Analysis of Medication Patterns for Traditional Chinese Medicine in Antithrombotic Therapy Based on Data Mining and Network Pharmacology


Release Date:

2021-02-27

Analysis of Medication Patterns for Traditional Chinese Medicine in Antithrombotic Therapy Based on Data Mining and Network Pharmacology

Analysis of Medication Patterns for Traditional Chinese Medicine in Antithrombotic Therapy Based on Data Mining and Network Pharmacology

Abstract: Objective Deep venous thrombosis is a major contributor to mortality and poor prognosis in critically ill patients with coronavirus disease 2019 (COVID-19). This study aims to elucidate the formulation and medication patterns of traditional Chinese medicine (TCM) for the prevention and treatment of thrombosis, thereby providing a reference for the application of TCM in managing deep venous thrombosis in critically ill COVID-19 patients. Methods Relevant literature indexed in the China National Knowledge Infrastructure and Wanfang Database was used as the data source. Excel 2010 and Clementine 12.0 software were employed to conduct association rule analysis on the included TCM herbs. Network pharmacology was applied to perform gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses on the core herb combinations, with the aim of investigating their mechanisms of action and validating these findings through molecular docking. Results A total of 356 antithrombotic formulas were screened from the China National Knowledge Infrastructure and Wanfang Database, revealing 25 high-frequency TCM herbs. Further association analysis identified seven high-frequency herb combinations. Among them, the core combination “Carthamus tinctorius–Prunus persica seed–Paeonia lactiflora–Ligusticum chuanxiong” contains 23 active chemical constituents and shares 41 common target proteins with thrombosis. The protein–protein interaction (PPI) network primarily involves key target proteins such as IL-6, VEGFA, CASP3, ALB, EGFR, and MAPK8. GO functional enrichment analysis yielded 971 biological process terms ( P <0.05), KEGG enrichment identified 105 pathways ( P (<0.05), primarily involving pathways such as Kaposi’s sarcoma-associated herpesvirus infection, human cytomegalovirus infection, fluid shear stress and atherosclerosis, and hepatitis B; molecular docking screening identified luteolin, quercetin, and baicalein as active compounds with strong binding affinity to IL-6. Conclusion: The key active constituents in the core drug combination exert antithrombotic effects by modulating critical targets such as IL-6 and by influencing pathways including Kaposi’s sarcoma-associated herpesvirus infection, human cytomegalovirus infection, fluid shear stress and atherosclerosis, and hepatitis B, thereby providing a reference for the prevention and treatment of deep vein thrombosis in critically ill patients with COVID-19.

Current international clinical studies have demonstrated that patients with coronavirus disease 2019 (COVID-19) in intensive care units (ICUs) exhibit a markedly increased risk of venous thromboembolic events (VTEs). Moreover, among COVID-19 patients admitted to general wards, the incidence of deep venous thrombosis (DVT) is also high, at 14.7%, and elevated D-dimer levels are a common finding in critically ill COVID-19 patients. [1] , the patient is in a high-risk thrombotic condition, including acute coronary syndrome (ACS), venous thromboembolism such as deep vein thrombosis (DVT) or pulmonary embolism (PE), or stroke. [2] In response, Chinese researchers have also conducted related studies; Zhang et al. [3] A survey of DVT formation in COVID-19 patients at a large referral treatment center in Wuhan revealed a DVT prevalence of 46.1% among these patients. Furthermore, a DVT screening study conducted by the Zhongnan Hospital of Wuhan University on critically ill COVID-19 patients found that 85.4% of them had DVT. [4] Moreover, both clinical studies have suggested that the development of DVT may be associated with poor outcomes in critically ill patients with COVID-19 pneumonia. The “Recommendations for the Prevention and Management of Venous Thromboembolism Associated with Novel Coronavirus Pneumonia (Trial)” state that DVT detachment carries a potential risk of pulmonary embolism. [5] This significantly affects the mortality and prognosis of COVID-19 patients; therefore, effective prevention and management of DVT can help reduce the risk of death in these patients. [6] Modern medicine commonly employs heparin, urokinase, low-molecular-weight dextran, warfarin, and other agents for antithrombotic therapy; however, the incidence of adverse reactions, such as gastrointestinal ulcers, also increases accordingly. [7] Thrombosis bears a striking resemblance to the TCM syndrome of blood stasis in terms of physiological and pathological understanding: adequate blood volume, robust cardiac qi, and unobstructed meridians are the fundamental prerequisites for normal blood circulation; any disruption at any one of these stages can impede blood flow, thereby giving rise to blood stasis. [8] Traditional Chinese medicine is adept at employing internal administration, external treatments, acupuncture, topical applications, fumigation, and localized moxibustion to prevent and treat venous thrombosis. This study investigates the patterns of TCM drug use for antithrombotic therapy and utilizes network pharmacology to explore the mechanisms underlying the antithrombotic effects of core herbal combinations, with the aim of providing reference for the use of TCM in preventing and treating DVT associated with COVID-19.

1 Materials and Methods

1.1 Data Source

Using “traditional Chinese medicine” and “thrombosis” as search terms, a literature search was conducted in the China National Knowledge Infrastructure (CNKI) and Wanfang Data databases from January 1980 to June 2020, yielding a total of 7,993 articles.

1.2 Inclusion criteria

① The literature must involve clinical cases (regardless of whether they are randomized controlled trials); ② The literature must be related to the antithrombotic effects of traditional Chinese medicine compound formulas; ③ The compound formula described in the literature must include the complete drug composition; ④ The literature must report biochemical indicators—such as D-dimer, fibrinogen (Fbg), prothrombin time (PT), thrombin time (TT), and activated partial thromboplastin time (APTT)—that reflect antithrombotic efficacy, or results from color Doppler ultrasound examinations; ⑤ The sample size must be ≥10.

1.3 Exclusion criteria

① Duplicate literature and non-combinatorial formulations; ② Basic research involving cells, animals, etc.; ③ Review articles and theoretical studies; ④ Literature reviews that do not specify particular drug compositions. [9]

1.4 Data Analysis

Network-based visual analysis and presentation of the included traditional Chinese medicines were conducted using Clementine 12.0, followed by association rule analysis via Apriori modeling. [10] ; The R language version 3.6.2 was used to extract information based on the “Bioconductor” package; gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were performed on the potential targets of the core drug combination for antithrombotic activity; Cytoscape software was employed to construct a “disease–drug–active ingredient–target” network diagram and a protein–protein interaction (PPI) network of common targets shared by the active ingredients and the disease; crystal structures of key target proteins were downloaded from the RCSB Protein Data Bank, water molecules in the crystal structures were removed using AutoDockTools 1.5.6, hydrogenation was performed, and charge calculations were carried out on the proteins, after which the files were saved in PDBQT format; 3D structures of key active compounds screened from the PubChem database were downloaded in SDF format, and Chem3D 17.0 and PyMOL were used to process the protein–small-molecule structures, followed by energy minimization of the compound files; finally, molecular docking was conducted using AutoDock Vina, and the results were visualized with PyMOL. [11]

2 Results and Analysis

2.1 Data Filtering

Based on the aforementioned inclusion and exclusion criteria, 337 articles meeting the standards were identified. The core formulas contained in these articles were entered into the database, with any added or subtracted herbs excluded, resulting in a total of 356 formulas after consolidation. The specific procedures for database retrieval and literature and formula screening are illustrated in Figure 1. [12]

2.2  Usage of Single-Ingredient Traditional Chinese Medicines

Refer to the 2015 Edition of the Chinese Pharmacopoeia. [13] and “Chinese Materia Medica” [14] The drug names in the 356 prescriptions were standardized, revealing a total of 198 types of traditional Chinese medicines with a cumulative occurrence frequency of 3,482. The top 10% by usage frequency were designated as high-frequency drugs. [15] A total of 25 traditional Chinese medicinal herbs were included (with a frequency ≥38), accounting for a cumulative frequency of 2,437 occurrences (69.99% of the total). The top six herbs by frequency of use are Angelica sinensis (frequency: 224, 62.92%), Carthamus tinctorius (frequency: 205, 57.58%), Ligusticum chuanxiong (frequency: 201, 56.46%), Paeonia lactiflora (frequency: 188, 52.81%), Prunus armeniaca seed (frequency: 173, 48.60%), and Achyranthes bidentata (frequency: 165, 46.35%), as shown in Table 1.

2.3 High-Frequency Drug Association Rule Analysis

The Apriori algorithm is a classic method for association rule mining. It is based on the concept of 2-item frequent itemsets and employs a recursive approach to uncover hidden association rules within frequent itemsets. [16-17] In Clementine 12.0 software, association rule analysis was conducted on high-frequency drugs (with a usage frequency ≥38) using the Apriori algorithm, with the following parameter settings: minimum support of 20%, minimum confidence of 90%, maximum number of antecedents set to 5, and minimum lift of 1. [18] Seven potential drug combinations were identified among the 356 formulas, each with an odds ratio greater than 1, indicating that these combinations are statistically significant. [17] The core herbal combination with the highest confidence and lift is safflower–peach kernel–chuanxiong–danggui; specific parameters are provided in Table 2. The association network diagram among high-frequency herbs is shown in Figure 2.

2.4 Associated drug dosage situation

To ensure the accuracy and efficacy of clinical medication for DVT in the prevention and treatment of COVID-19 using associated drug combinations, this study conducted a statistical analysis of the dosages of traditional Chinese medicines involved in the identified drug combinations. The results showed that the primary dosage ranges for Danggui, Taoren, Honghua, Chishao, and Chuanxiong were all 10–15 g, while the primary dosage range for Niuxi was 10–18 g (Table 3).

2.5 Mechanistic Analysis of the Core Drug Combination for Antithrombotic Effects Based on Network Pharmacology

2.5.1 Collection of information on the active ingredients, molecular targets, and thrombotic targets of the core drug combination. The core drug combination consisting of Safflower, Peach Kernel, Red Peony Root, and Chuanxiong exhibits the highest confidence and the greatest degree of enhancement. On the Traditional Chinese Medicine Systems Pharmacology Database and Analysis Platform (TCMSP, http://tcmspw.com/tcmsp.php), the active constituents and molecular targets of these four herbs—Safflower, Peach Kernel, Red Peony Root, and Chuanxiong—are retrieved, with criteria set at oral bioavailability (OB) ≥ 30% and drug-likeness (DL) ≥ 0.18. [19] After removing duplicate entries and non-existent mapping relationships, 46 chemical constituents and 109 putative targets were identified. Using the GeneCards database (https://www.genecards.org/), 798 targets associated with thrombosis were retrieved.

2.5.2 Disease–Drug–Active Ingredient–Target Network and PPI Network Analysis A Venn diagram was used to identify 41 shared target proteins between the active ingredients of four traditional Chinese medicines and the disease. Protein–protein interaction data were obtained from the STRING database (https://string-db.org/), and Cytoscape software was then employed to construct the disease–drug–active ingredient–target network (Figure 3) and the PPI network (Figure 4). A barplot of the target proteins was generated using an R package (Figure 5). The disease–drug–active ingredient–target network comprises 69 nodes, including 41 target genes, 23 drug active ingredients, 4 traditional Chinese medicine names, and 1 disease name; among these, quercetin, β-sitosterol, baicalein, luteolin, kaempferol, and stigmasterol are the chemical constituents with the largest number of associated targets, as shown in Table 3. In the PPI network, the strength of interactions among target proteins is positively correlated with the size and color intensity of the corresponding circles, as well as with the thickness and color intensity of the connecting lines. The target protein barplot visually represents, in the form of a histogram, the number of nodes connected to each target protein in the PPI network; the greater the number of connected nodes, the stronger the interaction with other target proteins. Ultimately, the target proteins exhibiting the strongest interaction strengths (with more than 25 connected nodes) were identified as interleukin-6 (IL-6), vascular endothelial growth factor A (VEGFA), caspase-3 (CASP3), albumin (ALB), epidermal growth factor receptor (EGFR), and mitogen-activated protein kinase 8 (MAPK8).

2.5.3 GO and KEGG pathway enrichment analyses: GO and KEGG enrichment analyses were performed on the antithrombotic targets of the core drug combination using the database (https://org.Hs.eg.db). The ClusterProfilerGO.R package in R was used to generate advanced bubble plots for biological function and pathway enrichment analyses, focusing on the top 20 pathways. In these plots, larger bubbles indicate a greater number of enriched genes. P The lower the value, the redder the color, indicating a closer association with the core drug combination for thrombosis prevention and treatment. [11] . GO analysis identified 971 biological processes (BP), primarily involving oxidative stress response, homeostasis of anatomical structure, cellular response to oxidative stress, response to lipopolysaccharides, response to acidic chemical substances, and response to bacterial-derived molecules, among others. KEGG pathway enrichment analysis revealed 105 signaling pathways ( P (<0.05), primarily involving pathways such as Kaposi’s sarcoma-associated herpesvirus infection, human cytomegalovirus infection, fluid shear stress and atherosclerosis, and hepatitis B (Figures 6 and 7).

 

2.6 Molecular docking validation of the interactions between the common active ingredients of the core drug combination with antithrombotic activity and the core targets.

Molecular docking was performed between the core drug group and the chemically active constituents with the greatest number of intermediate targets and the core target protein IL-6, which has the highest number of connected nodes in the PPI network (Table 4), thereby validating the results obtained from network pharmacology. In molecular docking, a binding energy less than zero indicates that the ligand and receptor can bind spontaneously. [20] Moreover, the more stable the conformation, the lower the binding energy, and the greater the likelihood of small-molecule compounds interacting with proteins. [21] In this study, binding energy ≤ −20 kJ/mol was selected as the screening criterion. [12] Docking results indicate that all six active compounds can bind to IL-6; the docking poses are shown in Figure 8, with luteolin, quercetin, and baicalein exhibiting particularly strong binding affinities.

3 Discussion

Current domestic and international research has confirmed a close association between severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and thromboembolic events; however, the underlying pathophysiological mechanisms remain incompletely elucidated. Available data indicate that D-dimer levels are markedly elevated in patients with COVID-19, suggesting a hypercoagulable state. [22] Some scholars believe this is associated with the cytokine storm induced by COVID-19, as plasma levels of tumor necrosis factor-α (TNF-α), IL-1β, and IL-8 are significantly elevated in COVID-19 patients. [23] This leads to platelet activation and endothelial dysfunction, ultimately resulting in thrombus formation. Recently, foreign researchers have detected evidence of the virus in endothelial cells of COVID-19 patients; therefore, another explanation is that the angiotensin-converting enzyme 2 (ACE2) receptor is widely expressed in endothelial cells, where SARS-CoV-2 binds to it, causing endothelial injury and subsequently triggering arterial and venous thrombosis. [24] In addition, severe pulmonary disease, advanced patient age, and prolonged bed rest are also potential contributors to COVID-19–associated thrombosis and poor clinical outcomes. Most importantly, despite the use of standard or high-dose prophylactic anticoagulation, the risk of thrombosis remains substantial, and the optimal anticoagulant dose has yet to be established. Clinically, there is an urgent need to identify additional predictive biomarkers beyond D-dimer. [25] Therefore, applying the valuable clinical experience of traditional Chinese medicine in antithrombotic therapy to the prevention and treatment of thrombosis in COVID-19 may significantly improve the adverse prognosis of critically ill patients.
This study conducted data mining and analysis on 356 herbal formulas related to antithrombotic therapy retrieved from the CNKI and Wanfang databases, identifying 25 high-frequency herbs and seven associated herb combinations. The study also quantified the dosages of each herb within these combinations, providing a reference for future clinical management of COVID-19–associated thrombosis. Among them, safflower, peach kernel, red peony root, and chuanxiong constitute the core herb combination identified in this study. In traditional Chinese medicine, deep vein thrombosis is classified under the categories of “blood stasis” and “meridian obstruction,” with the pathogenesis attributed to impaired qi flow and blood stasis obstructing the meridians. [26] Among the core herbal combination, peach kernel and safflower primarily promote blood circulation and resolve blood stasis; red peony root cools the blood and disperses stasis; and chuanxiong regulates qi and promotes blood circulation. Together, they exert the effects of breaking up blood stasis, regulating qi, and resolving stasis, which precisely aligns with the fundamental pathogenesis of thrombus formation. According to the results of network pharmacology analysis, the key target proteins of the core herbal combination are mainly IL-6, VEGFA, CASP3, ALB, EGFR, and MAPK8. Among these, IL-6 can activate platelets, enhancing their procoagulant activity; it can also induce the liver to produce acute-phase proteins such as C-reactive protein and fibrinogen, thereby increasing the number of coagulation factors and promoting thrombus formation. [27] ; VEGF can promote the proliferation of mesenchymal endothelial cells. [28] , exerting antiplatelet effects [29] , with VEGFA being its principal member; CASP3 is a member of the Caspase family and serves as a key effector of apoptotic signaling. [30] ; ALB exhibits heparin-like activity, can bind to NO and inhibit its inactivation, thereby relatively prolonging platelet aggregation and exerting an anticoagulant effect. [31] ; EGFR is a transmembrane glycoprotein with tyrosine kinase activity; upon binding its ligand, it exerts regulatory effects on cell proliferation, apoptosis, and angiogenesis. [32] ; MAPK8 plays a critical role in the proliferation, differentiation, and activation of inflammatory cells. [33] In summary, the core drug combination may exert antithrombotic effects by exerting anti-inflammatory and antiplatelet activities and promoting angiogenesis.
GO enrichment analysis of the active ingredients in the core drug combination revealed that the biological process with the highest number of enriched targets is oxidative stress response. Studies have shown that the hyperactive immune response induced by COVID-19 leads to the excessive production of free radicals in the body, resulting in oxidative stress and the accumulation of free radicals. [34] , leading to an imbalance between oxidative and antioxidant processes in the body, causing oxidative damage to vascular endothelial cells and ultimately inducing thrombus formation. [35] The KEGG pathways with the highest enrichment scores include those related to Kaposi’s sarcoma-associated herpesvirus infection, human cytomegalovirus infection, fluid shear stress, and atherosclerosis. Notably, most of these four pathways involve VEGF or IL-6. VEGF not only increases vascular permeability but also binds specifically to receptors on vascular endothelial cells, promoting their division and proliferation; in turn, vascular endothelial cells produce vasodilatory factors such as prostacyclin, thereby dilating blood vessels and inhibiting platelet aggregation. [36] Studies have found that IL-6 levels are only mildly elevated in patients with mild and moderate COVID-19 and decline substantially within a short period following treatment, whereas they show a marked increase on the third day of hospitalization in severe cases—likely reflecting the body’s immune response to SARS-CoV-2 infection, namely a “cytokine storm.” [37] , IL-6 is a key pro-inflammatory mediator in the body. [38] Furthermore, it can stimulate hepatocytes to produce large amounts of coagulation factor VIII and tissue factor (TF), thereby accelerating the coagulation cascade; it can also promote the proliferation and maturation of macrophages, leading to increased platelet production. [21]
Currently, many reports [39-40] It has been confirmed that IL-6 levels in the serum of COVID-19 patients are markedly elevated, and these levels correlate positively with disease severity. Furthermore, IL-6 can activate endothelial cells, promoting a hypercoagulable state that increases the risk of thrombotic disorders; the resulting thrombi, in turn, further stimulate excessive IL-6 expression, thereby establishing a vicious cycle. [41] Therefore, IL-6 may serve as a reference indicator for prognostic assessment and a therapeutic target in the prevention and treatment of thrombosis in critically ill patients with COVID-19. Network pharmacology analysis further reveals that IL-6 is a key target protein through which the core drug combination exerts its antithrombotic effects. Molecular docking results indicate that luteolin, quercetin, and baicalein exhibit strong binding affinities to IL-6: specifically, luteolin forms hydrogen bonds with Glu106, Glu42, and Asp160 on the A chain of the IL-6 protein; quercetin forms hydrogen bonds with Thr43, Glu42, and Glu106 on the A chain; and baicalein forms hydrogen bonds with Glu106, Thr43, and Asp160 on the A chain. Studies have shown that luteolin can markedly inhibit the secretion of the pro-inflammatory cytokine IL-6, thereby exerting anti-inflammatory effects. [42] , Quercetin exerts antiplatelet and antithrombotic effects. [43] Baicalein can inhibit collagen- and arachidonic acid-induced platelet aggregation and suppress thrombin-induced conversion of fibrinogen to fibrin, thereby exerting anticoagulant and antithrombotic effects. [44]

In summary, this study employed data mining and network pharmacology to investigate the formulation patterns and mechanisms of action underlying the anti-thrombotic effects of traditional Chinese medicine. It was found that the key active constituents in the core herbal combination—safflower, peach kernel, red peony root, and chuanxiong—can modulate critical targets such as IL-6 and engage pathways associated with viral infection and inflammatory vascular diseases, thereby counteracting oxidative stress, repairing endothelial injury, inhibiting platelet aggregation, and exerting anti-thrombotic effects. These findings provide both empirical evidence and theoretical guidance for the prevention and treatment of deep vein thrombosis in critically ill patients with COVID-19.

Source: Cui Linlin, Miao Mingsan. Analysis of Medication Patterns for Traditional Chinese Medicine in Anti-thrombotic Therapy Based on Data Mining and Network Pharmacology [J]. Chinese Herbal Medicines, 2021, 52(4):1063–1072.