Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 39
Filter
Add more filters











Publication year range
1.
J Cell Mol Med ; 28(19): e18590, 2024 Oct.
Article in English | MEDLINE | ID: mdl-39347925

ABSTRACT

Long non-coding RNAs (lncRNAs) and microRNAs (miRNAs) are two typical types of non-coding RNAs that interact and play important regulatory roles in many animal organisms. Exploring the unknown interactions between lncRNAs and miRNAs contributes to a better understanding of their functional involvement. Currently, studying the interactions between lncRNAs and miRNAs heavily relies on laborious biological experiments. Therefore, it is necessary to design a computational method for predicting lncRNA-miRNA interactions. In this work, we propose a method called MPGK-LMI, which utilizes a graph attention network (GAT) to predict lncRNA-miRNA interactions in animals. First, we construct a meta-path similarity matrix based on known lncRNA-miRNA interaction information. Then, we use GAT to aggregate the constructed meta-path similarity matrix and the computed Gaussian kernel similarity matrix to update the feature matrix with neighbourhood information. Finally, a scoring module is used for prediction. By comparing with three state-of-the-art algorithms, MPGK-LMI achieves the best results in terms of performance, with AUC value of 0.9077, AUPR of 0.9327, ACC of 0.9080, F1-score of 0.9143 and precision of 0.8739. These results validate the effectiveness and reliability of MPGK-LMI. Additionally, we conduct detailed case studies to demonstrate the effectiveness and feasibility of our approach in practical applications. Through these empirical results, we gain deeper insights into the functional roles and mechanisms of lncRNA-miRNA interactions, providing significant breakthroughs and advancements in this field of research. In summary, our method not only outperforms others in terms of performance but also establishes its practicality and reliability in biological research through real-case analysis, offering strong support and guidance for future studies and applications.


Subject(s)
Algorithms , Computational Biology , MicroRNAs , RNA, Long Noncoding , RNA, Long Noncoding/genetics , MicroRNAs/genetics , Computational Biology/methods , Animals , Humans , Gene Regulatory Networks , Normal Distribution
2.
Neural Netw ; 179: 106595, 2024 Nov.
Article in English | MEDLINE | ID: mdl-39159535

ABSTRACT

Graph neural networks (GNNs) leveraging metapaths have garnered extensive utilization. Nevertheless, the escalating parameters and data corpus within graph pre-training models incur mounting training costs. Consequently, GNN models encounter hurdles including diminished generalization capacity and compromised performance amidst small sample datasets. Drawing inspiration from the efficacy demonstrated by self-supervised learning methodologies in natural language processing, we embark on an exploration. We endeavor to imbue graph data with augmentable, learnable prompt vectors targeting node representation enhancement to foster superior adaptability to downstream tasks. This paper proposes a novel approach, the Metapath Integrated Graph Prompt Neural Network (MIGP), which leverages learnable prompt vectors to enhance node representations within a pretrained model framework. By leveraging learnable prompt vectors, MIGP aims to address the limitations posed by mall sample datasets and improve GNNs' model generalization. In the pretraining stage, we split symmetric metapaths in heterogeneous graphs into short metapaths and explicitly propagate information along the metapaths to update node representations. In the prompt-tuning stage, the parameters of the pretrained model are fixed, a set of independent basis vectors is introduced, and an attention mechanism is employed to generate task-specific learnable prompt vectors for each node. Another notable contribution of our work is the introduction of three patent datasets, which is a pioneering application in related fields. We will make these three patent datasets publicly available to facilitate further research on large-scale patent data analysis. Through comprehensive experiments conducted on three patent datasets and three other public datasets, i.e., ACM, IMDB, and DBLP, we demonstrate the superior performance of the MIGP model in enhancing model applicability and performance across a variety of downstream datasets. The source code and datasets are available in the website.1.


Subject(s)
Neural Networks, Computer , Natural Language Processing , Algorithms , Humans
3.
Interdiscip Sci ; 2024 Aug 07.
Article in English | MEDLINE | ID: mdl-39112911

ABSTRACT

The exploration of the interactions between diseases and metabolites holds significant implications for the diagnosis and treatment of diseases. However, traditional experimental methods are time-consuming and costly, and current computational methods often overlook the influence of other biological entities on both. In light of these limitations, we proposed a novel deep learning model based on metapath aggregation of tripartite heterogeneous networks (MAHN) to explore disease-related metabolites. Specifically, we introduced microbes to construct a tripartite heterogeneous network and employed graph convolutional network and enhanced GraphSAGE to learn node features with metapath length 3. Additionally, we utilized node-level and semantic-level attention mechanisms, a more granular approach, to aggregate node features with metapath length 2. Finally, the reconstructed association probability is obtained by fusing features from different metapaths into the bilinear decoder. The experiments demonstrate that the proposed MAHN model achieved superior performance in five-fold cross-validation with Acc (91.85%), Pre (90.48%), Recall (93.53%), F1 (91.94%), AUC (97.39%), and AUPR (97.47%), outperforming four state-of-the-art algorithms. Case studies on two complex diseases, irritable bowel syndrome and obesity, further validate the predictive results, and the MAHN model is a trustworthy prediction tool for discovering potential metabolites. Moreover, deep learning models integrating multi-omics data represent the future mainstream direction for predicting disease-related biological entities.

4.
Molecules ; 29(11)2024 May 24.
Article in English | MEDLINE | ID: mdl-38893359

ABSTRACT

The combinatorial therapy with multiple drugs may lead to unexpected drug-drug interactions (DDIs) and result in adverse reactions to patients. Predicting DDI events can mitigate the potential risks of combinatorial therapy and enhance drug safety. In recent years, deep models based on heterogeneous graph representation learning have attracted widespread interest in DDI event prediction and have yielded satisfactory results, but there is still room for improvement in prediction performance. In this study, we proposed a meta-path-based heterogeneous graph contrastive learning model, MPHGCL-DDI, for DDI event prediction. The model constructs two contrastive views based on meta-paths: an average graph view and an augmented graph view. The former represents that there are connections between drugs, while the latter reveals how the drugs connect with each other. We defined three levels of data augmentation schemes in the augmented graph view and adopted a combination of three losses in the model training phase: multi-relation prediction loss, unsupervised contrastive loss and supervised contrastive loss. Furthermore, the model incorporates indirect drug information, protein-protein interactions (PPIs), to reveal latent relations of drugs. We evaluated MPHGCL-DDI on three different tasks of two datasets. Experimental results demonstrate that MPHGCL-DDI surpasses several state-of-the-art methods in performance.


Subject(s)
Drug Interactions , Humans , Algorithms , Deep Learning , Machine Learning
5.
Sci Prog ; 107(2): 368504241257389, 2024.
Article in English | MEDLINE | ID: mdl-38881338

ABSTRACT

As the Internet and Internet of Things (IoT) continue to develop, Heterogeneous Information Networks (HIN) have formed complex interaction relationships among data objects. These relationships are represented by various types of edges (meta-paths) that contain rich semantic information. In the context of IoT data applications, the widespread adoption of Trigger-Action Patterns makes the management and analysis of heterogeneous data particularly important. This study proposes a meta-path-based clustering method for heterogeneous IoT data called I-RankClus, which aims to improve the modeling and analysis efficiency of IoT data. By combining ranking with clustering algorithms, the PageRank algorithm was used to calculate the intraclass influence of objects in the network. The HITS algorithm then transfers the influence to the core objects, thereby optimizing the classification of objects during the clustering process. The I-RankClus algorithm does not process each meta-path individually, but instead integrates multiple meta-paths to enhance the interpretability and clustering performance of the model. The experimental results show that the I-RankClus algorithm can process complex IoT datasets more effectively than traditional clustering methods and provide more accurate clustering outcomes. Furthermore, through a detailed analysis of meta-paths, this study explored the influence and importance of different meta-paths, thereby validating the effectiveness of the algorithm. Overall, the research presented in this paper not only improves the application effects of HINs in IoT data analysis but also provides valuable methods and insights for future network data processing.

6.
Brief Bioinform ; 25(3)2024 Mar 27.
Article in English | MEDLINE | ID: mdl-38622356

ABSTRACT

Identifying disease-associated microRNAs (miRNAs) could help understand the deep mechanism of diseases, which promotes the development of new medicine. Recently, network-based approaches have been widely proposed for inferring the potential associations between miRNAs and diseases. However, these approaches ignore the importance of different relations in meta-paths when learning the embeddings of miRNAs and diseases. Besides, they pay little attention to screening out reliable negative samples which is crucial for improving the prediction accuracy. In this study, we propose a novel approach named MGCNSS with the multi-layer graph convolution and high-quality negative sample selection strategy. Specifically, MGCNSS first constructs a comprehensive heterogeneous network by integrating miRNA and disease similarity networks coupled with their known association relationships. Then, we employ the multi-layer graph convolution to automatically capture the meta-path relations with different lengths in the heterogeneous network and learn the discriminative representations of miRNAs and diseases. After that, MGCNSS establishes a highly reliable negative sample set from the unlabeled sample set with the negative distance-based sample selection strategy. Finally, we train MGCNSS under an unsupervised learning manner and predict the potential associations between miRNAs and diseases. The experimental results fully demonstrate that MGCNSS outperforms all baseline methods on both balanced and imbalanced datasets. More importantly, we conduct case studies on colon neoplasms and esophageal neoplasms, further confirming the ability of MGCNSS to detect potential candidate miRNAs. The source code is publicly available on GitHub https://github.com/15136943622/MGCNSS/tree/master.


Subject(s)
Colonic Neoplasms , MicroRNAs , Humans , MicroRNAs/genetics , Algorithms , Computational Biology/methods , Software , Colonic Neoplasms/genetics
7.
J Cell Mol Med ; 28(7): e18180, 2024 04.
Article in English | MEDLINE | ID: mdl-38506066

ABSTRACT

Circular RNA (circRNA) is a common non-coding RNA and plays an important role in the diagnosis and therapy of human diseases, circRNA-disease associations prediction based on computational methods can provide a new way for better clinical diagnosis. In this article, we proposed a novel method for circRNA-disease associations prediction based on ensemble learning, named ELCDA. First, the association heterogeneous network was constructed via collecting multiple information of circRNAs and diseases, and multiple similarity measures are adopted here, then, we use metapath, matrix factorization and GraphSAGE-based models to extract features of nodes from different views, the final comprehensive features of circRNAs and diseases via ensemble learning, finally, a soft voting ensemble strategy is used to integrate the predicted results of all classifier. The performance of ELCDA is evaluated by fivefold cross-validation and compare with other state-of-the-art methods, the experimental results show that ELCDA is outperformance than others. Furthermore, three common diseases are used as case studies, which also demonstrate that ELCDA is an effective method for predicting circRNA-disease associations.


Subject(s)
Machine Learning , RNA, Circular , Humans , RNA, Circular/genetics , Computational Biology/methods
8.
Neural Netw ; 169: 496-505, 2024 Jan.
Article in English | MEDLINE | ID: mdl-37939538

ABSTRACT

Graph Convolutional Network (GCN) has become a hotspot in graph-based machine learning due to its powerful graph processing capability. Most of the existing GCN-based approaches are designed for single-view data. In numerous practical scenarios, data is expressed through multiple views, rather than a single view. The ability of GCN to model homogeneous graphs is indisputable, while it is insufficient in facing the heterophily property of multi-view data. In this paper, we revisit multi-view learning to propose an implicit heterogeneous graph convolutional network that efficiently captures the heterogeneity of multi-view data while exploiting the powerful feature aggregation capability of GCN. We automatically assign optimal importance to each view when constructing the meta-path graph. High-order cross-view meta-paths are explored based on the obtained graph, and a series of graph matrices are generated. Combining graph matrices with learnable global feature representation to obtain heterogeneous graph embeddings at various levels. Finally, in order to effectively utilize both local and global information, we introduce a graph-level attention mechanism at the meta-path level that allocates private information to each node individually. Extensive experimental results convincingly support the superior performance of the proposed method compared to other state-of-the-art approaches.


Subject(s)
Machine Learning , Neural Networks, Computer
9.
Math Biosci Eng ; 20(12): 20553-20575, 2023 Nov 14.
Article in English | MEDLINE | ID: mdl-38124565

ABSTRACT

Increasing amounts of experimental studies have shown that circular RNAs (circRNAs) play important regulatory roles in human diseases through interactions with related microRNAs (miRNAs). CircRNAs have become new potential disease biomarkers and therapeutic targets. Predicting circRNA-disease association (CDA) is of great significance for exploring the pathogenesis of complex diseases, which can improve the diagnosis level of diseases and promote the targeted therapy of diseases. However, determination of CDAs through traditional clinical trials is usually time-consuming and expensive. Computational methods are now alternative ways to predict CDAs. In this study, a new computational method, named PCDA-HNMP, was designed. For obtaining informative features of circRNAs and diseases, a heterogeneous network was first constructed, which defined circRNAs, mRNAs, miRNAs and diseases as nodes and associations between them as edges. Then, a deep analysis was conducted on the heterogeneous network by extracting meta-paths connecting to circRNAs (diseases), thereby mining hidden associations between various circRNAs (diseases). These associations constituted the meta-path-induced networks for circRNAs and diseases. The features of circRNAs and diseases were derived from the aforementioned networks via mashup. On the other hand, miRNA-disease associations (mDAs) were employed to improve the model's performance. miRNA features were yielded from the meta-path-induced networks on miRNAs and circRNAs, which were constructed from the meta-paths connecting miRNAs and circRNAs in the heterogeneous network. A concatenation operation was adopted to build the features of CDAs and mDAs. Such representations of CDAs and mDAs were fed into XGBoost to set up the model. The five-fold cross-validation yielded an area under the curve (AUC) of 0.9846, which was better than those of some existing state-of-the-art methods. The employment of mDAs can really enhance the model's performance and the importance analysis on meta-path-induced networks shown that networks produced by the meta-paths containing validated CDAs provided the most important contributions.


Subject(s)
3,4-Methylenedioxyamphetamine , MicroRNAs , Humans , RNA, Circular , Area Under Curve , MicroRNAs/genetics , RNA, Messenger/genetics
10.
BMC Bioinformatics ; 24(1): 335, 2023 Sep 11.
Article in English | MEDLINE | ID: mdl-37697297

ABSTRACT

Circular RNA (CircRNA) is a type of non-coding RNAs in which both ends are covalently linked. Researchers have demonstrated that many circRNAs can act as biomarkers of diseases. However, traditional experimental methods for circRNA-disease associations identification are labor-intensive. In this work, we propose a novel method based on the heterogeneous graph neural network and metapaths for circRNA-disease associations prediction termed as HMCDA. First, a heterogeneous graph consisting of circRNA-disease associations, circRNA-miRNA associations, miRNA-disease associations and disease-disease associations are constructed. Then, six metapaths are defined and generated according to the biomedical pathways. Afterwards, the entity content transformation, intra-metapath and inter-metapath aggregation are implemented to learn the embeddings of circRNA and disease entities. Finally, the learned embeddings are used to predict novel circRNA-disase associations. In particular, the result of extensive experiments demonstrates that HMCDA outperforms four state-of-the-art models in fivefold cross validation. In addition, our case study indicates that HMCDA has the ability to identify novel circRNA-disease associations.


Subject(s)
MicroRNAs , RNA, Circular , Research Design , Learning , MicroRNAs/genetics , Neural Networks, Computer
11.
Brief Bioinform ; 24(4)2023 07 20.
Article in English | MEDLINE | ID: mdl-37328701

ABSTRACT

Circular RNA (circRNA) is closely associated with human diseases. Accordingly, identifying the associations between human diseases and circRNA can help in disease prevention, diagnosis and treatment. Traditional methods are time consuming and laborious. Meanwhile, computational models can effectively predict potential circRNA-disease associations (CDAs), but are restricted by limited data, resulting in data with high dimension and imbalance. In this study, we propose a model based on automatically selected meta-path and contrastive learning, called the MPCLCDA model. First, the model constructs a new heterogeneous network based on circRNA similarity, disease similarity and known association, via automatically selected meta-path and obtains the low-dimensional fusion features of nodes via graph convolutional networks. Then, contrastive learning is used to optimize the fusion features further, and obtain the node features that make the distinction between positive and negative samples more evident. Finally, circRNA-disease scores are predicted through a multilayer perceptron. The proposed method is compared with advanced methods on four datasets. The average area under the receiver operating characteristic curve, area under the precision-recall curve and F1 score under 5-fold cross-validation reached 0.9752, 0.9831 and 0.9745, respectively. Simultaneously, case studies on human diseases further prove the predictive ability and application value of this method.


Subject(s)
Neural Networks, Computer , RNA, Circular , Humans , RNA, Circular/genetics , ROC Curve , Computational Biology/methods , Algorithms
12.
Front Physiol ; 14: 1156286, 2023.
Article in English | MEDLINE | ID: mdl-37228825

ABSTRACT

Introduction: Given the direct association with malignant ventricular arrhythmias, cardiotoxicity is a major concern in drug design. In the past decades, computational models based on the quantitative structure-activity relationship have been proposed to screen out cardiotoxic compounds and have shown promising results. The combination of molecular fingerprint and the machine learning model shows stable performance for a wide spectrum of problems; however, not long after the advent of the graph neural network (GNN) deep learning model and its variant (e.g., graph transformer), it has become the principal way of quantitative structure-activity relationship-based modeling for its high flexibility in feature extraction and decision rule generation. Despite all these progresses, the expressiveness (the ability of a program to identify non-isomorphic graph structures) of the GNN model is bounded by the WL isomorphism test, and a suitable thresholding scheme that relates directly to the sensitivity and credibility of a model is still an open question. Methods: In this research, we further improved the expressiveness of the GNN model by introducing the substructure-aware bias by the graph subgraph transformer network model. Moreover, to propose the most appropriate thresholding scheme, a comprehensive comparison of the thresholding schemes was conducted. Results: Based on these improvements, the best model attains performance with 90.4% precision, 90.4% recall, and 90.5% F1-score with a dual-threshold scheme (active: <1µM; non-active: >30µM). The improved pipeline (graph subgraph transformer network model and thresholding scheme) also shows its advantages in terms of the activity cliff problem and model interpretability.

13.
Brief Bioinform ; 24(2)2023 03 19.
Article in English | MEDLINE | ID: mdl-36892155

ABSTRACT

Drug-target interaction (DTI) prediction can identify novel ligands for specific protein targets, and facilitate the rapid screening of effective new drug candidates to speed up the drug discovery process. However, the current methods are not sensitive enough to complex topological structures, and complicated relations between multiple node types are not fully captured yet. To address the above challenges, we construct a metapath-based heterogeneous bioinformatics network, and then propose a DTI prediction method with metapath-based hierarchical transformer and attention network for drug-target interaction prediction (MHTAN-DTI), applying metapath instance-level transformer, single-semantic attention and multi-semantic attention to generate low-dimensional vector representations of drugs and proteins. Metapath instance-level transformer performs internal aggregation on the metapath instances, and models global context information to capture long-range dependencies. Single-semantic attention learns the semantics of a certain metapath type, introduces the central node weight and assigns different weights to different metapath instances to obtain the semantic-specific node embedding. Multi-semantic attention captures the importance of different metapath types and performs weighted fusion to attain the final node embedding. The hierarchical transformer and attention network weakens the influence of noise data on the DTI prediction results, and enhances the robustness and generalization ability of MHTAN-DTI. Compared with the state-of-the-art DTI prediction methods, MHTAN-DTI achieves significant performance improvements. In addition, we also conduct sufficient ablation studies and visualize the experimental results. All the results demonstrate that MHTAN-DTI can offer a powerful and interpretable tool for integrating heterogeneous information to predict DTIs and provide new insights into drug discovery.


Subject(s)
Drug Development , Drug Discovery , Computer Simulation , Drug Discovery/methods , Proteins/chemistry , Learning
14.
Entropy (Basel) ; 25(2)2023 Feb 04.
Article in English | MEDLINE | ID: mdl-36832662

ABSTRACT

How to learn the embedding vectors of nodes in unsupervised large-scale heterogeneous networks is a key problem in heterogeneous network embedding research. This paper proposes an unsupervised embedding learning model, named LHGI (Large-scale Heterogeneous Graph Infomax). LHGI adopts the subgraph sampling technology under the guidance of metapaths, which can compress the network and retain the semantic information in the network as much as possible. At the same time, LHGI adopts the idea of contrastive learning, and takes the mutual information between normal/negative node vectors and the global graph vector as the objective function to guide the learning process. By maximizing the mutual information, LHGI solves the problem of how to train the network without supervised information. The experimental results show that, compared with the baseline models, the LHGI model shows a better feature extraction capability both in medium-scale unsupervised heterogeneous networks and in large-scale unsupervised heterogeneous networks. The node vectors generated by the LHGI model achieve better performance in the downstream mining tasks.

15.
J Transl Med ; 21(1): 48, 2023 01 25.
Article in English | MEDLINE | ID: mdl-36698208

ABSTRACT

BACKGROUND: Drug-target interaction (DTI) prediction has become a crucial prerequisite in drug design and drug discovery. However, the traditional biological experiment is time-consuming and expensive, as there are abundant complex interactions present in the large size of genomic and chemical spaces. For alleviating this phenomenon, plenty of computational methods are conducted to effectively complement biological experiments and narrow the search spaces into a preferred candidate domain. Whereas, most of the previous approaches cannot fully consider association behavior semantic information based on several schemas to represent complex the structure of heterogeneous biological networks. Additionally, the prediction of DTI based on single modalities cannot satisfy the demand for prediction accuracy. METHODS: We propose a multi-modal representation framework of 'DeepMPF' based on meta-path semantic analysis, which effectively utilizes heterogeneous information to predict DTI. Specifically, we first construct protein-drug-disease heterogeneous networks composed of three entities. Then the feature information is obtained under three views, containing sequence modality, heterogeneous structure modality and similarity modality. We proposed six representative schemas of meta-path to preserve the high-order nonlinear structure and catch hidden structural information of the heterogeneous network. Finally, DeepMPF generates highly representative comprehensive feature descriptors and calculates the probability of interaction through joint learning. RESULTS: To evaluate the predictive performance of DeepMPF, comparison experiments are conducted on four gold datasets. Our method can obtain competitive performance in all datasets. We also explore the influence of the different feature embedding dimensions, learning strategies and classification methods. Meaningfully, the drug repositioning experiments on COVID-19 and HIV demonstrate DeepMPF can be applied to solve problems in reality and help drug discovery. The further analysis of molecular docking experiments enhances the credibility of the drug candidates predicted by DeepMPF. CONCLUSIONS: All the results demonstrate the effectively predictive capability of DeepMPF for drug-target interactions. It can be utilized as a useful tool to prescreen the most potential drug candidates for the protein. The web server of the DeepMPF predictor is freely available at http://120.77.11.78/DeepMPF/ , which can help relevant researchers to further study.


Subject(s)
COVID-19 , Deep Learning , Humans , Molecular Docking Simulation , Semantics , Drug Discovery/methods , Proteins
16.
Brief Bioinform ; 24(1)2023 01 19.
Article in English | MEDLINE | ID: mdl-36592060

ABSTRACT

Drug-target interaction (DTI) prediction is an essential step in drug repositioning. A few graph neural network (GNN)-based methods have been proposed for DTI prediction using heterogeneous biological data. However, existing GNN-based methods only aggregate information from directly connected nodes restricted in a drug-related or a target-related network and are incapable of capturing high-order dependencies in the biological heterogeneous graph. In this paper, we propose a metapath-aggregated heterogeneous graph neural network (MHGNN) to capture complex structures and rich semantics in the biological heterogeneous graph for DTI prediction. Specifically, MHGNN enhances heterogeneous graph structure learning and high-order semantics learning by modeling high-order relations via metapaths. Additionally, MHGNN enriches high-order correlations between drug-target pairs (DTPs) by constructing a DTP correlation graph with DTPs as nodes. We conduct extensive experiments on three biological heterogeneous datasets. MHGNN favorably surpasses 17 state-of-the-art methods over 6 evaluation metrics, which verifies its efficacy for DTI prediction. The code is available at https://github.com/Zora-LM/MHGNN-DTI.


Subject(s)
Benchmarking , Drug Repositioning , Drug Delivery Systems , Learning , Neural Networks, Computer
17.
Health Inf Sci Syst ; 11(1): 5, 2023 Dec.
Article in English | MEDLINE | ID: mdl-36660407

ABSTRACT

Traditional Chinese Medicine (TCM) has been widely adopted in clinical practice by Eastern Asia people for thousands of years. Nowadays, TCM still plays a critical role in Chinese society and receives increasing attention worldwide. The existing herb recommenders learn the complex relations between symptoms and herbs by mining the TCM prescriptions. Given a set of symptoms, they will provide a set of herbs and explanations from the TCM theory. However, the foundation of TCM is Yinyangism (i.e. the combination of Five Phases theory with Yin-yang theory), which is very different from modern medicine philosophy. Only recommending herbs from the TCM theory aspect largely prevents TCM from modern medical treatment. As TCM and modern medicine share a common view at the molecular level, it is necessary to integrate the ancient practice of TCM and standards of modern medicine. In this paper, we explore the underlying action mechanisms of herbs from both TCM and modern medicine, and propose a Meta-path guided Graph Attention Network (MGAT) to provide the explainable herb recommendations. Technically, to translate TCM from an experience-based medicine to an evidence-based medicine system, we incorporate the pharmacology knowledge of modern Chinese medicine with the TCM knowledge. We design a meta-path guided information propagation scheme based on the extended knowledge graph, which combines information propagation and decision process. This scheme adopts meta-paths (predefined relation sequences) to guide neighbor selection in the propagation process. Furthermore, the attention mechanism is utilized in aggregation to help distinguish the salience of different paths connecting a symptom with a herb. In this way, our model can distill the long-range semantics along meta-paths and generate fine-grained explanations. We conduct extensive experiments on a public TCM dataset, demonstrating comparable performance to the state-of-the-art herb recommendation models and the strong explainability.

18.
Toxics ; 11(1)2023 Jan 12.
Article in English | MEDLINE | ID: mdl-36668800

ABSTRACT

Limited availability of fish metabolic pathways for PFAS may lead to risk assessments with inherent uncertainties based only upon the parent chemical or the assumption that the biodegradation or mammalian metabolism map data will serve as an adequate surrogate. A rapid and transparent process, utilizing a recently created database of systematically collected information for fish, mammals, poultry, plant, earthworm, sediment, sludge, bacteria, and fungus using data evaluation tools in the previously described metabolism pathway software system MetaPath, is presented. The fish metabolism maps for 10 PFAS, heptadecafluorooctyl(tridecafluorohexyl)phosphinic acid (C6/C8 PFPiA), bis(perfluorooctyl)phosphinic acid (C8/C8 PFPiA), 2-[(6-chloro-1,1,2,2,3,3,4,4,5,5,6,6-dodecafluorohexyl)oxy]-1,1,2,2-tetrafluoroethanesulfonic acid (6:2 Cl-PFESA), N-Ethylperfluorooctane-1-sulfonamide (Sulfuramid; N-EtFOSA), N-Ethyl Perfluorooctane Sulfonamido Ethanol phosphate diester (SAmPAP), Perfluorooctanesulfonamide (FOSA), 8:2 Fluorotelomer phosphate diester (8:2 diPAP), 8:2 fluorotelomer alcohol (8:2 FTOH), 10:2 fluorotelomer alcohol (10:2 FTOH), and 6:2 fluorotelomer sulfonamide alkylbetaine (6:2 FTAB), were compared across multiple species and systems. The approach demonstrates how comparisons of metabolic maps across species are aided by considering the sample matrix in which metabolites were quantified for each species, differences in analytical methods used to identify metabolites in each study, and the relative amounts of metabolites quantified. Overall, the pathways appear to be well conserved across species and systems. For PFAS lacking a fish metabolism study, a composite map consisting of all available maps would serve as the best basis for metabolite prediction. This emphasizes the importance and utility of collating metabolism into a searchable database such as that created in this effort.

19.
Neural Netw ; 157: 65-76, 2023 Jan.
Article in English | MEDLINE | ID: mdl-36334540

ABSTRACT

Heterogeneous information network embedding aims to learn low-dimensional node vectors in heterogeneous information networks (HINs), concerning not only structural information but also heterogeneity of diverse node and relation types. Most existing HIN embedding models mainly rely on metapath to define composite relations between node pairs and thus extract substructures from the original HIN. However, due to the pairwise structure of metapath, these models fail to capture the high-order relations (such as "Multiple authors co-authoring a paper") implicitly contained in HINs. To tackle the limitation, this paper proposes a Metapath-aware HyperGraph Transformer (Meta-HGT) for node embedding in HINs. Meta-HGT first extends metapath to guide the high-order relation extraction from original HIN and constructs multiple metapath based hypergraphs with diverse composite semantics. Then, Meta-HGT learns the latent node and hyperedge embeddings in each metapath based hypergraph through Meta-HGT layers. Each layer consists of two types of components, i.e., intra-hyperedge aggregation and inter-hyperedge aggregation, in which a novel type-dependent attention mechanism is proposed for node and hyperedge feature aggregation. Finally, it fuses multiple node embeddings learned from different metapath based hypergraphs via a semantic attention layer and generates the final node embeddings. Extensive experiments have been conducted on three HIN benchmarks for node classification. The results demonstrate that Meta-HGT achieves state-of-the-art performance on all three datasets.


Subject(s)
Learning , Semantics , Benchmarking , Information Services
20.
Brief Bioinform ; 24(1)2023 01 19.
Article in English | MEDLINE | ID: mdl-36562724

ABSTRACT

Drug combinations could trigger pharmacological therapeutic effects (TEs) and adverse effects (AEs). Many computational methods have been developed to predict TEs, e.g. the therapeutic synergy scores of anti-cancer drug combinations, or AEs from drug-drug interactions. However, most of the methods treated the AEs and TEs predictions as two separate tasks, ignoring the potential mechanistic commonalities shared between them. Based on previous clinical observations, we hypothesized that by learning the shared mechanistic commonalities between AEs and TEs, we could learn the underlying MoAs (mechanisms of actions) and ultimately improve the accuracy of TE predictions. To test our hypothesis, we formulated the TE prediction problem as a multi-task heterogeneous network learning problem that performed TE and AE learning tasks simultaneously. To solve this problem, we proposed Muthene (multi-task heterogeneous network embedding) and evaluated it on our collected drug-drug interaction dataset with both TEs and AEs indications. Our experimental results showed that, by including the AE prediction as an auxiliary task, Muthene generated more accurate TE predictions than standard single-task learning methods, which supports our hypothesis. Using a drug pair Vincristine-Dasatinib as a case study, we demonstrated that our method not only provides a novel way of TE predictions but also helps us gain a deeper understanding of the MoAs of drug combinations.


Subject(s)
Drug-Related Side Effects and Adverse Reactions , Humans , Drug Interactions , Drug Combinations , Machine Learning
SELECTION OF CITATIONS
SEARCH DETAIL