Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
1.
Chem Cent J ; 6(1): 139, 2012 Nov 23.
Article in English | MEDLINE | ID: mdl-23173901

ABSTRACT

BACKGROUND: Src plays various roles in tumour progression, invasion, metastasis, angiogenesis and survival. It is one of the multiple targets of multi-target kinase inhibitors in clinical uses and trials for the treatment of leukemia and other cancers. These successes and appearances of drug resistance in some patients have raised significant interest and efforts in discovering new Src inhibitors. Various in-silico methods have been used in some of these efforts. It is desirable to explore additional in-silico methods, particularly those capable of searching large compound libraries at high yields and reduced false-hit rates. RESULTS: We evaluated support vector machines (SVM) as virtual screening tools for searching Src inhibitors from large compound libraries. SVM trained and tested by 1,703 inhibitors and 63,318 putative non-inhibitors correctly identified 93.53%~ 95.01% inhibitors and 99.81%~ 99.90% non-inhibitors in 5-fold cross validation studies. SVM trained by 1,703 inhibitors reported before 2011 and 63,318 putative non-inhibitors correctly identified 70.45% of the 44 inhibitors reported since 2011, and predicted as inhibitors 44,843 (0.33%) of 13.56M PubChem, 1,496 (0.89%) of 168 K MDDR, and 719 (7.73%) of 9,305 MDDR compounds similar to the known inhibitors. CONCLUSIONS: SVM showed comparable yield and reduced false hit rates in searching large compound libraries compared to the similarity-based and other machine-learning VS methods developed from the same set of training compounds and molecular descriptors. We tested three virtual hits of the same novel scaffold from in-house chemical libraries not reported as Src inhibitor, one of which showed moderate activity. SVM may be potentially explored for searching Src inhibitors from large compound libraries at low false-hit rates.

2.
Mol Biosyst ; 8(10): 2645-56, 2012 Oct.
Article in English | MEDLINE | ID: mdl-22833077

ABSTRACT

Some drugs, such as anticancer EGFR tyrosine kinase inhibitors, elicit markedly different clinical response rates due to differences in drug bypass signaling as well as genetic variations of drug target and downstream drug-resistant genes. The profiles of these bypass signaling are expected to be useful for improved drug response prediction, which have not been systematically explored previously. In this work, we searched and analyzed 16 literature-reported EGFR tyrosine kinase inhibitor bypass signaling routes in the EGFR pathway, which include 5 compensatory routes of EGFR transactivation by another receptor, and 11 alternative routes activated by another receptor. These 16 routes are reportedly regulated by 11 bypass genes. Their expression profiles together with the mutational, amplification and expression profiles of EGFR and 4 downstream drug-resistant genes, were used as new sets of biomarkers for identifying 53 NSCLC cell-lines sensitive or resistant to EGFR tyrosine kinase inhibitors gefitinib, erlotinib and lapatinib. The collective profiles of all 16 genes distinguish sensitive and resistant cell-lines are better than those of individual genes and the combined EGFR and downstream drug resistant genes, and their derived cell-line response rates are consistent with the reported clinical response rates of the three drugs. The usefulness of cell-line data for drug response studies was further analyzed by comparing the expression profiles of EGFR and bypass genes in NSCLC cell-lines and patient samples, and by using a machine learning feature selection method for selecting drug response biomarkers. Our study suggested that the profiles of drug bypass signaling are highly useful for improved drug response prediction.


Subject(s)
Antineoplastic Agents/pharmacology , Carcinoma, Non-Small-Cell Lung/drug therapy , ErbB Receptors/antagonists & inhibitors , Gene Expression Regulation, Neoplastic/drug effects , Lung Neoplasms/drug therapy , Protein Kinase Inhibitors/pharmacology , Signal Transduction/drug effects , Biomarkers, Pharmacological/metabolism , Carcinoma, Non-Small-Cell Lung/genetics , Carcinoma, Non-Small-Cell Lung/metabolism , Carcinoma, Non-Small-Cell Lung/pathology , Drug Resistance, Neoplasm/drug effects , ErbB Receptors/genetics , ErbB Receptors/metabolism , Erlotinib Hydrochloride , Gefitinib , Gene Expression Profiling , Humans , Lapatinib , Lung Neoplasms/genetics , Lung Neoplasms/metabolism , Lung Neoplasms/pathology , Mutation , Predictive Value of Tests , Quinazolines/pharmacology , Signal Transduction/genetics , Support Vector Machine
3.
PLoS One ; 7(6): e39076, 2012.
Article in English | MEDLINE | ID: mdl-22720033

ABSTRACT

Target selective drugs, such as dopamine receptor (DR) subtype selective ligands, are developed for enhanced therapeutics and reduced side effects. In silico methods have been explored for searching DR selective ligands, but encountered difficulties associated with high subtype similarity and ligand structural diversity. Machine learning methods have shown promising potential in searching target selective compounds. Their target selective capability can be further enhanced. In this work, we introduced a new two-step support vector machines target-binding and selectivity screening method for searching DR subtype-selective ligands, which was tested together with three previously-used machine learning methods for searching D1, D2, D3 and D4 selective ligands. It correctly identified 50.6%-88.0% of the 21-408 subtype selective and 71.7%-81.0% of the 39-147 multi-subtype ligands. Its subtype selective ligand identification rates are significantly better than, and its multi-subtype ligand identification rates are comparable to the best rates of the previously used methods. Our method produced low false-hit rates in screening 13.56 M PubChem, 168,016 MDDR and 657,736 ChEMBLdb compounds. Molecular features important for subtype selectivity were extracted by using the recursive feature elimination feature selection method. These features are consistent with literature-reported features. Our method showed similar performance in searching estrogen receptor subtype selective ligands. Our study demonstrated the usefulness of the two-step target binding and selectivity screening method in searching subtype selective ligands from large compound libraries.


Subject(s)
Receptors, Dopamine/metabolism , Support Vector Machine , Ligands , Protein Binding
4.
Nucleic Acids Res ; 40(Database issue): D1128-36, 2012 Jan.
Article in English | MEDLINE | ID: mdl-21948793

ABSTRACT

Knowledge and investigation of therapeutic targets (responsible for drug efficacy) and the targeted drugs facilitate target and drug discovery and validation. Therapeutic Target Database (TTD, http://bidd.nus.edu.sg/group/ttd/ttd.asp) has been developed to provide comprehensive information about efficacy targets and the corresponding approved, clinical trial and investigative drugs. Since its last update, major improvements and updates have been made to TTD. In addition to the significant increase of data content (from 1894 targets and 5028 drugs to 2025 targets and 17,816 drugs), we added target validation information (drug potency against target, effect against disease models and effect of target knockout, knockdown or genetic variations) for 932 targets, and 841 quantitative structure activity relationship models for active compounds of 228 chemical types against 121 targets. Moreover, we added the data from our previous drug studies including 3681 multi-target agents against 108 target pairs, 116 drug combinations with their synergistic, additive, antagonistic, potentiative or reductive mechanisms, 1427 natural product-derived approved, clinical trial and pre-clinical drugs and cross-links to the clinical trial information page in the ClinicalTrials.gov database for 770 clinical trial drugs. These updates are useful for facilitating target discovery and validation, drug lead discovery and optimization, and the development of multi-target drugs and drug combinations.


Subject(s)
Databases, Factual , Drug Discovery , Biological Products/therapeutic use , Clinical Trials as Topic , Drug Therapy, Combination , Quantitative Structure-Activity Relationship , Structure-Activity Relationship
5.
BMC Evol Biol ; 10: 77, 2010 Mar 15.
Article in English | MEDLINE | ID: mdl-20230639

ABSTRACT

BACKGROUND: Even after years of exploration, the terrestrial origin of bio-molecules remains unsolved and controversial. Today, observation of amino acid composition in proteins has become an alternative way for a global understanding of the mystery encoded in whole genomes and seeking clues for the origin of amino acids. RESULTS: In this study, we statistically monitored the frequencies of 20 alpha-amino acids in 549 taxa from three kingdoms of life: archaebacteria, eubacteria, and eukaryotes. We found that the amino acids evolved independently in these three kingdoms; but, conserved linkages were observed in two groups of amino acids, (A, G, H, L, P, Q, R, and W) and (F, I, K, N, S, and Y). Moreover, the amino acids encoded by GC-poor codons (F, Y, N, K, I, and M) were found to "lose" their usage in the development from single cell eukaryotic organisms like S. cerevisiae to H. sapiens, while the amino acids encoded by GC-rich codons (P, A, G, and W) were found to gain usage. These findings further support the co-evolution hypothesis of amino acids and genetic codes. CONCLUSION: We proposed a new chronological order of the appearance of amino acids (L, A, V/E/G, S, I, K, T, R/D, P, N, F, Q, Y, M, H, W, C). Two conserved evolutionary paths of amino acids were also suggested: A-->G-->R-->P and K-->Y.


Subject(s)
Amino Acids/genetics , Evolution, Molecular , Archaea/genetics , Bacteria/genetics , Eukaryota/genetics , Genetic Code , Genome
6.
Nucleic Acids Res ; 38(Database issue): D787-91, 2010 Jan.
Article in English | MEDLINE | ID: mdl-19933260

ABSTRACT

Increasing numbers of proteins, nucleic acids and other molecular entities have been explored as therapeutic targets, hundreds of which are targets of approved and clinical trial drugs. Knowledge of these targets and corresponding drugs, particularly those in clinical uses and trials, is highly useful for facilitating drug discovery. Therapeutic Target Database (TTD) has been developed to provide information about therapeutic targets and corresponding drugs. In order to accommodate increasing demand for comprehensive knowledge about the primary targets of the approved, clinical trial and experimental drugs, numerous improvements and updates have been made to TTD. These updates include information about 348 successful, 292 clinical trial and 1254 research targets, 1514 approved, 1212 clinical trial and 2302 experimental drugs linked to their primary targets (3382 small molecule and 649 antisense drugs with available structure and sequence), new ways to access data by drug mode of action, recursive search of related targets or drugs, similarity target and drug searching, customized and whole data download, standardized target ID, and significant increase of data (1894 targets, 560 diseases and 5028 drugs compared with the 433 targets, 125 diseases and 809 drugs in the original release described in previous paper). This database can be accessed at http://bidd.nus.edu.sg/group/cjttd/TTD.asp.


Subject(s)
Computational Biology/methods , Databases, Factual , Databases, Genetic , Pharmaceutical Preparations/chemistry , Chemistry, Pharmaceutical/methods , Clinical Trials as Topic , Computational Biology/trends , Databases, Protein , Drug Approval , Drug Therapy/methods , Humans , Information Storage and Retrieval/methods , Internet , Software , United States , United States Food and Drug Administration
7.
Nucleic Acids Res ; 35(Web Server issue): W538-42, 2007 Jul.
Article in English | MEDLINE | ID: mdl-17526528

ABSTRACT

The interactions between cytokines and their complementary receptors are the gateways to properly understand a large variety of cytokine-specific cellular activities such as immunological responses and cell differentiation. To discover novel cytokine-receptor interactions, an advanced support vector machines (SVMs) model, CytoSVM, was constructed in this study. This model was iteratively trained using 449 mammal (except rat) cytokine-receptor interactions and about 1 million virtually generated positive and negative vectors in an enriched way. Final independent evaluation by rat's data received sensitivity of 97.4%, specificity of 99.2% and the Matthews correlation coefficient (MCC) of 0.89. This performance is better than normal SVM-based models. Upon this well-optimized model, a web-based server was created to accept primary protein sequence and present its probabilities to interact with one or several cytokines. Moreover, this model was applied to identify putative cytokine-receptor pairs in the whole genomes of human and mouse. Excluding currently known cytokine-receptor interactions, total 1609 novel cytokine-receptor pairs were discovered from human genome with probability approximately 80% after further transmembrane analysis. These cover 220 novel receptors (excluding their isoforms) for 126 human cytokines. The screening results have been deposited in a database. Both the server and the database can be freely accessed at http://bioinf.xmu.edu.cn/software/cytosvm/cytosvm.php.


Subject(s)
Computational Biology/methods , Cytokines/chemistry , Dipeptides/chemistry , Internet , Proteins/chemistry , Algorithms , Animals , Artificial Intelligence , Cytokines/classification , Databases, Protein , Genome , Humans , Reproducibility of Results , Sequence Analysis, Protein , Software , User-Computer Interface
8.
Bioinformatics ; 23(13): 1710-2, 2007 Jul 01.
Article in English | MEDLINE | ID: mdl-17463030

ABSTRACT

MOTIVATION: Drug-induced toxicity related proteins (DITRPs) are proteins that mediate adverse drug reactions (ADRs) or toxicities through their binding to drugs or reactive metabolites. Collection of these proteins facilitates better understanding of the molecular mechanisms of drug-induced toxicity and the rational drug discovery. Drug-induced toxicity related protein database (DITOP) is such a database that is intending to provide comprehensive information of DITRPs. Currently, DITOP contains 1501 records, covering 618 distinct literature-reported DITRPs, 529 drugs/ligands and 418 distinct toxicity terms. These proteins were confirmed experimentally to interact with drugs or their reactive metabolites, thus directly or indirectly cause adverse effects or toxicities. Five major types of drug-induced toxicities or ADRs are included in DITOP, which are the idiosyncratic adverse drug reactions, the dose-dependent toxicities, the drug-drug interactions, the immune-mediated adverse drug effects (IMADEs) and the toxicities caused by genetic susceptibility. Molecular mechanisms underlying the toxicity and cross-links to related resources are also provided while available. Moreover, a series of user-friendly interfaces were designed for flexible retrieval of DITRPs-related information. The DITOP can be accessed freely at http://bioinf.xmu.edu.cn/databases/ADR/index.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Adverse Drug Reaction Reporting Systems , Databases, Protein , Drug-Related Side Effects and Adverse Reactions , Pharmaceutical Preparations/chemistry , Proteins/chemistry
9.
Nucleic Acids Res ; 34(Web Server issue): W492-7, 2006 Jul 01.
Article in English | MEDLINE | ID: mdl-16845057

ABSTRACT

Gene Expression Pattern Scanner (GEPS) is a web-based server to provide interactive pattern analysis of user-submitted microarray data for facilitating their further interpretation. Putative gene expression patterns such as correlated expression, similar expression and specific expression are determined globally and systematically using geometric comparison and correlation analysis methods. These patterns can be visualized via linear plot with quantitative measures. User-defined threshold value is allowed to customize the format of the pattern search results. For better understanding of gene expression, patterns derived from 329,205 non-redundant gene expression records from the GNF SymAltas and the Gene Expression Omnibus are also provided. These profiles cover 24,277 human genes in 79 tissues, 32,905 mouse genes in 61 tissues and 4201 rat genes in 44 tissues. GEPS is available at http://bioinf.xmu.edu.cn/software/geps/geps.php.


Subject(s)
Gene Expression Profiling/methods , Oligonucleotide Array Sequence Analysis/methods , Software , Animals , Data Interpretation, Statistical , Humans , Internet , Mice , Rats , User-Computer Interface
SELECTION OF CITATIONS
SEARCH DETAIL
...