Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 3 de 3
Filter
Add more filters










Database
Language
Publication year range
1.
Mol Divers ; 10(2): 207-12, 2006 May.
Article in English | MEDLINE | ID: mdl-16721627

ABSTRACT

Rule-based ensemble modelling has been used to develop a model with high accuracy and predictive capabilities for distinguishing between four different modes of toxic action for a set of 220 phenols. The model not only predicts the majority class (polar narcotics) well but also the other three classes (weak acid respiratory uncouplers, pro-electrophiles and soft electrophiles) of toxic action despite the severely skewed distribution among the four investigated classes. Furthermore, the investigation also highlights the merits of using ensemble (or consensus) modelling as an alternative to the more traditional development of a single model in order to promote robustness and accuracy with respect to the predictive capability for the derived model.


Subject(s)
Models, Chemical , Models, Statistical , Phenols/toxicity , Quantitative Structure-Activity Relationship , Databases, Factual
2.
J Med Chem ; 46(26): 5781-9, 2003 Dec 18.
Article in English | MEDLINE | ID: mdl-14667231

ABSTRACT

Three different multivariate statistical methods, PLS discriminant analysis, rule-based methods, and Bayesian classification, have been applied to multidimensional scoring data from four different target proteins: estrogen receptor alpha (ERalpha), matrix metalloprotease 3 (MMP3), factor Xa (fXa), and acetylcholine esterase (AChE). The purpose was to build classifiers able to discriminate between active and inactive compounds, given a structure-based virtual screen. Seven different scoring functions were used to generate the scoring matrices. The classifiers were compared to classical consensus scoring and single scoring functions. The classifiers show a superior performance, with rule-based methods being most effective. The precision of correctly predicting an active compound is about 90% for three of the targets and about 25% for acetylcholine esterase. On the basis of these results, a new two-stage approach is suggested for structure-based virtual screening where limited activity information is available.


Subject(s)
Multivariate Analysis , Quantitative Structure-Activity Relationship , Acetylcholinesterase/chemistry , Binding Sites , Estrogen Receptor alpha , Factor Xa/chemistry , Ligands , Matrix Metalloproteinase 3/chemistry , Receptors, Estrogen/chemistry
3.
Int J Med Inform ; 67(1-3): 49-61, 2002 Dec 04.
Article in English | MEDLINE | ID: mdl-12460631

ABSTRACT

A prerequisite for all higher level information extraction tasks is the identification of unknown names in text. Today, when large corpora can consist of billions of words, it is of utmost importance to develop accurate techniques for the automatic detection, extraction and categorization of named entities in these corpora. Although named entity recognition might be regarded a solved problem in some domains, it still poses a significant challenge in others. In this work we focus on one of the more difficult tasks, the identification of protein names in text. This task presents several interesting difficulties because of the named entities variant structural characteristics, their sometimes unclear status as names, the lack of common standards and fixed nomenclatures, and the specifics of the texts in the molecular biology domain in which they appear. We describe how we approached these and other difficulties in the implementation of Yapex, a system for the automatic identification of protein names in text. We also evaluate Yapex under four different notions of correctness and compare its performance to that of another publicly available system for protein name recognition.


Subject(s)
Information Storage and Retrieval , Linguistics , Medical Informatics , Molecular Biology , Names , Natural Language Processing , Proteins , Dictionaries as Topic , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...