Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 4 de 4
Filter
Add more filters










Database
Language
Publication year range
1.
Artif Intell Med ; 49(2): 93-104, 2010 Jun.
Article in English | MEDLINE | ID: mdl-20347582

ABSTRACT

OBJECTIVE: Successful use of classifiers that learn to make decisions from a set of patient examples require robust methods for performance estimation. Recently many promising approaches for determination of an upper bound for the error rate of a single classifier have been reported but the Bayesian credibility interval (CI) obtained from a conventional holdout test still delivers one of the tightest bounds. The conventional Bayesian CI becomes unacceptably large in real world applications where the test set sizes are less than a few hundred. The source of this problem is that fact that the CI is determined exclusively by the result on the test examples. In other words, there is no information at all provided by the uniform prior density distribution employed which reflects complete lack of prior knowledge about the unknown error rate. Therefore, the aim of the study reported here was to study a maximum entropy (ME) based approach to improved prior knowledge and Bayesian CIs, demonstrating its relevance for biomedical research and clinical practice. METHOD AND MATERIAL: It is demonstrated how a refined non-uniform prior density distribution can be obtained by means of the ME principle using empirical results from a few designs and tests using non-overlapping sets of examples. RESULTS: Experimental results show that ME based priors improve the CIs when employed to four quite different simulated and two real world data sets. CONCLUSIONS: An empirically derived ME prior seems promising for improving the Bayesian CI for the unknown error rate of a designed classifier.


Subject(s)
Artificial Intelligence , Bayes Theorem , Data Mining , Databases as Topic , Decision Support Systems, Clinical , Models, Statistical , Algorithms , Breast Neoplasms/classification , Breast Neoplasms/diagnosis , Computer Simulation , Decision Trees , Empirical Research , Female , Fungal Proteins/classification , Fungal Proteins/physiology , Humans , Linear Models , Normal Distribution , Predictive Value of Tests , Prognosis , Reproducibility of Results , Vocabulary, Controlled
2.
Leuk Res ; 31(7): 931-8, 2007 Jul.
Article in English | MEDLINE | ID: mdl-17207527

ABSTRACT

Microarray-based gene expression analysis was performed on diagnostic chronic phase CML patient samples prior to interferon treatment. Fifteen patient samples corresponding to six cytogenetic responders and nine non-responders were included. Genes differentially expressed between responder and non-responder patients were listed and a subsequent leave-one-out cross validation (LOOV) procedure showed that the top 20 genes allowed the highest prediction accuracy. The relevant genes were quantified by real-time PCR that supported the microarray results. We conclude that it might be possible to use gene expression analysis to predict future response to interferon in CML diagnostic samples.


Subject(s)
Antineoplastic Agents/therapeutic use , Biomarkers, Tumor/genetics , Gene Expression Regulation, Leukemic , Gene Expression Regulation, Neoplastic/drug effects , Interferon-alpha/therapeutic use , Leukemia, Myeloid, Chronic-Phase/drug therapy , Leukemia, Myeloid, Chronic-Phase/genetics , Adult , Aged , Biomarkers, Tumor/metabolism , Cytarabine/therapeutic use , Fusion Proteins, bcr-abl/genetics , Gene Expression Profiling , Humans , Hydroxyurea/therapeutic use , Leukemia, Myeloid, Chronic-Phase/metabolism , Middle Aged , Oligonucleotide Array Sequence Analysis , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA, Neoplasm/blood , RNA, Neoplasm/genetics , RNA, Neoplasm/isolation & purification , Reverse Transcriptase Polymerase Chain Reaction
3.
Tumour Biol ; 27(4): 211-20, 2006.
Article in English | MEDLINE | ID: mdl-16675914

ABSTRACT

OBJECTIVE: To identify molecular markers useful for the diagnostic discrimination of benign and malignant follicular thyroid tumors. METHODS: A panel of thyroid tumors was characterized with expression profiling using cDNA microarrays. A robust algorithm for gene selection was developed to identify molecular markers useful for the classification of heterogeneous tumor classes. The study included tumor tissue specimens from 10 patients with benign follicular adenomas and from 10 with malignant tumors. The malignant tumors mainly consisted of clinically relevant minimally invasive follicular carcinomas. The mRNA expression level of a candidate gene, FHL1, was evaluated in an independent series of 61 tumors. RESULTS: 22 gene expression markers were identified as differentially expressed. Several of the identified genes, for example DIO1, CITED1, CA12 and FN1, have previously been observed as differentially expressed in various thyroid tumors. FHL1 was significantly underexpressed in carcinomas compared to adenomas in the independent panel of tumors. The results indicate that a small number of genes can be useful to distinguish follicular adenomas from follicular carcinomas. CONCLUSIONS: Our findings clearly corroborate previous studies and identify novel candidate molecular markers. These genes have the potential for molecular classification of follicular thyroid tumors and for providing improved understanding of the molecular mechanisms involved in thyroid malignancies.


Subject(s)
Adenoma/genetics , Genetic Markers , Thyroid Diseases/diagnosis , Thyroid Neoplasms/genetics , Adenocarcinoma, Follicular/diagnosis , Adenocarcinoma, Follicular/genetics , Adenocarcinoma, Follicular/pathology , Adenoma/diagnosis , Adult , Aged , DNA Primers , Diagnosis, Differential , Female , Humans , Male , Middle Aged , Neoplasm Invasiveness , Oligonucleotide Array Sequence Analysis , Polymerase Chain Reaction , Thyroid Neoplasms/diagnosis , Thyroid Neoplasms/pathology
4.
BMC Bioinformatics ; 7: 127, 2006 Mar 13.
Article in English | MEDLINE | ID: mdl-16533392

ABSTRACT

BACKGROUND: Supervised learning for classification of cancer employs a set of design examples to learn how to discriminate between tumors. In practice it is crucial to confirm that the classifier is robust with good generalization performance to new examples, or at least that it performs better than random guessing. A suggested alternative is to obtain a confidence interval of the error rate using repeated design and test sets selected from available examples. However, it is known that even in the ideal situation of repeated designs and tests with completely novel samples in each cycle, a small test set size leads to a large bias in the estimate of the true variance between design sets. Therefore different methods for small sample performance estimation such as a recently proposed procedure called Repeated Random Sampling (RSS) is also expected to result in heavily biased estimates, which in turn translates into biased confidence intervals. Here we explore such biases and develop a refined algorithm called Repeated Independent Design and Test (RIDT). RESULTS: Our simulations reveal that repeated designs and tests based on resampling in a fixed bag of samples yield a biased variance estimate. We also demonstrate that it is possible to obtain an improved variance estimate by means of a procedure that explicitly models how this bias depends on the number of samples used for testing. For the special case of repeated designs and tests using new samples for each design and test, we present an exact analytical expression for how the expected value of the bias decreases with the size of the test set. CONCLUSION: We show that via modeling and subsequent reduction of the small sample bias, it is possible to obtain an improved estimate of the variance of classifier performance between design sets. However, the uncertainty of the variance estimate is large in the simulations performed indicating that the method in its present form cannot be directly applied to small data sets.


Subject(s)
Artificial Intelligence , Biomarkers, Tumor/analysis , Diagnosis, Computer-Assisted/methods , Gene Expression Profiling/methods , Neoplasm Proteins/analysis , Neoplasms/diagnosis , Analysis of Variance , Bias , Humans , Models, Biological , Models, Statistical , Neoplasms/metabolism , Oligonucleotide Array Sequence Analysis/methods , Pattern Recognition, Automated/methods , Reproducibility of Results , Sample Size , Sensitivity and Specificity
SELECTION OF CITATIONS
SEARCH DETAIL
...