Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters










Database
Language
Publication year range
1.
Nature ; 450(7172): 1091-5, 2007 Dec 13.
Article in English | MEDLINE | ID: mdl-18046333

ABSTRACT

Infection with the malaria parasite Plasmodium falciparum leads to widely different clinical conditions in children, ranging from mild flu-like symptoms to coma and death. Despite the immense medical implications, the genetic and molecular basis of this diversity remains largely unknown. Studies of in vitro gene expression have found few transcriptional differences between different parasite strains. Here we present a large study of in vivo expression profiles of parasites derived directly from blood samples from infected patients. The in vivo expression profiles define three distinct transcriptional states. The biological basis of these states can be interpreted by comparison with an extensive compendium of expression data in the yeast Saccharomyces cerevisiae. The three states in vivo closely resemble, first, active growth based on glycolytic metabolism, second, a starvation response accompanied by metabolism of alternative carbon sources, and third, an environmental stress response. The glycolytic state is highly similar to the known profile of the ring stage in vitro, but the other states have not been observed in vitro. The results reveal a previously unknown physiological diversity in the in vivo biology of the malaria parasite, in particular evidence for a functional mitochondrion in the asexual-stage parasite, and indicate in vivo and in vitro studies to determine how this variation may affect disease manifestations and treatment.


Subject(s)
Malaria, Falciparum/parasitology , Plasmodium falciparum/metabolism , Animals , Cluster Analysis , Fatty Acids/metabolism , Gene Expression Profiling , Gene Expression Regulation , Glycolysis/genetics , Humans , Malaria, Falciparum/blood , Oligonucleotide Array Sequence Analysis , Plasmodium falciparum/genetics , Plasmodium falciparum/growth & development , Plasmodium falciparum/pathogenicity , Transcription, Genetic , Tricarboxylic Acids/metabolism
2.
Bioinformatics ; 20(11): 1797-8, 2004 Jul 22.
Article in English | MEDLINE | ID: mdl-14988123

ABSTRACT

SUMMARY: GeneCluster 2.0 is a software package for analyzing gene expression and other bioarray data, giving users a variety of methods to build and evaluate class predictors, visualize marker lists, cluster data and validate results. GeneCluster 2.0 greatly expands the data analysis capabilities of GeneCluster 1.0 by adding classification, class discovery and permutation test methods. It includes algorithms for building and testing supervised models using weighted voting and k-nearest neighbor algorithms, a module for systematically finding and evaluating clustering via self-organizing maps, and modules for marker gene selection and heat map visualization that allow users to view and sort samples and genes by many criteria. GeneCluster 2.0 is a stand-alone Java application and runs on any platform that supports the Java Runtime Environment version 1.3.1 or greater. AVAILABILITY: http://www.broad.mit.edu/cancer/software


Subject(s)
Algorithms , Cluster Analysis , Gene Expression Profiling/methods , Oligonucleotide Array Sequence Analysis/methods , Sequence Analysis, DNA/methods , Software , User-Computer Interface , Computer Graphics , DNA/analysis , DNA/chemistry , DNA/classification , Pattern Recognition, Automated
3.
Proc Natl Acad Sci U S A ; 98(26): 15149-54, 2001 Dec 18.
Article in English | MEDLINE | ID: mdl-11742071

ABSTRACT

The optimal treatment of patients with cancer depends on establishing accurate diagnoses by using a complex combination of clinical and histopathological data. In some instances, this task is difficult or impossible because of atypical clinical presentation or histopathology. To determine whether the diagnosis of multiple common adult malignancies could be achieved purely by molecular classification, we subjected 218 tumor samples, spanning 14 common tumor types, and 90 normal tissue samples to oligonucleotide microarray gene expression analysis. The expression levels of 16,063 genes and expressed sequence tags were used to evaluate the accuracy of a multiclass classifier based on a support vector machine algorithm. Overall classification accuracy was 78%, far exceeding the accuracy of random classification (9%). Poorly differentiated cancers resulted in low-confidence predictions and could not be accurately classified according to their tissue of origin, indicating that they are molecularly distinct entities with dramatically different gene expression patterns compared with their well differentiated counterparts. Taken together, these results demonstrate the feasibility of accurate, multiclass molecular cancer classification and suggest a strategy for future clinical implementation of molecular cancer diagnostics.


Subject(s)
Gene Expression Profiling , Neoplasms/classification , Neoplasms/diagnosis , Biomarkers, Tumor , Cluster Analysis , Humans , Multigene Family , Neoplasms/genetics
4.
Proc Natl Acad Sci U S A ; 98(19): 10787-92, 2001 Sep 11.
Article in English | MEDLINE | ID: mdl-11553813

ABSTRACT

In an effort to develop a genomics-based approach to the prediction of drug response, we have developed an algorithm for classification of cell line chemosensitivity based on gene expression profiles alone. Using oligonucleotide microarrays, the expression levels of 6,817 genes were measured in a panel of 60 human cancer cell lines (the NCI-60) for which the chemosensitivity profiles of thousands of chemical compounds have been determined. We sought to determine whether the gene expression signatures of untreated cells were sufficient for the prediction of chemosensitivity. Gene expression-based classifiers of sensitivity or resistance for 232 compounds were generated and then evaluated on independent sets of data. The classifiers were designed to be independent of the cells' tissue of origin. The accuracy of chemosensitivity prediction was considerably better than would be expected by chance. Eighty-eight of 232 expression-based classifiers performed accurately (with P < 0.05) on an independent test set, whereas only 12 of the 232 would be expected to do so by chance. These results suggest that at least for a subset of compounds genomic approaches to chemosensitivity prediction are feasible.


Subject(s)
Drug Resistance, Neoplasm/genetics , Neoplasms/genetics , Transcription, Genetic , Gene Expression Profiling , Humans , Neoplasms/drug therapy , Oligonucleotide Array Sequence Analysis/methods , Predictive Value of Tests , Tumor Cells, Cultured
5.
Nature ; 409(6822): 860-921, 2001 Feb 15.
Article in English | MEDLINE | ID: mdl-11237011

ABSTRACT

The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.


Subject(s)
Genome, Human , Human Genome Project , Sequence Analysis, DNA , Animals , Chromosome Mapping , Conserved Sequence , CpG Islands , DNA Transposable Elements , Databases, Factual , Drug Industry , Evolution, Molecular , Forecasting , GC Rich Sequence , Gene Duplication , Genes , Genetic Diseases, Inborn , Genetics, Medical , Humans , Mutation , Private Sector , Proteins/genetics , Proteome , Public Sector , RNA/genetics , Repetitive Sequences, Nucleic Acid , Sequence Analysis, DNA/methods , Species Specificity
6.
Genome Res ; 10(7): 950-8, 2000 Jul.
Article in English | MEDLINE | ID: mdl-10899144

ABSTRACT

We describe a novel analytical approach to gene recognition based on cross-species comparison. We first undertook a comparison of orthologous genomic loci from human and mouse, studying the extent of similarity in the number, size and sequence of exons and introns. We then developed an approach for recognizing genes within such orthologous regions by first aligning the regions using an iterative global alignment system and then identifying genes based on conservation of exonic features at aligned positions in both species. The alignment and gene recognition are performed by new programs called and, respectively. performed well at exact identification of coding exons in 117 orthologous pairs tested.


Subject(s)
Exons/genetics , Genes/genetics , Amino Acids/genetics , Animals , Codon/genetics , Genetic Markers , Genomic Library , Humans , Introns/genetics , Mice , Sequence Alignment , Sequence Analysis, DNA , Software , Species Specificity , Spliceosomes/genetics
7.
Science ; 286(5439): 531-7, 1999 Oct 15.
Article in English | MEDLINE | ID: mdl-10521349

ABSTRACT

Although cancer classification has improved over the past 30 years, there has been no general approach for identifying new cancer classes (class discovery) or for assigning tumors to known classes (class prediction). Here, a generic approach to cancer classification based on gene expression monitoring by DNA microarrays is described and applied to human acute leukemias as a test case. A class discovery procedure automatically discovered the distinction between acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL) without previous knowledge of these classes. An automatically derived class predictor was able to determine the class of new leukemia cases. The results demonstrate the feasibility of cancer classification based solely on gene expression monitoring and suggest a general strategy for discovering and predicting cancer classes for other types of cancer, independent of previous biological knowledge.


Subject(s)
Gene Expression Profiling , Leukemia, Myeloid/classification , Leukemia, Myeloid/genetics , Precursor Cell Lymphoblastic Leukemia-Lymphoma/classification , Precursor Cell Lymphoblastic Leukemia-Lymphoma/genetics , Acute Disease , Antineoplastic Combined Chemotherapy Protocols/therapeutic use , Cell Adhesion/genetics , Cell Cycle/genetics , Homeodomain Proteins/genetics , Humans , Leukemia, Myeloid/drug therapy , Neoplasm Proteins/genetics , Neoplasms/classification , Neoplasms/genetics , Oligonucleotide Array Sequence Analysis , Oncogenes , Precursor Cell Lymphoblastic Leukemia-Lymphoma/drug therapy , Predictive Value of Tests , Reproducibility of Results , Treatment Outcome
8.
J Mol Biol ; 225(4): 1049-63, 1992 Jun 20.
Article in English | MEDLINE | ID: mdl-1613789

ABSTRACT

We have developed a hybrid system to predict the secondary structures (alpha-helix, beta-sheet and coil) of proteins and achieved 66.4% accuracy, with correlation coefficients of C(coil) = 0.429, C alpha = 0.470 and C beta = 0.387. This system contains three subsystems ("experts"): a neural network module, a statistical module and a memory-based reasoning module. First, the three experts independently learn the mapping between amino acid sequences and secondary structures from the known protein structures, then a Combiner learns to combine automatically the outputs of the experts to make final predictions. The hybrid system was tested with 107 protein structures through k-way cross-validation. Its performance was better than each expert and all previously reported methods with greater than 0.99 statistical significance. It was observed that for 20% of the residues, all three experts produced the same but wrong predictions. This may suggest an upper bound on the accuracy of secondary structure predictions based on local information from the currently available protein structures, and indicate places where non-local interactions may play a dominant role in conformation. For 64% of the residues, at least two experts were the same and correct, which shows that the Combiner performed better than majority vote. For 77% of the residues, at least one expert was correct, thus there may still be room for improvement in this hybrid approach. Rigorous evaluation procedures were used in testing the hybrid system, and statistical significance measures were developed in analyzing the differences among different methods. When measured in terms of the number of secondary structures (rather than the number of residues) that were predicted correctly, the prediction produced by the hybrid system was also better than those of individual experts.


Subject(s)
Protein Conformation , Proteins/chemistry , Amino Acid Sequence , Enzymes/chemistry , Mathematics , Models, Statistical , Molecular Sequence Data , Software
SELECTION OF CITATIONS
SEARCH DETAIL
...