Pesquisa | Portal Regional da BVS

Clinician-centric diagnosis of rare genetic diseases: performance of a gene pertinence metric in decision support for clinicians.

Segal, Michael M; George, Renee; Waltman, Peter; El-Hattab, Ayman W; James, Kiely N; Stanley, Valentina; Gleeson, Joseph.

Orphanet J Rare Dis ; 15(1): 191, 2020 07 22.

Artigo em Inglês | MEDLINE | ID: mdl-32698834

RESUMO

BACKGROUND: In diagnosis of rare genetic diseases we face a decision as to the degree to which the sequencing lab offers one or more diagnoses based on clinical input provided by the clinician, or the clinician reaches a diagnosis based on the complete set of variants provided by the lab. We tested a software approach to assist the clinician in making the diagnosis based on clinical findings and an annotated genomic variant table, using cases already solved using less automated processes. RESULTS: For the 81 cases studied (involving 216 individuals), 70 had genetic abnormalities with phenotypes previously described in the literature, and 11 were not described in the literature at the time of analysis ("discovery genes"). These included cases beyond a trio, including ones with different variants in the same gene. In 100% of cases the abnormality was recognized. Of the 70, the abnormality was ranked #1 in 94% of cases, with an average rank 1.1 for all cases. Large CNVs could be analyzed in an integrated analysis, performed in 24 of the cases. The process is rapid enough to allow for periodic reanalysis of unsolved cases. CONCLUSIONS: A clinician-friendly environment for clinical correlation can be provided to clinicians who are best positioned to have the clinical information needed for this interpretation.

Assuntos

Doenças Raras , Software , Variações do Número de Cópias de DNA , Genômica , Humanos , Fenótipo , Doenças Raras/diagnóstico , Doenças Raras/genética

Identifying Aspects of the Post-Transcriptional Program Governing the Proteome of the Green Alga Micromonas pusilla.

Waltman, Peter H; Guo, Jian; Reistetter, Emily Nahas; Purvine, Samuel; Ansong, Charles K; van Baren, Marijke J; Wong, Chee-Hong; Wei, Chia-Lin; Smith, Richard D; Callister, Stephen J; Stuart, Joshua M; Worden, Alexandra Z.

PLoS One ; 11(7): e0155839, 2016.

Artigo em Inglês | MEDLINE | ID: mdl-27434306

RESUMO

Micromonas is a unicellular motile alga within the Prasinophyceae, a green algal group that is related to land plants. This picoeukaryote (<2 µm diameter) is widespread in the marine environment but is not well understood at the cellular level. Here, we examine shifts in mRNA and protein expression over the course of the day-night cycle using triplicated mid-exponential, nutrient replete cultures of Micromonas pusilla CCMP1545. Samples were collected at key transition points during the diel cycle for evaluation using high-throughput LC-MS proteomics. In conjunction, matched mRNA samples from the same time points were sequenced using pair-ended directional Illumina RNA-Seq to investigate the dynamics and relationship between the mRNA and protein expression programs of M. pusilla. Similar to a prior study of the marine cyanobacterium Prochlorococcus, we found significant divergence in the mRNA and proteomics expression dynamics in response to the light:dark cycle. Additionally, expressional responses of genes and the proteins they encoded could also be variable within the same metabolic pathway, such as we observed in the oxygenic photosynthesis pathway. A regression framework was used to predict protein levels from both mRNA expression and gene-specific sequence-based features. Several features in the genome sequence were found to influence protein abundance including codon usage as well as 3' UTR length and structure. Collectively, our studies provide insights into the regulation of the proteome over a diel cycle as well as the relationships between transcriptional and translational programs in the widespread marine green alga Micromonas.

Assuntos

Proteínas de Algas/genética , Clorófitas/genética , Regulação da Expressão Gênica de Plantas , Proteômica , RNA de Algas/genética , RNA Mensageiro/genética , Regiões 3' não Traduzidas , Proteínas de Algas/metabolismo , Clorófitas/metabolismo , Códon , Ontologia Genética , Anotação de Sequência Molecular , Fotoperíodo , Fotossíntese/genética , Biossíntese de Proteínas , RNA de Algas/metabolismo , RNA Mensageiro/metabolismo , Análise de Sequência de RNA , Transcrição Gênica

Comparative microbial modules resource: generation and visualization of multi-species biclusters.

Kacmarczyk, Thadeous; Waltman, Peter; Bate, Ashley; Eichenberger, Patrick; Bonneau, Richard.

PLoS Comput Biol ; 7(12): e1002228, 2011 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-22144874

RESUMO

The increasing abundance of large-scale, high-throughput datasets for many closely related organisms provides opportunities for comparative analysis via the simultaneous biclustering of datasets from multiple species. These analyses require a reformulation of how to organize multi-species datasets and visualize comparative genomics data analyses results. Recently, we developed a method, multi-species cMonkey, which integrates heterogeneous high-throughput datatypes from multiple species to identify conserved regulatory modules. Here we present an integrated data visualization system, built upon the Gaggle, enabling exploration of our method's results (available at http://meatwad.bio.nyu.edu/cmmr.html). The system can also be used to explore other comparative genomics datasets and outputs from other data analysis procedures - results from other multiple-species clustering programs or from independent clustering of different single-species datasets. We provide an example use of our system for two bacteria, Escherichia coli and Salmonella Typhimurium. We illustrate the use of our system by exploring conserved biclusters involved in nitrogen metabolism, uncovering a putative function for yjjI, a currently uncharacterized gene that we predict to be involved in nitrogen assimilation.

Assuntos

Algoritmos , Biologia Computacional/métodos , Bases de Dados Factuais , Genoma Bacteriano , Software , Análise por Conglomerados , Escherichia coli/genética , Escherichia coli/metabolismo , Escherichia coli/fisiologia , Nitrogênio/metabolismo , Salmonella typhimurium/genética , Salmonella typhimurium/metabolismo , Salmonella typhimurium/fisiologia , Biologia de Sistemas , Interface Usuário-Computador

Multi-species integrative biclustering.

Waltman, Peter; Kacmarczyk, Thadeous; Bate, Ashley R; Kearns, Daniel B; Reiss, David J; Eichenberger, Patrick; Bonneau, Richard.

Genome Biol ; 11(9): R96, 2010.

Artigo em Inglês | MEDLINE | ID: mdl-20920250

RESUMO

We describe an algorithm, multi-species cMonkey, for the simultaneous biclustering of heterogeneous multiple-species data collections and apply the algorithm to a group of bacteria containing Bacillus subtilis, Bacillus anthracis, and Listeria monocytogenes. The algorithm reveals evolutionary insights into the surprisingly high degree of conservation of regulatory modules across these three species and allows data and insights from well-studied organisms to complement the analysis of related but less well studied organisms.

Assuntos

Algoritmos , Bacillus/genética , Análise por Conglomerados , Mineração de Dados , Genômica , Listeria monocytogenes/genética , Família Multigênica , Bacillus anthracis/genética , Bacillus subtilis/genética , Sequência de Bases , Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , Regulação Bacteriana da Expressão Gênica , Redes Reguladoras de Genes , Genoma Bacteriano , Modelos Genéticos , Reconhecimento Automatizado de Padrão

FiberID--a technique to identify fibrous protein subclasses.

Waltman, Peter; Blumer, Anselm; Kaplan, David.

Proteins ; 66(1): 127-35, 2007 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-17039548

RESUMO

Fibrous proteins such as collagen, silk, and elastin play critical biological roles, yet they have been the subject of few projects that use computational techniques to predict either their class or their structure. In this article, we present FiberID, a simple yet effective method for identifying and distinguishing three fibrous protein subclasses from their primary sequences. Using a combination of amino acid composition and fast Fourier measurements, FiberID can classify fibrous proteins belonging to these subclasses with high accuracy by using two standard machine learning techniques (decision trees and Naïve Bayesian classifiers). After presenting our results, we present several fibrous sequences that are regularly misclassified by FiberID as sequences of potential interest for further study. Finally, we analyze the decision trees developed by FiberID for potential insights regarding the structure of these proteins.

Assuntos

Algoritmos , Colágeno/classificação , Biologia Computacional/métodos , Elastina/classificação , Seda/classificação , Software , Sequência de Aminoácidos , Animais , Teorema de Bayes , Colágeno/química , Bases de Dados de Proteínas , Elastina/química , Humanos , Dados de Sequência Molecular , Seda/química

Interpreter of maladies: redescription mining applied to biomedical data analysis.

Waltman, Peter; Pearlman, Alex; Mishra, Bud.

Pharmacogenomics ; 7(3): 503-9, 2006 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-16610960

RESUMO

Comprehensive, systematic and integrated data-centric statistical approaches to disease modeling can provide powerful frameworks for understanding disease etiology. Here, one such computational framework based on redescription mining in both its incarnations, static and dynamic, is discussed. The static framework provides bioinformatic tools applicable to multifaceted datasets, containing genetic, transcriptomic, proteomic, and clinical data for diseased patients and normal subjects. The dynamic redescription framework provides systems biology tools to model complex sets of regulatory, metabolic and signaling pathways in the initiation and progression of a disease. As an example, the case of chronic fatigue syndrome (CFS) is considered, which has so far remained intractable and unpredictable in its etiology and nosology. The redescription mining approaches can be applied to the Centers for Disease Control and Prevention's Wichita (KS, USA) dataset, integrating transcriptomic, epidemiological and clinical data, and can also be used to study how pathways in the hypothalamic-pituitary-adrenal axis affect CFS patients.

Assuntos

Interpretação Estatística de Dados , Bases de Dados Factuais , Algoritmos , Síndrome de Fadiga Crônica/epidemiologia , Síndrome de Fadiga Crônica/genética , Humanos , Modelos Estatísticos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA