Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
1.
NPJ Precis Oncol ; 8(1): 134, 2024 Jun 19.
Artigo em Inglês | MEDLINE | ID: mdl-38898127

RESUMO

While alterations in nucleus size, shape, and color are ubiquitous in cancer, comprehensive quantification of nuclear morphology across a whole-slide histologic image remains a challenge. Here, we describe the development of a pan-tissue, deep learning-based digital pathology pipeline for exhaustive nucleus detection, segmentation, and classification and the utility of this pipeline for nuclear morphologic biomarker discovery. Manually-collected nucleus annotations were used to train an object detection and segmentation model for identifying nuclei, which was deployed to segment nuclei in H&E-stained slides from the BRCA, LUAD, and PRAD TCGA cohorts. Interpretable features describing the shape, size, color, and texture of each nucleus were extracted from segmented nuclei and compared to measurements of genomic instability, gene expression, and prognosis. The nuclear segmentation and classification model trained herein performed comparably to previously reported models. Features extracted from the model revealed differences sufficient to distinguish between BRCA, LUAD, and PRAD. Furthermore, cancer cell nuclear area was associated with increased aneuploidy score and homologous recombination deficiency. In BRCA, increased fibroblast nuclear area was indicative of poor progression-free and overall survival and was associated with gene expression signatures related to extracellular matrix remodeling and anti-tumor immunity. Thus, we developed a powerful pan-tissue approach for nucleus segmentation and featurization, enabling the construction of predictive models and the identification of features linking nuclear morphology with clinically-relevant prognostic biomarkers across multiple cancer types.

2.
J Thorac Oncol ; 2023 Dec 07.
Artigo em Inglês | MEDLINE | ID: mdl-38070597

RESUMO

INTRODUCTION: Pathologic response (PathR) by histopathologic assessment of resected specimens may be an early clinical end point associated with long-term outcomes with neoadjuvant therapy. Digital pathology may improve the efficiency and precision of PathR assessment. LCMC3 (NCT02927301) evaluated neoadjuvant atezolizumab in patients with resectable NSCLC and reported a 20% major PathR rate. METHODS: We determined PathR in primary tumor resection specimens using guidelines-based visual techniques and developed a convolutional neural network model using the same criteria to digitally measure the percent viable tumor on whole-slide images. Concordance was evaluated between visual determination of percent viable tumor (n = 151) performed by one of the 47 local pathologists and three central pathologists. RESULTS: For concordance among visual determination of percent viable tumor, the interclass correlation coefficient was 0.87 (95% confidence interval [CI]: 0.84-0.90). Agreement for visually assessed 10% or less viable tumor (major PathR [MPR]) in the primary tumor was 92.1% (Fleiss kappa = 0.83). Digitally assessed percent viable tumor (n = 136) correlated with visual assessment (Pearson r = 0.73; digital/visual slope = 0.28). Digitally assessed MPR predicted visually assessed MPR with outstanding discrimination (area under receiver operating characteristic curve, 0.98) and was associated with longer disease-free survival (hazard ratio [HR] = 0.30; 95% CI: 0.09-0.97, p = 0.033) and overall survival (HR = 0.14, 95% CI: 0.02-1.06, p = 0.027) versus no MPR. Digitally assessed PathR strongly correlated with visual measurements. CONCLUSIONS: Artificial intelligence-powered digital pathology exhibits promise in assisting pathologic assessments in neoadjuvant NSCLC clinical trials. The development of artificial intelligence-powered approaches in clinical settings may aid pathologists in clinical operations, including routine PathR assessments, and subsequently support improved patient care and long-term outcomes.

3.
medRxiv ; 2023 Apr 25.
Artigo em Inglês | MEDLINE | ID: mdl-37162870

RESUMO

Clinical trials in nonalcoholic steatohepatitis (NASH) require histologic scoring for assessment of inclusion criteria and endpoints. However, guidelines for scoring key features have led to variability in interpretation, impacting clinical trial outcomes. We developed an artificial intelligence (AI)-based measurement (AIM) tool for scoring NASH histology (AIM-NASH). AIM-NASH predictions for NASH Clinical Research Network (CRN) grades of necroinflammation and stages of fibrosis aligned with expert consensus scores and were reproducible. Continuous scores produced by AIM-NASH for key histological features of NASH correlated with mean pathologist scores and with noninvasive biomarkers and strongly predicted patient outcomes. In a retrospective analysis of the ATLAS trial, previously unmet pathological endpoints were met when scored by the AIM-NASH algorithm alone. Overall, these results suggest that AIM-NASH may assist pathologists in histologic review of NASH clinical trials, reducing inter-rater variability on trial outcomes and offering a more sensitive and reproducible measure of patient therapeutic response.

4.
Cell Rep Med ; 4(4): 101016, 2023 04 18.
Artigo em Inglês | MEDLINE | ID: mdl-37075704

RESUMO

Nonalcoholic steatohepatitis (NASH) is the most common chronic liver disease globally and a leading cause for liver transplantation in the US. Its pathogenesis remains imprecisely defined. We combined two high-resolution modalities to tissue samples from NASH clinical trials, machine learning (ML)-based quantification of histological features and transcriptomics, to identify genes that are associated with disease progression and clinical events. A histopathology-driven 5-gene expression signature predicted disease progression and clinical events in patients with NASH with F3 (pre-cirrhotic) and F4 (cirrhotic) fibrosis. Notably, the Notch signaling pathway and genes implicated in liver-related diseases were enriched in this expression signature. In a validation cohort where pharmacologic intervention improved disease histology, multiple Notch signaling components were suppressed.


Assuntos
Aprendizado Profundo , Hepatopatia Gordurosa não Alcoólica , Humanos , Hepatopatia Gordurosa não Alcoólica/complicações , Transcriptoma/genética , Progressão da Doença , Cirrose Hepática/genética , Cirrose Hepática/tratamento farmacológico
5.
Mod Pathol ; 36(6): 100124, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-36841434

RESUMO

Ulcerative colitis is a chronic inflammatory bowel disease that is characterized by a relapsing and remitting course. Assessment of disease activity critically informs treatment decisions. In addition to endoscopic remission, histologic remission is emerging as a treatment target and a key factor in the evaluation of disease activity and therapeutic efficacy. However, manual pathologist evaluation is semiquantitative and limited in granularity. Machine learning approaches are increasingly being developed to aid pathologists in accurate and reproducible scoring of histology, enabling precise quantitation of clinically relevant features. Here, we report the development and validation of convolutional neural network models that quantify histologic features pertinent to ulcerative colitis disease activity, directly from hematoxylin and eosin-stained whole slide images. Tissue and cell model predictions were used to generate quantitative human-interpretable features to fully characterize the histology samples. Tissue and cell predictions showed comparable agreement to pathologist annotations, and the extracted slide-level human-interpretable features demonstrated strong correlations with disease severity and pathologist-assigned Nancy histological index scores. Moreover, using a random forest classifier based on 13 human-interpretable features derived from the tissue and cell models, we were able to accurately predict Nancy histological index scores, with a weighted kappa (κ = 0.91) and Spearman correlation (⍴ = 0.89, P < .001) when compared with pathologist consensus Nancy histological index scores. We were also able to predict histologic remission, based on the absence of neutrophil extravasation, with a high accuracy of 0.97. This work demonstrates the potential of computer vision to enable a standardized and robust assessment of ulcerative colitis histopathology for translational research and improved evaluation of disease activity and prognosis.


Assuntos
Colite Ulcerativa , Doenças Inflamatórias Intestinais , Humanos , Colite Ulcerativa/tratamento farmacológico , Inteligência Artificial , Índice de Gravidade de Doença , Doenças Inflamatórias Intestinais/patologia , Mucosa Intestinal/patologia , Colonoscopia
6.
Mod Pathol ; 35(11): 1529-1539, 2022 11.
Artigo em Inglês | MEDLINE | ID: mdl-35840720

RESUMO

Assessment of programmed death ligand 1 (PD-L1) expression by immunohistochemistry (IHC) has emerged as an important predictive biomarker across multiple tumor types. However, manual quantitation of PD-L1 positivity can be difficult and leads to substantial inter-observer variability. Although the development of artificial intelligence (AI) algorithms may mitigate some of the challenges associated with manual assessment and improve the accuracy of PD-L1 expression scoring, use of AI-based approaches to oncology biomarker scoring and drug development has been sparse, primarily due to the lack of large-scale clinical validation studies across multiple cohorts and tumor types. We developed AI-powered algorithms to evaluate PD-L1 expression on tumor cells by IHC and compared it with manual IHC scoring in urothelial carcinoma, non-small cell lung cancer, melanoma, and squamous cell carcinoma of the head and neck (prospectively determined during the phase II and III CheckMate clinical trials). 1,746 slides were retrospectively analyzed, the largest investigation of digital pathology algorithms on clinical trial datasets performed to date. AI-powered quantification of PD-L1 expression on tumor cells identified more PD-L1-positive samples compared with manual scoring at cutoffs of ≥1% and ≥5% in most tumor types. Additionally, similar improvements in response and survival were observed in patients identified as PD-L1-positive compared with PD-L1-negative using both AI-powered and manual methods, while improved associations with survival were observed in patients with certain tumor types identified as PD-L1-positive using AI-powered scoring only. Our study demonstrates the potential for implementation of digital pathology-based methods in future clinical practice to identify more patients who would benefit from treatment with immuno-oncology therapy compared with current guidelines using manual assessment.


Assuntos
Carcinoma Pulmonar de Células não Pequenas , Carcinoma de Células de Transição , Neoplasias Pulmonares , Neoplasias da Bexiga Urinária , Humanos , Antígeno B7-H1/metabolismo , Carcinoma Pulmonar de Células não Pequenas/patologia , Nivolumabe/uso terapêutico , Ipilimumab , Inteligência Artificial , Neoplasias Pulmonares/patologia , Estudos Retrospectivos , Anticorpos Monoclonais/uso terapêutico , Biomarcadores Tumorais/metabolismo
7.
Hepatology ; 74(6): 3146-3160, 2021 12.
Artigo em Inglês | MEDLINE | ID: mdl-34333790

RESUMO

BACKGROUND AND AIMS: The hepatic venous pressure gradient (HVPG) is the standard for estimating portal pressure but requires expertise for interpretation. We hypothesized that HVPG could be extrapolated from liver histology using a machine learning (ML) algorithm. APPROACH AND RESULTS: Patients with NASH with compensated cirrhosis from a phase 2b trial were included. HVPG and biopsies from baseline and weeks 48 and 96 were reviewed centrally, and biopsies evaluated with a convolutional neural network (PathAI, Boston, MA). Using trichrome-stained biopsies in the training set (n = 130), an ML model was developed to recognize fibrosis patterns associated with HVPG, and the resultant ML HVPG score was validated in a held-out test set (n = 88). Associations between the ML HVPG score with measured HVPG and liver-related events, and performance of the ML HVPG score for clinically significant portal hypertension (CSPH) (HVPG ≥ 10 mm Hg), were determined. The ML-HVPG score was more strongly correlated with HVPG than hepatic collagen by morphometry (ρ = 0.47 vs. ρ = 0.28; P < 0.001). The ML HVPG score differentiated patients with normal (0-5 mm Hg) and elevated (5.5-9.5 mm Hg) HVPG and CSPH (median: 1.51 vs. 1.93 vs. 2.60; all P < 0.05). The areas under receiver operating characteristic curve (AUROCs) (95% CI) of the ML-HVPG score for CSPH were 0.85 (0.80, 0.90) and 0.76 (0.68, 0.85) in the training and test sets, respectively. Discrimination of the ML-HVPG score for CSPH improved with the addition of a ML parameter for nodularity, Enhanced Liver Fibrosis, platelets, aspartate aminotransferase (AST), and bilirubin (AUROC in test set: 0.85; 95% CI: 0.78, 0.92). Although baseline ML-HVPG score was not prognostic, changes were predictive of clinical events (HR: 2.13; 95% CI: 1.26, 3.59) and associated with hemodynamic response and fibrosis improvement. CONCLUSIONS: An ML model based on trichrome-stained liver biopsy slides can predict CSPH in patients with NASH with cirrhosis.


Assuntos
Hipertensão Portal/diagnóstico , Processamento de Imagem Assistida por Computador/métodos , Cirrose Hepática/complicações , Fígado/patologia , Hepatopatia Gordurosa não Alcoólica/complicações , Biópsia , Ensaios Clínicos Fase II como Assunto , Diagnóstico Diferencial , Feminino , Humanos , Hipertensão Portal/etiologia , Cirrose Hepática/patologia , Aprendizado de Máquina , Masculino , Pessoa de Meia-Idade , Hepatopatia Gordurosa não Alcoólica/patologia , Pressão na Veia Porta , Prognóstico , Curva ROC , Ensaios Clínicos Controlados Aleatórios como Assunto
8.
Nat Commun ; 12(1): 1613, 2021 03 12.
Artigo em Inglês | MEDLINE | ID: mdl-33712588

RESUMO

Computational methods have made substantial progress in improving the accuracy and throughput of pathology workflows for diagnostic, prognostic, and genomic prediction. Still, lack of interpretability remains a significant barrier to clinical integration. We present an approach for predicting clinically-relevant molecular phenotypes from whole-slide histopathology images using human-interpretable image features (HIFs). Our method leverages >1.6 million annotations from board-certified pathologists across >5700 samples to train deep learning models for cell and tissue classification that can exhaustively map whole-slide images at two and four micron-resolution. Cell- and tissue-type model outputs are combined into 607 HIFs that quantify specific and biologically-relevant characteristics across five cancer types. We demonstrate that these HIFs correlate with well-known markers of the tumor microenvironment and can predict diverse molecular signatures (AUROC 0.601-0.864), including expression of four immune checkpoint proteins and homologous recombination deficiency, with performance comparable to 'black-box' methods. Our HIF-based approach provides a comprehensive, quantitative, and interpretable window into the composition and spatial architecture of the tumor microenvironment.


Assuntos
Neoplasias/classificação , Neoplasias/diagnóstico por imagem , Neoplasias/patologia , Patologia Molecular/métodos , Fenótipo , Algoritmos , Aprendizado Profundo , Humanos , Processamento de Imagem Assistida por Computador , Medicina de Precisão , Microambiente Tumoral
9.
Hepatology ; 74(1): 133-147, 2021 07.
Artigo em Inglês | MEDLINE | ID: mdl-33570776

RESUMO

BACKGROUND AND AIMS: Manual histological assessment is currently the accepted standard for diagnosing and monitoring disease progression in NASH, but is limited by variability in interpretation and insensitivity to change. Thus, there is a critical need for improved tools to assess liver pathology in order to risk stratify NASH patients and monitor treatment response. APPROACH AND RESULTS: Here, we describe a machine learning (ML)-based approach to liver histology assessment, which accurately characterizes disease severity and heterogeneity, and sensitively quantifies treatment response in NASH. We use samples from three randomized controlled trials to build and then validate deep convolutional neural networks to measure key histological features in NASH, including steatosis, inflammation, hepatocellular ballooning, and fibrosis. The ML-based predictions showed strong correlations with expert pathologists and were prognostic of progression to cirrhosis and liver-related clinical events. We developed a heterogeneity-sensitive metric of fibrosis response, the Deep Learning Treatment Assessment Liver Fibrosis score, which measured antifibrotic treatment effects that went undetected by manual pathological staging and was concordant with histological disease progression. CONCLUSIONS: Our ML method has shown reproducibility and sensitivity and was prognostic for disease progression, demonstrating the power of ML to advance our understanding of disease heterogeneity in NASH, risk stratify affected patients, and facilitate the development of therapies.


Assuntos
Aprendizado Profundo , Processamento de Imagem Assistida por Computador/métodos , Cirrose Hepática/diagnóstico , Fígado/patologia , Hepatopatia Gordurosa não Alcoólica/diagnóstico , Biópsia , Humanos , Cirrose Hepática/patologia , Hepatopatia Gordurosa não Alcoólica/patologia , Ensaios Clínicos Controlados Aleatórios como Assunto , Reprodutibilidade dos Testes , Índice de Gravidade de Doença
10.
Hepatology ; 73(2): 625-643, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-33169409

RESUMO

BACKGROUND AND AIMS: Advanced fibrosis attributable to NASH is a leading cause of end-stage liver disease. APPROACH AND RESULTS: In this phase 2b trial, 392 patients with bridging fibrosis or compensated cirrhosis (F3-F4) were randomized to receive placebo, selonsertib 18 mg, cilofexor 30 mg, or firsocostat 20 mg, alone or in two-drug combinations, once-daily for 48 weeks. The primary endpoint was a ≥1-stage improvement in fibrosis without worsening of NASH between baseline and 48 weeks based on central pathologist review. Exploratory endpoints included changes in NAFLD Activity Score (NAS), liver histology assessed using a machine learning (ML) approach, liver biochemistry, and noninvasive markers. The majority had cirrhosis (56%) and NAS ≥5 (83%). The primary endpoint was achieved in 11% of placebo-treated patients versus cilofexor/firsocostat (21%; P = 0.17), cilofexor/selonsertib (19%; P = 0.26), firsocostat/selonsertib (15%; P = 0.62), firsocostat (12%; P = 0.94), and cilofexor (12%; P = 0.96). Changes in hepatic collagen by morphometry were not significant, but cilofexor/firsocostat led to a significant decrease in ML NASH CRN fibrosis score (P = 0.040) and a shift in biopsy area from F3-F4 to ≤F2 fibrosis patterns. Compared to placebo, significantly higher proportions of cilofexor/firsocostat patients had a ≥2-point NAS reduction; reductions in steatosis, lobular inflammation, and ballooning; and significant improvements in alanine aminotransferase (ALT), aspartate aminotransferase (AST), bilirubin, bile acids, cytokeratin-18, insulin, estimated glomerular filtration rate, ELF score, and liver stiffness by transient elastography (all P ≤ 0.05). Pruritus occurred in 20%-29% of cilofexor versus 15% of placebo-treated patients. CONCLUSIONS: In patients with bridging fibrosis and cirrhosis, 48 weeks of cilofexor/firsocostat was well tolerated, led to improvements in NASH activity, and may have an antifibrotic effect. This combination offers potential for fibrosis regression with longer-term therapy in patients with advanced fibrosis attributable to NASH.


Assuntos
Azetidinas/administração & dosagem , Doença Hepática Terminal/prevenção & controle , Isobutiratos/administração & dosagem , Ácidos Isonicotínicos/administração & dosagem , Cirrose Hepática/tratamento farmacológico , Hepatopatia Gordurosa não Alcoólica/tratamento farmacológico , Oxazóis/administração & dosagem , Pirimidinas/administração & dosagem , Idoso , Azetidinas/efeitos adversos , Benzamidas/administração & dosagem , Benzamidas/efeitos adversos , Biomarcadores/sangue , Biópsia , Esquema de Medicação , Quimioterapia Combinada/efeitos adversos , Quimioterapia Combinada/métodos , Doença Hepática Terminal/patologia , Feminino , Humanos , Imidazóis/administração & dosagem , Imidazóis/efeitos adversos , Isobutiratos/efeitos adversos , Ácidos Isonicotínicos/efeitos adversos , Fígado/efeitos dos fármacos , Fígado/patologia , Cirrose Hepática/complicações , Cirrose Hepática/diagnóstico , Cirrose Hepática/patologia , Testes de Função Hepática , Masculino , Pessoa de Meia-Idade , Hepatopatia Gordurosa não Alcoólica/diagnóstico , Hepatopatia Gordurosa não Alcoólica/etiologia , Hepatopatia Gordurosa não Alcoólica/patologia , Oxazóis/efeitos adversos , Piridinas/administração & dosagem , Piridinas/efeitos adversos , Pirimidinas/efeitos adversos , Índice de Gravidade de Doença , Resultado do Tratamento
11.
Microb Genom ; 4(11)2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30418868

RESUMO

Accurate orthologue identification is a vital component of bacterial comparative genomic studies, but many popular sequence-similarity-based approaches do not scale well to the large numbers of genomes that are now generated routinely. Furthermore, most approaches do not take gene synteny into account, which is useful information for disentangling paralogues. Here, we present SynerClust, a user-friendly synteny-aware tool based on synergy that can process thousands of genomes. SynerClust was designed to analyse genomes with high levels of local synteny, particularly prokaryotes, which have operon structure. SynerClust's run-time is optimized by selecting cluster representatives at each node in the phylogeny; thus, avoiding the need for exhaustive pairwise similarity searches. In benchmarking against Roary, Hieranoid2, PanX and Reciprocal Best Hit, SynerClust was able to more completely identify sets of core genes for datasets that included diverse strains, while using substantially less memory, and with scalability comparable to the fastest tools. Due to its scalability, ease of installation and use, and suitability for a variety of computing environments, orthogroup clustering using SynerClust will enable many large-scale prokaryotic comparative genomics efforts.


Assuntos
Genoma Bacteriano , Software , Sintenia , Algoritmos , Análise por Conglomerados , Enterobacteriaceae/genética , Escherichia coli/genética , Genômica/métodos , Mycobacterium tuberculosis/genética
12.
Cell Syst ; 4(3): 291-305.e7, 2017 03 22.
Artigo em Inglês | MEDLINE | ID: mdl-28189581

RESUMO

A systems-level understanding of Gram-positive bacteria is important from both an environmental and health perspective and is most easily obtained when high-quality, validated genomic resources are available. To this end, we constructed two ordered, barcoded, erythromycin-resistance- and kanamycin-resistance-marked single-gene deletion libraries of the Gram-positive model organism, Bacillus subtilis. The libraries comprise 3,968 and 3,970 genes, respectively, and overlap in all but four genes. Using these libraries, we update the set of essential genes known for this organism, provide a comprehensive compendium of B. subtilis auxotrophic genes, and identify genes required for utilizing specific carbon and nitrogen sources, as well as those required for growth at low temperature. We report the identification of enzymes catalyzing several missing steps in amino acid biosynthesis. Finally, we describe a suite of high-throughput phenotyping methodologies and apply them to provide a genome-wide analysis of competence and sporulation. Altogether, we provide versatile resources for studying gene function and pathway and network architecture in Gram-positive bacteria.


Assuntos
Bacillus subtilis/genética , Ensaios de Triagem em Larga Escala/métodos , Aminoácidos , Deleção de Genes , Biblioteca Gênica , Biblioteca Genômica , Genômica , Deleção de Sequência/genética , Esporos Bacterianos/genética
13.
Mol Cell Proteomics ; 14(2): 430-40, 2015 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-25473088

RESUMO

The function of a large percentage of proteins is modulated by post-translational modifications (PTMs). Currently, mass spectrometry (MS) is the only proteome-wide technology that can identify PTMs. Unfortunately, the inability to detect a PTM by MS is not proof that the modification is not present. The detectability of peptides varies significantly making MS potentially blind to a large fraction of peptides. Learning from published algorithms that generally focus on predicting the most detectable peptides we developed a tool that incorporates protein abundance into the peptide prediction algorithm with the aim to determine the detectability of every peptide within a protein. We tested our tool, "Peptide Prediction with Abundance" (PPA), on in-house acquired as well as published data sets from other groups acquired on different instrument platforms. Incorporation of protein abundance into the prediction allows us to assess not only the detectability of all peptides but also whether a peptide of interest is likely to become detectable upon enrichment. We validated the ability of our tool to predict changes in protein detectability with a dilution series of 31 purified proteins at several different concentrations. PPA predicted the concentration dependent peptide detectability in 78% of the cases correctly, demonstrating its utility for predicting the protein enrichment needed to observe a peptide of interest in targeted experiments. This is especially important in the analysis of PTMs. PPA is available as a web-based or executable package that can work with generally applicable defaults or retrained from a pilot MS data set.


Assuntos
Algoritmos , Espectrometria de Massas/métodos , Peptídeos/metabolismo , Sequência de Aminoácidos , Bases de Dados de Proteínas , Humanos , Dados de Sequência Molecular , Biblioteca de Peptídeos , Peptídeos/química , Reprodutibilidade dos Testes , Saccharomyces cerevisiae/metabolismo
15.
Elife ; 2: e00603, 2013 06 18.
Artigo em Inglês | MEDLINE | ID: mdl-23795289

RESUMO

Divergence in gene regulation can play a major role in evolution. Here, we used a phylogenetic framework to measure mRNA profiles in 15 yeast species from the phylum Ascomycota and reconstruct the evolution of their modular regulatory programs along a time course of growth on glucose over 300 million years [corrected]. We found that modules have diverged proportionally to phylogenetic distance, with prominent changes in gene regulation accompanying changes in lifestyle and ploidy, especially in carbon metabolism. Paralogs have significantly contributed to regulatory divergence, typically within a very short window from their duplication. Paralogs from a whole genome duplication (WGD) event have a uniquely substantial contribution that extends over a longer span. Similar patterns occur when considering the evolution of the heat shock regulatory program measured in eight of the species, suggesting that these are general evolutionary principles. DOI:http://dx.doi.org/10.7554/eLife.00603.001.


Assuntos
Evolução Molecular , Regulação Fúngica da Expressão Gênica , Genes Fúngicos , Ascomicetos/classificação , Ascomicetos/genética , Duplicação Gênica , Perfilação da Expressão Gênica , Filogenia , Transcrição Gênica
16.
Genome Res ; 23(6): 1039-50, 2013 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-23640720

RESUMO

Comparative functional genomics studies the evolution of biological processes by analyzing functional data, such as gene expression profiles, across species. A major challenge is to compare profiles collected in a complex phylogeny. Here, we present Arboretum, a novel scalable computational algorithm that integrates expression data from multiple species with species and gene phylogenies to infer modules of coexpressed genes in extant species and their evolutionary histories. We also develop new, generally applicable measures of conservation and divergence in gene regulatory modules to assess the impact of changes in gene content and expression on module evolution. We used Arboretum to study the evolution of the transcriptional response to heat shock in eight species of Ascomycota fungi and to reconstruct modules of the ancestral environmental stress response (ESR). We found substantial conservation in the stress response across species and in the reconstructed components of the ancestral ESR modules. The greatest divergence was in the most induced stress, primarily through module expansion. The divergence of the heat stress response exceeds that observed in the response to glucose depletion in the same species. Arboretum and its associated analyses provide a comprehensive framework to systematically study regulatory evolution of condition-specific responses.


Assuntos
Algoritmos , Biologia Computacional/métodos , Evolução Molecular , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Análise por Conglomerados , Duplicação Gênica , Resposta ao Choque Térmico/genética , Especificidade da Espécie , Estresse Fisiológico/genética , Leveduras/genética
17.
Mol Syst Biol ; 8: 619, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-23089682

RESUMO

Evolutionary rewiring of regulatory networks is an important source of diversity among species. Previous evidence suggested substantial divergence of regulatory networks across species. However, systematically assessing the extent of this plasticity and its functional implications has been challenging due to limited experimental data and the noisy nature of computational predictions. Here, we introduce a novel approach to study cis-regulatory evolution, and use it to trace the regulatory history of 88 DNA motifs of transcription factors across 23 Ascomycota fungi. While motifs are conserved, we find a pervasive gain and loss in the regulation of their target genes. Despite this turnover, the biological processes associated with a motif are generally conserved. We explain these trends using a model with a strong selection to conserve the overall function of a transcription factor, and a much weaker selection over the specific genes it targets. The model also accounts for the turnover of bound targets measured experimentally across species in yeasts and mammals. Thus, selective pressures on regulatory networks mostly tolerate local rewiring, and may allow for subtle fine-tuning of gene regulation during evolution.


Assuntos
Ascomicetos/genética , Evolução Molecular , Redes Reguladoras de Genes/genética , Modelos Genéticos , Seleção Genética , Animais , Sequência de Bases , Sequência Conservada , DNA Fúngico/genética , Proteínas Fúngicas/metabolismo , Mamíferos/genética , Dados de Sequência Molecular , Motivos de Nucleotídeos/genética , Filogenia , Ligação Proteica , Sequências Reguladoras de Ácido Nucleico/genética , Especificidade da Espécie , Fatores de Transcrição/metabolismo
18.
Nature ; 487(7407): 370-4, 2012 Jul 19.
Artigo em Inglês | MEDLINE | ID: mdl-22722833

RESUMO

Novel protein-coding genes can arise either through re-organization of pre-existing genes or de novo. Processes involving re-organization of pre-existing genes, notably after gene duplication, have been extensively described. In contrast, de novo gene birth remains poorly understood, mainly because translation of sequences devoid of genes, or 'non-genic' sequences, is expected to produce insignificant polypeptides rather than proteins with specific biological functions. Here we formalize an evolutionary model according to which functional genes evolve de novo through transitory proto-genes generated by widespread translational activity in non-genic sequences. Testing this model at the genome scale in Saccharomyces cerevisiae, we detect translation of hundreds of short species-specific open reading frames (ORFs) located in non-genic sequences. These translation events seem to provide adaptive potential, as suggested by their differential regulation upon stress and by signatures of retention by natural selection. In line with our model, we establish that S. cerevisiae ORFs can be placed within an evolutionary continuum ranging from non-genic sequences to genes. We identify ~1,900 candidate proto-genes among S. cerevisiae ORFs and find that de novo gene birth from such a reservoir may be more prevalent than sporadic gene duplication. Our work illustrates that evolution exploits seemingly dispensable sequences to generate adaptive functional innovation.


Assuntos
Evolução Molecular , Genes Fúngicos/genética , Saccharomyces/genética , Sequência de Bases , Sequência Conservada , Variação Genética , Dados de Sequência Molecular , Fases de Leitura Aberta , Filogenia , Biossíntese de Proteínas , Saccharomyces/classificação , Saccharomyces cerevisiae/classificação , Saccharomyces cerevisiae/genética , Alinhamento de Sequência
19.
BMC Genomics ; 13: 120, 2012 Mar 28.
Artigo em Inglês | MEDLINE | ID: mdl-22452820

RESUMO

BACKGROUND: The sequence of the pathogen Mycobacterium tuberculosis (Mtb) strain H37Rv has been available for over a decade, but the biology of the pathogen remains poorly understood. Genome sequences from other Mtb strains and closely related bacteria present an opportunity to apply the power of comparative genomics to understand the evolution of Mtb pathogenesis. We conducted a comparative analysis using 31 genomes from the Tuberculosis Database (TBDB.org), including 8 strains of Mtb and M. bovis, 11 additional Mycobacteria, 4 Corynebacteria, 2 Streptomyces, Rhodococcus jostii RHA1, Nocardia farcinia, Acidothermus cellulolyticus, Rhodobacter sphaeroides, Propionibacterium acnes, and Bifidobacterium longum. RESULTS: Our results highlight the functional importance of lipid metabolism and its regulation, and reveal variation between the evolutionary profiles of genes implicated in saturated and unsaturated fatty acid metabolism. It also suggests that DNA repair and molybdopterin cofactors are important in pathogenic Mycobacteria. By analyzing sequence conservation and gene expression data, we identify nearly 400 conserved noncoding regions. These include 37 predicted promoter regulatory motifs, of which 14 correspond to previously validated motifs, as well as 50 potential noncoding RNAs, of which we experimentally confirm the expression of four. CONCLUSIONS: Our analysis of protein evolution highlights gene families that are associated with the adaptation of environmental Mycobacteria to obligate pathogenesis. These families include fatty acid metabolism, DNA repair, and molybdopterin biosynthesis. Our analysis reinforces recent findings suggesting that small noncoding RNAs are more common in Mycobacteria than previously expected. Our data provide a foundation for understanding the genome and biology of Mtb in a comparative context, and are available online and through TBDB.org.


Assuntos
Actinobacteria/genética , Evolução Molecular , Mycobacterium tuberculosis/genética , Mycobacterium/genética , Actinobacteria/classificação , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Coenzimas/genética , Coenzimas/metabolismo , Reparo do DNA , Bases de Dados Genéticas , Ácidos Graxos/genética , Ácidos Graxos/metabolismo , Genoma Bacteriano , Genômica , Metabolismo dos Lipídeos/genética , Metaloproteínas/genética , Metaloproteínas/metabolismo , Cofatores de Molibdênio , Mycobacterium/classificação , Mycobacterium tuberculosis/classificação , Filogenia , Pteridinas/metabolismo , RNA não Traduzido/química , RNA não Traduzido/metabolismo
20.
Science ; 332(6032): 930-6, 2011 May 20.
Artigo em Inglês | MEDLINE | ID: mdl-21511999

RESUMO

The fission yeast clade--comprising Schizosaccharomyces pombe, S. octosporus, S. cryophilus, and S. japonicus--occupies the basal branch of Ascomycete fungi and is an important model of eukaryote biology. A comparative annotation of these genomes identified a near extinction of transposons and the associated innovation of transposon-free centromeres. Expression analysis established that meiotic genes are subject to antisense transcription during vegetative growth, which suggests a mechanism for their tight regulation. In addition, trans-acting regulators control new genes within the context of expanded functional modules for meiosis and stress response. Differences in gene content and regulation also explain why, unlike the budding yeast of Saccharomycotina, fission yeasts cannot use ethanol as a primary carbon source. These analyses elucidate the genome structure and gene regulation of fission yeast and provide tools for investigation across the Schizosaccharomyces clade.


Assuntos
Genoma Fúngico , Schizosaccharomyces/genética , Centrômero/genética , Centrômero/fisiologia , Centrômero/ultraestrutura , Elementos de DNA Transponíveis , Evolução Molecular , Perfilação da Expressão Gênica , Regulação Fúngica da Expressão Gênica , Genes Fúngicos Tipo Acasalamento , Genômica , Glucose/metabolismo , Meiose , Anotação de Sequência Molecular , Dados de Sequência Molecular , Filogenia , RNA Antissenso/genética , RNA Fúngico/genética , RNA Interferente Pequeno/genética , RNA não Traduzido/genética , Elementos Reguladores de Transcrição , Schizosaccharomyces/crescimento & desenvolvimento , Schizosaccharomyces/metabolismo , Proteínas de Schizosaccharomyces pombe/genética , Proteínas de Schizosaccharomyces pombe/metabolismo , Análise de Sequência de DNA , Especificidade da Espécie , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Transcrição Gênica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...