Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Evol Appl ; 13(4): 781-793, 2020 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-32211067

RESUMO

The ultimate causes of correlated evolution among sites in a genome remain difficult to tease apart. To address this problem directly, we performed a high-throughput search for correlated evolution among sites associated with resistance to a fluoroquinolone antibiotic using whole-genome data from clinical strains of Pseudomonas aeruginosa, before validating our computational predictions experimentally. We show that for at least two sites, this correlation is underlain by epistasis. Our analysis also revealed eight additional pairs of synonymous substitutions displaying correlated evolution underlain by physical linkage, rather than selection associated with antibiotic resistance. Our results provide direct evidence that both epistasis and physical linkage among sites can drive the correlated evolution identified by high-throughput computational tools. In other words, the observation of correlated evolution is not by itself sufficient evidence to guarantee that the sites in question are epistatic; such a claim requires additional evidence, ideally coming from direct estimates of epistasis, based on experimental evidence.

2.
BMC Genomics ; 20(1): 470, 2019 Jun 10.
Artigo em Inglês | MEDLINE | ID: mdl-31182025

RESUMO

BACKGROUND: A critical goal in biology is to relate the phenotype to the genotype, that is, to find the genetic determinants of various traits. However, while simple monofactorial determinants are relatively easy to identify, the underpinnings of complex phenotypes are harder to predict. While traditional approaches rely on genome-wide association studies based on Single Nucleotide Polymorphism data, the ability of machine learning algorithms to find these determinants in whole proteome data is still not well known. RESULTS: To better understand the applicability of machine learning in this case, we implemented two such algorithms, adaptive boosting (AB) and repeated random forest (RRF), and developed a chunking layer that facilitates the analysis of whole proteome data. We first assessed the performance of these algorithms and tuned them on an influenza data set, for which the determinants of three complex phenotypes (infectivity, transmissibility, and pathogenicity) are known based on experimental evidence. This allowed us to show that chunking improves runtimes by an order of magnitude. Based on simulations, we showed that chunking also increases sensitivity of the predictions, reaching 100% with as few as 20 sequences in a small proteome as in the influenza case (5k sites), but may require at least 30 sequences to reach 90% on larger alignments (500k sites). While RRF has less specificity than random forest, it was never <50%, and RRF sensitivity was significantly higher at smaller chunk sizes. We then used these algorithms to predict the determinants of three types of drug resistance (to Ciprofloxacin, Ceftazidime, and Gentamicin) in a bacterium, Pseudomonas aeruginosa. While both algorithms performed well in the case of the influenza data, results were more nuanced in the bacterial case, with RRF making more sensible predictions, with smaller errors rates, than AB. CONCLUSIONS: Altogether, we demonstrated that ML algorithms can be used to identify genetic determinants in small proteomes (viruses), even when trained on small numbers of individuals. We further showed that our RRF algorithm may deserve more scrutiny, which should be facilitated by the decreasing costs of both sequencing and phenotyping of large cohorts of individuals.


Assuntos
Aprendizado de Máquina , Fenótipo , Sequenciamento Completo do Genoma , Algoritmos , Farmacorresistência Bacteriana/genética , Humanos , Vírus da Influenza A/genética , Vírus da Influenza A/patogenicidade , Influenza Humana/transmissão , Influenza Humana/virologia , Proteômica , Pseudomonas aeruginosa/efeitos dos fármacos , Pseudomonas aeruginosa/genética
3.
Genetics ; 205(1): 409-420, 2017 01.
Artigo em Inglês | MEDLINE | ID: mdl-28049709

RESUMO

In systems biology and genomics, epistasis characterizes the impact that a substitution at a particular location in a genome can have on a substitution at another location. This phenomenon is often implicated in the evolution of drug resistance or to explain why particular "disease-causing" mutations do not have the same outcome in all individuals. Hence, uncovering these mutations and their locations in a genome is a central question in biology. However, epistasis is notoriously difficult to uncover, especially in fast-evolving organisms. Here, we present a novel statistical approach that replies on a model developed in ecology and that we adapt to analyze genetic data in fast-evolving systems such as the influenza A virus. We validate the approach using a two-pronged strategy: extensive simulations demonstrate a low-to-moderate sensitivity with excellent specificity and precision, while analyses of experimentally validated data recover known interactions, including in a eukaryotic system. We further evaluate the ability of our approach to detect correlated evolution during antigenic shifts or at the emergence of drug resistance. We show that in all cases, correlated evolution is prevalent in influenza A viruses, involving many pairs of sites linked together in chains; a hallmark of historical contingency. Strikingly, interacting sites are separated by large physical distances, which entails either long-range conformational changes or functional tradeoffs, for which we find support with the emergence of drug resistance. Our work paves a new way for the unbiased detection of epistasis in a wide range of organisms by performing whole-genome scans.


Assuntos
Alphainfluenzavirus/genética , Modelos Estatísticos , Teorema de Bayes , Epistasia Genética , Evolução Molecular , Glicoproteínas de Hemaglutininação de Vírus da Influenza/genética , Humanos , Influenza Humana/virologia , Alphainfluenzavirus/patogenicidade , Modelos Genéticos , Mutação , Filogenia
4.
BMC Microbiol ; 12: 40, 2012 Mar 22.
Artigo em Inglês | MEDLINE | ID: mdl-22439760

RESUMO

BACKGROUND: Bacteria excrete costly toxins to defend their ecological niche. The evolution of such antagonistic interactions between individuals is expected to depend on both the social environment and the strength of resource competition. Antagonism is expected to be weak among highly similar genotypes because most individuals are immune to antagonistic agents and among dissimilar genotypes because these are unlikely to be competing for the same resources and antagonism should not yield much benefit. The strength of antagonism is therefore expected to peak at intermediate genetic distance. RESULTS: We studied the ability of laboratory strains of Pseudomonas aeruginosa to prevent growth of 55 different clinical P. aeruginosa isolates derived from cystic fibrosis patients. Genetic distance was determined using genetic fingerprints. We found that the strength of antagonism was maximal among genotypes of intermediate genetic distance and we show that genetic distance and resource use are linked. CONCLUSIONS: Our results suggest that the importance of social interactions like antagonism may be modulated by the strength of resource competition.


Assuntos
Antibiose , Variação Genética , Pseudomonas aeruginosa/genética , Pseudomonas aeruginosa/fisiologia , Adolescente , Adulto , Fibrose Cística/complicações , Impressões Digitais de DNA , DNA Bacteriano/genética , Humanos , Tipagem Molecular , Infecções por Pseudomonas/microbiologia , Pseudomonas aeruginosa/classificação , Pseudomonas aeruginosa/isolamento & purificação , Adulto Jovem
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...