Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Bioinformatics ; 31(24): 3946-52, 2015 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-26315912

RESUMO

MOTIVATION: Detecting positive selection in genomic regions is a recurrent topic in natural population genetic studies. However, there is little consistency among the regions detected in several genome-wide scans using different tests and/or populations. Furthermore, few methods address the challenge of classifying selective events according to specific features such as age, intensity or state (completeness). RESULTS: We have developed a machine-learning classification framework that exploits the combined ability of some selection tests to uncover different polymorphism features expected under the hard sweep model, while controlling for population-specific demography. As a result, we achieve high sensitivity toward hard selective sweeps while adding insights about their completeness (whether a selected variant is fixed or not) and age of onset. Our method also determines the relevance of the individual methods implemented so far to detect positive selection under specific selective scenarios. We calibrated and applied the method to three reference human populations from The 1000 Genome Project to generate a genome-wide classification map of hard selective sweeps. This study improves detection of selective sweep by overcoming the classical selection versus no-selection classification strategy, and offers an explanation to the lack of consistency observed among selection tests when applied to real data. Very few signals were observed in the African population studied, while our method presents higher sensitivity in this population demography. AVAILABILITY AND IMPLEMENTATION: The genome-wide results for three human populations from The 1000 Genomes Project and an R-package implementing the 'Hierarchical Boosting' framework are available at http://hsb.upf.edu/.


Assuntos
Genética Populacional/métodos , Genômica/métodos , Aprendizado de Máquina , Demografia , Humanos , Polimorfismo Genético , Seleção Genética
2.
Bioinformatics ; 31(3): 438-9, 2015 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-25282646

RESUMO

SUMMARY: A wealth of large-scale genome sequencing projects opens the doors to new approaches to study the relationship between genotype and phenotype. One such opportunity is the possibility to apply genotype networks analysis to population genetics data. Genotype networks are a representation of the set of genotypes associated with a single phenotype, and they allow one to estimate properties such as the robustness of the phenotype to mutations, and the ability of its associated genotypes to evolve new adaptations. So far, though, genotype networks analysis has rarely been applied to population genetics data. To help fill this gap, here we present VCF2Networks, a tool to determine and study genotype network structure from single-nucleotide variant data. AVAILABILITY AND IMPLEMENTATION: VCF2Networks is available at https://bitbucket.org/dalloliogm/vcf2networks. CONTACT: giovanni.dallolio@kcl.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Redes Reguladoras de Genes , Genética Populacional , Genoma Humano , Polimorfismo de Nucleotídeo Único/genética , Software , Evolução Biológica , Genótipo , Humanos , Fenótipo
3.
PLoS One ; 9(6): e99424, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-24911413

RESUMO

Genotype networks are a concept used in systems biology to study sets of genotypes having the same phenotype, and the ability of these to bring forth novel phenotypes. In the past they have been applied to determine the genetic heterogeneity, and stability to mutations, of systems such as metabolic networks and RNA folds. Recently, they have been the base for reconciling the neutralist and selectionist views on evolution. Here, we adapted this concept to the study of population genetics data. Specifically, we applied genotype networks to the human 1000 genomes dataset, and analyzed networks composed of short haplotypes of Single Nucleotide Variants (SNV). The result is a scan of how properties related to genetic heterogeneity and stability to mutations are distributed along the human genome. We found that genes involved in acquired immunity, such as some HLA and MHC genes, tend to have the most heterogeneous and connected networks, and that coding regions tend to be more heterogeneous and stable to mutations than non-coding regions. We also found, using coalescent simulations, that regions under selection have more extended and connected networks. The application of the concept of genotype networks can provide a new opportunity to understand the evolutionary processes that shaped our genome. Learning how the genotype space of each region of our genome has been explored during the evolutionary history of the human species can lead to a better understanding on how selective pressures and neutral factors have shaped genetic diversity within populations and among individuals. Combined with the availability of larger datasets of sequencing data, genotype networks represent a new approach to the study of human genetic diversity that looks to the whole genome, and goes beyond the classical division between selection and neutrality methods.


Assuntos
Estudos de Associação Genética , Variação Genética , Genoma Humano , Genótipo , Evolução Biológica , Simulação por Computador , Heterogeneidade Genética , Loci Gênicos , Genética Populacional , Estudo de Associação Genômica Ampla , Haplótipos , Humanos , Modelos Genéticos , Fases de Leitura Aberta , Biologia de Sistemas
4.
BMC Evol Biol ; 12: 98, 2012 Jun 25.
Artigo em Inglês | MEDLINE | ID: mdl-22731960

RESUMO

BACKGROUND: Asparagine N-Glycosylation is one of the most important forms of protein post-translational modification in eukaryotes. This metabolic pathway can be subdivided into two parts: an upstream sub-pathway required for achieving proper folding for most of the proteins synthesized in the secretory pathway, and a downstream sub-pathway required to give variability to trans-membrane proteins, and involved in adaptation to the environment and innate immunity. Here we analyze the nucleotide variability of the genes of this pathway in human populations, identifying which genes show greater population differentiation and which genes show signatures of recent positive selection. We also compare how these signals are distributed between the upstream and the downstream parts of the pathway, with the aim of exploring how forces of population differentiation and positive selection vary among genes involved in the same metabolic pathway but subject to different functional constraints. RESULTS: Our results show that genes in the downstream part of the pathway are more likely to show a signature of population differentiation, while events of positive selection are equally distributed among the two parts of the pathway. Moreover, events of positive selection are frequent on genes that are known to be at bifurcation points, and that are identified as being in key position by a network-level analysis such as MGAT3 and GCS1. CONCLUSIONS: These findings indicate that the upstream part of the Asparagine N-Glycosylation pathway has lower diversity among populations, while the downstream part is freer to tolerate diversity among populations. Moreover, the distribution of signatures of population differentiation and positive selection can change between parts of a pathway, especially between parts that are exposed to different functional constraints. Our results support the hypothesis that genes involved in constitutive processes can be expected to show lower population differentiation, while genes involved in traits related to the environment should show higher variability. Taken together, this work broadens our knowledge on how events of population differentiation and of positive selection are distributed among different parts of a metabolic pathway.


Assuntos
Asparagina/genética , Genética Populacional , Redes e Vias Metabólicas , Processamento de Proteína Pós-Traducional , Seleção Genética , Evolução Molecular , Variação Genética , Genoma Humano , Glicosilação , Humanos , Modelos Genéticos
5.
Mol Biol Evol ; 29(5): 1379-92, 2012 May.
Artigo em Inglês | MEDLINE | ID: mdl-22135191

RESUMO

Genes and proteins rarely act in isolation, but they rather operate as components of complex networks of interacting molecules. Therefore, for understanding their evolution, it may be helpful to take into account the interaction networks in which they participate. It has been shown that selective constraints acting on genes depend on the position that they occupy in the network. Less understood is how the impact of local adaptation at the intraspecific level is affected by the network structure. Here, we analyzed the patterns of molecular evolution of 67 genes involved in the insulin/target of rapamycin (TOR) signal transduction pathway. This well-characterized pathway plays a key role in fundamental processes such as energetic metabolism, growth, reproduction, and aging and is involved in metabolic disorders such as obesity, insulin resistance, and diabetes. For that purpose, we combined genotype data from worldwide human populations with current knowledge of the structure and function of the pathway. We identified the footprint of recent positive selection in nine of the studied genomic regions. Most of the adaptation signals were observed among Middle East and North African, European, and Central South Asian populations. We found that positive selection preferentially targets the most central elements in the pathway, in contrast to previous observations in the whole human interactome. This observation indicates that the impact of positive selection on genes involved in the insulin/TOR pathway is affected by the pathway structure.


Assuntos
Insulina/genética , Insulina/metabolismo , Grupos Raciais/genética , Serina-Treonina Quinases TOR/genética , Serina-Treonina Quinases TOR/metabolismo , Genética Populacional , Humanos , Polimorfismo de Nucleotídeo Único , Seleção Genética , Transdução de Sinais
7.
PLoS One ; 6(3): e17913, 2011 Mar 28.
Artigo em Inglês | MEDLINE | ID: mdl-21464928

RESUMO

Recombination varies greatly among species, as illustrated by the poor conservation of the recombination landscape between humans and chimpanzees. Thus, shorter evolutionary time frames are needed to understand the evolution of recombination. Here, we analyze its recent evolution in humans. We calculated the recombination rates between adjacent pairs of 636,933 common single-nucleotide polymorphism loci in 28 worldwide human populations and analyzed them in relation to genetic distances between populations. We found a strong and highly significant correlation between similarity in the recombination rates corrected for effective population size and genetic differentiation between populations. This correlation is observed at the genome-wide level, but also for each chromosome and when genetic distances and recombination similarities are calculated independently from different parts of the genome. Moreover, and more relevant, this relationship is robustly maintained when considering presence/absence of recombination hotspots. Simulations show that this correlation cannot be explained by biases in the inference of recombination rates caused by haplotype sharing among similar populations. This result indicates a rapid pace of evolution of recombination, within the time span of differentiation of modern humans.


Assuntos
Genética Populacional , Recombinação Genética , Cromossomos Humanos/genética , Simulação por Computador , Frequência do Gene/genética , Humanos , Polimorfismo de Nucleotídeo Único/genética , Densidade Demográfica
8.
Glycobiology ; 21(11): 1395-400, 2011 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-21199820

RESUMO

Asparagine N-linked glycosylation is one of the most important forms of protein post-translational modification in eukaryotes and is one of the first metabolic pathways described at a biochemical level. Here, we report a new annotation of this pathway for the Human species, published after passing a peer-review process in Reactome. The new annotation presented here offers a high level of detail and provides references and descriptions for each reaction, along with integration with GeneOntology and other databases. The open-source approach of Reactome toward annotation encourages feedback from its users, making it easier to keep the annotation of this pathway updated with future knowledge. Reactome's web interface allows easy navigation between steps involved in the pathway to compare it with other pathways and resources in other scientific databases and to export it to BioPax and SBML formats, making it accessible for computational studies. This new entry in Reactome expands and complements the annotations already published in databases for biological pathways and provides a common reference to researchers interested in studying this important pathway in the human species. Finally, we discuss the status of the annotation of this pathway and point out which steps are worth further investigation or need better experimental validation.


Assuntos
Asparagina/metabolismo , Bases de Dados de Proteínas , Anotação de Sequência Molecular , Processamento de Proteína Pós-Traducional , Glicosilação , Humanos , Redes e Vias Metabólicas , Polissacarídeos/biossíntese , Transporte Proteico
9.
Mol Biol Evol ; 28(1): 813-23, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-20924085

RESUMO

N-glycosylation is one of the most important forms of protein modification, serving key biological functions in multicellular organisms. N-glycans at the cell surface mediate the interaction between cells and the surrounding matrix and may act as pathogen receptors, making the genes responsible for their synthesis good candidates to show signatures of adaptation to different pathogen environments. Here, we study the forces that shaped the evolution of the genes involved in the synthesis of the N-glycans during the divergence of primates within the framework of their functional network. We have found that, despite their function of producing glycan repertoires capable of evading rapidly evolving pathogens, genes involved in the synthesis of the glycans are highly conserved, and no signals of positive selection have been detected within the time of divergence of primates. This suggests strong functional constraints as the main force driving their evolution. We studied the strength of the purifying selection acting on the genes in relation to the network structure considering the position of each gene along the pathway, its connectivity, and the rates of evolution in neighboring genes. We found a strong and highly significant negative correlation between the strength of purifying selection and the connectivity of each gene, indicating that genes encoding for highly connected enzymes evolve slower and thus are subject to stronger selective constraints. This result confirms that network topology does shape the evolution of the genes and that the connectivity within metabolic pathways and networks plays a major role in constraining evolutionary rates.


Assuntos
Evolução Molecular , Redes e Vias Metabólicas/genética , Primatas/genética , Primatas/metabolismo , Seleção Genética/genética , Adaptação Fisiológica/genética , Animais , Sequência de Bases , Glicosilação , Dados de Sequência Molecular , Análise Multivariada , Polissacarídeos/química , Polissacarídeos/metabolismo , Alinhamento de Sequência
10.
Database (Oxford) ; 2010: baq035, 2010.
Artigo em Inglês | MEDLINE | ID: mdl-21186182

RESUMO

Since the publication of their longtime predecessor The Atlas of Protein Sequences and Structures in 1965 by Margaret Dayhoff, scientific databases have become a key factor in the organization of modern science. All the information and knowledge described in the novel scientific literature is translated into entries in many different scientific databases, making it possible to obtain very accurate information on a biological entity like genes or proteins without having to manually review the literature on it. However, even for the databases with the finest annotation procedures, errors or unclear parts sometimes appear in the publicly released version and influence the research of unaware scientists using them. The researcher that finds an error in a database is often left in a uncertain state, and often abandons the effort of reporting it because of a lack of a standard procedure to do so. In the present work, we propose that the simple adoption of a public error tracker application, as in many open software projects, could improve the quality of the annotations in many databases and encourage feedback from the scientific community on the data annotated publicly. In order to illustrate the situation, we describe a series of errors that we found and helped solve on the genes of a very well-known pathway in various biomedically relevant databases. We would like to show that, even if a majority of the most important scientific databases have procedures for reporting errors, these are usually not publicly visible, making the process of reporting errors time consuming and not useful. Also, the effort made by the user that reports the error often goes unacknowledged, putting him in a discouraging position.


Assuntos
Indexação e Redação de Resumos , Biologia Computacional , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Animais , Humanos , Internet
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...