Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Sci Data ; 4: 170115, 2017 09 21.
Artigo em Inglês | MEDLINE | ID: mdl-28933420

RESUMO

Understanding of sequence diversity is the cornerstone of analysis of genetic disorders, population genetics, and evolutionary biology. Here, we present an update of our sequencing set to 15,220 Icelanders who we sequenced to an average genome-wide coverage of 34X. We identified 39,020,168 autosomal variants passing GATK filters: 31,079,378 SNPs and 7,940,790 indels. Calling de novo mutations (DNMs) is a formidable challenge given the high false positive rate in sequencing datasets relative to the mutation rate. Here we addressed this issue by using segregation of alleles in three-generation families. Using this transmission assay, we controlled the false positive rate and identified 108,778 high quality DNMs. Furthermore, we used our extended family structure and read pair tracing of DNMs to a panel of phased SNPs, to determine the parent of origin of 42,961 DNMs.


Assuntos
Genoma Humano , Humanos , Mutação INDEL , Islândia , Polimorfismo de Nucleotídeo Único
2.
Nat Genet ; 49(11): 1654-1660, 2017 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-28945251

RESUMO

A fundamental requirement for genetic studies is an accurate determination of sequence variation. While human genome sequence diversity is increasingly well characterized, there is a need for efficient ways to use this knowledge in sequence analysis. Here we present Graphtyper, a publicly available novel algorithm and software for discovering and genotyping sequence variants. Graphtyper realigns short-read sequence data to a pangenome, a variation-aware graph structure that encodes sequence variation within a population by representing possible haplotypes as graph paths. Our results show that Graphtyper is fast, highly scalable, and provides sensitive and accurate genotype calls. Graphtyper genotyped 89.4 million sequence variants in the whole genomes of 28,075 Icelanders using less than 100,000 CPU days, including detailed genotyping of six human leukocyte antigen (HLA) genes. We show that Graphtyper is a valuable tool in characterizing sequence variation in both small and population-scale sequencing studies.


Assuntos
Algoritmos , Genoma Humano , Técnicas de Genotipagem/instrumentação , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA/estatística & dados numéricos , Alelos , Sequência de Bases , Gráficos por Computador , Antígenos HLA/genética , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Alinhamento de Sequência , Análise de Sequência de DNA/métodos , Software
3.
Nature ; 549(7673): 519-522, 2017 09 28.
Artigo em Inglês | MEDLINE | ID: mdl-28959963

RESUMO

The characterization of mutational processes that generate sequence diversity in the human genome is of paramount importance both to medical genetics and to evolutionary studies. To understand how the age and sex of transmitting parents affect de novo mutations, here we sequence 1,548 Icelanders, their parents, and, for a subset of 225, at least one child, to 35× genome-wide coverage. We find 108,778 de novo mutations, both single nucleotide polymorphisms and indels, and determine the parent of origin of 42,961. The number of de novo mutations from mothers increases by 0.37 per year of age (95% CI 0.32-0.43), a quarter of the 1.51 per year from fathers (95% CI 1.45-1.57). The number of clustered mutations increases faster with the mother's age than with the father's, and the genomic span of maternal de novo mutation clusters is greater than that of paternal ones. The types of de novo mutation from mothers change substantially with age, with a 0.26% (95% CI 0.19-0.33%) decrease in cytosine-phosphate-guanine to thymine-phosphate-guanine (CpG>TpG) de novo mutations and a 0.33% (95% CI 0.28-0.38%) increase in C>G de novo mutations per year, respectively. Remarkably, these age-related changes are not distributed uniformly across the genome. A striking example is a 20 megabase region on chromosome 8p, with a maternal C>G mutation rate that is up to 50-fold greater than the rest of the genome. The age-related accumulation of maternal non-crossover gene conversions also mostly occurs within these regions. Increased sequence diversity and linkage disequilibrium of C>G variants within regions affected by excess maternal mutations indicate that the underlying mutational process has persisted in humans for thousands of years. Moreover, the regional excess of C>G variation in humans is largely shared by chimpanzees, less by gorillas, and is almost absent from orangutans. This demonstrates that sequence diversity in humans results from evolving interactions between age, sex, mutation type, and genomic location.


Assuntos
Envelhecimento/genética , Mutação em Linhagem Germinativa/genética , Idade Materna , Mutagênese , Pais , Idade Paterna , Adolescente , Adulto , Idoso , Animais , Criança , Cromossomos Humanos Par 8/genética , Evolução Molecular , Feminino , Sequência Rica em GC , Genoma Humano/genética , Gorilla gorilla/genética , Humanos , Mutação INDEL , Islândia , Desequilíbrio de Ligação/genética , Masculino , Pessoa de Meia-Idade , Taxa de Mutação , Pan troglodytes/genética , Polimorfismo de Nucleotídeo Único , Pongo/genética , Adulto Jovem
4.
Nat Genet ; 48(6): 634-9, 2016 06.
Artigo em Inglês | MEDLINE | ID: mdl-27135400

RESUMO

Sequence variants affecting blood lipids and coronary artery disease (CAD) may enhance understanding of the atherogenicity of lipid fractions. Using a large resource of whole-genome sequence data, we examined rare and low-frequency variants for association with non-HDL cholesterol, HDL cholesterol, LDL cholesterol, and triglycerides in up to 119,146 Icelanders. We discovered 13 variants with large effects (within ANGPTL3, APOB, ABCA1, NR1H3, APOA1, LIPC, CETP, LDLR, and APOC1) and replicated 14 variants. Five variants within PCSK9, APOA1, ANGPTL4, and LDLR associate with CAD (33,090 cases and 236,254 controls). We used genetic risk scores for the lipid fractions to examine their causal relationship with CAD. The non-HDL cholesterol genetic risk score associates most strongly with CAD (P = 2.7 × 10(-28)), and no other genetic risk score associates with CAD after accounting for non-HDL cholesterol. The genetic risk score for non-HDL cholesterol confers CAD risk beyond that of LDL cholesterol (P = 5.5 × 10(-8)), suggesting that targeting atherogenic remnant cholesterol may reduce cardiovascular risk.


Assuntos
Colesterol/sangue , Doença da Artéria Coronariana/genética , Lipídeos/sangue , Triglicerídeos/sangue , Doença da Artéria Coronariana/sangue , Doença da Artéria Coronariana/diagnóstico , Humanos
5.
Sci Data ; 2: 150011, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25977816

RESUMO

We have accumulated considerable data on the genetic makeup of the Icelandic population by sequencing the whole genomes of 2,636 Icelanders to depth of at least 10X and by chip genotyping 101,584 more. The sequencing was done with Illumina technology. The median sequencing depth was 20X and 909 individuals were sequenced to a depth of at least 30X. We found 20 million single nucleotide polymorphisms (SNPs) and 1.5 million insertions/deletions (indels) that passed stringent quality control. Almost all the common SNPs (derived allele frequency (DAF) over 2%) that we identified in Iceland have been observed by either dbSNP (build 137) or the Exome Sequencing Project (ESP) while only 60 and 20% of rare (DAF<0.5%) SNPs and indels in coding regions, the most heavily studied parts of the genome, have been observed in the public databases. Features of our variant data, such as the transition/transversion ratio and the length distribution of indels, are similar to published reports.


Assuntos
Genoma Humano , Análise de Sequência de DNA , Sequência de Bases , Frequência do Gene , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Mutação INDEL , Islândia , Polimorfismo de Nucleotídeo Único
6.
Nat Genet ; 47(5): 448-52, 2015 May.
Artigo em Inglês | MEDLINE | ID: mdl-25807282

RESUMO

Loss-of-function mutations cause many mendelian diseases. Here we aimed to create a catalog of autosomal genes that are completely knocked out in humans by rare loss-of-function mutations. We sequenced the whole genomes of 2,636 Icelanders and imputed the sequence variants identified in this set into 101,584 additional chip-genotyped and phased Icelanders. We found a total of 6,795 autosomal loss-of-function SNPs and indels in 4,924 genes. Of the genotyped Icelanders, 7.7% are homozygotes or compound heterozygotes for loss-of-function mutations with a minor allele frequency (MAF) below 2% in 1,171 genes (complete knockouts). Genes that are highly expressed in the brain are less often completely knocked out than other genes. Homozygous loss-of-function offspring of two heterozygous parents occurred less frequently than expected (deficit of 136 per 10,000 transmissions for variants with MAF <2%, 95% confidence interval (CI) = 10-261).


Assuntos
Mutação INDEL , Encéfalo/metabolismo , Feminino , Expressão Gênica , Frequência do Gene , Ontologia Genética , Genoma Humano , Heterozigoto , Homozigoto , Humanos , Masculino , Fases de Leitura Aberta , Especificidade de Órgãos , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA
7.
Nat Genet ; 47(5): 435-44, 2015 May.
Artigo em Inglês | MEDLINE | ID: mdl-25807286

RESUMO

Here we describe the insights gained from sequencing the whole genomes of 2,636 Icelanders to a median depth of 20×. We found 20 million SNPs and 1.5 million insertions-deletions (indels). We describe the density and frequency spectra of sequence variants in relation to their functional annotation, gene position, pathway and conservation score. We demonstrate an excess of homozygosity and rare protein-coding variants in Iceland. We imputed these variants into 104,220 individuals down to a minor allele frequency of 0.1% and found a recessive frameshift mutation in MYL4 that causes early-onset atrial fibrillation, several mutations in ABCB4 that increase risk of liver diseases and an intronic variant in GNAS associating with increased thyroid-stimulating hormone levels when maternally inherited. These data provide a study design that can be used to determine how variation in the sequence of the human genome gives rise to human diversity.


Assuntos
Subfamília B de Transportador de Cassetes de Ligação de ATP/genética , Subunidades alfa Gs de Proteínas de Ligação ao GTP/genética , Cadeias Leves de Miosina/genética , Idoso , Idoso de 80 Anos ou mais , Fibrilação Atrial/genética , Paralisia Bulbar Progressiva/genética , Cromograninas , Feminino , Mutação da Fase de Leitura , Frequência do Gene , Predisposição Genética para Doença , Genoma Humano , Estudo de Associação Genômica Ampla , Perda Auditiva Neurossensorial/genética , Humanos , Mutação INDEL , Islândia , Hepatopatias/genética , Masculino , Pessoa de Meia-Idade , Anotação de Sequência Molecular , Filogeografia , Polimorfismo de Nucleotídeo Único , Receptores Acoplados a Proteínas G/genética , Risco , Análise de Sequência de DNA , Tireotropina/sangue
8.
Nat Genet ; 43(11): 1127-30, 2011 Oct 09.
Artigo em Inglês | MEDLINE | ID: mdl-21983786

RESUMO

We tested 16 million SNPs, identified through whole-genome sequencing of 457 Icelanders, for association with gout and serum uric acid levels. Genotypes were imputed into 41,675 chip-genotyped Icelanders and their relatives, for effective sample sizes of 968 individuals with gout and 15,506 individuals for whom serum uric acid measurements were available. We identified a low-frequency missense variant (c.1580C>G) in ALDH16A1 associated with gout (OR = 3.12, P = 1.5 × 10(-16), at-risk allele frequency = 0.019) and serum uric acid levels (effect = 0.36 s.d., P = 4.5 × 10(-21)). We confirmed the association with gout by performing Sanger sequencing on 6,017 Icelanders. The association with gout was stronger in males relative to females. We also found a second variant on chromosome 1 associated with gout (OR = 1.92, P = 0.046, at-risk allele frequency = 0.986) and serum uric acid levels (effect = 0.48 s.d., P = 4.5 × 10(-16)). This variant is close to a common variant previously associated with serum uric acid levels. This work illustrates how whole-genome sequencing data allow the detection of associations between low-frequency variants and complex traits.


Assuntos
Gota/genética , Polimorfismo de Nucleotídeo Único , Ácido Úrico/sangue , Humanos , Islândia , Mutação de Sentido Incorreto
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...