Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 86
Filtrar
1.
bioRxiv ; 2024 Mar 13.
Artigo em Inglês | MEDLINE | ID: mdl-38559261

RESUMO

Inference of demographic and evolutionary parameters from a sample of genome sequences often proceeds by first inferring identical-by-descent (IBD) genome segments. By exploiting efficient data encoding based on the ancestral recombination graph (ARG), we obtain three major advantages over current approaches: (i) no need to impose a length threshold on IBD segments, (ii) IBD can be defined without the hard-to-verify requirement of no recombination, and (iii) computation time can be reduced with little loss of statistical efficiency using only the IBD segments from a set of sequence pairs that scales linearly with sample size. We first demonstrate powerful inferences when true IBD information is available from simulated data. For IBD inferred from real data, we propose an approximate Bayesian computation inference algorithm and use it to show that poorly-inferred short IBD segments can improve estimation precision. We show estimation precision similar to a previously-published estimator despite a 4 000-fold reduction in data used for inference. Computational cost limits model complexity in our approach, but we are able to incorporate unknown nuisance parameters and model misspecification, still finding improved parameter inference.

2.
Am J Hum Genet ; 109(11): 2080-2087, 2022 11 03.
Artigo em Inglês | MEDLINE | ID: mdl-36288729

RESUMO

Genetic epilepsy with febrile seizures plus (GEFS+) is an autosomal dominant familial epilepsy syndrome characterized by distinctive phenotypic heterogeneity within families. The SCN1B c.363C>G (p.Cys121Trp) variant has been identified in independent, multi-generational families with GEFS+. Although the variant is present in population databases (at very low frequency), there is strong clinical, genetic, and functional evidence to support pathogenicity. Recurrent variants may be due to a founder event in which the variant has been inherited from a common ancestor. Here, we report evidence of a single founder event giving rise to the SCN1B c.363C>G variant in 14 independent families with epilepsy. A common haplotype was observed in all families, and the age of the most recent common ancestor was estimated to be approximately 800 years ago. Analysis of UK Biobank whole-exome-sequencing data identified 74 individuals with the same variant. All individuals carried haplotypes matching the epilepsy-affected families, suggesting all instances of the variant derive from a single mutational event. This unusual finding of a variant causing an autosomal dominant, early-onset disease in an outbred population that has persisted over many generations can be attributed to the relatively mild phenotype in most carriers and incomplete penetrance. Founder events are well established in autosomal recessive and late-onset disorders but are rarely observed in early-onset, autosomal dominant diseases. These findings suggest variants present in the population at low frequencies should be considered potentially pathogenic in mild phenotypes with incomplete penetrance and may be more important contributors to the genetic landscape than previously thought.


Assuntos
Epilepsia , Convulsões Febris , Criança , Humanos , Linhagem , Eletroencefalografia , Convulsões Febris/genética , Fenótipo , Epilepsia/genética
3.
Genet Epidemiol ; 46(7): 347-371, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-35842778

RESUMO

The inclusion of ancestrally diverse participants in genetic studies can lead to new discoveries and is important to ensure equitable health care benefit from research advances. Here, members of the Ethical, Legal, Social, Implications (ELSI) committee of the International Genetic Epidemiology Society (IGES) offer perspectives on methods and analysis tools for the conduct of inclusive genetic epidemiology research, with a focus on admixed and ancestrally diverse populations in support of reproducible research practices. We emphasize the importance of distinguishing socially defined population categorizations from genetic ancestry in the design, analysis, reporting, and interpretation of genetic epidemiology research findings. Finally, we discuss the current state of genomic resources used in genetic association studies, functional interpretation, and clinical and public health translation of genomic findings with respect to diverse populations.


Assuntos
Genética Populacional , Genômica , Estudos Epidemiológicos , Estudos de Associação Genética , Humanos , Epidemiologia Molecular
4.
Bioessays ; 44(5): e2100170, 2022 05.
Artigo em Inglês | MEDLINE | ID: mdl-35279859

RESUMO

Complex-trait genetics has advanced dramatically through methods to estimate the heritability tagged by SNPs, both genome-wide and in genomic regions of interest such as those defined by functional annotations. The models underlying many of these analyses are inadequate, and consequently many SNP-heritability results published to date are inaccurate. Here, we review the modelling issues, both for analyses based on individual genotype data and association test statistics, highlighting the role of a low-dimensional model for the heritability of each SNP. We use state-of-art models to present updated results about how heritability is distributed with respect to functional annotations in the human genome, and how it varies with allele frequency, which can reflect purifying selection. Our results give finer detail to the picture that has emerged in recent years of complex trait heritability widely dispersed across the genome. Confounding due to population structure remains a problem that summary statistic analyses cannot reliably overcome. Also see the video abstract here: https://youtu.be/WC2u03V65MQ.


Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Frequência do Gene , Genoma Humano/genética , Estudo de Associação Genômica Ampla/métodos , Genótipo , Humanos , Modelos Genéticos , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Característica Quantitativa Herdável
5.
Mol Ecol Resour ; 22(1): 137-152, 2022 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-34192415

RESUMO

Mapping the genes underlying ecologically relevant traits in natural populations is fundamental to develop a molecular understanding of species adaptation. Current sequencing technologies enable the characterization of a species' genetic diversity across the landscape or even over its whole range. The relevant capture of the genetic diversity across the landscape is critical for a successful genetic mapping of traits and there are no clear guidelines on how to achieve an optimal sampling and which sequencing strategy to implement. Here we determine, through simulation, the sampling scheme that maximizes the power to map the genetic basis of a complex trait in an outbreeding species across an idealized landscape and draw genomic predictions for the trait, comparing individual and pool sequencing strategies. Our results show that quantitative trait locus detection power and prediction accuracy are higher when more populations over the landscape are sampled and this is more cost-effectively done with pool sequencing than with individual sequencing. Additionally, we recommend sampling populations from areas of high genetic diversity. As progress in sequencing enables the integration of trait-based functional ecology into landscape genomics studies, these findings will guide study designs allowing direct measures of genetic effects in natural populations across the environment.


Assuntos
Ecologia , Genômica , Projetos de Pesquisa
6.
Genes (Basel) ; 12(8)2021 08 05.
Artigo em Inglês | MEDLINE | ID: mdl-34440383

RESUMO

Y chromosome and mitochondrial DNA profiles have been used as evidence in courts for decades, yet the problem of evaluating the weight of evidence has not been adequately resolved. Both are lineage markers (inherited from just one parent), which presents different interpretation challenges compared with standard autosomal DNA profiles (inherited from both parents). We review approaches to the evaluation of lineage marker profiles for forensic identification, focussing on the key roles of profile mutation rate and relatedness (extending beyond known relatives). Higher mutation rates imply fewer individuals matching the profile of an alleged contributor, but they will be more closely related. This makes it challenging to evaluate the possibility that one of these matching individuals could be the true source, because relatives may be plausible alternative contributors, and may not be well mixed in the population. These issues reduce the usefulness of profile databases drawn from a broad population: larger populations can have a lower profile relative frequency because of lower relatedness with the alleged contributor. Many evaluation methods do not adequately take account of distant relatedness, but its effects have become more pronounced with the latest generation of high-mutation-rate Y profiles.


Assuntos
Cromossomos Humanos Y , DNA/genética , Genética Forense , Genoma Mitocondrial , Humanos
8.
Nat Genet ; 52(4): 458-462, 2020 04.
Artigo em Inglês | MEDLINE | ID: mdl-32203469

RESUMO

There is currently much debate regarding the best model for how heritability varies across the genome. The authors of GCTA recommend the GCTA-LDMS-I model, the authors of LD Score Regression recommend the Baseline LD model, and we have recommended the LDAK model. Here we provide a statistical framework for assessing heritability models using summary statistics from genome-wide association studies. Based on 31 studies of complex human traits (average sample size 136,000), we show that the Baseline LD model is more realistic than other existing heritability models, but that it can be improved by incorporating features from the LDAK model. Our framework also provides a method for estimating the selection-related parameter α from summary statistics. We find strong evidence (P < 1 × 10-6) of negative genome-wide selection for traits, including height, systolic blood pressure and college education, and that the impact of selection is stronger inside functional categories, such as coding SNPs and promoter regions.


Assuntos
Padrões de Herança/genética , Estudo de Associação Genômica Ampla/métodos , Genótipo , Humanos , Desequilíbrio de Ligação/genética , Modelos Genéticos , Herança Multifatorial/genética , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Característica Quantitativa Herdável , Tamanho da Amostra , Software
9.
Genet Epidemiol ; 43(8): 930-940, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31541496

RESUMO

Linkage disequilibrium SCore regression (LDSC) has become a popular approach to estimate confounding bias, heritability, and genetic correlation using only genome-wide association study (GWAS) test statistics. SumHer is a newly introduced alternative with similar aims. We show using theory and simulations that both approaches fail to adequately account for confounding bias, even when the assumed heritability model is correct. Consequently, these methods may estimate heritability poorly if there was an inadequate adjustment for confounding in the original GWAS analysis. We also show that the choice of a summary statistic for use in LDSC or SumHer can have a large impact on resulting inferences. Further, covariate adjustments in the original GWAS can alter the target of heritability estimation, which can be problematic for test statistics from a meta-analysis of GWAS with different covariate adjustments.


Assuntos
Viés , Interpretação Estatística de Dados , Padrões de Herança , Modelos Genéticos , Simulação por Computador , Estudo de Associação Genômica Ampla , Humanos , Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único
10.
Forensic Sci Int Genet ; 38: 48-53, 2019 01.
Artigo em Inglês | MEDLINE | ID: mdl-30340211

RESUMO

We recently introduced a new approach to the evaluation of weight of evidence (WoE) for Y-chromosome profiles. Rather than attempting to calculate match probabilities, which is particularly problematic for modern Y-profiles with high mutation rates, we proposed using simulation to describe the distribution of the number of males in the population with a matching Y-profile, both the unconditional distribution and conditional on a database frequency of the profile. Here we further validate the new approach by showing that our results are robust to assumptions about the allelic ladder and the founder haplotypes, and we extend the approach in two important directions. Firstly, forensic databases are not the only source of background data relevant to the evaluation of Y-profile evidence: in many cases the Y-profiles of one or more relatives of the accused are also available. To date it has been unclear how to use this additional information, but in our simulation-based approach its effect is readily incorporated. We describe this approach and illustrate how the WoE that a man was the source of an observed Y-profile changes when the Y-profiles of some of his male-line relatives are also available. Secondly, we extend our new approach to mixtures of Y-profiles from two or more males. Surprisingly, our simulation-based approach reveals that observing a 2-male mixture that includes an alleged contributor's profile is almost as strong evidence as observing a matching single-contributor evidence sample, and even 3-male and 4-male mixtures are only slightly weaker.


Assuntos
Cromossomos Humanos Y , DNA/genética , Alelos , Impressões Digitais de DNA , Haplótipos , Humanos , Funções Verossimilhança , Masculino , Repetições de Microssatélites , Software
11.
Nat Genet ; 51(2): 277-284, 2019 02.
Artigo em Inglês | MEDLINE | ID: mdl-30510236

RESUMO

We present SumHer, software for estimating confounding bias, SNP heritability, enrichments of heritability and genetic correlations using summary statistics from genome-wide association studies. The key difference between SumHer and the existing software LD Score Regression (LDSC) is that SumHer allows the user to specify the heritability model. We apply SumHer to results from 24 large-scale association studies (average sample size 121,000) using our recommended heritability model. We show that these studies tended to substantially over-correct for confounding, and as a result the number of genome-wide significant loci was under-reported by about a quarter. We also estimate enrichments for 24 categories of SNPs defined by functional annotations. A previous study using LDSC reported that conserved regions were 13-fold enriched, and found a further six categories with above threefold enrichment. By contrast, our analysis using SumHer finds that none of the categories have enrichment above twofold. SumHer provides an improved understanding of the genetic architecture of complex traits, which enables more efficient analysis of future genetic data.


Assuntos
Polimorfismo de Nucleotídeo Único/genética , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla/métodos , Humanos , Padrões de Herança/genética , Modelos Genéticos , Fenótipo , Característica Quantitativa Herdável , Software
12.
PLoS Genet ; 14(11): e1007774, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30383746

RESUMO

Mitochondrial DNA (mtDNA) is useful to assist with identification of the source of a biological sample, or to confirm matrilineal relatedness. Although the autosomal genome is much larger, mtDNA has an advantage for forensic applications of multiple copy number per cell, allowing better recovery of sequence information from degraded samples. In addition, biological samples such as fingernails, old bones, teeth and hair have mtDNA but little or no autosomal DNA. The relatively low mutation rate of the mitochondrial genome (mitogenome) means that there can be large sets of matrilineal-related individuals sharing a common mitogenome. Here we present the mitolina simulation software that we use to describe the distribution of the number of mitogenomes in a population that match a given mitogenome, and investigate its dependence on population size and growth rate, and on a database count of the mitogenome. Further, we report on the distribution of the number of meioses separating pairs of individuals with matching mitogenome. Our results have important implications for assessing the weight of mtDNA profile evidence in forensic science, but mtDNA analysis has many non-human applications, for example in tracking the source of ivory. Our methods and software can also be used for simulations to help validate models of population history in human or non-human populations.


Assuntos
DNA Mitocondrial/genética , Genoma Mitocondrial , Modelos Genéticos , Cromossomos Humanos Y/genética , Simulação por Computador , Bases de Dados de Ácidos Nucleicos , Feminino , Genética Forense/estatística & dados numéricos , Variação Genética , Genética Populacional , Haplótipos , Humanos , Irã (Geográfico) , Masculino , Mutação , Software , Estados Unidos
13.
PLoS Genet ; 13(11): e1007028, 2017 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-29099833

RESUMO

The introduction of forensic autosomal DNA profiles was controversial, but the problems were successfully addressed, and DNA profiling has gone on to revolutionise forensic science. Y-chromosome profiles are valuable when there is a mixture of male-source and female-source DNA, and interest centres on the identity of the male source(s) of the DNA. The problem of evaluating evidential weight is even more challenging for Y profiles than for autosomal profiles. Numerous approaches have been proposed, but they fail to deal adequately with the fact that men with matching Y-profiles are related in extended patrilineal clans, many of which may not be represented in available databases. The higher mutation rates of modern profiling kits have led to increased discriminatory power but they have also exacerbated the problem of fairly conveying evidential value. Because the relevant population is difficult to define, yet the number of matching relatives is fixed as population size varies, it is typically infeasible to derive population-based match probabilities relevant to a specific crime. We propose a conceptually simple solution, based on a simulation model and software to approximate the distribution of the number of males with a matching Y profile. We show that this distribution is robust to different values for the variance in reproductive success and the population growth rate. We also use importance sampling reweighting to derive the distribution of the number of matching males conditional on a database frequency, finding that this conditioning typically has only a modest impact. We illustrate the use of our approach to quantify the value of Y profile evidence for a court in a way that is both scientifically valid and easily comprehensible by a judge or juror.


Assuntos
Cromossomos Humanos Y/genética , DNA/genética , Impressões Digitais de DNA/métodos , Genética Forense/métodos , Humanos , Masculino , Probabilidade , Reprodução , Software
14.
Genome Res ; 27(10): 1715-1729, 2017 10.
Artigo em Inglês | MEDLINE | ID: mdl-28864458

RESUMO

Gene panel and exome sequencing have revealed a high rate of molecular diagnoses among diseases where the genetic architecture has proven suitable for sequencing approaches, with a large number of distinct and highly penetrant causal variants identified among a growing list of disease genes. The challenge is, given the DNA sequence of a new patient, to distinguish disease-causing from benign variants. Large samples of human standing variation data highlight regional variation in the tolerance to missense variation within the protein-coding sequence of genes. This information is not well captured by existing bioinformatic tools, but is effective in improving variant interpretation. To address this limitation in existing tools, we introduce the missense tolerance ratio (MTR), which summarizes available human standing variation data within genes to encapsulate population level genetic variation. We find that patient-ascertained pathogenic variants preferentially cluster in low MTR regions (P < 0.005) of well-informed genes. By evaluating 20 publicly available predictive tools across genes linked to epilepsy, we also highlight the importance of understanding the empirical null distribution of existing prediction tools, as these vary across genes. Subsequently integrating the MTR with the empirically selected bioinformatic tools in a gene-specific approach demonstrates a clear improvement in the ability to predict pathogenic missense variants from background missense variation in disease genes. Among an independent test sample of case and control missense variants, case variants (0.83 median score) consistently achieve higher pathogenicity prediction probabilities than control variants (0.02 median score; Mann-Whitney U test, P < 1 × 10-16). We focus on the application to epilepsy genes; however, the framework is applicable to disease genes beyond epilepsy.


Assuntos
Biologia Computacional/métodos , Epilepsia/genética , Genômica/métodos , Variantes Farmacogenômicos , Medicina de Precisão/métodos , Epilepsia/diagnóstico , Humanos
15.
Nat Genet ; 49(7): 986-992, 2017 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-28530675

RESUMO

SNP heritability, the proportion of phenotypic variance explained by SNPs, has been reported for many hundreds of traits. Its estimation requires strong prior assumptions about the distribution of heritability across the genome, but current assumptions have not been thoroughly tested. By analyzing imputed data for a large number of human traits, we empirically derive a model that more accurately describes how heritability varies with minor allele frequency (MAF), linkage disequilibrium (LD) and genotype certainty. Across 19 traits, our improved model leads to estimates of common SNP heritability on average 43% (s.d. 3%) higher than those obtained from the widely used software GCTA and 25% (s.d. 2%) higher than those from the recently proposed extension GCTA-LDMS. Previously, DNase I hypersensitivity sites were reported to explain 79% of SNP heritability; using our improved heritability model, their estimated contribution is only 24%.


Assuntos
Estudo de Associação Genômica Ampla/métodos , Modelos Genéticos , Herança Multifatorial/genética , Polimorfismo de Nucleotídeo Único/genética , Alelos , Estudos de Casos e Controles , Sítios Frágeis do Cromossomo , Estudos de Coortes , Simulação por Computador , Desoxirribonuclease I , Frequência do Gene , Estudos de Associação Genética , Humanos , Padrões de Herança , Característica Quantitativa Herdável
16.
Bioinformatics ; 33(8): 1246-1247, 2017 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-28003266

RESUMO

Motivation: Sequencing pools of individuals (Pool-Seq) is a cost-effective way to gain insight into the genetics of complex traits, but as yet no parametric method has been developed to both test for genetic effects and estimate their magnitude. Here, we propose GWAlpha, a flexible method to obtain parametric estimates of genetic effects genome-wide from Pool-Seq experiments. Results: We showed that GWAlpha powerfully replicates the results of Genome-Wide Association Studies (GWAS) from model organisms. We perform simulation studies that illustrate the effect on power of sample size and number of pools and test the method on different experimental data. Availability and Implementation: GWAlpha is implemented in python, designed to run on Linux operating system and tested on Mac OS. It is freely available at https://github.com/aflevel/GWAlpha . Contact: afournier@unimelb.edu.au. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Estudo de Associação Genômica Ampla/métodos , Software , Genoma , Fenótipo , Tamanho da Amostra
18.
Hum Mutat ; 38(1): 78-85, 2017 01.
Artigo em Inglês | MEDLINE | ID: mdl-27650164

RESUMO

The aryl hydrocarbon receptor interacting protein (AIP) founder mutation R304* (or p.R304* ; NM_003977.3:c.910C>T, p.Arg304Ter) identified in Northern Ireland (NI) predisposes to acromegaly/gigantism; its population health impact remains unexplored. We measured R304* carrier frequency in 936 Mid Ulster, 1,000 Greater Belfast (both in NI) and 2,094 Republic of Ireland (ROI) volunteers and in 116 NI or ROI acromegaly/gigantism patients. Carrier frequencies were 0.0064 in Mid Ulster (95%CI = 0.0027-0.013; P = 0.0005 vs. ROI), 0.001 in Greater Belfast (0.00011-0.0047) and zero in ROI (0-0.0014). R304* prevalence was elevated in acromegaly/gigantism patients in NI (11/87, 12.6%, P < 0.05), but not in ROI (2/29, 6.8%) versus non-Irish patients (0-2.41%). Haploblock conservation supported a common ancestor for all the 18 identified Irish pedigrees (81 carriers, 30 affected). Time to most recent common ancestor (tMRCA) was 2550 (1,275-5,000) years. tMRCA-based simulations predicted 432 (90-5,175) current carriers, including 86 affected (18-1,035) for 20% penetrance. In conclusion, R304* is frequent in Mid Ulster, resulting in numerous acromegaly/gigantism cases. tMRCA is consistent with historical/folklore accounts of Irish giants. Forward simulations predict many undetected carriers; geographically targeted population screening improves asymptomatic carrier identification, complementing clinical testing of patients/relatives. We generated disease awareness locally, necessary for early diagnosis and improved outcomes of AIP-related disease.


Assuntos
Acromegalia/epidemiologia , Acromegalia/genética , Predisposição Genética para Doença , Gigantismo/epidemiologia , Gigantismo/genética , Peptídeos e Proteínas de Sinalização Intracelular/genética , Acromegalia/diagnóstico , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Alelos , Substituição de Aminoácidos , Mapeamento Cromossômico , Estudos Transversais , Feminino , Frequência do Gene , Genótipo , Gigantismo/diagnóstico , Heterozigoto , Humanos , Irlanda/epidemiologia , Masculino , Programas de Rastreamento , Pessoa de Meia-Idade , Fenótipo , Risco , Adulto Jovem
19.
Forensic Sci Int ; 272: e7-e9, 2017 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-27817943

RESUMO

This letter comments on the report "Forensic science in criminal courts: Ensuring scientific validity of feature-comparison methods" recently released by the President's Council of Advisors on Science and Technology (PCAST). The report advocates a procedure for evaluation of forensic evidence that is a two-stage procedure in which the first stage is "match"/"non-match" and the second stage is empirical assessment of sensitivity (correct acceptance) and false alarm (false acceptance) rates. Almost always, quantitative data from feature-comparison methods are continuously-valued and have within-source variability. We explain why a two-stage procedure is not appropriate for this type of data, and recommend use of statistical procedures which are appropriate.

20.
Forensic Sci Int Genet ; 25: 227-239, 2016 11.
Artigo em Inglês | MEDLINE | ID: mdl-27768939

RESUMO

Many DNA profiles recovered from crime scene samples are of a quality that does not allow them to be searched against, nor entered into, databases. We propose a method for the comparison of profiles arising from two DNA samples, one or both of which can have multiple donors and be affected by low DNA template or degraded DNA. We compute likelihood ratios to evaluate the hypothesis that the two samples have a common DNA donor, and hypotheses specifying the relatedness of two donors. Our method uses a probability distribution for the genotype of the donor of interest in each sample. This distribution can be obtained from a statistical model, or we can exploit the ability of trained human experts to assess genotype probabilities, thus extracting much information that would be discarded by standard interpretation rules. Our method is compatible with established methods in simple settings, but is more widely applicable and can make better use of information than many current methods for the analysis of mixed-source, low-template DNA profiles. It can accommodate uncertainty arising from relatedness instead of or in addition to uncertainty arising from noisy genotyping. We describe a computer program GPMDNA, available under an open source licence, to calculate LRs using the method presented in this paper.


Assuntos
Degradação Necrótica do DNA , Impressões Digitais de DNA , DNA/análise , Bases de Dados de Ácidos Nucleicos , Modelos Estatísticos , Software , Genética Forense , Genótipo , Humanos , Funções Verossimilhança , Probabilidade
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...