Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
BMC Genet ; 14: 108, 2013 Nov 07.
Article in English | MEDLINE | ID: mdl-24199751

ABSTRACT

BACKGROUND: The advent of genome-wide association studies has led to many novel disease-SNP associations, opening the door to focused study on their biological underpinnings. Because of the importance of analyzing these associations, numerous statistical methods have been devoted to them. However, fewer methods have attempted to associate entire genes or genomic regions with outcomes, which is potentially more useful knowledge from a biological perspective and those methods currently implemented are often permutation-based. RESULTS: One property of some permutation-based tests is that their power varies as a function of whether significant markers are in regions of linkage disequilibrium (LD) or not, which we show from a theoretical perspective. We therefore develop two methods for quantifying the degree of association between a genomic region and outcome, both of whose power does not vary as a function of LD structure. One method uses dimension reduction to "filter" redundant information when significant LD exists in the region, while the other, called the summary-statistic test, controls for LD by scaling marker Z-statistics using knowledge of the correlation matrix of markers. An advantage of this latter test is that it does not require the original data, but only their Z-statistics from univariate regressions and an estimate of the correlation structure of markers, and we show how to modify the test to protect the type 1 error rate when the correlation structure of markers is misspecified. We apply these methods to sequence data of oral cleft and compare our results to previously proposed gene tests, in particular permutation-based ones. We evaluate the versatility of the modification of the summary-statistic test since the specification of correlation structure between markers can be inaccurate. CONCLUSION: We find a significant association in the sequence data between the 8q24 region and oral cleft using our dimension reduction approach and a borderline significant association using the summary-statistic based approach. We also implement the summary-statistic test using Z-statistics from an already-published GWAS of Chronic Obstructive Pulmonary Disorder (COPD) and correlation structure obtained from HapMap. We experiment with the modification of this test because the correlation structure is assumed imperfectly known.


Subject(s)
Genetic Testing/methods , Models, Statistical , Algorithms , Cleft Lip/genetics , Cleft Lip/pathology , Genome-Wide Association Study , HapMap Project , Humans , Linkage Disequilibrium , Polymorphism, Single Nucleotide , Pulmonary Disease, Chronic Obstructive/genetics , Pulmonary Disease, Chronic Obstructive/pathology
2.
Bioinformatics ; 28(23): 3027-33, 2012 Dec 01.
Article in English | MEDLINE | ID: mdl-23044548

ABSTRACT

MOTIVATION: For the analysis of rare variants in sequence data, numerous approaches have been suggested. Fixed and flexible threshold approaches collapse the rare variant information of a genomic region into a test statistic with reduced dimensionality. Alternatively, the rare variant information can be combined in statistical frameworks that are based on suitable regression models, machine learning, etc. Although the existing approaches provide powerful tests that can incorporate information on allele frequencies and prior biological knowledge, differences in the spatial clustering of rare variants between cases and controls cannot be incorporated. Based on the assumption that deleterious variants and protective variants cluster or occur in different parts of the genomic region of interest, we propose a testing strategy for rare variants that builds on spatial cluster methodology and that guides the identification of the biological relevant segments of the region. Our approach does not require any assumption about the directions of the genetic effects. RESULTS: In simulation studies, we assess the power of the clustering approach and compare it with existing methodology. Our simulation results suggest that the clustering approach for rare variants is well powered, even in situations that are ideal for standard methods. The efficiency of our spatial clustering approach is not affected by the presence of rare variants that have opposite effect size directions. An application to a sequencing study for non-syndromic cleft lip with or without cleft palate (NSCL/P) demonstrates its practical relevance. The proposed testing strategy is applied to a genomic region on chromosome 15q13.3 that was implicated in NSCL/P etiology in a previous genome-wide association study, and its results are compared with standard approaches. AVAILABILITY: Source code and documentation for the implementation in R will be provided online. Currently, the R-implementation only supports genotype data. We currently are working on an extension for VCF files. CONTACT: heide.fier@googlemail.com.


Subject(s)
Alleles , Cleft Lip/genetics , Cleft Palate/genetics , Computer Simulation , Brain/abnormalities , Chromosomes, Human, Pair 15/genetics , Cluster Analysis , Gene Frequency , Genome-Wide Association Study , Genotype , Humans
3.
Nat Genet ; 44(9): 968-71, 2012 Sep.
Article in English | MEDLINE | ID: mdl-22863734

ABSTRACT

We have conducted the first meta-analyses for nonsyndromic cleft lip with or without cleft palate (NSCL/P) using data from the two largest genome-wide association studies published to date. We confirmed associations with all previously identified loci and identified six additional susceptibility regions (1p36, 2p21, 3p11.1, 8q21.3, 13q31.1 and 15q22). Analysis of phenotypic variability identified the first specific genetic risk factor for NSCLP (nonsyndromic cleft lip plus palate) (rs8001641; P(NSCLP) = 6.51 × 10(-11); homozygote relative risk = 2.41, 95% confidence interval (CI) 1.84-3.16).


Subject(s)
Cleft Lip/genetics , Cleft Palate/genetics , Genome-Wide Association Study/statistics & numerical data , Adult , Child , Cleft Lip/complications , Cleft Lip/epidemiology , Cleft Palate/complications , Cleft Palate/epidemiology , Female , Genetic Predisposition to Disease , Humans , Male , Parents , Polymorphism, Single Nucleotide/physiology , Risk Factors , Syndrome
4.
Genet Epidemiol ; 36(5): 419-29, 2012 Jul.
Article in English | MEDLINE | ID: mdl-22549767

ABSTRACT

Many genome-wide association studies (GWAS) have signals with unknown etiology. This paper addresses the question-is such an association signal caused by rare or common variants that lead to increased disease risk? For a genomic region implicated by a GWAS, we use single nucleotide polymorphism (SNP) data in a case-control setting to predict how many common or rare variants there are, using a Bayesian analysis. Our objective is to compute posterior probabilities for configurations of rare and/or common variants. We use an extension of coalescent trees--the ancestral recombination graphs--to model the genealogical history of the samples based on marker data. As we expect SNPs to be in linkage disequilibrium with common disease variants, we can expect the trees to reflect the type of variants. To demonstrate the application, we apply our method to candidate gene sequencing data from a German case-control study on nonsyndromic cleft lip with or without cleft palate.


Subject(s)
Molecular Epidemiology/methods , Bayes Theorem , Case-Control Studies , Genetic Markers , Genetic Variation , Germany , Haplotypes , Humans , Likelihood Functions , Linkage Disequilibrium , Models, Genetic , Models, Statistical , Models, Theoretical , Mutation , Polymorphism, Single Nucleotide , Probability , Risk
5.
Genet Epidemiol ; 35(8): 880-6, 2011 Dec.
Article in English | MEDLINE | ID: mdl-22125225

ABSTRACT

Despite the numerous and successful applications of genome-wide association studies (GWASs), there has been a lot of difficulty in discovering disease susceptibility loci (DSLs). This is due to the fact that the GWAS approach is an indirect mapping technique, often identifying markers. For the identification of DSLs, which is required for the understanding of the genetic pathways for complex diseases, sequencing data that examines every genetic locus directly is necessary. Yet, there is currently a lack of methodology targeted at the identification of the DSLs in sequencing data: existing methods localize the causal variant to a region but not to a single variant, and therefore do not allow one to identify unique loci that cause the phenotype association. Here, we have developed such a method to determine if there is evidence that an individual loci affects case/control status with sequencing data. This methodology differs from other rare variant approaches: rather than testing an entire region comprised of many loci for association with the phenotype, we can identify the individual genetic locus that causes the association between the phenotype and the genetic region. For each variant, the test determines if the pattern of linkage disequilibrium (LD) across the other variants coincides with the pattern expected if that variant were a DSL. Power simulations show that the method successfully detects the causal variant, distinguishing it from other nearby variants (in high LD with the causal variant), and outperforms the standard tests. The efficiency of the method is especially apparent with small samples, which are currently realistic for studies due to sequencing data costs. The practical relevance of the approach is illustrated by an application to a sequencing dataset for nonsyndromic cleft lip with or without cleft palate. The proposed method implicated one variant (P = 0.002, 0.062 after Bonferroni correction), which was not found by standard analyses. Code for implementation is available.


Subject(s)
Genetic Predisposition to Disease , Linkage Disequilibrium , Sequence Analysis, DNA/methods , Cleft Lip/epidemiology , Cleft Lip/genetics , Cleft Palate/epidemiology , Cleft Palate/genetics , Computing Methodologies , Data Interpretation, Statistical , Gene Frequency , Genetic Markers , Humans , Sequence Analysis, DNA/statistics & numerical data
SELECTION OF CITATIONS
SEARCH DETAIL
...