Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 15 de 15
Filter
1.
Genome Med ; 13(1): 66, 2021 04 21.
Article in English | MEDLINE | ID: mdl-33883027

ABSTRACT

BACKGROUND: The large airway epithelial barrier provides one of the first lines of defense against respiratory viruses, including SARS-CoV-2 that causes COVID-19. Substantial inter-individual variability in individual disease courses is hypothesized to be partially mediated by the differential regulation of the genes that interact with the SARS-CoV-2 virus or are involved in the subsequent host response. Here, we comprehensively investigated non-genetic and genetic factors influencing COVID-19-relevant bronchial epithelial gene expression. METHODS: We analyzed RNA-sequencing data from bronchial epithelial brushings obtained from uninfected individuals. We related ACE2 gene expression to host and environmental factors in the SPIROMICS cohort of smokers with and without chronic obstructive pulmonary disease (COPD) and replicated these associations in two asthma cohorts, SARP and MAST. To identify airway biology beyond ACE2 binding that may contribute to increased susceptibility, we used gene set enrichment analyses to determine if gene expression changes indicative of a suppressed airway immune response observed early in SARS-CoV-2 infection are also observed in association with host factors. To identify host genetic variants affecting COVID-19 susceptibility in SPIROMICS, we performed expression quantitative trait (eQTL) mapping and investigated the phenotypic associations of the eQTL variants. RESULTS: We found that ACE2 expression was higher in relation to active smoking, obesity, and hypertension that are known risk factors of COVID-19 severity, while an association with interferon-related inflammation was driven by the truncated, non-binding ACE2 isoform. We discovered that expression patterns of a suppressed airway immune response to early SARS-CoV-2 infection, compared to other viruses, are similar to patterns associated with obesity, hypertension, and cardiovascular disease, which may thus contribute to a COVID-19-susceptible airway environment. eQTL mapping identified regulatory variants for genes implicated in COVID-19, some of which had pheWAS evidence for their potential role in respiratory infections. CONCLUSIONS: These data provide evidence that clinically relevant variation in the expression of COVID-19-related genes is associated with host factors, environmental exposures, and likely host genetic variation.


Subject(s)
Bronchi , COVID-19/genetics , Respiratory Mucosa , SARS-CoV-2 , Adult , Aged , Aged, 80 and over , Angiotensin-Converting Enzyme 2/genetics , Asthma/genetics , COVID-19/immunology , Cardiovascular Diseases/genetics , Cardiovascular Diseases/immunology , Gene Expression , Genetic Variation , Humans , Middle Aged , Obesity/genetics , Obesity/immunology , Pulmonary Disease, Chronic Obstructive/genetics , Quantitative Trait Loci , Risk Factors , Smoking/genetics
2.
Cell Rep ; 34(1): 108573, 2021 01 05.
Article in English | MEDLINE | ID: mdl-33406429

ABSTRACT

Whereas the human fetal immune system is poised to generate immune tolerance and suppress inflammation in utero, an adult-like immune system emerges to orchestrate anti-pathogen immune responses in post-natal life. It has been posited that cells of the adult immune system arise as a discrete ontological "layer" of hematopoietic stem-progenitor cells (HSPCs) and their progeny; evidence supporting this model in humans has, however, been inconclusive. Here, we combine bulk and single-cell transcriptional profiling of lymphoid cells, myeloid cells, and HSPCs from fetal, perinatal, and adult developmental stages to demonstrate that the fetal-to-adult transition occurs progressively along a continuum of maturity-with a substantial degree of inter-individual variation at the time of birth-rather than via a transition between discrete waves. These findings have important implications for the design of strategies for prophylaxis against infection in the newborn and for the use of umbilical cord blood (UCB) in the setting of transplantation.


Subject(s)
Fetus/metabolism , Hematopoietic Stem Cells/metabolism , Lymphocytes/metabolism , Myeloid Cells/metabolism , Single-Cell Analysis , T-Lymphocytes/metabolism , Transcriptome , Bone Marrow/metabolism , Cell Culture Techniques , Female , Fetal Blood , Humans , Immunity , Pregnancy , Sequence Analysis, RNA
3.
Am J Respir Crit Care Med ; 197(3): 313-324, 2018 02 01.
Article in English | MEDLINE | ID: mdl-29064281

ABSTRACT

RATIONALE: Quantification of type 2 inflammation provided a molecular basis for heterogeneity in asthma. Non-type 2 pathways that contribute to asthma pathogenesis are not well understood. OBJECTIVES: To identify dysregulated pathways beyond type 2 inflammation. METHODS: We applied RNA sequencing to airway epithelial brushings obtained from subjects with stable mild asthma not on corticosteroids (n = 19) and healthy control subjects (n = 16). Sequencing reads were mapped to human and viral genomes. In the same cohort, and in a separate group with severe asthma (n = 301), we profiled blood gene expression with microarrays. MEASUREMENTS AND MAIN RESULTS: In airway brushings from mild asthma on inhaled corticosteroids, RNA sequencing yielded 1,379 differentially expressed genes (false discovery rate < 0.01). Pathway analysis revealed increased expression of type 2 markers, IFN-stimulated genes (ISGs), and endoplasmic reticulum (ER) stress-related genes. Airway epithelial ISG expression was not associated with type 2 inflammation in asthma or with viral transcripts but was associated with reduced lung function by FEV1 (ρ = -0.72; P = 0.0004). ER stress was confirmed by an increase in XBP1 (X-box binding protein 1) splicing in mild asthma and was associated with both type 2 inflammation and ISG expression. ISGs were also the most activated genes in blood cells in asthma and were correlated with airway ISG expression (ρ = 0.55; P = 0.030). High blood ISG expression in severe asthma was similarly unrelated to type 2 inflammation. CONCLUSIONS: ISG activation is prominent in asthma, independent of viral transcripts, orthogonal to type 2 inflammation, and associated with distinct clinical features. ER stress is associated with both type 2 inflammation and ISG expression.


Subject(s)
Asthma/genetics , Asthma/physiopathology , Endoplasmic Reticulum/genetics , Gene Expression Regulation , Interferon Regulatory Factor-3/genetics , Adult , Case-Control Studies , Eosinophils/immunology , Female , Humans , Inflammation Mediators/metabolism , Male , Middle Aged , Oxidative Stress/genetics , RNA/genetics , Reference Values , Sensitivity and Specificity , Signal Transduction
4.
Alcohol Clin Exp Res ; 41(4): 711-718, 2017 04.
Article in English | MEDLINE | ID: mdl-28196272

ABSTRACT

BACKGROUND: Previous genomewide association studies (GWASs) have identified a number of putative risk loci for alcohol dependence (AD). However, only a few loci have replicated and these replicated variants only explain a small proportion of AD risk. Using an innovative approach, the goal of this study was to generate hypotheses about potentially causal variants for AD that can be explored further through functional studies. METHODS: We employed targeted capture of 71 candidate loci and flanking regions followed by next-generation deep sequencing (mean coverage 78X) in 806 European Americans. Regions included in our targeted capture library were genes identified through published GWAS of alcohol, all human alcohol and aldehyde dehydrogenases, reward system genes including dopaminergic and opioid receptors, prioritized candidate genes based on previous associations, and genes involved in the absorption, distribution, metabolism, and excretion of drugs. We performed single-locus tests to determine if any single variant was associated with AD symptom count. Sets of variants that overlapped with biologically meaningful annotations were tested for association in aggregate. RESULTS: No single, common variant was significantly associated with AD in our study. We did, however, find evidence for association with several variant sets. Two variant sets were significant at the q-value <0.10 level: a genic enhancer for ADHFE1 (p = 1.47 × 10-5 ; q = 0.019), an alcohol dehydrogenase, and ADORA1 (p = 5.29 × 10-5 ; q = 0.035), an adenosine receptor that belongs to a G-protein-coupled receptor gene family. CONCLUSIONS: To our knowledge, this is the first sequencing study of AD to examine variants in entire genes, including flanking and regulatory regions. We found that in addition to protein coding variant sets, regulatory variant sets may play a role in AD. From these findings, we have generated initial functional hypotheses about how these sets may influence AD.


Subject(s)
Alcoholism/diagnosis , Alcoholism/genetics , Genetic Association Studies/methods , Genetic Variation/genetics , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , Adult , Alcoholism/epidemiology , Female , Humans , Male , Young Adult
5.
Genome Biol ; 16: 291, 2015 Dec 23.
Article in English | MEDLINE | ID: mdl-26699738

ABSTRACT

BACKGROUND: Genetic influence on DNA methylation is potentially an important mechanism affecting individual differences in humans. We use next-generation sequencing to assay blood DNA methylation at approximately 4.5 million loci, each comprising 2.9 CpGs on average, in 697 normal subjects. Methylation measures at each locus are tested for association with approximately 4.5 million single nucleotide polymorphisms (SNPs) to exhaustively screen for methylation quantitative trait loci (meQTLs). RESULTS: Using stringent false discovery rate control, 15 % of methylation sites show genetic influence. Most meQTLs are local, where the associated SNP and methylation site are in close genomic proximity. Distant meQTLs and those spanning different chromosomes are less common. Most local meQTLs encompass common SNPs that alter CpG sites (CpG-SNPs). Local meQTLs encompassing CpG-SNPs are enriched in regions of inactive chromatin in blood cells. In contrast, local meQTLs lacking CpG-SNPs are enriched in regions of active chromatin and transcription factor binding sites. Of 393 local meQTLs that overlap disease-associated regions from genome-wide studies, a high percentage encompass common CpG-SNPs. These meQTLs overlap active enhancers, differentiating them from CpG-SNP meQTLs in inactive chromatin. CONCLUSIONS: Genetic influence on the human blood methylome is common, involves several heterogeneous processes and is predominantly dependent on local sequence context at the meQTL site. Most meQTLs involve CpG-SNPs, while sequence-dependent effects on chromatin binding are also important in regions of active chromatin. An abundance of local meQTLs resulting from methylation of CpG-SNPs in inactive chromatin suggests that many meQTLs lack functional consequence. Integrating meQTL and Roadmap Epigenomics data could assist fine-mapping efforts.


Subject(s)
DNA Methylation , Genome, Human , Quantitative Trait Loci , Blood/metabolism , CpG Islands , Female , High-Throughput Nucleotide Sequencing , Humans , Male , Polymorphism, Single Nucleotide , Sequence Analysis, DNA
6.
Alcohol Clin Exp Res ; 39(8): 1396-405, 2015 Aug.
Article in English | MEDLINE | ID: mdl-26146898

ABSTRACT

BACKGROUND: Methylome-wide association (MWAS) studies present a new way to advance the search for biological correlates for alcohol use. A challenge with methylation studies of alcohol involves the causal direction of significant methylation-alcohol associations. One way to address this issue is to combine MWAS data with genomewide association study (GWAS) data. METHODS: Here, we combined MWAS and GWAS results for alcohol use from 619 individuals. Our MWAS data were generated by next-generation sequencing of the methylated genomic DNA fraction, producing over 60 million reads per subject to interrogate methylation levels at ~27 million autosomal CpG sites in the human genome. Our GWAS included 5,571,786 single nucleotide polymorphisms (SNPs) imputed with 1000 Genomes. RESULTS: When combining the MWAS and GWAS data, our top finding was a region in an intron of CNTN4 (p = 2.55 × 10(-8) ), located between chr3: 2,555,403 and 2,555,524, encompassing SNPs rs1382874 and rs1382875. This finding was then replicated in an independent sample of 730 individuals. We used bisulfite pyrosequencing to measure methylation and found significant association with regular alcohol use in the same direction as the MWAS (p = 0.021). Rs1382874 and rs1382875 were genotyped and found to be associated in the same direction as the GWAS (p = 0.008 and p = 0.009). After integrating the MWAS and GWAS findings from the replication sample, we replicated our combined analysis finding (p = 0.0017) in CNTN4. CONCLUSIONS: Through combining methylation and SNP data, we have identified CNTN4 as a risk factor for regular alcohol use.


Subject(s)
Alcohol Drinking/genetics , Contactins/genetics , DNA Methylation/genetics , Genome-Wide Association Study/methods , Adult , Aged , Female , High-Throughput Nucleotide Sequencing/methods , Humans , Male , Middle Aged , Polymorphism, Single Nucleotide/genetics
7.
Hum Genet ; 134(1): 77-87, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25284466

ABSTRACT

Recent genome-wide association studies (GWAS) have made substantial progress in identifying disease loci. The next logical step is to design functional experiments to identify disease mechanisms. This step, however, is often hampered by the large size of loci identified in GWAS that is caused by linkage disequilibrium between SNPs. In this study, we demonstrate how integrating methylome-wide association study (MWAS) results with GWAS findings can narrow down the location for a subset of the putative casual sites. We use the disease schizophrenia as an example. To handle "data analytic" variation, we first combined our MWAS results with two GWAS meta-analyses (N = 32,143 and 21,953), that had largely overlapping samples but different data analysis pipelines, separately. Permutation tests showed significant overlapping association signals between GWAS and MWAS findings. This significant overlap justified prioritizing loci based on the concordance principle. To further ensure that the methylation signal was not driven by chance, we successfully replicated the top three methylation findings near genes SDCCAG8, CREB1 and ATXN7 in an independent sample using targeted pyrosequencing. In contrast to the SNPs in the selected region, the methylation sites were largely uncorrelated explaining why the methylation signals implicated much smaller regions (median size 78 bp). The refined loci showed considerable enrichment of genomic elements of possible functional importance and suggested specific hypotheses about schizophrenia etiology. Several hypotheses involved possible variation in transcription factor-binding efficiencies.


Subject(s)
Biomarkers/analysis , DNA Methylation , Epigenomics , Genome-Wide Association Study , Polymorphism, Single Nucleotide/genetics , Schizophrenia/genetics , Case-Control Studies , Computational Biology , Databases, Genetic , Follow-Up Studies , Humans , Linkage Disequilibrium , Meta-Analysis as Topic
8.
JAMA Psychiatry ; 71(3): 255-64, 2014 Mar.
Article in English | MEDLINE | ID: mdl-24402055

ABSTRACT

IMPORTANCE: Epigenetic studies present unique opportunities to advance schizophrenia research because they can potentially account for many of its clinical features and suggest novel strategies to improve disease management. OBJECTIVE: To identify schizophrenia DNA methylation biomarkers in blood. DESIGN, SETTING, AND PARTICIPANTS: The sample consisted of 759 schizophrenia cases and 738 controls (N = 1497) collected in Sweden. We used methyl-CpG-binding domain protein-enriched genome sequencing of the methylated genomic fraction, followed by next-generation DNA sequencing. We obtained a mean (SD) number of 68 (26.8) million reads per sample. This massive data set was processed using a specifically designed data analysis pipeline. Critical top findings from our methylome-wide association study (MWAS) were replicated in independent case-control participants using targeted pyrosequencing of bisulfite-converted DNA. MAIN OUTCOMES AND MEASURES: Status of schizophrenia cases and controls. RESULTS: Our MWAS suggested a considerable number of effects, with 25 sites passing the highly conservative Bonferroni correction and 139 sites significant at a false discovery rate of 0.01. Our top MWAS finding, which was located in FAM63B, replicated with P = 2.3 × 10-10. It was part of the networks regulated by microRNA that can be linked to neuronal differentiation and dopaminergic gene expression. Many other top MWAS results could be linked to hypoxia and, to a lesser extent, infection, suggesting that a record of pathogenic events may be preserved in the methylome. Our findings also implicated a site in RELN, one of the most frequently studied candidates in methylation studies of schizophrenia. CONCLUSIONS AND RELEVANCE: To our knowledge, the present study is one of the first MWASs of disease with a large sample size using a technology that provides good coverage of methylation sites across the genome. Our results demonstrated one of the unique features of methylation studies that can capture signatures of environmental insults in peripheral tissues. Our MWAS suggested testable hypotheses about disease mechanisms and yielded biomarkers that can potentially be used to improve disease management.


Subject(s)
DNA Methylation , Genome-Wide Association Study/methods , Registries , Schizophrenia/etiology , Biomarkers/blood , Epigenomics/methods , Genome-Wide Association Study/instrumentation , Humans , Reelin Protein , Schizophrenia/genetics , Sequence Analysis, DNA , Sweden
9.
Hum Mol Genet ; 23(5): 1175-85, 2014 Mar 01.
Article in English | MEDLINE | ID: mdl-24135035

ABSTRACT

The central importance of epigenetics to the aging process is increasingly being recognized. Here we perform a methylome-wide association study (MWAS) of aging in whole blood DNA from 718 individuals, aged 25-92 years (mean = 55). We sequenced the methyl-CpG-enriched genomic DNA fraction, averaging 67.3 million reads per subject, to obtain methylation measurements for the ∼27 million autosomal CpGs in the human genome. Following extensive quality control, we adaptively combined methylation measures for neighboring, highly-correlated CpGs into 4 344 016 CpG blocks with which we performed association testing. Eleven age-associated differentially methylated regions (DMRs) passed Bonferroni correction (P-value < 1.15 × 10(-8)). Top findings replicated in an independent sample set of 558 subjects using pyrosequencing of bisulfite-converted DNA (min P-value < 10(-30)). To examine biological themes, we selected 70 DMRs with false discovery rate of <0.1. Of these, 42 showed hypomethylation and 28 showed hypermethylation with age. Hypermethylated DMRs were more likely to overlap with CpG islands and shores. Hypomethylated DMRs were more likely to be in regions associated with polycomb/regulatory proteins (e.g. EZH2) or histone modifications H3K27ac, H3K4m1, H3K4m2, H3K4m3 and H3K9ac. Among genes implicated by the top DMRs were protocadherins, homeobox genes, MAPKs and ryanodine receptors. Several of our DMRs are at genes with potential relevance for age-related disease. This study successfully demonstrates the application of next-generation sequencing to MWAS, by interrogating a large proportion of the methylome and returning potentially novel age DMRs, in addition to replicating several loci implicated in previous studies using microarrays.


Subject(s)
Aging/genetics , CpG Islands , DNA Methylation , Epigenomics , Adult , Aged , Aged, 80 and over , Computational Biology , DNA/genetics , DNA/metabolism , DNA-Binding Proteins/metabolism , Epigenesis, Genetic , Female , Gene Regulatory Networks , Genome-Wide Association Study , High-Throughput Nucleotide Sequencing , Humans , Male , Middle Aged , Protein Binding , Protein Interaction Maps , Sex Factors , Signal Transduction , Transcription Factors/metabolism
10.
Epigenomics ; 5(4): 367-77, 2013 Aug.
Article in English | MEDLINE | ID: mdl-23895651

ABSTRACT

AIM: As the primary relevant tissue (brain) for psychiatric disorders is commonly not available, we aimed to investigate whether blood can be used as a proxy in methylation studies on the basis of two models. In the 'signature' model methylation-disease associations occur because a disease-causing factor affected methylation in the blood. In the 'mirror-site' model the methylation status in the blood is correlated with the corresponding disease-causing site in the brain. MATERIALS, METHODS & RESULTS: Methyl-binding domain enrichment and next-generation sequencing of the blood, cortex and hippocampus from four haloperidol-treated and ten untreated C57BL/6 mice revealed high levels of correlation in methylation across tissues. Despite the treatment inducing a large number of methylation changes, this correlation remains high. CONCLUSION: Our results show that, consistent with the signature model, factors that affect brain processes (i.e., haloperidol) leave biomarker signatures in the blood and, consistent with the mirror-site model, the methylation status of many sites in the blood mirror those in the brain.


Subject(s)
Biomarkers/blood , Brain/drug effects , DNA Methylation , DNA-Binding Proteins/blood , Models, Biological , Animals , Antipsychotic Agents/pharmacology , Brain/metabolism , Computational Biology , CpG Islands , DNA-Binding Proteins/genetics , Epigenesis, Genetic , Haloperidol/pharmacology , Male , Mental Disorders/blood , Mice , Mice, Inbred C57BL
11.
Epigenetics ; 8(5): 542-7, 2013 May.
Article in English | MEDLINE | ID: mdl-23644822

ABSTRACT

The potential importance of DNA methylation in the etiology of complex diseases has led to interest in the development of methylome-wide association studies (MWAS) aimed at interrogating all methylation sites in the human genome. When using blood as biomaterial for a MWAS the DNA is typically extracted directly from fresh or frozen whole blood that was collected via venous puncture. However, DNA extracted from dry blood spots may also be an alternative starting material. In the present study, we apply a methyl-CpG binding domain (MBD) protein enrichment-based technique in combination with next generation sequencing (MBD-seq) to assess the methylation status of the ~27 million CpGs in the human autosomal reference genome. We investigate eight methylomes using DNA from blood spots. This data are compared with 1,500 methylomes previously assayed with the same MBD-seq approach using DNA from whole blood. When investigating the sequence quality and the enrichment profile across biological features, we find that DNA extracted from blood spots gives comparable results with DNA extracted from whole blood. Only if the amount of starting material is ≤ 0.5µg DNA we observe a slight decrease in the assay performance. In conclusion, we show that high quality methylome-wide investigations using MBD-seq can be conducted in DNA extracted from archived dry blood spots without sacrificing quality and without bias in enrichment profile as long as the amount of starting material is sufficient. In general, the amount of DNA extracted from a single blood spot is sufficient for methylome-wide investigations with the MBD-seq approach.


Subject(s)
Blood Banks , DNA Methylation/genetics , DNA/genetics , Dried Blood Spot Testing , High-Throughput Nucleotide Sequencing/methods , CpG Islands/genetics , Humans
12.
BMC Bioinformatics ; 14: 74, 2013 Mar 02.
Article in English | MEDLINE | ID: mdl-23452721

ABSTRACT

BACKGROUND: In methylome-wide association studies (MWAS) there are many possible differences between cases and controls (e.g. related to life style, diet, and medication use) that may affect the methylome and produce false positive findings. An effective approach to control for these confounders is to first capture the major sources of variation in the methylation data and then regress out these components in the association analyses. This approach is, however, computationally very challenging due to the extremely large number of methylation sites in the human genome. RESULT: We introduce MethylPCA that is specifically designed to control for potential confounders in studies where the number of methylation sites is extremely large. MethylPCA offers a complete and flexible data analysis including 1) an adaptive method that performs data reduction prior to PCA by empirically combining methylation data of neighboring sites, 2) an efficient algorithm that performs a principal component analysis (PCA) on the ultra high-dimensional data matrix, and 3) association tests. To accomplish this MethylPCA allows for parallel execution of tasks, uses C++ for CPU and I/O intensive calculations, and stores intermediate results to avoid computing the same statistics multiple times or keeping results in memory. Through simulations and an analysis of a real whole methylome MBD-seq study of 1,500 subjects we show that MethylPCA effectively controls for potential confounders. CONCLUSIONS: MethylPCA provides users a convenient tool to perform MWAS. The software effectively handles the challenge in memory and speed to perform tasks that would be impossible to accomplish using existing software when millions of sites are interrogated with the sample sizes required for MWAS.


Subject(s)
DNA Methylation , Software , Algorithms , Genetic Association Studies , Genome, Human , Humans , Principal Component Analysis , Sequence Analysis, DNA
13.
BMC Bioinformatics ; 14: 50, 2013 Feb 12.
Article in English | MEDLINE | ID: mdl-23398781

ABSTRACT

BACKGROUND: Methylation studies are a promising complement to genetic studies of DNA sequence. However, detailed prior biological knowledge is typically lacking, so methylome-wide association studies (MWAS) will be critical to detect disease relevant sites. A cost-effective approach involves the next-generation sequencing (NGS) of single-end libraries created from samples that are enriched for methylated DNA fragments. A limitation of single-end libraries is that the fragment size distribution is not observed. This hampers several aspects of the data analysis such as the calculation of enrichment measures that are based on the number of fragments covering the CpGs. RESULTS: We developed a non-parametric method that uses isolated CpGs to estimate sample-specific fragment size distributions from the empirical sequencing data. Through simulations we show that our method is highly accurate. While the traditional (extended) read count methods resulted in severely biased coverage estimates and introduces artificial inter-individual differences, through the use of the estimated fragment size distributions we could remove these biases almost entirely. Furthermore, we found correlations of 0.999 between coverage estimates obtained using fragment size distributions that were estimated with our method versus those that were "observed" in paired-end sequencing data. CONCLUSIONS: We propose a non-parametric method for estimating fragment size distributions that is highly precise and can improve the analysis of cost-effective MWAS studies that sequence single-end libraries created from samples that are enriched for methylated DNA fragments.


Subject(s)
CpG Islands , DNA Methylation , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , Animals , Male , Mice , Mice, Inbred C57BL
14.
Epigenomics ; 4(6): 605-21, 2012 Dec.
Article in English | MEDLINE | ID: mdl-23244307

ABSTRACT

AIM: We studied the use of methyl-CpG binding domain (MBD) protein-enriched genome sequencing (MBD-seq) as a cost-effective screening tool for methylome-wide association studies (MWAS). MATERIALS & METHODS: Because MBD-seq has not yet been applied on a large scale, we first developed and tested a pipeline for data processing using 1500 schizophrenia cases and controls plus 75 technical replicates with an average of 68 million reads per sample. This involved the use of technical replicates to optimize quality control for multi- and duplicate-reads, an in silico experiment to identify CpGs in loci with alignment problems, CpG coverage calculations based on multiparametric estimates of the fragment size distribution, a two-stage adaptive algorithm to combine data from correlated adjacent CpG sites, principal component analyses to control for confounders and new software tailored to handle the large data set. RESULTS: We replicated MWAS findings in independent samples using a different technology that provided single base resolution. In an MWAS of age-related methylation changes, one of our top findings was a previously reported robust association involving GRIA2. Our results also suggested that owing to the many confounding effects, a considerable challenge in MWAS is to identify those effects that are informative about disease processes. CONCLUSION: This study showed the potential of MBD-seq as a cost-effective tool in large-scale disease studies.


Subject(s)
DNA Methylation , Genetic Association Studies , Genome, Human , Schizophrenia/genetics , Case-Control Studies , CpG Islands , DNA-Binding Proteins/metabolism , High-Throughput Nucleotide Sequencing , Humans , Receptors, AMPA/genetics , Sequence Analysis, DNA
15.
Eur J Hum Genet ; 20(9): 953-5, 2012 Sep.
Article in English | MEDLINE | ID: mdl-22378283

ABSTRACT

DNA from Epstein-Barr virus-transformed lymphocyte cell lines (LCLs) has proven useful for studies of genetic sequence polymorphisms. Whether LCL DNA is suitable for methylation studies is less clear. We conduct a genome-wide methylation investigation using an array set with 45 million probes to investigate the methylome of LCL DNA and technical duplicates of WB DNA from the same 10 individuals. We focus specifically on methylation sites that show variation between individuals and, therefore, are potentially useful as biomarkers. The sample correlations for the methylation variable probes ranged from 0.69 to 0.78 for the WB duplicates and from 0.27 to 0.72 for WB vs LCL. To compare the pattern of the methylation signals, we grouped adjacent probes based on their inter-correlations. These analyses showed ∼29 000 and ∼14 000 blocks in WB and LCL, respectively. Merely 31% of the methylated regions detected in WB were detectable in LCLs. Furthermore, we observed significant differences in mean difference between WB and LCL as compared with duplicates of WB (P-value =2.2 × 10(-16)). Our study shows that there are substantial differences in the DNA methylation patterns between LCL and WB. Thus, LCL DNA should not be used as a proxy for WB DNA in methylome-wide studies.


Subject(s)
DNA Methylation , DNA/genetics , Lymphocytes/metabolism , Adult , Aged , Aged, 80 and over , Biomarkers/metabolism , Cell Line, Transformed , DNA/blood , DNA/isolation & purification , DNA Probes , Female , Genetic Loci , Herpesvirus 4, Human/genetics , Humans , Lymphocytes/virology , Male , Middle Aged , Organ Specificity
SELECTION OF CITATIONS
SEARCH DETAIL
...