Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 18 de 18
Filter
1.
Bioinformatics ; 35(20): 3906-3912, 2019 10 15.
Article in English | MEDLINE | ID: mdl-30903145

ABSTRACT

MOTIVATION: Non-coding rare variants (RVs) may contribute to Mendelian disorders but have been challenging to study due to small sample sizes, genetic heterogeneity and uncertainty about relevant non-coding features. Previous studies identified RVs associated with expression outliers, but varying outlier definitions were employed and no comprehensive open-source software was developed. RESULTS: We developed Outlier-RV Enrichment (ORE) to identify biologically-meaningful non-coding RVs. We implemented ORE combining whole-genome sequencing and cardiac RNAseq from congenital heart defect patients from the Pediatric Cardiac Genomics Consortium and deceased adults from Genotype-Tissue Expression. Use of rank-based outliers maximized sensitivity while a most extreme outlier approach maximized specificity. Rarer variants had stronger associations, suggesting they are under negative selective pressure and providing a basis for investigating their contribution to Mendelian disorders. AVAILABILITY AND IMPLEMENTATION: ORE, source code, and documentation are available at https://pypi.python.org/pypi/ore under the MIT license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Genomics , Software , Child , Documentation , Humans , Uncertainty , Whole Genome Sequencing
2.
Nat Commun ; 8(1): 1943, 2017 12 05.
Article in English | MEDLINE | ID: mdl-29203772

ABSTRACT

Mechanisms driving acute food allergic reactions have not been fully characterized. We profile the dynamic transcriptome of acute peanut allergic reactions using serial peripheral blood samples obtained from 19 children before, during, and after randomized, double-blind, placebo-controlled oral challenges to peanut. We identify genes with changes in expression triggered by peanut, but not placebo, during acute peanut allergic reactions. Network analysis reveals that these genes comprise coexpression networks for acute-phase response and pro-inflammatory processes. Key driver analysis identifies six genes (LTB4R, PADI4, IL1R2, PPP1R3D, KLHL2, and ECHDC3) predicted to causally modulate the state of coregulated networks in response to peanut. Leukocyte deconvolution analysis identifies changes in neutrophil, naive CD4+ T cell, and macrophage populations during peanut challenge. Analyses in 21 additional peanut allergic subjects replicate major findings. These results highlight key genes, biological processes, and cell types that can be targeted for mechanistic study and therapeutic targeting of peanut allergy.


Subject(s)
Acute-Phase Reaction/genetics , Peanut Hypersensitivity/genetics , RNA, Messenger/metabolism , Acute-Phase Reaction/immunology , Adolescent , CD4-Positive T-Lymphocytes/immunology , Child , Double-Blind Method , Enoyl-CoA Hydratase/genetics , Female , Gene Expression Profiling , Gene Regulatory Networks , Humans , Inflammation/genetics , Inflammation/immunology , Macrophages/immunology , Male , Microfilament Proteins/genetics , Nerve Tissue Proteins/genetics , Neutrophils/immunology , Peanut Hypersensitivity/immunology , Protein Phosphatase 1/genetics , Protein-Arginine Deiminase Type 4 , Protein-Arginine Deiminases/genetics , Random Allocation , Receptors, Interleukin-1 Type II/genetics , Receptors, Leukotriene B4/genetics , Reproducibility of Results
3.
Transl Psychiatry ; 6(8): e880, 2016 08 30.
Article in English | MEDLINE | ID: mdl-27576169

ABSTRACT

Meditation is becoming increasingly practiced, especially for stress-related medical conditions. Meditation may improve cellular health; however, studies have not separated out effects of meditation from vacation-like effects in a residential randomized controlled trial. We recruited healthy women non-meditators to live at a resort for 6 days and randomized to either meditation retreat or relaxing on-site, with both groups compared with 'regular meditators' already enrolled in the retreat. Blood drawn at baseline and post intervention was assessed for transcriptome-wide expression patterns and aging-related biomarkers. Highly significant gene expression changes were detected across all groups (the 'vacation effect') that could accurately predict (96% accuracy) between baseline and post-intervention states and were characterized by improved regulation of stress response, immune function and amyloid beta (Aß) metabolism. Although a smaller set of genes was affected, regular meditators showed post-intervention differences in a gene network characterized by lower regulation of protein synthesis and viral genome activity. Changes in well-being were assessed post intervention relative to baseline, as well as 1 and 10 months later. All groups showed equivalently large immediate post-intervention improvements in well-being, but novice meditators showed greater maintenance of lower distress over time compared with those in the vacation arm. Regular meditators showed a trend toward increased telomerase activity compared with randomized women, who showed increased plasma Aß42/Aß40 ratios and tumor necrosis factor alpha (TNF-α) levels. This highly controlled residential study showed large salutary changes in gene expression networks due to the vacation effect, common to all groups. For those already trained in the practice of meditation, a retreat appears to provide additional benefits to cellular health beyond the vacation effect.


Subject(s)
Aging/metabolism , Amyloid beta-Peptides/metabolism , Immunity , Meditation/methods , Mental Health , Stress, Psychological/therapy , Tumor Necrosis Factor-alpha/immunology , Adult , Aging/immunology , Female , Gene Expression Profiling , Gene Regulatory Networks , Holidays , Humans , Middle Aged , Peptide Fragments/metabolism , Phenotype , Stress, Physiological , Stress, Psychological/immunology , Stress, Psychological/metabolism
5.
Mol Psychiatry ; 21(8): 1099-111, 2016 08.
Article in English | MEDLINE | ID: mdl-26552589

ABSTRACT

Identification and characterization of molecular mechanisms that connect genetic risk factors to initiation and evolution of disease pathophysiology represent major goals and opportunities for improving therapeutic and diagnostic outcomes in Alzheimer's disease (AD). Integrative genomic analysis of the human AD brain transcriptome holds potential for revealing novel mechanisms of dysfunction that underlie the onset and/or progression of the disease. We performed an integrative genomic analysis of brain tissue-derived transcriptomes measured from two lines of mice expressing distinct mutant AD-related proteins. The first line expresses oligomerogenic mutant APP(E693Q) inside neurons, leading to the accumulation of amyloid beta (Aß) oligomers and behavioral impairment, but never develops parenchymal fibrillar amyloid deposits. The second line expresses APP(KM670/671NL)/PSEN1(Δexon9) in neurons and accumulates fibrillar Aß amyloid and amyloid plaques accompanied by neuritic dystrophy and behavioral impairment. We performed RNA sequencing analyses of the dentate gyrus and entorhinal cortex from each line and from wild-type mice. We then performed an integrative genomic analysis to identify dysregulated molecules and pathways, comparing transgenic mice with wild-type controls as well as to each other. We also compared these results with datasets derived from human AD brain. Differential gene and exon expression analysis revealed pervasive alterations in APP/Aß metabolism, epigenetic control of neurogenesis, cytoskeletal organization and extracellular matrix (ECM) regulation. Comparative molecular analysis converged on FMR1 (Fragile X Mental Retardation 1), an important negative regulator of APP translation and oligomerogenesis in the post-synaptic space. Integration of these transcriptomic results with human postmortem AD gene networks, differential expression and differential splicing signatures identified significant similarities in pathway dysregulation, including ECM regulation and neurogenesis, as well as strong overlap with AD-associated co-expression network structures. The strong overlap in molecular systems features supports the relevance of these findings from the AD mouse models to human AD.


Subject(s)
Amyloid beta-Protein Precursor/genetics , Amyloid beta-Protein Precursor/metabolism , Presenilin-1/genetics , Alzheimer Disease/genetics , Amyloid beta-Peptides/metabolism , Animals , Brain/metabolism , Disease Models, Animal , Fibrillar Collagens , Fragile X Mental Retardation Protein/metabolism , Humans , Mice , Mice, Transgenic , Mutation , Neurogenesis , Neurons/metabolism , Plaque, Amyloid/pathology , Risk Factors , Transcriptome/genetics
6.
Transl Psychiatry ; 5: e679, 2015 Nov 17.
Article in English | MEDLINE | ID: mdl-26575220

ABSTRACT

Regulators of the histone H3-trimethyl lysine-4 (H3K4me3) mark are significantly associated with the genetic risk architecture of common neurodevelopmental disease, including schizophrenia and autism. Typical H3K4me3 is primarily localized in the form of sharp peaks, extending in neuronal chromatin on average only across 500-1500 base pairs mostly in close proximity to annotated transcription start sites. Here, through integrative computational analysis of epigenomic and transcriptomic data based on next-generation sequencing, we investigated H3K4me3 landscapes of sorted neuronal and non-neuronal nuclei in human postmortem, non-human primate and mouse prefrontal cortex (PFC), and blood. To explore whether H3K4me3 peak signals could also extend across much broader domains, we examined broadest domain cell-type-specific H3K4me3 peaks in an unbiased manner with an innovative approach on 41+12 ChIP-seq and RNA-seq data sets. In PFC neurons, broadest H3K4me3 distribution ranged from 3.9 to 12 kb, with extremely broad peaks (~10 kb or broader) related to synaptic function and GABAergic signaling (DLX1, ELFN1, GAD1, IGSF9B and LINC00966). Broadest neuronal peaks showed distinct motif signatures and were centrally positioned in prefrontal gene-regulatory Bayesian networks and sensitive to defective neurodevelopment. Approximately 120 of the broadest H3K4me3 peaks in human PFC neurons, including many genes related to glutamatergic and dopaminergic signaling, were fully conserved in chimpanzee, macaque and mouse cortical neurons. Exploration of spread and breadth of lysine methylation markings could provide novel insights into epigenetic mechanism involved in neuropsychiatric disease and neuronal genome evolution.


Subject(s)
Brain/metabolism , Epigenesis, Genetic/genetics , Gene Regulatory Networks/genetics , Histones/genetics , Histones/metabolism , Adult , Animals , Female , Humans , Macaca , Male , Mice , Pan troglodytes
7.
Diabetologia ; 55(8): 2205-13, 2012 Aug.
Article in English | MEDLINE | ID: mdl-22584726

ABSTRACT

AIMS/HYPOTHESIS: While genome-wide association studies (GWASs) have been successful in identifying novel variants associated with various diseases, it has been much more difficult to determine the biological mechanisms underlying these associations. Expression quantitative trait loci (eQTL) provide another dimension to these data by associating single nucleotide polymorphisms (SNPs) with gene expression. We hypothesised that integrating SNPs known to be associated with type 2 diabetes with eQTLs and coexpression networks would enable the discovery of novel candidate genes for type 2 diabetes. METHODS: We selected 32 SNPs associated with type 2 diabetes in two or more independent GWASs. We used previously described eQTLs mapped from genotype and gene expression data collected from 1,008 morbidly obese patients to find genes with expression associated with these SNPs. We linked these genes to coexpression modules, and ranked the other genes in these modules using an inverse sum score. RESULTS: We found 62 genes with expression associated with type 2 diabetes SNPs. We validated our method by linking highly ranked genes in the coexpression modules back to SNPs through a combined eQTL dataset. We showed that the eQTLs highlighted by this method are significantly enriched for association with type 2 diabetes in data from the Wellcome Trust Case Control Consortium (WTCCC, p = 0.026) and the Gene Environment Association Studies (GENEVA, p = 0.042), validating our approach. Many of the highly ranked genes are also involved in the regulation or metabolism of insulin, glucose or lipids. CONCLUSIONS/INTERPRETATION: We have devised a novel method, involving the integration of datasets of different modalities, to discover novel candidate genes for type 2 diabetes.


Subject(s)
Diabetes Mellitus, Type 2/genetics , Obesity, Morbid/genetics , Polymorphism, Single Nucleotide , Quantitative Trait Loci , Animals , Case-Control Studies , Gene Expression Profiling , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Mice , Mice, Inbred C57BL , Reproducibility of Results
8.
Genes Immun ; 12(6): 428-33, 2011 Sep.
Article in English | MEDLINE | ID: mdl-21346778

ABSTRACT

A genome-wide association study identified single nucleotide polymorphisms (SNPs) rs3077 and rs9277535 located in the 3' untranslated regions of human leukocyte antigen (HLA) class II genes HLA-DPA1 and HLA-DPB1, respectively, as the independent variants most strongly associated with chronic hepatitis B. We examined whether these SNPs are associated with mRNA expression of HLA-DPA1 and HLA-DPB1. We identified gene expression-associated SNPs (eSNPs) in normal liver samples obtained from 651 individuals of European ancestry by integrating genotype (~650 000 SNPs) and gene expression (>39 000 transcripts) data from each sample. We used the Kruskal-Wallis test to determine associations between gene expression and genotype. To confirm findings, we measured allelic expression imbalance (AEI) of complementary DNA compared with DNA in liver specimens from subjects who were heterozygous for rs3077 and rs9277535. On a genome-wide basis, rs3077 was the SNP most strongly associated with HLA-DPA1 expression (p=10(-48)), and rs9277535 was strongly associated with HLA-DPB1 expression (p=10(-15)). Consistent with these gene expression associations, we observed AEI for both rs3077 (p=3.0 × 10(-7); 17 samples) and rs9277535 (p=0.001; 17 samples). We conclude that the variants previously associated with chronic hepatitis B are also strongly associated with mRNA expression of HLA-DPA1 and HLA-DPB1, suggesting that expression of these genes is important in control of HBV.


Subject(s)
Genetic Predisposition to Disease , HLA-DP alpha-Chains/genetics , HLA-DP beta-Chains/genetics , Hepatitis B, Chronic/genetics , Hepatitis B, Chronic/immunology , Liver/immunology , RNA, Messenger/biosynthesis , 3' Untranslated Regions , Alleles , Gene Expression , Genome-Wide Association Study , Genotype , HLA-DP alpha-Chains/immunology , HLA-DP beta-Chains/immunology , Hepatitis B virus/immunology , Humans , Polymorphism, Single Nucleotide
10.
Anim Genet ; 37 Suppl 1: 18-23, 2006 Aug.
Article in English | MEDLINE | ID: mdl-16886998

ABSTRACT

Forward genetics is a common approach to dissecting complex traits like common human diseases. The ultimate aim of this approach was the identification of genes that are causal for disease or other phenotypes of interest. However, the forward genetics approach is by definition restricted to the identification of genes that have incurred mutations over the course of evolution or that incurred mutations as a result of chemical mutagenesis, and that as a result lead to disease or to variations in other phenotypes of interest. Genes that harbour no such mutations, but that play key roles in parts of the biological network that lead to disease, are systematically missed by this class of approaches. Recently, a class of novel integrative genomics approaches has been devised to elucidate the complexity of common human diseases by intersecting genotypic, molecular profiling, and clinical data in segregating populations. These novel approaches take a more holistic view of biological systems and leverage the vast network of gene-gene interactions, in combination with DNA variation data, to establish causal relationships among molecular profiling traits and between molecular profiling and disease (or other classic phenotypes). A number of novel genes for disease phenotypes have been identified as a result of these approaches, highlighting the utility of integrating orthogonal sources of data to get at the underlying causes of disease.


Subject(s)
Genes , Genomics/methods , Animals , Chromosome Mapping , Genetic Predisposition to Disease/genetics , Humans , Mice , Models, Genetic , Quantitative Trait Loci
11.
Am J Hum Genet ; 75(6): 1094-105, 2004 Dec.
Article in English | MEDLINE | ID: mdl-15514893

ABSTRACT

Combining genetic inheritance information, for both molecular profiles and complex traits, is a promising strategy not only for detecting quantitative trait loci (QTLs) for complex traits but for understanding which genes, pathways, and biological processes are also under the influence of a given QTL. As a primary step in determining the feasibility of such an approach in humans, we present the largest survey to date, to our knowledge, of the heritability of gene-expression traits in segregating human populations. In particular, we measured expression for 23,499 genes in lymphoblastoid cell lines for members of 15 Centre d'Etude du Polymorphisme Humain (CEPH) families. Of the total set of genes, 2,340 were found to be expressed, of which 31% had significant heritability when a false-discovery rate of 0.05 was used. QTLs were detected for 33 genes on the basis of at least one P value <.000005. Of these, 13 genes possessed a QTL within 5 Mb of their physical location. Hierarchical clustering was performed on the basis of both Pearson correlation of gene expression and genetic correlation. Both reflected biologically relevant activity taking place in the lymphoblastoid cell lines, with greater coherency represented in Kyoto Encyclopedia of Genes and Genomes database (KEGG) pathways than in Gene Ontology database pathways. However, more pathway coherence was observed in KEGG pathways when clustering was based on genetic correlation than when clustering was based on Pearson correlation. As more expression data in segregating populations are generated, viewing clusters or networks based on genetic correlation measures and shared QTLs will offer potentially novel insights into the relationship among genes that may underlie complex traits.


Subject(s)
Gene Expression Profiling , Genetic Linkage , Lymphocytes/metabolism , Quantitative Trait Loci , Cell Line , Cluster Analysis , Databases, Genetic , Family , Humans , Models, Statistical , Oligonucleotide Array Sequence Analysis , Reverse Transcriptase Polymerase Chain Reaction
12.
Cytogenet Genome Res ; 105(2-4): 363-74, 2004.
Article in English | MEDLINE | ID: mdl-15237224

ABSTRACT

The reconstruction of genetic networks in mammalian systems is one of the primary goals in biological research, especially as such reconstructions relate to elucidating not only common, polygenic human diseases, but living systems more generally. Here we propose a novel gene network reconstruction algorithm, derived from classic Bayesian network methods, that utilizes naturally occurring genetic variations as a source of perturbations to elucidate the network. This algorithm incorporates relative transcript abundance and genotypic data from segregating populations by employing a generalized scoring function of maximum likelihood commonly used in Bayesian network reconstruction problems. The utility of this novel algorithm is demonstrated via application to liver gene expression data from a segregating mouse population. We demonstrate that the network derived from these data using our novel network reconstruction algorithm is able to capture causal associations between genes that result in increased predictive power, compared to more classically reconstructed networks derived from the same data.


Subject(s)
Algorithms , Genetics, Population , Genomics , 11-beta-Hydroxysteroid Dehydrogenases/genetics , Animals , Bayes Theorem , Cluster Analysis , Female , Gene Expression Profiling , Linkage Disequilibrium , Mice , Mice, Inbred C57BL , Mice, Inbred DBA , Models, Genetic , Quantitative Trait Loci
13.
Biochem Soc Trans ; 31(2): 437-43, 2003 Apr.
Article in English | MEDLINE | ID: mdl-12653656

ABSTRACT

Application of statistical genetics approaches to variations in mRNA transcript abundances in segregating populations can be used to identify genes and pathways associated with common human diseases. The combination of this genetic information with gene expression and clinical trait data can also be used to identify subtypes of a disease and the genetic loci specific to each subtype. Here we highlight results from some of our recent work in this area and further explore the many possibilities that exist in employing a more comprehensive genetics and functional genomics approach to the functional annotation of genomes, and in applying such methods to the validation of targets for complex traits in the drug discovery process.


Subject(s)
Drug Design , Animals , Gene Expression Regulation , Genetic Linkage , Genomics/methods , Humans , Synteny
14.
Nature ; 409(6822): 922-7, 2001 Feb 15.
Article in English | MEDLINE | ID: mdl-11237012

ABSTRACT

The most important product of the sequencing of a genome is a complete, accurate catalogue of genes and their products, primarily messenger RNA transcripts and their cognate proteins. Such a catalogue cannot be constructed by computational annotation alone; it requires experimental validation on a genome scale. Using 'exon' and 'tiling' arrays fabricated by ink-jet oligonucleotide synthesis, we devised an experimental approach to validate and refine computational gene predictions and define full-length transcripts on the basis of co-regulated expression of their exons. These methods can provide more accurate gene numbers and allow the detection of mRNA splice variants and identification of the tissue- and disease-specific conditions under which genes are expressed. We apply our technique to chromosome 22q under 69 experimental condition pairs, and to the entire human genome under two experimental conditions. We discuss implications for more comprehensive, consistent and reliable genome annotation, more efficient, full-length complementary DNA cloning strategies and application to complex diseases.


Subject(s)
Chromosomes, Human, Pair 22 , Computational Biology , Genome, Human , Oligonucleotide Array Sequence Analysis , Algorithms , Alternative Splicing , Cell Line , DNA, Complementary , Exons , Human Genome Project , Humans , Oligonucleotide Probes
15.
J Cell Biochem Suppl ; Suppl 37: 120-5, 2001.
Article in English | MEDLINE | ID: mdl-11842437

ABSTRACT

Algorithms for performing feature extraction and normalization on high-density oligonucleotide gene expression arrays, have not been fully explored, and the impact these algorithms have on the downstream analysis is not well understood. Advances in such low-level analysis methods are essential to increase the sensitivity and specificity of detecting whether genes are present and/or differentially expressed. We have developed and implemented a number of algorithms for the analysis of expression array data in a software application, the DNA-Chip Analyzer (dChip). In this report, we describe the algorithms for feature extraction and normalization, and present validation data and comparison results with some of the algorithms currently in use.


Subject(s)
Algorithms , Gene Expression Profiling/methods , Oligonucleotide Array Sequence Analysis/methods , Reproducibility of Results , Software
16.
J Cell Biochem ; 80(2): 192-202, 2000 Oct 20.
Article in English | MEDLINE | ID: mdl-11074587

ABSTRACT

We have developed methods and identified problems associated with the analysis of data generated by high-density, oligonuceotide gene expression arrays. Our methods are aimed at accounting for many of the sources of variation that make it difficult, at times, to realize consistent results. We present here descriptions of some of these methods and how they impact the analysis of oligonucleotide gene expression array data. We will discuss the process of recognizing the "spots" (or features) on the Affymetrix GeneChip(R) probe arrays, correcting for background and intensity gradients in the resulting images, scaling/normalizing an array to allow array-to-array comparisons, monitoring probe performance with respect to hybridization efficiency, and assessing whether a gene is present or differentially expressed. Examples from the analyses of gene expression validation data are presented to contrast the different methods applied to these types of data.


Subject(s)
Gene Expression Profiling , Oligonucleotide Array Sequence Analysis , Base Sequence , DNA , Expressed Sequence Tags , Molecular Sequence Data
17.
J Cell Biochem ; 80(2): 212-5, 2000 Oct 20.
Article in English | MEDLINE | ID: mdl-11074591
18.
Genome Res ; 8(3): 222-33, 1998 Mar.
Article in English | MEDLINE | ID: mdl-9521926

ABSTRACT

We have developed a generalization of Kimura's Markov chain model for base substitution at a single nucleotide site. This generalized model incorporates more flexible transition rates and consequently allows irreversible as well as reversible chains. Because the model embodies just the right amount of symmetry, it permits explicit calculation of finite-time transition probabilities and equilibrium distributions. The model also meshes well with maximum likelihood methods for phylogenetic analysis. Quick calculation of likelihoods and their derivatives can be carried out by adapting Baum's forward and backward algorithms from the theory of hidden Markov chains. Analysis of HIV sequence data illustrates the speed of the algorithms on trees with many contemporary taxa. Analysis of some of Lake's data on the origin of the eukaryotic nucleus contrasts the reversible and irreversible versions of the model.


Subject(s)
Computational Biology/statistics & numerical data , Evolution, Molecular , Models, Statistical , Phylogeny , Algorithms , Animals , Computer Simulation , Humans , Likelihood Functions , Markov Chains , Software
SELECTION OF CITATIONS
SEARCH DETAIL
...