Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
Annu Rev Biomed Data Sci ; 5: 119-139, 2022 08 10.
Article in English | MEDLINE | ID: mdl-35483347

ABSTRACT

Thousands of common genetic variants in the human population have been associated with disease risk and phenotypic variation by genome-wide association studies (GWAS). However, the majority of GWAS variants fall into noncoding regions of the genome, complicating our understanding of their regulatory functions, and few molecular mechanisms of GWAS variant effects have been clearly elucidated. Here, we set out to review genetic variant effects, focusing on expression quantitative trait loci (eQTLs), including their utility in interpreting GWAS variant mechanisms. We discuss the interrelated challenges and opportunities for eQTL analysis, covering determining causal variants, elucidating molecular mechanisms of action, and understanding context variability. Addressing these questions can enable better functional characterization of disease-associated loci and provide insights into fundamental biological questions of the noncoding genetic regulatory code and its control of gene expression.


Subject(s)
Genome-Wide Association Study , Polymorphism, Single Nucleotide , Humans , Polymorphism, Single Nucleotide/genetics , Quantitative Trait Loci/genetics
2.
PLoS Genet ; 18(1): e1009719, 2022 01.
Article in English | MEDLINE | ID: mdl-35100260

ABSTRACT

Tens of thousands of genetic variants associated with gene expression (cis-eQTLs) have been discovered in the human population. These eQTLs are active in various tissues and contexts, but the molecular mechanisms of eQTL variability are poorly understood, hindering our understanding of genetic regulation across biological contexts. Since many eQTLs are believed to act by altering transcription factor (TF) binding affinity, we hypothesized that analyzing eQTL effect size as a function of TF level may allow discovery of mechanisms of eQTL variability. Using GTEx Consortium eQTL data from 49 tissues, we analyzed the interaction between eQTL effect size and TF level across tissues and across individuals within specific tissues and generated a list of 10,098 TF-eQTL interactions across 2,136 genes that are supported by at least two lines of evidence. These TF-eQTLs were enriched for various TF binding measures, supporting with orthogonal evidence that these eQTLs are regulated by the implicated TFs. We also found that our TF-eQTLs tend to overlap genes with gene-by-environment regulatory effects and to colocalize with GWAS loci, implying that our approach can help to elucidate mechanisms of context-specificity and trait associations. Finally, we highlight an interesting example of IKZF1 TF regulation of an APBB1IP gene eQTL that colocalizes with a GWAS signal for blood cell traits. Together, our findings provide candidate TF mechanisms for a large number of eQTLs and offer a generalizable approach for researchers to discover TF regulators of genetic variant effects in additional QTL datasets.


Subject(s)
Quantitative Trait Loci , Transcription Factors/physiology , Alleles , Binding Sites , Gene Knockdown Techniques , Gene-Environment Interaction , Genome-Wide Association Study , Humans , Interferon Regulatory Factor-1/genetics , Models, Genetic , Phenotype , Transcription Factors/metabolism
3.
Mol Cell ; 74(6): 1189-1204.e6, 2019 06 20.
Article in English | MEDLINE | ID: mdl-31226278

ABSTRACT

RNA-binding proteins (RBPs) regulate post-transcriptional gene expression by recognizing short and degenerate sequence motifs in their target transcripts, but precisely defining their binding specificity remains challenging. Crosslinking and immunoprecipitation (CLIP) allows for mapping of the exact protein-RNA crosslink sites, which frequently reside at specific positions in RBP motifs at single-nucleotide resolution. Here, we have developed a computational method, named mCross, to jointly model RBP binding specificity while precisely registering the crosslinking position in motif sites. We applied mCross to 112 RBPs using ENCODE eCLIP data and validated the reliability of the discovered motifs by genome-wide analysis of allelic binding sites. Our analyses revealed that the prototypical SR protein SRSF1 recognizes clusters of GGA half-sites in addition to its canonical GGAGGA motif. Therefore, SRSF1 regulates splicing of a much larger repertoire of transcripts than previously appreciated, including HNRNPD and HNRNPDL, which are involved in multivalent protein assemblies and phase separation.


Subject(s)
Heterogeneous-Nuclear Ribonucleoprotein D/chemistry , Models, Molecular , RNA/chemistry , Serine-Arginine Splicing Factors/chemistry , Base Sequence , Binding Sites , Cross-Linking Reagents/chemistry , Gene Expression , HeLa Cells , Hep G2 Cells , Heterogeneous Nuclear Ribonucleoprotein D0 , Heterogeneous-Nuclear Ribonucleoprotein D/genetics , Heterogeneous-Nuclear Ribonucleoprotein D/metabolism , Humans , K562 Cells , Nucleic Acid Conformation , Protein Binding , Protein Conformation, alpha-Helical , Protein Conformation, beta-Strand , Protein Interaction Domains and Motifs , RNA/genetics , RNA/metabolism , Sequence Alignment , Sequence Homology, Nucleic Acid , Serine-Arginine Splicing Factors/genetics , Serine-Arginine Splicing Factors/metabolism
4.
Front Med (Lausanne) ; 4: 62, 2017.
Article in English | MEDLINE | ID: mdl-28603714

ABSTRACT

Traditionally, the use of genomic information for personalized medical decisions relies on prior discovery and validation of genotype-phenotype associations. This approach constrains care for patients presenting with undescribed problems. The National Institutes of Health (NIH) Undiagnosed Diseases Program (UDP) hypothesized that defining disease as maladaptation to an ecological niche allows delineation of a logical framework to diagnose and evaluate such patients. Herein, we present the philosophical bases, methodologies, and processes implemented by the NIH UDP. The NIH UDP incorporated use of the Human Phenotype Ontology, developed a genomic alignment strategy cognizant of parental genotypes, pursued agnostic biochemical analyses, implemented functional validation, and established virtual villages of global experts. This systematic approach provided a foundation for the diagnostic or non-diagnostic answers provided to patients and serves as a paradigm for scalable translational research.

5.
Genet Med ; 18(6): 608-17, 2016 06.
Article in English | MEDLINE | ID: mdl-26562225

ABSTRACT

PURPOSE: Medical diagnosis and molecular or biochemical confirmation typically rely on the knowledge of the clinician. Although this is very difficult in extremely rare diseases, we hypothesized that the recording of patient phenotypes in Human Phenotype Ontology (HPO) terms and computationally ranking putative disease-associated sequence variants improves diagnosis, particularly for patients with atypical clinical profiles. METHODS: Using simulated exomes and the National Institutes of Health Undiagnosed Diseases Program (UDP) patient cohort and associated exome sequence, we tested our hypothesis using Exomiser. Exomiser ranks candidate variants based on patient phenotype similarity to (i) known disease-gene phenotypes, (ii) model organism phenotypes of candidate orthologs, and (iii) phenotypes of protein-protein association neighbors. RESULTS: Benchmarking showed Exomiser ranked the causal variant as the top hit in 97% of known disease-gene associations and ranked the correct seeded variant in up to 87% when detectable disease-gene associations were unavailable. Using UDP data, Exomiser ranked the causative variant(s) within the top 10 variants for 11 previously diagnosed variants and achieved a diagnosis for 4 of 23 cases undiagnosed by clinical evaluation. CONCLUSION: Structured phenotyping of patients and computational analysis are effective adjuncts for diagnosing patients with genetic disorders.Genet Med 18 6, 608-617.


Subject(s)
Exome Sequencing/methods , Exome/genetics , Rare Diseases/genetics , Rare Diseases/physiopathology , Animals , Computational Biology , Databases, Genetic , Disease Models, Animal , Genetic Association Studies , Genetic Variation , Humans , Mice , National Institutes of Health (U.S.) , Patients , Phenotype , Rare Diseases/diagnosis , Rare Diseases/epidemiology , United States , Zebrafish
SELECTION OF CITATIONS
SEARCH DETAIL
...