Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 29
Filter
1.
PLOS Digit Health ; 3(2): e0000447, 2024 Feb.
Article in English | MEDLINE | ID: mdl-38335183

ABSTRACT

Distinguishing between alcohol-associated hepatitis (AH) and alcohol-associated cirrhosis (AC) remains a diagnostic challenge. In this study, we used machine learning with transcriptomics and proteomics data from liver tissue and peripheral mononuclear blood cells (PBMCs) to classify patients with alcohol-associated liver disease. The conditions in the study were AH, AC, and healthy controls. We processed 98 PBMC RNAseq samples, 55 PBMC proteomic samples, 48 liver RNAseq samples, and 53 liver proteomic samples. First, we built separate classification and feature selection pipelines for transcriptomics and proteomics data. The liver tissue models were validated in independent liver tissue datasets. Next, we built integrated gene and protein expression models that allowed us to identify combined gene-protein biomarker panels. For liver tissue, we attained 90% nested-cross validation accuracy in our dataset and 82% accuracy in the independent validation dataset using transcriptomic data. We attained 100% nested-cross validation accuracy in our dataset and 61% accuracy in the independent validation dataset using proteomic data. For PBMCs, we attained 83% and 89% accuracy with transcriptomic and proteomic data, respectively. The integration of the two data types resulted in improved classification accuracy for PBMCs, but not liver tissue. We also identified the following gene-protein matches within the gene-protein biomarker panels: CLEC4M-CLC4M, GSTA1-GSTA2 for liver tissue and SELENBP1-SBP1 for PBMCs. In this study, machine learning models had high classification accuracy for both transcriptomics and proteomics data, across liver tissue and PBMCs. The integration of transcriptomics and proteomics into a multi-omics model yielded improvement in classification accuracy for the PBMC data. The set of integrated gene-protein biomarkers for PBMCs show promise toward developing a liquid biopsy for alcohol-associated liver disease.

2.
Obesity (Silver Spring) ; 32(1): 176-186, 2024 Jan.
Article in English | MEDLINE | ID: mdl-37823211

ABSTRACT

OBJECTIVE: Metabolic syndrome (MetS) is defined by clustering of cardiometabolic components, which may be present in different combinations. The authors evaluated clustering in individuals and extended families within and across ancestry groups. METHODS: The prevalence of different combinations of MetS components (high fasting glucose, low high-density lipoprotein cholesterol, high triglycerides, high blood pressure, and abdominal obesity) was estimated in 1651 individuals (340 families) self-reporting as European American (EA), Hispanic/Mexican American (MA), African American (AA), and Japanese American (JA). Odds ratios were estimated using logistic regression with generalized estimating equations comparing individual MetS components, number, and combinations of components for each ancestry group versus EA. RESULTS: Clustering of all five components (Combination #16) was more prevalent in EA (29.9%) and MA (25.2%) individuals than in AA (18.7%) and JA (15.5%) individuals. Compared with EA individuals, AA individuals were 64% and 66% less likely to have high triglycerides and low high-density lipoprotein cholesterol, whereas JA individuals were 85% and 56% less likely to have abdominal obesity and high blood pressure, respectively. Compared with EA individuals, the odds of having two, four, or five components were at least 77% lower in JA individuals, whereas the odds of having three, four, or five components were at least 3.79 times greater in MA individuals. CONCLUSIONS: Understanding heterogeneity in MetS clustering may identify factors important in reducing health disparities.


Subject(s)
Hypertension , Metabolic Syndrome , Humans , Metabolic Syndrome/epidemiology , Metabolic Syndrome/genetics , Obesity, Abdominal/epidemiology , Triglycerides , Obesity , Hypertension/epidemiology , Cluster Analysis , Lipoproteins, HDL , Cholesterol , Risk Factors
3.
Hum Immunol ; 85(1): 110735, 2024 Jan.
Article in English | MEDLINE | ID: mdl-38040543

ABSTRACT

Alcohol-associated hepatitis (AH) is often diagnosed at advanced stages, and severe AH usually carries poor prognosis and high short-term mortality. In addition, it is challenging to understand the molecular mechanisms of immune dysregulation and inflammation in AH due to the cellular complexity and heterogeneity. Using single-cell RNA sequencing, previous studies found that AH causes dysfunctional innate immune response in monocytes, involving activation of pattern recognition receptors (PRRs) and cytokine signaling pathways. To better understand the coordinated systemic immune response in AH patients, we performed combined single-cell transcriptome, cell-surface protein, and lymphocyte antigen receptor analysis of peripheral blood mononuclear cell (PBMC) samples. Our results showed inflammatory cytokines and chemokines were highly expressed in AH, including IL-2, IL-32, CXC3R1 and CXCL16 in monocytes and NK cells, whereas HLA-DR genes were reduced in monocytes. In addition, we also found altered differentiation of T-helper cells (TH1 and TH17), which could further lead to neutrophil recruitment and macrophage activation. Lastly, our results also suggest impaired NK-cell activation and differentiation in AH with reduced gene expression of KLRC2 and increased gene expression of KLRG1. Our findings indicate different mechanisms may be involved in impaired immune and inflammatory responses for the cellular subtypes of the PBMCs in AH.


Subject(s)
Hepatitis, Alcoholic , Leukocytes, Mononuclear , Humans , Leukocytes, Mononuclear/metabolism , Cytokines/metabolism , Chemokines/metabolism , Hepatitis, Alcoholic/genetics , Hepatitis, Alcoholic/metabolism , Gene Expression Profiling , NK Cell Lectin-Like Receptor Subfamily C
4.
Hepatology ; 2023 Oct 02.
Article in English | MEDLINE | ID: mdl-37796138

ABSTRACT

Excessive alcohol use is a major risk factor for the development of an alcohol use disorder (AUD) and contributes to a wide variety of other medical illnesses, including alcohol-associated liver disease (ALD). Both AUD and ALD are complex and causally interrelated diseases, and multiple factors other than alcohol consumption are implicated in the disease pathogenesis. While the underlying pathophysiology of AUD and ALD is complex, there is substantial evidence for a genetic susceptibility of both diseases. Current genome-wide association studies indicate that the genes associated with clinical AUD only poorly overlap with the genes identified for heavy drinking and, in turn, neither overlap with the genes identified for ALD. Uncovering the main genetic factors will enable us to identify molecular drivers underlying the pathogenesis, discover potential targets for therapy, and implement patient care early in disease progression. In this review, we described multiple genomic approaches and their implications to investigate the susceptibility and pathogenesis of both AUD and ALD. We concluded our review with a discussion of the knowledge gaps and future research on genomic studies in these 2 diseases.

5.
Int J Hyg Environ Health ; 252: 114211, 2023 07.
Article in English | MEDLINE | ID: mdl-37393842

ABSTRACT

Animal and epidemiologic studies suggest that there may be adverse health effects from exposure to glyphosate, the most highly used pesticide in the world, and its metabolite aminomethylphosphonic acid (AMPA). Meanwhile, consumption of organic foods (presumably grown free of chemical pesticides) has increased in recent years. However, there have been limited biomonitoring studies assessing the levels of human glyphosate and AMPA exposure in the United States. We examined urinary levels of glyphosate and AMPA in the context of organic eating behavior in a cohort of healthy postmenopausal women residing in Southern California and evaluated associations with demographics, dietary intake, and other lifestyle factors. 338 women provided two first-morning urine samples and at least one paired 24-h dietary recall reporting the previous day's dietary intake. Urinary glyphosate and AMPA were measured using LC-MS/MS. Participants reported on demographic and lifestyle factors via questionnaires. Potential associations were examined between these factors and urinary glyphosate and AMPA concentrations. Glyphosate was detected in 89.9% of urine samples and AMPA in 67.2%. 37.9% of study participants reported often or always eating organic food, 30.2% sometimes, and 32.0% seldom or never. Frequency of organic food consumption was associated with several demographic and lifestyle factors. Frequent organic eaters had significantly lower urinary glyphosate and AMPA levels, but not after adjustment for covariates. Grain consumption was significantly associated with higher urinary glyphosate levels, even among women who reported often or always eating organic grains. Soy protein and alcohol consumption as well as high frequency of eating fast food were associated with higher urinary AMPA levels. In conclusion, in the largest study to date examining paired dietary recall data and measurements of first-void urinary glyphosate and AMPA, the vast majority of subjects sampled had detectable levels, and significant dietary sources in the American diet were identified.


Subject(s)
Herbicides , Pesticides , Animals , Humans , Female , Cross-Sectional Studies , alpha-Amino-3-hydroxy-5-methyl-4-isoxazolepropionic Acid , Herbicides/urine , Chromatography, Liquid , Postmenopause , Tandem Mass Spectrometry , Feeding Behavior , Eating , Glyphosate
6.
Am J Pathol ; 192(12): 1658-1669, 2022 12.
Article in English | MEDLINE | ID: mdl-36243044

ABSTRACT

Alcohol-associated hepatitis (AH) is a form of liver failure with high short-term mortality. Recent studies have shown that defective function of hepatocyte nuclear factor 4 alpha (HNF4a) and systemic inflammation are major disease drivers of AH. Plasma biomarkers of hepatocyte function could be useful for diagnostic and prognostic purposes. Herein, an integrative analysis of hepatic RNA sequencing and liquid chromatography-tandem mass spectrometry was performed to identify plasma protein signatures for patients with mild and severe AH. Alcohol-related liver disease cirrhosis, nonalcoholic fatty liver disease, and healthy subjects were used as comparator groups. Levels of identified proteins primarily involved in hepatocellular function were decreased in patients with AH, which included hepatokines, clotting factors, complement cascade components, and hepatocyte growth activators. A protein signature of AH disease severity was identified, including thrombin, hepatocyte growth factor α, clusterin, human serum factor H-related protein, and kallistatin, which exhibited large abundance shifts between severe and nonsevere AH. The combination of thrombin and hepatocyte growth factor α discriminated between severe and nonsevere AH with high sensitivity and specificity. These findings were correlated with the liver expression of genes encoding secreted proteins in a similar cohort, finding a highly consistent plasma protein signature reflecting HNF4A and HNF1A functions. This unbiased proteomic-transcriptome analysis identified plasma protein signatures and pathways associated with disease severity, reflecting HNF4A/1A activity useful for diagnostic assessment in AH.


Subject(s)
Carcinoma, Hepatocellular , Hepatitis, Alcoholic , Liver Neoplasms , Humans , Transcriptome , Hepatocyte Growth Factor/genetics , Proteomics , Thrombin/metabolism , Hepatitis, Alcoholic/diagnosis , Proteins/genetics , Biomarkers
7.
JHEP Rep ; 4(10): 100560, 2022 Oct.
Article in English | MEDLINE | ID: mdl-36119721

ABSTRACT

Background & Aims: Liver disease carries significant healthcare burden and frequently requires a combination of blood tests, imaging, and invasive liver biopsy to diagnose. Distinguishing between inflammatory liver diseases, which may have similar clinical presentations, is particularly challenging. In this study, we implemented a machine learning pipeline for the identification of diagnostic gene expression biomarkers across several alcohol-associated and non-alcohol-associated liver diseases, using either liver tissue or blood-based samples. Methods: We collected peripheral blood mononuclear cells (PBMCs) and liver tissue samples from participants with alcohol-associated hepatitis (AH), alcohol-associated cirrhosis (AC), non-alcohol-associated fatty liver disease, chronic HCV infection, and healthy controls. We performed RNA sequencing (RNA-seq) on 137 PBMC samples and 67 liver tissue samples. Using gene expression data, we implemented a machine learning feature selection and classification pipeline to identify diagnostic biomarkers which distinguish between the liver disease groups. The liver tissue results were validated using a public independent RNA-seq dataset. The biomarkers were computationally validated for biological relevance using pathway analysis tools. Results: Utilizing liver tissue RNA-seq data, we distinguished between AH, AC, and healthy conditions with overall accuracies of 90% in our dataset, and 82% in the independent dataset, with 33 genes. Distinguishing 4 liver conditions and healthy controls yielded 91% overall accuracy in our liver tissue dataset with 39 genes, and 75% overall accuracy in our PBMC dataset with 75 genes. Conclusions: Our machine learning pipeline was effective at identifying a small set of diagnostic gene biomarkers and classifying several liver diseases using RNA-seq data from liver tissue and PBMCs. The methodologies implemented and genes identified in this study may facilitate future efforts toward a liquid biopsy diagnostic for liver diseases. Lay summary: Distinguishing between inflammatory liver diseases without multiple tests can be challenging due to their clinically similar characteristics. To lay the groundwork for the development of a non-invasive blood-based diagnostic across a range of liver diseases, we compared samples from participants with alcohol-associated hepatitis, alcohol-associated cirrhosis, chronic hepatitis C infection, and non-alcohol-associated fatty liver disease. We used a machine learning computational approach to demonstrate that gene expression data generated from either liver tissue or blood samples can be used to discover a small set of gene biomarkers for effective diagnosis of these liver diseases.

8.
Environ Health Perspect ; 130(4): 47001, 2022 04.
Article in English | MEDLINE | ID: mdl-35377194

ABSTRACT

BACKGROUND: Glyphosate is the most commonly used herbicide in the world and is purported to have a variety of health effects, including endocrine disruption and an elevated risk of several types of cancer. Blood DNA methylation has been shown to be associated with many other environmental exposures, but to our knowledge, no studies to date have examined the association between blood DNA methylation and glyphosate exposure. OBJECTIVE: We conducted an epigenome-wide association study to identify DNA methylation loci associated with urinary glyphosate and its metabolite aminomethylphosphonic acid (AMPA) levels. Secondary goals were to determine the association of epigenetic age acceleration with glyphosate and AMPA and develop blood DNA methylation indices to predict urinary glyphosate and AMPA levels. METHODS: For 392 postmenopausal women, white blood cell DNA methylation was measured using the Illumina Infinium MethylationEPIC BeadChip array. Glyphosate and AMPA were measured in two urine samples per participant using liquid chromatography-tandem mass spectrometry. Methylation differences at the probe and regional level associated with glyphosate and AMPA levels were assessed using a resampling-based approach. Probes and regions that had an false discovery rate q<0.1 in ≥90% of 1,000 subsamples of the study population were considered differentially methylated. Differentially methylated sites from the probe-specific analysis were combined into a methylation index. Epigenetic age acceleration from three epigenetic clocks and an epigenetic measure of pace of aging were examined for associations with glyphosate and AMPA. RESULTS: We identified 24 CpG sites whose methylation level was associated with urinary glyphosate concentration and two associated with AMPA. Four regions, within the promoters of the MSH4, KCNA6, ABAT, and NDUFAF2/ERCC8 genes, were associated with glyphosate levels, along with an association between ESR1 promoter hypomethylation and AMPA. The methylation index accurately predicted glyphosate levels in an internal validation cohort. AMPA, but not glyphosate, was associated with greater epigenetic age acceleration. DISCUSSION: Glyphosate and AMPA exposure were associated with DNA methylation differences that could promote the development of cancer and other diseases. Further studies are warranted to replicate our results, determine the functional impact of glyphosate- and AMPA-associated differential DNA methylation, and further explore whether DNA methylation could serve as a biomarker of glyphosate exposure. https://doi.org/10.1289/EHP10174.


Subject(s)
DNA Methylation , Postmenopause , Cross-Sectional Studies , DNA Repair Enzymes , Female , Glycine/analogs & derivatives , Humans , Kv1.6 Potassium Channel , Transcription Factors , Glyphosate
9.
BMC Bioinformatics ; 23(1): 17, 2022 Jan 06.
Article in English | MEDLINE | ID: mdl-34991439

ABSTRACT

BACKGROUND: A limitation of traditional differential expression analysis on small datasets involves the possibility of false positives and false negatives due to sample variation. Considering the recent advances in deep learning (DL) based models, we wanted to expand the state-of-the-art in disease biomarker prediction from RNA-seq data using DL. However, application of DL to RNA-seq data is challenging due to absence of appropriate labels and smaller sample size as compared to number of genes. Deep learning coupled with transfer learning can improve prediction performance on novel data by incorporating patterns learned from other related data. With the emergence of new disease datasets, biomarker prediction would be facilitated by having a generalized model that can transfer the knowledge of trained feature maps to the new dataset. To the best of our knowledge, there is no Convolutional Neural Network (CNN)-based model coupled with transfer learning to predict the significant upregulating (UR) and downregulating (DR) genes from both trained and untrained datasets. RESULTS: We implemented a CNN model, DEGnext, to predict UR and DR genes from gene expression data obtained from The Cancer Genome Atlas database. DEGnext uses biologically validated data along with logarithmic fold change values to classify differentially expressed genes (DEGs) as UR and DR genes. We applied transfer learning to our model to leverage the knowledge of trained feature maps to untrained cancer datasets. DEGnext's results were competitive (ROC scores between 88 and 99[Formula: see text]) with those of five traditional machine learning methods: Decision Tree, K-Nearest Neighbors, Random Forest, Support Vector Machine, and XGBoost. DEGnext was robust and effective in terms of transferring learned feature maps to facilitate classification of unseen datasets. Additionally, we validated that the predicted DEGs from DEGnext were mapped to significant Gene Ontology terms and pathways related to cancer. CONCLUSIONS: DEGnext can classify DEGs into UR and DR genes from RNA-seq cancer datasets with high performance. This type of analysis, using biologically relevant fine-tuning data, may aid in the exploration of potential biomarkers and can be adapted for other disease datasets.


Subject(s)
Neoplasms , Neural Networks, Computer , Humans , Machine Learning , RNA-Seq , Support Vector Machine
10.
Epigenetics ; 17(5): 531-546, 2022 05.
Article in English | MEDLINE | ID: mdl-34116608

ABSTRACT

BACKGROUND: Altered DNA methylation may be an intermediate phenotype between breast cancer risk factors and disease. Mammographic density is a strong risk factor for breast cancer. However, no studies to date have identified an epigenetic signature of mammographic density. We performed an epigenome-wide association study of mammographic density. METHODS: White blood cell DNA methylation was measured for 385 postmenopausal women using the Illumina Infinium MethylationEPIC BeadChip array. Differential methylation was assessed using genome-wide, probe-level, and regional analyses. We implemented a resampling-based approach to improve the stability of our findings. RESULTS: On average, women with elevated mammographic density exhibited DNA hypermethylation within CpG islands and gene promoters compared to women with lower mammographic density. We identified 250 CpG sites for which DNA methylation was significantly associated with mammographic density. The top sites were located within genes associated with cancer, including HDLBP, TGFB2, CCT4, and PAX8, and were more likely to be located in regulatory regions of the genome. We also identified differential DNA methylation in 37 regions, including within the promoters of PAX8 and PF4, a gene involved in the regulation of angiogenesis. Overall, our results paint a picture of epigenetic dysregulation associated with mammographic density. CONCLUSION: Mammographic density is associated with differential DNA methylation throughout the genome, including within genes associated with cancer. Our results suggest the potential involvement of several genes in the biological mechanisms behind differences in breast density between women. Further studies are warranted to explore these potential mechanisms and potential links to breast cancer risk.


Subject(s)
Breast Density , Breast Neoplasms , Breast Neoplasms/diagnostic imaging , Breast Neoplasms/genetics , CpG Islands , DNA Methylation , Epigenesis, Genetic , Epigenomics , Female , Genome-Wide Association Study/methods , Humans
11.
Aging Cell ; 20(7): e13362, 2021 07.
Article in English | MEDLINE | ID: mdl-34197020

ABSTRACT

Gene variants associated with longevity are also associated with protection against cognitive decline, dementia and Alzheimer's disease, suggesting that common physiologic pathways act at the interface of longevity and cognitive function. To test the hypothesis that variants in genes implicated in cognitive function may promote exceptional longevity, we performed a comprehensive 3-stage study to identify functional longevity-associated variants in ~700 candidate genes in up to 450 centenarians and 500 controls by target capture sequencing analysis. We found an enrichment of longevity-associated genes in the nPKC and NF-κB signaling pathways by gene-based association analyses. Functional analysis of the top three gene variants (NFKBIA, CLU, PRKCH) suggests that non-coding variants modulate the expression of cognate genes, thereby reducing signaling through the nPKC and NF-κB. This matches genetic studies in multiple model organisms, suggesting that the evolutionary conservation of reduced PKC and NF-κB signaling pathways in exceptional longevity may include humans.


Subject(s)
Longevity/genetics , NF-kappa B/genetics , Peptide Fragments/genetics , Protein Kinase C/genetics , Genetic Variation , Humans , Signal Transduction
12.
Diabetol Metab Syndr ; 13(1): 59, 2021 Jun 01.
Article in English | MEDLINE | ID: mdl-34074324

ABSTRACT

BACKGROUND: To identify genetic associations of quantitative metabolic syndrome (MetS) traits and characterize heterogeneity across ethnic groups. METHODS: Data was collected from GENetics of Noninsulin dependent Diabetes Mellitus (GENNID), a multiethnic resource of Type 2 diabetic families and included 1520 subjects in 259 African-American, European-American, Japanese-Americans, and Mexican-American families. We focused on eight MetS traits: weight, waist circumference, systolic and diastolic blood pressure, high-density lipoprotein, triglycerides, fasting glucose, and insulin. Using genotyped and imputed data from Illumina's Multiethnic array, we conducted genome-wide association analyses with linear mixed models for all ethnicities, except for the smaller Japanese-American group, where we used additive genetic models with gene-dropping. RESULTS: Findings included ethnic-specific genetic associations and heterogeneity across ethnicities. Most significant associations were outside our candidate linkage regions and were coincident within a gene or intergenic region, with two exceptions in European-American families: (a) within previously identified linkage region on chromosome 2, two significant GLI2-TFCP2L1 associations with weight, and (b) one chromosome 11 variant near CADM1-LINC00900 with pleiotropic blood pressure effects. CONCLUSIONS: This multiethnic family study found genetic heterogeneity and coincident associations (with one case of pleiotropy), highlighting the importance of including diverse populations in genetic research and illustrating the complex genetic architecture underlying MetS.

13.
BMC Med Genet ; 21(1): 152, 2020 07 22.
Article in English | MEDLINE | ID: mdl-32698886

ABSTRACT

BACKGROUND: Intermediate filament proteins that construct the nuclear lamina of a cell include the Lamin A/C proteins encoded by the LMNA gene, and are implicated in fundamental processes such as nuclear structure, gene expression, and signal transduction. LMNA mutations predominantly affect mesoderm-derived cell lineages in diseases collectively termed as laminopathies that include dilated cardiomyopathy with conduction defects, different forms of muscular dystrophies, and premature aging syndromes as Hutchinson-Gilford Progeria Syndrome. At present, our understanding of the molecular mechanisms regulating tissue-specific manifestations of laminopathies are still limited. METHODS: To gain deeper insight into the molecular mechanism of a novel LMNA splice-site mutation (c.357-2A > G) in an affected family with cardiac disease, we conducted deep RNA sequencing and pathway analysis for nine fibroblast samples obtained from three patients with cardiomyopathy, three unaffected family members, and three unrelated, unaffected individuals. We validated our findings by quantitative PCR and protein studies. RESULTS: We identified eight significantly differentially expressed genes between the mutant and non-mutant fibroblasts, that included downregulated insulin growth factor binding factor protein 5 (IGFBP5) in patient samples. Pathway analysis showed involvement of the ERK/MAPK signaling pathway consistent with previous studies. We found no significant differences in gene expression for Lamin A/C and B-type lamins between the groups. In mutant fibroblasts, RNA-seq confirmed that only the LMNA wild type allele predominately was expressed, and Western Blot showed normal Lamin A/C protein levels. CONCLUSIONS: IGFBP5 may contribute in maintaining signaling pathway homeostasis, which may lead to the absence of notable molecular and structural abnormalities in unaffected tissues such as fibroblasts. Compensatory mechanisms from other nuclear membrane proteins were not found. Our results also demonstrate that only one copy of the wild type allele is sufficient for normal levels of Lamin A/C protein to maintain physiological function in an unaffected cell type. This suggests that affected cell types such as cardiac tissues may be more sensitive to haploinsufficiency of Lamin A/C. These results provide insight into the molecular mechanism of disease with a possible explanation for the tissue specificity of LMNA-related dilated cardiomyopathy.


Subject(s)
Cardiomyopathies/genetics , Fibroblasts/metabolism , Fibroblasts/pathology , Gene Expression Profiling , Lamin Type A/genetics , Signal Transduction/genetics , Base Sequence , Family , Gene Expression Regulation , Humans , MAP Kinase Signaling System/genetics , Nuclear Lamina/metabolism
14.
Article in English | MEDLINE | ID: mdl-32182891

ABSTRACT

Environmental factors have been linked to many diseases and health conditions, but reliable assessment of environmental exposures is challenging. Developing biomarkers of environmental exposures, rather than relying on self-report, will improve our ability to assess the association of such exposures with disease. Epigenetic markers, most notably DNA methylation, have been identified for some environmental exposures, but identification of markers for additional exposures is still needed. The rationale behind the Markers for Environmental Exposures (MEE) Study was to (1) identify biomarkers, especially epigenetic markers, of environmental exposures, such as pesticides, air/food/water contaminants, and industrial chemicals that are commonly encountered in the general population; and (2) support the study of potential relationships between environmental exposures and health and health-related factors. The MEE Study is a cross-sectional study with potential for record linkage and follow-up. The well-characterized cohort of 400 postmenopausal women has generated a repository of biospecimens, including blood, urine, and saliva samples. Paired data include an environmental exposures questionnaire, a breast health questionnaire, dietary recalls, and a food frequency questionnaire. This work describes the rationale, study design, and cohort characteristics of the MEE Study. In addition to our primary research goals, we hope that the data and biorepository generated by this study will serve as a resource for future studies and collaboration.


Subject(s)
Environmental Exposure , Biomarkers , Cohort Studies , Cross-Sectional Studies , Female , Humans , Pesticides
15.
Hear Res ; 387: 107875, 2020 03 01.
Article in English | MEDLINE | ID: mdl-31896498

ABSTRACT

BACKGROUND: This study investigated the relationship between smoking and hearing loss and deafness (HLD) and whether the relationship is modified by genetic variation. Data for these analyses was from the subset of Japanese American families collected as part of the American Diabetes Association Genetics of Non-insulin Dependent Diabetes Mellitus study. Logistic regression with generalized estimating equations assessed the relationship between HLD and smoking. Nonparametric linkage analysis identified genetic regions harboring HLD susceptibility genes and ordered subset analysis was used to identify regions showing evidence for gene-smoking interactions. Genetic variants within these candidate regions were then each tested for interaction with smoking using logistic regression models. RESULTS: After adjusting for age, sex, diabetes status and smoking duration, for each pack of cigarettes smoked per day, risk of HLD increased 4.58 times (odds ratio (OR) = 4.58; 95% Confidence Interval (CI): (1.40,15.03)), and ever smokers were over 5 times more likely than nonsmokers to report HLD (OR = 5.22; 95% CI: (1.24, 22.03)). Suggestive evidence for linkage for HLD was observed in multiple genomic regions (Chromosomes 5p15, 8p23 and 17q21), and additional suggestive regions were identified when considering interactions with smoking status (Chromosomes 7p21, 11q23, 12q32, 15q26, and 20q13) and packs-per-day (Chromosome 8q21). CONCLUSIONS: To our knowledge this was the first report of possible gene-by-smoking interactions in HLD using family data. Additional work, including independent replication, is needed to understand the basis of these findings. HLD are important public health issues and understanding the contributions of genetic and environmental factors may inform public health messages and policies.


Subject(s)
Asian/genetics , Deafness/genetics , Gene-Environment Interaction , Hearing/genetics , Polymorphism, Single Nucleotide , Smoking/adverse effects , Adaptor Proteins, Signal Transducing/genetics , Adult , Aged , Cyclic Nucleotide Phosphodiesterases, Type 7/genetics , Deafness/ethnology , Deafness/physiopathology , Female , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Japan/ethnology , Male , Membrane Proteins/genetics , Middle Aged , Phenotype , Prevalence , Repressor Proteins/genetics , Risk Assessment , Risk Factors , Smoking/ethnology , United States/epidemiology
16.
Genet Epidemiol ; 44(1): 16-25, 2020 01.
Article in English | MEDLINE | ID: mdl-31647587

ABSTRACT

Genome-wide association studies (GWAS) have been used to establish thousands of genetic associations across numerous phenotypes. To improve the power of GWAS and generalize associations across ethnic groups, transethnic meta-analysis methods are used to combine the results of several GWAS from diverse ancestries. The goal of this study is to identify genetic associations for eight quantitative metabolic syndrome (MetS) traits through a meta-analysis across four ethnic groups. Traits were measured in the GENetics of Noninsulin dependent Diabetes Mellitus (GENNID) Study which consists of African-American (families = 73, individuals = 288), European-American (families = 79, individuals = 519), Japanese-American (families = 17, individuals = 132), and Mexican-American (families = 113, individuals = 610) samples. Genome-wide association results from these four ethnic groups were combined using four meta-analysis methods: fixed effects, random effects, TransMeta, and MR-MEGA. We provide an empirical comparison of the four meta-analysis methods from the GENNID results, discuss which types of loci (characterized by allelic heterogeneity) appear to be better detected by each of the four meta-analysis methods in the GENNID Study, and validate our results using previous genetic discoveries. We specifically compare the two transethnic methods, TransMeta and MR-MEGA, and discuss how each transethnic method's framework relates to the types of loci best detected by each method.


Subject(s)
Genome-Wide Association Study/methods , Meta-Analysis as Topic , Metabolic Syndrome/ethnology , Metabolic Syndrome/genetics , Black or African American/genetics , Asian/genetics , Diabetes Mellitus, Type 2/genetics , Humans , Male , Mexican Americans/genetics , Phenotype , Polymorphism, Single Nucleotide/genetics , White People/genetics
17.
BMC Bioinformatics ; 20(1): 595, 2019 Nov 19.
Article in English | MEDLINE | ID: mdl-31744472

ABSTRACT

BACKGROUND: Researchers commonly analyze lists of differentially expressed entities (DEEs), such as differentially expressed genes (DEGs), differentially expressed proteins (DEPs), and differentially methylated positions/regions (DMPs/DMRs), across multiple pairwise comparisons. Large biological studies can involve multiple conditions, tissues, and timepoints that result in dozens of pairwise comparisons. Manually filtering and comparing lists of DEEs across multiple pairwise comparisons, typically done by writing custom code, is a cumbersome task that can be streamlined and standardized. RESULTS: A-Lister is a lightweight command line and graphical user interface tool written in Python. It can be executed in a differential expression mode or generic name list mode. In differential expression mode, A-Lister accepts as input delimited text files that are output by differential expression tools such as DESeq2, edgeR, Cuffdiff, and limma. To allow for the most flexibility in input ID types, to avoid database installation requirements, and to allow for secure offline use, A-Lister does not validate or impose restrictions on entity ID names. Users can specify thresholds to filter the input file(s) by column(s) such as p-value, q-value, and fold change. Additionally, users can filter the pairwise comparisons within the input files by fold change direction (sign). Queries composed of intersection, fuzzy intersection, difference, and union set operations can also be performed on any number of pairwise comparisons. Thus, the user can filter and compare any number of pairwise comparisons within a single A-Lister differential expression command. In generic name list mode, A-Lister accepts delimited text files containing lists of names as input. Queries composed of intersection, fuzzy intersection, difference, and union set operations can then be performed across these lists of names. CONCLUSIONS: A-Lister is a flexible tool that enables the user to rapidly narrow down large lists of DEEs to a small number of most significant entities. These entities can then be further analyzed using visualization, pathway analysis, and other bioinformatics tools.


Subject(s)
Genomics/methods , Software , Computational Biology , Databases, Factual , Fuzzy Logic , Gene Expression Profiling , Humans , Search Engine , User-Computer Interface
18.
Mutat Res ; 809: 24-31, 2018 05.
Article in English | MEDLINE | ID: mdl-29677560

ABSTRACT

Identification of all genetic variants associated with complex traits is one of the most important goals in modern human genetics. Genome-wide association studies (GWAS) have been successfully applied to identify common variants, which thus far explain only small portion of heritability. Interests in rare variants have been increasingly growing as an answer for this missing heritability. While next-generation sequencing allows detection of rare variants, its cost is still prohibitively high to sequence a large number of human DNA samples required for rare variant association studies. In this study, we evaluated the sensitivity and specificity of sequencing for pooled DNA samples of multiple individuals (Pool-seq) as a cost-effective and robust approach for rare variant discovery. We comparatively analyzed Pool-seq vs. individual-seq of indexed target capture of up to 960 genes in ∼1000 individuals, followed by independent genotyping validation studies. We found that Pool-seq was as effective and accurate as individual-seq in detecting rare variants and accurately estimating their minor allele frequencies (MAFs). Our results suggest that Pool-seq can be used as an efficient and cost-effective method for discovery of rare variants for population-based sequencing studies in individual laboratories.


Subject(s)
Genetic Variation , Genome-Wide Association Study/methods , High-Throughput Nucleotide Sequencing/methods , Female , Humans , Male
19.
Twin Res Hum Genet ; 18(6): 727-37, 2015 Dec.
Article in English | MEDLINE | ID: mdl-26608796

ABSTRACT

Neurophysiological measurements of the response to pre-pulse and startle stimuli have been suggested to represent an important endophenotype for both substance dependence and other select psychiatric disorders. We have previously shown, in young adult Mexican Americans (MA), that presentation of a short delay acoustic pre-pulse, prior to the startle stimuli can elicit a late negative component at about 400 msec (N4S), in the event-related potential (ERP), recorded from frontal cortical areas. In the present study, we investigated whether genetic factors associated with this endophenotype could be identified. The study included 420 (age 18-30 years) MA men (n = 170), and women (n = 250). DNA was genotyped using an Affymetrix Axiom Exome1A chip. An association analysis revealed that the CCKAR and CCKBR (cholecystokinin A and B receptor) genes each had a nearby variant that showed suggestive significance with the amplitude of the N4S component to pre-pulse stimuli. The neurotransmitter cholecystokinin (CCK), along with its receptors, CCKAR and CCKBR, have been previously associated with psychiatric disorders, suggesting that variants near these genes may play a role in the pre-pulse/startle response in this cohort.


Subject(s)
Mexican Americans/genetics , Receptor, Cholecystokinin A/genetics , Receptor, Cholecystokinin B/genetics , Reflex, Startle/genetics , Adolescent , Adult , Cohort Studies , Electrophysiological Phenomena , Female , Humans , Male , Risk Assessment , Young Adult
20.
Sci Rep ; 5: 14840, 2015 Oct 07.
Article in English | MEDLINE | ID: mdl-26443302

ABSTRACT

Microorganisms almost always exist as mixed communities in nature. While the significance of microbial community activities is well appreciated, a thorough understanding about how microbial communities respond to environmental perturbations has not yet been achieved. Here we have used a combination of metagenomic, genome binning, and stimulus-induced metatranscriptomic approaches to estimate the metabolic network and stimuli-induced metabolic switches existing in a complex microbial biofilm that was producing electrical current via extracellular electron transfer (EET) to a solid electrode surface. Two stimuli were employed: to increase EET and to stop EET. An analysis of cell activity marker genes after stimuli exposure revealed that only two strains within eleven binned genomes had strong transcriptional responses to increased EET rates, with one responding positively and the other responding negatively. Potential metabolic switches between eleven dominant members were mainly observed for acetate, hydrogen, and ethanol metabolisms. These results have enabled the estimation of a multi-species metabolic network and the associated short-term responses to EET stimuli that induce changes to metabolic flow and cooperative or competitive microbial interactions. This systematic meta-omics approach represents a next step towards understanding complex microbial roles within a community and how community members respond to specific environmental stimuli.


Subject(s)
Biofilms , Gene Expression Profiling , Genes, Bacterial/genetics , Metabolic Networks and Pathways , Metagenomics , Transcriptome , Electron Transport , Electrons , Genome, Bacterial , Microbial Interactions
SELECTION OF CITATIONS
SEARCH DETAIL
...