Search | VHL Regional Portal

1.

Automated Cytometric Gating with Human-Level Performance Using Bivariate Segmentation.

Chen, Jiong; Ionita, Matei; Feng, Yanbo; Lu, Yinfeng; Orzechowski, Patryk; Garai, Sumita; Hassinger, Kenneth; Bao, Jingxuan; Wen, Junhao; Duong-Tran, Duy; Wagenaar, Joost; McKeague, Michelle L; Painter, Mark M; Mathew, Divij; Pattekar, Ajinkya; Meyer, Nuala J; John Wherry, E; Greenplate, Allison R; Shen, Li.

bioRxiv ; 2024 May 09.

Article in English | MEDLINE | ID: mdl-38766268

ABSTRACT

Recent advances in cytometry technology have enabled high-throughput data collection with multiple single-cell protein expression measurements. The significant biological and technical variance between samples in cytometry has long posed a formidable challenge during the gating process, especially for the initial gates which deal with unpredictable events, such as debris and technical artifacts. Even with the same experimental machine and protocol, the target population, as well as the cell population that needs to be excluded, may vary across different measurements. To address this challenge and mitigate the labor-intensive manual gating process, we propose a deep learning framework UNITO to rigorously identify the hierarchical cytometric subpopulations. The UNITO framework transformed a cell-level classification task into an image-based semantic segmentation problem. For reproducibility purposes, the framework was applied to three independent cohorts and successfully detected initial gates that were required to identify single cellular events as well as subsequent cell gates. We validated the UNITO framework by comparing its results with previous automated methods and the consensus of at least four experienced immunologists. UNITO outperformed existing automated methods and differed from human consensus by no more than each individual human. Most critically, UNITO framework functions as a fully automated pipeline after training and does not require human hints or prior knowledge. Unlike existing multi-channel classification or clustering pipelines, UNITO can reproduce a similar contour compared to manual gating for each intermediate gating to achieve better interpretability and provide post hoc visual inspection. Beyond acting as a pioneering framework that uses image segmentation to do auto-gating, UNITO gives a fast and interpretable way to assign the cell subtype membership, and the speed of UNITO will not be impacted by the number of cells from each sample. The pre-gating and gating inference takes approximately 2 minutes for each sample using our pre-defined 9 gates system, and it can also adapt to any sequential prediction with different configurations.

2.

Identifying Shared Neuroanatomic Architecture between Cognitive Traits through Multiscale Morphometric Correlation Analysis.

Wen, Zixuan; Bao, Jingxuan; Yang, Shu; Risacher, Shannon L; Saykin, Andrew J; Thompson, Paul M; Davatzikos, Christos; Huang, Heng; Zhao, Yize; Shen, Li.

Med Image Comput Comput Assist Interv MICCAI 2023 Workshops (2023) ; 14394: 227-240, 2024.

Article in English | MEDLINE | ID: mdl-38584725

ABSTRACT

We introduce an informative metric, called morphometric correlation, as a measure of shared neuroanatomic similarity between two cognitive traits. Traditional estimates of trait correlations can be confounded by factors beyond brain morphology. To exclude these confounding factors, we adopt a Gaussian kernel to measure the morphological similarity between individuals and compare pure neuroanatomic correlations among cognitive traits. In our empirical study, we employ a multiscale strategy. Given a set of cognitive traits, we first perform morphometric correlation analysis for each pair of traits to reveal their shared neuroanatomic correlation at the whole brain (or global) level. After that, we extend our whole brain concept to regional morphometric correlation and estimate shared neuroanatomic similarity between two cognitive traits at the regional (or local) level. Our results demonstrate that morphometric correlation can provide insights into shared neuroanatomic architecture between cognitive traits. Furthermore, we also estimate the morphometricity of each cognitive trait at both global and local levels, which can be used to better understand how neuroanatomic changes influence individuals' cognitive status.

3.

Multiplexed Serum Biomarkers to Discriminate Nonviable and Ectopic Pregnancy.

Barnhart, Kurt T; Bollig, Kassie J; Senapati, Suneeta; Takacs, Peter; Robins, Jared C; Haisenleder, Daniel J; Beer, Lynn A; Savaris, Ricardo F; Koelper, Nathanael C; Speicher, David W; Chittams, Jesse; Bao, Jingxuan; Wen, Zixuan; Feng, Yanbo; Kim, Mansu; Mumford, Sunni; Shen, Li; Gimotty, Phyllis.

Fertil Steril ; 2024 Apr 25.

Article in English | MEDLINE | ID: mdl-38677710

ABSTRACT

OBJECTIVE: The use of multiplexed biomarkers may improve the diagnosis of normal and abnormal early pregnancies. In this study we assessed 24 markers with multiple machine learning-based methodologies to evaluate combinations of top candidates to develop a multiplexed prediction model for identification of 1) viability and 2) location of an early pregnancy. DESIGN: A nested case-control design evaluating the predictive ability and discrimination of biomarkers in patients at risk of early pregnancy failure in the first trimester to classify viability and location SUBJECTS: 218 individuals with a symptomatic (pain and/or bleeding) early pregnancy: 75 with an ongoing intrauterine gestation, 68 ectopic pregnancies, and 75 miscarriages. INTERVENTIONS: Serum values of 24 biomarkers were assessed in the same patients. Multiple machine learning-based methodologies to evaluate combinations of these top candidates to develop a multiplexed prediction model for identification of 1) a nonviable pregnancy (ongoing intrauterine pregnancy vs miscarriage or ectopic pregnancy) and 2) an ectopic pregnancy (ectopic pregnancy vs ongoing intrauterine pregnancy or miscarriage). MAIN OUTCOME MEASURES: The predicted classification by each model was compared to actual diagnosis and sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), conclusive classification, and accuracy were calculated. RESULTS: Models using classification regression tree analysis using three markers (PSG3, CG-Alpha and PAPPA) were able to predict a maximum sensitivity 93.3%, a maximum specificity 98.6%. The model with the highest accuracy was 97.4% (with 70.2% receiving classification). Models using an overlapping group of three markers (sFLT, PSG3 and TFP12) achieved a maximum sensitivity of 98.5%. and a maximum specificity of 95.3%. The model with the highest accuracy was 94.4% (with 65.6% receiving classification). When the models were used simultaneously the conclusive classification increased to 72.7% with an accuracy 95.9%. The predictive ability of the biomarkers random forest produced similar test characteristics when using 11 predictive markers. CONCLUSION: We have demonstrated a pool of biomarkers from divergent biological pathways that can be used to classify individuals with potential early pregnancy loss. The biomarkers CG-Alpha, PAPPA and PSG3 can be used to predict viability and sFLT, TPFI2 and PSG3 can be used to predict pregnancy location.

4.

The genetic architecture of multimodal human brain age.

Wen, Junhao; Zhao, Bingxin; Yang, Zhijian; Erus, Guray; Skampardoni, Ioanna; Mamourian, Elizabeth; Cui, Yuhan; Hwang, Gyujoon; Bao, Jingxuan; Boquet-Pujadas, Aleix; Zhou, Zhen; Veturi, Yogasudha; Ritchie, Marylyn D; Shou, Haochang; Thompson, Paul M; Shen, Li; Toga, Arthur W; Davatzikos, Christos.

Nat Commun ; 15(1): 2604, 2024 Mar 23.

Article in English | MEDLINE | ID: mdl-38521789

ABSTRACT

The complex biological mechanisms underlying human brain aging remain incompletely understood. This study investigated the genetic architecture of three brain age gaps (BAG) derived from gray matter volume (GM-BAG), white matter microstructure (WM-BAG), and functional connectivity (FC-BAG). We identified sixteen genomic loci that reached genome-wide significance (P-value < 5×10-8). A gene-drug-disease network highlighted genes linked to GM-BAG for treating neurodegenerative and neuropsychiatric disorders and WM-BAG genes for cancer therapy. GM-BAG displayed the most pronounced heritability enrichment in genetic variants within conserved regions. Oligodendrocytes and astrocytes, but not neurons, exhibited notable heritability enrichment in WM and FC-BAG, respectively. Mendelian randomization identified potential causal effects of several chronic diseases on brain aging, such as type 2 diabetes on GM-BAG and AD on WM-BAG. Our results provide insights into the genetics of human brain aging, with clinical implications for potential lifestyle and therapeutic interventions. All results are publicly available at https://labs.loni.usc.edu/medicine .

Subject(s)

Diabetes Mellitus, Type 2 , White Matter , Humans , Brain , Gray Matter , Magnetic Resonance Imaging/methods , White Matter/physiology , Mendelian Randomization Analysis

5.

Gene-SGAN: discovering disease subtypes with imaging and genetic signatures via multi-view weakly-supervised deep clustering.

Yang, Zhijian; Wen, Junhao; Abdulkadir, Ahmed; Cui, Yuhan; Erus, Guray; Mamourian, Elizabeth; Melhem, Randa; Srinivasan, Dhivya; Govindarajan, Sindhuja T; Chen, Jiong; Habes, Mohamad; Masters, Colin L; Maruff, Paul; Fripp, Jurgen; Ferrucci, Luigi; Albert, Marilyn S; Johnson, Sterling C; Morris, John C; LaMontagne, Pamela; Marcus, Daniel S; Benzinger, Tammie L S; Wolk, David A; Shen, Li; Bao, Jingxuan; Resnick, Susan M; Shou, Haochang; Nasrallah, Ilya M; Davatzikos, Christos.

Nat Commun ; 15(1): 354, 2024 Jan 08.

Article in English | MEDLINE | ID: mdl-38191573

ABSTRACT

Disease heterogeneity has been a critical challenge for precision diagnosis and treatment, especially in neurologic and neuropsychiatric diseases. Many diseases can display multiple distinct brain phenotypes across individuals, potentially reflecting disease subtypes that can be captured using MRI and machine learning methods. However, biological interpretability and treatment relevance are limited if the derived subtypes are not associated with genetic drivers or susceptibility factors. Herein, we describe Gene-SGAN - a multi-view, weakly-supervised deep clustering method - which dissects disease heterogeneity by jointly considering phenotypic and genetic data, thereby conferring genetic correlations to the disease subtypes and associated endophenotypic signatures. We first validate the generalizability, interpretability, and robustness of Gene-SGAN in semi-synthetic experiments. We then demonstrate its application to real multi-site datasets from 28,858 individuals, deriving subtypes of Alzheimer's disease and brain endophenotypes associated with hypertension, from MRI and single nucleotide polymorphism data. Derived brain phenotypes displayed significant differences in neuroanatomical patterns, genetic determinants, biological and clinical biomarkers, indicating potentially distinct underlying neuropathologic processes, genetic drivers, and susceptibility factors. Overall, Gene-SGAN is broadly applicable to disease subtyping and endophenotype discovery, and is herein tested on disease-related, genetically-associated neuroimaging phenotypes.

Subject(s)

Alzheimer Disease , Neuroimaging , Humans , Endophenotypes , Alzheimer Disease/diagnostic imaging , Alzheimer Disease/genetics , Brain/diagnostic imaging , Cluster Analysis

6.

Genomic loci influence patterns of structural covariance in the human brain.

Wen, Junhao; Nasrallah, Ilya M; Abdulkadir, Ahmed; Satterthwaite, Theodore D; Yang, Zhijian; Erus, Guray; Robert-Fitzgerald, Timothy; Singh, Ashish; Sotiras, Aristeidis; Boquet-Pujadas, Aleix; Mamourian, Elizabeth; Doshi, Jimit; Cui, Yuhan; Srinivasan, Dhivya; Skampardoni, Ioanna; Chen, Jiong; Hwang, Gyujoon; Bergman, Mark; Bao, Jingxuan; Veturi, Yogasudha; Zhou, Zhen; Yang, Shu; Dazzan, Paola; Kahn, Rene S; Schnack, Hugo G; Zanetti, Marcus V; Meisenzahl, Eva; Busatto, Geraldo F; Crespo-Facorro, Benedicto; Pantelis, Christos; Wood, Stephen J; Zhuo, Chuanjun; Shinohara, Russell T; Gur, Ruben C; Gur, Raquel E; Koutsouleris, Nikolaos; Wolf, Daniel H; Saykin, Andrew J; Ritchie, Marylyn D; Shen, Li; Thompson, Paul M; Colliot, Olivier; Wittfeld, Katharina; Grabe, Hans J; Tosun, Duygu; Bilgel, Murat; An, Yang; Marcus, Daniel S; LaMontagne, Pamela; Heckbert, Susan R.

Proc Natl Acad Sci U S A ; 120(52): e2300842120, 2023 Dec 26.

Article in English | MEDLINE | ID: mdl-38127979

ABSTRACT

Normal and pathologic neurobiological processes influence brain morphology in coordinated ways that give rise to patterns of structural covariance (PSC) across brain regions and individuals during brain aging and diseases. The genetic underpinnings of these patterns remain largely unknown. We apply a stochastic multivariate factorization method to a diverse population of 50,699 individuals (12 studies and 130 sites) and derive data-driven, multi-scale PSCs of regional brain size. PSCs were significantly correlated with 915 genomic loci in the discovery set, 617 of which are newly identified, and 72% were independently replicated. Key pathways influencing PSCs involve reelin signaling, apoptosis, neurogenesis, and appendage development, while pathways of breast cancer indicate potential interplays between brain metastasis and PSCs associated with neurodegeneration and dementia. Using support vector machines, multi-scale PSCs effectively derive imaging signatures of several brain diseases. Our results elucidate genetic and biological underpinnings that influence structural covariance patterns in the human brain.

Subject(s)

Brain Neoplasms , Magnetic Resonance Imaging , Humans , Magnetic Resonance Imaging/methods , Brain/pathology , Brain Mapping/methods , Genomics , Brain Neoplasms/pathology

7.

An interpretable Alzheimer's disease oligogenic risk score informed by neuroimaging biomarkers improves risk prediction and stratification.

Suh, Erica H; Lee, Garam; Jung, Sang-Hyuk; Wen, Zixuan; Bao, Jingxuan; Nho, Kwangsik; Huang, Heng; Davatzikos, Christos; Saykin, Andrew J; Thompson, Paul M; Shen, Li; Kim, Dokyoon.

Front Aging Neurosci ; 15: 1281748, 2023.

Article in English | MEDLINE | ID: mdl-37953885

ABSTRACT

Introduction: Stratification of Alzheimer's disease (AD) patients into risk subgroups using Polygenic Risk Scores (PRS) presents novel opportunities for the development of clinical trials and disease-modifying therapies. However, the heterogeneous nature of AD continues to pose significant challenges for the clinical broadscale use of PRS. PRS remains unfit in demonstrating sufficient accuracy in risk prediction, particularly for individuals with mild cognitive impairment (MCI), and in allowing feasible interpretation of specific genes or SNPs contributing to disease risk. We propose adORS, a novel oligogenic risk score for AD, to better predict risk of disease by using an optimized list of relevant genetic risk factors. Methods: Using whole genome sequencing data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort (n = 1,545), we selected 20 genes that exhibited the strongest correlations with FDG-PET and AV45-PET, recognized neuroimaging biomarkers that detect functional brain changes in AD. This subset of genes was incorporated into adORS to assess, in comparison to PRS, the prediction accuracy of CN vs. AD classification and MCI conversion prediction, risk stratification of the ADNI cohort, and interpretability of the genetic information included in the scores. Results: adORS improved AUC scores over PRS in both CN vs. AD classification and MCI conversion prediction. The oligogenic model also refined risk-based stratification, even without the assistance of APOE, thus reflecting the true prevalence rate of the ADNI cohort compared to PRS. Interpretation analysis shows that genes included in adORS, such as ATF6, EFCAB11, ING5, SIK3, and CD46, have been observed in similar neurodegenerative disorders and/or are supported by AD-related literature. Discussion: Compared to conventional PRS, adORS may prove to be a more appropriate choice of differentiating patients into high or low genetic risk of AD in clinical studies or settings. Additionally, the ability to interpret specific genetic information allows the focus to be shifted from general relative risk based on a given population to the information that adORS can provide for a single individual, thus permitting the possibility of personalized treatments for AD.

8.

Brain-wide genome-wide colocalization study for integrating genetics, transcriptomics and brain morphometry in Alzheimer's disease.

Bao, Jingxuan; Wen, Junhao; Wen, Zixuan; Yang, Shu; Cui, Yuhan; Yang, Zhijian; Erus, Guray; Saykin, Andrew J; Long, Qi; Davatzikos, Christos; Shen, Li.

Neuroimage ; 280: 120346, 2023 10 15.

Article in English | MEDLINE | ID: mdl-37634885

ABSTRACT

Alzheimer's disease (AD) is one of the most common neurodegenerative diseases. However, the AD mechanism has not yet been fully elucidated to date, hindering the development of effective therapies. In our work, we perform a brain imaging genomics study to link genetics, single-cell gene expression data, tissue-specific gene expression data, brain imaging-derived volumetric endophenotypes, and disease diagnosis to discover potential underlying neurobiological pathways for AD. To do so, we perform brain-wide genome-wide colocalization analyses to integrate multidimensional imaging genomic biobank data. Specifically, we use (1) the individual-level imputed genotyping data and magnetic resonance imaging (MRI) data from the UK Biobank, (2) the summary statistics of the genome-wide association study (GWAS) from multiple European ancestry cohorts, and (3) the tissue-specific cis-expression quantitative trait loci (cis-eQTL) summary statistics from the GTEx project. We apply a Bayes factor colocalization framework and mediation analysis to these multi-modal imaging genomic data. As a result, we derive the brain regional level GWAS summary statistics for 145 brain regions with 482,831 single nucleotide polymorphisms (SNPs) followed by posthoc functional annotations. Our analysis yields the discovery of a potential AD causal pathway from a systems biology perspective: the SNP chr10:124165615:G>A (rs6585827) mutation upregulates the expression of BTBD16 gene in oligodendrocytes, a specialized glial cells, in the brain cortex, leading to a reduced risk of volumetric loss in the entorhinal cortex, resulting in the protective effect on AD. We substantiate our findings with multiple evidence from existing imaging, genetic and genomic studies in AD literature. Our study connects genetics, molecular and cellular signatures, regional brain morphologic endophenotypes, and AD diagnosis, providing new insights into the mechanistic understanding of the disease. Our findings can provide valuable guidance for subsequent therapeutic target identification and drug discovery in AD.

Subject(s)

Alzheimer Disease , Humans , Alzheimer Disease/diagnostic imaging , Alzheimer Disease/genetics , Bayes Theorem , Genome-Wide Association Study , Transcriptome , Brain/diagnostic imaging , Entorhinal Cortex

9.

Preference matrix guided sparse canonical correlation analysis for mining brain imaging genetic associations in Alzheimer's disease.

Sha, Jiahang; Bao, Jingxuan; Liu, Kefei; Yang, Shu; Wen, Zixuan; Wen, Junhao; Cui, Yuhan; Tong, Boning; Moore, Jason H; Saykin, Andrew J; Davatzikos, Christos; Long, Qi; Shen, Li.

Methods ; 218: 27-38, 2023 10.

Article in English | MEDLINE | ID: mdl-37507059

ABSTRACT

Investigating the relationship between genetic variation and phenotypic traits is a key issue in quantitative genetics. Specifically for Alzheimer's disease, the association between genetic markers and quantitative traits remains vague while, once identified, will provide valuable guidance for the study and development of genetics-based treatment approaches. Currently, to analyze the association of two modalities, sparse canonical correlation analysis (SCCA) is commonly used to compute one sparse linear combination of the variable features for each modality, giving a pair of linear combination vectors in total that maximizes the cross-correlation between the analyzed modalities. One drawback of the plain SCCA model is that the existing findings and knowledge cannot be integrated into the model as priors to help extract interesting correlations as well as identify biologically meaningful genetic and phenotypic markers. To bridge this gap, we introduce preference matrix guided SCCA (PM-SCCA) that not only takes priors encoded as a preference matrix but also maintains computational simplicity. A simulation study and a real-data experiment are conducted to investigate the effectiveness of the model. Both experiments demonstrate that the proposed PM-SCCA model can capture not only genotype-phenotype correlation but also relevant features effectively.

Subject(s)

Alzheimer Disease , Neuroimaging , Humans , Neuroimaging/methods , Canonical Correlation Analysis , Algorithms , Alzheimer Disease/diagnostic imaging , Alzheimer Disease/genetics , Brain , Magnetic Resonance Imaging

10.

The Genetic Architecture of Multimodal Human Brain Age.

Wen, Junhao; Zhao, Bingxin; Yang, Zhijian; Erus, Guray; Skampardoni, Ioanna; Mamourian, Elizabeth; Cui, Yuhan; Hwang, Gyujoon; Bao, Jingxuan; Boquet-Pujadas, Aleix; Zhou, Zhen; Veturi, Yogasudha; Ritchie, Marylyn D; Shou, Haochang; Thompson, Paul M; Shen, Li; Toga, Arthur W; Davatzikos, Christos.

bioRxiv ; 2023 Sep 30.

Article in English | MEDLINE | ID: mdl-37333190

ABSTRACT

The complex biological mechanisms underlying human brain aging remain incompletely understood, involving multiple body organs and chronic diseases. In this study, we used multimodal magnetic resonance imaging and artificial intelligence to examine the genetic architecture of the brain age gap (BAG) derived from gray matter volume (GM-BAG, N=31,557 European ancestry), white matter microstructure (WM-BAG, N=31,674), and functional connectivity (FC-BAG, N=32,017). We identified sixteen genomic loci that reached genome-wide significance (P-value<5×10-8). A gene-drug-disease network highlighted genes linked to GM-BAG for treating neurodegenerative and neuropsychiatric disorders and WM-BAG genes for cancer therapy. GM-BAG showed the highest heritability enrichment for genetic variants in conserved regions, whereas WM-BAG exhibited the highest heritability enrichment in the 5' untranslated regions; oligodendrocytes and astrocytes, but not neurons, showed significant heritability enrichment in WM and FC-BAG, respectively. Mendelian randomization identified potential causal effects of several exposure variables on brain aging, such as type 2 diabetes on GM-BAG (odds ratio=1.05 [1.01, 1.09], P-value=1.96×10-2) and AD on WM-BAG (odds ratio=1.04 [1.02, 1.05], P-value=7.18×10-5). Overall, our results provide valuable insights into the genetics of human brain aging, with clinical implications for potential lifestyle and therapeutic interventions. All results are publicly available at the MEDICINE knowledge portal: https://labs.loni.usc.edu/medicine.

11.

Comparing Amyloid Imaging Normalization Strategies for Alzheimer's Disease Classification using an Automated Machine Learning Pipeline.

Tong, Boning; Risacher, Shannon L; Bao, Jingxuan; Feng, Yanbo; Wang, Xinkai; Ritchie, Marylyn D; Moore, Jason H; Urbanowicz, Ryan; Saykin, Andrew J; Shen, Li.

AMIA Jt Summits Transl Sci Proc ; 2023: 525-533, 2023.

Article in English | MEDLINE | ID: mdl-37350880

ABSTRACT

Amyloid imaging has been widely used in Alzheimer's disease (AD) diagnosis and biomarker discovery through detecting the regional amyloid plaque density. It is essential to be normalized by a reference region to reduce noise and artifacts. To explore an optimal normalization strategy, we employ an automated machine learning (AutoML) pipeline, STREAMLINE, to conduct the AD diagnosis binary classification and perform permutation-based feature importance analysis with thirteen machine learning models. In this work, we perform a comparative study to evaluate the prediction performance and biomarker discovery capability of three amyloid imaging measures, including one original measure and two normalized measures using two reference regions (i.e., the whole cerebellum and the composite reference region). Our AutoML results indicate that the composite reference region normalization dataset yields a higher balanced accuracy, and identifies more AD-related regions based on the fractioned feature importance ranking.

12.

Exploring Automated Machine Learning for Cognitive Outcome Prediction from Multimodal Brain Imaging using STREAMLINE.

Wang, Xinkai; Feng, Yanbo; Tong, Boning; Bao, Jingxuan; Ritchie, Marylyn D; Saykin, Andrew J; Moore, Jason H; Urbanowicz, Ryan; Shen, Li.

AMIA Jt Summits Transl Sci Proc ; 2023: 544-553, 2023.

Article in English | MEDLINE | ID: mdl-37350896

ABSTRACT

STREAMLINE is a simple, transparent, end-to-end automated machine learning (AutoML) pipeline for easily conducting rigorous machine learning (ML) modeling and analysis. The initial version is limited to binary classification. In this work, we extend STREAMLINE through implementing multiple regression-based ML models, including linear regression, elastic net, group lasso, and L21 norm. We demonstrate the effectiveness of the regression version of STREAMLINE by applying it to the prediction of Alzheimer's disease (AD) cognitive outcomes using multimodal brain imaging data. Our empirical results demonstrate the feasibility and effectiveness of the newly expanded STREAMLINE as an AutoML pipeline for evaluating AD regression models, and for discovering multimodal imaging biomarkers.

13.

Integrative analysis of multi-omics and imaging data with incorporation of biological information via structural Bayesian factor analysis.

Bao, Jingxuan; Chang, Changgee; Zhang, Qiyiwen; Saykin, Andrew J; Shen, Li; Long, Qi.

Brief Bioinform ; 24(2)2023 03 19.

Article in English | MEDLINE | ID: mdl-36882008

ABSTRACT

MOTIVATION: With the rapid development of modern technologies, massive data are available for the systematic study of Alzheimer's disease (AD). Though many existing AD studies mainly focus on single-modality omics data, multi-omics datasets can provide a more comprehensive understanding of AD. To bridge this gap, we proposed a novel structural Bayesian factor analysis framework (SBFA) to extract the information shared by multi-omics data through the aggregation of genotyping data, gene expression data, neuroimaging phenotypes and prior biological network knowledge. Our approach can extract common information shared by different modalities and encourage biologically related features to be selected, guiding future AD research in a biologically meaningful way. METHOD: Our SBFA model decomposes the mean parameters of the data into a sparse factor loading matrix and a factor matrix, where the factor matrix represents the common information extracted from multi-omics and imaging data. Our framework is designed to incorporate prior biological network information. Our simulation study demonstrated that our proposed SBFA framework could achieve the best performance compared with the other state-of-the-art factor-analysis-based integrative analysis methods. RESULTS: We apply our proposed SBFA model together with several state-of-the-art factor analysis models to extract the latent common information from genotyping, gene expression and brain imaging data simultaneously from the ADNI biobank database. The latent information is then used to predict the functional activities questionnaire score, an important measurement for diagnosis of AD quantifying subjects' abilities in daily life. Our SBFA model shows the best prediction performance compared with the other factor analysis models. AVAILABILITY: Code are publicly available at https://github.com/JingxuanBao/SBFA. CONTACT: qlong@upenn.edu.

Subject(s)

Multiomics , Neuroimaging , Bayes Theorem , Neuroimaging/methods , Brain/diagnostic imaging , Phenotype

14.

Gene-SGAN: a method for discovering disease subtypes with imaging and genetic signatures via multi-view weakly-supervised deep clustering.

Yang, Zhijian; Wen, Junhao; Abdulkadir, Ahmed; Cui, Yuhan; Erus, Guray; Mamourian, Elizabeth; Melhem, Randa; Srinivasan, Dhivya; Govindarajan, Sindhuja T; Chen, Jiong; Habes, Mohamad; Masters, Colin L; Maruff, Paul; Fripp, Jurgen; Ferrucci, Luigi; Albert, Marilyn S; Johnson, Sterling C; Morris, John C; LaMontagne, Pamela; Marcus, Daniel S; Benzinger, Tammie L S; Wolk, David A; Shen, Li; Bao, Jingxuan; Resnick, Susan M; Shou, Haochang; Nasrallah, Ilya M; Davatzikos, Christos.

ArXiv ; 2023 Jan 25.

Article in English | MEDLINE | ID: mdl-36748000

ABSTRACT

Disease heterogeneity has been a critical challenge for precision diagnosis and treatment, especially in neurologic and neuropsychiatric diseases. Many diseases can display multiple distinct brain phenotypes across individuals, potentially reflecting disease subtypes that can be captured using MRI and machine learning methods. However, biological interpretability and treatment relevance are limited if the derived subtypes are not associated with genetic drivers or susceptibility factors. Herein, we describe Gene-SGAN - a multi-view, weakly-supervised deep clustering method - which dissects disease heterogeneity by jointly considering phenotypic and genetic data, thereby conferring genetic correlations to the disease subtypes and associated endophenotypic signatures. We first validate the generalizability, interpretability, and robustness of Gene-SGAN in semi-synthetic experiments. We then demonstrate its application to real multi-site datasets from 28,858 individuals, deriving subtypes of Alzheimer's disease and brain endophenotypes associated with hypertension, from MRI and SNP data. Derived brain phenotypes displayed significant differences in neuroanatomical patterns, genetic determinants, biological and clinical biomarkers, indicating potentially distinct underlying neuropathologic processes, genetic drivers, and susceptibility factors. Overall, Gene-SGAN is broadly applicable to disease subtyping and endophenotype discovery, and is herein tested on disease-related, genetically-driven neuroimaging phenotypes.

15.

Homological landscape of human brain functional sub-circuits.

Duong-Tran, Duy; Kaufmann, Ralph; Chen, Jiong; Wang, Xuan; Garai, Sumita; Xu, Frederick; Bao, Jingxuan; Amico, Enrico; Kaplan, Alan David; Petri, Giovanni; Goni, Joaquin; Zhao, Yize; Shen, Li.

bioRxiv ; 2023 Dec 22.

Article in English | MEDLINE | ID: mdl-38187668

ABSTRACT

Human whole-brain functional connectivity networks have been shown to exhibit both local/quasilocal (e.g., set of functional sub-circuits induced by node or edge attributes) and non-local (e.g., higher-order functional coordination patterns) properties. Nonetheless, the non-local properties of topological strata induced by local/quasilocal functional sub-circuits have yet to be addressed. To that end, we proposed a homological formalism that enables the quantification of higher-order characteristics of human brain functional sub-circuits. Our results indicated that each homological order uniquely unravels diverse, complementary properties of human brain functional sub-circuits. Noticeably, the H1 homological distance between rest and motor task were observed at both whole-brain and sub-circuit consolidated level which suggested the self-similarity property of human brain functional connectivity unraveled by homological kernel. Furthermore, at the whole-brain level, the rest-task differentiation was found to be most prominent between rest and different tasks at different homological orders: i) Emotion task H0, ii) Motor task H1, and iii) Working memory task H2. At the functional sub-circuit level, the rest-task functional dichotomy of default mode network is found to be mostly prominent at the first and second homological scaffolds. Also at such scale, we found that the limbic network plays a significant role in homological reconfiguration across both task- and subject- domain which sheds light to subsequent Investigations on the complex neuro-physiological role of such network. From a wider perspective, our formalism can be applied, beyond brain connectomics, to study non-localized coordination patterns of localized structures stretching across complex network fibers.

16.

Mining High-Level Imaging Genetic Associations via Clustering AD Candidate Variants with Similar Brain Association Patterns.

Wu, Ruiming; Bao, Jingxuan; Kim, Mansu; Saykin, Andrew J; Moore, Jason H; Shen, Li.

Genes (Basel) ; 13(9)2022 08 24.

Article in English | MEDLINE | ID: mdl-36140686

ABSTRACT

Brain imaging genetics examines associations between imaging quantitative traits (QTs) and genetic factors such as single nucleotide polymorphisms (SNPs) to provide important insights into the pathogenesis of Alzheimer's disease (AD). The individual level SNP-QT signals are high dimensional and typically have small effect sizes, making them hard to be detected and replicated. To overcome this limitation, this work proposes a new approach that identifies high-level imaging genetic associations through applying multigraph clustering to the SNP-QT association maps. Given an SNP set and a brain QT set, the association between each SNP and each QT is evaluated using a linear regression model. Based on the resulting SNP-QT association map, five SNP-SNP similarity networks (or graphs) are created using five different scoring functions, respectively. Multigraph clustering is applied to these networks to identify SNP clusters with similar association patterns with all the brain QTs. After that, functional annotation is performed for each identified SNP cluster and its corresponding brain association pattern. We applied this pipeline to an AD imaging genetic study, which yielded promising results. For example, in an association study between 54 AD SNPs and 116 amyloid QTs, we identified two SNP clusters with one responsible for amyloid beta clearances and the other regulating amyloid beta formation. These high-level findings have the potential to provide valuable insights into relevant genetic pathways and brain circuits, which can help form new hypotheses for more detailed imaging and genetics studies in independent cohorts.

Subject(s)

Alzheimer Disease , Algorithms , Alzheimer Disease/diagnostic imaging , Alzheimer Disease/genetics , Alzheimer Disease/metabolism , Amyloid beta-Peptides , Brain/metabolism , Cluster Analysis , Humans , Neuroimaging/methods

17.

Identifying genes associated with brain volumetric differences through tissue specific transcriptomic inference from GWAS summary data.

Mai, Hung; Bao, Jingxuan; Thompson, Paul M; Kim, Dokyoon; Shen, Li.

BMC Bioinformatics ; 23(Suppl 3): 398, 2022 Sep 28.

Article in English | MEDLINE | ID: mdl-36171548

ABSTRACT

BACKGROUND: Brain volume has been widely studied in the neuroimaging field, since it is an important and heritable trait associated with brain development, aging and various neurological and psychiatric disorders. Genome-wide association studies (GWAS) have successfully identified numerous associations between genetic variants such as single nucleotide polymorphisms and complex traits like brain volume. However, it is unclear how these genetic variations influence regional gene expression levels, which may subsequently lead to phenotypic changes. S-PrediXcan is a tissue-specific transcriptomic data analysis method that can be applied to bridge this gap. In this work, we perform an S-PrediXcan analysis on GWAS summary data from two large imaging genetics initiatives, the UK Biobank and Enhancing Neuroimaging Genetics through Meta Analysis, to identify tissue-specific transcriptomic effects on two closely related brain volume measures: total brain volume (TBV) and intracranial volume (ICV). RESULTS: As a result of the analysis, we identified 10 genes that are highly associated with both TBV and ICV. Nine out of 10 genes were found to be associated with TBV in another study using a different gene-based association analysis. Moreover, most of our discovered genes were also found to be correlated with multiple cognitive and behavioral traits. Further analyses revealed the protein-protein interactions, associated molecular pathways and biological functions that offer insight into how these genes function and interact with others. CONCLUSIONS: These results confirm that S-PrediXcan can identify genes with tissue-specific transcriptomic effects on complex traits. The analysis also suggested novel genes whose expression levels are related to brain volumetric traits. This provides important insights into the genetic mechanisms of the human brain.

Subject(s)

Genome-Wide Association Study , Transcriptome , Brain/diagnostic imaging , Genome-Wide Association Study/methods , Humans , Multifactorial Inheritance , Phenotype , Polymorphism, Single Nucleotide

18.

Identifying Alzheimer's genes via brain transcriptome mapping.

Baik, Jae Young; Kim, Mansu; Bao, Jingxuan; Long, Qi; Shen, Li.

BMC Med Genomics ; 15(Suppl 2): 116, 2022 05 19.

Article in English | MEDLINE | ID: mdl-35590321

ABSTRACT

BACKGROUND: Alzheimer's disease (AD) is one of the most common neurodegenerative disorders characterized by progressive decline in cognitive function. Targeted genetic analyses, genome-wide association studies, and imaging genetic analyses have been performed to detect AD risk and protective genes and have successfully identified dozens of AD susceptibility loci. Recently, brain imaging transcriptomics analyses have also been conducted to investigate the relationship between neuroimaging traits and gene expression measures to identify interesting gene-traits associations. These imaging transcriptomic studies typically do not involve the disease outcome in the analysis, and thus the identified brain or transcriptomic markers may not be related or specific to the disease outcome. RESULTS: We propose an innovative two-stage approach to identify genes whose expression profiles are related to diagnosis phenotype via brain transcriptome mapping. Specifically, we first map the effects of a diagnosis phenotype onto imaging traits across the brain using a linear regression model. Then, the gene-diagnosis association is assessed by spatially correlating the brain transcriptome map with the diagnostic effect map on the brain-wide imaging traits. To demonstrate the promise of our approach, we apply it to the integrative analysis of the brain transcriptome data from the Allen Human Brain Atlas (AHBA) and the amyloid imaging data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort. Our method identifies 12 genes whose brain-wide transcriptome patterns are highly correlated with six different diagnostic effect maps on the amyloid imaging traits. These 12 genes include four confirmatory findings (i.e., AD genes reported in DisGeNET) and eight novel genes that have not be associated with AD in DisGeNET. CONCLUSION: We have proposed a novel disease-related brain transcriptomic mapping method to identify genes whose expression profiles spatially correlated with regional diagnostic effects on a studied brain trait. Our empirical study on the AHBA and ADNI data shows the promise of the approach, and the resulting AD gene discoveries provide valuable information for better understanding biological pathways from transcriptomic signatures to intermediate brain traits and to phenotypic disease outcomes.

Subject(s)

Alzheimer Disease , Alzheimer Disease/diagnostic imaging , Alzheimer Disease/genetics , Brain/diagnostic imaging , Genome-Wide Association Study , Humans , Transcriptome

19.

Characterizing Heterogeneity in Neuroimaging, Cognition, Clinical Symptoms, and Genetics Among Patients With Late-Life Depression.

Wen, Junhao; Fu, Cynthia H Y; Tosun, Duygu; Veturi, Yogasudha; Yang, Zhijian; Abdulkadir, Ahmed; Mamourian, Elizabeth; Srinivasan, Dhivya; Skampardoni, Ioanna; Singh, Ashish; Nawani, Hema; Bao, Jingxuan; Erus, Guray; Shou, Haochang; Habes, Mohamad; Doshi, Jimit; Varol, Erdem; Mackin, R Scott; Sotiras, Aristeidis; Fan, Yong; Saykin, Andrew J; Sheline, Yvette I; Shen, Li; Ritchie, Marylyn D; Wolk, David A; Albert, Marilyn; Resnick, Susan M; Davatzikos, Christos.

JAMA Psychiatry ; 79(5): 464-474, 2022 05 01.

Article in English | MEDLINE | ID: mdl-35262657

ABSTRACT

Importance: Late-life depression (LLD) is characterized by considerable heterogeneity in clinical manifestation. Unraveling such heterogeneity might aid in elucidating etiological mechanisms and support precision and individualized medicine. Objective: To cross-sectionally and longitudinally delineate disease-related heterogeneity in LLD associated with neuroanatomy, cognitive functioning, clinical symptoms, and genetic profiles. Design, Setting, and Participants: The Imaging-Based Coordinate System for Aging and Neurodegenerative Diseases (iSTAGING) study is an international multicenter consortium investigating brain aging in pooled and harmonized data from 13 studies with more than 35â¯000 participants, including a subset of individuals with major depressive disorder. Multimodal data from a multicenter sample (N = 996), including neuroimaging, neurocognitive assessments, and genetics, were analyzed in this study. A semisupervised clustering method (heterogeneity through discriminative analysis) was applied to regional gray matter (GM) brain volumes to derive dimensional representations. Data were collected from July 2017 to July 2020 and analyzed from July 2020 to December 2021. Main Outcomes and Measures: Two dimensions were identified to delineate LLD-associated heterogeneity in voxelwise GM maps, white matter (WM) fractional anisotropy, neurocognitive functioning, clinical phenotype, and genetics. Results: A total of 501 participants with LLD (mean [SD] age, 67.39 [5.56] years; 332 women) and 495 healthy control individuals (mean [SD] age, 66.53 [5.16] years; 333 women) were included. Patients in dimension 1 demonstrated relatively preserved brain anatomy without WM disruptions relative to healthy control individuals. In contrast, patients in dimension 2 showed widespread brain atrophy and WM integrity disruptions, along with cognitive impairment and higher depression severity. Moreover, 1 de novo independent genetic variant (rs13120336; chromosome: 4, 186387714; minor allele, G) was significantly associated with dimension 1 (odds ratio, 2.35; SE, 0.15; P = 3.14 ×108) but not with dimension 2. The 2 dimensions demonstrated significant single-nucleotide variant-based heritability of 18% to 27% within the general population (N = 12â¯518 in UK Biobank). In a subset of individuals having longitudinal measurements, those in dimension 2 experienced a more rapid longitudinal change in GM and brain age (Cohen f2 = 0.03; P = .02) and were more likely to progress to Alzheimer disease (Cohen f2 = 0.03; P = .03) compared with those in dimension 1 (N = 1431 participants and 7224 scans from the Alzheimer's Disease Neuroimaging Initiative [ADNI], Baltimore Longitudinal Study of Aging [BLSA], and Biomarkers for Older Controls at Risk for Dementia [BIOCARD] data sets). Conclusions and Relevance: This study characterized heterogeneity in LLD into 2 dimensions with distinct neuroanatomical, cognitive, clinical, and genetic profiles. This dimensional approach provides a potential mechanism for investigating the heterogeneity of LLD and the relevance of the latent dimensions to possible disease mechanisms, clinical outcomes, and responses to interventions.

Subject(s)

Alzheimer Disease , Depressive Disorder, Major , Alzheimer Disease/diagnostic imaging , Alzheimer Disease/genetics , Brain/diagnostic imaging , Cognition , Depression , Depressive Disorder, Major/diagnostic imaging , Depressive Disorder, Major/genetics , Female , Humans , Longitudinal Studies , Magnetic Resonance Imaging/methods , Male , Neuroimaging

20.

Identifying imaging genetic associations via regional morphometricity estimation.

Bao, Jingxuan; Wen, Zixuan; Kim, Mansu; Saykin, Andrew J; Thompson, Paul M; Zhao, Yize; Shen, Li.

Pac Symp Biocomput ; 27: 97-108, 2022.

Article in English | MEDLINE | ID: mdl-34890140

ABSTRACT

Brain imaging genetics is an emerging research field aiming to reveal the genetic basis of brain traits captured by imaging data. Inspired by heritability analysis, the concept of morphometricity was recently introduced to assess trait association with whole brain morphology. In this study, we extend the concept of morphometricity from its original definition at the whole brain level to a more focal level based on a region of interest (ROI). We propose a novel framework to identify the SNP-ROI association via regional morphometricity estimation of each studied single nucleotide polymorphism (SNP). We perform an empirical study on the structural MRI and genotyping data from a landmark Alzheimer's disease (AD) biobank; and yield promising results. Our findings indicate that the AD-related SNPs have higher overall regional morphometricity estimates than the SNPs not yet related to AD. This observation suggests that the variance of AD SNPs can be explained more by regional morphometric features than non-AD SNPs, supporting the value of imaging traits as targets in studying AD genetics. Also, we identified 11 ROIs, where the AD/non-AD SNPs and significant/insignificant morphometricity estimation of the corresponding SNPs in these ROIs show strong dependency. Supplementary motor area (SMA) and dorsolateral prefrontal cortex (DPC) are enriched by these ROIs. Our results also demonstrate that using all the detailed voxel-level measures within the ROI to incorporate morphometric information outperforms using only a single average ROI measure, and thus provides improved power to detect imaging genetic associations.

Subject(s)

Alzheimer Disease , Dorsolateral Prefrontal Cortex , Alzheimer Disease/diagnostic imaging , Alzheimer Disease/genetics , Brain/diagnostic imaging , Computational Biology , Humans , Neuroimaging , Polymorphism, Single Nucleotide

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL