Your browser doesn't support javascript.
loading
Estimating gene expression from DNA methylation and copy number variation: A deep learning regression model for multi-omics integration.
Seal, Dibyendu Bikash; Das, Vivek; Goswami, Saptarsi; De, Rajat K.
Afiliação
  • Seal DB; A. K. Choudhury School of Information Technology, University of Calcutta, JD-2, Sector III, Salt Lake City, Kolkata 700106, India.
  • Das V; Novo Nordisk Research Center Seattle, Inc., 530 Fairview Ave N # 5000, Seattle, WA 98109, United States.
  • Goswami S; Bangabasi Morning College, 35 Rajkumar Chakraborty Sarani, Scott Ln, Kolkata 700009, India.
  • De RK; Machine Intelligence Unit, Indian Statistical Institute, 203 Barrackpore Trunk Road, Kolkata 700108, India. Electronic address: rajat@isical.ac.in.
Genomics ; 112(4): 2833-2841, 2020 07.
Article em En | MEDLINE | ID: mdl-32234433
Gene expression analysis plays a significant role for providing molecular insights in cancer. Various genetic and epigenetic factors (being dealt under multi-omics) affect gene expression giving rise to cancer phenotypes. A recent growth in understanding of multi-omics seems to provide a resource for integration in interdisciplinary biology since they altogether can draw the comprehensive picture of an organism's developmental and disease biology in cancers. Such large scale multi-omics data can be obtained from public consortium like The Cancer Genome Atlas (TCGA) and several other platforms. Integrating these multi-omics data from varied platforms is still challenging due to high noise and sensitivity of the platforms used. Currently, a robust integrative predictive model to estimate gene expression from these genetic and epigenetic data is lacking. In this study, we have developed a deep learning-based predictive model using Deep Denoising Auto-encoder (DDAE) and Multi-layer Perceptron (MLP) that can quantitatively capture how genetic and epigenetic alterations correlate with directionality of gene expression for liver hepatocellular carcinoma (LIHC). The DDAE used in the study has been trained to extract significant features from the input omics data to estimate the gene expression. These features have then been used for back-propagation learning by the multilayer perceptron for the task of regression and classification. We have benchmarked the proposed model against state-of-the-art regression models. Finally, the deep learning-based integration model has been evaluated for its disease classification capability, where an accuracy of 95.1% has been obtained.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Metilação de DNA / Variações do Número de Cópias de DNA / Aprendizado Profundo / RNA-Seq Tipo de estudo: Prognostic_studies Idioma: En Revista: Genomics Assunto da revista: GENETICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Índia País de publicação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Metilação de DNA / Variações do Número de Cópias de DNA / Aprendizado Profundo / RNA-Seq Tipo de estudo: Prognostic_studies Idioma: En Revista: Genomics Assunto da revista: GENETICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Índia País de publicação: Estados Unidos