Your browser doesn't support javascript.
Single-cell multi-omics integration for unpaired data by a siamese network with graph-based contrastive loss.
Liu, Chaozhong; Wang, Linhua; Liu, Zhandong.
  • Liu C; Graduate Program in Quantitative and Computational Biosciences, Baylor College of Medicine, Houston, USA.
  • Wang L; Graduate Program in Quantitative and Computational Biosciences, Baylor College of Medicine, Houston, USA.
  • Liu Z; Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston, USA. zhandong.liu@bcm.edu.
BMC Bioinformatics ; 24(1): 5, 2023 Jan 04.
Article in English | MEDLINE | ID: covidwho-2196037
ABSTRACT

BACKGROUND:

Single-cell omics technology is rapidly developing to measure the epigenome, genome, and transcriptome across a range of cell types. However, it is still challenging to integrate omics data from different modalities. Here, we propose a variation of the Siamese neural network framework called MinNet, which is trained to integrate multi-omics data on the single-cell resolution by using graph-based contrastive loss.

RESULTS:

By training the model and testing it on several benchmark datasets, we showed its accuracy and generalizability in integrating scRNA-seq with scATAC-seq, and scRNA-seq with epitope data. Further evaluation demonstrated our model's unique ability to remove the batch effect, a common problem in actual practice. To show how the integration impacts downstream analysis, we established model-based smoothing and cis-regulatory element-inferring method and validated it with external pcHi-C evidence. Finally, we applied the framework to a COVID-19 dataset to bolster the original work with integration-based analysis, showing its necessity in single-cell multi-omics research.

CONCLUSIONS:

MinNet is a novel deep-learning framework for single-cell multi-omics sequencing data integration. It ranked top among other methods in benchmarking and is especially suitable for integrating datasets with batch and biological variances. With the single-cell resolution integration results, analysis of the interplay between genome and transcriptome can be done to help researchers understand their data and question.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: COVID-19 / Multiomics Type of study: Experimental Studies / Prognostic study Limits: Humans Language: English Journal: BMC Bioinformatics Journal subject: Medical Informatics Year: 2023 Document Type: Article Affiliation country: S12859-022-05126-7

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: COVID-19 / Multiomics Type of study: Experimental Studies / Prognostic study Limits: Humans Language: English Journal: BMC Bioinformatics Journal subject: Medical Informatics Year: 2023 Document Type: Article Affiliation country: S12859-022-05126-7