Robust pathway-based multi-omics data integration using directed random walks for survival prediction in multiple cancer studies.
Biol Direct
; 14(1): 8, 2019 04 29.
Article
en En
| MEDLINE
| ID: mdl-31036036
BACKGROUND: Integrating the rich information from multi-omics data has been a popular approach to survival prediction and bio-marker identification for several cancer studies. To facilitate the integrative analysis of multiple genomic profiles, several studies have suggested utilizing pathway information rather than using individual genomic profiles. METHODS: We have recently proposed an integrative directed random walk-based method utilizing pathway information (iDRW) for more robust and effective genomic feature extraction. In this study, we applied iDRW to multiple genomic profiles for two different cancers, and designed a directed gene-gene graph which reflects the interaction between gene expression and copy number data. In the experiments, the performances of the iDRW method and four state-of-the-art pathway-based methods were compared using a survival prediction model which classifies samples into two survival groups. RESULTS: The results show that the integrative analysis guided by pathway information not only improves prediction performance, but also provides better biological insights into the top pathways and genes prioritized by the model in both the neuroblastoma and the breast cancer datasets. The pathways and genes selected by the iDRW method were shown to be related to the corresponding cancers. CONCLUSIONS: In this study, we demonstrated the effectiveness of a directed random walk-based multi-omics data integration method applied to gene expression and copy number data for both breast cancer and neuroblastoma datasets. We revamped a directed gene-gene graph considering the impact of copy number variation on gene expression and redefined the weight initialization and gene-scoring method. The benchmark result for iDRW with four pathway-based methods demonstrated that the iDRW method improved survival prediction performance and jointly identified cancer-related pathways and genes for two different cancer datasets. REVIEWERS: This article was reviewed by Helena Molina-Abril and Marta Hidalgo.
Palabras clave
Texto completo:
1
Colección:
01-internacional
Base de datos:
MEDLINE
Asunto principal:
Neoplasias de la Mama
/
Regulación Neoplásica de la Expresión Génica
/
Genoma Humano
/
Variaciones en el Número de Copia de ADN
/
Neuroblastoma
Tipo de estudio:
Clinical_trials
/
Prognostic_studies
/
Risk_factors_studies
Límite:
Humans
Idioma:
En
Revista:
Biol Direct
Año:
2019
Tipo del documento:
Article
País de afiliación:
Corea del Sur
Pais de publicación:
Reino Unido