Your browser doesn't support javascript.
loading
Robust prediction of colorectal cancer via gut microbiome 16S rRNA sequencing data.
Porreca, Annamaria; Ibrahimi, Eliana; Maturo, Fabrizio; Marcos Zambrano, Laura Judith; Meto, Melisa; Lopes, Marta B.
Afiliação
  • Porreca A; Department of Economics, Statistics and Business, Faculty of Economics and Law, Universitas Mercatorum, Rome, Italy.
  • Ibrahimi E; Department of Biology, University of Tirana, Tirana, Albania.
  • Maturo F; Department of Economics, Statistics and Business, Faculty of Technological and Innovation Sciences, Universitas Mercatorum, Rome, Italy.
  • Marcos Zambrano LJ; Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, Madrid, Spain.
  • Meto M; Department of Biology, University of Tirana, Tirana, Albania.
  • Lopes MB; Center for Mathematics and Applications (NOVA Math), NOVA School of Science and Technology, Caparica, Portugal.
J Med Microbiol ; 73(10)2024 Oct.
Article em En | MEDLINE | ID: mdl-39377779
ABSTRACT
Introduction. The study addresses the challenge of utilizing human gut microbiome data for the early detection of colorectal cancer (CRC). The research emphasizes the potential of using machine learning techniques to analyze complex microbiome datasets, providing a non-invasive approach to identifying CRC-related microbial markers.Hypothesis/Gap Statement. The primary hypothesis is that a robust machine learning-based analysis of 16S rRNA microbiome data can identify specific microbial features that serve as effective biomarkers for CRC detection, overcoming the limitations of classical statistical models in high-dimensional settings.Aim. The primary objective of this study is to explore and validate the potential of the human microbiome, specifically in the colon, as a valuable source of biomarkers for colorectal cancer (CRC) detection and progression. The focus is on developing a classifier that effectively predicts the presence of CRC and normal samples based on the analysis of three previously published faecal 16S rRNA sequencing datasets.Methodology. To achieve the aim, various machine learning techniques are employed, including random forest (RF), recursive feature elimination (RFE) and a robust correlation-based technique known as the fuzzy forest (FF). The study utilizes these methods to analyse the three datasets, comparing their performance in predicting CRC and normal samples. The emphasis is on identifying the most relevant microbial features (taxa) associated with CRC development via partial dependence plots, i.e. a machine learning tool focused on explainability, visualizing how a feature influences the predicted outcome.Results. The analysis of the three faecal 16S rRNA sequencing datasets reveals the consistent and superior predictive performance of the FF compared to the RF and RFE. Notably, FF proves effective in addressing the correlation problem when assessing the importance of microbial taxa in explaining the development of CRC. The results highlight the potential of the human microbiome as a non-invasive means to detect CRC and underscore the significance of employing FF for improved predictive accuracy.Conclusion. In conclusion, this study underscores the limitations of classical statistical techniques in handling high-dimensional information such as human microbiome data. The research demonstrates the potential of the human microbiome, specifically in the colon, as a valuable source of biomarkers for CRC detection. Applying machine learning techniques, particularly the FF, is a promising approach for building a classifier to predict CRC and normal samples. The findings advocate for integrating FF to overcome the challenges associated with correlation when identifying crucial microbial features linked to CRC development.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: RNA Ribossômico 16S / Neoplasias Colorretais / Fezes / Microbioma Gastrointestinal / Aprendizado de Máquina Limite: Humans Idioma: En Revista: J Med Microbiol / J. medical microbiol / Journal of medical microbiology Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Itália País de publicação: Reino Unido

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: RNA Ribossômico 16S / Neoplasias Colorretais / Fezes / Microbioma Gastrointestinal / Aprendizado de Máquina Limite: Humans Idioma: En Revista: J Med Microbiol / J. medical microbiol / Journal of medical microbiology Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Itália País de publicação: Reino Unido