Pesquisa | Portal Regional da BVS

Performance of a Shotgun Prediction Model for Colorectal Cancer When Using 16S rRNA Sequencing Data.

Ramon, Elies; Obón-Santacana, Mireia; Khannous-Lleiffe, Olfat; Saus, Ester; Gabaldón, Toni; Guinó, Elisabet; Bars-Cortina, David; Ibáñez-Sanz, Gemma; Rodríguez-Alonso, Lorena; Mata, Alfredo; García-Rodríguez, Ana; Moreno, Victor.

Int J Mol Sci ; 25(2)2024 Jan 18.

Artigo em Inglês | MEDLINE | ID: mdl-38256252

RESUMO

Colorectal cancer (CRC), the third most common cancer globally, has shown links to disturbed gut microbiota. While significant efforts have been made to establish a microbial signature indicative of CRC using shotgun metagenomic sequencing, the challenge lies in validating this signature with 16S ribosomal RNA (16S) gene sequencing. The primary obstacle is reconciling the differing outputs of these two methodologies, which often lead to divergent statistical models and conclusions. In this study, we introduce an algorithm designed to bridge this gap by mapping shotgun-derived taxa to their 16S counterparts. This mapping enables us to assess the predictive performance of a shotgun-based microbiome signature using 16S data. Our results demonstrate a reduction in performance when applying the 16S-mapped taxa in the shotgun prediction model, though it retains statistical significance. This suggests that while an exact match between shotgun and 16S data may not yet be feasible, our approach provides a viable method for comparative analysis and validation in the context of CRC-associated microbiome research.

Assuntos

Neoplasias Colorretais , Microbioma Gastrointestinal , Humanos , RNA Ribossômico 16S/genética , Algoritmos , Microbioma Gastrointestinal/genética , Pessoal de Saúde , Neoplasias Colorretais/genética

kernInt: A Kernel Framework for Integrating Supervised and Unsupervised Analyses in Spatio-Temporal Metagenomic Datasets.

Ramon, Elies; Belanche-Muñoz, Lluís; Molist, Francesc; Quintanilla, Raquel; Perez-Enciso, Miguel; Ramayo-Caldas, Yuliaxis.

Front Microbiol ; 12: 609048, 2021.

Artigo em Inglês | MEDLINE | ID: mdl-33584612

RESUMO

The advent of next-generation sequencing technologies allowed relative quantification of microbiome communities and their spatial and temporal variation. In recent years, supervised learning (i.e., prediction of a phenotype of interest) from taxonomic abundances has become increasingly common in the microbiome field. However, a gap exists between supervised and classical unsupervised analyses, based on computing ecological dissimilarities for visualization or clustering. Despite this, both approaches face common challenges, like the compositional nature of next-generation sequencing data or the integration of the spatial and temporal dimensions. Here we propose a kernel framework to place on a common ground the unsupervised and supervised microbiome analyses, including the retrieval of microbial signatures (taxa importances). We define two compositional kernels (Aitchison-RBF and compositional linear) and discuss how to transform non-compositional beta-dissimilarity measures into kernels. Spatial data is integrated with multiple kernel learning, while longitudinal data is evaluated by specific kernels. We illustrate our framework through a single point soil dataset, a human dataset with a spatial component, and a previously unpublished longitudinal dataset concerning pig production. The proposed framework and the case studies are freely available in the kernInt package at https://github.com/elies-ramon/kernInt.

HIV drug resistance prediction with weighted categorical kernel functions.

Ramon, Elies; Belanche-Muñoz, Lluís; Pérez-Enciso, Miguel.

BMC Bioinformatics ; 20(1): 410, 2019 Jul 30.

Artigo em Inglês | MEDLINE | ID: mdl-31362714

RESUMO

BACKGROUND: Antiretroviral drugs are a very effective therapy against HIV infection. However, the high mutation rate of HIV permits the emergence of variants that can be resistant to the drug treatment. Predicting drug resistance to previously unobserved variants is therefore very important for an optimum medical treatment. In this paper, we propose the use of weighted categorical kernel functions to predict drug resistance from virus sequence data. These kernel functions are very simple to implement and are able to take into account HIV data particularities, such as allele mixtures, and to weigh the different importance of each protein residue, as it is known that not all positions contribute equally to the resistance. RESULTS: We analyzed 21 drugs of four classes: protease inhibitors (PI), integrase inhibitors (INI), nucleoside reverse transcriptase inhibitors (NRTI) and non-nucleoside reverse transcriptase inhibitors (NNRTI). We compared two categorical kernel functions, Overlap and Jaccard, against two well-known noncategorical kernel functions (Linear and RBF) and Random Forest (RF). Weighted versions of these kernels were also considered, where the weights were obtained from the RF decrease in node impurity. The Jaccard kernel was the best method, either in its weighted or unweighted form, for 20 out of the 21 drugs. CONCLUSIONS: Results show that kernels that take into account both the categorical nature of the data and the presence of mixtures consistently result in the best prediction model. The advantage of including weights depended on the protein targeted by the drug. In the case of reverse transcriptase, weights based in the relative importance of each position clearly increased the prediction performance, while the improvement in the protease was much smaller. This seems to be related to the distribution of weights, as measured by the Gini index. All methods described, together with documentation and examples, are freely available at https://bitbucket.org/elies_ramon/catkern.

Assuntos

Algoritmos , Biologia Computacional/métodos , Farmacorresistência Viral/genética , HIV-1/genética , Fármacos Anti-HIV/farmacologia , Farmacorresistência Viral/efeitos dos fármacos , Infecções por HIV/virologia , HIV-1/efeitos dos fármacos , HIV-1/isolamento & purificação , Humanos , Modelos Lineares , Análise de Componente Principal

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA