Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
1.
Microbiome ; 11(1): 231, 2023 10 20.
Artigo em Inglês | MEDLINE | ID: mdl-37858269

RESUMO

BACKGROUND: With the emergence of metagenomic data, multiple links between the gut microbiome and the host health have been shown. Deciphering these complex interactions require evolved analysis methods focusing on the microbial ecosystem functions. Despite the fact that host or diet-derived fibres are the most abundant nutrients available in the gut, the presence of distinct functional traits regarding fibre and mucin hydrolysis, fermentation and hydrogenotrophic processes has never been investigated. RESULTS: After manually selecting 91 KEGG orthologies and 33 glycoside hydrolases further aggregated in 101 functional descriptors representative of fibre and mucin degradation pathways in the gut microbiome, we used nonnegative matrix factorization to mine metagenomic datasets. Four distinct metabolic profiles were further identified on a training set of 1153 samples, thoroughly validated on a large database of 2571 unseen samples from 5 external metagenomic cohorts and confirmed with metatranscriptomic data. Profiles 1 and 2 are the main contributors to the fibre-degradation-related metagenome: they present contrasted involvement in fibre degradation and sugar metabolism and are differentially linked to dysbiosis, metabolic disease and inflammation. Profile 1 takes over Profile 2 in healthy samples, and unbalance of these profiles characterize dysbiotic samples. Furthermore, high fibre diet favours a healthy balance between profiles 1 and profile 2. Profile 3 takes over profile 2 during Crohn's disease, inducing functional reorientations towards unusual metabolism such as fucose and H2S degradation or propionate, acetone and butanediol production. Profile 4 gathers under-represented functions, like methanogenesis. Two taxonomic makes up of the profiles were investigated, using either the covariation of 203 prevalent genomes or metagenomic species, both providing consistent results in line with their functional characteristics. This taxonomic characterization showed that profiles 1 and 2 were respectively mainly composed of bacteria from the phyla Bacteroidetes and Firmicutes while profile 3 is representative of Proteobacteria and profile 4 of methanogens. CONCLUSIONS: Integrating anaerobic microbiology knowledge with statistical learning can narrow down the metagenomic analysis to investigate functional profiles. Applying this approach to fibre degradation in the gut ended with 4 distinct functional profiles that can be easily monitored as markers of diet, dysbiosis, inflammation and disease. Video Abstract.


Assuntos
Microbioma Gastrointestinal , Microbiota , Humanos , Microbioma Gastrointestinal/genética , Mucinas , Disbiose , Microbiota/genética , Metagenoma , Fibras na Dieta , Inflamação , Metagenômica/métodos
2.
Plant Methods ; 19(1): 54, 2023 Jun 07.
Artigo em Inglês | MEDLINE | ID: mdl-37287031

RESUMO

BACKGROUND: The time between the appearance of successive leaves, or phyllochron, characterizes the vegetative development of annual plants. Hypothesis testing models, which allow the comparison of phyllochrons between genetic groups and/or environmental conditions, are usually based on regression of thermal time on the number of leaves; most of the time a constant leaf appearance rate is assumed. However regression models ignore auto-correlation of the leaf number process and may lead to biased testing procedures. Moreover, the hypothesis of constant leaf appearance rate may be too restrictive. METHODS: We propose a stochastic process model in which emergence of new leaves is considered to result from successive time-to-events. This model provides a flexible and more accurate modeling as well as unbiased testing procedures. It was applied to an original maize dataset collected in the field over three years on plants originating from two divergent selection experiments for flowering time in two maize inbred lines. RESULTS AND CONCLUSION: We showed that the main differences in phyllochron were not observed between selection populations but rather between ancestral lines, years of experimentation and leaf ranks. Our results highlight a strong departure from the assumption of a constant leaf appearance rate over a season which could be related to climate variations, even if the impact of individual climate variables could not be clearly determined.

3.
PeerJ ; 10: e13525, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35769140

RESUMO

One of the difficulties encountered in the statistical analysis of metaproteomics data is the high proportion of missing values, which are usually treated by imputation. Nevertheless, imputation methods are based on restrictive assumptions regarding missingness mechanisms, namely "at random" or "not at random". To circumvent these limitations in the context of feature selection in a multi-class comparison, we propose a univariate selection method that combines a test of association between missingness and classes, and a test for difference of observed intensities between classes. This approach implicitly handles both missingness mechanisms. We performed a quantitative and qualitative comparison of our procedure with imputation-based feature selection methods on two experimental data sets, as well as simulated data with various scenarios regarding the missingness mechanisms and the nature of the difference of expression (differential intensity or differential presence). Whereas we observed similar performances in terms of prediction on the experimental data set, the feature ranking and selection from various imputation-based methods were strongly divergent. We showed that the combined test reaches a compromise by correlating reasonably with other methods, and remains efficient in all simulated scenarios unlike imputation-based feature selection methods.


Assuntos
Proteômica , Projetos de Pesquisa , Confiabilidade dos Dados
4.
Clin Microbiol Infect ; 28(1): 137.e1-137.e8, 2022 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-34111580

RESUMO

OBJECTIVES: Bacillus cereus is responsible for food poisoning and rare but severe clinical infections. The pathogenicity of strains varies from harmless to lethal strains. However, there are currently no markers, either alone or in combination, to differentiate pathogenic from non-pathogenic strains. The objective of the study was to identify new genetic biomarkers to differentiate non-pathogenic from clinically relevant B. cereus strains. METHODS: A first set of 15 B. cereus strains were compared by RNAseq. A logistic regression model with lasso penalty was applied to define combination of genes whose expression was associated with strain pathogenicity. The identified markers were checked for their presence/absence in a collection of 95 B. cereus strains with varying pathogenic potential (food-borne outbreaks, clinical and non-pathogenic). Receiver operating characteristic area under the curve (AUC) analysis was used to determine the combination of biomarkers, which best differentiate between the "disease" versus "non-disease" groups. RESULTS: Seven genes were identified during the RNAseq analysis with a prediction to differentiate between pathogenic and non-pathogenic strains. The validation of the presence/absence of these genes in a larger collection of strains coupled with AUC prediction showed that a combination of four biomarkers was sufficient to accurately discern clinical strains from harmless strains, with an AUC of 0.955, sensitivity of 0.9 and specificity of 0.86. CONCLUSIONS: These new findings help in the understanding of B. cereus pathogenic potential and complexity and may provide tools for a better assessment of the risks associated with B. cereus contamination to improve patient health and food safety.


Assuntos
Bacillus cereus , Microbiologia de Alimentos , Marcadores Genéticos , Bacillus cereus/genética , Bacillus cereus/isolamento & purificação , Filogenia , RNA-Seq , Virulência
5.
J Proteome Res ; 20(3): 1522-1534, 2021 03 05.
Artigo em Inglês | MEDLINE | ID: mdl-33528260

RESUMO

The gut microbiota are increasingly considered as a main partner of human health. Metaproteomics enables us to move from the functional potential revealed by metagenomics to the functions actually operating in the microbiome. However, metaproteome deciphering remains challenging. In particular, confident interpretation of a myriad of MS/MS spectra can only be pursued with smart database searches. Here, we compare the interpretation of MS/MS data sets from 48 individual human gut microbiomes using three interrogation strategies of the dedicated Integrated nonredundant Gene Catalog (IGC 9.9 million genes from 1267 individual fecal samples) together with the Homo sapiens database: the classical single-step interrogation strategy and two iterative strategies (in either two or three steps) aimed at preselecting a reduced-sized, more targeted search space for the final peptide spectrum matching. Both iterative searches outperformed the single-step classical search in terms of the number of peptides and protein clusters identified and the depth of taxonomic and functional knowledge, and this was the most convincing with the three-step approach. However, iterative searches do not help in reducing variability of repeated analyses, which is inherent to the traditional data-dependent acquisition mode, but this variability did not affect the hierarchical relationship between replicates and all other samples.


Assuntos
Microbioma Gastrointestinal , Microbiota , Microbioma Gastrointestinal/genética , Humanos , Metagenômica , Proteômica , Espectrometria de Massas em Tandem
6.
Sci Rep ; 10(1): 15880, 2020 Sep 23.
Artigo em Inglês | MEDLINE | ID: mdl-32968096

RESUMO

An amendment to this paper has been published and can be accessed via a link at the top of the paper.

7.
Sci Rep ; 9(1): 9620, 2019 07 03.
Artigo em Inglês | MEDLINE | ID: mdl-31270376

RESUMO

An integrated analysis of gut microbiota, blood biochemical and metabolome in 52 endurance horses was performed. Clustering by gut microbiota revealed the existence of two communities mainly driven by diet as host properties showed little effect. Community 1 presented lower richness and diversity, but higher dominance and rarity of species, including some pathobionts. Moreover, its microbiota composition was tightly linked to host blood metabolites related to lipid metabolism and glycolysis at basal time. Despite the lower fiber intake, community type 1 appeared more specialized to produce acetate as a mean of maintaining the energy supply as glucose concentrations fell during the race. On the other hand, community type 2 showed an enrichment of fibrolytic and cellulolytic bacteria as well as anaerobic fungi, coupled to a higher production of propionate and butyrate. The higher butyrate proportion in community 2 was not associated with protective effects on telomere lengths but could have ameliorated mucosal inflammation and oxidative status. The gut microbiota was neither associated with the blood biochemical markers nor metabolome during the endurance race, and did not provide a biomarker for race ranking or risk of failure to finish the race.

8.
Sci Rep ; 8(1): 5098, 2018 03 23.
Artigo em Inglês | MEDLINE | ID: mdl-29572473

RESUMO

Enterococci, in particular vancomycin-resistant enterococci (VRE), are a leading cause of hospital-acquired infections. Promoting intestinal resistance against enterococci could reduce the risk of VRE infections. We investigated the effects of two Lactobacillus strains to prevent intestinal VRE. We used an intestinal colonisation mouse model based on an antibiotic-induced microbiota dysbiosis to mimic enterococci overgrowth and VRE persistence. Each Lactobacillus spp. was administered daily to mice starting one week before antibiotic treatment until two weeks after antibiotic and VRE inoculation. Of the two strains, Lactobacillus paracasei CNCM I-3689 decreased significantly VRE numbers in the feces demonstrating an improvement of the reduction of VRE. Longitudinal microbiota analysis showed that supplementation with L. paracasei CNCM I-3689 was associated with a better recovery of members of the phylum Bacteroidetes. Bile salt analysis and expression analysis of selected host genes revealed increased level of lithocholate and of ileal expression of camp (human LL-37) upon L. paracasei CNCM I-3689 supplementation. Although a direct effect of L. paracasei CNCM I-3689 on the VRE reduction was not ruled out, our data provide clues to possible anti-VRE mechanisms supporting an indirect anti-VRE effect through the gut microbiota. This work sustains non-antibiotic strategies against opportunistic enterococci after antibiotic-induced dysbiosis.


Assuntos
Bacteroidetes/fisiologia , Lacticaseibacillus paracasei/fisiologia , Probióticos/administração & dosagem , Enterococos Resistentes à Vancomicina/fisiologia , Animais , Antibacterianos/farmacologia , Bacteroidetes/efeitos dos fármacos , Clindamicina/farmacologia , Fezes/microbiologia , Microbioma Gastrointestinal/efeitos dos fármacos , Humanos , Intestinos/microbiologia , Masculino , Camundongos , Probióticos/farmacologia , Enterococos Resistentes à Vancomicina/efeitos dos fármacos
9.
PLoS Comput Biol ; 12(12): e1005252, 2016 12.
Artigo em Inglês | MEDLINE | ID: mdl-27984592

RESUMO

Whole Genome Shotgun (WGS) metagenomics is increasingly used to study the structure and functions of complex microbial ecosystems, both from the taxonomic and functional point of view. Gene inventories of otherwise uncultured microbial communities make the direct functional profiling of microbial communities possible. The concept of community aggregated trait has been adapted from environmental and plant functional ecology to the framework of microbial ecology. Community aggregated traits are quantified from WGS data by computing the abundance of relevant marker genes. They can be used to study key processes at the ecosystem level and correlate environmental factors and ecosystem functions. In this paper we propose a novel model based approach to infer combinations of aggregated traits characterizing specific ecosystemic metabolic processes. We formulate a model of these Combined Aggregated Functional Traits (CAFTs) accounting for a hierarchical structure of genes, which are associated on microbial genomes, further linked at the ecosystem level by complex co-occurrences or interactions. The model is completed with constraints specifically designed to exploit available genomic information, in order to favor biologically relevant CAFTs. The CAFTs structure, as well as their intensity in the ecosystem, is obtained by solving a constrained Non-negative Matrix Factorization (NMF) problem. We developed a multicriteria selection procedure for the number of CAFTs. We illustrated our method on the modelling of ecosystemic functional traits of fiber degradation by the human gut microbiota. We used 1408 samples of gene abundances from several high-throughput sequencing projects and found that four CAFTs only were needed to represent the fiber degradation potential. This data reduction highlighted biologically consistent functional patterns while providing a high quality preservation of the original data. Our method is generic and can be applied to other metabolic processes in the gut or in other ecosystems.


Assuntos
Microbioma Gastrointestinal/genética , Metagenômica/métodos , Algoritmos , Bactérias/genética , Bactérias/metabolismo , Metabolismo dos Carboidratos/genética , Metabolismo dos Carboidratos/fisiologia , Fibras na Dieta/metabolismo , Fezes/microbiologia , Fermentação , Humanos
10.
Sci Rep ; 6: 22932, 2016 Mar 10.
Artigo em Inglês | MEDLINE | ID: mdl-26960911

RESUMO

The adaptive response to extreme endurance exercise might involve transcriptional and translational regulation by microRNAs (miRNAs). Therefore, the objective of the present study was to perform an integrated analysis of the blood transcriptome and miRNome (using microarrays) in the horse before and after a 160 km endurance competition. A total of 2,453 differentially expressed genes and 167 differentially expressed microRNAs were identified when comparing pre- and post-ride samples. We used a hypergeometric test and its generalization to gain a better understanding of the biological functions regulated by the differentially expressed microRNA. In particular, 44 differentially expressed microRNAs putatively regulated a total of 351 depleted differentially expressed genes involved variously in glucose metabolism, fatty acid oxidation, mitochondrion biogenesis, and immune response pathways. In an independent validation set of animals, graphical Gaussian models confirmed that miR-21-5p, miR-181b-5p and miR-505-5p are candidate regulatory molecules for the adaptation to endurance exercise in the horse. To the best of our knowledge, the present study is the first to provide a comprehensive, integrated overview of the microRNA-mRNA co-regulation networks that may have a key role in controlling post-transcriptomic regulation during endurance exercise.


Assuntos
Cavalos/fisiologia , MicroRNAs/genética , Resistência Física/genética , RNA Mensageiro/genética , Animais , Biomarcadores/metabolismo , Regulação da Expressão Gênica , Cavalos/genética , MicroRNAs/isolamento & purificação , RNA Mensageiro/isolamento & purificação
11.
BMC Med Res Methodol ; 16: 28, 2016 Mar 05.
Artigo em Inglês | MEDLINE | ID: mdl-26944545

RESUMO

BACKGROUND: The understanding of changes in temporal processes related to human carcinogenesis is limited. One approach for prospective functional genomic studies is to compile trajectories of differential expression of genes, based on measurements from many case-control pairs. We propose a new statistical method that does not assume any parametric shape for the gene trajectories. METHODS: The trajectory of a gene is defined as the curve representing the changes in gene expression levels in the blood as a function of time to cancer diagnosis. In a nested case-control design it consists of differences in gene expression levels between cases and controls. Genes can be grouped into curve groups, each curve group corresponding to genes with a similar development over time. The proposed new statistical approach is based on a set of hypothesis testing that can determine whether or not there is development in gene expression levels over time, and whether this development varies among different strata. Curve group analysis may reveal significant differences in gene expression levels over time among the different strata considered. This new method was applied as a "proof of concept" to breast cancer in the Norwegian Women and Cancer (NOWAC) postgenome cohort, using blood samples collected prospectively that were specifically preserved for transcriptomic analyses (PAX tube). Cohort members diagnosed with invasive breast cancer through 2009 were identified through linkage to the Cancer Registry of Norway, and for each case a random control from the postgenome cohort was also selected, matched by birth year and time of blood sampling, to create a case-control pair. After exclusions, 441 case-control pairs were available for analyses, in which we considered strata of lymph node status at time of diagnosis and time of diagnosis with respect to breast cancer screening visits. RESULTS: The development of gene expression levels in the NOWAC postgenome cohort varied in the last years before breast cancer diagnosis, and this development differed by lymph node status and participation in the Norwegian Breast Cancer Screening Program. The differences among the investigated strata appeared larger in the year before breast cancer diagnosis compared to earlier years. CONCLUSIONS: This approach shows good properties in term of statistical power and type 1 error under minimal assumptions. When applied to a real data set it was able to discriminate between groups of genes with non-linear similar patterns before diagnosis.


Assuntos
Neoplasias da Mama/genética , Detecção Precoce de Câncer/métodos , Perfilação da Expressão Gênica/estatística & dados numéricos , Predisposição Genética para Doença/epidemiologia , Modelos Estatísticos , Sistema de Registros , Adulto , Fatores Etários , Idoso , Neoplasias da Mama/epidemiologia , Estudos de Casos e Controles , Estudos de Coortes , Feminino , Perfilação da Expressão Gênica/métodos , Regulação Neoplásica da Expressão Gênica , Humanos , Incidência , Pessoa de Meia-Idade , Noruega , Valores de Referência , Medição de Risco , Sensibilidade e Especificidade
12.
Med Hypotheses ; 85(4): 494-7, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26187155

RESUMO

Traditionally, the prospective design has been chosen for risk factor analyses of lifestyle and cancer using mainly estimation by survival analysis methods. With new technologies, epidemiologists can expand their prospective studies to include functional genomics given either as transcriptomics, mRNA and microRNA, or epigenetics in blood or other biological materials. The novel functional analyses should not be assessed using classical survival analyses since the main goal is not risk estimation, but the analysis of functional genomics as part of the dynamic carcinogenic process over time, i.e., a "processual" approach. In the risk factor model, time to event is analysed as a function of exposure variables known at start of follow-up (fixed covariates) or changing over the follow-up period (time-dependent covariates). In the processual model, transcriptomics or epigenetics is considered as functions of time and exposures. The success of this novel approach depends on the development of new statistical methods with the capacity of describing and analysing the time-dependent curves or trajectories for tens of thousands of genes simultaneously. This approach also focuses on multilevel or integrative analyses introducing novel statistical methods in epidemiology. The processual approach as part of systems epidemiology might represent in a near future an alternative to human in vitro studies using human biological material for understanding the mechanisms and pathways involved in carcinogenesis.


Assuntos
Carcinogênese , Neoplasias/fisiopatologia , Projetos de Pesquisa , Carcinógenos , Epigênese Genética , Estudo de Associação Genômica Ampla , Humanos , Estilo de Vida , Modelos Teóricos , Neoplasias/etiologia , Estudos Observacionais como Assunto , Estudos Prospectivos , Análise de Regressão , Fatores de Risco , Fatores de Tempo , Transcriptoma
13.
Bioinformatics ; 31(12): i320-8, 2015 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-26072499

RESUMO

MOTIVATION: Motility is a fundamental cellular attribute, which plays a major part in processes ranging from embryonic development to metastasis. Traditionally, single cell motility is often studied by live cell imaging. Yet, such studies were so far limited to low throughput. To systematically study cell motility at a large scale, we need robust methods to quantify cell trajectories in live cell imaging data. RESULTS: The primary contribution of this article is to present Motility study Integrated Workflow (MotIW), a generic workflow for the study of single cell motility in high-throughput time-lapse screening data. It is composed of cell tracking, cell trajectory mapping to an original feature space and hit detection according to a new statistical procedure. We show that this workflow is scalable and demonstrates its power by application to simulated data, as well as large-scale live cell imaging data. This application enables the identification of an ontology of cell motility patterns in a fully unsupervised manner. AVAILABILITY AND IMPLEMENTATION: Python code and examples are available online (http://cbio.ensmp.fr/∼aschoenauer/motiw.html)


Assuntos
Movimento Celular , Rastreamento de Células/métodos , Imagem com Lapso de Tempo/métodos , Células HeLa , Humanos , Análise de Célula Única , Software , Fluxo de Trabalho
14.
BMC Bioinformatics ; 13: 329, 2012 Dec 11.
Artigo em Inglês | MEDLINE | ID: mdl-23231059

RESUMO

BACKGROUND: Illumina BeadArray technology includes non specific negative control features that allow a precise estimation of the background noise. As an alternative to the background subtraction proposed in BeadStudio which leads to an important loss of information by generating negative values, a background correction method modeling the observed intensities as the sum of the exponentially distributed signal and normally distributed noise has been developed. Nevertheless, Wang and Ye (2012) display a kernel-based estimator of the signal distribution on Illumina BeadArrays and suggest that a gamma distribution would represent a better modeling of the signal density. Hence, the normal-exponential modeling may not be appropriate for Illumina data and background corrections derived from this model may lead to wrong estimation. RESULTS: We propose a more flexible modeling based on a gamma distributed signal and a normal distributed background noise and develop the associated background correction, implemented in the R-package NormalGamma. Our model proves to be markedly more accurate to model Illumina BeadArrays: on the one hand, it is shown on two types of Illumina BeadChips that this model offers a more correct fit of the observed intensities. On the other hand, the comparison of the operating characteristics of several background correction procedures on spike-in and on normal-gamma simulated data shows high similarities, reinforcing the validation of the normal-gamma modeling. The performance of the background corrections based on the normal-gamma and normal-exponential models are compared on two dilution data sets, through testing procedures which represent various experimental designs. Surprisingly, we observe that the implementation of a more accurate parametrisation in the model-based background correction does not increase the sensitivity. These results may be explained by the operating characteristics of the estimators: the normal-gamma background correction offers an improvement in terms of bias, but at the cost of a loss in precision. CONCLUSIONS: This paper addresses the lack of fit of the usual normal-exponential model by proposing a more flexible parametrisation of the signal distribution as well as the associated background correction. This new model proves to be considerably more accurate for Illumina microarrays, but the improvement in terms of modeling does not lead to a higher sensitivity in differential analysis. Nevertheless, this realistic modeling makes way for future investigations, in particular to examine the characteristics of pre-processing strategies.


Assuntos
Modelos Químicos , Análise de Sequência com Séries de Oligonucleotídeos/métodos
15.
Int J Mol Epidemiol Genet ; 3(2): 107-14, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22724047

RESUMO

Transcriptomics as the analysis of mRNA and microRNA could be implemented in prospective studies both in peripheral blood and tissues. Its application in cancer epidemiology could provide a new understanding of the functional changes underlying the multistage model of carcinogenesis, as well as the relationship between these changes and exposure to carcinogens. Transcriptomics is not merely another -omics technology for risk assessment in traditional prospective studies. Instead, this novel approach has the potential to estimate the distribution of gene expression conditionally on different exposures, and to study the length of the different stages of carcinogenesis. If it proves to be a valid approach, transcriptomics could be an opportunity to make meaningful advances in our understanding of the carcinogenic process.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...