Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 3 de 3
Filter
Add more filters










Database
Language
Publication year range
1.
Synth Biol (Oxf) ; 7(1): ysac012, 2022.
Article in English | MEDLINE | ID: mdl-36035514

ABSTRACT

Sequencing technologies, in particular RNASeq, have become critical tools in the design, build, test and learn cycle of synthetic biology. They provide a better understanding of synthetic designs, and they help identify ways to improve and select designs. While these data are beneficial to design, their collection and analysis is a complex, multistep process that has implications on both discovery and reproducibility of experiments. Additionally, tool parameters, experimental metadata, normalization of data and standardization of file formats present challenges that are computationally intensive. This calls for high-throughput pipelines expressly designed to handle the combinatorial and longitudinal nature of synthetic biology. In this paper, we present a pipeline to maximize the analytical reproducibility of RNASeq for synthetic biologists. We also explore the impact of reproducibility on the validation of machine learning models. We present the design of a pipeline that combines traditional RNASeq data processing tools with structured metadata tracking to allow for the exploration of the combinatorial design in a high-throughput and reproducible manner. We then demonstrate utility via two different experiments: a control comparison experiment and a machine learning model experiment. The first experiment compares datasets collected from identical biological controls across multiple days for two different organisms. It shows that a reproducible experimental protocol for one organism does not guarantee reproducibility in another. The second experiment quantifies the differences in experimental runs from multiple perspectives. It shows that the lack of reproducibility from these different perspectives can place an upper bound on the validation of machine learning models trained on RNASeq data. Graphical Abstract.

2.
Bioinformatics ; 38(2): 404-409, 2022 01 03.
Article in English | MEDLINE | ID: mdl-34570169

ABSTRACT

MOTIVATION: Applications in synthetic and systems biology can benefit from measuring whole-cell response to biochemical perturbations. Execution of experiments to cover all possible combinations of perturbations is infeasible. In this paper, we present the host response model (HRM), a machine learning approach that maps response of single perturbations to transcriptional response of the combination of perturbations. RESULTS: The HRM combines high-throughput sequencing with machine learning to infer links between experimental context, prior knowledge of cell regulatory networks, and RNASeq data to predict a gene's dysregulation. We find that the HRM can predict the directionality of dysregulation to a combination of inducers with an accuracy of >90% using data from single inducers. We further find that the use of prior, known cell regulatory networks doubles the predictive performance of the HRM (an R2 from 0.3 to 0.65). The model was validated in two organisms, Escherichia coli and Bacillus subtilis, using new experiments conducted after training. Finally, while the HRM is trained with gene expression data, the direct prediction of differential expression makes it possible to also conduct enrichment analyses using its predictions. We show that the HRM can accurately classify >95% of the pathway regulations. The HRM reduces the number of RNASeq experiments needed as responses can be tested in silico prior to the experiment. AVAILABILITY AND IMPLEMENTATION: The HRM software and tutorial are available at https://github.com/sd2e/CDM and the configurable differential expression analysis tools and tutorials are available at https://github.com/SD2E/omics_tools. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Machine Learning , Software , Systems Biology , Escherichia coli/genetics , High-Throughput Nucleotide Sequencing
3.
Genetics ; 206(3): 1403-1416, 2017 07.
Article in English | MEDLINE | ID: mdl-28533440

ABSTRACT

Gene silencing mediated by dsRNA (RNAi) can persist for multiple generations in Caenorhabditis elegans (termed RNAi inheritance). Here we describe the results of a forward genetic screen in C. elegans that has identified six factors required for RNAi inheritance: GLH-1/VASA, PUP-1/CDE-1, MORC-1, SET-32, and two novel nematode-specific factors that we term here (heritable RNAi defective) HRDE-2 and HRDE-4 The new RNAi inheritance factors exhibit mortal germline (Mrt) phenotypes, which we show is likely caused by epigenetic deregulation in germ cells. We also show that HRDE-2 contributes to RNAi inheritance by facilitating the binding of small RNAs to the inheritance Argonaute (Ago) HRDE-1 Together, our results identify additional components of the RNAi inheritance machinery whose conservation provides insights into the molecular mechanism of RNAi inheritance, further our understanding of how the RNAi inheritance machinery promotes germline immortality, and show that HRDE-2 couples the inheritance Ago HRDE-1 with the small RNAs it needs to direct RNAi inheritance and germline immortality.


Subject(s)
Caenorhabditis elegans Proteins/metabolism , Caenorhabditis elegans/genetics , Gene Silencing , RNA, Small Interfering/genetics , Animals , Caenorhabditis elegans/metabolism , Caenorhabditis elegans Proteins/genetics , Epigenesis, Genetic , Phenotype
SELECTION OF CITATIONS
SEARCH DETAIL
...