Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 2 de 2
Filter
Add more filters










Database
Language
Publication year range
1.
Nat Protoc ; 18(12): 3690-3731, 2023 Dec.
Article in English | MEDLINE | ID: mdl-37989764

ABSTRACT

Non-negative matrix factorization (NMF) is an unsupervised learning method well suited to high-throughput biology. However, inferring biological processes from an NMF result still requires additional post hoc statistics and annotation for interpretation of learned features. Here, we introduce a suite of computational tools that implement NMF and provide methods for accurate and clear biological interpretation and analysis. A generalized discussion of NMF covering its benefits, limitations and open questions is followed by four procedures for the Bayesian NMF algorithm Coordinated Gene Activity across Pattern Subsets (CoGAPS). Each procedure will demonstrate NMF analysis to quantify cell state transitions in a public domain single-cell RNA-sequencing dataset. The first demonstrates PyCoGAPS, our new Python implementation that enhances runtime for large datasets, and the second allows its deployment in Docker. The third procedure steps through the same single-cell NMF analysis using our R CoGAPS interface. The fourth introduces a beginner-friendly CoGAPS platform using GenePattern Notebook, aimed at users with a working conceptual knowledge of data analysis but without a basic proficiency in the R or Python programming language. We also constructed a user-facing website to serve as a central repository for information and instructional materials about CoGAPS and its application programming interfaces. The expected timing to setup the packages and conduct a test run is around 15 min, and an additional 30 min to conduct analyses on a precomputed result. The expected runtime on the user's desired dataset can vary from hours to days depending on factors such as dataset size or input parameters.


Subject(s)
Algorithms , Programming Languages , Bayes Theorem , Single-Cell Analysis
2.
bioRxiv ; 2023 Nov 05.
Article in English | MEDLINE | ID: mdl-37745323

ABSTRACT

Cells are fundamental units of life, constantly interacting and evolving as dynamical systems. While recent spatial multi-omics can quantitate individual cells' characteristics and regulatory programs, forecasting their evolution ultimately requires mathematical modeling. We develop a conceptual framework-a cell behavior hypothesis grammar-that uses natural language statements (cell rules) to create mathematical models. This allows us to systematically integrate biological knowledge and multi-omics data to make them computable. We can then perform virtual "thought experiments" that challenge and extend our understanding of multicellular systems, and ultimately generate new testable hypotheses. In this paper, we motivate and describe the grammar, provide a reference implementation, and demonstrate its potential through a series of examples in tumor biology and immunotherapy. Altogether, this approach provides a bridge between biological, clinical, and systems biology researchers for mathematical modeling of biological systems at scale, allowing the community to extrapolate from single-cell characterization to emergent multicellular behavior.

SELECTION OF CITATIONS
SEARCH DETAIL
...