Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 34
Filter
1.
Nat Commun ; 15(1): 4234, 2024 May 18.
Article in English | MEDLINE | ID: mdl-38762544

ABSTRACT

Interactions between genetic perturbations and segregating loci can cause perturbations to show different phenotypic effects across genetically distinct individuals. To study these interactions on a genome scale in many individuals, we used combinatorial DNA barcode sequencing to measure the fitness effects of 8046 CRISPRi perturbations targeting 1721 distinct genes in 169 yeast cross progeny (or segregants). We identified 460 genes whose perturbation has different effects across segregants. Several factors caused perturbations to show variable effects, including baseline segregant fitness, the mean effect of a perturbation across segregants, and interacting loci. We mapped 234 interacting loci and found four hub loci that interact with many different perturbations. Perturbations that interact with a given hub exhibit similar epistatic relationships with the hub and show enrichment for cellular processes that may mediate these interactions. These results suggest that an individual's response to perturbations is shaped by a network of perturbation-locus interactions that cannot be measured by approaches that examine perturbations or natural variation alone.


Subject(s)
Epistasis, Genetic , Genome, Fungal , Saccharomyces cerevisiae , Saccharomyces cerevisiae/genetics , Genetic Variation , Genetic Fitness , CRISPR-Cas Systems , Phenotype , DNA Barcoding, Taxonomic
2.
Nucleic Acids Res ; 52(10): e47, 2024 Jun 10.
Article in English | MEDLINE | ID: mdl-38709890

ABSTRACT

Sequence verification of plasmid DNA is critical for many cloning and molecular biology workflows. To leverage high-throughput sequencing, several methods have been developed that add a unique DNA barcode to individual samples prior to pooling and sequencing. However, these methods require an individual plasmid extraction and/or in vitro barcoding reaction for each sample processed, limiting throughput and adding cost. Here, we develop an arrayed in vivo plasmid barcoding platform that enables pooled plasmid extraction and library preparation for Oxford Nanopore sequencing. This method has a high accuracy and recovery rate, and greatly increases throughput and reduces cost relative to other plasmid barcoding methods or Sanger sequencing. We use in vivo barcoding to sequence verify >45 000 plasmids and show that the method can be used to transform error-containing dispersed plasmid pools into sequence-perfect arrays or well-balanced pools. In vivo barcoding does not require any specialized equipment beyond a low-overhead Oxford Nanopore sequencer, enabling most labs to flexibly process hundreds to thousands of plasmids in parallel.


Subject(s)
Gene Library , High-Throughput Nucleotide Sequencing , Plasmids , Plasmids/genetics , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , DNA/genetics , DNA Barcoding, Taxonomic/methods , Nanopore Sequencing/methods
3.
bioRxiv ; 2024 Jan 16.
Article in English | MEDLINE | ID: mdl-38293072

ABSTRACT

Interactions between genetic perturbations and segregating loci can cause perturbations to show different phenotypic effects across genetically distinct individuals. To study these interactions on a genome scale in many individuals, we used combinatorial DNA barcode sequencing to measure the fitness effects of 7,700 CRISPRi perturbations targeting 1,712 distinct genes in 169 yeast cross progeny (or segregants). We identified 460 genes whose perturbation has different effects across segregants. Several factors caused perturbations to show variable effects, including baseline segregant fitness, the mean effect of a perturbation across segregants, and interacting loci. We mapped 234 interacting loci and found four hub loci that interact with many different perturbations. Perturbations that interact with a given hub exhibit similar epistatic relationships with the hub and show enrichment for cellular processes that may mediate these interactions. These results suggest that an individual's response to perturbations is shaped by a network of perturbation-locus interactions that cannot be measured by approaches that examine perturbations or natural variation alone.

4.
bioRxiv ; 2023 Oct 18.
Article in English | MEDLINE | ID: mdl-37873145

ABSTRACT

Sequence verification of plasmid DNA is critical for many cloning and molecular biology workflows. To leverage high-throughput sequencing, several methods have been developed that add a unique DNA barcode to individual samples prior to pooling and sequencing. However, these methods require an individual plasmid extraction and/or in vitro barcoding reaction for each sample processed, limiting throughput and adding cost. Here, we develop an arrayed in vivo plasmid barcoding platform that enables pooled plasmid extraction and library preparation for Oxford Nanopore sequencing. This method has a high accuracy and recovery rate, and greatly increases throughput and reduces cost relative to other plasmid barcoding methods or Sanger sequencing. We use in vivo barcoding to sequence verify >45,000 plasmids and show that the method can be used to transform error-containing dispersed plasmid pools into sequence-perfect arrays or well-balanced pools. In vivo barcoding does not require any specialized equipment beyond a low-overhead Oxford Nanopore sequencer, enabling most labs to flexibly process hundreds to thousands of plasmids in parallel.

5.
PLoS One ; 18(3): e0283548, 2023.
Article in English | MEDLINE | ID: mdl-36989327

ABSTRACT

As synthetic biology expands and accelerates into real-world applications, methods for quantitatively and precisely engineering biological function become increasingly relevant. This is particularly true for applications that require programmed sensing to dynamically regulate gene expression in response to stimuli. However, few methods have been described that can engineer biological sensing with any level of quantitative precision. Here, we present two complementary methods for precision engineering of genetic sensors: in silico selection and machine-learning-enabled forward engineering. Both methods use a large-scale genotype-phenotype dataset to identify DNA sequences that encode sensors with quantitatively specified dose response. First, we show that in silico selection can be used to engineer sensors with a wide range of dose-response curves. To demonstrate in silico selection for precise, multi-objective engineering, we simultaneously tune a genetic sensor's sensitivity (EC50) and saturating output to meet quantitative specifications. In addition, we engineer sensors with inverted dose-response and specified EC50. Second, we demonstrate a machine-learning-enabled approach to predictively engineer genetic sensors with mutation combinations that are not present in the large-scale dataset. We show that the interpretable machine learning results can be combined with a biophysical model to engineer sensors with improved inverted dose-response curves.


Subject(s)
Machine Learning , Synthetic Biology , Synthetic Biology/methods
6.
Genetics ; 222(3)2022 11 01.
Article in English | MEDLINE | ID: mdl-36103708

ABSTRACT

Determining how genetic polymorphisms enable certain fungi to persist in mammalian hosts can improve understanding of opportunistic fungal pathogenesis, a source of substantial human morbidity and mortality. We examined the genetic basis of fungal persistence in mice using a cross between a clinical isolate and the lab reference strain of the budding yeast Saccharomyces cerevisiae. Employing chromosomally encoded DNA barcodes, we tracked the relative abundances of 822 genotyped, haploid segregants in multiple organs over time and performed linkage mapping of their persistence in hosts. Detected loci showed a mix of general and antagonistically pleiotropic effects across organs. General loci showed similar effects across all organs, while antagonistically pleiotropic loci showed contrasting effects in the brain vs the kidneys, liver, and spleen. Persistence in an organ required both generally beneficial alleles and organ-appropriate pleiotropic alleles. This genetic architecture resulted in many segregants persisting in the brain or in nonbrain organs, but few segregants persisting in all organs. These results show complex combinations of genetic polymorphisms collectively cause and constrain fungal persistence in different parts of the mammalian body.


Subject(s)
Mycoses , Animals , Humans , Mice , Alleles , Chromosome Mapping/methods , Saccharomyces cerevisiae/genetics , Mycoses/microbiology , Brain/microbiology , Kidney/microbiology , Liver/microbiology , Spleen/microbiology
7.
Genetics ; 221(2)2022 05 31.
Article in English | MEDLINE | ID: mdl-35435209

ABSTRACT

Identification of adaptive targets in experimental evolution typically relies on extensive replication and genetic reconstruction. An alternative approach is to directly assay all mutations in an evolved clone by generating pools of segregants that contain random combinations of evolved mutations. Here, we apply this method to 6 Saccharomyces cerevisiae clones isolated from 4 diploid populations that were clonally evolved for 2,000 generations in rich glucose medium. Each clone contains 17-26 mutations relative to the ancestor. We derived intermediate genotypes between the founder and the evolved clones by bulk mating sporulated cultures of the evolved clones to a barcoded haploid version of the ancestor. We competed the resulting barcoded diploids en masse and quantified fitness in the experimental and alternative environments by barcode sequencing. We estimated average fitness effects of evolved mutations using barcode-based fitness assays and whole-genome sequencing for a subset of segregants. In contrast to our previous work with haploid evolved clones, we find that diploids carry fewer beneficial mutations, with modest fitness effects (up to 5.4%) in the environment in which they arose. In agreement with theoretical expectations, reconstruction experiments show that all mutations with a detectable fitness effect manifest some degree of dominance over the ancestral allele, and most are overdominant. Genotypes with lower fitness effects in alternative environments allowed us to identify conditions that drive adaptation in our system.


Subject(s)
Diploidy , Saccharomyces cerevisiae , Adaptation, Physiological/genetics , Genetic Fitness , Haploidy , Mutation , Saccharomyces cerevisiae/genetics
8.
Nat Commun ; 13(1): 1463, 2022 03 18.
Article in English | MEDLINE | ID: mdl-35304450

ABSTRACT

In diploid species, genetic loci can show additive, dominance, and epistatic effects. To characterize the contributions of these different types of genetic effects to heritable traits, we use a double barcoding system to generate and phenotype a panel of ~200,000 diploid yeast strains that can be partitioned into hundreds of interrelated families. This experiment enables the detection of thousands of epistatic loci, many whose effects vary across families. Here, we show traits are largely specified by a small number of hub loci with major additive and dominance effects, and pervasive epistasis. Genetic background commonly influences both the additive and dominance effects of loci, with multiple modifiers typically involved. The most prominent dominance modifier in our data is the mating locus, which has no effect on its own. Our findings show that the interplay between additivity, dominance, and epistasis underlies a complex genotype-to-phenotype map in diploids.


Subject(s)
Diploidy , Saccharomyces cerevisiae , Epistasis, Genetic , Exercise , Humans , Models, Genetic , Phenotype , Saccharomyces cerevisiae/genetics
11.
Mol Syst Biol ; 17(3): e10179, 2021 03.
Article in English | MEDLINE | ID: mdl-33784029

ABSTRACT

Allostery is a fundamental biophysical mechanism that underlies cellular sensing, signaling, and metabolism. Yet a quantitative understanding of allosteric genotype-phenotype relationships remains elusive. Here, we report the large-scale measurement of the genotype-phenotype landscape for an allosteric protein: the lac repressor from Escherichia coli, LacI. Using a method that combines long-read and short-read DNA sequencing, we quantitatively measure the dose-response curves for nearly 105 variants of the LacI genetic sensor. The resulting data provide a quantitative map of the effect of amino acid substitutions on LacI allostery and reveal systematic sequence-structure-function relationships. We find that in many cases, allosteric phenotypes can be quantitatively predicted with additive or neural-network models, but unpredictable changes also occur. For example, we were surprised to discover a new band-stop phenotype that challenges conventional models of allostery and that emerges from combinations of nearly silent amino acid substitutions.


Subject(s)
Genotype , Lac Repressors/metabolism , Phenotype , Allosteric Regulation , Amino Acid Substitution , Escherichia coli/genetics , Genetic Variation
12.
Elife ; 92020 09 14.
Article in English | MEDLINE | ID: mdl-32924934

ABSTRACT

To characterize how protein-protein interaction (PPI) networks change, we quantified the relative PPI abundance of 1.6 million protein pairs in the yeast Saccharomyces cerevisiae across nine growth conditions, with replication, for a total of 44 million measurements. Our multi-condition screen identified 13,764 pairwise PPIs, a threefold increase over PPIs identified in one condition. A few 'immutable' PPIs are present across all conditions, while most 'mutable' PPIs are rarely observed. Immutable PPIs aggregate into highly connected 'core' network modules, with most network remodeling occurring within a loosely connected 'accessory' module. Mutable PPIs are less likely to co-express, co-localize, and be explained by simple mass action kinetics, and more likely to contain proteins with intrinsically disordered regions, implying that environment-dependent association and binding is critical to cellular adaptation. Our results show that protein interactomes are larger than previously thought and contain highly dynamic regions that reorganize to drive or respond to cellular changes.


Subject(s)
Protein Interaction Maps , Saccharomyces cerevisiae Proteins/metabolism , Saccharomyces cerevisiae/metabolism , Environment
13.
Nature ; 575(7783): 494-499, 2019 11.
Article in English | MEDLINE | ID: mdl-31723263

ABSTRACT

In rapidly adapting asexual populations, including many microbial pathogens and viruses, numerous mutant lineages often compete for dominance within the population1-5. These complex evolutionary dynamics determine the outcomes of adaptation, but have been difficult to observe directly. Previous studies have used whole-genome sequencing to follow molecular adaptation6-10; however, these methods have limited resolution in microbial populations. Here we introduce a renewable barcoding system to observe evolutionary dynamics at high resolution in laboratory budding yeast. We find nested patterns of interference and hitchhiking even at low frequencies. These events are driven by the continuous appearance of new mutations that modify the fates of existing lineages before they reach substantial frequencies. We observe how the distribution of fitness within the population changes over time, and find a travelling wave of adaptation that has been predicted by theory11-17. We show that clonal competition creates a dynamical 'rich-get-richer' effect: fitness advantages that are acquired early in evolution drive clonal expansions, which increase the chances of acquiring future mutations. However, less-fit lineages also routinely leapfrog over strains of higher fitness. Our results demonstrate that this combination of factors, which is not accounted for in existing models of evolutionary dynamics, is critical in determining the rate, predictability and molecular basis of adaptation.


Subject(s)
Adaptation, Physiological/genetics , Cell Lineage , Evolution, Molecular , Laboratories , Mutation , Saccharomyces cerevisiae/cytology , Saccharomyces cerevisiae/genetics , Clone Cells/cytology , Clone Cells/metabolism , DNA Barcoding, Taxonomic , Genetic Fitness/genetics
14.
Cell Syst ; 8(4): 338-344.e8, 2019 04 24.
Article in English | MEDLINE | ID: mdl-30954477

ABSTRACT

We developed a flexible toolkit for combinatorial screening in Saccharomyces cerevisiae, which generates large libraries of cells, each uniquely barcoded to mark a combination of DNA elements. This interaction sequencing platform (iSeq 2.0) includes genomic landing pads that assemble combinations through sequential integration of plasmids or yeast mating, 15 barcoded plasmid libraries containing split selectable markers (URA3AI, KanMXAI, HphMXAI, and NatMXAI), and an array of ∼24,000 "double-barcoder" strains that can make existing yeast libraries iSeq compatible. Various DNA elements are compatible with iSeq: DNA introduced on integrating plasmids, engineered genomic modifications, or entire genetic backgrounds. DNA element libraries are modular and interchangeable, and any two libraries can be combined, making iSeq capable of performing many new combinatorial screens by short-read sequencing.


Subject(s)
Protein Interaction Mapping/methods , Sequence Analysis, DNA/methods , Software , Fungal Proteins/chemistry , Fungal Proteins/metabolism , Gene Library , Recombinant Proteins/chemistry , Recombinant Proteins/metabolism , Saccharomyces cerevisiae
15.
Genome Res ; 29(4): 668-681, 2019 04.
Article in English | MEDLINE | ID: mdl-30782640

ABSTRACT

Large-scale genetic interaction (GI) screens in yeast have been invaluable for our understanding of molecular systems biology and for characterizing novel gene function. Owing in part to the high costs and long experiment times required, a preponderance of GI data has been generated in a single environmental condition. However, an unknown fraction of GIs may be specific to other conditions. Here, we developed a pooled-growth CRISPRi-based sequencing assay for GIs, CRISPRiSeq, which increases throughput such that GIs can be easily assayed across multiple growth conditions. We assayed the fitness of approximately 17,000 strains encompassing approximately 7700 pairwise interactions in five conditions and found that the additional conditions increased the number of GIs detected nearly threefold over the number detected in rich media alone. In addition, we found that condition-specific GIs are prevalent and improved the power to functionally classify genes. Finally, we found new links during respiratory growth between members of the Ras nutrient-sensing pathway and both the COG complex and a gene of unknown function. Our results highlight the potential of conditional GI screens to improve our understanding of cellular genetic networks.


Subject(s)
Clustered Regularly Interspaced Short Palindromic Repeats , Environment , Epistasis, Genetic , Gene Regulatory Networks , Genetic Techniques , Sequence Analysis, DNA/methods , Genes, Fungal , Saccharomyces cerevisiae/genetics
16.
Nat Ecol Evol ; 3(2): 293-301, 2019 02.
Article in English | MEDLINE | ID: mdl-30598529

ABSTRACT

The dynamics of genetic diversity in large clonally evolving cell populations are poorly understood, despite having implications for the treatment of cancer and microbial infections. Here, we combine barcode lineage tracking, sequencing of adaptive clones and mathematical modelling of mutational dynamics to understand adaptive diversity changes during experimental evolution of Saccharomyces cerevisiae under nitrogen and carbon limitation. We find that, despite differences in beneficial mutational mechanisms and fitness effects, early adaptive genetic diversity increases predictably, driven by the expansion of many single-mutant lineages. However, a crash in adaptive diversity follows, caused by highly fit double-mutant 'jackpot' clones that are fed from exponentially growing single mutants, a process closely related to the classic Luria-Delbrück experiment. The diversity crash is likely to be a general feature of asexual evolution with clonal interference; however, both its timing and magnitude are stochastic and depend on the population size, the distribution of beneficial fitness effects and patterns of epistasis.


Subject(s)
Adaptation, Biological , Clonal Evolution , Genetic Variation/genetics , Saccharomyces cerevisiae/genetics , Models, Genetic , Mutation
17.
PLoS Biol ; 16(12): e3000069, 2018 12.
Article in English | MEDLINE | ID: mdl-30562346

ABSTRACT

Copy number variants (CNVs) are a pervasive source of genetic variation and evolutionary potential, but the dynamics and diversity of CNVs within evolving populations remain unclear. Long-term evolution experiments in chemostats provide an ideal system for studying the molecular processes underlying CNV formation and the temporal dynamics with which they are generated, selected, and maintained. Here, we developed a fluorescent CNV reporter to detect de novo gene amplifications and deletions in individual cells. We used the CNV reporter in Saccharomyces cerevisiae to study CNV formation at the GAP1 locus, which encodes the general amino acid permease, in different nutrient-limited chemostat conditions. We find that under strong selection, GAP1 CNVs are repeatedly generated and selected during the early stages of adaptive evolution, resulting in predictable dynamics. Molecular characterization of CNV-containing lineages shows that the CNV reporter detects different classes of CNVs, including aneuploidies, nonreciprocal translocations, tandem duplications, and complex CNVs. Despite GAP1's proximity to repeat sequences that facilitate intrachromosomal recombination, breakpoint analysis revealed that short inverted repeat sequences mediate formation of at least 50% of GAP1 CNVs. Inverted repeat sequences are also found at breakpoints at the DUR3 locus, where CNVs are selected in urea-limited chemostats. Analysis of 28 CNV breakpoints indicates that inverted repeats are typically 8 nucleotides in length and separated by 40 bases. The features of these CNVs are consistent with origin-dependent inverted-repeat amplification (ODIRA), suggesting that replication-based mechanisms of CNV formation may be a common source of gene amplification. We combined the CNV reporter with barcode lineage tracking and found that 102-104 independent CNV-containing lineages initially compete within populations, resulting in extreme clonal interference. However, only a small number (18-21) of CNV lineages ever constitute more than 1% of the CNV subpopulation, and as selection progresses, the diversity of CNV lineages declines. Our study introduces a novel means of studying CNVs in heterogeneous cell populations and provides insight into their dynamics, diversity, and formation mechanisms in the context of adaptive evolution.


Subject(s)
Adaptation, Biological/genetics , Amino Acid Transport Systems/genetics , DNA Copy Number Variations/genetics , Saccharomyces cerevisiae Proteins/genetics , Amino Acid Transport Systems/metabolism , DNA Mutational Analysis/methods , DNA Replication/genetics , Gene Amplification/genetics , Genes, Reporter/genetics , Membrane Transport Proteins/genetics , Recombination, Genetic , Repetitive Sequences, Nucleic Acid/genetics , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/physiology , Saccharomyces cerevisiae Proteins/metabolism , Single-Cell Analysis/methods
18.
Cell Syst ; 7(5): 521-525.e4, 2018 11 28.
Article in English | MEDLINE | ID: mdl-30391162

ABSTRACT

Standard practice for phenotyping complex cell pools is to measure the fold enrichment of genotype-specific amplicons after a period of competitive growth. Here, we show that fold-enrichment measures cannot be compared across genotype pools with different fitness distributions. We develop a method to calculate an unbiased estimate of relative fitness by tracking abundances over several time points and show how to optimize experimental protocols to minimize fitness measurement error.


Subject(s)
Algorithms , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , DNA Barcoding, Taxonomic/methods
19.
Bioinformatics ; 34(5): 739-747, 2018 03 01.
Article in English | MEDLINE | ID: mdl-29069318

ABSTRACT

Motivation: Barcode sequencing (bar-seq) is a high-throughput, and cost effective method to assay large numbers of cell lineages or genotypes in complex cell pools. Because of its advantages, applications for bar-seq are quickly growing-from using neutral random barcodes to study the evolution of microbes or cancer, to using pseudo-barcodes, such as shRNAs or sgRNAs to simultaneously screen large numbers of cell perturbations. However, the computational pipelines for bar-seq clustering are not well developed. Available methods often yield a high frequency of under-clustering artifacts that result in spurious barcodes, or over-clustering artifacts that group distinct barcodes together. Here, we developed Bartender, an accurate clustering algorithm to detect barcodes and their abundances from raw next-generation sequencing data. Results: In contrast with existing methods that cluster based on sequence similarity alone, Bartender uses a modified two-sample proportion test that also considers cluster size. This modification results in higher accuracy and lower rates of under- and over-clustering artifacts. Additionally, Bartender includes unique molecular identifier handling and a 'multiple time point' mode that matches barcode clusters between different clustering runs for seamless handling of time course data. Bartender is a set of simple-to-use command line tools that can be performed on a laptop at comparable run times to existing methods. Availability and implementation: Bartender is available at no charge for non-commercial use at https://github.com/LaoZZZZZ/bartender-1.1. Contact: sasha.levy@stonybrook.edu or song.wu@stonybrook.edu. Supplementary information: Supplementary data are available at Bioinformatics online.


Subject(s)
Cluster Analysis , High-Throughput Nucleotide Sequencing/methods , Software , Algorithms , Animals , Artifacts , Bacteria , Data Accuracy , Humans , Sequence Analysis, RNA
20.
Nat Commun ; 8: 15586, 2017 05 25.
Article in English | MEDLINE | ID: mdl-28541284

ABSTRACT

Several large-scale efforts have systematically catalogued protein-protein interactions (PPIs) of a cell in a single environment. However, little is known about how the protein interactome changes across environmental perturbations. Current technologies, which assay one PPI at a time, are too low throughput to make it practical to study protein interactome dynamics. Here, we develop a highly parallel protein-protein interaction sequencing (PPiSeq) platform that uses a novel double barcoding system in conjunction with the dihydrofolate reductase protein-fragment complementation assay in Saccharomyces cerevisiae. PPiSeq detects PPIs at a rate that is on par with current assays and, in contrast with current methods, quantitatively scores PPIs with enough accuracy and sensitivity to detect changes across environments. Both PPI scoring and the bulk of strain construction can be performed with cell pools, making the assay scalable and easily reproduced across environments. PPiSeq is therefore a powerful new tool for large-scale investigations of dynamic PPIs.


Subject(s)
DNA Barcoding, Taxonomic/methods , Protein Interaction Mapping/methods , Protein Interaction Maps , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Saccharomyces cerevisiae Proteins/metabolism , Tetrahydrofolate Dehydrogenase/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL
...