Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 210
Filter
1.
PLoS Biol ; 22(7): e3002697, 2024 Jul.
Article in English | MEDLINE | ID: mdl-39024225

ABSTRACT

Long-read sequencing is driving rapid progress in genome assembly across all major groups of life, including species of the family Drosophilidae, a longtime model system for genetics, genomics, and evolution. We previously developed a cost-effective hybrid Oxford Nanopore (ONT) long-read and Illumina short-read sequencing approach and used it to assemble 101 drosophilid genomes from laboratory cultures, greatly increasing the number of genome assemblies for this taxonomic group. The next major challenge is to address the laboratory culture bias in taxon sampling by sequencing genomes of species that cannot easily be reared in the lab. Here, we build upon our previous methods to perform amplification-free ONT sequencing of single wild flies obtained either directly from the field or from ethanol-preserved specimens in museum collections, greatly improving the representation of lesser studied drosophilid taxa in whole-genome data. Using Illumina Novaseq X Plus and ONT P2 sequencers with R10.4.1 chemistry, we set a new benchmark for inexpensive hybrid genome assembly at US $150 per genome while assembling genomes from as little as 35 ng of genomic DNA from a single fly. We present 183 new genome assemblies for 179 species as a resource for drosophilid systematics, phylogenetics, and comparative genomics. Of these genomes, 62 are from pooled lab strains and 121 from single adult flies. Despite the sample limitations of working with small insects, most single-fly diploid assemblies are comparable in contiguity (>1 Mb contig N50), completeness (>98% complete dipteran BUSCOs), and accuracy (>QV40 genome-wide with ONT R10.4.1) to assemblies from inbred lines. We present a well-resolved multi-locus phylogeny for 360 drosophilid and 4 outgroup species encompassing all publicly available (as of August 2023) genomes for this group. Finally, we present a Progressive Cactus whole-genome, reference-free alignment built from a subset of 298 suitably high-quality drosophilid genomes. The new assemblies and alignment, along with updated laboratory protocols and computational pipelines, are released as an open resource and as a tool for studying evolution at the scale of an entire insect family.


Subject(s)
Drosophilidae , Genome, Insect , Genomics , Phylogeny , Animals , Drosophilidae/genetics , Drosophilidae/classification , Genomics/methods , Sequence Analysis, DNA/methods , High-Throughput Nucleotide Sequencing/methods
2.
Mol Biol Evol ; 2024 Jul 23.
Article in English | MEDLINE | ID: mdl-39041198

ABSTRACT

Measuring the fitnesses of genetic variants is a fundamental objective in evolutionary biology. A standard approach for measuring microbial fitnesses in bulk involves labeling a library of genetic variants with unique sequence barcodes, competing the labeled strains in batch culture, and using deep sequencing to track changes in the barcode abundances over time. However, idiosyncratic properties of barcodes can induce non-uniform amplification or uneven sequencing coverage that causes some barcodes to be over- or under-represented in samples. This systematic bias can result in erroneous read count trajectories and misestimates of fitness. Here we develop a computational method, REBAR, for inferring the effects of barcode processing bias by leveraging the structure of systematic deviations in the data. We illustrate this approach by applying it to two independent data sets, and demonstrate that this method estimates and corrects for bias more accurately than standard proxies, such as GC-based corrections. REBAR mitigates bias and improves fitness estimates in high-throughput assays without introducing additional complexity to the experimental protocols, with potential applications in a range of experimental evolution and mutation screening contexts.

3.
Nat Ecol Evol ; 2024 Jul 17.
Article in English | MEDLINE | ID: mdl-39020024

ABSTRACT

Evolution in a static laboratory environment often proceeds via large-effect beneficial mutations that may become maladaptive in other environments. Conversely, natural settings require populations to endure environmental fluctuations. A sensible assumption is that the fitness of a lineage in a fluctuating environment is the time average of its fitness over the sequence of static conditions it encounters. However, transitions between conditions may pose entirely new challenges, which could cause deviations from this time average. To test this, we tracked hundreds of thousands of barcoded yeast lineages evolving in static and fluctuating conditions and subsequently isolated 900 mutants for pooled fitness assays in 15 environments. Here we find that fitness in fluctuating environments indeed often deviates from the time average, leading to fitness non-additivity. Moreover, closer examination reveals that fitness in one component of a fluctuating environment is often strongly influenced by the previous component. We show that this environmental memory is especially common for mutants with high variance in fitness across tested environments. We use a simple mathematical model and whole-genome sequencing to propose mechanisms underlying this effect, including lag time evolution and sensing mutations. Our results show that environmental fluctuations impact fitness and suggest that variance in static environments can explain these impacts.

4.
bioRxiv ; 2024 May 31.
Article in English | MEDLINE | ID: mdl-38853826

ABSTRACT

Most cancers are diagnosed in persons over the age of sixty, but little is known about how age impacts tumorigenesis. While aging is accompanied by mutation accumulation - widely understood to contribute to cancer risk - it is also associated with numerous other cellular and molecular changes likely to impact tumorigenesis. Moreover, cancer incidence decreases in the oldest part of the population, suggesting that very old age may reduce carcinogenesis. Here we show that aging represses tumor initiation and growth in genetically engineered mouse models of human lung cancer. Moreover, aging dampens the impact of inactivating many, but not all, tumor suppressor genes with the impact of inactivating PTEN, a negative regulator of the PI3K/AKT pathway, weakened to a disproportionate extent. Single-cell transcriptomic analysis revealed that neoplastic cells from tumors in old mice retain many age-related transcriptomic changes, showing that age has an enduring impact that persists through oncogenic transformation. Furthermore, the consequences of PTEN inactivation were strikingly age-dependent, with PTEN deficiency reducing signatures of aging in cancer cells and the tumor microenvironment. Our findings suggest that the relationship between age and lung cancer incidence may reflect an integration of the competing effects of driver mutation accumulation and tumor suppressive effects of aging.

5.
bioRxiv ; 2024 May 25.
Article in English | MEDLINE | ID: mdl-38496463

ABSTRACT

Somatic genome editing in mouse models has increased our understanding of the in vivo effects of genetic alterations in areas ranging from neuroscience to cancer biology and beyond. However, existing models are limited in their ability to create multiple targeted edits. Thus, our understanding of the complex genetic interactions that underlie development, homeostasis, and disease remains incomplete. Cas12a is an RNA-guided endonuclease with unique attributes that enable simple targeting of multiple genes with crRNA arrays containing tandem guides. To accelerate and expand the generation of complex genotypes in somatic cells, we generated transgenic mice with Cre-regulated and constitutive expression of enhanced Acidaminococcus sp. Cas12a (enAsCas12a). In these mice, enAsCas12a-mediated somatic genome editing robustly generated compound genotypes, as exemplified by the initiation of diverse cancer types driven by homozygous inactivation of trios of tumor suppressor genes. We further integrated these modular crRNA arrays with clonal barcoding to quantify the size and number of tumors with each array, as well as the efficiency of each crRNA. These Cas12a alleles will enable the rapid generation of disease models and broadly facilitate the high-throughput investigation of coincident genomic alterations in somatic cells in vivo .

6.
bioRxiv ; 2024 Mar 08.
Article in English | MEDLINE | ID: mdl-38496564

ABSTRACT

Lung adenocarcinoma, the most common subtype of lung cancer, is genomically complex, with tumors containing tens to hundreds of non-synonymous mutations. However, little is understood about how genes interact with each other to enable tumorigenesis in vivo , largely due to a lack of methods for investigating genetic interactions in a high-throughput and multiplexed manner. Here, we employed a novel platform to generate tumors with all pairwise inactivation of ten tumor suppressor genes within an autochthonous mouse model of oncogenic KRAS-driven lung cancer. By quantifying the fitness of tumors with every single and double mutant genotype, we show that most tumor suppressor genetic interactions exhibited negative epistasis, with diminishing returns on tumor fitness. In contrast, Apc inactivation showed positive epistasis with the inactivation of several other genes, including dramatically synergistic effects on tumor fitness in combination with Lkb1 or Nf1 inactivation. This approach has the potential to expand the scope of genetic interactions that may be functionally characterized in vivo , which could lead to a better understanding of how complex tumor genotypes impact each step of carcinogenesis.

7.
PLoS Comput Biol ; 20(3): e1011937, 2024 Mar.
Article in English | MEDLINE | ID: mdl-38489348

ABSTRACT

The tracking of lineage frequencies via DNA barcode sequencing enables the quantification of microbial fitness. However, experimental noise coming from biotic and abiotic sources complicates the computation of a reliable inference. We present a Bayesian pipeline to infer relative microbial fitness from high-throughput lineage tracking assays. Our model accounts for multiple sources of noise and propagates uncertainties throughout all parameters in a systematic way. Furthermore, using modern variational inference methods based on automatic differentiation, we are able to scale the inference to a large number of unique barcodes. We extend this core model to analyze multi-environment assays, replicate experiments, and barcodes linked to genotypes. On simulations, our method recovers known parameters within posterior credible intervals. This work provides a generalizable Bayesian framework to analyze lineage tracking experiments. The accompanying open-source software library enables the adoption of principled statistical methods in experimental evolution.


Subject(s)
High-Throughput Screening Assays , Software , Bayes Theorem , Sequence Analysis, DNA , Gene Library
8.
bioRxiv ; 2023 Dec 11.
Article in English | MEDLINE | ID: mdl-38076867

ABSTRACT

The long-term success of introduced populations depends on their initial size and ability to compete against existing residents, but it remains unclear how these factors collectively shape colonization. Here, we investigate how initial population (propagule) size and resource competition interact during community coalescence by systematically mixing eight pairs of in vitro microbial communities at ratios that vary over six orders of magnitude, and we compare our results to a neutral ecological model. Although the composition of the resulting co-cultures deviated substantially from neutral expectations, each co-culture contained species whose relative abundance depended on propagule size even after ~40 generations of growth. Using a consumer-resource model, we show that this dose-dependent colonization can arise when resident and introduced species have high niche overlap and consume shared resources at similar rates. This model predicts that propagule size will have larger, longer-lasting effects in diverse communities in which niche overlap is higher, and we experimentally confirm that strain isolates show stronger dose dependence when introduced into diverse communities than in pairwise co-culture. This work shows how neutral-like colonization dynamics can emerge from non-neutral resource competition and have lasting effects on the outcomes of community coalescence.

9.
bioRxiv ; 2023 Oct 02.
Article in English | MEDLINE | ID: mdl-37873137

ABSTRACT

Long-read sequencing is driving rapid progress in genome assembly across all major groups of life, including species of the family Drosophilidae, a longtime model system for genetics, genomics, and evolution. We previously developed a cost-effective hybrid Oxford Nanopore (ONT) long-read and Illumina short-read sequencing approach and used it to assemble 101 drosophilid genomes from laboratory cultures, greatly increasing the number of genome assemblies for this taxonomic group. The next major challenge is to address the laboratory culture bias in taxon sampling by sequencing genomes of species that cannot easily be reared in the lab. Here, we build upon our previous methods to perform amplification-free ONT sequencing of single wild flies obtained either directly from the field or from ethanol-preserved specimens in museum collections, greatly improving the representation of lesser studied drosophilid taxa in whole-genome data. Using Illumina Novaseq X Plus and ONT P2 sequencers with R10.4.1 chemistry, we set a new benchmark for inexpensive hybrid genome assembly at US $150 per genome while assembling genomes from as little as 35 ng of genomic DNA from a single fly. We present 183 new genome assemblies for 179 species as a resource for drosophilid systematics, phylogenetics, and comparative genomics. Of these genomes, 62 are from pooled lab strains and 121 from single adult flies. Despite the sample limitations of working with small insects, most single-fly diploid assemblies are comparable in contiguity (>1Mb contig N50), completeness (>98% complete dipteran BUSCOs), and accuracy (>QV40 genome-wide with ONT R10.4.1) to assemblies from inbred lines. We present a well-resolved multi-locus phylogeny for 360 drosophilid and 4 outgroup species encompassing all publicly available (as of August 2023) genomes for this group. Finally, we present a Progressive Cactus whole-genome, reference-free alignment built from a subset of 298 suitably high-quality drosophilid genomes. The new assemblies and alignment, along with updated laboratory protocols and computational pipelines, are released as an open resource and as a tool for studying evolution at the scale of an entire insect family.

10.
bioRxiv ; 2023 Oct 18.
Article in English | MEDLINE | ID: mdl-37904971

ABSTRACT

The tracking of lineage frequencies via DNA barcode sequencing enables the quantification of microbial fitness. However, experimental noise coming from biotic and abiotic sources complicates the computation of a reliable inference. We present a Bayesian pipeline to infer relative microbial fitness from high-throughput lineage tracking assays. Our model accounts for multiple sources of noise and propagates uncertainties throughout all parameters in a systematic way. Furthermore, using modern variational inference methods based on automatic differentiation, we are able to scale the inference to a large number of unique barcodes. We extend this core model to analyze multi-environment assays, replicate experiments, and barcodes linked to genotypes. On simulations, our method recovers known parameters within posterior credible intervals. This work provides a generalizable Bayesian framework to analyze lineage tracking experiments. The accompanying open-source software library enables the adoption of principled statistical methods in experimental evolution.

11.
Nat Commun ; 14(1): 6422, 2023 10 12.
Article in English | MEDLINE | ID: mdl-37828026

ABSTRACT

Tumors acquire alterations in oncogenes and tumor suppressor genes in an adaptive walk through the fitness landscape of tumorigenesis. However, the interactions between oncogenes and tumor suppressor genes that shape this landscape remain poorly resolved and cannot be revealed by human cancer genomics alone. Here, we use a multiplexed, autochthonous mouse platform to model and quantify the initiation and growth of more than one hundred genotypes of lung tumors across four oncogenic contexts: KRAS G12D, KRAS G12C, BRAF V600E, and EGFR L858R. We show that the fitness landscape is rugged-the effect of tumor suppressor inactivation often switches between beneficial and deleterious depending on the oncogenic context-and shows no evidence of diminishing-returns epistasis within variants of the same oncogene. These findings argue against a simple linear signaling relationship amongst these three oncogenes and imply a critical role for off-axis signaling in determining the fitness effects of inactivating tumor suppressors.


Subject(s)
Lung Neoplasms , Proto-Oncogene Proteins p21(ras) , Mice , Humans , Animals , Proto-Oncogene Proteins p21(ras)/genetics , Oncogenes/genetics , Lung Neoplasms/genetics , Lung Neoplasms/pathology , Carcinogenesis/genetics , Cell Transformation, Neoplastic/genetics , Mutation
12.
Elife ; 122023 10 20.
Article in English | MEDLINE | ID: mdl-37861305

ABSTRACT

Adaptation is driven by the selection for beneficial mutations that provide a fitness advantage in the specific environment in which a population is evolving. However, environments are rarely constant or predictable. When an organism well adapted to one environment finds itself in another, pleiotropic effects of mutations that made it well adapted to its former environment will affect its success. To better understand such pleiotropic effects, we evolved both haploid and diploid barcoded budding yeast populations in multiple environments, isolated adaptive clones, and then determined the fitness effects of adaptive mutations in 'non-home' environments in which they were not selected. We find that pleiotropy is common, with most adaptive evolved lineages showing fitness effects in non-home environments. Consistent with other studies, we find that these pleiotropic effects are unpredictable: they are beneficial in some environments and deleterious in others. However, we do find that lineages with adaptive mutations in the same genes tend to show similar pleiotropic effects. We also find that ploidy influences the observed adaptive mutational spectra in a condition-specific fashion. In some conditions, haploids and diploids are selected with adaptive mutations in identical genes, while in others they accumulate mutations in almost completely disjoint sets of genes.


Subject(s)
Diploidy , Saccharomyces cerevisiae , Haploidy , Saccharomyces cerevisiae/genetics , Mutation
13.
bioRxiv ; 2023 Oct 30.
Article in English | MEDLINE | ID: mdl-37808827

ABSTRACT

Humans constantly encounter new microbes, but few become long-term residents of the adult gut microbiome. Classical theories predict that colonization is determined by the availability of open niches, but it remains unclear whether other ecological barriers limit commensal colonization in natural settings. To disentangle these effects, we used a controlled perturbation with the antibiotic ciprofloxacin to investigate the dynamics of gut microbiome transmission in 22 households of healthy, cohabiting adults. Colonization was rare in three-quarters of antibiotic-taking subjects, whose resident strains rapidly recovered in the week after antibiotics ended. In contrast, the remaining antibiotic-taking subjects exhibited lasting responses, with extensive species losses and transient expansions of potential opportunistic pathogens. These subjects experienced elevated rates of commensal colonization, but only after long delays: many new colonizers underwent sudden, correlated expansions months after the antibiotic perturbation. Furthermore, strains that had previously transmitted between cohabiting partners rarely recolonized after antibiotic disruptions, showing that colonization displays substantial historical contingency. This work demonstrates that there remain substantial ecological barriers to colonization even after major microbiome disruptions, suggesting that dispersal interactions and priority effects limit the pace of community change.

14.
Proc Natl Acad Sci U S A ; 120(38): e2303224120, 2023 09 19.
Article in English | MEDLINE | ID: mdl-37695905

ABSTRACT

Cancer genomes are almost invariably complex with genomic alterations cooperating during each step of carcinogenesis. In cancers that lack a single dominant oncogene mutation, cooperation between the inactivation of multiple tumor suppressor genes can drive tumor initiation and growth. Here, we shed light on how the sequential acquisition of genomic alterations generates oncogene-negative lung tumors. We couple tumor barcoding with combinatorial and multiplexed somatic genome editing to characterize the fitness landscapes of three tumor suppressor genes NF1, RASA1, and PTEN, the inactivation of which jointly drives oncogene-negative lung adenocarcinoma initiation and growth. The fitness landscape was surprisingly accessible, with each additional mutation leading to growth advantage. Furthermore, the fitness landscapes remained fully accessible across backgrounds with the inactivation of additional tumor suppressor genes. These results suggest that while predicting cancer evolution will be challenging, acquiring the multiple alterations that drive the growth of oncogene-negative tumors can be facilitated by the lack of constraints on mutational order.


Subject(s)
Adenocarcinoma of Lung , Lung Neoplasms , Humans , Oncogenes/genetics , Adenocarcinoma of Lung/genetics , Mutation , Lung Neoplasms/genetics , Cell Transformation, Neoplastic , p120 GTPase Activating Protein
15.
bioRxiv ; 2023 Sep 21.
Article in English | MEDLINE | ID: mdl-37745585

ABSTRACT

Evolution in a static environment, such as a laboratory setting with constant and uniform conditions, often proceeds via large-effect beneficial mutations that may become maladaptive in other environments. Conversely, natural settings require populations to endure environmental fluctuations. A sensible assumption is that the fitness of a lineage in a fluctuating environment is the time-average of its fitness over the sequence of static conditions it encounters. However, transitions between conditions may pose entirely new challenges, which could cause deviations from this time-average. To test this, we tracked hundreds of thousands of barcoded yeast lineages evolving in static and fluctuating conditions and subsequently isolated 900 mutants for pooled fitness assays in 15 environments. We find that fitness in fluctuating environments indeed often deviates from the expectation based on static components, leading to fitness non-additivity. Moreover, closer examination reveals that fitness in one component of a fluctuating environment is often strongly influenced by the previous component. We show that this environmental memory is especially common for mutants with high variance in fitness across tested environments, even if the components of the focal fluctuating environment are excluded from this variance. We employ a simple mathematical model and whole-genome sequencing to propose mechanisms underlying this effect, including lag time evolution and sensing mutations. Our results demonstrate that environmental fluctuations have large impacts on fitness and suggest that variance in static environments can explain these impacts.

17.
ACS Omega ; 8(28): 25009-25019, 2023 Jul 18.
Article in English | MEDLINE | ID: mdl-37483225

ABSTRACT

The emergence of viral threats such as Ebola, ZIKA, and severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) requires a rapid and efficient approach for elucidating mechanisms of pathogenesis and development of therapeutics. In this context, cell-free protein synthesis (CFPS) holds a promise to resolve the bottlenecks of multiplexed protein production and interaction analysis among host and pathogen proteins. Here, we applied a eukaryotic CFPS system based on Leishmania tarentolae extract (LTE) protein expression in combination with AlphaLISA proximity-based protein interaction technology to identify intraviral and viral-human protein interactions of SARS-CoV-2 virus that can potentially be targeted by the existing or novel antiviral therapeutics. We produced and tested 54 putative human-viral protein pairs in vitro and identified 45 direct binary protein interactions. As a casing example of the assay's suitability for drug development applications, we analyzed the effect of a putative biologic on the human angiotensin-converting enzyme 2/receptor-binding domain (hACE2/RBD) interaction. This suggests that the presented pathogen characterization platform can facilitate the development of new therapeutic agents.

18.
J Mol Evol ; 91(3): 293-310, 2023 06.
Article in English | MEDLINE | ID: mdl-37237236

ABSTRACT

The phrase "survival of the fittest" has become an iconic descriptor of how natural selection works. And yet, precisely measuring fitness, even for single-celled microbial populations growing in controlled laboratory conditions, remains a challenge. While numerous methods exist to perform these measurements, including recently developed methods utilizing DNA barcodes, all methods are limited in their precision to differentiate strains with small fitness differences. In this study, we rule out some major sources of imprecision, but still find that fitness measurements vary substantially from replicate to replicate. Our data suggest that very subtle and difficult to avoid environmental differences between replicates create systematic variation across fitness measurements. We conclude by discussing how fitness measurements should be interpreted given their extreme environment dependence. This work was inspired by the scientific community who followed us and gave us tips as we live tweeted a high-replicate fitness measurement experiment at #1BigBatch.


Subject(s)
Genetic Fitness , Selection, Genetic
19.
bioRxiv ; 2023 Feb 06.
Article in English | MEDLINE | ID: mdl-36778226

ABSTRACT

Cancer genomes are almost invariably complex with genomic alterations cooperating during each step of carcinogenesis. In cancers that lack a single dominant oncogene mutation, cooperation between the inactivation of multiple tumor suppressor genes can drive tumor initiation and growth. Here, we shed light on how the sequential acquisition of genomic alterations generates oncogene-negative lung tumors. We couple tumor barcoding with combinatorial and multiplexed somatic genome editing to characterize the fitness landscapes of three tumor suppressor genes NF1, RASA1, and PTEN, the inactivation of which jointly drives oncogene-negative lung adenocarcinoma initiation and growth. The fitness landscape was surprisingly accessible, with each additional mutation leading to growth advantage. Furthermore, the fitness landscapes remained fully accessible across backgrounds with additional tumor suppressor mutations. These results suggest that while predicting cancer evolution will be challenging, acquiring the multiple alterations required for the growth of oncogene-negative tumors can be facilitated by the lack of constraints on mutational order.

20.
bioRxiv ; 2023 Feb 02.
Article in English | MEDLINE | ID: mdl-36778235

ABSTRACT

Sequence variation among antigenic var genes enables Plasmodium falciparum malaria parasites to evade host immunity. Using long sequence reads from haploid clones from a mutation accumulation experiment, we detect var diversity inconsistent with simple chromosomal inheritance. We discover putatively circular DNA that is strongly enriched for var genes, which exist in multiple alleles per locus separated by recombination and indel events. Extrachromosomal DNA likely contributes to rapid antigenic diversification in P. falciparum.

SELECTION OF CITATIONS
SEARCH DETAIL
...