Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 49
Filter
Add more filters










Publication year range
1.
Genome Biol Evol ; 16(5)2024 May 02.
Article in English | MEDLINE | ID: mdl-38758096

ABSTRACT

The coppery titi monkey (Plecturocebus cupreus) is an emerging nonhuman primate model system for behavioral and neurobiological research. At the same time, the almost entire absence of genomic resources for the species has hampered insights into the genetic underpinnings of the phenotypic traits of interest. To facilitate future genotype-to-phenotype studies, we here present a high-quality, fully annotated de novo genome assembly for the species with chromosome-length scaffolds spanning the autosomes and chromosome X (scaffold N50 = 130.8 Mb), constructed using data obtained from several orthologous short- and long-read sequencing and scaffolding techniques. With a base-level accuracy of ∼99.99% in chromosome-length scaffolds as well as benchmarking universal single-copy ortholog and k-mer completeness scores of >99.0% and 95.1% at the genome level, this assembly represents one of the most complete Pitheciidae genomes to date, making it an invaluable resource for comparative evolutionary genomics research to improve our understanding of lineage-specific changes underlying adaptive traits as well as deleterious mutations associated with disease.


Subject(s)
Genome , Pitheciidae , Animals , Pitheciidae/genetics , Genomics , Models, Animal
2.
Microbiol Resour Announc ; 13(6): e0018224, 2024 Jun 11.
Article in English | MEDLINE | ID: mdl-38651927

ABSTRACT

Amabiko is a lytic subcluster BE2 bacteriophage that infects Streptomyces scabiei-a bacterium causing common scab in potatoes. Its 131,414 bp genome has a GC content of 49.5% and contains 245 putative protein-coding genes, 45 tRNAs, and one tmRNA. Amabiko is closely related to Streptomyces bacteriophage MindFlayer (gene content similarity: 86.5%).

3.
Virus Evol ; 10(1): vead083, 2024.
Article in English | MEDLINE | ID: mdl-38361822

ABSTRACT

The rapid emergence and spread of antimicrobial resistance across the globe have prompted the usage of bacteriophages (i.e. viruses that infect bacteria) in a variety of applications ranging from agriculture to biotechnology and medicine. In order to effectively guide the application of bacteriophages in these multifaceted areas, information about their host ranges-that is the bacterial strains or species that a bacteriophage can successfully infect and kill-is essential. Utilizing sixteen broad-spectrum (polyvalent) bacteriophages with experimentally validated host ranges, we here benchmark the performance of eleven recently developed computational host range prediction tools that provide a promising and highly scalable supplement to traditional, but laborious, experimental procedures. We show that machine- and deep-learning approaches offer the highest levels of accuracy and precision-however, their predominant predictions at the species- or genus-level render them ill-suited for applications outside of an ecosystems metagenomics framework. In contrast, only moderate sensitivity (<80 per cent) could be reached at the strain-level, albeit at low levels of precision (<40 per cent). Taken together, these limitations demonstrate that there remains room for improvement in the active scientific field of in silico host prediction to combat the challenge of guiding experimental designs to identify the most promising bacteriophage candidates for any given application.

4.
Genome Biol Evol ; 16(2)2024 Feb 01.
Article in English | MEDLINE | ID: mdl-38207127

ABSTRACT

Disentangling the effects of demography and selection has remained a focal point of population genetic analysis. Knowledge about mutation and recombination is essential in this endeavor; however, despite clear evidence that both mutation and recombination rates vary across genomes, it is common practice to model both rates as fixed. In this study, we quantify how this unaccounted for rate heterogeneity may impact inference using common approaches for inferring selection (DFE-alpha, Grapes, and polyDFE) and/or demography (fastsimcoal2 and δaδi). We demonstrate that, if not properly modeled, this heterogeneity can increase uncertainty in the estimation of demographic and selective parameters and in some scenarios may result in mis-leading inference. These results highlight the importance of quantifying the fundamental evolutionary parameters of mutation and recombination before utilizing population genomic data to quantify the effects of genetic drift (i.e. as modulated by demographic history) and selection; or, at the least, that the effects of uncertainty in these parameters can and should be directly modeled in downstream inference.


Subject(s)
Genetic Drift , Selection, Genetic , Demography , Mutation , Recombination, Genetic , Models, Genetic
5.
Genome Biol Evol ; 16(1)2024 Jan 05.
Article in English | MEDLINE | ID: mdl-38051960

ABSTRACT

Meiotic recombination landscapes differ greatly between distantly and closely related taxa, populations, individuals, sexes, and even within genomes; however, the factors driving this variation are yet to be well elucidated. Here, we directly estimate contemporary crossover rates and, for the first time, noncrossover rates in rhesus macaques (Macaca mulatta) from four three-generation pedigrees comprising 32 individuals. We further compare these results with historical, demography-aware, linkage disequilibrium-based recombination rate estimates. From paternal meioses in the pedigrees, 165 crossover events with a median resolution of 22.3 kb were observed, corresponding to a male autosomal map length of 2,357 cM-approximately 15% longer than an existing linkage map based on human microsatellite loci. In addition, 85 noncrossover events with a mean tract length of 155 bp were identified-similar to the tract lengths observed in the only other two primates in which noncrossovers have been studied to date, humans and baboons. Consistent with observations in other placental mammals with PRDM9-directed recombination, crossover (and to a lesser extent noncrossover) events in rhesus macaques clustered in intergenic regions and toward the chromosomal ends in males-a pattern in broad agreement with the historical, sex-averaged recombination rate estimates-and evidence of GC-biased gene conversion was observed at noncrossover sites.


Subject(s)
Genome , Placenta , Pregnancy , Animals , Male , Humans , Female , Macaca mulatta/genetics , Chromosome Mapping/methods , Linkage Disequilibrium , Meiosis , Mammals/genetics , Histone-Lysine N-Methyltransferase/genetics
6.
bioRxiv ; 2023 Nov 13.
Article in English | MEDLINE | ID: mdl-38014252

ABSTRACT

Disentangling the effects of demography and selection has remained a focal point of population genetic analysis. Knowledge about mutation and recombination is essential in this endeavour; however, despite clear evidence that both mutation and recombination rates vary across genomes, it is common practice to model both rates as fixed. In this study, we quantify how this unaccounted for rate heterogeneity may impact inference using common approaches for inferring selection (DFE-alpha, Grapes, and polyDFE) and/or demography (fastsimcoal2 and δaδi). We demonstrate that, if not properly modelled, this heterogeneity can increase uncertainty in the estimation of demographic and selective parameters and in some scenarios may result in mis-leading inference. These results highlight the importance of quantifying the fundamental evolutionary parameters of mutation and recombination prior to utilizing population genomic data to quantify the effects of genetic drift (i.e., as modulated by demographic history) and selection; or, at the least, that the effects of uncertainty in these parameters can and should be directly modelled in downstream inference.

7.
PLoS Pathog ; 19(10): e1011646, 2023 10.
Article in English | MEDLINE | ID: mdl-37796819

ABSTRACT

Congenital cytomegalovirus (cCMV) is the leading infectious cause of neurologic defects in newborns with particularly severe sequelae in the setting of primary CMV infection in the first trimester of pregnancy. The majority of cCMV cases worldwide occur after non-primary infection in CMV-seropositive women; yet the extent to which pre-existing natural CMV-specific immunity protects against CMV reinfection or reactivation during pregnancy remains ill-defined. We previously reported on a novel nonhuman primate model of cCMV in rhesus macaques where 100% placental transmission and 83% fetal loss were seen in CD4+ T lymphocyte-depleted rhesus CMV (RhCMV)-seronegative dams after primary RhCMV infection. To investigate the protective effect of preconception maternal immunity, we performed reinfection studies in CD4+ T lymphocyte-depleted RhCMV-seropositive dams inoculated in late first / early second trimester gestation with RhCMV strains 180.92 (n = 2), or RhCMV UCD52 and FL-RhCMVΔRh13.1/SIVgag, a wild-type-like RhCMV clone with SIVgag inserted as an immunological marker, administered separately (n = 3). An early transient increase in circulating monocytes followed by boosting of the pre-existing RhCMV-specific CD8+ T lymphocyte and antibody response was observed in the reinfected dams but not in control CD4+ T lymphocyte-depleted dams. Emergence of SIV Gag-specific CD8+ T lymphocyte responses in macaques inoculated with the FL-RhCMVΔRh13.1/SIVgag virus confirmed reinfection. Placental transmission was detected in only one of five reinfected dams and there were no adverse fetal sequelae. Viral whole genome, short-read, deep sequencing analysis confirmed transmission of both reinfection RhCMV strains across the placenta with ~30% corresponding to FL-RhCMVΔRh13.1/SIVgag and ~70% to RhCMV UCD52, consistent with the mixed human CMV infections reported in infants with cCMV. Our data showing reduced placental transmission and absence of fetal loss after non-primary as opposed to primary infection in CD4+ T lymphocyte-depleted dams indicates that preconception maternal CMV-specific CD8+ T lymphocyte and/or humoral immunity can protect against cCMV infection.


Subject(s)
Cytomegalovirus Infections , Cytomegalovirus , Infant, Newborn , Animals , Female , Pregnancy , Humans , Cytomegalovirus/genetics , Macaca mulatta , Reinfection , Placenta , Immunity, Innate
8.
Mol Biol Evol ; 40(5)2023 05 02.
Article in English | MEDLINE | ID: mdl-37128989

ABSTRACT

Building evolutionarily appropriate baseline models for natural populations is not only important for answering fundamental questions in population genetics-including quantifying the relative contributions of adaptive versus nonadaptive processes-but also essential for identifying candidate loci experiencing relatively rare and episodic forms of selection (e.g., positive or balancing selection). Here, a baseline model was developed for a human population of West African ancestry, the Yoruba, comprising processes constantly operating on the genome (i.e., purifying and background selection, population size changes, recombination rate heterogeneity, and gene conversion). Specifically, to perform joint inference of selective effects with demography, an approximate Bayesian approach was employed that utilizes the decay of background selection effects around functional elements, taking into account genomic architecture. This approach inferred a recent 6-fold population growth together with a distribution of fitness effects that is skewed towards effectively neutral mutations. Importantly, these results further suggest that, although strong and/or frequent recurrent positive selection is inconsistent with observed data, weak to moderate positive selection is consistent but unidentifiable if rare.


Subject(s)
Evolution, Molecular , Selection, Genetic , Humans , Bayes Theorem , Genetics, Population , Genomics , Models, Genetic
9.
bioRxiv ; 2023 Apr 11.
Article in English | MEDLINE | ID: mdl-37090533

ABSTRACT

Building evolutionarily appropriate baseline models for natural populations is not only important for answering fundamental questions in population genetics - including quantifying the relative contributions of adaptive vs. non-adaptive processes - but it is also essential for identifying candidate loci experiencing relatively rare and episodic forms of selection ( e.g., positive or balancing selection). Here, a baseline model was developed for a human population of West African ancestry, the Yoruba, comprising processes constantly operating on the genome ( i.e. , purifying and background selection, population size changes, recombination rate heterogeneity, and gene conversion). Specifically, to perform joint inference of selective effects with demography, an approximate Bayesian approach was employed that utilizes the decay of background selection effects around functional elements, taking into account genomic architecture. This approach inferred a recent 6-fold population growth together with a distribution of fitness effects that is skewed towards effectively neutral mutations. Importantly, these results further suggest that, while strong and/or frequent recurrent positive selection is inconsistent with observed data, weak to moderate positive selection is consistent but unidentifiable if rare.

10.
bioRxiv ; 2023 Apr 10.
Article in English | MEDLINE | ID: mdl-37090643

ABSTRACT

Congenital cytomegalovirus (cCMV) is the leading infectious cause of neurologic defects in newborns with particularly severe sequelae in the setting of primary CMV infection in the first trimester of pregnancy. The majority of cCMV cases worldwide occur after non-primary infection in CMV-seropositive women; yet the extent to which pre-existing natural CMV-specific immunity protects against CMV reinfection or reactivation during pregnancy remains ill-defined. We previously reported on a novel nonhuman primate model of cCMV in rhesus macaques where 100% placental transmission and 83% fetal loss were seen in CD4 + T lymphocyte-depleted rhesus CMV (RhCMV)-seronegative dams after primary RhCMV infection. To investigate the protective effect of preconception maternal immunity, we performed reinfection studies in CD4+ T lymphocyte-depleted RhCMV-seropositive dams inoculated in late first / early second trimester gestation with RhCMV strains 180.92 ( n =2), or RhCMV UCD52 and FL-RhCMVΔRh13.1/SIV gag , a wild-type-like RhCMV clone with SIV gag inserted as an immunological marker ( n =3). An early transient increase in circulating monocytes followed by boosting of the pre-existing RhCMV-specific CD8+ T lymphocyte and antibody response was observed in the reinfected dams but not in control CD4+ T lymphocyte-depleted dams. Emergence of SIV Gag-specific CD8+ T lymphocyte responses in macaques inoculated with the FL-RhCMVΔRh13.1/SIV gag virus confirmed reinfection. Placental transmission was detected in only one of five reinfected dams and there were no adverse fetal sequelae. Viral whole genome, short-read, deep sequencing analysis confirmed transmission of both reinfection RhCMV strains across the placenta with ∼30% corresponding to FL-RhCMVΔRh13.1/SIV gag and ∼70% to RhCMV UCD52, consistent with the mixed human CMV infections reported in infants with cCMV. Our data showing reduced placental transmission and absence of fetal loss after non-primary as opposed to primary infection in CD4+ T lymphocyte-depleted dams indicates that preconception maternal CMV-specific CD8+ T lymphocyte and/or humoral immunity can protect against cCMV infection. Author Summary: Globally, pregnancies in CMV-seropositive women account for the majority of cases of congenital CMV infection but the immune responses needed for protection against placental transmission in mothers with non-primary infection remains unknown. Recently, we developed a nonhuman primate model of primary rhesus CMV (RhCMV) infection in which placental transmission and fetal loss occurred in RhCMV-seronegative CD4+ T lymphocyte-depleted macaques. By conducting similar studies in RhCMV-seropositive dams, we demonstrated the protective effect of pre-existing natural CMV-specific CD8+ T lymphocytes and humoral immunity against congenital CMV after reinfection. A 5-fold reduction in congenital transmission and complete protection against fetal loss was observed in dams with pre-existing immunity compared to primary CMV in this model. Our study is the first formal demonstration in a relevant model of human congenital CMV that natural pre-existing CMV-specific maternal immunity can limit congenital CMV transmission and its sequelae. The nonhuman primate model of non-primary congenital CMV will be especially relevant to studying immune requirements of a maternal vaccine for women in high CMV seroprevalence areas at risk of repeated CMV reinfections during pregnancy.

11.
PLoS Pathog ; 19(4): e1011265, 2023 04.
Article in English | MEDLINE | ID: mdl-37018331

ABSTRACT

Over the past 3 years, Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has spread through human populations in several waves, resulting in a global health crisis. In response, genomic surveillance efforts have proliferated in the hopes of tracking and anticipating the evolution of this virus, resulting in millions of patient isolates now being available in public databases. Yet, while there is a tremendous focus on identifying newly emerging adaptive viral variants, this quantification is far from trivial. Specifically, multiple co-occurring and interacting evolutionary processes are constantly in operation and must be jointly considered and modeled in order to perform accurate inference. We here outline critical individual components of such an evolutionary baseline model-mutation rates, recombination rates, the distribution of fitness effects, infection dynamics, and compartmentalization-and describe the current state of knowledge pertaining to the related parameters of each in SARS-CoV-2. We close with a series of recommendations for future clinical sampling, model construction, and statistical analysis.


Subject(s)
COVID-19 , SARS-CoV-2 , Humans , Genomics
12.
Genome Biol Evol ; 15(4)2023 04 06.
Article in English | MEDLINE | ID: mdl-37071785

ABSTRACT

Human cytomegalovirus (HCMV) represents a major threat to human health, contributing to both birth defects in neonates as well as organ transplant failure and opportunistic infections in immunocompromised individuals. HCMV exhibits considerable interhost and intrahost diversity, which likely influences the pathogenicity of the virus. Therefore, understanding the relative contributions of various evolutionary forces in shaping patterns of variation is of critical importance both mechanistically and clinically. Herein, we present the individual components of an evolutionary baseline model for HCMV, with a particular focus on congenital infections for the sake of illustration-including mutation and recombination rates, the distribution of fitness effects, infection dynamics, and compartmentalization-and describe the current state of knowledge of each. By building this baseline model, researchers will be able to better describe the range of possible evolutionary scenarios contributing to observed variation as well as improve power and reduce false-positive rates when scanning for adaptive mutations in the HCMV genome.


Subject(s)
Cytomegalovirus Infections , Cytomegalovirus , Infant, Newborn , Humans , Cytomegalovirus/genetics , Cytomegalovirus Infections/genetics , Mutation , Biological Evolution
13.
Genome Biol Evol ; 15(3)2023 03 03.
Article in English | MEDLINE | ID: mdl-36790107

ABSTRACT

Recent studies have highlighted variation in the mutational spectra among human populations as well as closely related hominoids-yet little remains known about the genetic and nongenetic factors driving these rate changes across the genome. Pinpointing the root causes of these differences is an important endeavor that requires careful comparative analyses of population-specific mutational landscapes at both broad and fine genomic scales. However, several factors can confound such analyses. Although previous studies have shown that technical artifacts, such as sequencing errors and batch effects, can contribute to observed mutational shifts, other potentially confounding parameters have received less attention thus far. Using population genetic simulations of human and chimpanzee populations as an illustrative example, we here show that the sample size required for robust inference of mutational spectra depends on the population-specific demographic history. As a consequence, the power to detect rate changes is high in certain hominoid populations while, for others, currently available sample sizes preclude analyses at fine genomic scales.


Subject(s)
Hominidae , Pan troglodytes , Animals , Humans , Pan troglodytes/genetics , Sample Size , Mutation , Genetics, Population
14.
Microorganisms ; 11(1)2023 Jan 10.
Article in English | MEDLINE | ID: mdl-36677462

ABSTRACT

Bacteriophages are being widely harnessed as an alternative to antibiotics due to the global emergence of drug-resistant pathogens. To guide the usage of these bactericidal agents, characterization of their host specificity is vital-however, host range information remains limited for many bacteriophages. This is particularly the case for bacteriophages infecting the Microbacterium genus, despite their importance in agriculture, biomedicine, and biotechnology. Here, we elucidate the phylogenomic relationships between 125 Microbacterium cluster EA bacteriophages-including members from 11 sub-clusters (EA1 to EA11)-and infer their putative host ranges using insights from codon usage bias patterns as well as predictions from both exploratory and confirmatory computational methods. Our computational analyses suggest that cluster EA bacteriophages have a shared infection history across the Microbacterium clade. Interestingly, bacteriophages of all sub-clusters exhibit codon usage preference patterns that resemble those of bacterial strains different from ones used for isolation, suggesting that they might be able to infect additional hosts. Furthermore, host range predictions indicate that certain sub-clusters may be better suited in prospective biotechnological and medical applications such as phage therapy.

15.
Microbiol Resour Announc ; 12(2): e0125122, 2023 Feb 16.
Article in English | MEDLINE | ID: mdl-36645290

ABSTRACT

We characterized the complete genome sequence of Chako, an obligate lytic bacteriophage with siphovirus morphology from subcluster EA1 that infects Microbacterium foliorum NRRL B-24224. Its 41.6-kb genome contains 62 putative protein-coding genes and is highly similar to that of bacteriophage HanSolo (99.26% nucleotide identity).

16.
Heredity (Edinb) ; 130(2): 55-63, 2023 02.
Article in English | MEDLINE | ID: mdl-36496447

ABSTRACT

High-throughput sequencing data enables the comprehensive study of genomes and the variation therein. Essential for the interpretation of this genomic data is a thorough understanding of the computational methods used for processing and analysis. Whereas "gold-standard" empirical datasets exist for this purpose in humans, synthetic (i.e., simulated) sequencing data can offer important insights into the capabilities and limitations of computational pipelines for any arbitrary species and/or study design-yet, the ability of read simulator software to emulate genomic characteristics of empirical datasets remains poorly understood. We here compare the performance of six popular short-read simulators-ART, DWGSIM, InSilicoSeq, Mason, NEAT, and wgsim-and discuss important considerations for selecting suitable models for benchmarking.


Subject(s)
Genomics , Software , Humans , Genomics/methods , Genome , High-Throughput Nucleotide Sequencing/methods , Benchmarking
17.
F1000Res ; 11: 530, 2022.
Article in English | MEDLINE | ID: mdl-36262335

ABSTRACT

In October 2021, 59 scientists from 14 countries and 13 U.S. states collaborated virtually in the Third Annual Baylor College of Medicine & DNANexus Structural Variation hackathon. The goal of the hackathon was to advance research on structural variants (SVs) by prototyping and iterating on open-source software. This led to nine hackathon projects focused on diverse genomics research interests, including various SV discovery and genotyping methods, SV sequence reconstruction, and clinically relevant structural variation, including SARS-CoV-2 variants. Repositories for the projects that participated in the hackathon are available at https://github.com/collaborativebioinformatics.


Subject(s)
COVID-19 , SARS-CoV-2 , Humans , SARS-CoV-2/genetics , Genomics , Software
18.
G3 (Bethesda) ; 12(11)2022 11 04.
Article in English | MEDLINE | ID: mdl-36094333

ABSTRACT

Bacteriophages, infecting bacterial hosts in every environment on our planet, are a driver of adaptive evolution in bacterial communities. At the same time, the host range of many bacteriophages-and thus one of the selective pressures acting on complex microbial systems in nature-remains poorly characterized. Here, we computationally inferred the putative host ranges of 40 cluster P mycobacteriophages, including members from 6 subclusters (P1-P6). A series of comparative genomic analyses revealed that mycobacteriophages of subcluster P1 are restricted to the Mycobacterium genus, whereas mycobacteriophages of subclusters P2-P6 are likely also able to infect other genera, several of which are commonly associated with human disease. Further genomic analysis highlighted that the majority of cluster P mycobacteriophages harbor a conserved integration-dependent immunity system, hypothesized to be the ancestral state of a genetic switch that controls the shift between lytic and lysogenic life cycles-a temperate characteristic that impedes their usage in antibacterial applications.


Subject(s)
Bacteriophages , Mycobacteriophages , Humans , Mycobacteriophages/genetics , Phylogeny , Host Specificity/genetics , Genome, Viral , Bacteriophages/genetics
19.
Viruses ; 14(8)2022 07 27.
Article in English | MEDLINE | ID: mdl-36016269

ABSTRACT

Bacteriophages infecting bacteria of the genus Gordonia have increasingly gained interest in the scientific community for their diverse applications in agriculture, biotechnology, and medicine, ranging from biocontrol agents in wastewater management to the treatment of opportunistic pathogens in pulmonary disease patients. However, due to the time and costs associated with experimental isolation and cultivation, host ranges for many bacteriophages remain poorly characterized, hindering a more efficient usage of bacteriophages in these areas. Here, we perform a series of computational genomic inferences to predict the putative host ranges of all Gordonia cluster DR bacteriophages known to date. Our analyses suggest that BiggityBass (as well as several of its close relatives) is likely able to infect host bacteria from a wide range of genera-from Gordonia to Nocardia to Rhodococcus, making it a suitable candidate for future phage therapy and wastewater treatment strategies.


Subject(s)
Bacteriophages , Gordonia Bacterium , Bacteriophages/genetics , Genome, Viral , Genomics , Gordonia Bacterium/genetics , Humans , Phylogeny , Wastewater
20.
Microbiol Resour Announc ; 11(9): e0054022, 2022 Sep 15.
Article in English | MEDLINE | ID: mdl-35924939

ABSTRACT

We characterized the complete genome of the cluster P mycobacteriophage Phegasus. Its 47.5-kb genome contains 81 protein-coding genes, 36 of which could be assigned a putative function. Phegasus is most closely related to two subcluster P1 bacteriophages, Mangethe and Majeke, with an average nucleotide identity of 99.63% each.

SELECTION OF CITATIONS
SEARCH DETAIL
...