Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 215
Filter
1.
Nat Commun ; 15(1): 3699, 2024 May 02.
Article in English | MEDLINE | ID: mdl-38698035

ABSTRACT

In silico identification of viral anti-CRISPR proteins (Acrs) has relied largely on the guilt-by-association method using known Acrs or anti-CRISPR associated proteins (Acas) as the bait. However, the low number and limited spread of the characterized archaeal Acrs and Aca hinders our ability to identify Acrs using guilt-by-association. Here, based on the observation that the few characterized archaeal Acrs and Aca are transcribed immediately post viral infection, we hypothesize that these genes, and many other unidentified anti-defense genes (ADG), are under the control of conserved regulatory sequences including a strong promoter, which can be used to predict anti-defense genes in archaeal viruses. Using this consensus sequence based method, we identify 354 potential ADGs in 57 archaeal viruses and 6 metagenome-assembled genomes. Experimental validation identified a CRISPR subtype I-A inhibitor and the first virally encoded inhibitor of an archaeal toxin-antitoxin based immune system. We also identify regulatory proteins potentially akin to Acas that can facilitate further identification of ADGs combined with the guilt-by-association approach. These results demonstrate the potential of regulatory sequence analysis for extensive identification of ADGs in viruses of archaea and bacteria.


Subject(s)
Archaea , Archaeal Viruses , Archaeal Viruses/genetics , Archaea/genetics , Archaea/virology , Archaea/immunology , Promoter Regions, Genetic/genetics , Clustered Regularly Interspaced Short Palindromic Repeats/genetics , Regulatory Sequences, Nucleic Acid/genetics , Viral Proteins/genetics , Archaeal Proteins/genetics , Archaeal Proteins/metabolism , Metagenome/genetics , CRISPR-Associated Proteins/genetics , CRISPR-Associated Proteins/metabolism , CRISPR-Cas Systems/genetics
2.
Elife ; 132024 May 13.
Article in English | MEDLINE | ID: mdl-38739430

ABSTRACT

A comprehensive census of McrBC systems, among the most common forms of prokaryotic Type IV restriction systems, followed by phylogenetic analysis, reveals their enormous abundance in diverse prokaryotes and a plethora of genomic associations. We focus on a previously uncharacterized branch, which we denote coiled-coil nuclease tandems (CoCoNuTs) for their salient features: the presence of extensive coiled-coil structures and tandem nucleases. The CoCoNuTs alone show extraordinary variety, with three distinct types and multiple subtypes. All CoCoNuTs contain domains predicted to interact with translation system components, such as OB-folds resembling the SmpB protein that binds bacterial transfer-messenger RNA (tmRNA), YTH-like domains that might recognize methylated tmRNA, tRNA, or rRNA, and RNA-binding Hsp70 chaperone homologs, along with RNases, such as HEPN domains, all suggesting that the CoCoNuTs target RNA. Many CoCoNuTs might additionally target DNA, via McrC nuclease homologs. Additional restriction systems, such as Type I RM, BREX, and Druantia Type III, are frequently encoded in the same predicted superoperons. In many of these superoperons, CoCoNuTs are likely regulated by cyclic nucleotides, possibly, RNA fragments with cyclic termini, that bind associated CARF (CRISPR-Associated Rossmann Fold) domains. We hypothesize that the CoCoNuTs, together with the ancillary restriction factors, employ an echeloned defense strategy analogous to that of Type III CRISPR-Cas systems, in which an immune response eliminating virus DNA and/or RNA is launched first, but then, if it fails, an abortive infection response leading to PCD/dormancy via host RNA cleavage takes over.


All organisms, from animals to bacteria, are subject to genetic parasites, such as viruses and transposons. Genetic parasites are pieces of nucleic acids (DNA or RNA) that can use a cell's machinery to copy themselves at the expense of their hosts. This often leads to the host's demise, so organisms evolved many types of defense mechanisms. One of the most ancient and common forms of defense against viruses and transposons is the targeted restriction of nucleic acids, that is, deployment of host enzymes that can destroy or restrict nucleic acids containing specific sequence motifs or modifications. In bacteria, many of the restriction enzymes targeting parasitic genetic elements are formed by fusions of proteins from the so-called McrBC systems with a protein domain called EVE. EVE and other functionally similar domains are a part of proteins that recognize and bind modified bases in nucleic acids. Enzymes can use the ability of these specificity domains to bind modified bases to detect non-host nucleic acids. Bell et al. conducted a comprehensive computational search for McrBC systems and discovered a large and highly diverse branch of this family with unusual characteristic structural and functional domains. These features include regions that form long alpha-helices (coils) that coil with other alpha-helices (known as coiled-coils), as well as several distinct enzymatic domains that break down nucleic acids (known as nucleases). They call these systems CoCoNuTs (coiled-coiled nuclease tandems). All CoCoNuTs contain domains, including EVE-like ones, which are predicted to interact with components of the RNA-based systems responsible for producing proteins in the cell (translation), suggesting that the CoCoNuTs have an important impact on protein abundance and RNA metabolism. Bell et al.'s findings will be of interest to scientists working on prokaryotic immunity and virulence. Furthermore, similarities between CoCoNuTs and components of eukaryotic RNA-degrading systems suggest evolutionary connections between this diverse family of bacterial predicted RNA restriction systems and RNA regulatory pathways of eukaryotes. Further deciphering the mechanisms of CoCoNuTs could shed light on how certain pathways of RNA metabolism and regulation evolved, and how they may contribute to advances in biotechnology.


Subject(s)
RNA, Bacterial , RNA, Bacterial/metabolism , RNA, Bacterial/chemistry , RNA, Bacterial/genetics , Phylogeny , Bacterial Proteins/metabolism , Bacterial Proteins/chemistry , Bacterial Proteins/genetics , Bacteria/genetics , Bacteria/metabolism , RNA/metabolism , RNA/genetics , RNA/chemistry
3.
Sci Adv ; 10(17): eadl0164, 2024 Apr 26.
Article in English | MEDLINE | ID: mdl-38657076

ABSTRACT

Type VI CRISPR-Cas systems are among the few CRISPR varieties that target exclusively RNA. The CRISPR RNA-guided, sequence-specific binding of target RNAs, such as phage transcripts, activates the type VI effector, Cas13. Once activated, Cas13 causes collateral RNA cleavage, which induces bacterial cell dormancy, thus protecting the host population from the phage spread. We show here that the principal form of collateral RNA degradation elicited by Leptotrichia shahii Cas13a expressed in Escherichia coli cells is the cleavage of anticodons in a subset of transfer RNAs (tRNAs) with uridine-rich anticodons. This tRNA cleavage is accompanied by inhibition of protein synthesis, thus providing defense from the phages. In addition, Cas13a-mediated tRNA cleavage indirectly activates the RNases of bacterial toxin-antitoxin modules cleaving messenger RNA, which could provide a backup defense. The mechanism of Cas13a-induced antiphage defense resembles that of bacterial anticodon nucleases, which is compatible with the hypothesis that type VI effectors evolved from an abortive infection module encompassing an anticodon nuclease.


Subject(s)
Anticodon , CRISPR-Cas Systems , Escherichia coli , RNA, Transfer , RNA, Transfer/genetics , RNA, Transfer/metabolism , Anticodon/genetics , Escherichia coli/genetics , Escherichia coli/metabolism , Leptotrichia/genetics , Leptotrichia/metabolism , CRISPR-Associated Proteins/metabolism , CRISPR-Associated Proteins/genetics , Bacteriophages/genetics , RNA Cleavage
4.
Nat Microbiol ; 9(3): 712-726, 2024 Mar.
Article in English | MEDLINE | ID: mdl-38443574

ABSTRACT

Cell division is fundamental to all cellular life. Most archaea depend on either the prokaryotic tubulin homologue FtsZ or the endosomal sorting complex required for transport for division but neither system has been robustly characterized. Here, we show that three of the four photosynthesis reaction centre barrel domain proteins of Haloferax volcanii (renamed cell division proteins B1/2/3 (CdpB1/2/3)) play important roles in cell division. CdpB1 interacts directly with the FtsZ membrane anchor SepF and is essential for cell division, whereas deletion of cdpB2 and cdpB3 causes a major and a minor division defect, respectively. Orthologues of CdpB proteins are also involved in cell division in other haloarchaea, indicating a conserved function of these proteins. Phylogenetic analysis shows that photosynthetic reaction centre barrel proteins are widely distributed among archaea and appear to be central to cell division in most if not all archaea.


Subject(s)
Haloferax volcanii , Photosynthetic Reaction Center Complex Proteins , Phylogeny , Cell Division , Haloferax volcanii/genetics , Photosynthesis
5.
mBio ; 15(3): e0033524, 2024 Mar 13.
Article in English | MEDLINE | ID: mdl-38380930

ABSTRACT

Endosomal sorting complexes required for transport (ESCRT) play key roles in protein sorting between membrane-bounded compartments of eukaryotic cells. Homologs of many ESCRT components are identifiable in various groups of archaea, especially in Asgardarchaeota, the archaeal phylum that is currently considered to include the closest relatives of eukaryotes, but not in bacteria. We performed a comprehensive search for ESCRT protein homologs in archaea and reconstructed ESCRT evolution using the phylogenetic tree of Vps4 ATPase (ESCRT IV) as a scaffold and using sensitive protein sequence analysis and comparison of structural models to identify previously unknown ESCRT proteins. Several distinct groups of ESCRT systems in archaea outside of Asgard were identified, including proteins structurally similar to ESCRT-I and ESCRT-II, and several other domains involved in protein sorting in eukaryotes, suggesting an early origin of these components. Additionally, distant homologs of CdvA proteins were identified in Thermoproteales which are likely components of the uncharacterized cell division system in these archaea. We propose an evolutionary scenario for the origin of eukaryotic and Asgard ESCRT complexes from ancestral building blocks, namely, the Vps4 ATPase, ESCRT-III components, wH (winged helix-turn-helix fold) and possibly also coiled-coil, and Vps28-like domains. The last archaeal common ancestor likely encompassed a complex ESCRT system that was involved in protein sorting. Subsequent evolution involved either simplification, as in the TACK superphylum, where ESCRT was co-opted for cell division, or complexification as in Asgardarchaeota. In Asgardarchaeota, the connection between ESCRT and the ubiquitin system that was previously considered a eukaryotic signature was already established.IMPORTANCEAll eukaryotic cells possess complex intracellular membrane organization. Endosomal sorting complexes required for transport (ESCRT) play a central role in membrane remodeling which is essential for cellular functionality in eukaryotes. Recently, it has been shown that Asgard archaea, the archaeal phylum that includes the closest known relatives of eukaryotes, encode homologs of many components of the ESCRT systems. We employed protein sequence and structure comparisons to reconstruct the evolution of ESCRT systems in archaea and identified several previously unknown homologs of ESCRT subunits, some of which can be predicted to participate in cell division. The results of this reconstruction indicate that the last archaeal common ancestor already encoded a complex ESCRT system that was involved in protein sorting. In Asgard archaea, ESCRT systems evolved toward greater complexity, and in particular, the connection between ESCRT and the ubiquitin system that was previously considered a eukaryotic signature was established.


Subject(s)
Archaea , Endosomal Sorting Complexes Required for Transport , Endosomal Sorting Complexes Required for Transport/metabolism , Phylogeny , Amino Acid Sequence , Archaea/metabolism , Adenosine Triphosphatases/metabolism , Ubiquitins/metabolism
7.
mBio ; 15(2): e0309223, 2024 Feb 14.
Article in English | MEDLINE | ID: mdl-38189270

ABSTRACT

The identification of microbial genes essential for survival as those with lethal knockout phenotype (LKP) is a common strategy for functional interrogation of genomes. However, interpretation of the LKP is complicated because a substantial fraction of the genes with this phenotype remains poorly functionally characterized. Furthermore, many genes can exhibit LKP not because their products perform essential cellular functions but because their knockout activates the toxicity of other genes (conditionally essential genes). We analyzed the sets of LKP genes for two archaea, Methanococcus maripaludis and Sulfolobus islandicus, using a variety of computational approaches aiming to differentiate between essential and conditionally essential genes and to predict at least a general function for as many of the proteins encoded by these genes as possible. This analysis allowed us to predict the functions of several LKP genes including previously uncharacterized subunit of the GINS protein complex with an essential function in genome replication and of the KEOPS complex that is responsible for an essential tRNA modification as well as GRP protease implicated in protein quality control. Additionally, several novel antitoxins (conditionally essential genes) were predicted, and this prediction was experimentally validated by showing that the deletion of these genes together with the adjacent genes apparently encoding the cognate toxins caused no growth defect. We applied principal component analysis based on sequence and comparative genomic features showing that this approach can separate essential genes from conditionally essential ones and used it to predict essential genes in other archaeal genomes.IMPORTANCEOnly a relatively small fraction of the genes in any bacterium or archaeon is essential for survival as demonstrated by the lethal effect of their disruption. The identification of essential genes and their functions is crucial for understanding fundamental cell biology. However, many of the genes with a lethal knockout phenotype remain poorly functionally characterized, and furthermore, many genes can exhibit this phenotype not because their products perform essential cellular functions but because their knockout activates the toxicity of other genes. We applied state-of-the-art computational methods to predict the functions of a number of uncharacterized genes with the lethal knockout phenotype in two archaeal species and developed a computational approach to predict genes involved in essential functions. These findings advance the current understanding of key functionalities of archaeal cells.


Subject(s)
Archaea , Archaeal Proteins , Archaea/genetics , Archaea/metabolism , Genes, Essential , Genome, Archaeal , Genomics , Phenotype , Archaeal Proteins/genetics , Archaeal Proteins/metabolism
8.
Nat Commun ; 15(1): 317, 2024 Jan 05.
Article in English | MEDLINE | ID: mdl-38182597

ABSTRACT

Thermus thermophilus bacteriophage P23-45 encodes a giant 5,002-residue tail tape measure protein (TMP) that defines the length of its extraordinarily long tail. Here, we show that the N-terminal portion of P23-45 TMP is an unusual RNA polymerase (RNAP) homologous to cellular RNAPs. The TMP-fused virion RNAP transcribes pre-early phage genes, including a gene that encodes another, non-virion RNAP, that transcribes early and some middle phage genes. We report the crystal structures of both P23-45 RNAPs. The non-virion RNAP has a crab-claw-like architecture. By contrast, the virion RNAP adopts a unique flat structure without a clamp. Structure and sequence comparisons of the P23-45 RNAPs with other RNAPs suggest that, despite the extensive functional differences, the two P23-45 RNAPs originate from an ancient gene duplication in an ancestral phage. Our findings demonstrate striking adaptability of RNAPs that can be attained within a single virus species.


Subject(s)
Bacteriophages , Pyridinolcarbamate , Virion/genetics , Bacteriophages/genetics , Bacterial Typing Techniques , DNA-Directed RNA Polymerases/genetics
9.
bioRxiv ; 2024 Feb 10.
Article in English | MEDLINE | ID: mdl-37790407

ABSTRACT

A comprehensive census of McrBC systems, among the most common forms of prokaryotic Type IV restriction systems, followed by phylogenetic analysis, reveals their enormous abundance in diverse prokaryotes and a plethora of genomic associations. We focus on a previously uncharacterized branch, which we denote CoCoNuTs (coiled-coil nuclease tandems) for their salient features: the presence of extensive coiled-coil structures and tandem nucleases. The CoCoNuTs alone show extraordinary variety, with 3 distinct types and multiple subtypes. All CoCoNuTs contain domains predicted to interact with translation system components, such as OB-folds resembling the SmpB protein that binds bacterial transfer-messenger RNA (tmRNA), YTH-like domains that might recognize methylated tmRNA, tRNA, or rRNA, and RNA-binding Hsp70 chaperone homologs, along with RNases, such as HEPN domains, all suggesting that the CoCoNuTs target RNA. Many CoCoNuTs might additionally target DNA, via McrC nuclease homologs. Additional restriction systems, such as Type I RM, BREX, and Druantia Type III, are frequently encoded in the same predicted superoperons. In many of these superoperons, CoCoNuTs are likely regulated by cyclic nucleotides, possibly, RNA fragments with cyclic termini, that bind associated CARF (CRISPR-Associated Rossmann Fold) domains. We hypothesize that the CoCoNuTs, together with the ancillary restriction factors, employ an echeloned defense strategy analogous to that of Type III CRISPR-Cas systems, in which an immune response eliminating virus DNA and/or RNA is launched first, but then, if it fails, an abortive infection response leading to PCD/dormancy via host RNA cleavage takes over.

10.
Proc Natl Acad Sci U S A ; 120(48): e2308224120, 2023 Nov 28.
Article in English | MEDLINE | ID: mdl-37983496

ABSTRACT

The TnpB proteins are transposon-associated RNA-guided nucleases that are among the most abundant proteins encoded in bacterial and archaeal genomes, but whose functions in the transposon life cycle remain unknown. TnpB appears to be the evolutionary ancestor of Cas12, the effector nuclease of type V CRISPR-Cas systems. We performed a comprehensive census of TnpBs in archaeal and bacterial genomes and constructed a phylogenetic tree on which we mapped various features of these proteins. In multiple branches of the tree, the catalytic site of the TnpB nuclease is rearranged, demonstrating structural and probably biochemical malleability of this enzyme. We identified numerous cases of apparent recruitment of TnpB for other functions of which the most common is the evolution of type V CRISPR-Cas effectors on about 50 independent occasions. In many other cases of more radical exaptation, the catalytic site of the TnpB nuclease is apparently inactivated, suggesting a regulatory function, whereas in others, the activity appears to be retained, indicating that the recruited TnpB functions as a nuclease, for example, as a toxin. These findings demonstrate remarkable evolutionary malleability of the TnpB scaffold and provide extensive opportunities for further exploration of RNA-guided biological systems as well as multiple applications.


Subject(s)
Bacteria , Ribonucleases , Ribonucleases/metabolism , Phylogeny , Bacteria/metabolism , Archaea/metabolism , Endonucleases/metabolism , CRISPR-Cas Systems , RNA
11.
Science ; 382(6673): eadi1910, 2023 11 24.
Article in English | MEDLINE | ID: mdl-37995242

ABSTRACT

Microbial systems underpin many biotechnologies, including CRISPR, but the exponential growth of sequence databases makes it difficult to find previously unidentified systems. In this work, we develop the fast locality-sensitive hashing-based clustering (FLSHclust) algorithm, which performs deep clustering on massive datasets in linearithmic time. We incorporated FLSHclust into a CRISPR discovery pipeline and identified 188 previously unreported CRISPR-linked gene modules, revealing many additional biochemical functions coupled to adaptive immunity. We experimentally characterized three HNH nuclease-containing CRISPR systems, including the first type IV system with a specified interference mechanism, and engineered them for genome editing. We also identified and characterized a candidate type VII system, which we show acts on RNA. This work opens new avenues for harnessing CRISPR and for the broader exploration of the vast functional diversity of microbial proteins.


Subject(s)
CRISPR-Associated Proteins , CRISPR-Cas Systems , Data Mining , Gene Editing , CRISPR-Cas Systems/genetics , Humans , HEK293 Cells , Cluster Analysis , Algorithms , CRISPR-Associated Proteins/chemistry , CRISPR-Associated Proteins/classification , CRISPR-Associated Proteins/genetics , DNA Cleavage , RNA, Guide, CRISPR-Cas Systems , Datasets as Topic , Data Mining/methods
12.
CRISPR J ; 6(3): 222-231, 2023 06.
Article in English | MEDLINE | ID: mdl-37272863

ABSTRACT

Many bacterial and archaeal viruses encode anti-CRISPR proteins (Acrs) that specifically inhibit CRISPR-Cas systems via various mechanisms. The majority of the Acrs are small, non-enzymatic proteins that abrogate CRISPR activity by binding to Cas effector proteins. The Acrs evolve fast, due to the arms race with the respective CRISPR-Cas systems, which hampers the elucidation of their evolutionary origins by sequence comparison. We performed comprehensive structural modeling using AlphaFold2 for 3693 experimentally characterized and predicted Acrs, followed by a comparison to the protein structures in the Protein Data Bank database. After clustering the Acrs by sequence similarity, 363 high-quality structural models were obtained that accounted for 102 Acr families. Structure comparisons allowed the identification of homologs for 13 of these families that could be ancestors of the Acrs. Despite the limited extent of structural conservation, the inferred origins of Acrs show distinct trends, in particular, recruitment of toxins and antitoxins and SOS repair system components for the Acr function.


Subject(s)
CRISPR-Associated Protein 9 , CRISPR-Cas Systems , Humans , CRISPR-Cas Systems/genetics , CRISPR-Associated Protein 9/genetics , Gene Editing , Bacteria/metabolism , Viral Proteins/genetics , Viral Proteins/metabolism
13.
Mol Cell ; 83(12): 2122-2136.e10, 2023 Jun 15.
Article in English | MEDLINE | ID: mdl-37267947

ABSTRACT

To spread, transposons must integrate into target sites without disruption of essential genes while avoiding host defense systems. Tn7-like transposons employ multiple mechanisms for target-site selection, including protein-guided targeting and, in CRISPR-associated transposons (CASTs), RNA-guided targeting. Combining phylogenomic and structural analyses, we conducted a broad survey of target selectors, revealing diverse mechanisms used by Tn7 to recognize target sites, including previously uncharacterized target-selector proteins found in newly discovered transposable elements (TEs). We experimentally characterized a CAST I-D system and a Tn6022-like transposon that uses TnsF, which contains an inactivated tyrosine recombinase domain, to target the comM gene. Additionally, we identified a non-Tn7 transposon, Tsy, encoding a homolog of TnsF with an active tyrosine recombinase domain, which we show also inserts into comM. Our findings show that Tn7 transposons employ modular architecture and co-opt target selectors from various sources to optimize target selection and drive transposon spread.


Subject(s)
Clustered Regularly Interspaced Short Palindromic Repeats , DNA Transposable Elements , Plasmids , DNA Transposable Elements/genetics , Recombinases/genetics , Tyrosine/genetics
14.
Nucleic Acids Res ; 51(15): 8150-8168, 2023 08 25.
Article in English | MEDLINE | ID: mdl-37283088

ABSTRACT

CRISPR-cas loci typically contain CRISPR arrays with unique spacers separating direct repeats. Spacers along with portions of adjacent repeats are transcribed and processed into CRISPR(cr) RNAs that target complementary sequences (protospacers) in mobile genetic elements, resulting in cleavage of the target DNA or RNA. Additional, standalone repeats in some CRISPR-cas loci produce distinct cr-like RNAs implicated in regulatory or other functions. We developed a computational pipeline to systematically predict crRNA-like elements by scanning for standalone repeat sequences that are conserved in closely related CRISPR-cas loci. Numerous crRNA-like elements were detected in diverse CRISPR-Cas systems, mostly, of type I, but also subtype V-A. Standalone repeats often form mini-arrays containing two repeat-like sequence separated by a spacer that is partially complementary to promoter regions of cas genes, in particular cas8, or cargo genes located within CRISPR-Cas loci, such as toxins-antitoxins. We show experimentally that a mini-array from a type I-F1 CRISPR-Cas system functions as a regulatory guide. We also identified mini-arrays in bacteriophages that could abrogate CRISPR immunity by inhibiting effector expression. Thus, recruitment of CRISPR effectors for regulatory functions via spacers with partial complementarity to the target is a common feature of diverse CRISPR-Cas systems.


Subject(s)
CRISPR-Cas Systems , RNA , Repetitive Sequences, Nucleic Acid
15.
bioRxiv ; 2023 Mar 28.
Article in English | MEDLINE | ID: mdl-37090588

ABSTRACT

Cell division is fundamental to all cellular life. Most of the archaea employ one of two alternative division machineries, one centered around the prokaryotic tubulin homolog FtsZ and the other around the endosomal sorting complex required for transport (ESCRT). However, neither of these mechanisms has been thoroughly characterized in archaea. Here, we show that three of the four PRC (Photosynthetic Reaction Center) barrel domain proteins of Haloferax volcanii (renamed Cell division proteins B1/2/3 (CdpB1/2/3)), play important roles in division. CdpB1 interacts directly with the FtsZ membrane anchor SepF and is essential for division, whereas deletion of cdpB2 and cdpB3 causes a major and a minor division defect, respectively. Orthologs of CdpB proteins are also involved in cell division in other haloarchaea. Phylogenetic analysis shows that PRC barrel proteins are widely distributed among archaea, including the highly conserved CdvA protein of the crenarchaeal ESCRT-based division system. Thus, diverse PRC barrel proteins appear to be central to cell division in most if not all archaea. Further study of these proteins is expected to elucidate the division mechanisms in archaea and their evolution.

16.
bioRxiv ; 2023 Mar 03.
Article in English | MEDLINE | ID: mdl-37090614

ABSTRACT

CRISPR- cas loci typically contain CRISPR arrays with unique spacers separating direct repeats. Spacers along with portions of adjacent repeats are transcribed and processed into CRISPR(cr) RNAs that target complementary sequences (protospacers) in mobile genetic elements, resulting in cleavage of the target DNA or RNA. Additional, standalone repeats in some CRISPR- cas loci produce distinct cr-like RNAs implicated in regulatory or other functions. We developed a computational pipeline to systematically predict crRNA-like elements by scanning for standalone repeat sequences that are conserved in closely related CRISPR- cas loci. Numerous crRNA-like elements were detected in diverse CRISPR-Cas systems, mostly, of type I, but also subtype V-A. Standalone repeats often form mini-arrays containing two repeat-like sequence separated by a spacer that is partially complementary to promoter regions of cas genes, in particular cas8 , or cargo genes located within CRISPR-Cas loci, such as toxins-antitoxins. We show experimentally that a mini-array from a type I-F1 CRISPR-Cas system functions as a regulatory guide. We also identified mini-arrays in bacteriophages that could abrogate CRISPR immunity by inhibiting effector expression. Thus, recruitment of CRISPR effectors for regulatory functions via spacers with partial complementarity to the target is a common feature of diverse CRISPR-Cas systems.

17.
Proc Natl Acad Sci U S A ; 120(16): e2300154120, 2023 04 18.
Article in English | MEDLINE | ID: mdl-37036997

ABSTRACT

The evolution of genomes in all life forms involves two distinct, dynamic types of genomic changes: gene duplication (and loss) that shape families of paralogous genes and extension (and contraction) of low-complexity regions (LCR), which occurs through dynamics of short repeats in protein-coding genes. Although the roles of each of these types of events in genome evolution have been studied, their co-evolutionary dynamics is not thoroughly understood. Here, by analyzing a wide range of genomes from diverse bacteria and archaea, we show that LCR and paralogy represent two distinct routes of evolution that are inversely correlated. The emergence of LCR is a prominent evolutionary mechanism in fast evolving, young protein families, whereas paralogy dominates the comparatively slow evolution of old protein families. The analysis of multiple prokaryotic genomes shows that the formation of LCR is likely a widespread, transient evolutionary mechanism that temporally and locally affects also ancestral functions, but apparently, fades away with time, under mutational and selective pressures, yielding to gene paralogy. We propose that compensatory relationships between short-term and longer-term evolutionary mechanisms are universal in the evolution of life.


Subject(s)
Evolution, Molecular , Prokaryotic Cells , Phylogeny , Bacteria/genetics , Archaea/genetics
18.
Nat Commun ; 14(1): 1827, 2023 04 01.
Article in English | MEDLINE | ID: mdl-37005419

ABSTRACT

Several groups of bacteria have complex life cycles involving cellular differentiation and multicellular structures. For example, actinobacteria of the genus Streptomyces form multicellular vegetative hyphae, aerial hyphae, and spores. However, similar life cycles have not yet been described for archaea. Here, we show that several haloarchaea of the family Halobacteriaceae display a life cycle resembling that of Streptomyces bacteria. Strain YIM 93972 (isolated from a salt marsh) undergoes cellular differentiation into mycelia and spores. Other closely related strains are also able to form mycelia, and comparative genomic analyses point to gene signatures (apparent gain or loss of certain genes) that are shared by members of this clade within the Halobacteriaceae. Genomic, transcriptomic and proteomic analyses of non-differentiating mutants suggest that a Cdc48-family ATPase might be involved in cellular differentiation in strain YIM 93972. Additionally, a gene encoding a putative oligopeptide transporter from YIM 93972 can restore the ability to form hyphae in a Streptomyces coelicolor mutant that carries a deletion in a homologous gene cluster (bldKA-bldKE), suggesting functional equivalence. We propose strain YIM 93972 as representative of a new species in a new genus within the family Halobacteriaceae, for which the name Actinoarchaeum halophilum gen. nov., sp. nov. is herewith proposed. Our demonstration of a complex life cycle in a group of haloarchaea adds a new dimension to our understanding of the biological diversity and environmental adaptation of archaea.


Subject(s)
Halobacteriaceae , Streptomyces , Hyphae/genetics , Proteomics , Phylogeny , RNA, Ribosomal, 16S/genetics , Streptomyces/genetics , Halobacteriaceae/genetics , Spores , Cell Differentiation , Sequence Analysis, DNA , China
19.
J Mol Biol ; 435(7): 168036, 2023 04 01.
Article in English | MEDLINE | ID: mdl-36868398

ABSTRACT

Numerous viruses infecting bacteria and archaea encode CRISPR-Cas system inhibitors, known as anti-CRISPR proteins (Acr). The Acrs typically are highly specific for particular CRISPR variants, resulting in remarkable sequence and structural diversity and complicating accurate prediction and identification of Acrs. In addition to their intrinsic interest for understanding the coevolution of defense and counter-defense systems in prokaryotes, Acrs could be natural, potent on-off switches for CRISPR-based biotechnological tools, so their discovery, characterization and application are of major importance. Here we discuss the computational approaches for Acr prediction. Due to the enormous diversity and likely multiple origins of the Acrs, sequence similarity searches are of limited use. However, multiple features of protein and gene organization have been successfully harnessed to this end including small protein size and distinct amino acid compositions of the Acrs, association of acr genes in virus genomes with genes encoding helix-turn-helix proteins that regulate Acr expression (Acr-associated proteins, Aca), and presence of self-targeting CRISPR spacers in bacterial and archaeal genomes containing Acr-encoding proviruses. Productive approaches for Acr prediction also involve genome comparison of closely related viruses, of which one is resistant and the other one is sensitive to a particular CRISPR variant, and "guilt by association" whereby genes adjacent to a homolog of a known Aca are identified as candidate Acrs. The distinctive features of Acrs are employed for Acr prediction both by developing dedicated search algorithms and through machine learning. New approaches will be needed to identify novel types of Acrs that are likely to exist.


Subject(s)
Archaea , Bacteria , Bacteriophages , CRISPR-Associated Protein 9 , CRISPR-Cas Systems , Viral Proteins , Archaea/genetics , Archaea/virology , Bacteria/genetics , Bacteria/virology , CRISPR-Associated Protein 9/metabolism , CRISPR-Cas Systems/genetics , Viral Proteins/chemistry , Viral Proteins/genetics , Computer Simulation , Bacteriophages/genetics , Bacteriophages/metabolism
20.
Nat Commun ; 13(1): 6719, 2022 11 07.
Article in English | MEDLINE | ID: mdl-36344504

ABSTRACT

Transposon-encoded IscB family proteins are RNA-guided nucleases in the OMEGA (obligate mobile element-guided activity) system, and likely ancestors of the RNA-guided nuclease Cas9 in the type II CRISPR-Cas adaptive immune system. IscB associates with its cognate ωRNA to form a ribonucleoprotein complex that cleaves double-stranded DNA targets complementary to an ωRNA guide segment. Although IscB shares the RuvC and HNH endonuclease domains with Cas9, it is much smaller than Cas9, mainly due to the lack of the α-helical nucleic-acid recognition lobe. Here, we report the cryo-electron microscopy structure of an IscB protein from the human gut metagenome (OgeuIscB) in complex with its cognate ωRNA and a target DNA, at 2.6-Å resolution. This high-resolution structure reveals the detailed architecture of the IscB-ωRNA ribonucleoprotein complex, and shows how the small IscB protein assembles with the ωRNA and mediates RNA-guided DNA cleavage. The large ωRNA scaffold structurally and functionally compensates for the recognition lobe of Cas9, and participates in the recognition of the guide RNA-target DNA heteroduplex. These findings provide insights into the mechanism of the programmable DNA cleavage by the IscB-ωRNA complex and the evolution of the type II CRISPR-Cas9 effector complexes.


Subject(s)
CRISPR-Cas Systems , RNA, Guide, Kinetoplastida , Humans , Cryoelectron Microscopy , RNA, Guide, Kinetoplastida/metabolism , Endonucleases/metabolism , RNA/metabolism , DNA/metabolism , Ribonucleoproteins/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL
...