Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 27.167
Filter
1.
Wiley Interdiscip Rev RNA ; 15(3): e1852, 2024.
Article in English | MEDLINE | ID: mdl-38715192

ABSTRACT

Small RNAs (sRNAs) with sizes ranging from 15 to 50 nucleotides (nt) are critical regulators of gene expression control. Prior studies have shown that sRNAs are involved in a broad range of biological processes, such as organ development, tumorigenesis, and epigenomic regulation; however, emerging evidence unveils a hidden layer of diversity and complexity of endogenously encoded sRNAs profile in eukaryotic organisms, including novel types of sRNAs and the previously unknown post-transcriptional RNA modifications. This underscores the importance for accurate, unbiased detection of sRNAs in various cellular contexts. A multitude of high-throughput methods based on next-generation sequencing (NGS) are developed to decipher the sRNA expression and their modifications. Nonetheless, distinct from mRNA sequencing, the data from sRNA sequencing suffer frequent inconsistencies and high variations emanating from the adapter contaminations and RNA modifications, which overall skew the sRNA libraries. Here, we summarize the sRNA-sequencing approaches, and discuss the considerations and challenges for the strategies and methods of sRNA library construction. The pros and cons of sRNA sequencing have significant implications for implementing RNA fragment footprinting approaches, including CLIP-seq and Ribo-seq. We envision that this review can inspire novel improvements in small RNA sequencing and RNA fragment footprinting in future. This article is categorized under: RNA Evolution and Genomics > Computational Analyses of RNA RNA Processing > Processing of Small RNAs Regulatory RNAs/RNAi/Riboswitches > Biogenesis of Effector Small RNAs.


Subject(s)
RNA, Small Untranslated , RNA, Small Untranslated/genetics , RNA, Small Untranslated/metabolism , Gene Library , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, RNA/methods , Humans , Animals
2.
J Med Chem ; 67(10): 8141-8160, 2024 May 23.
Article in English | MEDLINE | ID: mdl-38728572

ABSTRACT

Human interleukin-1ß (IL-1ß) is a pro-inflammatory cytokine that plays a critical role in the regulation of the immune response and the development of various inflammatory diseases. In this publication, we disclose our efforts toward the discovery of IL-1ß binders that interfere with IL-1ß signaling. To this end, several technologies were used in parallel, including fragment-based screening (FBS), DNA-encoded library (DEL) technology, peptide discovery platform (PDP), and virtual screening. The utilization of distinct technologies resulted in the identification of new chemical entities exploiting three different sites on IL-1ß, all of them also inhibiting the interaction with the IL-1R1 receptor. Moreover, we identified lysine 103 of IL-1ß as a target residue suitable for the development of covalent, low-molecular-weight IL-1ß antagonists.


Subject(s)
Interleukin-1beta , Humans , Drug Discovery , Interleukin-1beta/metabolism , Ligands , Receptors, Interleukin-1 Type I/metabolism , Receptors, Interleukin-1 Type I/antagonists & inhibitors , Small Molecule Libraries/chemistry , Small Molecule Libraries/pharmacology , Structure-Activity Relationship , DNA/chemistry , Gene Library
3.
Parasit Vectors ; 17(1): 216, 2024 May 11.
Article in English | MEDLINE | ID: mdl-38734639

ABSTRACT

BACKGROUND: Mosquitoes pose a risk to human health worldwide, and correct species identification and detection of cryptic species are the most important keys for surveillance and control of mosquito vectors. In addition to traditional identification based on morphology, DNA barcoding has recently been widely used as a complementary tool for reliable identification of mosquito species. The main objective of this study was to create a reference DNA barcode library for the Croatian mosquito fauna, which should contribute to more accurate and faster identification of species, including cryptic species, and recognition of relevant vector species. METHODS: Sampling was carried out in three biogeographical regions of Croatia over six years (2017-2022). The mosquitoes were morphologically identified; molecular identification was based on the standard barcoding region of the mitochondrial COI gene and the nuclear ITS2 region, the latter to identify species within the Anopheles maculipennis complex. The BIN-RESL algorithm assigned the COI sequences to the corresponding BINs (Barcode Index Number clusters) in BOLD, i.e. to putative MOTUs (Molecular Operational Taxonomic Units). The bPTP and ASAP species delimitation methods were applied to the genus datasets in order to verify/confirm the assignment of specimens to specific MOTUs. RESULTS: A total of 405 mosquito specimens belonging to six genera and 30 morphospecies were collected and processed. Species delimitation methods assigned the samples to 31 (BIN-RESL), 30 (bPTP) and 28 (ASAP) MOTUs, with most delimited MOTUs matching the morphological identification. Some species of the genera Culex, Aedes and Anopheles were assigned to the same MOTUs, especially species that are difficult to distinguish morphologically and/or represent species complexes. In total, COI barcode sequences for 34 mosquito species and ITS2 sequences for three species of the genus Anopheles were added to the mosquito sequence database for Croatia, including one individual from the Intrudens Group, which represents a new record for the Croatian mosquito fauna. CONCLUSION: We present the results of the first comprehensive study combining morphological and molecular identification of most mosquito species present in Croatia, including several invasive and vector species. With the exception of some closely related species, this study confirmed that DNA barcoding based on COI provides a reliable basis for the identification of mosquito species in Croatia.


Subject(s)
Culicidae , DNA Barcoding, Taxonomic , Electron Transport Complex IV , Mosquito Vectors , Animals , Croatia , Mosquito Vectors/genetics , Mosquito Vectors/classification , Mosquito Vectors/anatomy & histology , Culicidae/classification , Culicidae/genetics , Electron Transport Complex IV/genetics , Anopheles/genetics , Anopheles/classification , Phylogeny , Gene Library
4.
Appl Microbiol Biotechnol ; 108(1): 329, 2024 May 10.
Article in English | MEDLINE | ID: mdl-38727750

ABSTRACT

Xylanases are key biocatalysts in the degradation of the ß-1,4-glycosidic linkages in the xylan backbone of hemicellulose. These enzymes are potentially applied in a wide range of bioprocessing industries under harsh conditions. Metagenomics has emerged as powerful tools for the bioprospection and discovery of interesting bioactive molecules from extreme ecosystems with unique features, such as high temperatures. In this study, an innovative combination of function-driven screening of a compost metagenomic library and automatic extraction of halo areas with in-house MATLAB functions resulted in the identification of a promising clone with xylanase activity (LP4). The LP4 clone proved to be an effective xylanase producer under submerged fermentation conditions. Sequence and phylogenetic analyses revealed that the xylanase, Xyl4, corresponded to an endo-1,4-ß-xylanase belonging to glycosyl hydrolase family 10 (GH10). When xyl4 was expressed in Escherichia coli BL21(DE3), the enzyme activity increased about 2-fold compared to the LP4 clone. To get insight on the interaction of the enzyme with the substrate and establish possible strategies to improve its activity, the structure of Xyl4 was predicted, refined, and docked with xylohexaose. Our data unveiled, for the first time, the relevance of the amino acids Glu133 and Glu238 for catalysis, and a close inspection of the catalytic site suggested that the replacement of Phe316 by a bulkier Trp may improve Xyl4 activity. Our current findings contribute to enhancing the catalytic performance of Xyl4 towards industrial applications. KEY POINTS: • A GH10 endo-1,4-ß-xylanase (Xyl4) was isolated from a compost metagenomic library • MATLAB's in-house functions were developed to identify the xylanase-producing clones • Computational analysis showed that Glu133 and Glu238 are crucial residues for catalysis.


Subject(s)
Composting , Endo-1,4-beta Xylanases , Escherichia coli , Metagenomics , Phylogeny , Endo-1,4-beta Xylanases/genetics , Endo-1,4-beta Xylanases/metabolism , Endo-1,4-beta Xylanases/chemistry , Endo-1,4-beta Xylanases/isolation & purification , Escherichia coli/genetics , Escherichia coli/metabolism , Metagenome , Gene Library , Soil Microbiology , Xylans/metabolism , Cloning, Molecular , Fermentation , Gene Expression , Molecular Docking Simulation
5.
Expert Opin Drug Discov ; 19(6): 725-740, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38753553

ABSTRACT

INTRODUCTION: The effectiveness of Fragment-based drug design (FBDD) for targeting challenging therapeutic targets has been hindered by two factors: the small library size and the complexity of the fragment-to-hit optimization process. The DNA-encoded library (DEL) technology offers a compelling and robust high-throughput selection approach to potentially address these limitations. AREA COVERED: In this review, the authors propose the viewpoint that the DEL technology matches perfectly with the concept of FBDD to facilitate hit discovery. They begin by analyzing the technical limitations of FBDD from a medicinal chemistry perspective and explain why DEL may offer potential solutions to these limitations. Subsequently, they elaborate in detail on how the integration of DEL with FBDD works. In addition, they present case studies involving both de novo hit discovery and full ligand discovery, especially for challenging therapeutic targets harboring broad drug-target interfaces. EXPERT OPINION: The future of DEL-based fragment discovery may be promoted by both technical advances and application scopes. From the technical aspect, expanding the chemical diversity of DEL will be essential to achieve success in fragment-based drug discovery. From the application scope side, DEL-based fragment discovery holds promise for tackling a series of challenging targets.


Subject(s)
DNA , Drug Design , Drug Discovery , Small Molecule Libraries , Drug Discovery/methods , Humans , Small Molecule Libraries/pharmacology , Ligands , Chemistry, Pharmaceutical/methods , Gene Library , High-Throughput Screening Assays/methods , Molecular Targeted Therapy , Animals
6.
Nat Commun ; 15(1): 3727, 2024 May 02.
Article in English | MEDLINE | ID: mdl-38697982

ABSTRACT

We report the de novo design of small (<20 kDa) and highly soluble synthetic intrinsically disordered proteins (SynIDPs) that confer solubility to a fusion partner with minimal effect on the activity of the fused protein. To identify highly soluble SynIDPs, we create a pooled gene-library utilizing a one-pot gene synthesis technology to create a large library of repetitive genes that encode SynIDPs. We identify three small (<20 kDa) and highly soluble SynIDPs from this gene library that lack secondary structure and have high solvation. Recombinant fusion of these SynIDPs to three known inclusion body forming proteins rescue their soluble expression and do not impede the activity of the fusion partner, thereby eliminating the need for removal of the SynIDP tag. These findings highlight the utility of SynIDPs as solubility tags, as they promote the soluble expression of proteins in E. coli and are small, unstructured proteins that minimally interfere with the biological activity of the fused protein.


Subject(s)
Escherichia coli , Intrinsically Disordered Proteins , Recombinant Fusion Proteins , Solubility , Intrinsically Disordered Proteins/metabolism , Intrinsically Disordered Proteins/chemistry , Intrinsically Disordered Proteins/genetics , Recombinant Fusion Proteins/metabolism , Recombinant Fusion Proteins/genetics , Recombinant Fusion Proteins/chemistry , Escherichia coli/genetics , Escherichia coli/metabolism , Gene Library , Inclusion Bodies/metabolism
7.
Viruses ; 16(5)2024 05 05.
Article in English | MEDLINE | ID: mdl-38793613

ABSTRACT

African swine fever virus (ASFV) is the causative agent of a severe and highly contagious viral disease affecting domestic and wild swine. The current ASFV pandemic strain has a high mortality rate, severely impacting pig production and, for countries suffering outbreaks, preventing the export of their pig products for international trade. Early detection and diagnosis of ASFV is necessary to control new outbreaks before the disease spreads rapidly. One of the rate-limiting steps to identify ASFV by next-generation sequencing platforms is library preparation. Here, we investigated the capability of the Oxford Nanopore Technologies' VolTRAX platform for automated DNA library preparation with downstream sequencing on Nanopore sequencing platforms as a proof-of-concept study to rapidly identify the strain of ASFV. Within minutes, DNA libraries prepared using VolTRAX generated near-full genome sequences of ASFV. Thus, our data highlight the use of the VolTRAX as a platform for automated library preparation, coupled with sequencing on the MinION Mk1C for field sequencing or GridION within a laboratory setting. These results suggest a proof-of-concept study that VolTRAX is an effective tool for library preparation that can be used for the rapid and real-time detection of ASFV.


Subject(s)
African Swine Fever Virus , African Swine Fever , Gene Library , Genome, Viral , High-Throughput Nucleotide Sequencing , African Swine Fever Virus/genetics , African Swine Fever Virus/isolation & purification , Animals , Swine , African Swine Fever/diagnosis , African Swine Fever/virology , High-Throughput Nucleotide Sequencing/methods , DNA, Viral/genetics , Sequence Analysis, DNA
8.
BMC Genomics ; 25(1): 475, 2024 May 14.
Article in English | MEDLINE | ID: mdl-38745120

ABSTRACT

BACKGROUND: Single nucleotide polymorphism (SNP) markers play significant roles in accelerating breeding and basic crop research. Several soybean SNP panels have been developed. However, there is still a lack of SNP panels for differentiating between wild and cultivated populations, as well as for detecting polymorphisms within both wild and cultivated populations. RESULTS: This study utilized publicly available resequencing data from over 3,000 soybean accessions to identify differentiating and highly conserved SNP and insertion/deletion (InDel) markers between wild and cultivated soybean populations. Additionally, a naturally occurring mutant gene library was constructed by analyzing large-effect SNPs and InDels in the population. CONCLUSION: The markers obtained in this study are associated with numerous genes governing agronomic traits, thus facilitating the evaluation of soybean germplasms and the efficient differentiation between wild and cultivated soybeans. The natural mutant gene library permits the quick identification of individuals with natural mutations in functional genes, providing convenience for accelerating soybean breeding using reverse genetics.


Subject(s)
Glycine max , INDEL Mutation , Polymorphism, Single Nucleotide , Glycine max/genetics , Genome, Plant , Gene Library , Plant Breeding
9.
Genome Biol Evol ; 16(4)2024 Apr 02.
Article in English | MEDLINE | ID: mdl-38597156

ABSTRACT

De novo genes emerge from previously noncoding stretches of the genome. Their encoded de novo proteins are generally expected to be similar to random sequences and, accordingly, with no stable tertiary fold and high predicted disorder. However, structural properties of de novo proteins and whether they differ during the stages of emergence and fixation have not been studied in depth and rely heavily on predictions. Here we generated a library of short human putative de novo proteins of varying lengths and ages and sorted the candidates according to their structural compactness and disorder propensity. Using Förster resonance energy transfer combined with Fluorescence-activated cell sorting, we were able to screen the library for most compact protein structures, as well as most elongated and flexible structures. We find that compact de novo proteins are on average slightly shorter and contain lower predicted disorder than less compact ones. The predicted structures for most and least compact de novo proteins correspond to expectations in that they contain more secondary structure content or higher disorder content, respectively. Our experiments indicate that older de novo proteins have higher compactness and structural propensity compared with young ones. We discuss possible evolutionary scenarios and their implications underlying the age-dependencies of compactness and structural content of putative de novo proteins.


Subject(s)
Protein Folding , Proteins , Humans , Proteins/genetics , Protein Structure, Secondary , Gene Library
10.
Methods Mol Biol ; 2744: 213-221, 2024.
Article in English | MEDLINE | ID: mdl-38683321

ABSTRACT

Double-digest restriction site-associated DNA sequencing is a library preparation protocol that enables capturing variable sites across the genome including single-nucleotide polymorphisms (SNPs). These SNPs can be utilized to gain evolutionary insights into patterns observed in DNA barcodes, to infer population structure and phylogenies, to detect gene flow and introgression, and to perform species delimitation analyses. The protocol includes chemically shearing genomic DNA with restriction enzymes, unique tagging, size selection, and amplification of the resulting DNA fragments. Here we provide a detailed description of each step of the protocol, as well as information on essential equipment and common issues encountered during laboratory work.


Subject(s)
DNA Barcoding, Taxonomic , Polymorphism, Single Nucleotide , Sequence Analysis, DNA , DNA Barcoding, Taxonomic/methods , Sequence Analysis, DNA/methods , High-Throughput Nucleotide Sequencing/methods , DNA/genetics , Gene Library , Humans
11.
Methods Mol Biol ; 2744: 223-238, 2024.
Article in English | MEDLINE | ID: mdl-38683322

ABSTRACT

DNA barcodes are useful in biodiversity research, but sequencing barcodes with dye termination methods ("Sanger sequencing") has been so time-consuming and expensive that DNA barcodes are not as widely used as they should be. Fortunately, MinION sequencers from Oxford Nanopore Technologies have recently emerged as a cost-effective and efficient alternative for barcoding. MinION barcodes are now suitable for large-scale species discovery and enable specimen identification when the target species are represented in barcode databases. With a MinION, it is possible to obtain 10,000 barcodes from a single flow cell at a cost of less than 0.10 USD per specimen. Additionally, a Flongle flow cell can be used for small projects requiring up to 300 barcodes (0.50 USD per specimen). We here describe a cost-effective laboratory workflow for obtaining tagged amplicons, preparing ONT libraries, sequencing amplicon pools, and analyzing the MinION reads with the software ONTbarcoder. This workflow has been shown to yield highly accurate barcodes that are 99.99% identical to Sanger barcodes. Overall, we propose that the use of MinION for DNA barcoding is an attractive option for all researchers in need of a cost-effective and efficient solution for large-scale species discovery and specimen identification.


Subject(s)
DNA Barcoding, Taxonomic , Nanopore Sequencing , DNA Barcoding, Taxonomic/methods , DNA Barcoding, Taxonomic/economics , Nanopore Sequencing/methods , Cost-Benefit Analysis , High-Throughput Nucleotide Sequencing/methods , High-Throughput Nucleotide Sequencing/economics , Software , Gene Library , Sequence Analysis, DNA/methods , Sequence Analysis, DNA/economics , Workflow , DNA/genetics
12.
Methods Mol Biol ; 2744: 155-169, 2024.
Article in English | MEDLINE | ID: mdl-38683317

ABSTRACT

The article presents the several steps to be performed on a plant, fungal, insect, or soil sample to obtain DNA sequences for DNA barcode analysis. The chapter begins with a description of sample preparation including procedures for cleaning and proceeds to DNA extraction with methods adapted for the specific type of sample. Next, DNA quantification is described so the proper amount is used for the amplification of the selected barcode regions. Information is provided for reaction mixes and amplification conditions for several referenced barcode primer pairs tuned for the individual sample of interest. This is followed by a description of procedures to access the success of amplification, cleanup, and quantification of the product ready for either Sanger sequencing or library preparation for massive parallel sequencing (MPS). Finally, procedures are provided for Sanger sequencing, library preparation, and MPS sequencing. The chapter provides several references of barcode regions for different sample types.


Subject(s)
DNA Barcoding, Taxonomic , High-Throughput Nucleotide Sequencing , Plants , DNA Barcoding, Taxonomic/methods , High-Throughput Nucleotide Sequencing/methods , Animals , Plants/genetics , Insecta/genetics , Insecta/classification , Fungi/genetics , Fungi/classification , Sequence Analysis, DNA/methods , Gene Library , DNA/genetics
13.
Methods Mol Biol ; 2744: 445-473, 2024.
Article in English | MEDLINE | ID: mdl-38683335

ABSTRACT

Plant DNA barcoding has a multitude of applications ranging from species detection and biomonitoring to investigating ecological networks and checking food quality. The ability to accurately identify species, using DNA barcoding, depends on the quality and comprehensiveness of the reference library that is used. This chapter describes how to create plant reference libraries using the rbcL, matK, and ITS2 DNA barcode regions. It covers the creation of species lists, the collection of specimens from the field and herbarium, DNA extraction, PCR amplification, and DNA sequencing. This methodology gives special attention to using samples from herbaria, as they represent important collections of easily accessible, taxonomically verified plant material.


Subject(s)
DNA Barcoding, Taxonomic , DNA, Plant , Plants , DNA Barcoding, Taxonomic/methods , Plants/genetics , DNA, Plant/genetics , Polymerase Chain Reaction/methods , Sequence Analysis, DNA/methods , Gene Library
14.
Methods Mol Biol ; 2744: 491-502, 2024.
Article in English | MEDLINE | ID: mdl-38683337

ABSTRACT

All DNA barcode methods rely on reference sequences linked to well-curated voucher specimens. Definitions for and locations of DNA barcode reference libraries are not standardized, and vary throughout the literature. Standardizing, and centralizing reference specimens would provide an unambiguous source, analogous to reference genomes, to reproduce identifications and improve a library. This chapter proposes a working definition of a DNA barcode reference library, consistent with DNA barcode data standards, along with principles and methods to consider when producing or using such a library. These methods allow explicit traceback to sequence-sources which elevate the value of voucher specimens, and create a potential for community curation.


Subject(s)
DNA Barcoding, Taxonomic , Gene Library , DNA Barcoding, Taxonomic/methods , Reference Standards , DNA/genetics , Humans
15.
Microb Genom ; 10(4)2024 Apr.
Article in English | MEDLINE | ID: mdl-38578268

ABSTRACT

Background. PCR amplification is a necessary step in many next-generation sequencing (NGS) library preparation methods [1, 2]. Whilst many PCR enzymes are developed to amplify single targets efficiently, accurately and with specificity, few are developed to meet the challenges imposed by NGS PCR, namely unbiased amplification of a wide range of different sizes and GC content. As a result PCR amplification during NGS library prep often results in bias toward GC neutral and smaller fragments. As NGS has matured, optimized NGS library prep kits and polymerase formulations have emerged and in this study we have tested a wide selection of available enzymes for both short-read Illumina library preparation and long fragment amplification ahead of long-read sequencing.We tested over 20 different hi-fidelity PCR enzymes/NGS amplification mixes on a range of Illumina library templates of varying GC content and composition, and find that both yield and genome coverage uniformity characteristics of the commercially available enzymes varied dramatically. Three enzymes Quantabio RepliQa Hifi Toughmix, Watchmaker Library Amplification Hot Start Master Mix (2X) 'Equinox' and Takara Ex Premier were found to give a consistent performance, over all genomes, that mirrored closely that observed for PCR-free datasets. We also test a range of enzymes for long-read sequencing by amplifying size fractionated S. cerevisiae DNA of average size 21.6 and 13.4 kb, respectively.The enzymes of choice for short-read (Illumina) library fragment amplification are Quantabio RepliQa Hifi Toughmix, Watchmaker Library Amplification Hot Start Master Mix (2X) 'Equinox' and Takara Ex Premier, with RepliQa also being the best performing enzyme from the enzymes tested for long fragment amplification prior to long-read sequencing.


Subject(s)
DNA , Saccharomyces cerevisiae , Polymerase Chain Reaction/methods , Gene Library , High-Throughput Nucleotide Sequencing/methods
16.
Vet Microbiol ; 293: 110099, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38677125

ABSTRACT

Japanese encephalitis virus (JEV) is a pathogen with a substantial impact on both livestock and human health. However, the critical host factors in the virus life cycle remain poorly understood. Using a library comprising 123411 small guide RNAs (sgRNAs) targeting 19050 human genes, we conducted a genome-wide clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9-based screen to identify essential genes for JEV replication. By employing knockout or knockdown techniques on genes, we identified eleven human genes crucial for JEV replication, such as prolactin releasing hormone receptor (PRLHR), activating signal cointegrator 1 complex subunit 3 (ASCC3), acyl-CoA synthetase long chain family member 3 (ACSL3), and others. Notably, we found that PRLHR knockdown blocked the autophagic flux, thereby inhibiting JEV infection. Taken together, these findings provide effective data for studying important host factors of JEV replication and scientific data for selecting antiviral drug targets.


Subject(s)
CRISPR-Cas Systems , Encephalitis Virus, Japanese , RNA, Guide, CRISPR-Cas Systems , Virus Replication , Virus Replication/genetics , Encephalitis Virus, Japanese/genetics , Encephalitis Virus, Japanese/physiology , Humans , RNA, Guide, CRISPR-Cas Systems/genetics , Gene Library , Animals , Host-Pathogen Interactions/genetics , Encephalitis, Japanese/virology , Cell Line , HEK293 Cells , Clustered Regularly Interspaced Short Palindromic Repeats
17.
Bioinformatics ; 40(4)2024 Mar 29.
Article in English | MEDLINE | ID: mdl-38569896

ABSTRACT

MOTIVATION: Long-read sequencing technologies, an attractive solution for many applications, often suffer from higher error rates. Alignment of multiple reads can improve base-calling accuracy, but some applications, e.g. sequencing mutagenized libraries where multiple distinct clones differ by one or few variants, require the use of barcodes or unique molecular identifiers. Unfortunately, sequencing errors can interfere with correct barcode identification, and a given barcode sequence may be linked to multiple independent clones within a given library. RESULTS: Here we focus on the target application of sequencing mutagenized libraries in the context of multiplexed assays of variant effects (MAVEs). MAVEs are increasingly used to create comprehensive genotype-phenotype maps that can aid clinical variant interpretation. Many MAVE methods use long-read sequencing of barcoded mutant libraries for accurate association of barcode with genotype. Existing long-read sequencing pipelines do not account for inaccurate sequencing or nonunique barcodes. Here, we describe Pacybara, which handles these issues by clustering long reads based on the similarities of (error-prone) barcodes while also detecting barcodes that have been associated with multiple genotypes. Pacybara also detects recombinant (chimeric) clones and reduces false positive indel calls. In three example applications, we show that Pacybara identifies and correctly resolves these issues. AVAILABILITY AND IMPLEMENTATION: Pacybara, freely available at https://github.com/rothlab/pacybara, is implemented using R, Python, and bash for Linux. It runs on GNU/Linux HPC clusters via Slurm, PBS, or GridEngine schedulers. A single-machine simplex version is also available.


Subject(s)
High-Throughput Nucleotide Sequencing , Software , Sequence Analysis, DNA/methods , High-Throughput Nucleotide Sequencing/methods , Gene Library , Genotype , Cluster Analysis
18.
Nat Commun ; 15(1): 3577, 2024 Apr 27.
Article in English | MEDLINE | ID: mdl-38678031

ABSTRACT

Genetic interactions mediate the emergence of phenotype from genotype, but technologies for combinatorial genetic perturbation in mammalian cells are challenging to scale. Here, we identify background-independent paralog synthetic lethals from previous CRISPR genetic interaction screens, and find that the Cas12a platform provides superior sensitivity and assay replicability. We develop the in4mer Cas12a platform that uses arrays of four independent guide RNAs targeting the same or different genes. We construct a genome-scale library, Inzolia, that is ~30% smaller than a typical CRISPR/Cas9 library while also targeting ~4000 paralog pairs. Screens in cancer cells demonstrate discrimination of core and context-dependent essential genes similar to that of CRISPR/Cas9 libraries, as well as detection of synthetic lethal and masking/buffering genetic interactions between paralogs of various family sizes. Importantly, the in4mer platform offers a fivefold reduction in library size compared to other genetic interaction methods, substantially reducing the cost and effort required for these assays.


Subject(s)
Bacterial Proteins , CRISPR-Cas Systems , Endodeoxyribonucleases , Gene Knockout Techniques , Humans , Gene Knockout Techniques/methods , RNA, Guide, CRISPR-Cas Systems/genetics , Gene Library , Cell Line, Tumor , Genes, Essential , HEK293 Cells , Epistasis, Genetic , CRISPR-Associated Proteins/genetics , CRISPR-Associated Proteins/metabolism
19.
Diagn Microbiol Infect Dis ; 109(3): 116311, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38657353

ABSTRACT

The detection of patterns associated with the invasive form of Candida albicans, such as Candida albicans germ tube antibodies (CAGTA), is a useful complement to blood culture for Invasive Candidiasis (IC) diagnosis. As CAGTA are detected by a non-standardisable and non-automatable technique, a Candida albicans cDNA expression library was screened with CAGTA isolated from serum of an animal model of invasive candidiasis, and five protein targets were identified: hyphally regulated cell wall protein 1 (Hyr1), enolase 1 (Eno1), coatomer subunit gamma (Sec21), a metallo-aminopeptidase (Ape2) and cystathionine gamma-lyase (Cys3). Homology with proteins from other organisms rules out Cys3 as a good biomarker while Sec21 results suggest that it is not in the germ tubes surface but secreted to the external environment. Our analysis propose Ape2, Sec21 and a region of Hyr1 different from the one currently being studied for immunoprotection as potential biomarker candidates for the diagnosis of IC.


Subject(s)
Antibodies, Fungal , Candida albicans , Candidiasis, Invasive , Fungal Proteins , Gene Library , Candida albicans/genetics , Candidiasis, Invasive/diagnosis , Candidiasis, Invasive/microbiology , Animals , Fungal Proteins/genetics , Antibodies, Fungal/blood , Biomarkers/blood , Disease Models, Animal , Humans , Mice
20.
BMC Cancer ; 24(1): 490, 2024 Apr 17.
Article in English | MEDLINE | ID: mdl-38632528

ABSTRACT

BACKGROUND: Patients with rheumatologic preexisting autoimmune disease (PAD) have not been enrolled in clinical trials of immune checkpoint inhibitors (ICIs). Therefore, the risks and benefits of ICI therapy in such patients are unclear. Herein, we investigated the safety and efficacy of ICIs in rheumatologic PAD patients through a meta-analysis. METHODS: The PubMed, Cochrane Library, Embase and Web of Science databases were searched for additional studies. We analyzed the following data through Stata software: incidence of total irAEs (TirAEs), rate of flares, incidence of new on-set irAEs, rate of discontinuation, objective response rate (ORR) and disease control rate (DCR). RESULTS: We identified 23 articles including 643 patients with rheumatologic PAD. The pooled incidences of TirAEs, flares and new-onset irAEs were 64% (95% CI 55%-72%), 41% (95% CI 31%-50%), and 33% (95% CI 28%-38%), respectively. In terms of severity, the incidences were 7% (95% CI 2%-14%) for Grade 3-4 flares and 12% (95% CI 9%-15%) for Grade 3-4 new-onset irAEs. Patients with RA had a greater risk of flares than patients with other rheumatologic PADs did (RR = 1.35, 95% CI 1.03-1.77). The ORR and DCR were 30% and 44%, respectively. Baseline anti-rheumatic treatment was not significantly associated with the frequency of flares (RR = 1.05, 95% CI 0.63-1.77) or the ORR (RR = 0.45, 95% CI 0.12-1.69). CONCLUSIONS: Patients with rheumatologic PAD, particularly those with RA, are susceptible to relapse of their rheumatologic disease following ICI therapy. ICIs are also effective for treating rheumatologic PAD patients. PROSPECTIVE REGISTER OF SYSTEMATIC REVIEWS (PROSPERO): number CRD 42,023,439,702.


Subject(s)
Arthritis, Rheumatoid , Autoimmune Diseases , Neoplasms , Humans , Immune Checkpoint Inhibitors , Databases, Factual , Gene Library
SELECTION OF CITATIONS
SEARCH DETAIL
...