ABSTRACT
Single-cell whole-genome sequencing (scWGS) detects cell heterogeneity at the aspect of genomic variations, which are inheritable and play an important role in life processes such as aging and cancer progression. The recent explosive development of high-throughput single-cell sequencing methods has enabled high-performance heterogeneity detection through a vast number of novel strategies. Despite the limitation on total cost, technical advances in high-throughput single-cell whole-genome sequencing methods are made for higher genome coverage, parallel throughput, and level of integration. This review highlights the technical advancements in high-throughput scWGS in the aspects of strategies design, data efficiency, parallel handling platforms, and their applications on human genome. The experimental innovations, remaining challenges, and perspectives are summarized and discussed.
ABSTRACT
The design and generation of an optimal protein expression construct is the first and essential step in the characterization of any protein of interest. However, the exchange and modification of the coding and/or noncoding elements to analyze their effect on protein function or generating the optimal result can be a tedious and time-consuming process using standard molecular biology cloning methods. To streamline the process to generate defined expression constructs or libraries of otherwise difficult to express proteins, the Modular Protein Expression Toolbox (MoPET) has been developed (Weber E, PloS One 12(5):e0176314, 2017). The system applies Golden Gate cloning as an assembly method and follows the standardized modular cloning (MoClo) principle (Weber E, PloS One 6(2):e16765, 2011). This cloning platform allows highly efficient DNA assembly of pre-defined, standardized functional DNA modules effecting protein expression with a focus on minimizing the cloning burden in coding regions. The original MoPET system consists of 53 defined DNA modules divided into eight functional main classes and can be flexibly expanded dependent on the need of the experimenter and expression host. However, already with a limited set of only 53 modules, 792,000 different constructs can be rationally designed or used to generate combinatorial expression optimization libraries. We provide here a detailed protocol for the (1) design and generation of level 0 basic parts, (2) generation of defined expressions constructs, and (3) generation of combinatorial expression libraries.
Subject(s)
DNA , Mammals , Animals , Open Reading FramesABSTRACT
The unique palindromic inverted terminal repeats (ITRs) and single-stranded nature of adeno-associated virus (AAV) DNA are major hurdles to current sequencing technologies. Due to these characteristics, sequencing noncanonical AAV genomes present in AAV vector preparations remains challenging. To address this limitation, we developed thorough molecule configuration analysis of noncanonical AAV genomes (TMCA-AAV-seq). TMCA-AAV-seq takes advantage of the documented AAV packaging mechanism in which encapsidation initiates from its 3' ITR, for AAV-seq library construction. Any AAV genome with a 3' ITR is converted to a template suitable to adapter addition by a Bst DNA polymerase-mediated extension reaction. This extension reaction helps fix ITR heterogeneity in the AAV population and allows efficient adapter addition to even noncanonical AAV genomes. The resulting library maintains the original AAV genome configurations without introducing undesired changes. Subsequently, long-read sequencing can be performed by the Pacific Biosciences (PacBio) single-molecule, real-time (SMRT) sequencing technology platform. Finally, through comprehensive data analysis, we can recover canonical, noncanonical AAV DNA, and non-AAV vector DNA sequences, along with their molecular configurations. Our method is a robust tool for profiling thorough AAV-population genomes. TMCA-AAVseq can be further extended to all parvoviruses and their derivative vectors.
ABSTRACT
Enzymatic methyl-seq (EM-seq), an enzyme-based method, identifies genome-wide DNA methylation, which enables us to obtain reliable methylome data from purified genomic DNA by avoiding bisulfite-induced DNA damage. However, the loss of DNA during purification hinders the methylome analysis of limited samples. The crude DNA extraction method is the quickest and minimal sample loss approach for obtaining useable DNA without requiring additional dissolution and purification. However, it remains unclear whether crude DNA can be used directly for EM-seq library construction. In this study, we aimed to assess the quality of EM-seq libraries prepared directly using crude DNA. The crude DNA-derived libraries provided appropriate fragment sizes and concentrations for sequencing similar to those of the purified DNA-derived libraries. However, the sequencing results of crude samples exhibited lower reference sequence mapping efficiencies than those of the purified samples. Additionally, the lower-input crude DNA-derived sample exhibited a marginally lower cytosine-to-thymine conversion efficiency and hypermethylated pattern around gene regulatory elements than the higher-input crude DNA- or purified DNA-derived samples. In contrast, the methylation profiles of the crude and purified samples exhibited a significant correlation. Our findings indicate that crude DNA can be used as a raw material for EM-seq library construction.
Subject(s)
DNA Methylation , DNA , Gene Library , Base Sequence , DNA/genetics , DNA/analysis , Cloning, Molecular , Sequence Analysis, DNA/methods , High-Throughput Nucleotide Sequencing/methods , SulfitesABSTRACT
Protein display systems are powerful techniques used to identify protein molecules that bind with high affinity to target proteins of interest. The initial challenge in implementing a display system is the construction of a high-diversity naïve library. Here, we describe the methods to generate a designed ankyrin repeat protein (DARPin) display library using degenerate oligonucleotides. Specifically described is the construction of a single DARPin repeat module by overlap extension PCR, concatenation of the module by restriction enzyme digestion and ligation, and incorporation of the concatenated modules into a full-length DARPin sequence in a bacterial cloning or display vector containing the hydrophilic N- and C-terminal capping domains. Protocols for PCR amplification of DARPin sequences to estimate diversity of naïve and enriched libraries via next-generation sequencing are included, as is a simple Linux-based program for analysis of naïve and enriched sequences. © 2024 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Generation of a single DARPin repeat by overlap extension PCR Basic Protocol 2: Concatenation of DARPin repeats Basic Protocol 3: Ligation of internal repeats into cloning/display vector containing N- and C-terminal capping repeats Basic Protocol 4: Estimation of library size and diversity by next-generation sequencing (NGS) Basic Protocol 5: NGS analysis of naïve and enriched libraries.
Subject(s)
Designed Ankyrin Repeat Proteins , Gastrointestinal Agents , Gene Library , DNA Restriction Enzymes , High-Throughput Nucleotide SequencingABSTRACT
Yeast surface display (YSD) is a technology that fuses the exogenous target protein gene sequence with a specific vector gene sequence, followed by introduction into yeast cells. Subsequently, the target protein is expressed and localized on the yeast cell surface by using the intracellular protein transport mechanism of yeast cells, whereas the most widely used YSD system is the α-agglutinin expression system. Yeast cells possess the eukaryotic post-translational modification mechanism, which helps the target protein fold correctly. This mechanism could be used to display various eukaryotic proteins, including antibodies, receptors, enzymes, and antigenic peptides. YSD has become a powerful protein engineering tool in biotechnology and biomedicine, and has been used to improve a broad range of protein properties including affinity, specificity, enzymatic function, and stability. This review summarized recent advances in the application of YSD technology from the aspects of library construction and screening, antibody engineering, protein engineering, enzyme engineering and vaccine development.
Subject(s)
Protein Engineering , Saccharomyces cerevisiae , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Biotechnology , Antibodies/metabolism , Amino Acid SequenceABSTRACT
As central players in cellular structure and function, proteins have long been central themes in life science research. Analyzing the impact of protein sequence variation on its structure and function is one of the important means to study proteins. In recent years, a technology called deep mutational scanning (DMS) has been widely used in the field of protein research. It introduces thousands of mutations in parallel in specific regions of proteins through high-abundance DNA libraries. After screening, high-throughput sequencing is employed to score each mutation, revealing sequence-function correlations. Due to its high-throughput, fast and easy, and labor-saving features, DMS has become an important method for protein function research and protein engineering. This review briefly summarizes the principle of DMS technology, highlighting its applications in mammalian cells. Moreover, this review analyzes the current technical bottlenecks, aiming to facilitate relevant research.
Subject(s)
Protein Engineering , Proteins , Animals , Mutation , Proteins/genetics , Proteins/chemistry , High-Throughput Nucleotide Sequencing/methods , Mammals/geneticsABSTRACT
Naturally occurring heavy chain antibodies (HCAbs) in Camelidae species were a surprise discovery in 1993 by Hamers et al. Since that time, antibody fragments derived from HCAbs have garnered considerable attention by researchers and biotechnology companies. Due to their biophysico-chemical advantages over conventional antibody fragments, camelid single-domain antibodies (sdAbs, VHHs, nanobodies) are being increasingly utilized as viable immunotherapeutic modalities. Currently there are multiple VHH-based therapeutic agents in different phases of clinical trials in various formats such as bi- and multivalent, bi- and multi-specific, CAR-T, and antibody-drug conjugates. The first approved VHH, a bivalent humanized VHH (caplacizumab), was approved for treating rare blood clotting disorders in 2018 by the EMA and the FDA in 2019. This was followed by the approval of an anti-BCMA VHH-based CAR-T cell product in 2022 (ciltacabtagene autoleucel; CARVYKTI™) and more recently a trivalent antitumor necrosis factor alpha-based VHH drug (ozoralizumab; Nanozora®) in Japan for the treatment of rheumatoid arthritis. In this chapter we provide protocols describing the latest developments in isolating antigen-specific VHHs including llama immunization, construction of phage-displayed libraries, phage panning and screening of the soluble VHHs by ELISA, affinity measurements by surface plasmon resonance, functional cell binding by flow cytometry, and additional validation by immunoprecipitation. We present and discuss comprehensive, step-by-step methods for isolating and characterization of antigen-specific VHHs. This includes protocols for expression, biotinylation, purification, and characterization of the isolated VHHs. To demonstrate the feasibility of the entire strategy, we present examples of VHHs previously isolated and characterized in our laboratory.
Subject(s)
Arthritis, Rheumatoid , Bacteriophages , Blood Group Antigens , Camelids, New World , Single-Domain Antibodies , Animals , Antibodies, Monoclonal , Bacteriophages/genetics , Biotechnology , Camelidae , Factor VABSTRACT
An important contributor to the successful generation of recombinant affinity reagents via phage display is a large and diverse library. We describe, herein, the application of Kunkel mutagenesis and rolling circle amplification (RCA) to the construction of a 1.1 × 1011 member library, with only 26 electroporations, and isolation of low- to sub-nanomolar monobodies to a number of protein targets, including human COP9 signalosome subunit 5 (COPS5), HIV-1 Rev. binding protein-like protein (HRBL), X-ray repair cross-complementing 5/6 (Ku70/80) heterodimer, the receptor-binding domain (RBD) of SARS-CoV-2, and transforming growth factor beta 1 (TGF-ß1).
Subject(s)
Bacteriophages , COVID-19 , Humans , SARS-CoV-2 , Gene Library , MutagenesisABSTRACT
Liquid biopsy has obvious advantages over invasive sampling methods. Cell-free DNA is now widely used in prenatal and cancer diagnosis. Cell-free RNA is relatively unstable but could contain potential biomarkers, and some studies have demonstrated that cfRNA sequencing may be developed to be a promising diagnostic assay for disease detection. In this chapter, we introduce a modified protocol for the detection of cfRNA based on next-generation sequencing (NGS). This protocol has been specifically optimized for low-input, fragmented cfRNA samples.
Subject(s)
Cell-Free Nucleic Acids , Pregnancy , Female , Humans , Cell-Free Nucleic Acids/genetics , Liquid Biopsy/methods , Biomarkers , High-Throughput Nucleotide Sequencing , Sequence Analysis, RNA , RNA/genetics , Biomarkers, Tumor/geneticsABSTRACT
The human microbiome plays a crucial role in human health. In the past decade, advances in high-throughput sequencing technologies and analytical software have significantly improved our knowledge of the human microbiome. However, most studies concerning the human microbiome did not provide repeatable protocols to guide the sample collection, handling, and processing procedures, which impedes obtaining valid and timely microbial taxonomic and functional results. This protocol provides detailed operation methods of human microbial sample collection, DNA extraction, and library construction for both the amplicon sequencing-based measurements of the microbial samples from the human nasal cavity, oral cavity, and skin, as well as the shotgun metagenomic sequencing-based measurements of the human stool samples among adult participants. This study intends to develop practical procedure standards to improve the reproducibility of microbiota profiling of human samples. Supplementary Information: The online version contains supplementary material available at 10.1007/s43657-023-00097-y.
ABSTRACT
Currently, bioorthogonal coupling reactions have garnered considerable interest due to their high substrate selectivity and less restrictive reaction conditions. During recent decades, bioorthogonal coupling reactions have emerged as powerful tools in drug development. This review describes the current applications of bioorthogonal coupling reactions in compound library building mediated by the copper-catalyzed azide-alkyne cycloaddition (CuAAC) reaction and in situ click chemistry or conjunction with other techniques; druggability optimization with 1,2,3-triazole groups; and intracellular self-assembly platforms with ring tension reactions, which are presented from the viewpoint of drug development. There is a reasonable prospect that bioorthogonal coupling reactions will accelerate the screening of lead compounds, the designing strategies of small molecules and expand the variety of designed compounds, which will be a new trend in drug development in the future.
Subject(s)
Copper , Drug Development , Copper/chemistry , Cycloaddition Reaction , Azides/chemistry , Click Chemistry/methods , Alkynes/chemistry , CatalysisABSTRACT
OBJECTIVE: Comprehensive and reliable genome-wide variant analysis of a small number of cells has been challenging due to genome coverage bias, PCR over-cycling, and the requirement of expensive technologies. To comprehensively identify genome alterations in single colon crypts that reflect genome heterogeneity of stem cells, we developed a method to construct whole-genome sequencing libraries from single colon crypts without DNA extraction, whole-genome amplification, or increased PCR enrichment cycles. RESULTS: We present post-alignment statistics of 81 single-crypts (each contains four- to eight-fold less DNA than the requirement of conventional methods) and 16 bulk-tissue libraries to demonstrate the consistent success in obtaining reliable coverage, both in depth (≥ 30X) and breadth (≥ 92% of the genome covered at ≥ 10X depth), of the human genome. These single-crypt libraries are of comparable quality as libraries generated with the conventional method using high quality and quantities of purified DNA. Conceivably, our method can be applied to small biopsy samples from many tissues and can be combined with single cell targeted sequencing to comprehensively profile cancer genomes and their evolution. The broad potential application of this method offers expanded possibilities in cost-effectively examining genome heterogeneity in small numbers of cells at high resolution.
Subject(s)
DNA , Nucleic Acid Amplification Techniques , Humans , Whole Genome Sequencing , Sequence Analysis, DNA/methods , Polymerase Chain Reaction , High-Throughput Nucleotide Sequencing/methodsABSTRACT
The emergence of antibiotic-resistant-bacteria is a serious public health threat, which prompts us to speed up the discovery of novel antibacterial agents. Phage display technology has great potential to screen peptides or antibodies with high binding capacities for a wide range of targets. This property is significant in the rapid search for new antibacterial agents for the control of bacterial resistance. In this paper, we not only summarized the recent progress of phage display for the discovery of novel therapeutic agents, identification of action sites of bacterial target proteins, and rapid detection of different pathogens, but also discussed several problems of this technology that must be solved. Breakthrough in these problems may further promote the development and application of phage display technology in the biomedical field in the future.
Subject(s)
Bacterial Infections , Bacteriophages , Communicable Diseases , Humans , Peptides/therapeutic use , Peptides/chemistry , Bacterial Infections/diagnosis , Bacterial Infections/drug therapy , Bacterial Infections/microbiology , Antibodies/therapeutic use , Anti-Bacterial Agents/therapeutic use , Bacterial Proteins , Communicable Diseases/diagnosis , Communicable Diseases/drug therapy , Peptide LibraryABSTRACT
Next-generation sequencing (NGS) of small RNA (sRNA) cDNA libraries permits the identification and characterization of sRNA species de novo. However, the method through which these libraries are constructed can often introduce artifacts such as over- or underrepresentation of specific sequences or adapter oligonucleotides due to sequence biases held by the enzymes used. In this chapter we describe a protocol for sRNA library construction making use of high-definition (HD) adapters for the Illumina sequencing platform, which reduce ligation bias. This protocol leads to drastically reduced direct 5'/3' adapter ligation products and can be used for the synthesis of sRNA libraries from total RNA or sRNA of various plant, animal, and fungal samples. This protocol also includes a method for total RNA extraction from plant leaf and cultured cells or body fluids.
Subject(s)
RNA, Small Untranslated , RNA , Animals , Gene Library , Oligonucleotides , High-Throughput Nucleotide Sequencing/methods , Cloning, Molecular , Sequence Analysis, RNA/methods , RNA, Small Untranslated/geneticsABSTRACT
Adaptive laboratory evolution (ALE) is a technique for the selection of strains with better phenotypes by long-term culture under a specific selection pressure or growth environment. Because ALE does not require detailed knowledge of a variety of complex and interactive metabolic networks, and only needs to simulate natural environmental conditions in the laboratory to design a selection pressure, it has the advantages of broad adaptability, strong practicability, and more convenient transformation of strains. In addition, ALE provides a powerful method for studying the evolutionary forces that change the phenotype, performance, and stability of strains, resulting in more productive industrial strains with beneficial mutations. In recent years, ALE has been widely used in the activation of specific microbial metabolic pathways and phenotypic optimization, the efficient utilization of specific substrates, the optimization of tolerance to toxic substance, and the biosynthesis of target products, which is more conducive to the production of industrial strains with excellent phenotypic characteristics. In this paper, typical examples of ALE applications in the development of industrial strains and the research progress of this technology are reviewed, followed by a discussion of its development prospects.
Subject(s)
Metabolic Engineering , Metabolic Networks and Pathways , Metabolic Engineering/methods , MutationABSTRACT
Yeast surface display (YSD) is a technology that fuses the exogenous target protein gene sequence with a specific vector gene sequence, followed by introduction into yeast cells. Subsequently, the target protein is expressed and localized on the yeast cell surface by using the intracellular protein transport mechanism of yeast cells, whereas the most widely used YSD system is the α-agglutinin expression system. Yeast cells possess the eukaryotic post-translational modification mechanism, which helps the target protein fold correctly. This mechanism could be used to display various eukaryotic proteins, including antibodies, receptors, enzymes, and antigenic peptides. YSD has become a powerful protein engineering tool in biotechnology and biomedicine, and has been used to improve a broad range of protein properties including affinity, specificity, enzymatic function, and stability. This review summarized recent advances in the application of YSD technology from the aspects of library construction and screening, antibody engineering, protein engineering, enzyme engineering and vaccine development.
Subject(s)
Saccharomyces cerevisiae/metabolism , Protein Engineering , Biotechnology , Antibodies/metabolism , Amino Acid SequenceABSTRACT
As central players in cellular structure and function, proteins have long been central themes in life science research. Analyzing the impact of protein sequence variation on its structure and function is one of the important means to study proteins. In recent years, a technology called deep mutational scanning (DMS) has been widely used in the field of protein research. It introduces thousands of mutations in parallel in specific regions of proteins through high-abundance DNA libraries. After screening, high-throughput sequencing is employed to score each mutation, revealing sequence-function correlations. Due to its high-throughput, fast and easy, and labor-saving features, DMS has become an important method for protein function research and protein engineering. This review briefly summarizes the principle of DMS technology, highlighting its applications in mammalian cells. Moreover, this review analyzes the current technical bottlenecks, aiming to facilitate relevant research.
Subject(s)
Animals , Mutation , Proteins/chemistry , Protein Engineering , High-Throughput Nucleotide Sequencing/methods , Mammals/geneticsABSTRACT
ABBREVIATIONS: CDR: complementarity determining region; FACS: fluorescence-activated cell sorting; ka: association rate; kd: dissociation rate; KD: dissociation constant; scFv: single-chain variable fragment; SPR: surface plasmon resonance.
Subject(s)
Single-Chain Antibodies , Antibody Affinity , Complementarity Determining Regions , Surface Plasmon ResonanceABSTRACT
Protein engineering through directed evolutison is facilitated by the screening and characterization of protein libraries. Efficient and effective methods for multiple site-saturation mutagenesis, such as Darwin Assembly, can accelerate the sampling of relevant sequence space and the identification of variants with desired functionalities. Here, we present the automation of the Darwin Assembly method, using a Gilson PIPETMAX™ liquid handling platform under the control of the Antha software platform, which resulted in the accelerated construction of complex, multiplexed gene libraries error-free and with minimal hands-on time, while maintaining flexibility over experimental parameters through a graphical user interface rather than requiring user-driven library-dependent programming of the liquid handling platform. We also present an approach for barcoding libraries that overcomes amplicon length limitations in next generation sequencing and enables fast reconstruction of library reads.