Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 18 de 18
Filter
Add more filters










Publication year range
1.
Nat Microbiol ; 5(5): 746-756, 2020 05.
Article in English | MEDLINE | ID: mdl-32152589

ABSTRACT

A complex microbiota inhabits various microenvironments of the gut, with some symbiotic bacteria having evolved traits to invade the epithelial mucus layer and reside deep within the intestinal tissue of animals. Whether these distinct bacterial communities across gut biogeographies exhibit divergent behaviours is largely unknown. Global transcriptomic analysis to investigate microbial physiology in specific mucosal niches has been hampered technically by an overabundance of host RNA. Here, we employed hybrid selection RNA sequencing (hsRNA-Seq) to enable detailed spatial transcriptomic profiling of a prominent human commensal as it colonizes the colonic lumen, mucus or epithelial tissue of mice. Compared to conventional RNA-Seq, hsRNA-Seq increased reads mapping to the Bacteroides fragilis genome by 48- and 154-fold in mucus and tissue, respectively, allowing for high-fidelity comparisons across biogeographic sites. Near the epithelium, B. fragilis upregulated numerous genes involved in protein synthesis, indicating that bacteria inhabiting the mucosal niche are metabolically active. Further, a specific sulfatase (BF3086) and glycosyl hydrolase (BF3134) were highly induced in mucus and tissue compared to bacteria in the lumen. In-frame deletion of these genes impaired in vitro growth on mucus as a carbon source, as well as mucosal colonization of mice. Mutants in either B. fragilis gene displayed a fitness defect in competing for colonization against bacterial challenge, revealing the importance of site-specific gene expression for robust host-microbial symbiosis. As a versatile tool, hsRNA-Seq can be deployed to explore the in vivo spatial physiology of numerous bacterial pathogens or commensals.


Subject(s)
Bacteroides fragilis/genetics , Bacteroides fragilis/physiology , Colon/microbiology , Animals , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Bacteroides fragilis/growth & development , Colitis/microbiology , Female , Gene Expression Regulation, Bacterial , Germ-Free Life , Humans , Intestinal Mucosa/microbiology , Male , Mice , Sulfonic Acids , Symbiosis , Transcriptome
2.
J Virol ; 92(2)2018 01 15.
Article in English | MEDLINE | ID: mdl-29093087

ABSTRACT

Over 90% of the world's population is persistently infected with Epstein-Barr virus. While EBV does not cause disease in most individuals, it is the common cause of acute infectious mononucleosis (AIM) and has been associated with several cancers and autoimmune diseases, highlighting a need for a preventive vaccine. At present, very few primary, circulating EBV genomes have been sequenced directly from infected individuals. While low levels of diversity and low viral evolution rates have been predicted for double-stranded DNA (dsDNA) viruses, recent studies have demonstrated appreciable diversity in common dsDNA pathogens (e.g., cytomegalovirus). Here, we report 40 full-length EBV genome sequences obtained from matched oral wash and B cell fractions from a cohort of 10 AIM patients. Both intra- and interpatient diversity were observed across the length of the entire viral genome. Diversity was most pronounced in viral genes required for establishing latent infection and persistence, with appreciable levels of diversity also detected in structural genes, including envelope glycoproteins. Interestingly, intrapatient diversity declined significantly over time (P < 0.01), and this was particularly evident on comparison of viral genomes sequenced from B cell fractions in early primary infection and convalescence (P < 0.001). B cell-associated viral genomes were observed to converge, becoming nearly identical to the B95.8 reference genome over time (Spearman rank-order correlation test; r = -0.5589, P = 0.0264). The reduction in diversity was most marked in the EBV latency genes. In summary, our data suggest independent convergence of diverse viral genome sequences toward a reference-like strain within a relatively short period following primary EBV infection.IMPORTANCE Identification of viral proteins with low variability and high immunogenicity is important for the development of a protective vaccine. Knowledge of genome diversity within circulating viral populations is a key step in this process, as is the expansion of intrahost genomic variation during infection. We report full-length EBV genomes sequenced from the blood and oral wash of 10 individuals early in primary infection and during convalescence. Our data demonstrate considerable diversity within the pool of circulating EBV strains, as well as within individual patients. Overall viral diversity decreased from early to persistent infection, particularly in latently infected B cells, which serve as the viral reservoir. Reduction in B cell-associated viral genome diversity coincided with a convergence toward a reference-like EBV genotype. Greater convergence positively correlated with time after infection, suggesting that the reference-like genome is the result of selection.


Subject(s)
Epstein-Barr Virus Infections/virology , Genetic Variation , Genome, Viral , Herpesvirus 4, Human/genetics , Computational Biology/methods , Genomics/methods , Genotype , Herpesvirus 4, Human/classification , High-Throughput Nucleotide Sequencing , Humans , Open Reading Frames , Phylogeny
3.
Genome Biol ; 18(1): 78, 2017 04 28.
Article in English | MEDLINE | ID: mdl-28454557

ABSTRACT

BACKGROUND: Artemisinin-based combination therapies are the first line of treatment for Plasmodium falciparum infections worldwide, but artemisinin resistance has risen rapidly in Southeast Asia over the past decade. Mutations in the kelch13 gene have been implicated in this resistance. We used longitudinal genomic surveillance to detect signals in kelch13 and other loci that contribute to artemisinin or partner drug resistance. We retrospectively sequenced the genomes of 194 P. falciparum isolates from five sites in Northwest Thailand, over the period of a rapid increase in the emergence of artemisinin resistance (2001-2014). RESULTS: We evaluate statistical metrics for temporal change in the frequency of individual SNPs, assuming that SNPs associated with resistance increase in frequency over this period. After Kelch13-C580Y, the strongest temporal change is seen at a SNP in phosphatidylinositol 4-kinase, which is involved in a pathway recently implicated in artemisinin resistance. Furthermore, other loci exhibit strong temporal signatures which warrant further investigation for involvement in artemisinin resistance evolution. Through genome-wide association analysis we identify a variant in a kelch domain-containing gene on chromosome 10 that may epistatically modulate artemisinin resistance. CONCLUSIONS: This analysis demonstrates the potential of a longitudinal genomic surveillance approach to detect resistance-associated gene loci to improve our mechanistic understanding of how resistance develops. Evidence for additional genomic regions outside of the kelch13 locus associated with artemisinin-resistant parasites may yield new molecular markers for resistance surveillance, which may be useful in efforts to reduce the emergence or spread of artemisinin resistance in African parasite populations.


Subject(s)
Antimalarials/pharmacology , Artemisinins/pharmacology , Drug Resistance/genetics , Genome, Protozoan , Plasmodium falciparum/genetics , Kelch Repeat , Plasmodium falciparum/drug effects , Polymorphism, Single Nucleotide , Protozoan Proteins/chemistry , Protozoan Proteins/genetics , Protozoan Proteins/metabolism
4.
Proc Natl Acad Sci U S A ; 114(7): E1291-E1300, 2017 02 14.
Article in English | MEDLINE | ID: mdl-28137873

ABSTRACT

Enhancers regulate gene expression through the binding of sequence-specific transcription factors (TFs) to cognate motifs. Various features influence TF binding and enhancer function-including the chromatin state of the genomic locus, the affinities of the binding site, the activity of the bound TFs, and interactions among TFs. However, the precise nature and relative contributions of these features remain unclear. Here, we used massively parallel reporter assays (MPRAs) involving 32,115 natural and synthetic enhancers, together with high-throughput in vivo binding assays, to systematically dissect the contribution of each of these features to the binding and activity of genomic regulatory elements that contain motifs for PPARγ, a TF that serves as a key regulator of adipogenesis. We show that distinct sets of features govern PPARγ binding vs. enhancer activity. PPARγ binding is largely governed by the affinity of the specific motif site and higher-order features of the larger genomic locus, such as chromatin accessibility. In contrast, the enhancer activity of PPARγ binding sites depends on varying contributions from dozens of TFs in the immediate vicinity, including interactions between combinations of these TFs. Different pairs of motifs follow different interaction rules, including subadditive, additive, and superadditive interactions among specific classes of TFs, with both spatially constrained and flexible grammars. Our results provide a paradigm for the systematic characterization of the genomic features underlying regulatory elements, applicable to the design of synthetic regulatory elements or the interpretation of human genetic variation.


Subject(s)
Enhancer Elements, Genetic/genetics , Gene Expression Regulation , Genomics/methods , Transcription Factors/metabolism , 3T3-L1 Cells , Animals , Binding Sites/genetics , Mice , Mutation , Nucleotide Motifs/genetics , PPAR gamma/metabolism , Protein Binding
5.
Nat Biotechnol ; 34(11): 1180-1190, 2016 Nov.
Article in English | MEDLINE | ID: mdl-27701403

ABSTRACT

Massively parallel reporter assays (MPRAs) enable nucleotide-resolution dissection of transcriptional regulatory regions, such as enhancers, but only few regions at a time. Here we present a combined experimental and computational approach, Systematic high-resolution activation and repression profiling with reporter tiling using MPRA (Sharpr-MPRA), that allows high-resolution analysis of thousands of regions simultaneously. Sharpr-MPRA combines dense tiling of overlapping MPRA constructs with a probabilistic graphical model to recognize functional regulatory nucleotides, and to distinguish activating and repressive nucleotides, using their inferred contribution to reporter gene expression. We used Sharpr-MPRA to test 4.6 million nucleotides spanning 15,000 putative regulatory regions tiled at 5-nucleotide resolution in two human cell types. Our results recovered known cell-type-specific regulatory motifs and evolutionarily conserved nucleotides, and distinguished known activating and repressive motifs. Our results also showed that endogenous chromatin state and DNA accessibility are both predictive of regulatory function in reporter assays, identified retroviral elements with activating roles, and uncovered 'attenuator' motifs with repressive roles in active chromatin.


Subject(s)
Chromosome Mapping/methods , Epigenetic Repression/genetics , Genome, Human/genetics , High-Throughput Nucleotide Sequencing/methods , Nucleotides/genetics , Regulatory Sequences, Nucleic Acid/genetics , Conserved Sequence/genetics , Humans , Reproducibility of Results , Sensitivity and Specificity
6.
Cell ; 165(6): 1530-1545, 2016 Jun 02.
Article in English | MEDLINE | ID: mdl-27259154

ABSTRACT

Genome-wide association studies (GWAS) have successfully identified thousands of associations between common genetic variants and human disease phenotypes, but the majority of these variants are non-coding, often requiring genetic fine-mapping, epigenomic profiling, and individual reporter assays to delineate potential causal variants. We employ a massively parallel reporter assay (MPRA) to simultaneously screen 2,756 variants in strong linkage disequilibrium with 75 sentinel variants associated with red blood cell traits. We show that this assay identifies elements with endogenous erythroid regulatory activity. Across 23 sentinel variants, we conservatively identified 32 MPRA functional variants (MFVs). We used targeted genome editing to demonstrate endogenous enhancer activity across 3 MFVs that predominantly affect the transcription of SMIM1, RBM38, and CD164. Functional follow-up of RBM38 delineates a key role for this gene in the alternative splicing program occurring during terminal erythropoiesis. Finally, we provide evidence for how common GWAS-nominated variants can disrupt cell-type-specific transcriptional regulatory pathways.


Subject(s)
Erythrocytes , Genetic Techniques , Genetic Variation , Alternative Splicing , Cell Line , Cell Lineage/genetics , Erythropoiesis/genetics , Gene Library , Genes, Reporter , Humans , Regulatory Sequences, Nucleic Acid , Transcription, Genetic
7.
Nat Genet ; 48(8): 953-8, 2016 08.
Article in English | MEDLINE | ID: mdl-27348298

ABSTRACT

Plasmodium vivax is a major public health burden, responsible for the majority of malaria infections outside Africa. We explored the impact of demographic history and selective pressures on the P. vivax genome by sequencing 182 clinical isolates sampled from 11 countries across the globe, using hybrid selection to overcome human DNA contamination. We confirmed previous reports of high genomic diversity in P. vivax relative to the more virulent Plasmodium falciparum species; regional populations of P. vivax exhibited greater diversity than the global P. falciparum population, indicating a large and/or stable population. Signals of natural selection suggest that P. vivax is evolving in response to antimalarial drugs and is adapting to regional differences in the human host and the mosquito vector. These findings underline the variable epidemiology of this parasite species and highlight the breadth of approaches that may be required to eliminate P. vivax globally.


Subject(s)
Drug Resistance/genetics , Genetic Markers/genetics , Malaria, Vivax/parasitology , Metagenomics/methods , Plasmodium vivax/genetics , Selection, Genetic/genetics , Transcriptome/genetics , Antimalarials/pharmacology , Humans , Malaria, Vivax/drug therapy , Malaria, Vivax/genetics , Plasmodium vivax/drug effects , Plasmodium vivax/pathogenicity , Selection, Genetic/drug effects
8.
Nat Commun ; 7: 10740, 2016 Feb 22.
Article in English | MEDLINE | ID: mdl-26899007

ABSTRACT

Pneumocystis jirovecii is a major cause of life-threatening pneumonia in immunosuppressed patients including transplant recipients and those with HIV/AIDS, yet surprisingly little is known about the biology of this fungal pathogen. Here we report near complete genome assemblies for three Pneumocystis species that infect humans, rats and mice. Pneumocystis genomes are highly compact relative to other fungi, with substantial reductions of ribosomal RNA genes, transporters, transcription factors and many metabolic pathways, but contain expansions of surface proteins, especially a unique and complex surface glycoprotein superfamily, as well as proteases and RNA processing proteins. Unexpectedly, the key fungal cell wall components chitin and outer chain N-mannans are absent, based on genome content and experimental validation. Our findings suggest that Pneumocystis has developed unique mechanisms of adaptation to life exclusively in mammalian hosts, including dependence on the lungs for gas and nutrients and highly efficient strategies to escape both host innate and acquired immune defenses.


Subject(s)
Adaptation, Biological , Genome, Fungal , Host-Pathogen Interactions/genetics , Pneumocystis carinii/genetics , Animals , Cell Wall/metabolism , Humans , Lung/microbiology , Metabolic Networks and Pathways/genetics , Mice , Multigene Family , Pneumocystis carinii/metabolism , Rats , Synteny
9.
Proc Natl Acad Sci U S A ; 112(47): E6456-65, 2015 Nov 24.
Article in English | MEDLINE | ID: mdl-26499245

ABSTRACT

We recently used in situ Hi-C to create kilobase-resolution 3D maps of mammalian genomes. Here, we combine these maps with new Hi-C, microscopy, and genome-editing experiments to study the physical structure of chromatin fibers, domains, and loops. We find that the observed contact domains are inconsistent with the equilibrium state for an ordinary condensed polymer. Combining Hi-C data and novel mathematical theorems, we show that contact domains are also not consistent with a fractal globule. Instead, we use physical simulations to study two models of genome folding. In one, intermonomer attraction during polymer condensation leads to formation of an anisotropic "tension globule." In the other, CCCTC-binding factor (CTCF) and cohesin act together to extrude unknotted loops during interphase. Both models are consistent with the observed contact domains and with the observation that contact domains tend to form inside loops. However, the extrusion model explains a far wider array of observations, such as why loops tend not to overlap and why the CTCF-binding motifs at pairs of loop anchors lie in the convergent orientation. Finally, we perform 13 genome-editing experiments examining the effect of altering CTCF-binding sites on chromatin folding. The convergent rule correctly predicts the affected loops in every case. Moreover, the extrusion model accurately predicts in silico the 3D maps resulting from each experiment using only the location of CTCF-binding sites in the WT. Thus, we show that it is possible to disrupt, restore, and move loops and domains using targeted mutations as small as a single base pair.


Subject(s)
Chromatin/chemistry , Chromatin/genetics , Genetic Engineering , Genome/genetics , Nucleic Acid Conformation , Anisotropy , Base Pairing , CCCTC-Binding Factor , Cell Cycle Proteins/metabolism , Chromosomal Proteins, Non-Histone/metabolism , Computer Simulation , Diffusion , Fractals , Humans , In Situ Hybridization, Fluorescence , Models, Molecular , Nucleotide Motifs/genetics , Polymers/chemistry , Probability , Repressor Proteins/metabolism , Cohesins
10.
PLoS Negl Trop Dis ; 9(3): e0003539, 2015 Mar.
Article in English | MEDLINE | ID: mdl-25781890

ABSTRACT

Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25-40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/µl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections.


Subject(s)
DNA Barcoding, Taxonomic/methods , DNA, Protozoan/genetics , Malaria, Vivax/parasitology , Plasmodium vivax/isolation & purification , Africa/epidemiology , Asia/epidemiology , Base Sequence , Chromosome Mapping , Genetic Markers/genetics , Humans , Malaria, Vivax/epidemiology , Plasmodium falciparum/classification , Plasmodium falciparum/genetics , Plasmodium falciparum/isolation & purification , Plasmodium vivax/classification , Plasmodium vivax/genetics , Polymorphism, Single Nucleotide , South America/epidemiology
11.
J Vis Exp ; (90)2014 Aug 17.
Article in English | MEDLINE | ID: mdl-25177895

ABSTRACT

The genetic reporter assay is a well-established and powerful tool for dissecting the relationship between DNA sequences and their gene regulatory activities. The potential throughput of this assay has, however, been limited by the need to individually clone and assay the activity of each sequence on interest using protein fluorescence or enzymatic activity as a proxy for regulatory activity. Advances in high-throughput DNA synthesis and sequencing technologies have recently made it possible to overcome these limitations by multiplexing the construction and interrogation of large libraries of reporter constructs. This protocol describes implementation of a Massively Parallel Reporter Assay (MPRA) that allows direct comparison of hundreds of thousands of putative regulatory sequences in a single cell culture dish.


Subject(s)
Sequence Analysis, DNA/methods , Cell Culture Techniques , Genes, Reporter , HEK293 Cells , Humans , Oligonucleotide Probes , Polymerase Chain Reaction/methods , Promoter Regions, Genetic , Regulatory Sequences, Nucleic Acid
12.
Nucleic Acids Res ; 42(14): e112, 2014 Aug.
Article in English | MEDLINE | ID: mdl-24914046

ABSTRACT

Deep mutational scanning has emerged as a promising tool for mapping sequence-activity relationships in proteins, ribonucleic acid and deoxyribonucleic acid. In this approach, diverse variants of a sequence of interest are first ranked according to their activities in a relevant assay, and this ranking is then used to infer the shape of the fitness landscape around the wild-type sequence. Little is currently known, however, about the degree to which such fitness landscapes are dependent on the specific assay conditions from which they are inferred. To explore this issue, we performed comprehensive single-substitution mutational scanning of APH(3')II, a Tn5 transposon-derived kinase that confers resistance to aminoglycoside antibiotics, in Escherichia coli under selection with each of six structurally diverse antibiotics at a range of inhibitory concentrations. We found that the resulting local fitness landscapes showed significant dependence on both antibiotic structure and concentration, and that this dependence can be exploited to guide protein engineering. Specifically, we found that differential analysis of fitness landscapes allowed us to generate synthetic APH(3')II variants with orthogonal substrate specificities.


Subject(s)
Kanamycin Kinase/genetics , Mutagenesis, Site-Directed/methods , Amino Acid Substitution , Aminoglycosides/pharmacology , Anti-Bacterial Agents/pharmacology , Escherichia coli/drug effects , Kanamycin Kinase/chemistry , Kanamycin Kinase/metabolism , Mutation , Substrate Specificity
13.
Genome Res ; 23(5): 800-11, 2013 May.
Article in English | MEDLINE | ID: mdl-23512712

ABSTRACT

Genome-wide chromatin annotations have permitted the mapping of putative regulatory elements across multiple human cell types. However, their experimental dissection by directed regulatory motif disruption has remained unfeasible at the genome scale. Here, we use a massively parallel reporter assay (MPRA) to measure the transcriptional levels induced by 145-bp DNA segments centered on evolutionarily conserved regulatory motif instances within enhancer chromatin states. We select five predicted activators (HNF1, HNF4, FOXA, GATA, NFE2L2) and two predicted repressors (GFI1, ZFP161) and measure reporter expression in erythroleukemia (K562) and liver carcinoma (HepG2) cell lines. We test 2104 wild-type sequences and 3314 engineered enhancer variants containing targeted motif disruptions, each using 10 barcode tags and two replicates. The resulting data strongly confirm the enhancer activity and cell-type specificity of enhancer chromatin states, the ability of 145-bp segments to recapitulate both, the necessary role of regulatory motifs in enhancer function, and the complementary roles of activator and repressor motifs. We find statistically robust evidence that (1) disrupting the predicted activator motifs abolishes enhancer function, while silent or motif-improving changes maintain enhancer activity; (2) evolutionary conservation, nucleosome exclusion, binding of other factors, and strength of the motif match are predictive of enhancer activity; (3) scrambling repressor motifs leads to aberrant reporter expression in cell lines where the enhancers are usually inactive. Our results suggest a general strategy for deciphering cis-regulatory elements by systematic large-scale manipulation and provide quantitative enhancer activity measurements across thousands of constructs that can be mined to develop predictive models of gene expression.


Subject(s)
Chromatin/genetics , Enhancer Elements, Genetic , Nucleotide Motifs/genetics , Transcription, Genetic , Base Sequence , Binding Sites , Cells/classification , Cells/metabolism , Chromosome Mapping , Conserved Sequence , Gene Expression Regulation , Genes, Reporter , Genome, Human , Hep G2 Cells , Humans , Promoter Regions, Genetic
14.
Nat Biotechnol ; 30(3): 271-7, 2012 Feb 26.
Article in English | MEDLINE | ID: mdl-22371084

ABSTRACT

Learning to read and write the transcriptional regulatory code is of central importance to progress in genetic analysis and engineering. Here we describe a massively parallel reporter assay (MPRA) that facilitates the systematic dissection of transcriptional regulatory elements. In MPRA, microarray-synthesized DNA regulatory elements and unique sequence tags are cloned into plasmids to generate a library of reporter constructs. These constructs are transfected into cells and tag expression is assayed by high-throughput sequencing. We apply MPRA to compare >27,000 variants of two inducible enhancers in human cells: a synthetic cAMP-regulated enhancer and the virus-inducible interferon-ß enhancer. We first show that the resulting data define accurate maps of functional transcription factor binding sites in both enhancers at single-nucleotide resolution. We then use the data to train quantitative sequence-activity models (QSAMs) of the two enhancers. We show that QSAMs from two cellular states can be combined to design enhancer variants that optimize potentially conflicting objectives, such as maximizing induced activity while minimizing basal activity.


Subject(s)
Biological Assay/methods , Enhancer Elements, Genetic , Genes, Reporter , Transcription Factors/genetics , Base Sequence , Binding Sites , Humans , Models, Genetic , Molecular Sequence Data , Mutagenesis , Sequence Alignment , Transcription Factors/metabolism , Transcription, Genetic
15.
Genome Biol ; 12(8): R73, 2011 Aug 11.
Article in English | MEDLINE | ID: mdl-21835008

ABSTRACT

We have adapted a solution hybrid selection protocol to enrich pathogen DNA in clinical samples dominated by human genetic material. Using mock mixtures of human and Plasmodium falciparum malaria parasite DNA as well as clinical samples from infected patients, we demonstrate an average of approximately 40-fold enrichment of parasite DNA after hybrid selection. This approach will enable efficient genome sequencing of pathogens from clinical samples, as well as sequencing of endosymbiotic organisms such as Wolbachia that live inside diverse metazoan phyla.


Subject(s)
DNA, Protozoan/genetics , Genome, Protozoan , Malaria, Falciparum/parasitology , Plasmodium falciparum/genetics , Sequence Analysis, DNA/methods , Chromosome Mapping/methods , Humans , Nucleic Acid Hybridization/methods
16.
Curr Protoc Hum Genet ; Chapter 18: Unit 18.4, 2010 Jul.
Article in English | MEDLINE | ID: mdl-20582916

ABSTRACT

This unit describes a protocol for the targeted enrichment of exons from randomly sheared genomic DNA libraries using an in-solution hybrid selection approach for sequencing on an Illumina Genome Analyzer II. The steps for designing and ordering a hybrid selection oligo pool are reviewed, as are critical steps for performing the preparation and hybrid selection of an Illumina paired-end library. Critical parameters, performance metrics, and analysis workflow are discussed.


Subject(s)
Exons/genetics , Nucleic Acid Hybridization/methods , Sequence Analysis, DNA/methods , Humans , Solutions
17.
Genome Biol ; 10(10): R115, 2009.
Article in English | MEDLINE | ID: mdl-19835606

ABSTRACT

Targeted RNA-Seq combines next-generation sequencing with capture of sequences from a relevant subset of a transcriptome. When testing by capturing sequences from a tumor cDNA library by hybridization to oligonucleotide probes specific for 467 cancer-related genes, this method showed high selectivity, improved mutation detection enabling discovery of novel chimeric transcripts, and provided RNA expression data. Thus, targeted RNA-Seq produces an enhanced view of the molecular state of a set of "high interest" genes.


Subject(s)
Gene Expression Profiling/methods , Mutation/genetics , Neoplasms/genetics , Oncogene Proteins, Fusion/genetics , Sequence Analysis, DNA/methods , Amino Acid Sequence , Base Sequence , DNA, Complementary/genetics , Gene Expression Regulation, Neoplastic/genetics , Humans , K562 Cells , Molecular Sequence Data , Oncogene Proteins, Fusion/chemistry , Oncogene Proteins, Fusion/metabolism , RNA, Messenger/genetics , RNA, Messenger/metabolism
18.
Nat Biotechnol ; 27(2): 182-9, 2009 Feb.
Article in English | MEDLINE | ID: mdl-19182786

ABSTRACT

Targeting genomic loci by massively parallel sequencing requires new methods to enrich templates to be sequenced. We developed a capture method that uses biotinylated RNA 'baits' to fish targets out of a 'pond' of DNA fragments. The RNA is transcribed from PCR-amplified oligodeoxynucleotides originally synthesized on a microarray, generating sufficient bait for multiple captures at concentrations high enough to drive the hybridization. We tested this method with 170-mer baits that target >15,000 coding exons (2.5 Mb) and four regions (1.7 Mb total) using Illumina sequencing as read-out. About 90% of uniquely aligning bases fell on or near bait sequence; up to 50% lay on exons proper. The uniformity was such that approximately 60% of target bases in the exonic 'catch', and approximately 80% in the regional catch, had at least half the mean coverage. One lane of Illumina sequence was sufficient to call high-confidence genotypes for 89% of the targeted exon space.


Subject(s)
Genomics/methods , Oligonucleotides/metabolism , Sequence Analysis, DNA/methods , Base Composition/genetics , Bayes Theorem , Biotinylation , Exons/genetics , Humans , Nucleic Acid Hybridization , Oligonucleotide Array Sequence Analysis , Polymerase Chain Reaction , Polymorphism, Single Nucleotide/genetics , Reproducibility of Results , Sensitivity and Specificity
SELECTION OF CITATIONS
SEARCH DETAIL
...