Search | VHL Regional Portal

High-intensity sequencing reveals the sources of plasma circulating cell-free DNA variants.

Razavi, Pedram; Li, Bob T; Brown, David N; Jung, Byoungsok; Hubbell, Earl; Shen, Ronglai; Abida, Wassim; Juluru, Krishna; De Bruijn, Ino; Hou, Chenlu; Venn, Oliver; Lim, Raymond; Anand, Aseem; Maddala, Tara; Gnerre, Sante; Vijaya Satya, Ravi; Liu, Qinwen; Shen, Ling; Eattock, Nicholas; Yue, Jeanne; Blocker, Alexander W; Lee, Mark; Sehnert, Amy; Xu, Hui; Hall, Megan P; Santiago-Zayas, Angie; Novotny, William F; Isbell, James M; Rusch, Valerie W; Plitas, George; Heerdt, Alexandra S; Ladanyi, Marc; Hyman, David M; Jones, David R; Morrow, Monica; Riely, Gregory J; Scher, Howard I; Rudin, Charles M; Robson, Mark E; Diaz, Luis A; Solit, David B; Aravanis, Alexander M; Reis-Filho, Jorge S.

Nat Med ; 25(12): 1928-1937, 2019 12.

Article in English | MEDLINE | ID: mdl-31768066

ABSTRACT

Accurate identification of tumor-derived somatic variants in plasma circulating cell-free DNA (cfDNA) requires understanding of the various biological compartments contributing to the cfDNA pool. We sought to define the technical feasibility of a high-intensity sequencing assay of cfDNA and matched white blood cell DNA covering a large genomic region (508 genes; 2 megabases; >60,000× raw depth) in a prospective study of 124 patients with metastatic cancer, with contemporaneous matched tumor tissue biopsies, and 47 controls without cancer. The assay displayed high sensitivity and specificity, allowing for de novo detection of tumor-derived mutations and inference of tumor mutational burden, microsatellite instability, mutational signatures and sources of somatic mutations identified in cfDNA. The vast majority of cfDNA mutations (81.6% in controls and 53.2% in patients with cancer) had features consistent with clonal hematopoiesis. This cfDNA sequencing approach revealed that clonal hematopoiesis constitutes a pervasive biological phenomenon, emphasizing the importance of matched cfDNA-white blood cell sequencing for accurate variant interpretation.

Subject(s)

Cell-Free Nucleic Acids/blood , Circulating Tumor DNA/blood , Genomics , Neoplasms/blood , Adult , Biomarkers, Tumor/blood , Circulating Tumor DNA/genetics , DNA Mutational Analysis , DNA, Neoplasm/blood , Female , Gene Expression Regulation, Neoplastic , High-Throughput Nucleotide Sequencing , Humans , Male , Microsatellite Instability , Middle Aged , Mutation , Neoplasms/genetics , Neoplasms/pathology

Reducing amplification artifacts in high multiplex amplicon sequencing by using molecular barcodes.

Peng, Quan; Vijaya Satya, Ravi; Lewis, Marcus; Randad, Pranay; Wang, Yexun.

BMC Genomics ; 16: 589, 2015 Aug 07.

Article in English | MEDLINE | ID: mdl-26248467

ABSTRACT

BACKGROUND: PCR amplicon sequencing has been widely used as a targeted approach for both DNA and RNA sequence analysis. High multiplex PCR has further enabled the enrichment of hundreds of amplicons in one simple reaction. At the same time, the performance of PCR amplicon sequencing can be negatively affected by issues such as high duplicate reads, polymerase artifacts and PCR amplification bias. Recently researchers have made some good progress in addressing these shortcomings by incorporating molecular barcodes into PCR primer design. So far, most work has been demonstrated using one to a few pairs of primers, which limits the size of the region one can analyze. RESULTS: We developed a simple protocol, which enables the use of molecular barcodes in high multiplex PCR with hundreds of amplicons. Using this protocol and reference materials, we demonstrated the applications in accurate variant calling at very low fraction over a large region and in targeted RNA quantification. We also evaluated the protocol's utility in profiling FFPE samples. CONCLUSIONS: We demonstrated the successful implementation of molecular barcodes in high multiplex PCR, with multiplex scale many times higher than earlier work. We showed that the new protocol combines the benefits of both high multiplex PCR and molecular barcodes, i.e. the analysis of a very large region, low DNA input requirement, very good reproducibility and the ability to detect as low as 1% mutations with minimal false positives (FP).

Subject(s)

DNA Barcoding, Taxonomic/methods , High-Throughput Nucleotide Sequencing/methods , Multiplex Polymerase Chain Reaction/methods , Artifacts , DNA Primers/genetics , Humans , RNA/genetics , Reproducibility of Results , Sequence Analysis/methods

Thermodynamic basis for the emergence of genomes during prebiotic evolution.

Woo, Hyung-June; Vijaya Satya, Ravi; Reifman, Jaques.

PLoS Comput Biol ; 8(5): e1002534, 2012 May.

Article in English | MEDLINE | ID: mdl-22693440

ABSTRACT

The RNA world hypothesis views modern organisms as descendants of RNA molecules. The earliest RNA molecules must have been random sequences, from which the first genomes that coded for polymerase ribozymes emerged. The quasispecies theory by Eigen predicts the existence of an error threshold limiting genomic stability during such transitions, but does not address the spontaneity of changes. Following a recent theoretical approach, we applied the quasispecies theory combined with kinetic/thermodynamic descriptions of RNA replication to analyze the collective behavior of RNA replicators based on known experimental kinetics data. We find that, with increasing fidelity (relative rate of base-extension for Watson-Crick versus mismatched base pairs), replications without enzymes, with ribozymes, and with protein-based polymerases are above, near, and below a critical point, respectively. The prebiotic evolution therefore must have crossed this critical region. Over large regions of the phase diagram, fitness increases with increasing fidelity, biasing random drifts in sequence space toward 'crystallization.' This region encloses the experimental nonenzymatic fidelity value, favoring evolutions toward polymerase sequences with ever higher fidelity, despite error rates above the error catastrophe threshold. Our work shows that experimentally characterized kinetics and thermodynamics of RNA replication allow us to determine the physicochemical conditions required for the spontaneous crystallization of biological information. Our findings also suggest that among many potential oligomers capable of templated replication, RNAs may have evolved to form prebiotic genomes due to the value of their nonenzymatic fidelity.

Subject(s)

Evolution, Molecular , Genome , Origin of Life , RNA/metabolism , DNA-Directed DNA Polymerase/metabolism , Models, Genetic , RNA, Catalytic/metabolism , Stochastic Processes , Thermodynamics

SNIT: SNP identification for strain typing.

Vijaya Satya, Ravi; Zavaljevski, Nela; Reifman, Jaques.

Source Code Biol Med ; 6: 14, 2011 Sep 08.

Article in English | MEDLINE | ID: mdl-21902825

ABSTRACT

With ever-increasing numbers of microbial genomes being sequenced, efficient tools are needed to perform strain-level identification of any newly sequenced genome. Here, we present the SNP identification for strain typing (SNIT) pipeline, a fast and accurate software system that compares a newly sequenced bacterial genome with other genomes of the same species to identify single nucleotide polymorphisms (SNPs) and small insertions/deletions (indels). Based on this information, the pipeline analyzes the polymorphic loci present in all input genomes to identify the genome that has the fewest differences with the newly sequenced genome. Similarly, for each of the other genomes, SNIT identifies the input genome with the fewest differences. Results from five bacterial species show that the SNIT pipeline identifies the correct closest neighbor with 75% to 100% accuracy. The SNIT pipeline is available for download at http://www.bhsai.org/snit.html.

A high-throughput pipeline for the design of real-time PCR signatures.

Vijaya Satya, Ravi; Kumar, Kamal; Zavaljevski, Nela; Reifman, Jaques.

BMC Bioinformatics ; 11: 340, 2010 Jun 23.

Article in English | MEDLINE | ID: mdl-20573238

ABSTRACT

BACKGROUND: Pathogen diagnostic assays based on polymerase chain reaction (PCR) technology provide high sensitivity and specificity. However, the design of these diagnostic assays is computationally intensive, requiring high-throughput methods to identify unique PCR signatures in the presence of an ever increasing availability of sequenced genomes. RESULTS: We present the Tool for PCR Signature Identification (TOPSI), a high-performance computing pipeline for the design of PCR-based pathogen diagnostic assays. The TOPSI pipeline efficiently designs PCR signatures common to multiple bacterial genomes by obtaining the shared regions through pairwise alignments between the input genomes. TOPSI successfully designed PCR signatures common to 18 Staphylococcus aureus genomes in less than 14 hours using 98 cores on a high-performance computing system. CONCLUSIONS: TOPSI is a computationally efficient, fully integrated tool for high-throughput design of PCR signatures common to multiple bacterial genomes. TOPSI is freely available for download at http://www.bhsai.org/downloads/topsi.tar.gz.

Subject(s)

Computational Biology/methods , Genome, Bacterial , Polymerase Chain Reaction/methods , Staphylococcus aureus/genetics , Base Sequence , Burkholderia mallei/genetics , Burkholderia pseudomallei/genetics , Chromosome Mapping , Sensitivity and Specificity , Staphylococcus aureus/classification

In silico microarray probe design for diagnosis of multiple pathogens.

Vijaya Satya, Ravi; Zavaljevski, Nela; Kumar, Kamal; Bode, Elizabeth; Padilla, Susana; Wasieloski, Leonard; Geyer, Jeanne; Reifman, Jaques.

BMC Genomics ; 9: 496, 2008 Oct 21.

Article in English | MEDLINE | ID: mdl-18940003

ABSTRACT

BACKGROUND: With multiple strains of various pathogens being sequenced, it is necessary to develop high-throughput methods that can simultaneously process multiple bacterial or viral genomes to find common fingerprints as well as fingerprints that are unique to each individual genome. We present algorithmic enhancements to an existing single-genome pipeline that allows for efficient design of microarray probes common to groups of target genomes. The enhanced pipeline takes advantage of the similarities in the input genomes to narrow the search to short, nonredundant regions of the target genomes and, thereby, significantly reduces the computation time. The pipeline also computes a three-state hybridization matrix, which gives the expected hybridization of each probe with each target. RESULTS: Design of microarray probes for eight pathogenic Burkholderia genomes shows that the multiple-genome pipeline is nearly four-times faster than the single-genome pipeline for this application. The probes designed for these eight genomes were experimentally tested with one non-target and three target genomes. Hybridization experiments show that less than 10% of the designed probes cross hybridize with non-targets. Also, more than 65% of the probes designed to identify all Burkholderia mallei and B. pseudomallei strains successfully hybridize with a B. pseudomallei strain not used for probe design. CONCLUSION: The savings in runtime suggest that the enhanced pipeline can be used to design fingerprints for tens or even hundreds of related genomes in a single run. Hybridization results with an unsequenced B. pseudomallei strain indicate that the designed probes might be useful in identifying unsequenced strains of B. mallei and B. pseudomallei.

Subject(s)

Burkholderia/genetics , DNA Fingerprinting/methods , DNA Probes , Genome, Bacterial , Oligonucleotide Array Sequence Analysis/methods , Algorithms , Bacterial Typing Techniques , Burkholderia/classification , Computational Biology , DNA, Bacterial/genetics , Sensitivity and Specificity , Sequence Analysis, DNA

A high-throughput pipeline for designing microarray-based pathogen diagnostic assays.

Vijaya Satya, Ravi; Zavaljevski, Nela; Kumar, Kamal; Reifman, Jaques.

BMC Bioinformatics ; 9: 185, 2008 Apr 10.

Article in English | MEDLINE | ID: mdl-18402679

ABSTRACT

BACKGROUND: We present a methodology for high-throughput design of oligonucleotide fingerprints for microarray-based pathogen diagnostic assays. The oligonucleotide fingerprints, or DNA microarray probes, are designed for identifying target organisms in environmental or clinical samples. The design process is implemented in a high-performance computing software pipeline that incorporates major algorithmic improvements over a previous version to both reduce computation time and improve specificity assessment. RESULTS: The algorithmic improvements result in significant reduction in runtimes, with the updated pipeline being nearly up to five-times faster than the previous version. The improvements in specificity assessment, based on multiple specificity criteria, result in robust and consistent evaluation of cross-hybridization with nontarget sequences. In addition, the multiple criteria provide finer control on the number of resulting fingerprints, which helps in obtaining a larger number of fingerprints with high specificity. Simulation tests for Francisella tularensis and Yersinia pestis, using a well-established hybridization model to estimate cross-hybridization with nontarget sequences, show that the improved specificity criteria yield a larger number of fingerprints as compared to using a single specificity criterion. CONCLUSION: The faster runtimes, achieved as the result of algorithmic improvements, are critical for extending the pipeline to process multiple target genomes. The larger numbers of identified fingerprints, obtained by considering broader specificity criteria, are essential for designing probes for hard-to-distinguish target sequences.

Subject(s)

Bacteria/genetics , Bacteria/isolation & purification , DNA Fingerprinting/methods , DNA Probes/genetics , DNA, Bacterial/genetics , Genomic Islands/genetics , Oligonucleotide Array Sequence Analysis/methods , Algorithms , Chromosome Mapping/methods

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL