Search | VHL Regional Portal

LUMI-PCR: an Illumina platform ligation-mediated PCR protocol for integration site cloning, provides molecular quantitation of integration sites.

Dawes, Joanna C; Webster, Philip; Iadarola, Barbara; Garcia-Diaz, Claudia; Dore, Marian; Bolt, Bruce J; Dewchand, Hamlata; Dharmalingam, Gopuraja; McLatchie, Alex P; Kaczor, Jakub; Caceres, Juan J; Paccanaro, Alberto; Game, Laurence; Parrinello, Simona; Uren, Anthony G.

Mob DNA ; 11: 7, 2020.

Article in English | MEDLINE | ID: mdl-32042315

ABSTRACT

BACKGROUND: Ligation-mediated PCR protocols have diverse uses including the identification of integration sites of insertional mutagens, integrating vectors and naturally occurring mobile genetic elements. For approaches that employ NGS sequencing, the relative abundance of integrations within a complex mixture is typically determined through the use of read counts or unique fragment lengths from a ligation of sheared DNA; however, these estimates may be skewed by PCR amplification biases and saturation of sequencing coverage. RESULTS: Here we describe a modification of our previous splinkerette based ligation-mediated PCR using a novel Illumina-compatible adapter design that prevents amplification of non-target DNA and incorporates unique molecular identifiers. This design reduces the number of PCR cycles required and improves relative quantitation of integration abundance for saturating sequencing coverage. By inverting the forked adapter strands from a standard orientation, the integration-genome junction can be sequenced without affecting the sequence diversity required for cluster generation on the flow cell. Replicate libraries of murine leukemia virus-infected spleen samples yielded highly reproducible quantitation of clonal integrations as well as a deep coverage of subclonal integrations. A dilution series of DNAs bearing integrations of MuLV or piggyBac transposon shows linearity of the quantitation over a range of concentrations. CONCLUSIONS: Merging ligation and library generation steps can reduce total PCR amplification cycles without sacrificing coverage or fidelity. The protocol is robust enough for use in a 96 well format using an automated liquid handler and we include programs for use of a Beckman Biomek liquid handling workstation. We also include an informatics pipeline that maps reads, builds integration contigs and quantitates integration abundance using both fragment lengths and unique molecular identifiers. Suggestions for optimizing the protocol to other target DNA sequences are included. The reproducible distinction of clonal and subclonal integration sites from each other allows for analysis of populations of cells undergoing selection, such as those found in insertional mutagenesis screens.

Author Correction: Subclonal mutation selection in mouse lymphomagenesis identifies known cancer loci and suggests novel candidates.

Webster, Philip; Dawes, Joanna C; Dewchand, Hamlata; Takacs, Katalin; Iadarola, Barbara; Bolt, Bruce J; Caceres, Juan J; Kaczor, Jakub; Dharmalingam, Gopuraja; Dore, Marian; Game, Laurence; Adejumo, Thomas; Elliott, James; Naresh, Kikkeri; Karimi, Mohammad; Rekopoulou, Katerina; Tan, Ge; Paccanaro, Alberto; Uren, Anthony G.

Nat Commun ; 10(1): 1167, 2019 03 06.

Article in English | MEDLINE | ID: mdl-30842421

ABSTRACT

The original version of this Article contained an error in the hyperlink for the online repository http://mulvdb.org which was incorrectly given as http://mulv.lms.mrc.ac.uk. This has been corrected in both the PDF and HTML versions of the Article.

Subclonal mutation selection in mouse lymphomagenesis identifies known cancer loci and suggests novel candidates.

Nat Commun ; 9(1): 2649, 2018 07 09.

Article in English | MEDLINE | ID: mdl-29985390

ABSTRACT

Determining whether recurrent but rare cancer mutations are bona fide driver mutations remains a bottleneck in cancer research. Here we present the most comprehensive analysis of murine leukemia virus-driven lymphomagenesis produced to date, sequencing 700,000 mutations from >500 malignancies collected at time points throughout tumor development. This scale of data allows novel statistical approaches for identifying selected mutations and yields a high-resolution, genome-wide map of the selective forces surrounding cancer gene loci. We also demonstrate negative selection of mutations that may be deleterious to tumor development indicating novel avenues for therapy. Screening of two BCL2 transgenic models confirmed known drivers of human non-Hodgkin lymphoma, and implicates novel candidates including modifiers of immunosurveillance and MHC loci. Correlating mutations with genotypic and phenotypic features independently of local variance in mutation density also provides support for weakly evidenced cancer genes. An online resource http://mulv.lms.mrc.ac.uk allows customized queries of the entire dataset.

Subject(s)

Genetic Loci/genetics , Genetic Predisposition to Disease/genetics , Lymphoma/genetics , Mutation , Animals , Genetic Association Studies , Genome-Wide Association Study , HEK293 Cells , High-Throughput Nucleotide Sequencing , Humans , Leukemia Virus, Murine/genetics , Leukemia Virus, Murine/physiology , Mice, Inbred BALB C , Mice, Inbred C57BL , Mice, Transgenic , Mutagenesis, Insertional

Using WormBase ParaSite: An Integrated Platform for Exploring Helminth Genomic Data.

Bolt, Bruce J; Rodgers, Faye H; Shafie, Myriam; Kersey, Paul J; Berriman, Matthew; Howe, Kevin L.

Methods Mol Biol ; 1757: 471-491, 2018.

Article in English | MEDLINE | ID: mdl-29761467

ABSTRACT

WormBase ParaSite ( parasite.wormbase.org ) is a comprehensive resource for the genomes of parasitic nematodes and flatworms (helminths). It currently includes genomic data for over 100 helminth species, adding value by way of consistent functional annotation, gene comparative analysis and gene expression analysis. We provide several ways of exploring the data including a choice of genome browsers, genome and gene summary pages, text and sequence searching, a query wizard, bulk downloads, and programmatic interfaces. WormBase ParaSite is released three to six times per year, and is developed in collaboration with WormBase ( www.wormbase.org ) and Ensembl Genomes ( www.ensemblgenomes.org ).

Subject(s)

Computational Biology , Databases, Genetic , Genome, Helminth , Genomics , Computational Biology/methods , Epistasis, Genetic , Gene Expression Profiling , Gene Ontology , Helminthiasis/parasitology , Phenotype , Software , Transcriptome , Web Browser

Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species.

Kersey, Paul Julian; Allen, James E; Allot, Alexis; Barba, Matthieu; Boddu, Sanjay; Bolt, Bruce J; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Grabmueller, Christoph; Kumar, Navin; Liu, Zicheng; Maurel, Thomas; Moore, Ben; McDowall, Mark D; Maheswari, Uma; Naamati, Guy; Newman, Victoria; Ong, Chuang Kee; Paulini, Michael; Pedro, Helder; Perry, Emily; Russell, Matthew; Sparrow, Helen; Tapanari, Electra; Taylor, Kieron; Vullo, Alessandro; Williams, Gareth; Zadissia, Amonida; Olson, Andrew; Stein, Joshua; Wei, Sharon; Tello-Ruiz, Marcela; Ware, Doreen; Luciani, Aurelien; Potter, Simon; Finn, Robert D; Urban, Martin; Hammond-Kosack, Kim E; Bolser, Dan M; De Silva, Nishadi; Howe, Kevin L; Langridge, Nicholas; Maslen, Gareth; Staines, Daniel Michael; Yates, Andrew.

Nucleic Acids Res ; 46(D1): D802-D808, 2018 01 04.

Article in English | MEDLINE | ID: mdl-29092050

ABSTRACT

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including genome sequence, gene models, transcript sequence, genetic variation, and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments and expansions. These include the incorporation of almost 20 000 additional genome sequences and over 35 000 tracks of RNA-Seq data, which have been aligned to genomic sequence and made available for visualization. Other advances since 2015 include the release of the database in Resource Description Framework (RDF) format, a large increase in community-derived curation, a new high-performance protein sequence search, additional cross-references, improved annotation of non-protein-coding genes, and the launch of pre-release and archival sites. Collectively, these changes are part of a continuing response to the increasing quantity of publicly-available genome-scale data, and the consequent need to archive, integrate, annotate and disseminate these using automated, scalable methods.

Subject(s)

Archaea/genetics , Bacteria/genetics , Databases, Genetic , Databases, Protein , Eukaryota/genetics , Genomics , Amino Acid Sequence , Animals , Base Sequence , Data Mining , Forecasting , Genome , Molecular Sequence Annotation , RNA/genetics , User-Computer Interface

WormBase ParaSite - a comprehensive resource for helminth genomics.

Howe, Kevin L; Bolt, Bruce J; Shafie, Myriam; Kersey, Paul; Berriman, Matthew.

Mol Biochem Parasitol ; 215: 2-10, 2017 07.

Article in English | MEDLINE | ID: mdl-27899279

ABSTRACT

The number of publicly available parasitic worm genome sequences has increased dramatically in the past three years, and research interest in helminth functional genomics is now quickly gathering pace in response to the foundation that has been laid by these collective efforts. A systematic approach to the organisation, curation, analysis and presentation of these data is clearly vital for maximising the utility of these data to researchers. We have developed a portal called WormBase ParaSite (http://parasite.wormbase.org) for interrogating helminth genomes on a large scale. Data from over 100 nematode and platyhelminth species are integrated, adding value by way of systematic and consistent functional annotation (e.g. protein domains and Gene Ontology terms), gene expression analysis (e.g. alignment of life-stage specific transcriptome data sets), and comparative analysis (e.g. orthologues and paralogues). We provide several ways of exploring the data, including genome browsers, genome and gene summary pages, text search, sequence search, a query wizard, bulk downloads, and programmatic interfaces. In this review, we provide an overview of the back-end infrastructure and analysis behind WormBase ParaSite, and the displays and tools available to users for interrogating helminth genomic data.

Subject(s)

Databases, Genetic , Genome, Helminth , Genomics/methods , Helminths/genetics , Animals , Computational Biology/methods , Helminths/classification

WormBase 2016: expanding to enable helminth genomic research.

Howe, Kevin L; Bolt, Bruce J; Cain, Scott; Chan, Juancarlos; Chen, Wen J; Davis, Paul; Done, James; Down, Thomas; Gao, Sibyl; Grove, Christian; Harris, Todd W; Kishore, Ranjana; Lee, Raymond; Lomax, Jane; Li, Yuling; Muller, Hans-Michael; Nakamura, Cecilia; Nuin, Paulo; Paulini, Michael; Raciti, Daniela; Schindelman, Gary; Stanley, Eleanor; Tuli, Mary Ann; Van Auken, Kimberly; Wang, Daniel; Wang, Xiaodong; Williams, Gary; Wright, Adam; Yook, Karen; Berriman, Matthew; Kersey, Paul; Schedl, Tim; Stein, Lincoln; Sternberg, Paul W.

Nucleic Acids Res ; 44(D1): D774-80, 2016 Jan 04.

Article in English | MEDLINE | ID: mdl-26578572

ABSTRACT

WormBase (www.wormbase.org) is a central repository for research data on the biology, genetics and genomics of Caenorhabditis elegans and other nematodes. The project has evolved from its original remit to collect and integrate all data for a single species, and now extends to numerous nematodes, ranging from evolutionary comparators of C. elegans to parasitic species that threaten plant, animal and human health. Research activity using C. elegans as a model system is as vibrant as ever, and we have created new tools for community curation in response to the ever-increasing volume and complexity of data. To better allow users to navigate their way through these data, we have made a number of improvements to our main website, including new tools for browsing genomic features and ontology annotations. Finally, we have developed a new portal for parasitic worm genomes. WormBase ParaSite (parasite.wormbase.org) contains all publicly available nematode and platyhelminth annotated genome sequences, and is designed specifically to support helminth genomic research.

Subject(s)

Caenorhabditis elegans/genetics , Databases, Genetic , Genome, Helminth , Genomics , Nematoda/genetics , Animals , Genes, Helminth , Molecular Sequence Annotation , Platyhelminths/genetics , Software

Ensembl Genomes 2016: more genomes, more complexity.

Kersey, Paul Julian; Allen, James E; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello-Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M; Howe, Kevin L; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M.

Nucleic Acids Res ; 44(D1): D574-80, 2016 Jan 04.

Article in English | MEDLINE | ID: mdl-26578574

ABSTRACT

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces.

Subject(s)

Databases, Genetic , Genome, Bacterial , Genome, Fungal , Genome, Plant , Invertebrates/genetics , Animals , Diploidy , Eukaryota/genetics , Genetic Variation , Genome , Polyploidy , Sequence Alignment

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL