Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 3 de 3
Filter
Add more filters










Database
Language
Publication year range
1.
Bioinformatics ; 28(12): i172-8, 2012 Jun 15.
Article in English | MEDLINE | ID: mdl-22689758

ABSTRACT

MOTIVATION: Shotgun sequence read data derived from xenograft material contains a mixture of reads arising from the host and reads arising from the graft. Classifying the read mixture to separate the two allows for more precise analysis to be performed. RESULTS: We present a technique, with an associated tool Xenome, which performs fast, accurate and specific classification of xenograft-derived sequence read data. We have evaluated it on RNA-Seq data from human, mouse and human-in-mouse xenograft datasets. AVAILABILITY: Xenome is available for non-commercial use from http://www.nicta.com.au/bioinformatics.


Subject(s)
Sequence Analysis, RNA/methods , Transplantation, Heterologous/classification , Algorithms , Animals , DNA, Complementary/genetics , Genome/genetics , Humans , Mice , Software
2.
Bioinformatics ; 28(14): 1937-8, 2012 Jul 15.
Article in English | MEDLINE | ID: mdl-22611131

ABSTRACT

MOTIVATION: The de novo assembly of short read high-throughput sequencing data poses significant computational challenges. The volume of data is huge; the reads are tiny compared to the underlying sequence, and there are significant numbers of sequencing errors. There are numerous software packages that allow users to assemble short reads, but most are either limited to relatively small genomes (e.g. bacteria) or require large computing infrastructure or employ greedy algorithms and thus often do not yield high-quality results. RESULTS: We have developed Gossamer, an implementation of the de Bruijn approach to assembly that requires close to the theoretical minimum of memory, but still allows efficient processing. Our results show that it is space efficient and produces high-quality assemblies. AVAILABILITY: Gossamer is available for non-commercial use from http://www.genomics.csse.unimelb.edu.au/product-gossamer.php.


Subject(s)
High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , Software , Algorithms , Computational Biology/methods
3.
Bioinformatics ; 27(4): 479-86, 2011 Feb 15.
Article in English | MEDLINE | ID: mdl-21245053

ABSTRACT

MOTIVATION: Second-generation sequencing technology makes it feasible for many researches to obtain enough sequence reads to attempt the de novo assembly of higher eukaryotes (including mammals). De novo assembly not only provides a tool for understanding wide scale biological variation, but within human biomedicine, it offers a direct way of observing both large-scale structural variation and fine-scale sequence variation. Unfortunately, improvements in the computational feasibility for de novo assembly have not matched the improvements in the gathering of sequence data. This is for two reasons: the inherent computational complexity of the problem and the in-practice memory requirements of tools. RESULTS: In this article, we use entropy compressed or succinct data structures to create a practical representation of the de Bruijn assembly graph, which requires at least a factor of 10 less storage than the kinds of structures used by deployed methods. Moreover, because our representation is entropy compressed, in the presence of sequencing errors it has better scaling behaviour asymptotically than conventional approaches. We present results of a proof-of-concept assembly of a human genome performed on a modest commodity server.


Subject(s)
Genomics/methods , Sequence Analysis, DNA/methods , Software , Computational Biology/methods , Genome, Human , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...