Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
2.
Genome Res ; 11(2): 274-80, 2001 Feb.
Article in English | MEDLINE | ID: mdl-11157790

ABSTRACT

Theory is developed for the process of sequencing randomly selected large-insert clones. Genome size, library depth, clone size, and clone distribution are considered relevant properties and perfect overlap detection for contig assembly is assumed. Genome-specific and nonrandom effects are neglected. Order of magnitude analysis indicates library depth is of secondary importance compared to the other variables, especially as clone size diminishes. In such cases, the well-known Poisson coverage law is a good approximation. Parameters derived from these models are used to examine performance for the specific case of sequencing random human BAC clones. We compare coverage and redundancy rates for libraries possessing uniform and nonuniform clone distributions. Results are measured against data from map-based human-chromosome-2 sequencing. We conclude that the map-based approach outperforms random clone sequencing, except early in a project. However, simultaneous use of both strategies can be beneficial if a performance-based estimate for halting random clone sequencing is made. Results further show that the random approach yields maximum effectiveness using nonbiased rather than biased libraries.


Subject(s)
Cloning, Molecular/methods , Sequence Analysis, DNA/methods , Contig Mapping , Genomic Library , Humans , Models, Genetic , Poisson Distribution , Sequence Analysis, DNA/statistics & numerical data
3.
Article in English | MEDLINE | ID: mdl-11970531

ABSTRACT

A general solution for the Couette velocity profile is reported. Taylor's classical one-dimensional profile is shown to be a special case of this solution for configurations whose aspect ratio is large. Numerical evaluation indicates that error between the two profiles is a logarithmic function of the aspect ratio and provides data to estimate when Taylor's profile should be replaced with the present solution.

4.
Genome Res ; 8(9): 975-84, 1998 Sep.
Article in English | MEDLINE | ID: mdl-9750196

ABSTRACT

A software system for transforming fragments from four-color fluorescence-based gel electrophoresis experiments into assembled sequence is described. It has been developed for large-scale processing of all trace data, including shotgun and finishing reads, regardless of clone origin. Design considerations are discussed in detail, as are programming implementation and graphic tools. The importance of input validation, record tracking, and use of base quality values is emphasized. Several quality analysis metrics are proposed and applied to sample results from recently sequenced clones. Such quantities prove to be a valuable aid in evaluating modifications of sequencing protocol. The system is in full production use at both the Genome Sequencing Center and the Sanger Centre, for which combined weekly production is approximately 100, 000 sequencing reads per week.


Subject(s)
Sequence Analysis, DNA/instrumentation , Sequence Analysis, DNA/methods , Software , Automation , Base Sequence , Computational Biology , Computer Graphics , Human Genome Project , Humans , Sequence Analysis, DNA/statistics & numerical data , User-Computer Interface
5.
Genome Res ; 8(3): 175-85, 1998 Mar.
Article in English | MEDLINE | ID: mdl-9521921

ABSTRACT

The availability of massive amounts of DNA sequence information has begun to revolutionize the practice of biology. As a result, current large-scale sequencing output, while impressive, is not adequate to keep pace with growing demand and, in particular, is far short of what will be required to obtain the 3-billion-base human genome sequence by the target date of 2005. To reach this goal, improved automation will be essential, and it is particularly important that human involvement in sequence data processing be significantly reduced or eliminated. Progress in this respect will require both improved accuracy of the data processing software and reliable accuracy measures to reduce the need for human involvement in error correction and make human review more efficient. Here, we describe one step toward that goal: a base-calling program for automated sequencer traces, phred, with improved accuracy. phred appears to be the first base-calling program to achieve a lower error rate than the ABI software, averaging 40%-50% fewer errors in the data sets examined independent of position in read, machine running conditions, or sequencing chemistry.


Subject(s)
Sequence Analysis, DNA , Software/standards , Algorithms , Base Sequence , Human Genome Project , Humans , Reproducibility of Results , Sensitivity and Specificity , Sequence Alignment , Sequence Analysis, DNA/instrumentation , Sequence Analysis, DNA/methods , Sequence Analysis, DNA/standards
SELECTION OF CITATIONS
SEARCH DETAIL
...