Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 3 de 3
Filter
Add more filters










Database
Language
Publication year range
1.
BMC Bioinformatics ; 16: 191, 2015 Jun 10.
Article in English | MEDLINE | ID: mdl-26056047

ABSTRACT

BACKGROUND: Forward Time Population Genetic Simulations offer a flexible framework for modeling the various evolutionary processes occurring in nature. Often this model expressibility is countered by an increased memory usage or computational overhead. With the complexity of simulation scenarios continuing to increase, addressing the scalability of the underlying simulation framework is a growing consideration. RESULTS: We propose a general method for representing in silico genetic sequences using implicit data structures. We provide a generalized implementation as a C++ template library called Clotho. We compare the performance and scalability of our approach with those taken in other simulation frameworks, namely: FWDPP and simuPOP. CONCLUSIONS: We show that this technique offers a 4x reduction in memory utilization. Additionally, with larger scale simulation scenarios we are able to offer a speedup of 6x-46x.


Subject(s)
Algorithms , Genetics, Population/methods , Software , Biological Evolution , Computer Simulation , Humans , Mutation Rate , Time Factors
2.
BMC Bioinformatics ; 14: 369, 2013 Dec 21.
Article in English | MEDLINE | ID: mdl-24359123

ABSTRACT

BACKGROUND: In recent years genetic data analysis has seen a rapid increase in the scale of data to be analyzed. Schadt et al (NRG 11:647-657, 2010) offered that with data sets approaching the petabyte scale, data related challenges such as formatting, management, and transfer are increasingly important topics which need to be addressed. The use of succinct data structures is one method of reducing physical size of a data set without the use of expensive compression techniques. In this work, we consider the use of 2- and 3-bit encoding schemes for genotype data. We compare the computational performance of allele or genotype counting algorithms utilizing genotype data encoded in both schemes. RESULTS: We perform a comparison of 2- and 3-bit genotype encoding schemes for use in genotype counting algorithms. We find that there is a 20% overhead when building simple frequency tables from 2-bit encoded genotypes. However, building pairwise count tables for genome-wide epistasis is 1.0% more efficient. CONCLUSIONS: In this work, we were concerned with comparing the performance benefits and disadvantages of using more densely packed genotype data representations in Genome Wide Associations Studies (GWAS). We implemented a 2-bit encoding for genotype data, and compared it against a more commonly used 3-bit encoding scheme. We also developed a C++ library, libgwaspp, which offers these data structures, and implementations of several common GWAS algorithms. In general, the 2-bit encoding consumes less memory, and is slightly more efficient in some algorithms than the 3-bit encoding.


Subject(s)
Genome-Wide Association Study/methods , Algorithms , Data Compression , Genome, Human , Genotype , Humans
3.
PLoS One ; 6(7): e21979, 2011.
Article in English | MEDLINE | ID: mdl-21760938

ABSTRACT

Using a small scale ENU mutagenesis approach we identified a recessive germline mutant, designated Lampe1 that exhibited growth retardation and spontaneous hepatosteatosis. Low resolution mapping based on 20 intercrossed Lampe1 mice revealed linkage to a ∼14 Mb interval on the distal site of chromosome 11 containing a total of 285 genes. Exons and 50 bp flanking sequences within the critical region were enriched with sequence capture microarrays and subsequently analyzed by next-generation sequencing. Using this approach 98.1 percent of the targeted DNA was covered with a depth of 10 or more reads per nucleotide and 3 homozygote mutations were identified. Two mutations represented intronic nucleotide changes whereas one mutation affected a splice donor site in intron 11-12 of Palmitoyl Acetyl-coenzyme A oxygenase-1 (Acox1), causing skipping of exon 12. Phenotyping of Acox1(Lampe1) mutants revealed a progression from hepatosteatosis to steatohepatitis, and ultimately hepatocellular carcinoma. The current approach provides a highly efficient and affordable method to identify causative mutations induced by ENU mutagenesis and animal models relevant to human pathology.


Subject(s)
Ethylnitrosourea/metabolism , Exons/genetics , Fatty Liver/genetics , Gene Targeting , Germ-Line Mutation/genetics , Sequence Analysis, DNA/methods , Animals , Fatty Liver/complications , Fatty Liver/pathology , Humans , Lipid Metabolism , Liver/abnormalities , Liver/growth & development , Liver/pathology , Liver/ultrastructure , Liver Neoplasms/complications , Liver Neoplasms/pathology , Mice , Mice, Inbred C57BL , Mice, Mutant Strains , Phenotype , RNA Splice Sites/genetics , Sequence Deletion/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...