Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters










Database
Language
Publication year range
1.
Gene ; 516(2): 328-34, 2013 Mar 10.
Article in English | MEDLINE | ID: mdl-23274653

ABSTRACT

Using our microsatellite specific genotyping method, we analyzed tandem repeats, which are known to be highly variable with some recognized as biomarkers causative of disease, in over 500 individuals who were exon sequenced in a 1000 Genomes Project pilot study. We were able to genotype over 97% of the microsatellite loci in the targeted regions. A total of 25,115 variations were observed, including repeat length and single nucleotide polymorphisms, corresponding to an average of 45.6 variations per individual and a density of 1.1 variations per kilobase. Standard variant detection did not report 94.2% of the exonic repeat length variations in part because the alignment techniques are not ideal for repetitive regions. Additionally some standard variation detection tools rely on a database of known variations, making them less likely to call repeat length variations as only a small percent of these loci (~6000) have been accurately characterized. A subset of the hundreds of non-synonymous variations we identified was experimentally validated, indicating an accuracy of 96.5% for our microsatellite-based genotyping method, with some novel variants identified in genes associated with cancer. We propose that microsatellite-based genotyping be used as a part of large scale sequencing studies to identify novel variants.


Subject(s)
Exons/genetics , Genetic Variation , Microsatellite Repeats/genetics , Base Sequence , Genetic Variation/physiology , Genetics, Population , Genome, Human/genetics , Genotype , Humans , Molecular Sequence Data , Pilot Projects , Polymorphism, Single Nucleotide/physiology , Sequence Analysis, DNA , Validation Studies as Topic
2.
Genomics ; 97(4): 193-9, 2011 Apr.
Article in English | MEDLINE | ID: mdl-21223998

ABSTRACT

We performed an analysis of global microsatellite variation on the two kindreds sequenced at high depth (~20×-60×) in the 1000 Genomes Project pilot studies because alterations in these highly mutable repetitive sequences have been linked with many phenotypes and disease risks. The standard alignment technique performs poorly in microsatellite regions as a consequence of low effective coverage (~1×-5×) resulting in 79% of the informative loci exhibiting non-Mendelian inheritance patterns. We used a more stringent approach in computing robust allelotypes resulting in 94.4% of the 1095 informative repeats conforming to traditional inheritance. The high-confidence allelotypes were analyzed to obtain an estimate of the minimum polymorphism rate as a function of motif length, motif sequence, and distribution within the genome.


Subject(s)
Genome, Human/genetics , Microsatellite Repeats/genetics , Female , Genetic Variation , Humans , Male , Pedigree , Pilot Projects , Sequence Alignment/methods , Sequence Analysis, DNA/standards
3.
Genome Biol ; 2(10): REVIEWS2002, 2001.
Article in English | MEDLINE | ID: mdl-11597340

ABSTRACT

Searching for similarities between biological sequences is the principal means by which bioinformatics contributes to our understanding of biology. Of the various informatics tools developed to accomplish this task, the most widely used is BLAST, the basic local alignment search tool. This article discusses the principles, workings, applications and potential pitfalls of BLAST, focusing on the implementation developed at the National Center for Biotechnology Information.


Subject(s)
Computational Biology/methods , Sequence Alignment/methods , Algorithms , Animals , Humans , Internet , Mutagenesis, Insertional , Sensitivity and Specificity , Sequence Analysis , Sequence Deletion , Sequence Homology , Time Factors
4.
Am J Hum Genet ; 67(2): 345-56, 2000 Aug.
Article in English | MEDLINE | ID: mdl-10889045

ABSTRACT

We have developed an algorithm that predicted 11,265 potentially polymorphic tandem repeats within transcribed sequences. We estimate that 22% (2,207/9,717) of the annotated clusters within UniGene contain at least one potentially polymorphic locus. Our predictions were tested by allelotyping a panel of approximately 30 individuals for 5% of these regions, confirming polymorphism for more than half the loci tested. Our study indicates that tandem-repeat polymorphisms in genes are more common than is generally believed. Approximately 8% of these loci are within coding sequences and, if polymorphic, would result in frameshifts. Our catalogue of putative polymorphic repeats within transcribed sequences comprises a large set of potentially phenotypic or disease-causing loci. In addition, from the anomalous character of the repetitive sequences within unannotated clusters, we also conclude that the UniGene cluster count substantially overestimates the number of genes in the human genome. We hypothesize that polymorphisms in repeated sequences occur with some baseline distribution, on the basis of repeat homogeneity, size, and sequence composition, and that deviations from that distribution are indicative of the nature of selection pressure at that locus. We find evidence of selective maintenance of the ability of some genes to respond very rapidly, perhaps even on intragenerational timescales, to fluctuating selective pressures.


Subject(s)
Evolution, Molecular , Genes , Polymorphism, Genetic/genetics , Repetitive Sequences, Amino Acid/genetics , Repetitive Sequences, Nucleic Acid/genetics , Algorithms , Alleles , Genome, Human , Genotype , Humans , Molecular Sequence Data , Phenotype , Sensitivity and Specificity , Software
5.
Cancer Res ; 60(7): 1949-60, 2000 Apr 01.
Article in English | MEDLINE | ID: mdl-10766185

ABSTRACT

Allele loss involving chromosome arm 3p is one of the most frequent and earliest known genetic events in lung cancer pathogenesis and may affect several potential tumor suppressor gene regions. To further study the role of chromosome 3p allele loss in the pathogenesis of lung cancer, we performed high resolution loss of heterozygosity (LOH) studies on 97 lung cancer and 54 preneoplastic/preinvasive microdissected respiratory epithelial samples using a panel of 28 3p markers. Allelic losses of 3p were detected in 96% of the lung cancers and in 78% of the preneoplastic/preinvasive lesions. The allele losses were often multiple and discontinuous, with areas of LOH interspersed with areas of retention of heterozygosity. Most small cell lung carcinomas (91%) and squamous cell carcinomas (95%) demonstrated larger 3p segments of allele loss, whereas most (71%) of the adenocarcinomas and preneoplastic/preinvasive lesions had smaller chromosome areas of 3p allele loss. There was a progressive increase in the frequency and size of 3p allele loss regions with increasing severity of histopathological preneoplastic/preinvasive changes. In analyses of the specific parental allele lost comparing 42 preneoplastic/preinvasive foci with those lost in the lung cancer in the same patient (n = 10), the same parental allele was lost in 88% of 244 comparisons for 28 3p markers (P = 1.2 x 10(-36) for this occurring by chance). This indicates the occurrence of allele-specific loss in these foci similar to that seen in the tumor by a currently unknown mechanism. Analysis of all of the data indicated multiple regions of localized 3p allele loss including telomere-D3S1597, D3S1111-D3S2432, D3S2432-D3S1537, D3S1537, D3S1537-D3S1612, D3S4604/Luca19.1-D3S4622/Luca4.1, D3S4624/Luca2.1, D3S4624/Luca2.1-D3S1582, D3S1766, D3S1234-D3S1300 (FHIT/FRA3B region centered on D3S1300), D3S1284-D3S1577 (U2020/DUTT1 region centered on D3S1274), and D3S1511-centromere. A panel of six markers in the 600-kb 3p21.3 deletion region showed loss in 77% of the lung cancers, 70% of normal or preneoplastic/preinvasive lesions associated with lung cancer, and 49% of 47 normal, mildly abnormal, or preneoplastic/preinvasive lesions found in smokers without lung cancer; however, loss was seen in 0% of 18 epithelial samples from seven never smokers. The 600-kb 3p21.3 region and the 3p14.2 (FHIT/FRA3B) and 3p12 (U2020/DUTT1) regions were common, independent sites of breakpoints (retention of heterozygosity by some markers and LOH by other markers in the immediate region). We conclude that 3p allele loss is nearly universal in lung cancer pathogenesis; involves multiple, discrete, 3p LOH sites that often show a "discontinuous LOH" pattern in individual tumors; occurs in preneoplastic/preinvasive lesions in smokers with and without lung cancer (multiple lesions often lose the same parental allele); frequently involves breakpoints in at least three very small defined genomic regions; and appears to have allele loss and breakpoints first occurring in the 600-kb 3p21.3 region. These findings are consistent with previously reported LOH studies in a variety of tumors showing allele loss occurring by mitotic recombination and induced by oxidative damage.


Subject(s)
Bronchi/pathology , Chromosome Breakage , Chromosomes, Human, Pair 3 , Lung Neoplasms/genetics , Precancerous Conditions/genetics , Respiratory Mucosa/pathology , Adult , Aged , Aged, 80 and over , Alleles , Carcinoma, Non-Small-Cell Lung/genetics , Carcinoma, Non-Small-Cell Lung/pathology , Carcinoma, Small Cell/genetics , Carcinoma, Small Cell/pathology , Chromosome Deletion , Chromosome Mapping , Female , Genetic Markers , Humans , Lung Neoplasms/pathology , Male , Middle Aged , Neoplasm Invasiveness , Precancerous Conditions/pathology , Tumor Cells, Cultured
6.
Proc Natl Acad Sci U S A ; 95(13): 7514-9, 1998 Jun 23.
Article in English | MEDLINE | ID: mdl-9636181

ABSTRACT

A computational system for the prediction of polymorphic loci directly and efficiently from human genomic sequence was developed and verified. A suite of programs, collectively called POMPOUS (polymorphic marker prediction of ubiquitous simple sequences) detects tandem repeats ranging from dinucleotides up to 250 mers, scores them according to predicted level of polymorphism, and designs appropriate flanking primers for PCR amplification. This approach was validated on an approximately 750-kilobase region of human chromosome 3p21.3, involved in lung and breast carcinoma homozygous deletions. Target DNA from 36 paired B lymphoblastoid and lung cancer lines was amplified and allelotyped for 33 loci predicted by POMPOUS to be variable in repeat size. We found that among those 36 predominately Caucasian individuals 22 of the 33 (67%) predicted loci were polymorphic with an average heterozygosity of 0.42. Allele loss in this region was found in 27/36 (75%) of the tumor lines using these markers. POMPOUS provides the genetic researcher with an additional tool for the rapid and efficient identification of polymorphic markers, and through a World Wide Web site, investigators can use POMPOUS to identify polymorphic markers for their research. A catalog of 13,261 potential polymorphic markers and associated primer sets has been created from the analysis of 141,779,504 base pairs of human genomic sequence in GenBank. This data is available on our Web site (pompous.swmed.edu) and will be updated periodically as GenBank is expanded and algorithm accuracy is improved.


Subject(s)
Genetic Markers , Polymorphism, Genetic , Computer Communication Networks , Human Genome Project , Humans , Numerical Analysis, Computer-Assisted , Repetitive Sequences, Nucleic Acid , Software , Tumor Cells, Cultured
7.
Biotechniques ; 22(6): 1162-3, 1997 Jun.
Article in English | MEDLINE | ID: mdl-9187770

ABSTRACT

Automated DNA sequencing requires the intensive use of computers to handle the large amount of data taken. When a computer failure occurs and the data are no longer accessible, all the expense and effort that went into the sequencing experiment is lost. By using the data storage architecture of Macintosh computers to our advantage, we may prevent this loss in the case of automatic sequencers from PE Applied Biosystems. The software required to allow the experimenter to do this has been written and is available free of charge.


Subject(s)
Computer Storage Devices , Information Storage and Retrieval , Sequence Analysis, DNA , Software , Automation , Computer Communication Networks , Sequence Analysis, DNA/instrumentation
8.
Curr Genet ; 27(1): 62-9, 1994 Dec.
Article in English | MEDLINE | ID: mdl-7750148

ABSTRACT

Conidiation in the filamentous ascomycete Aspergillus nidulans requires activation of brlA, a well-characterized transcriptional regulator of genes that are induced specifically during asexual development. We have isolated and characterized developmental mutations in six loci, designated fluG, flbA, flbB, flbC, flbD, and flbE, that result in defective development and reduced brlA expression. These mutants grow indeterminately to produce masses of aerial hyphae resulting in the formation of cotton-like colonies with a "fluffy" morphology. The results of growth and epistasis tests involving all pairwise combinations of fluffy mutations indicate complex hierarchical relationships among these loci. We discuss these genetic interactions and propose that there are multiple mechanisms for activating brlA.


Subject(s)
Aspergillus nidulans/genetics , Fungal Proteins/physiology , Gene Expression Regulation, Developmental , Gene Expression Regulation, Fungal , Genes, Fungal , Reproduction, Asexual/genetics , Transcription Factors , Alleles , Aspergillus nidulans/growth & development , Aspergillus nidulans/ultrastructure , Cloning, Molecular , Epistasis, Genetic , Fungal Proteins/biosynthesis , Fungal Proteins/genetics , Genetic Complementation Test , Genetic Linkage , Phenotype , Promoter Regions, Genetic , Recombinant Fusion Proteins/biosynthesis
SELECTION OF CITATIONS
SEARCH DETAIL
...