Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 31
Filter
2.
Hum Genomics ; 11(1): 23, 2017 10 11.
Article in English | MEDLINE | ID: mdl-29020978

ABSTRACT

There are bioethical, institutional, economic, legal, and cultural obstacles to creating the robust-precompetitive-data resource that will be required to advance the vision of "precision medicine," the ability to use molecular data to target therapies to patients for whom they offer the most benefit at the least risk. Creation of such an "information commons" was the central recommendation of the 2011 report Toward Precision Medicine issued by a committee of the National Research Council of the USA (Committee on a Framework for Development of a New Taxonomy of Disease; National Research Council. Toward precision medicine: building a knowledge network for biomedical research and a new taxonomy of disease. 2011). In this commentary, I review the rationale for creating an information commons and the obstacles to doing so; then, I endorse a path forward based on the dynamic consent of research subjects interacting with researchers through trusted mediators. I assert that the advantages of the proposed system overwhelm alternative ways of handling data on the phenotypes, genotypes, and environmental exposures of individual humans; hence, I argue that its creation should be the central policy objective of early efforts to make precision medicine a reality.


Subject(s)
Delivery of Health Care/organization & administration , Human Genetics/trends , Precision Medicine/trends , Genome, Human , Human Genetics/ethics , Human Genetics/organization & administration , Humans , Information Technology , Phenotype , Precision Medicine/methods , Privacy
3.
Genomics Proteomics Bioinformatics ; 15(1): 3-10, 2017 02.
Article in English | MEDLINE | ID: mdl-28188874
4.
Article in English | MEDLINE | ID: mdl-22657391

ABSTRACT

The central preoccupation of human genetics is an effort to understand the genotypic basis of human phenotypic diversity. Although recent progress in identifying the genes that, when mutated, underlie major genetic diseases has been rapid, knowledge of the genetic influences on the vast range of variable, and at least partially heritable, traits that constitute the "normal" range of human phenotypic variation lags. Spectacular advances in our knowledge of human genetic variation have laid the groundwork for a synthesis of insights from medical genetics, population genetics, molecular evolution, and the study of human origins that places basic constraints on models of human genetic individuality. Balancing selection, local adaptation, mutation-selection balance, and founder effects have all extensively shaped contemporary genetic variation. Long-term-balancing selection appears largely to reflect the consequences of host-pathogen arms races. Local adaptation has been widespread-and involved responses to a plethora of selective pressures, some identifiable but most unknown. However, it appears to be a combination of mutation-selection balance and founder effects that largely accounts for genetic individuality. If true, this inference has major implications for future research programs in human genetics.


Subject(s)
Genetic Variation , Animals , Evolution, Molecular , Genetics, Medical/history , Genome, Human , Haplotypes , History, 20th Century , Humans , Models, Genetic , Mutation , Selection, Genetic , Sequence Analysis, DNA
5.
J Bacteriol ; 193(14): 3668-9, 2011 Jul.
Article in English | MEDLINE | ID: mdl-21571998

ABSTRACT

The genome sequence of the aceticlastic methanoarchaeon Methanosaeta concilii GP6, comprised of a 3,008,626-bp chromosome and an 18,019-bp episome, has been determined and exhibits considerable differences in gene content from that of Methanosaeta thermophila.


Subject(s)
Genome, Archaeal , Methane/metabolism , Methanosarcinaceae/genetics , Methanosarcinaceae/metabolism , Base Sequence , DNA, Archaeal/genetics , Methanosarcinaceae/classification , Molecular Sequence Data , Sequence Analysis, DNA
7.
Am J Hum Genet ; 86(1): 34-44, 2010 Jan.
Article in English | MEDLINE | ID: mdl-20085712

ABSTRACT

It is well known that average levels of population structure are higher on the X chromosome compared to autosomes in humans. However, there have been surprisingly few analyses on the spatial distribution of population structure along the X chromosome. With publicly available data from the HapMap Project and Perlegen Sciences, we show a strikingly punctuated pattern of X chromosome population structure. Specifically, 87% of X-linked HapMap SNPs within the top 1% of F(ST) values cluster into five distinct loci. The largest of these regions spans 5.4 Mb and contains 66% of the most highly differentiated HapMap SNPs on the X chromosome. We demonstrate that the extreme clustering of highly differentiated SNPs on the X chromosome is not an artifact of ascertainment bias, nor is it specific to the populations genotyped in the HapMap Project. Rather, additional analyses and resequencing data suggest that these five regions have been substrates of recent and strong adaptive evolution. Finally, we discuss the implications that patterns of X-linked population structure have on the evolutionary history of African populations.


Subject(s)
Black People/genetics , Chromosomes, Human, X , Evolution, Molecular , Genetics, Population , Population Groups/genetics , Alleles , Databases, Genetic , Gene Frequency , Genome, Human , Genotype , Humans , Models, Genetic , Polymorphism, Single Nucleotide , Sequence Analysis, DNA , United States
8.
Am J Hum Genet ; 84(5): 692-7, 2009 May.
Article in English | MEDLINE | ID: mdl-19409521

ABSTRACT

We have established strong linkage evidence that supports mapping autosomal-dominant sensory/motor neuropathy with ataxia (SMNA) to chromosome 7q22-q32. SMNA is a rare neurological disorder whose phenotype encompasses both the central and the peripheral nervous system. In order to identify a gene responsible for SMNA, we have undertaken a comprehensive genomic evaluation of the region of linkage, including evaluation for repeat expansion and small deletions or duplications, capillary sequencing of candidate genes, and massively parallel sequencing of all coding exons. We excluded repeat expansion and small deletions or duplications as causative, and through microarray-based hybrid capture and massively parallel short-read sequencing, we identified a nonsynonymous variant in the human interferon-related developmental regulator gene 1 (IFRD1) as a disease-causing candidate. Sequence conservation, animal models, and protein structure evaluation support the involvement of IFRD1 in SMNA. Mutation analysis of IFRD1 in additional patients with similar phenotypes is needed for demonstration of causality and further evaluation of its importance in neurological diseases.


Subject(s)
Ataxia/genetics , Chromosomes, Human, Pair 7/genetics , Genetic Predisposition to Disease , Hereditary Sensory and Motor Neuropathy/genetics , Immediate-Early Proteins/genetics , Humans , Mutation , Pedigree
9.
PLoS One ; 4(5): e5584, 2009.
Article in English | MEDLINE | ID: mdl-19440302

ABSTRACT

BACKGROUND: Methylotrophy describes the ability of organisms to grow on reduced organic compounds without carbon-carbon bonds. The genomes of two pink-pigmented facultative methylotrophic bacteria of the Alpha-proteobacterial genus Methylobacterium, the reference species Methylobacterium extorquens strain AM1 and the dichloromethane-degrading strain DM4, were compared. METHODOLOGY/PRINCIPAL FINDINGS: The 6.88 Mb genome of strain AM1 comprises a 5.51 Mb chromosome, a 1.26 Mb megaplasmid and three plasmids, while the 6.12 Mb genome of strain DM4 features a 5.94 Mb chromosome and two plasmids. The chromosomes are highly syntenic and share a large majority of genes, while plasmids are mostly strain-specific, with the exception of a 130 kb region of the strain AM1 megaplasmid which is syntenic to a chromosomal region of strain DM4. Both genomes contain large sets of insertion elements, many of them strain-specific, suggesting an important potential for genomic plasticity. Most of the genomic determinants associated with methylotrophy are nearly identical, with two exceptions that illustrate the metabolic and genomic versatility of Methylobacterium. A 126 kb dichloromethane utilization (dcm) gene cluster is essential for the ability of strain DM4 to use DCM as the sole carbon and energy source for growth and is unique to strain DM4. The methylamine utilization (mau) gene cluster is only found in strain AM1, indicating that strain DM4 employs an alternative system for growth with methylamine. The dcm and mau clusters represent two of the chromosomal genomic islands (AM1: 28; DM4: 17) that were defined. The mau cluster is flanked by mobile elements, but the dcm cluster disrupts a gene annotated as chelatase and for which we propose the name "island integration determinant" (iid). CONCLUSION/SIGNIFICANCE: These two genome sequences provide a platform for intra- and interspecies genomic comparisons in the genus Methylobacterium, and for investigations of the adaptive mechanisms which allow bacterial lineages to acquire methylotrophic lifestyles.


Subject(s)
Genome, Bacterial/genetics , Methylobacterium/genetics , Methylobacterium/metabolism , Acyl Coenzyme A/metabolism , Formaldehyde/metabolism , Genome, Bacterial/physiology , Methanol/metabolism , Methylamines/metabolism , Models, Biological , Models, Genetic
10.
Genomics ; 91(6): 530-7, 2008 Jun.
Article in English | MEDLINE | ID: mdl-18445516

ABSTRACT

Large-insert genome analysis (LIGAN) is a broadly applicable, high-throughput technology designed to characterize genome-scale structural variation. Fosmid paired-end sequences and DNA fingerprints from a query genome are compared to a reference sequence using the Genomic Variation Analysis (GenVal) suite of software tools to pinpoint locations of insertions, deletions, and rearrangements. Fosmids spanning regions that contain new structural variants can then be sequenced. Clonal pairs of Pseudomonas aeruginosa isolates from four cystic fibrosis patients were used to validate the LIGAN technology. Approximately 1.5 Mb of inserted sequences were identified, including 743 kb containing 615 ORFs that are absent from published P. aeruginosa genomes. Six rearrangement breakpoints and 220 kb of deleted sequences were also identified. Our study expands the "genome universe" of P. aeruginosa and validates a technology that complements emerging, short-read sequencing methods that are better suited to characterizing single-nucleotide polymorphisms than structural variation.


Subject(s)
Cystic Fibrosis/microbiology , DNA Fingerprinting/methods , DNA Mutational Analysis/methods , Genome, Bacterial , Pseudomonas aeruginosa/genetics , Base Sequence , Genetic Variation , Humans , Molecular Sequence Data , Mutagenesis, Insertional , Pseudomonas aeruginosa/isolation & purification , Recombination, Genetic , Sequence Deletion
11.
Nature ; 453(7191): 56-64, 2008 May 01.
Article in English | MEDLINE | ID: mdl-18451855

ABSTRACT

Genetic variation among individual humans occurs on many different scales, ranging from gross alterations in the human karyotype to single nucleotide changes. Here we explore variation on an intermediate scale--particularly insertions, deletions and inversions affecting from a few thousand to a few million base pairs. We employed a clone-based method to interrogate this intermediate structural variation in eight individuals of diverse geographic ancestry. Our analysis provides a comprehensive overview of the normal pattern of structural variation present in these genomes, refining the location of 1,695 structural variants. We find that 50% were seen in more than one individual and that nearly half lay outside regions of the genome previously described as structurally variant. We discover 525 new insertion sequences that are not present in the human reference genome and show that many of these are variable in copy number between individuals. Complete sequencing of 261 structural variants reveals considerable locus complexity and provides insights into the different mutational processes that have shaped the human genome. These data provide the first high-resolution sequence map of human structural variation--a standard for genotyping platforms and a prelude to future individual genome sequencing projects.


Subject(s)
Genetic Variation/genetics , Genome, Human/genetics , Physical Chromosome Mapping , Sequence Analysis, DNA , Chromosome Inversion/genetics , Euchromatin/genetics , Gene Deletion , Geography , Haplotypes , Humans , Mutagenesis, Insertional/genetics , Polymorphism, Single Nucleotide/genetics , Racial Groups/genetics , Reproducibility of Results
13.
Nat Genet ; 40(1): 96-101, 2008 Jan.
Article in English | MEDLINE | ID: mdl-18157130

ABSTRACT

The human genome sequence has been finished to very high standards; however, more than 340 gaps remained when the finished genome was published by the International Human Genome Sequencing Consortium in 2004. Using fosmid resources generated from multiple individuals, we targeted gaps in the euchromatic part of the human genome. Here we report 2,488,842 bp of previously unknown euchromatic sequence, 363,114 bp of which close 26 of 250 euchromatic gaps, or 10%, including two remaining euchromatic gaps on chromosome 19. Eight (30.7%) of the closed gaps were found to be polymorphic. These sequences allow complete annotation of several human genes as well as the assignment of mRNAs. The gap sequences are 2.3-fold enriched in segmentally duplicated sequences compared to the whole genome. Our analysis confirms that not all gaps within 'finished' genomes are recalcitrant to subcloning and suggests that the paired-end-sequenced fosmid libraries could prove to be a rich resource for completion of the human euchromatic genome.


Subject(s)
Chromosomes, Human, Pair 19 , Genome, Human , Base Sequence , Cloning, Molecular , Euchromatin , Gene Library , Genetic Vectors , Human Genome Project , Humans , Molecular Sequence Data , Polymorphism, Genetic
14.
Genome Biol ; 8(6): R102, 2007.
Article in English | MEDLINE | ID: mdl-17550600

ABSTRACT

BACKGROUND: Francisella tularensis subspecies tularensis and holarctica are pathogenic to humans, whereas the two other subspecies, novicida and mediasiatica, rarely cause disease. To uncover the factors that allow subspecies tularensis and holarctica to be pathogenic to humans, we compared their genome sequences with the genome sequence of Francisella tularensis subspecies novicida U112, which is nonpathogenic to humans. RESULTS: Comparison of the genomes of human pathogenic Francisella strains with the genome of U112 identifies genes specific to the human pathogenic strains and reveals pseudogenes that previously were unidentified. In addition, this analysis provides a coarse chronology of the evolutionary events that took place during the emergence of the human pathogenic strains. Genomic rearrangements at the level of insertion sequences (IS elements), point mutations, and small indels took place in the human pathogenic strains during and after differentiation from the nonpathogenic strain, resulting in gene inactivation. CONCLUSION: The chronology of events suggests a substantial role for genetic drift in the formation of pseudogenes in Francisella genomes. Mutations that occurred early in the evolution, however, might have been fixed in the population either because of evolutionary bottlenecks or because they were pathoadaptive (beneficial in the context of infection). Because the structure of Francisella genomes is similar to that of the genomes of other emerging or highly pathogenic bacteria, this evolutionary scenario may be shared by pathogens from other species.


Subject(s)
Francisella tularensis/genetics , Francisella tularensis/pathogenicity , DNA Transposable Elements , Evolution, Molecular , Francisella tularensis/classification , Genome, Bacterial , Humans , Mutation , Pseudogenes , Virulence
15.
Am J Hum Genet ; 79(5): 958-64, 2006 Nov.
Article in English | MEDLINE | ID: mdl-17033972

ABSTRACT

Comparisons between haplotypes from affected patients and the human reference genome are frequently used to identify candidates for disease-causing mutations, even though these alignments are expected to reveal a high level of background neutral polymorphism. This limits the scope of genetic studies to relatively small genomic intervals, because current methods for distinguishing potential causal mutations from neutral variation are inefficient. Here we describe a new strategy for detecting mutations that is based on comparing affected haplotypes with closely matched control sequences from healthy individuals, rather than with the human reference genome. We use theory, simulation, and a real data set to show that this approach is expected to reduce the number of sequence variants that must be subjected to follow-up analysis by at least a factor of 20 when closely matched control sequences are selected from a reference panel with as few as 100 control genomes. We also define a reference data resource that would allow efficient application of this strategy to large critical intervals across the genome.


Subject(s)
Genetic Diseases, Inborn/genetics , Genome, Human , Haplotypes , Mutation , Alleles , Case-Control Studies , Computer Simulation , Databases, Genetic , Gene Frequency , Genomics/methods , Genomics/statistics & numerical data , Humans , Models, Genetic , Polymorphism, Genetic , Polymorphism, Single Nucleotide , Recombination, Genetic
16.
Proc Natl Acad Sci U S A ; 103(22): 8487-92, 2006 May 30.
Article in English | MEDLINE | ID: mdl-16687478

ABSTRACT

In many human infections, hosts and pathogens coexist for years or decades. Important examples include HIV, herpes viruses, tuberculosis, leprosy, and malaria. With the exception of intensively studied viral infections such as HIV/AIDs, little is known about the extent to which the clonal expansion that occurs during long-term infection by pathogens involves important genetic adaptations. We report here a detailed, whole-genome analysis of one such infection, that of a cystic fibrosis (CF) patient by the opportunistic bacterial pathogen Pseudomonas aeruginosa. The bacteria underwent numerous genetic adaptations during 8 years of infection, as evidenced by a positive-selection signal across the genome and an overwhelming signal in specific genes, several of which are mutated during the course of most CF infections. Of particular interest is our finding that virulence factors that are required for the initiation of acute infections are often selected against during chronic infections. It is apparent that the genotypes of the P. aeruginosa strains present in advanced CF infections differ systematically from those of "wild-type" P. aeruginosa and that these differences may offer new opportunities for treatment of this chronic disease.


Subject(s)
Adaptation, Physiological/genetics , Cystic Fibrosis/complications , Cystic Fibrosis/microbiology , Pseudomonas Infections/complications , Pseudomonas Infections/microbiology , Pseudomonas aeruginosa/genetics , Pseudomonas aeruginosa/physiology , Bacterial Proteins/genetics , Chronic Disease , Cystic Fibrosis/pathology , DNA-Binding Proteins/genetics , Genome, Bacterial/genetics , Humans , Molecular Sequence Data , Mutation/genetics , Pseudomonas Infections/pathology , Pseudomonas aeruginosa/isolation & purification , Selection, Genetic , Time Factors , Trans-Activators/genetics
17.
Hum Mol Genet ; 15(7): 1159-67, 2006 Apr 01.
Article in English | MEDLINE | ID: mdl-16497726

ABSTRACT

The contribution of large-scale and intermediate-size structural variation (ISV) to human genetic disease and disease susceptibility is only beginning to be understood. The development of high-throughput genotyping technologies is one of the most critical aspects for future studies of linkage disequilibrium (LD) and disease association. Using a simple PCR-based method designed to assay the junctions of the breakpoints, we genotyped seven simple insertion and deletion polymorphisms ranging in size from 6.3 to 24.7 kb among 90 CEPH individuals. We then extended this analysis to a larger collection of samples (n=460) by application of an oligonucleotide extension-ligation genotyping assay. The analysis showed a high level of concordance ( approximately 99%) when compared with PCR/sequence-validated genotypes. Using the available HapMap data, we observed significant LD (r2=0.74-0.95) between each ISV and flanking single nucleotide polymorphisms, but this observation is likely to hold only for similar simple insertion/deletion events. The approach we describe may be used to characterize a large number of individuals in a cost-effective manner once the sequence organization of ISVs is known.


Subject(s)
Genetic Testing/methods , Genotype , Cohort Studies , Female , Genetic Variation , Humans , Linkage Disequilibrium , Male , Microarray Analysis , Models, Genetic , Polymorphism, Single Nucleotide
18.
Genomics ; 86(6): 759-66, 2005 Dec.
Article in English | MEDLINE | ID: mdl-16249066

ABSTRACT

Currently, challenges exist to acquire long-range (hundreds of kilobase pairs) phase-discriminated sequence across substantial numbers of individuals. We have developed a straightforward method for isolating and characterizing specific genomic regions in a haplospecific manner. Real-time PCR is carried out to STS content map and genotype pools of fosmid clones arrayed in 384-well microtiter plates. Single-nucleotide polymorphisms, microsatellite markers, and insertion-deletion polymorphisms are used to differentiate the target region into haplotype-specific tiling paths. DNA of clones from these tiling paths is retrieved from the library and either sequenced by standard shotgun methods or amplified in vitro and sequenced by a primer-based, directed method. This approach provides convenient access to complete, haplotype-resolved resequencing data from multiple individuals across tens to hundreds of thousands of basepairs. We illustrate its implementation with a detailed example of more than 400 kbp from the human CFTR region, across 15 individuals, and summarize our experience applying it to many other human loci.


Subject(s)
Genome, Human/genetics , Haplotypes/genetics , Sequence Analysis, DNA/methods , Base Sequence , Cloning, Molecular/methods , Genotype , Humans , Microsatellite Repeats/genetics , Molecular Sequence Data , Polymorphism, Single Nucleotide/genetics
19.
Genome Res ; 15(9): 1250-7, 2005 Sep.
Article in English | MEDLINE | ID: mdl-16140993

ABSTRACT

Allelic variation in codons that specify amino acids that line the peptide-binding pockets of HLA's Class II antigen-presenting proteins is superimposed on strikingly few deeply diverged haplotypes. These haplotypes appear to have been evolving almost independently for tens of millions of years. By complete resequencing of 20 haplotypes across the approximately 100-kbp region that spans the HLA-DQA1, -DQB1, and -DRB1 genes, we provide a detailed view of the way in which the genome structure at this locus has been shaped by the interplay of selection, gene-gene interaction, and recombination.


Subject(s)
Genes, MHC Class II , Alleles , Animals , Evolution, Molecular , Genetic Variation , Genome, Human , Gorilla gorilla/genetics , Gorilla gorilla/immunology , HLA-DQ Antigens/genetics , HLA-DQ alpha-Chains , HLA-DQ beta-Chains , HLA-DR Antigens/genetics , HLA-DRB1 Chains , Haplotypes , Humans , Linkage Disequilibrium , Models, Genetic , Molecular Sequence Data , Pan troglodytes/genetics , Pan troglodytes/immunology , Phylogeny , Polymorphism, Single Nucleotide , Recombination, Genetic , Selection, Genetic
20.
Nat Genet ; 37(7): 727-32, 2005 Jul.
Article in English | MEDLINE | ID: mdl-15895083

ABSTRACT

Inversions, deletions and insertions are important mediators of disease and disease susceptibility. We systematically compared the human genome reference sequence with a second genome (represented by fosmid paired-end sequences) to detect intermediate-sized structural variants >8 kb in length. We identified 297 sites of structural variation: 139 insertions, 102 deletions and 56 inversion breakpoints. Using combined literature, sequence and experimental analyses, we validated 112 of the structural variants, including several that are of biomedical relevance. These data provide a fine-scale structural variation map of the human genome and the requisite sequence precision for subsequent genetic studies of human disease.


Subject(s)
Genome, Human , Genomic Instability , Mutation , Polymorphism, Genetic , Base Pairing , Cell Line, Tumor , Computational Biology , Humans , Oligonucleotide Array Sequence Analysis , Reference Values , Sequence Analysis, DNA
SELECTION OF CITATIONS
SEARCH DETAIL
...