Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 30
Filter
1.
Sci Adv ; 8(36): eabn4030, 2022 09 09.
Article in English | MEDLINE | ID: mdl-36083902

ABSTRACT

Genome-wide analysis of cell-free DNA methylation profile is a promising approach for sensitive and specific detection of many cancers. However, scaling such assays for clinical translation is impractical because of the high cost of whole-genome bisulfite sequencing. We show that the small fraction of GC-rich genome is highly enriched in CpG sites and disproportionately harbors most of the cancer-specific methylation signature. Here, we report on the simple and effective heat enrichment of CpG-rich regions for bisulfite sequencing (Heatrich-BS) platform that allows for focused methylation profiling in these highly informative regions. Our novel method and bioinformatics algorithm enable accurate tumor burden estimation and quantitative tracking of colorectal cancer patient's response to treatment at much reduced sequencing cost suitable for frequent monitoring. We also show tumor epigenetic subtyping using Heatrich-BS, which could enable patient stratification. Heatrich-BS holds great potential for highly scalable screening and monitoring of cancer using liquid biopsy.


Subject(s)
Cell-Free Nucleic Acids , Neoplasms , Cell-Free Nucleic Acids/genetics , DNA Methylation , Epigenome , Hot Temperature , Humans , Neoplasms/diagnosis , Neoplasms/genetics , Sequence Analysis, DNA/methods
2.
Nat Biotechnol ; 39(11): 1394-1402, 2021 11.
Article in English | MEDLINE | ID: mdl-34282325

ABSTRACT

RNA modifications, such as N6-methyladenosine (m6A), modulate functions of cellular RNA species. However, quantifying differences in RNA modifications has been challenging. Here we develop a computational method, xPore, to identify differential RNA modifications from nanopore direct RNA sequencing (RNA-seq) data. We evaluate our method on transcriptome-wide m6A profiling data, demonstrating that xPore identifies positions of m6A sites at single-base resolution, estimates the fraction of modified RNA species in the cell and quantifies the differential modification rate across conditions. We apply xPore to direct RNA-seq data from six cell lines and multiple myeloma patient samples without a matched control sample and find that many m6A sites are preserved across cell types, whereas a subset exhibit significant differences in their modification rates. Our results show that RNA modifications can be identified from direct RNA-seq data with high accuracy, enabling analysis of differential modifications and expression from a single high-throughput experiment.


Subject(s)
Nanopore Sequencing , Nanopores , High-Throughput Nucleotide Sequencing , Humans , RNA/genetics , RNA/metabolism , RNA Processing, Post-Transcriptional/genetics , Sequence Analysis, RNA/methods , Transcriptome/genetics
4.
Gut ; 70(10): 1823-1832, 2021 10.
Article in English | MEDLINE | ID: mdl-33229445

ABSTRACT

OBJECTIVE: Endoscopic mucosal biopsies of primary gastric cancers (GCs) are used to guide diagnosis, biomarker testing and treatment. Spatial intratumoural heterogeneity (ITH) may influence biopsy-derived information. We aimed to study ITH of primary GCs and matched lymph node metastasis (LNmet). DESIGN: GC resection samples were annotated to identify primary tumour superficial (PTsup), primary tumour deep (PTdeep) and LNmet subregions. For each subregion, we determined (1) transcriptomic profiles (NanoString 'PanCancer Progression Panel', 770 genes); (2) next-generation sequencing (NGS, 225 gastrointestinal cancer-related genes); (3) DNA copy number profiles by multiplex ligation-dependent probe amplification (MLPA, 16 genes); and (4) histomorphological phenotypes. RESULTS: NanoString profiling of 64 GCs revealed no differences between PTsup1 and PTsup2, while 43% of genes were differentially expressed between PTsup versus PTdeep and 38% in PTsup versus LNmet. Only 16% of genes were differently expressed between PTdeep and LNmet. Several genes with therapeutic potential (eg IGF1, PIK3CD and TGFB1) were overexpressed in LNmet and PTdeep compared with PTsup. NGS data revealed orthogonal support of NanoString results with 40% mutations present in PTdeep and/or LNmet, but not in PTsup. Conversely, only 6% of mutations were present in PTsup and were absent in PTdeep and LNmet. MLPA demonstrated significant ITH between subregions and progressive genomic changes from PTsup to PTdeep/LNmet. CONCLUSION: In GC, regional lymph node metastases are likely to originate from deeper subregions of the primary tumour. Future clinical trials of novel targeted therapies must consider assessment of deeper subregions of the primary tumour and/or metastases as several therapeutically relevant genes are only mutated, overexpressed or amplified in these regions.


Subject(s)
Lymphatic Metastasis/genetics , Lymphatic Metastasis/pathology , Neoplasm Proteins/genetics , Stomach Neoplasms/genetics , Stomach Neoplasms/pathology , Adenocarcinoma/genetics , Adenocarcinoma/pathology , DNA Copy Number Variations , Genes, Neoplasm , Genomics , High-Throughput Nucleotide Sequencing , Humans , Phenotype , Registries
6.
Nat Biotechnol ; 39(3): 336-346, 2021 03.
Article in English | MEDLINE | ID: mdl-33106685

ABSTRACT

Current methods for determining RNA structure with short-read sequencing cannot capture most differences between distinct transcript isoforms. Here we present RNA structure analysis using nanopore sequencing (PORE-cupine), which combines structure probing using chemical modifications with direct long-read RNA sequencing and machine learning to detect secondary structures in cellular RNAs. PORE-cupine also captures global structural features, such as RNA-binding-protein binding sites and reactivity differences at single-nucleotide variants. We show that shared sequences in different transcript isoforms of the same gene can fold into different structures, highlighting the importance of long-read sequencing for obtaining phase information. We also demonstrate that structural differences between transcript isoforms of the same gene lead to differences in translation efficiency. By revealing isoform-specific RNA structure, PORE-cupine will deepen understanding of the role of structures in controlling gene regulation.


Subject(s)
Nanopore Sequencing/methods , Nucleic Acid Conformation , RNA/chemistry , Sequence Analysis, RNA/methods , Human Embryonic Stem Cells/metabolism , Humans , Isomerism , RNA/genetics , Tetrahymena/genetics , Transcriptome
7.
Arch Dis Child ; 106(1): 31-37, 2021 01.
Article in English | MEDLINE | ID: mdl-32819910

ABSTRACT

OBJECTIVE: Use next-generation sequencing (NGS) technology to improve our diagnostic yield in patients with suspected genetic disorders in the Asian setting. DESIGN: A diagnostic study conducted between 2014 and 2019 (and ongoing) under the Singapore Undiagnosed Disease Program. Date of last analysis was 1 July 2019. SETTING: Inpatient and outpatient genetics service at two large academic centres in Singapore. PATIENTS: Inclusion criteria: patients suspected of genetic disorders, based on abnormal antenatal ultrasound, multiple congenital anomalies and developmental delay. EXCLUSION CRITERIA: patients with known genetic disorders, either after clinical assessment or investigations (such as karyotype or chromosomal microarray). INTERVENTIONS: Use of NGS technology-whole exome sequencing (WES) or whole genome sequencing (WGS). MAIN OUTCOME MEASURES: (1) Diagnostic yield by sequencing type, (2) diagnostic yield by phenotypical categories, (3) reduction in time to diagnosis and (4) change in clinical outcomes and management. RESULTS: We demonstrate a 37.8% diagnostic yield for WES (n=172) and a 33.3% yield for WGS (n=24). The yield was higher when sequencing was conducted on trios (40.2%), as well as for certain phenotypes (neuromuscular, 54%, and skeletal dysplasia, 50%). In addition to aiding genetic counselling in 100% of the families, a positive result led to a change in treatment in 27% of patients. CONCLUSION: Genomic sequencing is an effective method for diagnosing rare disease or previous 'undiagnosed' disease. The clinical utility of WES/WGS is seen in the shortened time to diagnosis and the discovery of novel variants. Additionally, reaching a diagnosis significantly impacts families and leads to alteration in management of these patients.


Subject(s)
Abnormalities, Multiple/genetics , Developmental Disabilities/genetics , High-Throughput Nucleotide Sequencing , Undiagnosed Diseases/genetics , Abnormalities, Multiple/diagnosis , Adolescent , Adult , Child , Child, Preschool , Developmental Disabilities/diagnosis , Female , Humans , Infant , Male , Singapore , Undiagnosed Diseases/diagnosis , Young Adult
8.
Sci Rep ; 10(1): 16774, 2020 10 08.
Article in English | MEDLINE | ID: mdl-33033274

ABSTRACT

Analysis of circulating cell-free DNA (cfDNA) has opened new opportunities for characterizing tumour mutational landscapes with many applications in genomic-driven oncology. We developed a customized targeted cfDNA sequencing approach for breast cancer (BC) using unique molecular identifiers (UMIs) for error correction. Our assay, spanning a 284.5 kb target region, is combined with a novel freely-licensed bioinformatics pipeline that provides detection of low-frequency variants, and reliable identification of copy number variations (CNVs) directly from plasma DNA. We first evaluated our pipeline on reference samples. Then in a cohort of 35 BC patients our approach detected actionable driver and clonal variants at low variant frequency levels in cfDNA that were concordant (77%) with sequencing of primary and/or metastatic solid tumour sites. We also detected ERRB2 gene CNVs used for HER2 subtype classification with 80% precision compared to immunohistochemistry. Further, we evaluated fragmentation profiles of cfDNA in BC and observed distinct differences compared to data from healthy individuals. Our results show that the developed assay addresses the majority of tumour associated aberrations directly from plasma DNA, and thus may be used to elucidate genomic alterations in liquid biopsy studies.


Subject(s)
Breast Neoplasms/genetics , Circulating Tumor DNA/genetics , DNA Copy Number Variations , Adult , Aged , Biomarkers, Tumor/genetics , Breast Neoplasms/pathology , Female , High-Throughput Nucleotide Sequencing , Humans , Middle Aged , Mutation , Sequence Analysis, DNA
9.
Nucleic Acids Res ; 46(22): 11659-11670, 2018 12 14.
Article in English | MEDLINE | ID: mdl-30412255

ABSTRACT

N6-methyldeoxyadenosine (6mA) is a well-characterized DNA modification in prokaryotes but reports on its presence and function in mammals have been controversial. To address this issue, we established the capacity of 6mA-Crosslinking-Exonuclease-sequencing (6mACE-seq) to detect genome-wide 6mA at single-nucleotide-resolution, demonstrating this by accurately mapping 6mA in synthesized DNA and bacterial genomes. Using 6mACE-seq, we generated a human-genome-wide 6mA map that accurately reproduced known 6mA enrichment at active retrotransposons and revealed mitochondrial chromosome-wide 6mA clusters asymmetrically enriched on the heavy-strand. We identified a novel putative 6mA-binding protein in single-stranded DNA-binding protein 1 (SSBP1), a mitochondrial DNA (mtDNA) replication factor known to coat the heavy-strand, linking 6mA with the regulation of mtDNA replication. Finally, we characterized AlkB homologue 1 (ALKBH1) as a mitochondrial protein with 6mA demethylase activity and showed that its loss decreases mitochondrial oxidative phosphorylation. Our results show that 6mA clusters play a previously unappreciated role in regulating human mitochondrial function, despite 6mA being an uncommon DNA modification in the human genome.


Subject(s)
DNA, Mitochondrial/genetics , DNA-Binding Proteins/genetics , DNA/genetics , Deoxyadenosines/genetics , Genome, Mitochondrial , Mitochondrial Proteins/genetics , AlkB Homolog 1, Histone H2a Dioxygenase/genetics , AlkB Homolog 1, Histone H2a Dioxygenase/metabolism , Bacteriophage lambda/genetics , Bacteriophage lambda/metabolism , Base Sequence , Chromosome Mapping , DNA/metabolism , DNA, Mitochondrial/metabolism , DNA-Binding Proteins/metabolism , Deoxyadenosines/metabolism , Escherichia coli/genetics , Escherichia coli/metabolism , Exodeoxyribonucleases , HEK293 Cells , Humans , Mitochondrial Proteins/metabolism , Oxidative Phosphorylation , Salmonella typhimurium/genetics , Salmonella typhimurium/metabolism , Sequence Analysis, DNA , Viral Proteins/chemistry , Viral Proteins/metabolism
10.
Breast Cancer Res Treat ; 171(3): 719-735, 2018 Oct.
Article in English | MEDLINE | ID: mdl-29926297

ABSTRACT

PURPOSE: The purpose of the study was to improve the understanding of NF1-associated breast cancer, given the increased risk of breast cancer in this tumour predisposition syndrome and the limited data. METHODS: We identified 18 women with NF1 and breast cancer at our institution. Clinical and pathologic characteristics of NF1-associated breast cancers were compared with 7132 breast cancers in patients without NF1 from our institutional database. Next generation sequencing was performed on DNA from blood and breast cancer specimens available. Blood specimens negative for NF1 mutation were subjected to multiplex ligation-dependent probe amplification (MLPA) to identify complete/partial deletions or duplications. Expression of neurofibromin in the NF1-associated breast cancers was evaluated using immunohistochemistry. RESULTS: There was a higher frequency of grade 3 (83.3% vs 45.4%, p = 0.005), oestrogen receptor (ER) negative (66.7% vs 26.3%, p < 0.001) and human epidermal growth factor receptor 2 (HER2)-positive (66.7% vs 23.4%, p < 0.001) tumours among NF1 patients compared to non-NF1 breast cancers. Overall survival was inferior in NF1 patients in multivariable analysis (hazard ratio 2.25, 95% CI 1.11-4.60; p = 0.025). Apart from germline NF1 mutations (11/16; 69%), somatic mutations in TP53 (8/10; 80%), second-hit NF1 (2/10; 20%), KMT2C (4/10; 40%), KMT2D (2/10; 20%), and PIK3CA (2/10; 20%) were observed. Immunohistochemical expression of neurofibromin was seen in the nuclei and/or cytoplasm of all specimens, but without any consistent pattern in the intensity or extent. CONCLUSIONS: This comprehensive series of NF1-associated breast cancers suggests that their aggressive features are related to germline NF1 mutations in cooperation with somatic mutations in TP53, KMT2C and other genes.


Subject(s)
Genes, Neurofibromatosis 1 , Neurofibromatosis 1/diagnosis , Neurofibromatosis 1/genetics , Adult , Aged , Biomarkers, Tumor , DNA Mutational Analysis , Female , Gene Amplification , Gene Expression Regulation, Neoplastic , High-Throughput Nucleotide Sequencing , Humans , Immunohistochemistry , Kaplan-Meier Estimate , Middle Aged , Mutation , Neoplasm Grading , Neoplasm Staging , Neurofibromatosis 1/epidemiology , Neurofibromatosis 1/mortality
11.
Sci Rep ; 7: 40737, 2017 01 19.
Article in English | MEDLINE | ID: mdl-28102343

ABSTRACT

Circulating tumour DNA (ctDNA) has the potential to be a specific biomarker for the monitoring of tumours in patients with colorectal cancer (CRC). Here, our aim was to develop a personalised surveillance strategy to monitor the clinical course of CRC after surgery. We developed patient-specific ctDNA assays based on multiplexed detection of somatic mutations identified from patient primary tumours, and applied them to detect ctDNA in 44 CRC patients, analysing a total of 260 plasma samples. We found that ctDNA detection correlated with clinical events - it is detectable in pre-operative but not post-operative plasma, and also in patients with recurrent CRC. We also detected ctDNA in 11 out of 15 cases at or before clinical or radiological recurrence of CRC, indicating the potential of our assay for early detection of metastasis. We further present data from a patient with multiple primary cancers to demonstrate the specificity of our assays to distinguish between CRC recurrence and a second primary cancer. Our approach can complement current methods for surveillance of CRC by adding an individualised biological component, allowing us not only to point to the presence of residual or recurrent disease, but also attribute it to the original cancer.


Subject(s)
Biomarkers, Tumor , Circulating Tumor DNA , Colorectal Neoplasms/genetics , DNA, Neoplasm , Colorectal Neoplasms/blood , Colorectal Neoplasms/diagnosis , Colorectal Neoplasms/surgery , Humans , Multiplex Polymerase Chain Reaction , Mutation , Postoperative Period , Recurrence , Reproducibility of Results , Sensitivity and Specificity , Treatment Outcome , Workflow
12.
J Mol Biol ; 426(12): 2283-99, 2014 Jun 12.
Article in English | MEDLINE | ID: mdl-24680668

ABSTRACT

The mixed lineage leukemia-1 (MLL1) enzyme is a histone H3 lysine 4 (H3K4) monomethyltransferase and has served as a paradigm for understanding the mechanism of action of the human SET1 family of enzymes that include MLL1-MLL4 and SETd1a,b. Dimethylation of H3K4 requires a sub-complex including WRAD (WDR5, RbBP5, Ash2L, and DPY-30), which binds to each SET1 family member forming a minimal core complex that is required for multiple lysine methylation. We recently demonstrated that WRAD is a novel histone methyltransferase that preferentially catalyzes H3K4 dimethylation in a manner that is dependent on an unknown non-active-site surface from the MLL1 SET domain. Recent genome sequencing studies have identified a number of human disease-associated missense mutations that localize to the SET domains of several MLL family members. In this investigation, we mapped many of these mutations onto the three-dimensional structure of the SET domain and noticed that a subset of MLL2 (KMT2D, ALR, MLL4)-associated Kabuki syndrome missense mutations map to a common solvent-exposed surface that is not expected to alter enzymatic activity. We introduced these mutations into the MLL1 SET domain and observed that all are defective for H3K4 dimethylation by the MLL1 core complex, which is associated with a loss of the ability of MLL1 to interact with WRAD or with the RbBP5/Ash2L heterodimer. Our results suggest that amino acids from this surface, which we term the Kabuki interaction surface or KIS, are required for formation of a second active site within SET1 family core complexes.


Subject(s)
DNA-Binding Proteins/metabolism , Myeloid-Lymphoid Leukemia Protein/metabolism , Nuclear Proteins/metabolism , Protein Interaction Domains and Motifs , Transcription Factors/metabolism , Amino Acid Sequence , Catalytic Domain , Histone-Lysine N-Methyltransferase , Humans , Models, Molecular , Molecular Sequence Data , Mutant Proteins/chemistry , Mutant Proteins/genetics , Mutant Proteins/metabolism , Mutation, Missense , Myeloid-Lymphoid Leukemia Protein/chemistry , Myeloid-Lymphoid Leukemia Protein/genetics , Protein Binding , Protein Conformation , Protein Interaction Mapping
13.
Nat Rev Genet ; 12(11): 745-55, 2011 Sep 27.
Article in English | MEDLINE | ID: mdl-21946919

ABSTRACT

Exome sequencing - the targeted sequencing of the subset of the human genome that is protein coding - is a powerful and cost-effective new tool for dissecting the genetic basis of diseases and traits that have proved to be intractable to conventional gene-discovery strategies. Over the past 2 years, experimental and analytical approaches relating to exome sequencing have established a rich framework for discovering the genes underlying unsolved Mendelian disorders. Additionally, exome sequencing is being adapted to explore the extent to which rare alleles explain the heritability of complex diseases and health-related traits. These advances also set the stage for applying exome and whole-genome sequencing to facilitate clinical diagnosis and personalized disease-risk profiling.


Subject(s)
Exome/genetics , Genome, Human , Sequence Analysis, DNA/methods , Alleles , Base Sequence , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Molecular Sequence Data , Pedigree , Phenotype
14.
Proc Natl Acad Sci U S A ; 108(41): 17087-92, 2011 Oct 11.
Article in English | MEDLINE | ID: mdl-21949389

ABSTRACT

To catalog protein-altering mutations that may drive the development of prostate cancers and their progression to metastatic disease systematically, we performed whole-exome sequencing of 23 prostate cancers derived from 16 different lethal metastatic tumors and three high-grade primary carcinomas. All tumors were propagated in mice as xenografts, designated the LuCaP series, to model phenotypic variation, such as responses to cancer-directed therapeutics. Although corresponding normal tissue was not available for most tumors, we were able to take advantage of increasingly deep catalogs of human genetic variation to remove most germline variants. On average, each tumor genome contained ~200 novel nonsynonymous variants, of which the vast majority was specific to individual carcinomas. A subset of genes was recurrently altered across tumors derived from different individuals, including TP53, DLK2, GPC6, and SDF4. Unexpectedly, three prostate cancer genomes exhibited substantially higher mutation frequencies, with 2,000-4,000 novel coding variants per exome. A comparison of castration-resistant and castration-sensitive pairs of tumor lines derived from the same prostate cancer highlights mutations in the Wnt pathway as potentially contributing to the development of castration resistance. Collectively, our results indicate that point mutations arising in coding regions of advanced prostate cancers are common but, with notable exceptions, very few genes are mutated in a substantial fraction of tumors. We also report a previously undescribed subtype of prostate cancers exhibiting "hypermutated" genomes, with potential implications for resistance to cancer therapeutics. Our results also suggest that increasingly deep catalogs of human germline variation may challenge the necessity of sequencing matched tumor-normal pairs.


Subject(s)
Exome , Mutation , Prostatic Neoplasms/genetics , Androgen Antagonists/therapeutic use , Animals , Calcium-Binding Proteins/genetics , Cell Line, Tumor , Genes, p53 , Genetic Variation , Glycoproteins/genetics , Glypicans/genetics , Humans , Male , Mice , Neoplasm Metastasis/genetics , Neoplasm Transplantation , Orchiectomy , Point Mutation , Prostatic Neoplasms/therapy , Transplantation, Heterologous , Wnt Signaling Pathway/genetics
15.
BMC Genomics ; 12: 347, 2011 Jul 05.
Article in English | MEDLINE | ID: mdl-21729323

ABSTRACT

BACKGROUND: Gene-targeted and genome-wide markers are crucial to advance evolutionary biology, agriculture, and biodiversity conservation by improving our understanding of genetic processes underlying adaptation and speciation. Unfortunately, for eukaryotic species with large genomes it remains costly to obtain genome sequences and to develop genome resources such as genome-wide SNPs. A method is needed to allow gene-targeted, next-generation sequencing that is flexible enough to include any gene or number of genes, unlike transcriptome sequencing. Such a method would allow sequencing of many individuals, avoiding ascertainment bias in subsequent population genetic analyses.We demonstrate the usefulness of a recent technology, exon capture, for genome-wide, gene-targeted marker discovery in species with no genome resources. We use coding gene sequences from the domestic cow genome sequence (Bos taurus) to capture (enrich for), and subsequently sequence, thousands of exons of B. taurus, B. indicus, and Bison bison (wild bison). Our capture array has probes for 16,131 exons in 2,570 genes, including 203 candidate genes with known function and of interest for their association with disease and other fitness traits. RESULTS: We successfully sequenced and mapped exon sequences from across the 29 autosomes and X chromosome in the B. taurus genome sequence. Exon capture and high-throughput sequencing identified thousands of putative SNPs spread evenly across all reference chromosomes, in all three individuals, including hundreds of SNPs in our targeted candidate genes. CONCLUSIONS: This study shows exon capture can be customized for SNP discovery in many individuals and for non-model species without genomic resources. Our captured exome subset was small enough for affordable next-generation sequencing, and successfully captured exons from a divergent wild species using the domestic cow genome as reference.


Subject(s)
Exons , Sequence Analysis, DNA/methods , Animals , Cattle , Female , Genome , Genotype , Polymorphism, Single Nucleotide , Software , X Chromosome
16.
Genome Res ; 21(10): 1686-94, 2011 Oct.
Article in English | MEDLINE | ID: mdl-21795384

ABSTRACT

Comparison of protein-coding DNA sequences from diverse primates can provide insight into these species' evolutionary history and uncover the molecular basis for their phenotypic differences. Currently, the number of available primate reference genomes limits these genome-wide comparisons. Here we use targeted capture methods designed for human to sequence the protein-coding regions, or exomes, of four non-human primate species (three Old World monkeys and one New World monkey). Despite average sequence divergence of up to 4% from the human sequence probes, we are able to capture ~96% of coding sequences. Using a combination of mapping and assembly techniques, we generated high-quality full-length coding sequences for each species. Both the number of nucleotide differences and the distribution of insertion and deletion (indel) lengths indicate that the quality of the assembled sequences is very high and exceeds that of most reference genomes. Using this expanded set of primate coding sequences, we performed a genome-wide scan for genes experiencing positive selection and identified a novel class of adaptively evolving genes involved in the conversion of epithelial cells in skin, hair, and nails to keratin. Interestingly, the genes we identify under positive selection also exhibit significantly increased allele frequency differences among human populations, suggesting that they play a role in both recent and long-term adaptation. We also identify several genes that have been lost on specific primate lineages, which illustrate the broad utility of this data set for other evolutionary analyses. These results demonstrate the power of second-generation sequencing in comparative genomics and greatly expand the repertoire of available primate coding sequences.


Subject(s)
Chlorocebus aethiops/genetics , Colobus/genetics , Exome , Macaca mulatta/genetics , Saguinus/genetics , Animals , Evolution, Molecular , Gene Deletion , Humans , INDEL Mutation , Metabolic Networks and Pathways/genetics , Phylogeny , Selection, Genetic , Sequence Alignment , Sequence Analysis, DNA
17.
Am J Med Genet A ; 155A(7): 1511-6, 2011 Jul.
Article in English | MEDLINE | ID: mdl-21671394

ABSTRACT

Kabuki syndrome is a rare, multiple malformation disorder characterized by a distinctive facial appearance, cardiac anomalies, skeletal abnormalities, and mild to moderate intellectual disability. Simplex cases make up the vast majority of the reported cases with Kabuki syndrome, but parent-to-child transmission in more than a half-dozen instances indicates that it is an autosomal dominant disorder. We recently reported that Kabuki syndrome is caused by mutations in MLL2, a gene that encodes a Trithorax-group histone methyltransferase, a protein important in the epigenetic control of active chromatin states. Here, we report on the screening of 110 families with Kabuki syndrome. MLL2 mutations were found in 81/110 (74%) of families. In simplex cases for which DNA was available from both parents, 25 mutations were confirmed to be de novo, while a transmitted MLL2 mutation was found in two of three familial cases. The majority of variants found to cause Kabuki syndrome were novel nonsense or frameshift mutations that are predicted to result in haploinsufficiency. The clinical characteristics of MLL2 mutation-positive cases did not differ significantly from MLL2 mutation-negative cases with the exception that renal anomalies were more common in MLL2 mutation-positive cases. These results are important for understanding the phenotypic consequences of MLL2 mutations for individuals and their families as well as for providing a basis for the identification of additional genes for Kabuki syndrome.


Subject(s)
Abnormalities, Multiple/genetics , DNA-Binding Proteins/genetics , Hematologic Diseases/genetics , Mutation/genetics , Neoplasm Proteins/genetics , Vestibular Diseases/genetics , Abnormalities, Multiple/diagnosis , Alleles , Face/abnormalities , Gene Order , Genetic Testing , Genotype , Hematologic Diseases/diagnosis , Humans , Phenotype , Prognosis , Vestibular Diseases/diagnosis
18.
Nat Genet ; 43(6): 585-9, 2011 Jun.
Article in English | MEDLINE | ID: mdl-21572417

ABSTRACT

Evidence for the etiology of autism spectrum disorders (ASDs) has consistently pointed to a strong genetic component complicated by substantial locus heterogeneity. We sequenced the exomes of 20 individuals with sporadic ASD (cases) and their parents, reasoning that these families would be enriched for de novo mutations of major effect. We identified 21 de novo mutations, 11 of which were protein altering. Protein-altering mutations were significantly enriched for changes at highly conserved residues. We identified potentially causative de novo events in 4 out of 20 probands, particularly among more severely affected individuals, in FOXP1, GRIN2B, SCN1A and LAMC3. In the FOXP1 mutation carrier, we also observed a rare inherited CNTNAP2 missense variant, and we provide functional support for a multi-hit model for disease risk. Our results show that trio-based exome sequencing is a powerful approach for identifying new candidate genes for ASDs and suggest that de novo mutations may contribute substantially to the genetic etiology of ASDs.


Subject(s)
Child Development Disorders, Pervasive/genetics , Genetic Predisposition to Disease , Mutation , Adult , Child , Exons , Female , Humans , Male , Middle Aged , Pedigree , Sequence Analysis, DNA
19.
Nat Biotechnol ; 29(1): 59-63, 2011 Jan.
Article in English | MEDLINE | ID: mdl-21170042

ABSTRACT

Haplotype information is essential to the complete description and interpretation of genomes, genetic diversity and genetic ancestry. Although individual human genome sequencing is increasingly routine, nearly all such genomes are unresolved with respect to haplotype. Here we combine the throughput of massively parallel sequencing with the contiguity information provided by large-insert cloning to experimentally determine the haplotype-resolved genome of a South Asian individual. A single fosmid library was split into a modest number of pools, each providing ∼3% physical coverage of the diploid genome. Sequencing of each pool yielded reads overwhelmingly derived from only one homologous chromosome at any given location. These data were combined with whole-genome shotgun sequence to directly phase 94% of ascertained heterozygous single nucleotide polymorphisms (SNPs) into long haplotype blocks (N50 of 386 kilobases (kbp)). This method also facilitates the analysis of structural variation, for example, to anchor novel insertions to specific locations and haplotypes.


Subject(s)
Asian People/genetics , Genome, Human/genetics , Haplotypes/genetics , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , Base Sequence , Cell Line , Heterozygote , Humans , Models, Molecular , Polymorphism, Single Nucleotide/genetics
20.
Hum Mol Genet ; 19(R2): R119-24, 2010 Oct 15.
Article in English | MEDLINE | ID: mdl-20846941

ABSTRACT

Massively parallel sequencing has enabled the rapid, systematic identification of variants on a large scale. This has, in turn, accelerated the pace of gene discovery and disease diagnosis on a molecular level and has the potential to revolutionize methods particularly for the analysis of Mendelian disease. Using massively parallel sequencing has enabled investigators to interrogate variants both in the context of linkage intervals and also on a genome-wide scale, in the absence of linkage information entirely. The primary challenge now is to distinguish between background polymorphisms and pathogenic mutations. Recently developed strategies for rare monogenic disorders have met with some early success. These strategies include filtering for potential causal variants based on frequency and function, and also ranking variants based on conservation scores and predicted deleteriousness to protein structure. Here, we review the recent literature in the use of high-throughput sequence data and its analysis in the discovery of causal mutations for rare disorders.


Subject(s)
Genome, Human/genetics , High-Throughput Nucleotide Sequencing , Rare Diseases/genetics , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...