Search | VHL Regional Portal

1.

Genome sequencing and comprehensive rare-variant analysis of 465 families with neurodevelopmental disorders.

Sanchis-Juan, Alba; Megy, Karyn; Stephens, Jonathan; Armirola Ricaurte, Camila; Dewhurst, Eleanor; Low, Kayyi; French, Courtney E; Grozeva, Detelina; Stirrups, Kathleen; Erwood, Marie; McTague, Amy; Penkett, Christopher J; Shamardina, Olga; Tuna, Salih; Daugherty, Louise C; Gleadall, Nicholas; Duarte, Sofia T; Hedrera-Fernández, Antonio; Vogt, Julie; Ambegaonkar, Gautam; Chitre, Manali; Josifova, Dragana; Kurian, Manju A; Parker, Alasdair; Rankin, Julia; Reid, Evan; Wakeling, Emma; Wassmer, Evangeline; Woods, C Geoffrey; Raymond, F Lucy; Carss, Keren J.

Am J Hum Genet ; 110(8): 1343-1355, 2023 08 03.

Article in English | MEDLINE | ID: mdl-37541188

ABSTRACT

Despite significant progress in unraveling the genetic causes of neurodevelopmental disorders (NDDs), a substantial proportion of individuals with NDDs remain without a genetic diagnosis after microarray and/or exome sequencing. Here, we aimed to assess the power of short-read genome sequencing (GS), complemented with long-read GS, to identify causal variants in participants with NDD from the National Institute for Health and Care Research (NIHR) BioResource project. Short-read GS was conducted on 692 individuals (489 affected and 203 unaffected relatives) from 465 families. Additionally, long-read GS was performed on five affected individuals who had structural variants (SVs) in technically challenging regions, had complex SVs, or required distal variant phasing. Causal variants were identified in 36% of affected individuals (177/489), and a further 23% (112/489) had a variant of uncertain significance after multiple rounds of re-analysis. Among all reported variants, 88% (333/380) were coding nuclear SNVs or insertions and deletions (indels), and the remainder were SVs, non-coding variants, and mitochondrial variants. Furthermore, long-read GS facilitated the resolution of challenging SVs and invalidated variants of difficult interpretation from short-read GS. This study demonstrates the value of short-read GS, complemented with long-read GS, in investigating the genetic causes of NDDs. GS provides a comprehensive and unbiased method of identifying all types of variants throughout the nuclear and mitochondrial genomes in individuals with NDD.

Subject(s)

Genome, Human , Neurodevelopmental Disorders , Humans , Genome, Human/genetics , Chromosome Mapping , Base Sequence , INDEL Mutation , Neurodevelopmental Disorders/genetics

2.

The Gene Curation Coalition: A global effort to harmonize gene-disease evidence resources.

DiStefano, Marina T; Goehringer, Scott; Babb, Lawrence; Alkuraya, Fowzan S; Amberger, Joanna; Amin, Mutaz; Austin-Tse, Christina; Balzotti, Marie; Berg, Jonathan S; Birney, Ewan; Bocchini, Carol; Bruford, Elspeth A; Coffey, Alison J; Collins, Heather; Cunningham, Fiona; Daugherty, Louise C; Einhorn, Yaron; Firth, Helen V; Fitzpatrick, David R; Foulger, Rebecca E; Goldstein, Jennifer; Hamosh, Ada; Hurles, Matthew R; Leigh, Sarah E; Leong, Ivone U S; Maddirevula, Sateesh; Martin, Christa L; McDonagh, Ellen M; Olry, Annie; Puzriakova, Arina; Radtke, Kelly; Ramos, Erin M; Rath, Ana; Riggs, Erin Rooney; Roberts, Angharad M; Rodwell, Charlotte; Snow, Catherine; Stark, Zornitza; Tahiliani, Jackie; Tweedie, Susan; Ware, James S; Weller, Phillip; Williams, Eleanor; Wright, Caroline F; Yates, Thabo Michael; Rehm, Heidi L.

Genet Med ; 24(8): 1732-1742, 2022 08.

Article in English | MEDLINE | ID: mdl-35507016

ABSTRACT

PURPOSE: Several groups and resources provide information that pertains to the validity of gene-disease relationships used in genomic medicine and research; however, universal standards and terminologies to define the evidence base for the role of a gene in disease and a single harmonized resource were lacking. To tackle this issue, the Gene Curation Coalition (GenCC) was formed. METHODS: The GenCC drafted harmonized definitions for differing levels of gene-disease validity on the basis of existing resources, and performed a modified Delphi survey with 3 rounds to narrow the list of terms. The GenCC also developed a unified database to display curated gene-disease validity assertions from its members. RESULTS: On the basis of 241 survey responses from the genetics community, a consensus term set was chosen for grading gene-disease validity and database submissions. As of December 2021, the database contained 15,241 gene-disease assertions on 4569 unique genes from 12 submitters. When comparing submissions to the database from distinct sources, conflicts in assertions of gene-disease validity ranged from 5.3% to 13.4%. CONCLUSION: Terminology standardization, sharing of gene-disease validity classifications, and resolution of curation conflicts will facilitate collaborations across international curation efforts and in turn, improve consistency in genetic testing and variant interpretation.

Subject(s)

Databases, Genetic , Genomics , Genetic Testing , Genetic Variation , Humans

3.

Whole genome sequencing for the diagnosis of neurological repeat expansion disorders in the UK: a retrospective diagnostic accuracy and prospective clinical validation study.

Ibañez, Kristina; Polke, James; Hagelstrom, R Tanner; Dolzhenko, Egor; Pasko, Dorota; Thomas, Ellen Rachel Amy; Daugherty, Louise C; Kasperaviciute, Dalia; Smith, Katherine R; Deans, Zandra C; Hill, Sue; Fowler, Tom; Scott, Richard H; Hardy, John; Chinnery, Patrick F; Houlden, Henry; Rendon, Augusto; Caulfield, Mark J; Eberle, Michael A; Taft, Ryan J; Tucci, Arianna.

Lancet Neurol ; 21(3): 234-245, 2022 03.

Article in English | MEDLINE | ID: mdl-35182509

ABSTRACT

BACKGROUND: Repeat expansion disorders affect about 1 in 3000 individuals and are clinically heterogeneous diseases caused by expansions of short tandem DNA repeats. Genetic testing is often locus-specific, resulting in underdiagnosis of people who have atypical clinical presentations, especially in paediatric patients without a previous positive family history. Whole genome sequencing is increasingly used as a first-line test for other rare genetic disorders, and we aimed to assess its performance in the diagnosis of patients with neurological repeat expansion disorders. METHODS: We retrospectively assessed the diagnostic accuracy of whole genome sequencing to detect the most common repeat expansion loci associated with neurological outcomes (AR, ATN1, ATXN1, ATXN2, ATXN3, ATXN7, C9orf72, CACNA1A, DMPK, FMR1, FXN, HTT, and TBP) using samples obtained within the National Health Service in England from patients who were suspected of having neurological disorders; previous PCR test results were used as the reference standard. The clinical accuracy of whole genome sequencing to detect repeat expansions was prospectively examined in previously genetically tested and undiagnosed patients recruited in 2013-17 to the 100â000 Genomes Project in the UK, who were suspected of having a genetic neurological disorder (familial or early-onset forms of ataxia, neuropathy, spastic paraplegia, dementia, motor neuron disease, parkinsonian movement disorders, intellectual disability, or neuromuscular disorders). If a repeat expansion call was made using whole genome sequencing, PCR was used to confirm the result. FINDINGS: The diagnostic accuracy of whole genome sequencing to detect repeat expansions was evaluated against 793 PCR tests previously performed within the NHS from 404 patients. Whole genome sequencing correctly classified 215 of 221 expanded alleles and 1316 of 1321 non-expanded alleles, showing 97·3% sensitivity (95% CI 94·2-99·0) and 99·6% specificity (99·1-99·9) across the 13 disease-associated loci when compared with PCR test results. In samples from 11â631 patients in the 100â000 Genomes Project, whole genome sequencing identified 81 repeat expansions, which were also tested by PCR: 68 were confirmed as repeat expansions in the full pathogenic range, 11 were non-pathogenic intermediate expansions or premutations, and two were non-expanded repeats (16% false discovery rate). INTERPRETATION: In our study, whole genome sequencing for the detection of repeat expansions showed high sensitivity and specificity, and it led to identification of neurological repeat expansion disorders in previously undiagnosed patients. These findings support implementation of whole genome sequencing in clinical laboratories for diagnosis of patients who have a neurological presentation consistent with a repeat expansion disorder. FUNDING: Medical Research Council, Department of Health and Social Care, National Health Service England, National Institute for Health Research, and Illumina.

Subject(s)

DNA Repeat Expansion , State Medicine , Child , Fragile X Mental Retardation Protein/genetics , Humans , Prospective Studies , Retrospective Studies , United Kingdom , Whole Genome Sequencing/methods

4.

Scaling national and international improvement in virtual gene panel curation via a collaborative approach to discordance resolution.

Stark, Zornitza; Foulger, Rebecca E; Williams, Eleanor; Thompson, Bryony A; Patel, Chirag; Lunke, Sebastian; Snow, Catherine; Leong, Ivone U S; Puzriakova, Arina; Daugherty, Louise C; Leigh, Sarah; Boustred, Christopher; Niblock, Olivia; Rueda-Martin, Antonio; Gerasimenko, Oleg; Savage, Kevin; Bellamy, William; Lin, Victor San Kho; Valls, Roman; Gordon, Lavinia; Brittain, Helen K; Thomas, Ellen R A; Taylor Tavares, Ana Lisa; McEntagart, Meriel; White, Susan M; Tan, Tiong Y; Yeung, Alison; Downie, Lilian; Macciocca, Ivan; Savva, Elena; Lee, Crystle; Roesley, Ain; De Fazio, Paul; Deller, Jane; Deans, Zandra C; Hill, Sue L; Caulfield, Mark J; North, Kathryn N; Scott, Richard H; Rendon, Augusto; Hofmann, Oliver; McDonagh, Ellen M.

Am J Hum Genet ; 108(9): 1551-1557, 2021 09 02.

Article in English | MEDLINE | ID: mdl-34329581

ABSTRACT

Clinical validity assessments of gene-disease associations underpin analysis and reporting in diagnostic genomics, and yet wide variability exists in practice, particularly in use of these assessments for virtual gene panel design and maintenance. Harmonization efforts are hampered by the lack of agreed terminology, agreed gene curation standards, and platforms that can be used to identify and resolve discrepancies at scale. We undertook a systematic comparison of the content of 80 virtual gene panels used in two healthcare systems by multiple diagnostic providers in the United Kingdom and Australia. The process was enabled by a shared curation platform, PanelApp, and resulted in the identification and review of 2,144 discordant gene ratings, demonstrating the utility of sharing structured gene-disease validity assessments and collaborative discordance resolution in establishing national and international consensus.

Subject(s)

Consensus , Data Curation/standards , Genetic Diseases, Inborn/genetics , Genomics/standards , Molecular Sequence Annotation/standards , Australia , Biomarkers/metabolism , Data Curation/methods , Delivery of Health Care , Gene Expression , Gene Ontology , Genetic Diseases, Inborn/diagnosis , Genetic Diseases, Inborn/pathology , Genomics/methods , Humans , Mobile Applications/supply & distribution , Terminology as Topic , United Kingdom

5.

Characterization of GDF2 Mutations and Levels of BMP9 and BMP10 in Pulmonary Arterial Hypertension.

Hodgson, Joshua; Swietlik, Emilia M; Salmon, Richard M; Hadinnapola, Charaka; Nikolic, Ivana; Wharton, John; Guo, Jingxu; Liley, James; Haimel, Matthias; Bleda, Marta; Southgate, Laura; Machado, Rajiv D; Martin, Jennifer M; Treacy, Carmen M; Yates, Katherine; Daugherty, Louise C; Shamardina, Olga; Whitehorn, Deborah; Holden, Simon; Bogaard, Harm J; Church, Colin; Coghlan, Gerry; Condliffe, Robin; Corris, Paul A; Danesino, Cesare; Eyries, Mélanie; Gall, Henning; Ghio, Stefano; Ghofrani, Hossein-Ardeschir; Gibbs, J Simon R; Girerd, Barbara; Houweling, Arjan C; Howard, Luke; Humbert, Marc; Kiely, David G; Kovacs, Gabor; Lawrie, Allan; MacKenzie Ross, Robert V; Moledina, Shahin; Montani, David; Olschewski, Andrea; Olschewski, Horst; Ouwehand, Willem H; Peacock, Andrew J; Pepke-Zaba, Joanna; Prokopenko, Inga; Rhodes, Christopher J; Scelsi, Laura; Seeger, Werner; Soubrier, Florent.

Am J Respir Crit Care Med ; 201(5): 575-585, 2020 03 01.

Article in English | MEDLINE | ID: mdl-31661308

ABSTRACT

Rationale: Recently, rare heterozygous mutations in GDF2 were identified in patients with pulmonary arterial hypertension (PAH). GDF2 encodes the circulating BMP (bone morphogenetic protein) type 9, which is a ligand for the BMP2 receptor.Objectives: Here we determined the functional impact of GDF2 mutations and characterized plasma BMP9 and BMP10 levels in patients with idiopathic PAH.Methods: Missense BMP9 mutant proteins were expressed in vitro and the impact on BMP9 protein processing and secretion, endothelial signaling, and functional activity was assessed. Plasma BMP9 and BMP10 levels and activity were assayed in patients with PAH with GDF2 variants and in control subjects. Levels were also measured in a larger cohort of control subjects (n = 120) and patients with idiopathic PAH (n = 260).Measurements and Main Results: We identified a novel rare variation at the GDF2 and BMP10 loci, including copy number variation. In vitro, BMP9 missense proteins demonstrated impaired cellular processing and secretion. Patients with PAH who carried these mutations exhibited reduced plasma levels of BMP9 and reduced BMP activity. Unexpectedly, plasma BMP10 levels were also markedly reduced in these individuals. Although overall BMP9 and BMP10 levels did not differ between patients with PAH and control subjects, BMP10 levels were lower in PAH females. A subset of patients with PAH had markedly reduced plasma levels of BMP9 and BMP10 in the absence of GDF2 mutations.Conclusions: Our findings demonstrate that GDF2 mutations result in BMP9 loss of function and are likely causal. These mutations lead to reduced circulating levels of both BMP9 and BMP10. These findings support therapeutic strategies to enhance BMP9 or BMP10 signaling in PAH.

Subject(s)

Bone Morphogenetic Proteins/genetics , Growth Differentiation Factor 2/genetics , Pulmonary Arterial Hypertension/genetics , Adult , Bone Morphogenetic Proteins/metabolism , Case-Control Studies , DNA Copy Number Variations , Female , Growth Differentiation Factor 2/metabolism , Heterozygote , Humans , Male , Middle Aged , Mutation, Missense , Protein Transport , Pulmonary Arterial Hypertension/metabolism , Sex Factors

6.

PanelApp crowdsources expert knowledge to establish consensus diagnostic gene panels.

Martin, Antonio Rueda; Williams, Eleanor; Foulger, Rebecca E; Leigh, Sarah; Daugherty, Louise C; Niblock, Olivia; Leong, Ivone U S; Smith, Katherine R; Gerasimenko, Oleg; Haraldsdottir, Eik; Thomas, Ellen; Scott, Richard H; Baple, Emma; Tucci, Arianna; Brittain, Helen; de Burca, Anna; Ibañez, Kristina; Kasperaviciute, Dalia; Smedley, Damian; Caulfield, Mark; Rendon, Augusto; McDonagh, Ellen M.

Nat Genet ; 51(11): 1560-1565, 2019 11.

Article in English | MEDLINE | ID: mdl-31676867

Subject(s)

Computational Biology/methods , Crowdsourcing , Genetic Markers , Genetic Testing/methods , Rare Diseases/diagnosis , Rare Diseases/genetics , Software , Consensus , England , Expert Testimony , High-Throughput Nucleotide Sequencing , Humans

7.

Diagnostic high-throughput sequencing of 2396 patients with bleeding, thrombotic, and platelet disorders.

Downes, Kate; Megy, Karyn; Duarte, Daniel; Vries, Minka; Gebhart, Johanna; Hofer, Stefanie; Shamardina, Olga; Deevi, Sri V V; Stephens, Jonathan; Mapeta, Rutendo; Tuna, Salih; Al Hasso, Namir; Besser, Martin W; Cooper, Nichola; Daugherty, Louise; Gleadall, Nick; Greene, Daniel; Haimel, Matthias; Martin, Howard; Papadia, Sofia; Revel-Vilk, Shoshana; Sivapalaratnam, Suthesh; Symington, Emily; Thomas, Will; Thys, Chantal; Tolios, Alexander; Penkett, Christopher J; Ouwehand, Willem H; Abbs, Stephen; Laffan, Michael A; Turro, Ernest; Simeoni, Ilenia; Mumford, Andrew D; Henskens, Yvonne M C; Pabinger, Ingrid; Gomez, Keith; Freson, Kathleen.

Blood ; 134(23): 2082-2091, 2019 12 05.

Article in English | MEDLINE | ID: mdl-31064749

ABSTRACT

A targeted high-throughput sequencing (HTS) panel test for clinical diagnostics requires careful consideration of the inclusion of appropriate diagnostic-grade genes, the ability to detect multiple types of genomic variation with high levels of analytic sensitivity and reproducibility, and variant interpretation by a multidisciplinary team (MDT) in the context of the clinical phenotype. We have sequenced 2396 index patients using the ThromboGenomics HTS panel test of diagnostic-grade genes known to harbor variants associated with rare bleeding, thrombotic, or platelet disorders (BTPDs). The molecular diagnostic rate was determined by the clinical phenotype, with an overall rate of 49.2% for all thrombotic, coagulation, platelet count, and function disorder patients and a rate of 3.2% for patients with unexplained bleeding disorders characterized by normal hemostasis test results. The MDT classified 745 unique variants, including copy number variants (CNVs) and intronic variants, as pathogenic, likely pathogenic, or variants of uncertain significance. Half of these variants (50.9%) are novel and 41 unique variants were identified in 7 genes recently found to be implicated in BTPDs. Inspection of canonical hemostasis pathways identified 29 patients with evidence of oligogenic inheritance. A molecular diagnosis has been reported for 894 index patients providing evidence that introducing an HTS genetic test is a valuable addition to laboratory diagnostics in patients with a high likelihood of having an inherited BTPD.

Subject(s)

Blood Platelet Disorders , Hemorrhage , High-Throughput Nucleotide Sequencing , Thrombosis , Blood Platelet Disorders/diagnosis , Blood Platelet Disorders/genetics , Female , Gene Dosage , Hemorrhage/diagnosis , Hemorrhage/genetics , Hemostasis/genetics , Humans , Male , Thrombosis/diagnosis , Thrombosis/genetics

8.

Identification of rare sequence variation underlying heritable pulmonary arterial hypertension.

Gräf, Stefan; Haimel, Matthias; Bleda, Marta; Hadinnapola, Charaka; Southgate, Laura; Li, Wei; Hodgson, Joshua; Liu, Bin; Salmon, Richard M; Southwood, Mark; Machado, Rajiv D; Martin, Jennifer M; Treacy, Carmen M; Yates, Katherine; Daugherty, Louise C; Shamardina, Olga; Whitehorn, Deborah; Holden, Simon; Aldred, Micheala; Bogaard, Harm J; Church, Colin; Coghlan, Gerry; Condliffe, Robin; Corris, Paul A; Danesino, Cesare; Eyries, Mélanie; Gall, Henning; Ghio, Stefano; Ghofrani, Hossein-Ardeschir; Gibbs, J Simon R; Girerd, Barbara; Houweling, Arjan C; Howard, Luke; Humbert, Marc; Kiely, David G; Kovacs, Gabor; MacKenzie Ross, Robert V; Moledina, Shahin; Montani, David; Newnham, Michael; Olschewski, Andrea; Olschewski, Horst; Peacock, Andrew J; Pepke-Zaba, Joanna; Prokopenko, Inga; Rhodes, Christopher J; Scelsi, Laura; Seeger, Werner; Soubrier, Florent; Stein, Dan F.

Nat Commun ; 9(1): 1416, 2018 04 12.

Article in English | MEDLINE | ID: mdl-29650961

ABSTRACT

Pulmonary arterial hypertension (PAH) is a rare disorder with a poor prognosis. Deleterious variation within components of the transforming growth factor-ß pathway, particularly the bone morphogenetic protein type 2 receptor (BMPR2), underlies most heritable forms of PAH. To identify the missing heritability we perform whole-genome sequencing in 1038 PAH index cases and 6385 PAH-negative control subjects. Case-control analyses reveal significant overrepresentation of rare variants in ATP13A3, AQP1 and SOX17, and provide independent validation of a critical role for GDF2 in PAH. We demonstrate familial segregation of mutations in SOX17 and AQP1 with PAH. Mutations in GDF2, encoding a BMPR2 ligand, lead to reduced secretion from transfected cells. In addition, we identify pathogenic mutations in the majority of previously reported PAH genes, and provide evidence for further putative genes. Taken together these findings contribute new insights into the molecular basis of PAH and indicate unexplored pathways for therapeutic intervention.

Subject(s)

Adenosine Triphosphatases/chemistry , Aquaporin 1/chemistry , Familial Primary Pulmonary Hypertension/genetics , Growth Differentiation Factors/chemistry , Membrane Transport Proteins/chemistry , Mutation , SOXF Transcription Factors/chemistry , Adenosine Triphosphatases/genetics , Adenosine Triphosphatases/metabolism , Adult , Aquaporin 1/genetics , Aquaporin 1/metabolism , Base Sequence , Bone Morphogenetic Protein Receptors, Type II/genetics , Bone Morphogenetic Protein Receptors, Type II/metabolism , Case-Control Studies , Familial Primary Pulmonary Hypertension/diagnosis , Familial Primary Pulmonary Hypertension/metabolism , Familial Primary Pulmonary Hypertension/pathology , Female , Gene Expression Regulation , Genetic Predisposition to Disease , Growth Differentiation Factor 2 , Growth Differentiation Factors/genetics , Growth Differentiation Factors/metabolism , HEK293 Cells , Humans , Male , Membrane Transport Proteins/genetics , Membrane Transport Proteins/metabolism , Models, Molecular , Prognosis , SOXF Transcription Factors/genetics , SOXF Transcription Factors/metabolism , Signal Transduction , Transforming Growth Factor beta/genetics , Transforming Growth Factor beta/metabolism , Whole Genome Sequencing

9.

The Allelic Landscape of Human Blood Cell Trait Variation and Links to Common Complex Disease.

Astle, William J; Elding, Heather; Jiang, Tao; Allen, Dave; Ruklisa, Dace; Mann, Alice L; Mead, Daniel; Bouman, Heleen; Riveros-Mckay, Fernando; Kostadima, Myrto A; Lambourne, John J; Sivapalaratnam, Suthesh; Downes, Kate; Kundu, Kousik; Bomba, Lorenzo; Berentsen, Kim; Bradley, John R; Daugherty, Louise C; Delaneau, Olivier; Freson, Kathleen; Garner, Stephen F; Grassi, Luigi; Guerrero, Jose; Haimel, Matthias; Janssen-Megens, Eva M; Kaan, Anita; Kamat, Mihir; Kim, Bowon; Mandoli, Amit; Marchini, Jonathan; Martens, Joost H A; Meacham, Stuart; Megy, Karyn; O'Connell, Jared; Petersen, Romina; Sharifi, Nilofar; Sheard, Simon M; Staley, James R; Tuna, Salih; van der Ent, Martijn; Walter, Klaudia; Wang, Shuang-Yin; Wheeler, Eleanor; Wilder, Steven P; Iotchkova, Valentina; Moore, Carmel; Sambrook, Jennifer; Stunnenberg, Hendrik G; Di Angelantonio, Emanuele; Kaptoge, Stephen.

Cell ; 167(5): 1415-1429.e19, 2016 11 17.

Article in English | MEDLINE | ID: mdl-27863252

ABSTRACT

Many common variants have been associated with hematological traits, but identification of causal genes and pathways has proven challenging. We performed a genome-wide association analysis in the UK Biobank and INTERVAL studies, testing 29.5 million genetic variants for association with 36 red cell, white cell, and platelet properties in 173,480 European-ancestry participants. This effort yielded hundreds of low frequency (<5%) and rare (<1%) variants with a strong impact on blood cell phenotypes. Our data highlight general properties of the allelic architecture of complex traits, including the proportion of the heritable component of each blood trait explained by the polygenic signal across different genome regulatory domains. Finally, through Mendelian randomization, we provide evidence of shared genetic pathways linking blood cell indices with complex pathologies, including autoimmune diseases, schizophrenia, and coronary heart disease and evidence suggesting previously reported population associations between blood cell indices and cardiovascular disease may be non-causal.

Subject(s)

Genetic Variation , Genome-Wide Association Study , Hematopoietic Stem Cells/metabolism , Immune System Diseases/genetics , Alleles , Cell Differentiation , Genetic Predisposition to Disease , Hematopoietic Stem Cells/pathology , Humans , Immune System Diseases/pathology , Polymorphism, Single Nucleotide , Quantitative Trait Loci , White People/genetics

10.

A high-throughput sequencing test for diagnosing inherited bleeding, thrombotic, and platelet disorders.

Simeoni, Ilenia; Stephens, Jonathan C; Hu, Fengyuan; Deevi, Sri V V; Megy, Karyn; Bariana, Tadbir K; Lentaigne, Claire; Schulman, Sol; Sivapalaratnam, Suthesh; Vries, Minka J A; Westbury, Sarah K; Greene, Daniel; Papadia, Sofia; Alessi, Marie-Christine; Attwood, Antony P; Ballmaier, Matthias; Baynam, Gareth; Bermejo, Emilse; Bertoli, Marta; Bray, Paul F; Bury, Loredana; Cattaneo, Marco; Collins, Peter; Daugherty, Louise C; Favier, Rémi; French, Deborah L; Furie, Bruce; Gattens, Michael; Germeshausen, Manuela; Ghevaert, Cedric; Goodeve, Anne C; Guerrero, Jose A; Hampshire, Daniel J; Hart, Daniel P; Heemskerk, Johan W M; Henskens, Yvonne M C; Hill, Marian; Hogg, Nancy; Jolley, Jennifer D; Kahr, Walter H; Kelly, Anne M; Kerr, Ron; Kostadima, Myrto; Kunishima, Shinji; Lambert, Michele P; Liesner, Ri; López, José A; Mapeta, Rutendo P; Mathias, Mary; Millar, Carolyn M.

Blood ; 127(23): 2791-803, 2016 06 09.

Article in English | MEDLINE | ID: mdl-27084890

ABSTRACT

Inherited bleeding, thrombotic, and platelet disorders (BPDs) are diseases that affect â¼300 individuals per million births. With the exception of hemophilia and von Willebrand disease patients, a molecular analysis for patients with a BPD is often unavailable. Many specialized tests are usually required to reach a putative diagnosis and they are typically performed in a step-wise manner to control costs. This approach causes delays and a conclusive molecular diagnosis is often never reached, which can compromise treatment and impede rapid identification of affected relatives. To address this unmet diagnostic need, we designed a high-throughput sequencing platform targeting 63 genes relevant for BPDs. The platform can call single nucleotide variants, short insertions/deletions, and large copy number variants (though not inversions) which are subjected to automated filtering for diagnostic prioritization, resulting in an average of 5.34 candidate variants per individual. We sequenced 159 and 137 samples, respectively, from cases with and without previously known causal variants. Among the latter group, 61 cases had clinical and laboratory phenotypes indicative of a particular molecular etiology, whereas the remainder had an a priori highly uncertain etiology. All previously detected variants were recapitulated and, when the etiology was suspected but unknown or uncertain, a molecular diagnosis was reached in 56 of 61 and only 8 of 76 cases, respectively. The latter category highlights the need for further research into novel causes of BPDs. The ThromboGenomics platform thus provides an affordable DNA-based test to diagnose patients suspected of having a known inherited BPD.

Subject(s)

Blood Platelet Disorders/genetics , Genetic Predisposition to Disease , Hemorrhage/genetics , High-Throughput Nucleotide Sequencing/methods , Thrombosis/genetics , Case-Control Studies , DNA Copy Number Variations , Female , Genetic Association Studies/methods , Humans , Male , Mutation , Polymorphism, Single Nucleotide , Sequence Analysis, DNA/methods

11.

The InterPro protein families database: the classification resource after 15 years.

Mitchell, Alex; Chang, Hsin-Yu; Daugherty, Louise; Fraser, Matthew; Hunter, Sarah; Lopez, Rodrigo; McAnulla, Craig; McMenamin, Conor; Nuka, Gift; Pesseat, Sebastien; Sangrador-Vegas, Amaia; Scheremetjew, Maxim; Rato, Claudia; Yong, Siew-Yit; Bateman, Alex; Punta, Marco; Attwood, Teresa K; Sigrist, Christian J A; Redaschi, Nicole; Rivoire, Catherine; Xenarios, Ioannis; Kahn, Daniel; Guyot, Dominique; Bork, Peer; Letunic, Ivica; Gough, Julian; Oates, Matt; Haft, Daniel; Huang, Hongzhan; Natale, Darren A; Wu, Cathy H; Orengo, Christine; Sillitoe, Ian; Mi, Huaiyu; Thomas, Paul D; Finn, Robert D.

Nucleic Acids Res ; 43(Database issue): D213-21, 2015 Jan.

Article in English | MEDLINE | ID: mdl-25428371

ABSTRACT

The InterPro database (http://www.ebi.ac.uk/interpro/) is a freely available resource that can be used to classify sequences into protein families and to predict the presence of important domains and sites. Central to the InterPro database are predictive models, known as signatures, from a range of different protein family databases that have different biological focuses and use different methodological approaches to classify protein families and domains. InterPro integrates these signatures, capitalizing on the respective strengths of the individual databases, to produce a powerful protein classification resource. Here, we report on the status of InterPro as it enters its 15th year of operation, and give an overview of new developments with the database and its associated Web interfaces and software. In particular, the new domain architecture search tool is described and the process of mapping of Gene Ontology terms to InterPro is outlined. We also discuss the challenges faced by the resource given the explosive growth in sequence data in recent years. InterPro (version 48.0) contains 36,766 member database signatures integrated into 26,238 InterPro entries, an increase of over 3993 entries (5081 signatures), since 2012.

Subject(s)

Databases, Protein , Proteins/classification , Bacteria/metabolism , Gene Ontology , Protein Structure, Tertiary , Proteins/genetics , Sequence Analysis, Protein , Software

12.

Genenames.org: the HGNC resources in 2013.

Gray, Kristian A; Daugherty, Louise C; Gordon, Susan M; Seal, Ruth L; Wright, Mathew W; Bruford, Elspeth A.

Nucleic Acids Res ; 41(Database issue): D545-52, 2013 Jan.

Article in English | MEDLINE | ID: mdl-23161694

ABSTRACT

The HUGO Gene Nomenclature Committee situated at the European Bioinformatics Institute assigns unique symbols and names to human genes. Since 2011, the data within our database has expanded largely owing to an increase in naming pseudogenes and non-coding RNA genes, and we now have >33,500 approved symbols. Our gene families and groups have also increased to nearly 500, with â¼45% of our gene entries associated to at least one family or group. We have also redesigned the HUGO Gene Nomenclature Committee website http://www.genenames.org creating a constant look and feel across the site and improving usability and readability for our users. The site provides a public access portal to our database with no restrictions imposed on access or the use of the data. Within this article, we review our online resources and data with particular emphasis on the updates to our website.

Subject(s)

Databases, Genetic , Genes , Terminology as Topic , Humans , Internet , Proteins/genetics

13.

Gene family matters: expanding the HGNC resource.

Daugherty, Louise C; Seal, Ruth L; Wright, Mathew W; Bruford, Elspeth A.

Hum Genomics ; 6: 4, 2012 Jul 05.

Article in English | MEDLINE | ID: mdl-23245209

ABSTRACT

The HUGO Gene Nomenclature Committee (HGNC) assigns approved gene symbols to human loci. There are currently over 33,000 approved gene symbols, the majority of which represent protein-coding genes, but we also name other locus types such as non-coding RNAs, pseudogenes and phenotypic loci. Where relevant, the HGNC organise these genes into gene families and groups. The HGNC website http://www.genenames.org/ is an online repository of HGNC-approved gene nomenclature and associated resources for human genes, and includes links to genomic, proteomic and phenotypic information. In addition to this, we also have dedicated gene family web pages and are currently expanding and generating more of these pages using data curated by the HGNC and from information derived from external resources that focus on particular gene families. Here, we review our current online resources with a particular focus on our gene family data, using it to highlight our new Gene Symbol Report and gene family data downloads.

Subject(s)

Databases, Genetic , Genetic Loci/genetics , Multigene Family/genetics , Proteins/genetics , Terminology as Topic , Genetic Variation , Genomics/methods , Humans , Proteins/classification , Proteins/metabolism , Proteomics/methods , Web Browser

14.

InterPro in 2011: new developments in the family and domain prediction database.

Hunter, Sarah; Jones, Philip; Mitchell, Alex; Apweiler, Rolf; Attwood, Teresa K; Bateman, Alex; Bernard, Thomas; Binns, David; Bork, Peer; Burge, Sarah; de Castro, Edouard; Coggill, Penny; Corbett, Matthew; Das, Ujjwal; Daugherty, Louise; Duquenne, Lauranne; Finn, Robert D; Fraser, Matthew; Gough, Julian; Haft, Daniel; Hulo, Nicolas; Kahn, Daniel; Kelly, Elizabeth; Letunic, Ivica; Lonsdale, David; Lopez, Rodrigo; Madera, Martin; Maslen, John; McAnulla, Craig; McDowall, Jennifer; McMenamin, Conor; Mi, Huaiyu; Mutowo-Muellenet, Prudence; Mulder, Nicola; Natale, Darren; Orengo, Christine; Pesseat, Sebastien; Punta, Marco; Quinn, Antony F; Rivoire, Catherine; Sangrador-Vegas, Amaia; Selengut, Jeremy D; Sigrist, Christian J A; Scheremetjew, Maxim; Tate, John; Thimmajanarthanan, Manjulapramila; Thomas, Paul D; Wu, Cathy H; Yeats, Corin; Yong, Siew-Yit.

Nucleic Acids Res ; 40(Database issue): D306-12, 2012 Jan.

Article in English | MEDLINE | ID: mdl-22096229

ABSTRACT

InterPro (http://www.ebi.ac.uk/interpro/) is a database that integrates diverse information about protein families, domains and functional sites, and makes it freely available to the public via Web-based interfaces and services. Central to the database are diagnostic models, known as signatures, against which protein sequences can be searched to determine their potential function. InterPro has utility in the large-scale analysis of whole genomes and meta-genomes, as well as in characterizing individual protein sequences. Herein we give an overview of new developments in the database and its associated software since 2009, including updates to database content, curation processes and Web and programmatic interfaces.

Subject(s)

Databases, Protein , Protein Structure, Tertiary , Proteins/classification , Proteins/physiology , Sequence Analysis, Protein , Software , Terminology as Topic , User-Computer Interface

15.

InterPro: the integrative protein signature database.

Hunter, Sarah; Apweiler, Rolf; Attwood, Teresa K; Bairoch, Amos; Bateman, Alex; Binns, David; Bork, Peer; Das, Ujjwal; Daugherty, Louise; Duquenne, Lauranne; Finn, Robert D; Gough, Julian; Haft, Daniel; Hulo, Nicolas; Kahn, Daniel; Kelly, Elizabeth; Laugraud, Aurélie; Letunic, Ivica; Lonsdale, David; Lopez, Rodrigo; Madera, Martin; Maslen, John; McAnulla, Craig; McDowall, Jennifer; Mistry, Jaina; Mitchell, Alex; Mulder, Nicola; Natale, Darren; Orengo, Christine; Quinn, Antony F; Selengut, Jeremy D; Sigrist, Christian J A; Thimma, Manjula; Thomas, Paul D; Valentin, Franck; Wilson, Derek; Wu, Cathy H; Yeats, Corin.

Nucleic Acids Res ; 37(Database issue): D211-5, 2009 Jan.

Article in English | MEDLINE | ID: mdl-18940856

ABSTRACT

The InterPro database (http://www.ebi.ac.uk/interpro/) integrates together predictive models or 'signatures' representing protein domains, families and functional sites from multiple, diverse source databases: Gene3D, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs. Integration is performed manually and approximately half of the total approximately 58,000 signatures available in the source databases belong to an InterPro entry. Recently, we have started to also display the remaining un-integrated signatures via our web interface. Other developments include the provision of non-signature data, such as structural data, in new XML files on our FTP site, as well as the inclusion of matchless UniProtKB proteins in the existing match XML files. The web interface has been extended and now links out to the ADAN predicted protein-protein interaction database and the SPICE and Dasty viewers. The latest public release (v18.0) covers 79.8% of UniProtKB (v14.1) and consists of 16 549 entries. InterPro data may be accessed either via the web address above, via web services, by downloading files by anonymous FTP or by using the InterProScan search software (http://www.ebi.ac.uk/Tools/InterProScan/).

Subject(s)

Databases, Protein , Sequence Analysis, Protein , Proteins/chemistry , Proteins/classification , Systems Integration

16.

New developments in the InterPro database.

Mulder, Nicola J; Apweiler, Rolf; Attwood, Teresa K; Bairoch, Amos; Bateman, Alex; Binns, David; Bork, Peer; Buillard, Virginie; Cerutti, Lorenzo; Copley, Richard; Courcelle, Emmanuel; Das, Ujjwal; Daugherty, Louise; Dibley, Mark; Finn, Robert; Fleischmann, Wolfgang; Gough, Julian; Haft, Daniel; Hulo, Nicolas; Hunter, Sarah; Kahn, Daniel; Kanapin, Alexander; Kejariwal, Anish; Labarga, Alberto; Langendijk-Genevaux, Petra S; Lonsdale, David; Lopez, Rodrigo; Letunic, Ivica; Madera, Martin; Maslen, John; McAnulla, Craig; McDowall, Jennifer; Mistry, Jaina; Mitchell, Alex; Nikolskaya, Anastasia N; Orchard, Sandra; Orengo, Christine; Petryszak, Robert; Selengut, Jeremy D; Sigrist, Christian J A; Thomas, Paul D; Valentin, Franck; Wilson, Derek; Wu, Cathy H; Yeats, Corin.

Nucleic Acids Res ; 35(Database issue): D224-8, 2007 Jan.

Article in English | MEDLINE | ID: mdl-17202162

ABSTRACT

InterPro is an integrated resource for protein families, domains and functional sites, which integrates the following protein signature databases: PROSITE, PRINTS, ProDom, Pfam, SMART, TIGRFAMs, PIRSF, SUPERFAMILY, Gene3D and PANTHER. The latter two new member databases have been integrated since the last publication in this journal. There have been several new developments in InterPro, including an additional reading field, new database links, extensions to the web interface and additional match XML files. InterPro has always provided matches to UniProtKB proteins on the website and in the match XML file on the FTP site. Additional matches to proteins in UniParc (UniProt archive) are now available for download in the new match XML files only. The latest InterPro release (13.0) contains more than 13 000 entries, covering over 78% of all proteins in UniProtKB. The database is available for text- and sequence-based searches via a webserver (http://www.ebi.ac.uk/interpro), and for download by anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro). The InterProScan search tool is now also available via a web service at http://www.ebi.ac.uk/Tools/webservices/WSInterProScan.html.

Subject(s)

Databases, Protein , Internet , Protein Structure, Tertiary , Proteins/chemistry , Proteins/classification , Proteins/physiology , Sequence Analysis, Protein , Systems Integration , User-Computer Interface

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL