High-quality peptide evidence for annotating non-canonical open reading frames as human proteins.

Deutsch, Eric W; Kok, Leron W; Mudge, Jonathan M; Ruiz-Orera, Jorge; Fierro-Monti, Ivo; Sun, Zhi; Abelin, Jennifer G; Alba, M Mar; Aspden, Julie L; Bazzini, Ariel A; Bruford, Elspeth A; Brunet, Marie A; Calviello, Lorenzo; Carr, Steven A; Carvunis, Anne-Ruxandra; Chothani, Sonia; Clauwaert, Jim; Dean, Kellie; Faridi, Pouya; Frankish, Adam; Hubner, Norbert; Ingolia, Nicholas T; Magrane, Michele; Martin, Maria Jesus; Martinez, Thomas F; Menschaert, Gerben; Ohler, Uwe; Orchard, Sandra; Rackham, Owen; Roucou, Xavier; Slavoff, Sarah A; Valen, Eivind; Wacholder, Aaron; Weissman, Jonathan S; Wu, Wei; Xie, Zhi; Choudhary, Jyoti; Bassani-Sternberg, Michal; Vizcaíno, Juan Antonio; Ternette, Nicola; Moritz, Robert L; Prensner, John R; van Heesch, Sebastiaan

Deutsch, Eric W; Kok, Leron W; Mudge, Jonathan M; Ruiz-Orera, Jorge; Fierro-Monti, Ivo; Sun, Zhi; Abelin, Jennifer G; Alba, M Mar; Aspden, Julie L; Bazzini, Ariel A; Bruford, Elspeth A; Brunet, Marie A; Calviello, Lorenzo; Carr, Steven A; Carvunis, Anne-Ruxandra; Chothani, Sonia; Clauwaert, Jim; Dean, Kellie; Faridi, Pouya; Frankish, Adam; Hubner, Norbert; Ingolia, Nicholas T; Magrane, Michele; Martin, Maria Jesus; Martinez, Thomas F; Menschaert, Gerben; Ohler, Uwe; Orchard, Sandra; Rackham, Owen; Roucou, Xavier; Slavoff, Sarah A; Valen, Eivind; Wacholder, Aaron; Weissman, Jonathan S; Wu, Wei; Xie, Zhi; Choudhary, Jyoti; Bassani-Sternberg, Michal; Vizcaíno, Juan Antonio; Ternette, Nicola; Moritz, Robert L; Prensner, John R; van Heesch, Sebastiaan.

Afiliación

Deutsch EW; Institute for Systems Biology (ISB), Seattle, WA, 98109, USA.
Kok LW; Princess Máxima Center for Pediatric Oncology, Utrecht, 3584 CS, The Netherlands.
Mudge JM; Oncode Institute, Utrecht, The Netherlands.
Ruiz-Orera J; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK.
Fierro-Monti I; Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, 13125, Germany.
Sun Z; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK.
Abelin JG; Institute for Systems Biology (ISB), Seattle, WA, 98109, USA.
Alba MM; Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.
Aspden JL; Hospital del Mar Research Institute, Barcelona, Spain.
Bazzini AA; Catalan Institute for Research and Advanced Studies (ICREA), Barcelona, Spain.
Bruford EA; School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, LS2 9JT, UK.
Brunet MA; Stowers Institute for Medical Research, Kansas City, MO, 64110, USA.
Calviello L; Department of Molecular and Integrative Physiology, University of Kansas Medical Center, Kansas City, KS, 66160, USA.
Carr SA; HUGO Gene Nomenclature Committee (HGNC), Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge, UK.
Carvunis AR; Pediatrics Department, University of Sherbrooke, Sherbrooke, Québec, Canada.
Chothani S; Centre de Recherche du Centre hospitalier universitaire de Sherbrooke (CRCHUS), Sherbrooke, Québec, Canada.
Clauwaert J; Human Technopole, Milan, 20157, Italy.
Dean K; Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.
Faridi P; Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15213, USA.
Frankish A; Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15213, USA.
Hubner N; Centre for Computational Biology and Program in Cardiovascular and Metabolic Disorders, Duke-NUS (National University of Singapore) Medical School, Singapore.
Ingolia NT; Department of Pediatrics, Division of Pediatric Hematology/Oncology, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
Magrane M; Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
Martin MJ; School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland.
Martinez TF; Centre for Cancer Research, Hudson Institute of Medical Research, Clayton, VIC, Australia.
Menschaert G; Monash Proteomics & Metabolomics Platform, Department of Medicine, School of Clinical Sciences, Monash University, Clayton, VIC, Australia.
Ohler U; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK.
Orchard S; Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, 13125, Germany.
Rackham O; Charité-Universitätsmedizin Berlin, Berlin, 10117, Germany.
Roucou X; Helmholtz-Institute for Translational AngioCardioScience (HI-TAC) of the Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC) at Heidelberg University, Heidelberg, 69117, Germany.
Slavoff SA; DZHK (German Center for Cardiovascular Research), Partner Site Berlin, Berlin, 13347, Germany.
Valen E; Department of Molecular and Cell Biology, Center for Computational Biology, University of California, Berkeley, Berkeley, CA, 94720-3202, USA.
Wacholder A; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK.
Weissman JS; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK.
Wu W; Department of Pharmaceutical Sciences, University of California, Irvine, Irvine, CA, 92617, USA.
Xie Z; Department of Biological Chemistry, University of California, Irvine, Irvine, CA, 92617, USA.
Choudhary J; Chao Family Comprehensive Cancer Center, University of California, Irvine, Irvine, CA, 92617, USA.
Bassani-Sternberg M; Biobix, Lab of Bioinformatics and Computational Genomics, Department of Mathematical Modelling, Statistics and Bioinformatics, Ghent University, Ghent, Belgium.
Vizcaíno JA; Department of Biology, Humboldt University Berlin, Berlin, 10117, Germany.
Ternette N; Berlin Institute of Medical Systems Biology (BIMSB), Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, 10115, Germany.
Moritz RL; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK.
Prensner JR; University of Southampton, Southampton, UK.
van Heesch S; Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, Québec, Canada.

bioRxiv ; 2024 Sep 09.

Article en En | MEDLINE | ID: mdl-39314370

ABSTRACT

ABSTRACT

A major scientific drive is to characterize the protein-coding genome as it provides the primary basis for the study of human health. But the fundamental question remains what has been missed in prior genomic analyses? Over the past decade, the translation of non-canonical open reading frames (ncORFs) has been observed across human cell types and disease states, with major implications for proteomics, genomics, and clinical science. However, the impact of ncORFs has been limited by the absence of a large-scale understanding of their contribution to the human proteome. Here, we report the collaborative efforts of stakeholders in proteomics, immunopeptidomics, Ribo-seq ORF discovery, and gene annotation, to produce a consensus landscape of protein-level evidence for ncORFs. We show that at least 25% of a set of 7,264 ncORFs give rise to translated gene products, yielding over 3,000 peptides in a pan-proteome analysis encompassing 3.8 billion mass spectra from 95,520 experiments. With these data, we developed an annotation framework for ncORFs and created public tools for researchers through GENCODE and PeptideAtlas. This work will provide a platform to advance ncORF-derived proteins in biomedical discovery and, beyond humans, diverse animals and plants where ncORFs are similarly observed.

Palabras clave

GENCODE; Human Proteome Project; Ribo-seq; immunopeptidomics; mass spectrometry; microproteins; non-canonical ORFs; proteomics; translation

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: BioRxiv Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos Pais de publicación: Estados Unidos

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google