Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 31
Filter
1.
Sci Data ; 10(1): 697, 2023 10 13.
Article in English | MEDLINE | ID: mdl-37833331

ABSTRACT

Data-Independent Acquisition (DIA) is a mass spectrometry-based method to reliably identify and reproducibly quantify large fractions of a target proteome. The peptide-centric data analysis strategy employed in DIA requires a priori generated spectral assay libraries. Such assay libraries allow to extract quantitative data in a targeted approach and have been generated for human, mouse, zebrafish, E. coli and few other organisms. However, a spectral assay library for the extreme halophilic archaeon Halobacterium salinarum NRC-1, a model organism that contributed to several notable discoveries, is not publicly available yet. Here, we report a comprehensive spectral assay library to measure 2,563 of 2,646 annotated H. salinarum NRC-1 proteins. We demonstrate the utility of this library by measuring global protein abundances over time under standard growth conditions. The H. salinarum NRC-1 library includes 21,074 distinct peptides representing 97% of the predicted proteome and provides a new, valuable resource to confidently measure and quantify any protein of this archaeon. Data and spectral assay libraries are available via ProteomeXchange (PXD042770, PXD042774) and SWATHAtlas (SAL00312-SAL00319).


Subject(s)
Halobacterium salinarum , Proteome , Halobacterium salinarum/metabolism , Peptides/analysis , Proteome/analysis , Proteomics/methods
2.
Sci Data ; 7(1): 389, 2020 11 12.
Article in English | MEDLINE | ID: mdl-33184295

ABSTRACT

Data-Independent Acquisition (DIA) is a method to improve consistent identification and precise quantitation of peptides and proteins by mass spectrometry (MS). The targeted data analysis strategy in DIA relies on spectral assay libraries that are generally derived from a priori measurements of peptides for each species. Although Escherichia coli (E. coli) is among the best studied model organisms, so far there is no spectral assay library for the bacterium publicly available. Here, we generated a spectral assay library for 4,014 of the 4,389 annotated E. coli proteins using one- and two-dimensional fractionated samples, and ion mobility separation enabling deep proteome coverage. We demonstrate the utility of this high-quality library with robustness in quantitation of the E. coli proteome and with rapid-chromatography to enhance throughput by targeted DIA-MS. The spectral assay library supports the detection and quantification of 91.5% of all E. coli proteins at high-confidence with 56,182 proteotypic peptides, making it a valuable resource for the scientific community. Data and spectral libraries are available via ProteomeXchange (PXD020761, PXD020785) and SWATHAtlas (SAL00222-28).


Subject(s)
Escherichia coli Proteins/analysis , Escherichia coli/metabolism , Mass Spectrometry , Proteome/analysis , Peptide Library , Peptides/analysis
3.
Nat Commun ; 11(1): 5251, 2020 10 16.
Article in English | MEDLINE | ID: mdl-33067471

ABSTRACT

Data-independent acquisition (DIA) mass spectrometry, also known as Sequential Window Acquisition of all Theoretical Mass Spectra (SWATH), is a popular label-free proteomics strategy to comprehensively quantify peptides/proteins utilizing mass spectral libraries to decipher inherently multiplexed spectra collected linearly across a mass range. Although there are many spectral libraries produced worldwide, the quality control of these libraries is lacking. We present the DIALib-QC (DIA library quality control) software tool for the systematic evaluation of a library's characteristics, completeness and correctness across 62 parameters of compliance, and further provide the option to improve its quality. We demonstrate its utility in assessing and repairing spectral libraries for correctness, accuracy and sensitivity.


Subject(s)
Mass Spectrometry/methods , Proteomics/methods , Software , Humans , Mass Spectrometry/standards , Peptide Library , Peptides/chemistry , Peptides/genetics , Proteins/chemistry , Proteins/genetics , Proteomics/standards
4.
J Proteome Res ; 18(12): 4262-4272, 2019 12 06.
Article in English | MEDLINE | ID: mdl-31290668

ABSTRACT

Spectral matching sequence database search engines commonly used on mass spectrometry-based proteomics experiments excel at identifying peptide sequence ions, and in addition, possible sequence ions carrying post-translational modifications (PTMs), but most do not provide confidence metrics for the exact localization of those PTMs when several possible sites are available. Localization is absolutely required for downstream molecular cell biology analysis of PTM function in vitro and in vivo. Therefore, we developed PTMProphet, a free and open-source software tool integrated into the Trans-Proteomic Pipeline, which reanalyzes identified spectra from any search engine for which pepXML output is available to provide localization confidence to enable appropriate further characterization of biologic events. Localization of any type of mass modification (e.g., phosphorylation) is supported. PTMProphet applies Bayesian mixture models to compute probabilities for each site/peptide spectrum match where a PTM has been identified. These probabilities can be combined to compute a global false localization rate at any threshold to guide downstream analysis. We describe the PTMProphet tool, its underlying algorithms, and demonstrate its performance on ground-truth synthetic peptide reference data sets, one previously published small data set, one new larger data set, and also on a previously published phosphoenriched data set where the correct sites of modification are unknown. Data have been deposited to ProteomeXchange with identifier PXD013210.


Subject(s)
Protein Processing, Post-Translational , Proteomics/methods , Software , Algorithms , Bayes Theorem , Databases, Protein , Humans , Phosphopeptides/metabolism , User-Computer Interface
5.
J Proteome Res ; 17(12): 4337-4344, 2018 12 07.
Article in English | MEDLINE | ID: mdl-30230343

ABSTRACT

Bottom-up proteomics relies on the proteolytic or chemical cleavage of proteins into peptides, the identification of those peptides via mass spectrometry, and the mapping of the identified peptides back to the reference proteome to infer which possible proteins are identified. Reliable mapping of peptides to proteins still poses substantial challenges when considering similar proteins, protein families, splice isoforms, sequence variation, and possible residue mass modifications, combined with an imperfect and incomplete understanding of the proteome. The ProteoMapper tool enables a comprehensive and rapid mapping of peptides to a reference proteome. The indexer component creates a segmented index for an input proteome from a FASTA or PEFF file. The ProMaST component provides ultrafast mapping of one or more input peptides against the index. ProteoMapper allows searches that take into account known sequence variation encoded in PEFF files. It also enables fuzzy searches to find highly similar peptides with residue order changes or other isobaric or near-isobaric substitutions within a specified mass tolerance. We demonstrate an example of a one-hit-wonder identification in PeptideAtlas that may be better explained by a combination of catalogued and uncatalogued sequence variation in another highly observed protein. ProteoMapper is a free and open source, available for local use after downloading, embedding in other applications, as an online web tool at http://www.peptideatlas.org/map , and as a web service.


Subject(s)
Peptide Mapping/methods , Proteome , Software , Amino Acid Sequence , Animals , Genetic Variation , Humans , Mass Spectrometry , Proteins
6.
Sci Data ; 5: 180157, 2018 08 07.
Article in English | MEDLINE | ID: mdl-30084848

ABSTRACT

The large array of peptides presented to CD8+ T cells by major histocompatibility complex (MHC) class I molecules is referred to as the MHC class I immunopeptidome. Although the MHC class I immunopeptidome is ubiquitous in mammals and represents a critical component of the immune system, very little is known, in any species, about its composition across most tissues and organs in vivo. We applied mass spectrometry (MS) technologies to draft the first tissue-based atlas of the murine MHC class I immunopeptidome in health. Peptides were extracted from 19 normal tissues from C57BL/6 mice and prepared for MS injections, resulting in a total number of 28,448 high-confidence H2Db/Kb-associated peptides identified and annotated in the atlas. This atlas provides initial qualitative data to explore the tissue-specificity of the immunopeptidome and serves as a guide to identify potential tumor-associated antigens from various cancer models. Our data were shared via PRIDE (PXD008733), SysteMHC Atlas (SYSMHC00018) and SWATH Atlas. We anticipate that this unique dataset will be expanded in the future and will find wide applications in basic and translational immunology.


Subject(s)
Histocompatibility Antigens Class I , Organ Specificity , Animals , Histocompatibility Antigens Class I/analysis , Histocompatibility Antigens Class I/immunology , Mass Spectrometry , Mice , Mice, Inbred C57BL , Peptides
7.
Nucleic Acids Res ; 46(D1): D1237-D1247, 2018 01 04.
Article in English | MEDLINE | ID: mdl-28985418

ABSTRACT

Mass spectrometry (MS)-based immunopeptidomics investigates the repertoire of peptides presented at the cell surface by major histocompatibility complex (MHC) molecules. The broad clinical relevance of MHC-associated peptides, e.g. in precision medicine, provides a strong rationale for the large-scale generation of immunopeptidomic datasets and recent developments in MS-based peptide analysis technologies now support the generation of the required data. Importantly, the availability of diverse immunopeptidomic datasets has resulted in an increasing need to standardize, store and exchange this type of data to enable better collaborations among researchers, to advance the field more efficiently and to establish quality measures required for the meaningful comparison of datasets. Here we present the SysteMHC Atlas (https://systemhcatlas.org), a public database that aims at collecting, organizing, sharing, visualizing and exploring immunopeptidomic data generated by MS. The Atlas includes raw mass spectrometer output files collected from several laboratories around the globe, a catalog of context-specific datasets of MHC class I and class II peptides, standardized MHC allele-specific peptide spectral libraries consisting of consensus spectra calculated from repeat measurements of the same peptide sequence, and links to other proteomics and immunology databases. The SysteMHC Atlas project was created and will be further expanded using a uniform and open computational pipeline that controls the quality of peptide identifications and peptide annotations. Thus, the SysteMHC Atlas disseminates quality controlled immunopeptidomic information to the public domain and serves as a community resource toward the generation of a high-quality comprehensive map of the human immunopeptidome and the support of consistent measurement of immunopeptidomic sample cohorts.


Subject(s)
Databases, Factual , HLA Antigens , Histocompatibility Antigens , Mass Spectrometry , Alleles , HLA Antigens/chemistry , HLA Antigens/immunology , Histocompatibility Antigens/chemistry , Histocompatibility Antigens/immunology , Humans , Internet , Tandem Mass Spectrometry , User-Computer Interface
8.
J Proteome Res ; 16(12): 4299-4310, 2017 12 01.
Article in English | MEDLINE | ID: mdl-28938075

ABSTRACT

Human blood plasma provides a highly accessible window to the proteome of any individual in health and disease. Since its inception in 2002, the Human Proteome Organization's Human Plasma Proteome Project (HPPP) has been promoting advances in the study and understanding of the full protein complement of human plasma and on determining the abundance and modifications of its components. In 2017, we review the history of the HPPP and the advances of human plasma proteomics in general, including several recent achievements. We then present the latest 2017-04 build of Human Plasma PeptideAtlas, which yields ∼43 million peptide-spectrum matches and 122,730 distinct peptide sequences from 178 individual experiments at a 1% protein-level FDR globally across all experiments. Applying the latest Human Proteome Project Data Interpretation Guidelines, we catalog 3509 proteins that have at least two non-nested uniquely mapping peptides of nine amino acids or more and >1300 additional proteins with ambiguous evidence. We apply the same two-peptide guideline to historical PeptideAtlas builds going back to 2006 and examine the progress made in the past ten years in plasma proteome coverage. We also compare the distribution of proteins in historical PeptideAtlas builds in various RNA abundance and cellular localization categories. We then discuss advances in plasma proteomics based on targeted mass spectrometry as well as affinity assays, which during early 2017 target ∼2000 proteins. Finally, we describe considerations about sample handling and study design, concluding with an outlook for future advances in deciphering the human plasma proteome.


Subject(s)
Plasma/chemistry , Proteome/analysis , Blood Proteins/analysis , Blood Proteins/history , Databases, Protein/history , History, 21st Century , Humans , Mass Spectrometry , Proteome/history , Proteomics/methods , Proteomics/trends
10.
DNA Res ; 24(2): 143-157, 2017 Apr 01.
Article in English | MEDLINE | ID: mdl-28065881

ABSTRACT

Differential next-generation-omics approaches aid in the visualization of biological processes and pave the way for divulging important events and/or interactions leading to a functional output at cellular or systems level. To this end, we undertook an integrated Nextgen transcriptomics and proteomics approach to divulge differential gene expression of infant and pubertal rat Sertoli cells (Sc).Unlike, pubertal Sc, infant Sc are immature and fail to support spermatogenesis. We found exclusive association of 14 and 19 transcription factor binding sites to infantile and pubertal states of Sc, respectively, using differential transcriptomics-guided genome-wide computational analysis of relevant promoters employing 220 Positional Weight Matrices from the TRANSFAC database. Proteomic SWATH-MS analysis provided extensive quantification of nuclear and cytoplasmic protein fractions revealing 1,670 proteins differentially located between the nucleus and cytoplasm of infant Sc and 890 proteins differentially located within those of pubertal Sc. Based on our multi-omics approach, the transcription factor YY1 was identified as one of the lead candidates regulating differentiation of Sc.YY1 was found to have abundant binding sites on promoters of genes upregulated during puberty. To determine its significance, we generated transgenic rats with Sc specific knockdown of YY1 that led to compromised spermatogenesis.


Subject(s)
Cell Differentiation , Gene Expression Regulation, Developmental , Promoter Regions, Genetic , Sertoli Cells/physiology , Testis/physiology , YY1 Transcription Factor/metabolism , Animals , Gene Expression Profiling , Male , Proteomics , Rats , Rats, Wistar , Sertoli Cells/metabolism , Spermatogenesis , Testis/metabolism , YY1 Transcription Factor/physiology
11.
Nucleic Acids Res ; 45(D1): D1100-D1106, 2017 01 04.
Article in English | MEDLINE | ID: mdl-27924013

ABSTRACT

The ProteomeXchange (PX) Consortium of proteomics resources (http://www.proteomexchange.org) was formally started in 2011 to standardize data submission and dissemination of mass spectrometry proteomics data worldwide. We give an overview of the current consortium activities and describe the advances of the past few years. Augmenting the PX founding members (PRIDE and PeptideAtlas, including the PASSEL resource), two new members have joined the consortium: MassIVE and jPOST. ProteomeCentral remains as the common data access portal, providing the ability to search for data sets in all participating PX resources, now with enhanced data visualization components.We describe the updated submission guidelines, now expanded to include four members instead of two. As demonstrated by data submission statistics, PX is supporting a change in culture of the proteomics field: public data sharing is now an accepted standard, supported by requirements for journal submissions resulting in public data release becoming the norm. More than 4500 data sets have been submitted to the various PX resources since 2012. Human is the most represented species with approximately half of the data sets, followed by some of the main model organisms and a growing list of more than 900 diverse species. Data reprocessing activities are becoming more prominent, with both MassIVE and PeptideAtlas releasing the results of reprocessed data sets. Finally, we outline the upcoming advances for ProteomeXchange.


Subject(s)
Databases, Protein , Proteome , Proteomics , Search Engine , Computational Biology/methods , Humans , Mass Spectrometry , Proteomics/methods , Software , Web Browser , Workflow
12.
J Proteome Res ; 15(11): 4091-4100, 2016 11 04.
Article in English | MEDLINE | ID: mdl-27577934

ABSTRACT

The results of analysis of shotgun proteomics mass spectrometry data can be greatly affected by the selection of the reference protein sequence database against which the spectra are matched. For many species there are multiple sources from which somewhat different sequence sets can be obtained. This can lead to confusion about which database is best in which circumstances-a problem especially acute in human sample analysis. All sequence databases are genome-based, with sequences for the predicted gene and their protein translation products compiled. Our goal is to create a set of primary sequence databases that comprise the union of sequences from many of the different available sources and make the result easily available to the community. We have compiled a set of four sequence databases of varying sizes, from a small database consisting of only the ∼20,000 primary isoforms plus contaminants to a very large database that includes almost all nonredundant protein sequences from several sources. This set of tiered, increasingly complete human protein sequence databases suitable for mass spectrometry proteomics sequence database searching is called the Tiered Human Integrated Search Proteome set. In order to evaluate the utility of these databases, we have analyzed two different data sets, one from the HeLa cell line and the other from normal human liver tissue, with each of the four tiers of database complexity. The result is that approximately 0.8%, 1.1%, and 1.5% additional peptides can be identified for Tiers 2, 3, and 4, respectively, as compared with the Tier 1 database, at substantially increasing computational cost. This increase in computational cost may be worth bearing if the identification of sequence variants or the discovery of sequences that are not present in the reviewed knowledge base entries is an important goal of the study. We find that it is useful to search a data set against a simpler database, and then check the uniqueness of the discovered peptides against a more complex database. We have set up an automated system that downloads all the source databases on the first of each month and automatically generates a new set of search databases and makes them available for download at http://www.peptideatlas.org/thisp/ .


Subject(s)
Databases, Protein/trends , Proteomics/methods , Computational Biology/methods , HeLa Cells , Humans , Liver/chemistry , Liver/cytology , Mass Spectrometry , Protein Isoforms/analysis , Proteins/analysis
13.
Cell ; 166(3): 766-778, 2016 Jul 28.
Article in English | MEDLINE | ID: mdl-27453469

ABSTRACT

The ability to reliably and reproducibly measure any protein of the human proteome in any tissue or cell type would be transformative for understanding systems-level properties as well as specific pathways in physiology and disease. Here, we describe the generation and verification of a compendium of highly specific assays that enable quantification of 99.7% of the 20,277 annotated human proteins by the widely accessible, sensitive, and robust targeted mass spectrometric method selected reaction monitoring, SRM. This human SRMAtlas provides definitive coordinates that conclusively identify the respective peptide in biological samples. We report data on 166,174 proteotypic peptides providing multiple, independent assays to quantify any human protein and numerous spliced variants, non-synonymous mutations, and post-translational modifications. The data are freely accessible as a resource at http://www.srmatlas.org/, and we demonstrate its utility by examining the network response to inhibition of cholesterol synthesis in liver cells and to docetaxel in prostate cancer lines.


Subject(s)
Databases, Protein , Proteome , Access to Information , Antineoplastic Agents/therapeutic use , Cell Line, Tumor , Cholesterol/biosynthesis , Docetaxel , Female , Humans , Internet , Liver/drug effects , Male , Mutation , Prostatic Neoplasms/drug therapy , RNA Splicing , Taxoids/therapeutic use
14.
Elife ; 42015 Jul 08.
Article in English | MEDLINE | ID: mdl-26154972

ABSTRACT

We present a novel mass spectrometry-based high-throughput workflow and an open-source computational and data resource to reproducibly identify and quantify HLA-associated peptides. Collectively, the resources support the generation of HLA allele-specific peptide assay libraries consisting of consensus fragment ion spectra, and the analysis of quantitative digital maps of HLA peptidomes generated from a range of biological sources by SWATH mass spectrometry (MS). This study represents the first community-based effort to develop a robust platform for the reproducible and quantitative measurement of the entire repertoire of peptides presented by HLA molecules, an essential step towards the design of efficient immunotherapies.


Subject(s)
Antigens/chemistry , Antigens/immunology , Computational Biology/methods , Databases, Factual , Peptides/chemistry , Peptides/immunology , Antigen Presentation , HLA Antigens/metabolism , High-Throughput Screening Assays/methods , Mass Spectrometry/methods
15.
Mol Cell Proteomics ; 13(10): 2618-31, 2014 Oct.
Article in English | MEDLINE | ID: mdl-24997998

ABSTRACT

Glioblastoma multiforme is a highly invasive and aggressive brain tumor with an invariably poor prognosis. The overexpression of epidermal growth factor receptor (EGFR) is a primary influencer of invasion and proliferation in tumor cells and the constitutively active EGFRvIII mutant, found in 30-65% of Glioblastoma multiforme, confers more aggressive invasion. To better understand how EGFR contributes to tumor aggressiveness, we investigated the effect of EGFR on the secreted levels of 65 rationally selected proteins involved in invasion. We employed selected reaction monitoring targeted mass spectrometry using stable isotope labeled internal peptide standards to quantity proteins in the secretome from five GBM (U87) isogenic cell lines in which EGFR, EGFRvIII, and/or PTEN were expressed. Our results show that cell lines with EGFR overexpression and constitutive EGFRvIII expression differ remarkably in the expression profiles for both secreted and intracellular signaling proteins, and alterations in EGFR signaling result in reproducible changes in concentrations of secreted proteins. Furthermore, the EGFRvIII-expressing mutant cell line secretes the majority of the selected invasion-promoting proteins at higher levels than other cell lines tested. Additionally, the intracellular and extracellular protein measurements indicate elevated oxidative stress in the EGFRvIII-expressing cell line. In conclusion, the results of our study demonstrate that EGFR signaling has a significant effect on the levels of secreted invasion-promoting proteins, likely contributing to the aggressiveness of Glioblastoma multiforme. Further characterization of these proteins may provide candidates for new therapeutic strategies and targets as well as biomarkers for this aggressive disease.


Subject(s)
ErbB Receptors/metabolism , Glioblastoma/metabolism , Glioblastoma/pathology , Intracellular Signaling Peptides and Proteins/metabolism , Proteomics/methods , Cell Line, Tumor , Gene Expression Regulation, Neoplastic , Humans , Neoplasm Invasiveness , PTEN Phosphohydrolase/metabolism , Signal Transduction
16.
Curr Protoc Bioinformatics ; 46: 13.25.1-13.25.28, 2014 Jun 17.
Article in English | MEDLINE | ID: mdl-24939129

ABSTRACT

PeptideAtlas, SRMAtlas, and PASSEL are Web-accessible resources to support discovery and targeted proteomics research. PeptideAtlas is a multi-species compendium of shotgun proteomic data provided by the scientific community; SRMAtlas is a resource of high-quality, complete proteome SRM assays generated in a consistent manner for the targeted identification and quantification of proteins; and PASSEL is a repository that compiles and represents selected reaction monitoring data, all in an easy-to-use interface. The databases are generated from native mass spectrometry data files that are analyzed in a standardized manner including statistical validation of the results. Each resource offers search functionalities and can be queried by user-defined constraints; the query results are provided in tables or are graphically displayed. PeptideAtlas, SRMAtlas, and PASSEL are publicly available freely via the Web site http://www.peptideatlas.org. In this protocol, we describe the use of these resources, we highlight how to submit, search, collate and download data.


Subject(s)
Atlases as Topic , Peptides/chemistry , Proteomics , Amino Acid Sequence , Information Storage and Retrieval , Internet , Molecular Sequence Data , Sequence Homology, Amino Acid , Tandem Mass Spectrometry
17.
Sci Data ; 1: 140031, 2014.
Article in English | MEDLINE | ID: mdl-25977788

ABSTRACT

Mass spectrometry is the method of choice for deep and reliable exploration of the (human) proteome. Targeted mass spectrometry reliably detects and quantifies pre-determined sets of proteins in a complex biological matrix and is used in studies that rely on the quantitatively accurate and reproducible measurement of proteins across multiple samples. It requires the one-time, a priori generation of a specific measurement assay for each targeted protein. SWATH-MS is a mass spectrometric method that combines data-independent acquisition (DIA) and targeted data analysis and vastly extends the throughput of proteins that can be targeted in a sample compared to selected reaction monitoring (SRM). Here we present a compendium of highly specific assays covering more than 10,000 human proteins and enabling their targeted analysis in SWATH-MS datasets acquired from research or clinical specimens. This resource supports the confident detection and quantification of 50.9% of all human proteins annotated by UniProtKB/Swiss-Prot and is therefore expected to find wide application in basic and clinical research. Data are available via ProteomeXchange (PXD000953-954) and SWATHAtlas (SAL00016-35).


Subject(s)
Databases, Protein , Mass Spectrometry/methods , Proteins/chemistry , Proteome , Humans , Proteome/chemistry , Proteomics/methods
18.
Cell Host Microbe ; 13(5): 602-612, 2013 May 15.
Article in English | MEDLINE | ID: mdl-23684311

ABSTRACT

Research advancing our understanding of Mycobacterium tuberculosis (Mtb) biology and complex host-Mtb interactions requires consistent and precise quantitative measurements of Mtb proteins. We describe the generation and validation of a compendium of assays to quantify 97% of the 4,012 annotated Mtb proteins by the targeted mass spectrometric method selected reaction monitoring (SRM). Furthermore, we estimate the absolute abundance for 55% of all Mtb proteins, revealing a dynamic range within the Mtb proteome of over four orders of magnitude, and identify previously unannotated proteins. As an example of the assay library utility, we monitored the entire Mtb dormancy survival regulon (DosR), which is linked to anaerobic survival and Mtb persistence, and show its dynamic protein-level regulation during hypoxia. In conclusion, we present a publicly available research resource that supports the sensitive, precise, and reproducible quantification of virtually any Mtb protein by a robust and widely accessible mass spectrometric method.


Subject(s)
Bacterial Proteins/analysis , Mycobacterium tuberculosis/chemistry , Proteome/analysis , Proteomics/methods
19.
Nature ; 494(7436): 266-70, 2013 Feb 14.
Article in English | MEDLINE | ID: mdl-23334424

ABSTRACT

Experience from different fields of life sciences suggests that accessible, complete reference maps of the components of the system under study are highly beneficial research tools. Examples of such maps include libraries of the spectroscopic properties of molecules, or databases of drug structures in analytical or forensic chemistry. Such maps, and methods to navigate them, constitute reliable assays to probe any sample for the presence and amount of molecules contained in the map. So far, attempts to generate such maps for any proteome have failed to reach complete proteome coverage. Here we use a strategy based on high-throughput peptide synthesis and mass spectrometry to generate an almost complete reference map (97% of the genome-predicted proteins) of the Saccharomyces cerevisiae proteome. We generated two versions of this mass-spectrometric map, one supporting discovery-driven (shotgun) and the other supporting hypothesis-driven (targeted) proteomic measurements. Together, the two versions of the map constitute a complete set of proteomic assays to support most studies performed with contemporary proteomic technologies. To show the utility of the maps, we applied them to a protein quantitative trait locus (QTL) analysis, which requires precise measurement of the same set of peptides over a large number of samples. Protein measurements over 78 S. cerevisiae strains revealed a complex relationship between independent genetic loci, influencing the levels of related proteins. Our results suggest that selective pressure favours the acquisition of sets of polymorphisms that adapt protein levels but also maintain the stoichiometry of functionally related pathway members.


Subject(s)
Mass Spectrometry , Proteome/analysis , Proteomics/methods , Quantitative Trait Loci/genetics , Saccharomyces cerevisiae Proteins/analysis , Saccharomyces cerevisiae/chemistry , Saccharomyces cerevisiae/genetics , Peptide Library , Polymorphism, Genetic , Proteome/genetics , Reference Values , Saccharomyces cerevisiae Proteins/genetics , Selection, Genetic
20.
Proteomics ; 12(8): 1170-5, 2012 Apr.
Article in English | MEDLINE | ID: mdl-22318887

ABSTRACT

Public repositories for proteomics data have accelerated proteomics research by enabling more efficient cross-analyses of datasets, supporting the creation of protein and peptide compendia of experimental results, supporting the development and testing of new software tools, and facilitating the manuscript review process. The repositories available to date have been designed to accommodate either shotgun experiments or generic proteomic data files. Here, we describe a new kind of proteomic data repository for the collection and representation of data from selected reaction monitoring (SRM) measurements. The PeptideAtlas SRM Experiment Library (PASSEL) allows researchers to easily submit proteomic data sets generated by SRM. The raw data are automatically processed in a uniform manner and the results are stored in a database, where they may be downloaded or browsed via a web interface that includes a chromatogram viewer. PASSELenables cross-analysis of SRMdata, supports optimization of SRMdata collection, and facilitates the review process of SRMdata. Further, PASSELwill help in the assessment of proteotypic peptide performance in a wide array of samples containing the same peptide, as well as across multiple experimental protocols.


Subject(s)
Chromatography, Liquid/methods , Databases, Protein/standards , Peptides/analysis , Proteomics/methods , Software , Tandem Mass Spectrometry/methods , Algorithms , Electronic Data Processing , Humans , Internet , Peptide Library , Proteomics/standards , Tandem Mass Spectrometry/standards
SELECTION OF CITATIONS
SEARCH DETAIL
...