Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 51
Filter
1.
Methods Mol Biol ; 2044: 81-110, 2019.
Article in English | MEDLINE | ID: mdl-31432408

ABSTRACT

Proteomics is an indispensable tool for disease biomarker discovery. It is widely used for the analysis of biological fluids such as cerebrospinal fluid (CSF), blood, and saliva, which further aids in our understanding of disease incidence and progression. CSF is often the biospecimen of choice in case of intracranial tumors, as rapid changes in the tumor microenvironment can be easily assessed due to its close proximity to the brain. On the contrary studies comprising of serum or plasma samples do not truly reflect the underlying molecular alterations due to the presence of protective blood-brain barrier. We have described in here the detailed workflows for two advanced proteomics techniques, namely, 2D-DIGE (two-dimensional difference in-gel electrophoresis) and iTRAQ (isobaric tag for relative and absolute quantitation), for CSF analysis. Both of these techniques are very sensitive and widely used for quantitative proteomics analysis.


Subject(s)
Brain Neoplasms/cerebrospinal fluid , Cerebrospinal Fluid Proteins/analysis , Cerebrospinal Fluid Proteins/isolation & purification , Chemical Fractionation/methods , Glioma/cerebrospinal fluid , Proteomics/methods , Brain Neoplasms/chemistry , Cerebrospinal Fluid Proteins/chemistry , Chemical Fractionation/instrumentation , Glioma/chemistry , Humans , Mass Spectrometry , Proteome/chemistry , Proteome/metabolism , Proteome/standards , Proteomics/standards , Software , Staining and Labeling/methods , Tumor Microenvironment/genetics , Two-Dimensional Difference Gel Electrophoresis/methods , Workflow
2.
J Transl Med ; 17(1): 184, 2019 05 31.
Article in English | MEDLINE | ID: mdl-31151397

ABSTRACT

BACKGROUND: SWATH-MS has emerged as the strategy of choice for biomarker discovery due to the proteome coverage achieved in acquisition and provision to re-interrogate the data. However, in quantitative analysis using SWATH, each sample from the comparison group is run individually in mass spectrometer and the resulting inter-run variation may influence relative quantification and identification of biomarkers. Normalization of data to diminish this variation thereby becomes an essential step in SWATH data processing. In most reported studies, data normalization methods used are those provided in instrument-based data analysis software or those used for microarray data. This study, for the first time provides an experimental evidence for selection of normalization method optimal for biomarker identification. METHODS: The efficiency of 12 normalization methods to normalize SWATH-MS data was evaluated based on statistical criteria in 'Normalyzer'-a tool which provides comparative evaluation of normalization by different methods. Further, the suitability of normalized data for biomarker discovery was assessed by evaluating the clustering efficiency of differentiators, identified from the normalized data based on p-value, fold change and both, by hierarchical clustering in Genesis software v.1.8.1. RESULTS: Conventional statistical criteria identified VSN-G as the optimal method for normalization of SWATH data. However, differentiators identified from VSN-G normalized data failed to segregate test and control groups. We thus assessed data normalized by eleven other methods for their ability to yield differentiators which segregate the study groups. Datasets in our study demonstrated that differentiators identified based on p-value from data normalized with Loess-R stratified the study groups optimally. CONCLUSION: This is the first report of experimentally tested strategy for SWATH-MS data processing with an emphasis on identification of clinically relevant biomarkers. Normalization of SWATH-MS data by Loess-R method and identification of differentiators based on p-value were found to be optimal for biomarker discovery in this study. The study also demonstrates the need to base the choice of normalization method on the application of the data.


Subject(s)
Biomarkers/analysis , Mass Spectrometry , Proteome/analysis , Proteomics , Case-Control Studies , Datasets as Topic , Diagnosis, Differential , Escherichia coli , Evaluation Studies as Topic , HeLa Cells , Humans , K562 Cells , Mass Spectrometry/methods , Mass Spectrometry/standards , Peptide Fragments/analysis , Peptide Fragments/chemistry , Proteome/standards , Proteomics/methods , Proteomics/standards , Reference Standards , Reference Values , Software , Staining and Labeling , Yeasts
3.
Anal Chem ; 90(21): 13112-13117, 2018 11 06.
Article in English | MEDLINE | ID: mdl-30350613

ABSTRACT

Mass spectrometry (MS) measurements are not inherently calibrated. Researchers use various calibration methods to assign meaning to arbitrary signal intensities and improve precision. Internal calibration (IC) methods use internal standards (IS) such as synthesized or recombinant proteins or peptides to calibrate MS measurements by comparing endogenous analyte signal to the signal from known IS concentrations spiked into the same sample. However, recent work suggests that using IS as IC introduces quantitative biases that affect comparison across studies because of the inability of IS to capture all sources of variation present throughout an MS workflow. Here, we describe a single-point external calibration strategy to calibrate signal intensity measurements to a common reference material, placing MS measurements on the same scale and harmonizing signal intensities between instruments, acquisition methods, and sites. We demonstrate data harmonization between laboratories and methodologies using this generalizable approach.


Subject(s)
Mass Spectrometry/standards , Proteome/standards , Proteomics/standards , Calibration , Reference Standards , Saccharomyces cerevisiae/chemistry
4.
J Proteome Res ; 17(6): 2205-2215, 2018 06 01.
Article in English | MEDLINE | ID: mdl-29718670

ABSTRACT

Reference materials are vital to benchmarking the reproducibility of clinical tests and essential for monitoring laboratory performance for clinical proteomics. The reference material utilized for mass spectrometric analysis of the human proteome would ideally contain enough proteins to be suitably representative of the human proteome, as well as exhibit a stable protein composition in different batches of sample regeneration. Previously, The Clinical Proteomic Tumor Analysis Consortium (CPTAC) utilized a PDX-derived comparative reference (CompRef) materials for the longitudinal assessment of proteomic performance; however, inherent drawbacks of PDX-derived material, including extended time needed to grow tumors and high level of expertise needed, have resulted in efforts to identify a new source of CompRef material. In this study, we examined the utility of using a panel of seven cancer cell lines, NCI-7 Cell Line Panel, as a reference material for mass spectrometric analysis of human proteome. Our results showed that not only is the NCI-7 material suitable for benchmarking laboratory sample preparation methods, but also NCI-7 sample generation is highly reproducible at both the global and phosphoprotein levels. In addition, the predicted genomic and experimental coverage of the NCI-7 proteome suggests the NCI-7 material may also have applications as a universal standard proteomic reference.


Subject(s)
Proteome/standards , Proteomics/standards , Benchmarking , Cell Line, Tumor , Humans , Mass Spectrometry/methods , Proteomics/methods , Reproducibility of Results
5.
Mass Spectrom Rev ; 37(6): 715-737, 2018 11.
Article in English | MEDLINE | ID: mdl-28758227

ABSTRACT

Mass spectrometry-based approaches have enabled important breakthroughs in quantitative proteomics in the last decades. This development is reflected in the better quantitative assessment of protein levels as well as to understand post-translational modifications and protein complexes and networks. Nowadays, the focus of quantitative proteomics shifted from the relative determination of proteins (ie, differential expression between two or more cellular states) to absolute quantity determination, required for a more-thorough characterization of biological models and comprehension of the proteome dynamism, as well as for the search and validation of novel protein biomarkers. However, the physico-chemical environment of the analyte species affects strongly the ionization efficiency in most mass spectrometry (MS) types, which thereby require the use of specially designed standardization approaches to provide absolute quantifications. Most common of such approaches nowadays include (i) the use of stable isotope-labeled peptide standards, isotopologues to the target proteotypic peptides expected after tryptic digestion of the target protein; (ii) use of stable isotope-labeled protein standards to compensate for sample preparation, sample loss, and proteolysis steps; (iii) isobaric reagents, which after fragmentation in the MS/MS analysis provide a final detectable mass shift, can be used to tag both analyte and standard samples; (iv) label-free approaches in which the absolute quantitative data are not obtained through the use of any kind of labeling, but from computational normalization of the raw data and adequate standards; (v) elemental mass spectrometry-based workflows able to provide directly absolute quantification of peptides/proteins that contain an ICP-detectable element. A critical insight from the Analytical Chemistry perspective of the different standardization approaches and their combinations used so far for absolute quantitative MS-based (molecular and elemental) proteomics is provided in this review.


Subject(s)
Mass Spectrometry/standards , Proteome/analysis , Proteomics/standards , Animals , Humans , Indicators and Reagents/standards , Isotope Labeling/methods , Isotope Labeling/standards , Mass Spectrometry/methods , Peptides/analysis , Peptides/standards , Proteome/standards , Proteomics/methods , Reference Standards , Workflow
6.
J Proteome Res ; 16(12): 4531-4535, 2017 12 01.
Article in English | MEDLINE | ID: mdl-28895742

ABSTRACT

The evidence that any protein exists in the Human Proteome Project (HPP; protein evidence 1 or PE1) has revolved primarily (although not exclusively) around mass spectrometry (MS) (93% of PE1 proteins have MS evidence in the latest neXtProt release), with robust and stringent, well-curated metrics that have served the community well. This has led to a significant number of proteins still considered "missing" (i.e., PE2-4). Many PE2-4 proteins have MS evidence of unacceptable quality (small or not enough unitypic peptides and unacceptably high protein/peptide FDRs), transcriptomic, or antibody evidence. Here we use a Chromosome 7 PE2 example called Prestin to demonstrate that clear and robust criteria/metrics need to be developed for proteins that may not or cannot produce clear-cut MS evidence while possessing significant non-MS evidence, including disease-association data. Many of the PE2-4 proteins are inaccessible, spatiotemporally expressed in a limited way, or expressed at such a very low copy number as to be unable to be detected by current MS methodologies. We propose that the HPP community consider and lead a communal initiative to accelerate the discovery and characterization of these types of "missing" proteins.


Subject(s)
Anion Transport Proteins/analysis , Mass Spectrometry , Humans , Proteome/analysis , Proteome/standards , Sulfate Transporters
7.
Sci Rep ; 7: 45570, 2017 04 03.
Article in English | MEDLINE | ID: mdl-28368040

ABSTRACT

The two most common techniques for absolute protein quantification are based on either mass spectrometry (MS) or on immunochemical techniques, such as western blotting (WB). Western blotting is most often used for protein identification or relative quantification, but can also be deployed for absolute quantification if appropriate calibration standards are used. MS based techniques offer superior data quality and reproducibility, but WB offers greater sensitivity and accessibility to most researchers. It would be advantageous to apply both techniques for orthogonal quantification, but workflows rarely overlap. We describe DOSCATs (DOuble Standard conCATamers), novel calibration standards based on QconCAT technology, to unite these platforms. DOSCATs combine a series of epitope sequences concatenated with tryptic peptides in a single artificial protein to create internal tryptic peptide standards for MS as well as an intact protein bearing multiple linear epitopes. A DOSCAT protein was designed and constructed to quantify five proteins of the NF-κB pathway. For three target proteins, protein fold change and absolute copy per cell values measured by MS and WB were in excellent agreement. This demonstrates that DOSCATs can be used as multiplexed, dual purpose standards, readily deployed in a single workflow, supporting seamless quantitative transition from MS to WB.


Subject(s)
Proteins/analysis , Proteins/standards , Proteome/analysis , Proteome/standards , Proteomics/methods , Humans , Peptide Fragments/analysis , Peptide Fragments/standards , Reference Standards
9.
EBioMedicine ; 18: 300-310, 2017 Apr.
Article in English | MEDLINE | ID: mdl-28396014

ABSTRACT

Urine as a true non-invasive sampling source holds great potential for biomarker discovery. While approximately 2000 proteins can be detected by mass spectrometry in urine from healthy people, the amount of these proteins vary considerably. A systematic evaluation of a large number of samples is needed to determine the range of the variations. Current biomarker studies often measure limited number of urine samples in the discovery phase, which makes it difficult to determine whether proteins differentially expressed between control and disease groups represent actual difference, or are just physiological variations among the individuals, leads to failures in the validation phase with the increased sample numbers. Here, we report a streamlined workflow with capacity of measuring 8 urine proteomes per day at the coverage of >1500 proteins. With this workflow, we evaluated variations in 497 urine proteomes from 167 healthy donors, establishing reference intervals (RIs) that covered urine protein variations. We demonstrated that RIs could be used to monitor physiological changes by detecting transient outlier proteins. Furthermore, we provided a RIs-based algorithm for biomarker discovery and validation to screen for diseases such as cancer. This study provided a proof-of-principle workflow for the use of urine proteome for health monitoring and disease screening.


Subject(s)
Biomarkers/urine , Proteome/analysis , Algorithms , Area Under Curve , Chromatography, High Pressure Liquid/standards , False Negative Reactions , False Positive Reactions , Humans , Mass Spectrometry/standards , Monitoring, Physiologic , Nanotechnology/standards , Neoplasms/diagnosis , Proteome/metabolism , Proteome/standards , ROC Curve , Reference Values
10.
J Proteome Res ; 16(5): 1831-1838, 2017 05 05.
Article in English | MEDLINE | ID: mdl-28418254

ABSTRACT

Multiplexed quantification with isobaric chemical tags (e.g., TMT, iTRAQ) provides a robust and efficient means to comparatively examine proteome dynamics between several biological states using a mass spectrometer (MS). The quantitative nature of isobaric tags necessitates strict validation of the observed ion signals in the chosen MS detector before differential patterns are extracted between biological states. We present an in-depth analysis of isobaric tag data acquired on current generation Orbitrap MS hardware to illustrate pitfalls in acquisition settings that can negatively impact results. We establish, for the first time, the presence of a notch, a region of no observed values, in the reporter ion distributions from isobaric-labeled peptide mixtures acquired on these instruments. We determine that this notch is present in published data across a wide range of instruments of the same or different type and is isolated to the Orbitrap mass analyzer. We demonstrate that the impact of the notch can be minimized using manipulations of Orbitrap scan parameters and on-column injection amounts. Lastly, using a mixture of synthetic standard peptides we investigated the impact on identification rates and quantification precision. Together, these data highlight an important phenomenon that negatively impacts peptide identification and quantification in the Orbitrap analyzer as well as outlining guidelines to follow to ensure minimization of MS-induced artifacts in isobaric tag experiments resulting from the notch.


Subject(s)
Mass Spectrometry/methods , Proteome/analysis , Proteomics/methods , Ions , Mass Spectrometry/instrumentation , Peptides/analysis , Peptides/standards , Proteome/standards , Proteomics/standards , Staining and Labeling
11.
Anal Chem ; 89(8): 4474-4479, 2017 04 18.
Article in English | MEDLINE | ID: mdl-28318237

ABSTRACT

To have confidence in results acquired during biological mass spectrometry experiments, a systematic approach to quality control is of vital importance. Nonetheless, until now, only scattered initiatives have been undertaken to this end, and these individual efforts have often not been complementary. To address this issue, the Human Proteome Organization-Proteomics Standards Initiative has established a new working group on quality control at its meeting in the spring of 2016. The goal of this working group is to provide a unifying framework for quality control data. The initial focus will be on providing a community-driven standardized file format for quality control. For this purpose, the previously proposed qcML format will be adapted to support a variety of use cases for both proteomics and metabolomics applications, and it will be established as an official PSI format. An important consideration is to avoid enforcing restrictive requirements on quality control but instead provide the basic technical necessities required to support extensive quality control for any type of mass spectrometry-based workflow. We want to emphasize that this is an open community effort, and we seek participation from all scientists with an interest in this field.


Subject(s)
Proteome/analysis , Proteomics , Databases, Protein , Humans , Mass Spectrometry/standards , Proteome/standards , Proteomics/standards , Quality Control
12.
J Proteome Res ; 16(2): 619-634, 2017 02 03.
Article in English | MEDLINE | ID: mdl-27977202

ABSTRACT

Normalization is a fundamental step in data processing to account for the sample-to-sample variation observed in biological samples. However, data structure is affected by normalization. In this paper, we show how, and to what extent, the correlation structure is affected by the application of 11 different normalization procedures. We also discuss the consequences for data analysis and interpretation, including principal component analysis, partial least-squares discrimination, and the inference of metabolite-metabolite association networks.


Subject(s)
Metabolome/genetics , Principal Component Analysis , Proteome/standards , Proteomics/statistics & numerical data , Animals , Least-Squares Analysis , Proteome/chemistry , Proteome/genetics , Proteomics/standards , Swine , Urine/chemistry
13.
J Proteome Res ; 16(2): 945-957, 2017 02 03.
Article in English | MEDLINE | ID: mdl-27990823

ABSTRACT

Detection of differentially abundant proteins in label-free quantitative shotgun liquid chromatography-tandem mass spectrometry (LC-MS/MS) experiments requires a series of computational steps that identify and quantify LC-MS features. It also requires statistical analyses that distinguish systematic changes in abundance between conditions from artifacts of biological and technical variation. The 2015 study of the Proteome Informatics Research Group (iPRG) of the Association of Biomolecular Resource Facilities (ABRF) aimed to evaluate the effects of the statistical analysis on the accuracy of the results. The study used LC-tandem mass spectra acquired from a controlled mixture, and made the data available to anonymous volunteer participants. The participants used methods of their choice to detect differentially abundant proteins, estimate the associated fold changes, and characterize the uncertainty of the results. The study found that multiple strategies (including the use of spectral counts versus peak intensities, and various software tools) could lead to accurate results, and that the performance was primarily determined by the analysts' expertise. This manuscript summarizes the outcome of the study, and provides representative examples of good computational and statistical practice. The data set generated as part of this study is publicly available.


Subject(s)
Chromatography, Liquid/standards , Laboratory Proficiency Testing , Proteome/isolation & purification , Proteomics/standards , Tandem Mass Spectrometry/standards , Data Interpretation, Statistical , Humans , Professional Competence , Proteome/standards , Proteomics/instrumentation , Proteomics/methods , Reproducibility of Results , Uncertainty
14.
J Proteome Res ; 15(8): 2634-42, 2016 08 05.
Article in English | MEDLINE | ID: mdl-27376408

ABSTRACT

Membrane proteins are underrepresented in proteome analysis platforms because of their hydrophobic character, contributing to decreased solubility. Sodium dodecyl sulfate is a favored denaturant in proteomic workflows, facilitating cell lysis and protein dissolution; however, SDS impedes MS detection and therefore must be removed prior to analysis. Although strategies exist for SDS removal, they provide low recovery, purity, or reproducibility. Here we present a simple automated device, termed transmembrane electrophoresis (TME), incorporating the principles of membrane filtration, but with an applied electric current to ensure near-complete (99.9%) removal of the surfactant, including protein-bound SDS. Intact proteins are recovered in solution phase in high yield (90-100%) within 1 h of operation. The strategy is applied to protein standards and proteome mixtures, including an enriched membrane fraction from E. coli, resulting in quality MS spectra free of SDS adducts. The TME platform is applicable to both bottom-up MS/MS as well as LC-ESI-MS analysis of intact proteins. SDS-depleted fractions reveal a similar number of protein identifications (285) compared wit a non-SDS control (280), being highly correlated in terms of protein spectral counts. This fully automated approach to SDS removal presents a viable tool for proteome sample processing ahead of MS analysis. Data are available via ProteomeXchange, identifier PXD003941.


Subject(s)
Electrophoresis, Polyacrylamide Gel/methods , Membrane Proteins/analysis , Proteome/analysis , Proteomics/methods , Sodium Dodecyl Sulfate/isolation & purification , Automation , Chromatography, Liquid , Escherichia coli , Escherichia coli Proteins/analysis , Mass Spectrometry/methods , Proteome/standards , Solubility , Tandem Mass Spectrometry
15.
J Proteome Res ; 15(8): 2537-47, 2016 08 05.
Article in English | MEDLINE | ID: mdl-27345528

ABSTRACT

The multiplexing capabilities of isobaric mass tag-based protein quantification, such as Tandem Mass Tags or Isobaric Tag for Relative and Absolute Quantitation have dramatically increased the scope of mass spectrometry-based proteomics studies. Not only does the technology allow for the simultaneous quantification of multiple samples in a single MS injection, but its seamless compatibility with extensive sample prefractionation methods allows for comprehensive studies of complex proteomes. However, reporter ion-based quantification has often been criticized for limited quantification accuracy due to interference from coeluting peptides and peptide fragments. In this study, we investigate the extent of this problem and propose an effective and easy-to-implement remedy that relies on spiking a 6-protein calibration mixture to the samples. We evaluated our ratio adjustment approach using two large scale TMT 10-plex data sets derived from a human cancer and noncancer cell line as well as E. coli cells grown at two different conditions. Furthermore, we analyzed a complex 2-proteome artificial sample mixture and investigated the precision of TMT and precursor ion intensity-based label free quantification. Studying the protein set identified by both methods, we found that differentially abundant proteins were assigned dramatically higher statistical significance when quantified using TMT. Data are available via ProteomeXchange with identifier PXD003346.


Subject(s)
Proteome/analysis , Proteomics/methods , Cell Line , Cell Line, Tumor , Data Interpretation, Statistical , Escherichia coli , Humans , Proteome/standards , Proteomics/standards , Tandem Mass Spectrometry/methods
16.
J Proteomics ; 132: 51-62, 2016 Jan 30.
Article in English | MEDLINE | ID: mdl-26585461

ABSTRACT

Proteomic workflows based on nanoLC-MS/MS data-dependent-acquisition analysis have progressed tremendously in recent years. High-resolution and fast sequencing instruments have enabled the use of label-free quantitative methods, based either on spectral counting or on MS signal analysis, which appear as an attractive way to analyze differential protein expression in complex biological samples. However, the computational processing of the data for label-free quantification still remains a challenge. Here, we used a proteomic standard composed of an equimolar mixture of 48 human proteins (Sigma UPS1) spiked at different concentrations into a background of yeast cell lysate to benchmark several label-free quantitative workflows, involving different software packages developed in recent years. This experimental design allowed to finely assess their performances in terms of sensitivity and false discovery rate, by measuring the number of true and false-positive (respectively UPS1 or yeast background proteins found as differential). The spiked standard dataset has been deposited to the ProteomeXchange repository with the identifier PXD001819 and can be used to benchmark other label-free workflows, adjust software parameter settings, improve algorithms for extraction of the quantitative metrics from raw MS data, or evaluate downstream statistical methods. BIOLOGICAL SIGNIFICANCE: Bioinformatic pipelines for label-free quantitative analysis must be objectively evaluated in their ability to detect variant proteins with good sensitivity and low false discovery rate in large-scale proteomic studies. This can be done through the use of complex spiked samples, for which the "ground truth" of variant proteins is known, allowing a statistical evaluation of the performances of the data processing workflow. We provide here such a controlled standard dataset and used it to evaluate the performances of several label-free bioinformatics tools (including MaxQuant, Skyline, MFPaQ, IRMa-hEIDI and Scaffold) in different workflows, for detection of variant proteins with different absolute expression levels and fold change values. The dataset presented here can be useful for tuning software tool parameters, and also testing new algorithms for label-free quantitative analysis, or for evaluation of downstream statistical methods.


Subject(s)
Benchmarking/standards , Chromatography, Liquid/standards , Mass Spectrometry/standards , Proteome/analysis , Proteome/standards , Workflow , Benchmarking/methods , Reproducibility of Results , Sensitivity and Specificity , Software , Software Validation , Staining and Labeling
17.
Proteomics ; 15(15): 2592-6, 2015 Aug.
Article in English | MEDLINE | ID: mdl-25884107

ABSTRACT

The mzQuantML data standard was designed to capture the output of quantitative software in proteomics, to support submissions to public repositories, development of visualization software and pipeline/modular approaches. The standard is designed around a common core that can be extended to support particular types of technique through the release of semantic rules that are checked by validation software. The first release of mzQuantML supported four quantitative proteomics techniques via four sets of semantic rules: (i) intensity-based (MS(1) ) label free, (ii) MS(1) label-based (such as SILAC or N(15) ), (iii) MS(2) tag-based (iTRAQ or tandem mass tags), and (iv) spectral counting. We present an update to mzQuantML for supporting SRM techniques. The update includes representing the quantitative measurements, and associated meta-data, for SRM transitions, the mechanism for inferring peptide-level or protein-level quantitative values, and support for both label-based or label-free SRM protocols, through the creation of semantic rules and controlled vocabulary terms. We have updated the specification document for mzQuantML (version 1.0.1) and the mzQuantML validator to ensure that consistent files are produced by different exporters. We also report the capabilities for production of mzQuantML files from popular SRM software packages, such as Skyline and Anubis.


Subject(s)
Computational Biology/methods , Mass Spectrometry/methods , Proteome/analysis , Proteomics/methods , Software , Computational Biology/standards , Isotope Labeling/methods , Isotope Labeling/standards , Mass Spectrometry/standards , Proteome/metabolism , Proteome/standards , Proteomics/standards , Reproducibility of Results
18.
Environ Microbiol ; 17(1): 4-9, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25339269

ABSTRACT

We present the Proteome Quality Index (PQI; http://pqi-list.org), a much-needed resource for users of bacterial and eukaryotic proteomes. Completely sequenced genomes for which there is an available set of protein sequences (the proteome) are given a one- to five-star rating supported by 11 different metrics of quality. The database indexes over 3000 proteomes at the time of writing and is provided via a website for browsing, filtering and downloading. Previous to this work, there was no systematic way to account for the large variability in quality of the thousands of proteomes, and this is likely to have profoundly influenced the outcome of many published studies, in particular large-scale comparative analyses. The lack of a measure of proteome quality is likely due to the difficulty in producing one, a problem that we have approached by integrating multiple metrics. The continued development and improvement of the index will require the contribution of additional metrics by us and by others; the PQI provides a useful point of reference for the scientific community, but it is only the first step towards a 'standard' for the field.


Subject(s)
Databases, Protein , Proteome/standards , Genome , Internet
19.
J Proteome Res ; 13(12): 5888-97, 2014 Dec 05.
Article in English | MEDLINE | ID: mdl-25285707

ABSTRACT

The rapidly expanding availability of high-resolution mass spectrometry has substantially enhanced the ion-current-based relative quantification techniques. Despite the increasing interest in ion-current-based methods, quantitative sensitivity, accuracy, and false discovery rate remain the major concerns; consequently, comprehensive evaluation and development in these regards are urgently needed. Here we describe an integrated, new procedure for data normalization and protein ratio estimation, termed ICan, for improved ion-current-based analysis of data generated by high-resolution mass spectrometry (MS). ICan achieved significantly better accuracy and precision, and lower false-positive rate for discovering altered proteins, over current popular pipelines. A spiked-in experiment was used to evaluate the performance of ICan to detect small changes. In this study E. coli extracts were spiked with moderate-abundance proteins from human plasma (MAP, enriched by IgY14-SuperMix procedure) at two different levels to set a small change of 1.5-fold. Forty-five (92%, with an average ratio of 1.71 ± 0.13) of 49 identified MAP protein (i.e., the true positives) and none of the reference proteins (1.0-fold) were determined as significantly altered proteins, with cutoff thresholds of ≥ 1.3-fold change and p ≤ 0.05. This is the first study to evaluate and prove competitive performance of the ion-current-based approach for assigning significance to proteins with small changes. By comparison, other methods showed remarkably inferior performance. ICan can be broadly applicable to reliable and sensitive proteomic survey of multiple biological samples with the use of high-resolution MS. Moreover, many key features evaluated and optimized here such as normalization, protein ratio determination, and statistical analyses are also valuable for data analysis by isotope-labeling methods.


Subject(s)
Proteome/metabolism , Biomarkers/chemistry , Biomarkers/metabolism , Escherichia coli Proteins/chemistry , Humans , Mass Spectrometry/standards , Proteome/chemistry , Proteome/standards , Reference Standards , Sensitivity and Specificity , Serum Albumin, Bovine/chemistry
20.
Proteomics ; 12(18): 2767-72, 2012 Sep.
Article in English | MEDLINE | ID: mdl-22969026

ABSTRACT

The Human Proteome Organisation Proteomics Standards Initiative (HUPO-PSI) was established in 2002 with the aim of defining community standards for data representation in proteomics and facilitating data comparison, exchange and verification. Over the last 10 years significant advances have been made, with common data standards now published and implemented in the field of both mass spectrometry and molecular interactions. The 2012 meeting further advanced this work, with the mass spectrometry groups finalising approaches to capturing the output from recent developments in the field, such as quantitative proteomics and SRM. The molecular interaction group focused on improving the integration of data from multiple resources. Both groups united with a guest work track, organized by the HUPO Technology/Standards Committee, to formulate proposals for data submissions from the HUPO Human Proteome Project and to start an initiative to collect standard experimental protocols.


Subject(s)
Proteome/standards , Proteomics/education , Proteomics/standards , Guidelines as Topic , History, 21st Century , Humans , Mass Spectrometry/history , Mass Spectrometry/standards , Proteome/history , Proteomics/history , United States
SELECTION OF CITATIONS
SEARCH DETAIL
...