Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
Add more filters










Database
Language
Publication year range
1.
Mol Cell Proteomics ; 12(6): 1741-51, 2013 Jun.
Article in English | MEDLINE | ID: mdl-23462206

ABSTRACT

We report a high quality and system-wide proteome catalogue covering 71% (3,542 proteins) of the predicted genes of fission yeast, Schizosaccharomyces pombe, presenting the largest protein dataset to date for this important model organism. We obtained this high proteome and peptide (11.4 peptides/protein) coverage by a combination of extensive sample fractionation, high resolution Orbitrap mass spectrometry, and combined database searching using the iProphet software as part of the Trans-Proteomics Pipeline. All raw and processed data are made accessible in the S. pombe PeptideAtlas. The identified proteins showed no biases in functional properties and allowed global estimation of protein abundances. The high coverage of the PeptideAtlas allowed correlation with transcriptomic data in a system-wide manner indicating that post-transcriptional processes control the levels of at least half of all identified proteins. Interestingly, the correlation was not equally tight for all functional categories ranging from r(s) >0.80 for proteins involved in translation to r(s) <0.45 for signal transduction proteins. Moreover, many proteins involved in DNA damage repair could not be detected in the PeptideAtlas despite their high mRNA levels, strengthening the translation-on-demand hypothesis for members of this protein class. In summary, the extensive and publicly available S. pombe PeptideAtlas together with the generated proteotypic peptide spectral library will be a useful resource for future targeted, in-depth, and quantitative proteomic studies on this microorganism.


Subject(s)
Gene Expression Regulation, Fungal , Peptides/isolation & purification , Protein Processing, Post-Translational , Proteome/metabolism , RNA, Messenger/metabolism , Schizosaccharomyces pombe Proteins/metabolism , Schizosaccharomyces/metabolism , Databases, Protein , Mass Spectrometry , Multigene Family , Peptide Mapping , Proteome/chemistry , Proteome/genetics , RNA, Messenger/genetics , Schizosaccharomyces/chemistry , Schizosaccharomyces/genetics , Schizosaccharomyces pombe Proteins/chemistry , Schizosaccharomyces pombe Proteins/genetics , Signal Transduction
2.
J Proteome Res ; 12(4): 1628-44, 2013 Apr 05.
Article in English | MEDLINE | ID: mdl-23391308

ABSTRACT

We present a computational pipeline for the quantification of peptides and proteins in label-free LC-MS/MS data sets. The pipeline is composed of tools from the OpenMS software framework and is applicable to the processing of large experiments (50+ samples). We describe several enhancements that we have introduced to OpenMS to realize the implementation of this pipeline. They include new algorithms for centroiding of raw data, for feature detection, for the alignment of multiple related measurements, and a new tool for the calculation of peptide and protein abundances. Where possible, we compare the performance of the new algorithms to that of their established counterparts in OpenMS. We validate the pipeline on the basis of two small data sets that provide ground truths for the quantification. There, we also compare our results to those of MaxQuant and Progenesis LC-MS, two popular alternatives for the analysis of label-free data. We then show how our software can be applied to a large heterogeneous data set of 58 LC-MS/MS runs.


Subject(s)
Algorithms , Proteins/analysis , Proteomics/methods , Tandem Mass Spectrometry/methods , Automation , Chromatography, Liquid/methods , High-Throughput Screening Assays/methods , Humans , Leptospira interrogans , Reproducibility of Results , Software , Streptococcus pyogenes
3.
J Biol Chem ; 287(2): 1415-25, 2012 Jan 06.
Article in English | MEDLINE | ID: mdl-22117078

ABSTRACT

Streptococcus pyogenes is a major bacterial pathogen and a potent inducer of inflammation causing plasma leakage at the site of infection. A combination of label-free quantitative mass spectrometry-based proteomics strategies were used to measure how the intracellular proteome homeostasis of S. pyogenes is influenced by the presence of human plasma, identifying and quantifying 842 proteins. In plasma the bacterium modifies its production of 213 proteins, and the most pronounced change was the complete down-regulation of proteins required for fatty acid biosynthesis. Fatty acids are transported by albumin (HSA) in plasma. S. pyogenes expresses HSA-binding surface proteins, and HSA carrying fatty acids reduced the amount of fatty acid biosynthesis proteins to the same extent as plasma. The results clarify the function of HSA-binding proteins in S. pyogenes and underline the power of the quantitative mass spectrometry strategy used here to investigate bacterial adaptation to a given environment.


Subject(s)
Adaptation, Physiological , Bacterial Proteins/metabolism , Plasma , Proteome/metabolism , Streptococcus pyogenes/metabolism , Humans , Mass Spectrometry/methods , Proteomics/methods , Serum Albumin/metabolism
4.
BMC Bioinformatics ; 12: 468, 2011 Dec 08.
Article in English | MEDLINE | ID: mdl-22151573

ABSTRACT

BACKGROUND: Modern data generation techniques used in distributed systems biology research projects often create datasets of enormous size and diversity. We argue that in order to overcome the challenge of managing those large quantitative datasets and maximise the biological information extracted from them, a sound information system is required. Ease of integration with data analysis pipelines and other computational tools is a key requirement for it. RESULTS: We have developed openBIS, an open source software framework for constructing user-friendly, scalable and powerful information systems for data and metadata acquired in biological experiments. openBIS enables users to collect, integrate, share, publish data and to connect to data processing pipelines. This framework can be extended and has been customized for different data types acquired by a range of technologies. CONCLUSIONS: openBIS is currently being used by several SystemsX.ch and EU projects applying mass spectrometric measurements of metabolites and proteins, High Content Screening, or Next Generation Sequencing technologies. The attributes that make it interesting to a large research community involved in systems biology projects include versatility, simplicity in deployment, scalability to very large data, flexibility to handle any biological data type and extensibility to the needs of any research domain.


Subject(s)
Information Management/methods , Systems Biology/methods , Genomics , Mass Spectrometry/methods , Metabolomics , Software , Statistics as Topic
5.
Proteomics ; 9(10): 2648-55, 2009 May.
Article in English | MEDLINE | ID: mdl-19391179

ABSTRACT

The identification and characterization of peptides from MS/MS data represents a critical aspect of proteomics. It has been the subject of extensive research in bioinformatics resulting in the generation of a fair number of identification software tools. Most often, only one program with a specific and unvarying set of parameters is selected for identifying proteins. Hence, a significant proportion of the experimental spectra do not match the peptide sequences in the screened database due to inappropriate parameters or scoring schemes. The Swiss protein identification toolbox (swissPIT) project provides the scientific community with an expandable multitool platform for automated in-depth analysis of MS data also able to handle data from high-throughput experiments. With swissPIT many problems have been solved: The missing standards for input and output formats (A), creation of analysis workflows (B), unified result visualization (C), and simplicity of the user interface (D). Currently, swissPIT supports four different programs implementing two different search strategies to identify MS/MS spectra. Conceived to handle the calculation-intensive needs of each of the programs, swissPIT uses the distributed resources of a Swiss-wide computer Grid (http://www.swing-grid.ch).


Subject(s)
Proteins/analysis , Proteomics/methods , Software , Tandem Mass Spectrometry , Computer Communication Networks , Protein Processing, Post-Translational , Sequence Analysis, Protein
6.
Bioinformatics ; 24(11): 1416-7, 2008 Jun 01.
Article in English | MEDLINE | ID: mdl-18436540

ABSTRACT

The identification and characterization of peptides from tandem mass spectrometry (MS/MS) data represents a critical aspect of proteomics. Today, tandem MS analysis is often performed by only using a single identification program achieving identification rates between 10-50% (Elias and Gygi, 2007). Beside the development of new analysis tools, recent publications describe also the pipelining of different search programs to increase the identification rate (Hartler et al., 2007; Keller et al., 2005). The Swiss Protein Identification Toolbox (swissPIT) follows this approach, but goes a step further by providing the user an expandable multi-tool platform capable of executing workflows to analyze tandem MS-based data. One of the major problems in proteomics is the absent of standardized workflows to analyze the produced data. This includes the pre-processing part as well as the final identification of peptides and proteins. The main idea of swissPIT is not only the usage of different identification tool in parallel, but also the meaningful concatenation of different identification strategies at the same time. The swissPIT is open source software but we also provide a user-friendly web platform, which demonstrates the capabilities of our software and which is available at http://swisspit.cscs.ch upon request for account.


Subject(s)
Algorithms , Mass Spectrometry/methods , Peptide Mapping/methods , Proteins/chemistry , Sequence Analysis, Protein/methods , Software , Amino Acid Sequence , Molecular Sequence Data
7.
Stud Health Technol Inform ; 126: 13-22, 2007.
Article in English | MEDLINE | ID: mdl-17476043

ABSTRACT

Biomarker detection is one of the greatest challenges in Clinical Proteomics. Today, great hopes are placed into tandem mass spectrometry (MS/MS) to discover potential biomarkers. MS/MS is a technique that allows large scale data analysis, including the identification, characterization, and quantification of molecules. Especially the identification process, that implies to compare experimental spectra with theoretical amino acid sequences stored in specialized databases, has been subject for extensive research in bioinformatics since many years. Dozens of identification programs have been developed addressing different aspects of the identification process but in general, clinicians are only using a single tools for their data analysis along with a single set of specific parameters. Hence, a significant proportion of the experimental spectra do not lead to a confident identification score due to inappropriate parameters or scoring schemes of the applied analysis software. The swissPIT (Swiss Protein Identification Toolbox) project was initiated to provide the scientific community with an expandable multi-tool platform for automated and in-depth analysis of mass spectrometry data. The swissPIT uses multiple identification tools to automatic analyze mass spectra. The tools are concatenated as analysis workflows. In order to realize these calculation-intensive workflows we are using the Swiss Bio Grid infrastructure. A first version of the web-based front-end is available (http://www.swisspit.cscs.ch) and can be freely accessed after requesting an account. The source code of the project will be also made available in near future.


Subject(s)
Medical Informatics , Proteomics/methods , Tandem Mass Spectrometry/methods , Biomarkers/analysis , Databases, Protein , Humans , Sequence Analysis, Protein , Software , Statistics as Topic/methods , Switzerland
8.
Proteomics ; 6(23): 6124-33, 2006 Dec.
Article in English | MEDLINE | ID: mdl-17072907

ABSTRACT

Peak detection is a key step in the analysis of SELDI-TOF-MS spectra, but the current default method has low specificity and poor peak annotation. To improve data quality, scientists still have to validate the identified peaks visually, a tedious and time-consuming process, especially for large data sets. Hence, there is a genuine need for methods that minimize manual validation. We have previously reported a multi-spectral signal detection method, called RS for 'region of significance', with improved specificity. Here we extend it to include a peak quantification algorithm based on annotated regions of significance (ARS). For each spectral region flagged as significant by RS, we first identify a dominant spectrum for determining the number of peaks and the m/z region of these peaks. From each m/z region of peaks, a peak template is extracted from all spectra via the principal component analysis. Finally, with the template, we estimate the amplitude and location of the peak in each spectrum with the least-squares method and refine the estimation of the amplitude via the mixture model. We have evaluated the ARS algorithm on patient samples from a clinical study. Comparison with the standard method shows that ARS (i) inherits the superior specificity of RS, and (ii) gives more accurate peak annotations than the standard method. In conclusion, we find that ARS alleviates the main problems in the preprocessing of SELDI-TOF spectra. The R-package ProSpect that implements ARS is freely available for academic use at http://www.meb.ki.se/ yudpaw.


Subject(s)
Biomarkers/chemistry , Proteins/chemistry , Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization/methods , Adenocarcinoma/blood , Algorithms , Carcinoma, Squamous Cell/blood , Humans , Lung Neoplasms/blood , Neoplasm Proteins/chemistry , Sensitivity and Specificity
9.
Bioinformatics ; 22(12): 1515-23, 2006 Jun 15.
Article in English | MEDLINE | ID: mdl-16567365

ABSTRACT

MOTIVATION: There is a well-recognized potential of protein expression profiling using the surface-enhanced laser desorption and ionization technology for discovering biomarkers that can be applied in clinical diagnosis, prognosis and therapy prediction. The pre-processing of the raw data, however, is still problematic. METHODS: We focus on the peak detection step, where the standard method is marked by poor specificity. Currently, scientists need to inspect individual spectra visually and laboriously in order to verify that spectral peaks identified by the standard method are real. Motivated by this multi-spectral process, we investigate an analytical approach-called RS for 'regions of significance'-that reduces the data to a single spectrum of F-statistics capturing significant variability between spectra. To account for multiple testing, we use a false discovery rate criterion for identifying potentially interesting proteins. RESULTS: We show that RS has better operating characteristics than several existing methods and demonstrate routine applications on a number of large datasets.


Subject(s)
Biomarkers, Tumor/genetics , Biomarkers/chemistry , Proteins/chemistry , Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization/methods , Algorithms , Animals , Cell Line, Tumor , Humans , Models, Statistical , Prognosis , Programming Languages , Proteomics , ROC Curve , Reproducibility of Results , Software
SELECTION OF CITATIONS
SEARCH DETAIL
...