Search | VHL Regional Portal

Combining De Novo Peptide Sequencing Algorithms, A Synergistic Approach to Boost Both Identifications and Confidence in Bottom-up Proteomics.

Blank-Landeshammer, Bernhard; Kollipara, Laxmikanth; Biß, Karsten; Pfenninger, Markus; Malchow, Sebastian; Shuvaev, Konstantin; Zahedi, René P; Sickmann, Albert.

J Proteome Res ; 16(9): 3209-3218, 2017 09 01.

Article in English | MEDLINE | ID: mdl-28741358

ABSTRACT

Complex mass spectrometry based proteomics data sets are mostly analyzed by protein database searches. While this approach performs considerably well for sequenced organisms, direct inference of peptide sequences from tandem mass spectra, i.e., de novo peptide sequencing, oftentimes is the only way to obtain information when protein databases are absent. However, available algorithms suffer from drawbacks such as lack of validation and often high rates of false positive hits (FP). Here we present a simple method of combining results from commonly available de novo peptide sequencing algorithms, which in conjunction with minor tweaks in data acquisition ensues lower empirical FDR compared to the analysis using single algorithms. Results were validated using state-of-the art database search algorithms as well specifically synthesized reference peptides. Thus, we could increase the number of PSMs meeting a stringent FDR of 5% more than 3-fold compared to the single best de novo sequencing algorithm alone, accounting for an average of 11â¯120 PSMs (combined) instead of 3476 PSMs (alone) in triplicate 2 h LC-MS runs of tryptic HeLa digestion.

Subject(s)

Algorithms , Peptides/analysis , Proteomics/methods , Sequence Analysis, Protein/methods , Amino Acid Sequence , Animals , Cell Line , Chromatography, Liquid , Databases, Protein , HeLa Cells , Humans , Mice , Myoblasts/chemistry , Myoblasts/metabolism , Proteolysis , Proteomics/instrumentation , Saccharomyces cerevisiae/chemistry , Saccharomyces cerevisiae/metabolism , Snails/chemistry , Snails/metabolism , Tandem Mass Spectrometry , Trypsin/chemistry

DISMS2: A flexible algorithm for direct proteome- wide distance calculation of LC-MS/MS runs.

Rieder, Vera; Blank-Landeshammer, Bernhard; Stuhr, Marleen; Schell, Tilman; Biß, Karsten; Kollipara, Laxmikanth; Meyer, Achim; Pfenninger, Markus; Westphal, Hildegard; Sickmann, Albert; Rahnenführer, Jörg.

BMC Bioinformatics ; 18(1): 148, 2017 Mar 03.

Article in English | MEDLINE | ID: mdl-28253837

ABSTRACT

BACKGROUND: The classification of samples on a molecular level has manifold applications, from patient classification regarding cancer treatment to phylogenetics for identifying evolutionary relationships between species. Modern methods employ the alignment of DNA or amino acid sequences, mostly not genome-wide but only on selected parts of the genome. Recently proteomics-based approaches have become popular. An established method for the identification of peptides and proteins is liquid chromatography-tandem mass spectrometry (LC-MS/MS). First, protein sequences from MS/MS spectra are identified by means of database searches, given samples with known genome-wide sequence information, then sequence based methods are applied. Alternatively, de novo peptide sequencing algorithms annotate MS/MS spectra and deduce peptide/protein information without a database. A newer approach independent of additional information is to directly compare unidentified tandem mass spectra. The challenge then is to compute the distance between pairwise MS/MS runs consisting of thousands of spectra. METHODS: We present DISMS2, a new algorithm to calculate proteome-wide distances directly from MS/MS data, extending the algorithm compareMS2, an approach that also uses a spectral comparison pipeline. RESULTS: Our new more flexible algorithm, DISMS2, allows for the choice of the spectrum distance measure and includes different spectra preprocessing and filtering steps that can be tailored to specific situations by parameter optimization. CONCLUSIONS: DISMS2 performs well for samples from species with and without database annotation and thus has clear advantages over methods that are purely based on database search.

Subject(s)

Algorithms , Chromatography, Liquid/methods , Peptides/analysis , Proteome/chemistry , Proteomics/methods , Tandem Mass Spectrometry/methods , Amino Acid Sequence , Databases, Protein , Humans

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL