RESUMO
Identification of unknown compounds is of critical importance in GC/MS applications (metabolomics, environmental toxin identification, sports doping, petroleomics, and biofuel analysis, among many others) and remains a technological challenge. Derivation of elemental composition is the first step to determining the identity of an unknown compound by MS, for which high accuracy mass and isotopomer distribution measurements are critical. Here, we report on the development of a dedicated, applications-grade GC/MS employing an Orbitrap mass analyzer, the GC/Quadrupole-Orbitrap. Built from the basis of the benchtop Orbitrap LC/MS, the GC/Quadrupole-Orbitrap maintains the performance characteristics of the Orbitrap, enables quadrupole-based isolation for sensitive analyte detection, and includes numerous analysis modalities to facilitate structural elucidation. We detail the design and construction of the instrument, discuss its key figures-of-merit, and demonstrate its performance for the characterization of unknown compounds and environmental toxins.
Assuntos
Espectrometria de Massas/instrumentação , Desenho de EquipamentoRESUMO
BACKGROUND: Mascot is a commonly used protein identification program for MS as well as for tandem MS data. When analyzing huge shotgun proteomics datasets with Mascot's native tools, limits of computing resources are easily reached. Up to now no application has been available as open source that is capable of converting the full content of Mascot result files from the original MIME format into a database-compatible tabular format, allowing direct import into database management systems and efficient handling of huge datasets analyzed by Mascot. RESULTS: A program called mres2x is presented, which reads Mascot result files, analyzes them and extracts either selected or all information in order to store it in a single file or multiple files in formats which are easier to handle downstream of Mascot. It generates different output formats. The output of mres2x in tab format is especially designed for direct high-performance import into relational database management systems using native tools of these systems. Having the data available in database management systems allows complex queries and extensive analysis. In addition, the original peak lists can be extracted in DTA format suitable for protein identification using the Sequest program, and the Mascot files can be split, preserving the original data format. During conversion, several consistency checks are performed. mres2x is designed to provide high throughput processing combined with the possibility to be driven by other computer programs. The source code including supplement material and precompiled binaries is available via http://www.protein-ms.de and http://sourceforge.net/projects/protms/. CONCLUSION: The database upload allows regrouping of the MS/MS results using a database management system and complex analyzing queries using SQL without the need to run new Mascot searches when changing grouping parameters.
Assuntos
Bases de Dados de Proteínas , Espectrometria de Massas/métodos , Bases de Dados Factuais , Espectrometria de Massas/instrumentação , Proteínas/química , SoftwareRESUMO
UNLABELLED: Scientists usually want to verify the ion matching process of algorithms that look up peptide sequences in DNA or protein databases. The verification step is often done numerically or visually. Not all search algorithms present the appropriate theoretical spectrum information within their results. Thus, the theoretical spectrum for each result should be calculated from the sequence of the matched peptide. We present an operating-system-independent command line tool for this purpose that can be integrated easily into complex as well as existing environments, and can be used to present the theoretical spectrum to the user in either graphical or tabular format by third party products. AVAILABILITY: The code is available via the website http://www.protein-ms.de