ABSTRACT
Development of biopharmaceutical production cell lines requires efficient screening methods to select the host cell line and final production clone. This is often complicated by an incomplete understanding of the relationship between protein heterogeneity and function at early stages of product development. LC-MS/MS peptide mapping is well suited to the discovery and quantitation of protein heterogeneity; however, the intense hands-on time required to generate and analyze LC-MS/MS data typically accommodates only smaller sample sets at later stages of clone selection. Here we describe a simple approach to peptide mapping designed for large sample sets that includes higher-throughput sample preparation and automated data analysis. This approach allows for the inclusion of orthogonal protease digestions and multiple replicates of an assay control that encode an assessment of accuracy and precision into the data, significantly simplifying the identification of true-positive annotations in the LC-MS/MS results. This methodology was used to comprehensively identify and quantify glycosylation, degradation, unexpected post-translational modifications, and three types of sequence variants in a previously uncharacterized non-mAb protein therapeutic expressed in approximately 100 clones from three host cell lines. Several product quality risks were identified allowing for a more informed selection of the production clone. Moreover, the variability inherent in this unique sample set provides important structure/function information to support quality attribute identification and criticality assessments, two key components of Quality by Design.