Search | VHL Regional Portal

Identification and visualization of multidimensional antigen-specific T-cell populations in polychromatic cytometry data.

Lin, Lin; Frelinger, Jacob; Jiang, Wenxin; Finak, Greg; Seshadri, Chetan; Bart, Pierre-Alexandre; Pantaleo, Giuseppe; McElrath, Julie; DeRosa, Steve; Gottardo, Raphael.

Cytometry A ; 87(7): 675-82, 2015 Jul.

Article in English | MEDLINE | ID: mdl-25908275

ABSTRACT

An important aspect of immune monitoring for vaccine development, clinical trials, and research is the detection, measurement, and comparison of antigen-specific T-cells from subject samples under different conditions. Antigen-specific T-cells compose a very small fraction of total T-cells. Developments in cytometry technology over the past five years have enabled the measurement of single-cells in a multivariate and high-throughput manner. This growth in both dimensionality and quantity of data continues to pose a challenge for effective identification and visualization of rare cell subsets, such as antigen-specific T-cells. Dimension reduction and feature extraction play pivotal role in both identifying and visualizing cell populations of interest in large, multi-dimensional cytometry datasets. However, the automated identification and visualization of rare, high-dimensional cell subsets remains challenging. Here we demonstrate how a systematic and integrated approach combining targeted feature extraction with dimension reduction can be used to identify and visualize biological differences in rare, antigen-specific cell populations. By using OpenCyto to perform semi-automated gating and features extraction of flow cytometry data, followed by dimensionality reduction with t-SNE we are able to identify polyfunctional subpopulations of antigen-specific T-cells and visualize treatment-specific differences between them.

Subject(s)

Antigens/immunology , Cytokines/analysis , Epitopes/immunology , Flow Cytometry/methods , T-Lymphocytes/immunology , Adolescent , Algorithms , Computational Biology/methods , Humans , Leukocytes, Mononuclear , Staining and Labeling , T-Lymphocytes/classification

OpenCyto: an open source infrastructure for scalable, robust, reproducible, and automated, end-to-end flow cytometry data analysis.

Finak, Greg; Frelinger, Jacob; Jiang, Wenxin; Newell, Evan W; Ramey, John; Davis, Mark M; Kalams, Spyros A; De Rosa, Stephen C; Gottardo, Raphael.

PLoS Comput Biol ; 10(8): e1003806, 2014 Aug.

Article in English | MEDLINE | ID: mdl-25167361

ABSTRACT

Flow cytometry is used increasingly in clinical research for cancer, immunology and vaccines. Technological advances in cytometry instrumentation are increasing the size and dimensionality of data sets, posing a challenge for traditional data management and analysis. Automated analysis methods, despite a general consensus of their importance to the future of the field, have been slow to gain widespread adoption. Here we present OpenCyto, a new BioConductor infrastructure and data analysis framework designed to lower the barrier of entry to automated flow data analysis algorithms by addressing key areas that we believe have held back wider adoption of automated approaches. OpenCyto supports end-to-end data analysis that is robust and reproducible while generating results that are easy to interpret. We have improved the existing, widely used core BioConductor flow cytometry infrastructure by allowing analysis to scale in a memory efficient manner to the large flow data sets that arise in clinical trials, and integrating domain-specific knowledge as part of the pipeline through the hierarchical relationships among cell populations. Pipelines are defined through a text-based csv file, limiting the need to write data-specific code, and are data agnostic to simplify repetitive analysis for core facilities. We demonstrate how to analyze two large cytometry data sets: an intracellular cytokine staining (ICS) data set from a published HIV vaccine trial focused on detecting rare, antigen-specific T-cell populations, where we identify a new subset of CD8 T-cells with a vaccine-regimen specific response that could not be identified through manual analysis, and a CyTOF T-cell phenotyping data set where a large staining panel and many cell populations are a challenge for traditional analysis. The substantial improvements to the core BioConductor flow cytometry packages give OpenCyto the potential for wide adoption. It can rapidly leverage new developments in computational cytometry and facilitate reproducible analysis in a unified environment.

Subject(s)

Computational Biology/methods , Flow Cytometry/methods , Software , CD8-Positive T-Lymphocytes , Databases, Factual , Humans , Reproducibility of Results

Setting objective thresholds for rare event detection in flow cytometry.

Richards, Adam J; Staats, Janet; Enzor, Jennifer; McKinnon, Katherine; Frelinger, Jacob; Denny, Thomas N; Weinhold, Kent J; Chan, Cliburn.

J Immunol Methods ; 409: 54-61, 2014 Jul.

Article in English | MEDLINE | ID: mdl-24727143

ABSTRACT

The accurate identification of rare antigen-specific cytokine positive cells from peripheral blood mononuclear cells (PBMC) after antigenic stimulation in an intracellular staining (ICS) flow cytometry assay is challenging, as cytokine positive events may be fairly diffusely distributed and lack an obvious separation from the negative population. Traditionally, the approach by flow operators has been to manually set a positivity threshold to partition events into cytokine-positive and cytokine-negative. This approach suffers from subjectivity and inconsistency across different flow operators. The use of statistical clustering methods does not remove the need to find an objective threshold between between positive and negative events since consistent identification of rare event subsets is highly challenging for automated algorithms, especially when there is distributional overlap between the positive and negative events ("smear"). We present a new approach, based on the Fß measure, that is similar to manual thresholding in providing a hard cutoff, but has the advantage of being determined objectively. The performance of this algorithm is compared with results obtained by expert visual gating. Several ICS data sets from the External Quality Assurance Program Oversight Laboratory (EQAPOL) proficiency program were used to make the comparisons. We first show that visually determined thresholds are difficult to reproduce and pose a problem when comparing results across operators or laboratories, as well as problems that occur with the use of commonly employed clustering algorithms. In contrast, a single parameterization for the Fß method performs consistently across different centers, samples, and instruments because it optimizes the precision/recall tradeoff by using both negative and positive controls.

Subject(s)

Cytokines/blood , Flow Cytometry/standards , Laboratories/standards , Laboratory Proficiency Testing/standards , Leukocytes, Mononuclear/immunology , Monitoring, Immunologic/standards , Algorithms , Automation, Laboratory/standards , Biomarkers/blood , Guideline Adherence/standards , Humans , Observer Variation , Practice Guidelines as Topic/standards , Predictive Value of Tests , Program Development , Quality Control , Quality Indicators, Health Care/standards , Reproducibility of Results , Specimen Handling/standards

Hierarchical modeling for rare event detection and cell subset alignment across flow cytometry samples.

Cron, Andrew; Gouttefangeas, Cécile; Frelinger, Jacob; Lin, Lin; Singh, Satwinder K; Britten, Cedrik M; Welters, Marij J P; van der Burg, Sjoerd H; West, Mike; Chan, Cliburn.

PLoS Comput Biol ; 9(7): e1003130, 2013.

Article in English | MEDLINE | ID: mdl-23874174

ABSTRACT

Flow cytometry is the prototypical assay for multi-parameter single cell analysis, and is essential in vaccine and biomarker research for the enumeration of antigen-specific lymphocytes that are often found in extremely low frequencies (0.1% or less). Standard analysis of flow cytometry data relies on visual identification of cell subsets by experts, a process that is subjective and often difficult to reproduce. An alternative and more objective approach is the use of statistical models to identify cell subsets of interest in an automated fashion. Two specific challenges for automated analysis are to detect extremely low frequency event subsets without biasing the estimate by pre-processing enrichment, and the ability to align cell subsets across multiple data samples for comparative analysis. In this manuscript, we develop hierarchical modeling extensions to the Dirichlet Process Gaussian Mixture Model (DPGMM) approach we have previously described for cell subset identification, and show that the hierarchical DPGMM (HDPGMM) naturally generates an aligned data model that captures both commonalities and variations across multiple samples. HDPGMM also increases the sensitivity to extremely low frequency events by sharing information across multiple samples analyzed simultaneously. We validate the accuracy and reproducibility of HDPGMM estimates of antigen-specific T cells on clinically relevant reference peripheral blood mononuclear cell (PBMC) samples with known frequencies of antigen-specific T cells. These cell samples take advantage of retrovirally TCR-transduced T cells spiked into autologous PBMC samples to give a defined number of antigen-specific T cells detectable by HLA-peptide multimer binding. We provide open source software that can take advantage of both multiple processors and GPU-acceleration to perform the numerically-demanding computations. We show that hierarchical modeling is a useful probabilistic approach that can provide a consistent labeling of cell subsets and increase the sensitivity of rare event detection in the context of quantifying antigen-specific immune responses.

Subject(s)

Flow Cytometry/methods , Lymphocyte Subsets , Models, Biological , Humans , Reproducibility of Results

Optimization of a highly standardized carboxyfluorescein succinimidyl ester flow cytometry panel and gating strategy design using discriminative information measure evaluation.

Chan, Cliburn; Lin, Lin; Frelinger, Jacob; Hérbert, Valérie; Gagnon, Dominic; Landry, Claire; Sékaly, Rafick-Pierre; Enzor, Jennifer; Staats, Janet; Weinhold, Kent J; Jaimes, Maria; West, Mike.

Cytometry A ; 77(12): 1126-36, 2010 Dec.

Article in English | MEDLINE | ID: mdl-21053294

ABSTRACT

The design of a panel to identify target cell subsets in flow cytometry can be difficult when specific markers unique to each cell subset do not exist, and a combination of parameters must be used to identify target cells of interest and exclude irrelevant events. Thus, the ability to objectively measure the contribution of a parameter or group of parameters toward target cell identification independent of any gating strategy could be very helpful for both panel design and gating strategy design. In this article, we propose a discriminative information measure evaluation (DIME) based on statistical mixture modeling; DIME is a numerical measure of the contribution of different parameters towards discriminating a target cell subset from all the others derived from the fitted posterior distribution of a Gaussian mixture model. Informally, DIME measures the "usefulness" of each parameter for identifying a target cell subset. We show how DIME provides an objective basis for inclusion or exclusion of specific parameters in a panel, and how ranked sets of such parameters can be used to optimize gating strategies. An illustrative example of the application of DIME to streamline the gating strategy for a highly standardized carboxyfluorescein succinimidyl ester (CFSE) assay is described.

Subject(s)

Flow Cytometry/methods , Flow Cytometry/standards , CD4-Positive T-Lymphocytes/cytology , CD8-Positive T-Lymphocytes/cytology , Canada , Cell Proliferation , Data Interpretation, Statistical , Fluoresceins , Humans , Normal Distribution , Pilot Projects , Succinimides , United States

Understanding GPU Programming for Statistical Computation: Studies in Massively Parallel Massive Mixtures.

Suchard, Marc A; Wang, Quanli; Chan, Cliburn; Frelinger, Jacob; Cron, Andrew; West, Mike.

J Comput Graph Stat ; 19(2): 419-438, 2010 Jun 01.

Article in English | MEDLINE | ID: mdl-20877443

ABSTRACT

This article describes advances in statistical computation for large-scale data analysis in structured Bayesian mixture models via graphics processing unit (GPU) programming. The developments are partly motivated by computational challenges arising in fitting models of increasing heterogeneity to increasingly large datasets. An example context concerns common biological studies using high-throughput technologies generating many, very large datasets and requiring increasingly high-dimensional mixture models with large numbers of mixture components. We outline important strategies and processes for GPU computation in Bayesian simulation and optimization approaches, give examples of the benefits of GPU implementations in terms of processing speed and scale-up in ability to analyze large datasets, and provide a detailed, tutorial-style exposition that will benefit readers interested in developing GPU-based approaches in other statistical models. Novel, GPU-oriented approaches to modifying existing algorithms software design can lead to vast speed-up and, critically, enable statistical analyses that presently will not be performed due to compute time limitations in traditional computational environments. Supplemental materials are provided with all source code, example data, and details that will enable readers to implement and explore the GPU approach in this mixture modeling context.

Modeling flow cytometry data for cancer vaccine immune monitoring.

Frelinger, Jacob; Ottinger, Janet; Gouttefangeas, Cécile; Chan, Cliburn.

Cancer Immunol Immunother ; 59(9): 1435-41, 2010 Sep.

Article in English | MEDLINE | ID: mdl-20563720

ABSTRACT

Flow cytometry (FCM) is widely used in cancer research for diagnosis, detection of minimal residual disease, as well as immune monitoring and profiling following immunotherapy. In all these applications, the challenge is to detect extremely rare cell subsets while avoiding spurious positive events. To achieve this objective, it helps to be able to analyze FCM data using multiple markers simultaneously, since the additional information provided often helps to minimize the number of false positive and false negative events, hence increasing both sensitivity and specificity. However, with manual gating, at most two markers can be examined in a single dot plot, and a sequential strategy is often used. As the sequential strategy discards events that fall outside preceding gates at each stage, the effectiveness of the strategy is difficult to evaluate without laborious and painstaking back-gating. Model-based analysis is a promising computational technique that works using information from all marker dimensions simultaneously, and offers an alternative approach to flow analysis that can usefully complement manual gating in the design of optimal gating strategies. Results from model-based analysis will be illustrated with examples from FCM assays commonly used in cancer immunotherapy laboratories.

Subject(s)

Cancer Vaccines , Flow Cytometry , Monitoring, Immunologic/methods , Animals , Cell Separation , Computational Biology/methods , Diagnosis, Computer-Assisted , Humans , Sensitivity and Specificity , Statistics as Topic

Flow: Statistics, visualization and informatics for flow cytometry.

Frelinger, Jacob; Kepler, Thomas B; Chan, Cliburn.

Source Code Biol Med ; 3: 10, 2008 Jun 17.

Article in English | MEDLINE | ID: mdl-18559108

ABSTRACT

Flow is an open source software application for clinical and experimental researchers to perform exploratory data analysis, clustering and annotation of flow cytometric data. Flow is an extensible system that offers the ease of use commonly found in commercial flow cytometry software packages and the statistical power of academic packages like the R BioConductor project.

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL