Your browser doesn't support javascript.
Novel Ensemble Feature Selection Approach and Application in Repertoire Sequencing Data.
He, Tao; Baik, Jason Min; Kato, Chiemi; Yang, Hai; Fan, Zenghua; Cham, Jason; Zhang, Li.
  • He T; Department of Mathematics, San Francisco State University, San Francisco, CA, United States.
  • Baik JM; Department of Mathematics, San Francisco State University, San Francisco, CA, United States.
  • Kato C; Department of Mathematics, San Francisco State University, San Francisco, CA, United States.
  • Yang H; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, United States.
  • Fan Z; Department of Medicine, University of California, San Francisco, San Francisco, CA, United States.
  • Cham J; Department of Medicine, Scripps Green Hospital, La Jolla, CA, United States.
  • Zhang L; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, United States.
Front Genet ; 13: 821832, 2022.
Article in English | MEDLINE | ID: covidwho-1952303
ABSTRACT
The T and B cell repertoire make up the adaptive immune system and is mainly generated through somatic V(D)J gene recombination. Thus, the VJ gene usage may be a potential prognostic or predictive biomarker. However, analysis of the adaptive immune system is challenging due to the heterogeneity of the clonotypes that make up the repertoire. To address the heterogeneity of the T and B cell repertoire, we proposed a novel ensemble feature selection approach and customized statistical learning algorithm focusing on the VJ gene usage. We applied the proposed approach to T cell receptor sequences from recovered COVID-19 patients and healthy donors, as well as a group of lung cancer patients who received immunotherapy. Our approach identified distinct VJ genes used in the COVID-19 recovered patients comparing to the healthy donors and the VJ genes associated with the clinical response in the lung cancer patients. Simulation studies show that the ensemble feature selection approach outperformed other state-of-the-art feature selection methods based on both efficiency and accuracy. It consistently yielded higher stability and sensitivity with lower false discovery rates. When integrated with different classification methods, the ensemble feature selection approach had the best prediction accuracy. In conclusion, the proposed novel approach and the integration procedure is an effective feature selection technique to aid in correctly classifying different subtypes to better understand the signatures in the adaptive immune response associated with disease or the treatment in order to improve treatment strategies.
Keywords

Full text: Available Collection: International databases Database: MEDLINE Type of study: Prognostic study Language: English Journal: Front Genet Year: 2022 Document Type: Article Affiliation country: Fgene.2022.821832

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Type of study: Prognostic study Language: English Journal: Front Genet Year: 2022 Document Type: Article Affiliation country: Fgene.2022.821832