Your browser doesn't support javascript.
loading
Human-augmented large language model-driven selection of glutathione peroxidase 4 as a candidate blood transcriptional biomarker for circulating erythroid cells.
Subba, Bishesh; Toufiq, Mohammed; Omi, Fuadur; Yurieva, Marina; Khan, Taushif; Rinchai, Darawan; Palucka, Karolina; Chaussabel, Damien.
Afiliação
  • Subba B; The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
  • Toufiq M; Williams College, Williamstown, MA, USA.
  • Omi F; The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
  • Yurieva M; The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
  • Khan T; The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
  • Rinchai D; The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
  • Palucka K; St Jude Children's Research Hospital, Memphis, TN, USA.
  • Chaussabel D; The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
Sci Rep ; 14(1): 23225, 2024 10 05.
Article em En | MEDLINE | ID: mdl-39369090
ABSTRACT
The identification of optimal candidate genes from large-scale blood transcriptomic data is crucial for developing targeted assays to monitor immune responses. Here, we introduce a novel, optimized large language model (LLM)-based approach for prioritizing candidate biomarkers from blood transcriptional modules. Focusing on module M14.51 from the BloodGen3 repertoire, we implemented a multi-step LLM-driven workflow. Initial high-throughput screening used GPT-4, Claude 3, and Claude 3.5 Sonnet to score and rank the module's constituent genes across six criteria. Top candidates then underwent high-resolution scoring using Consensus GPT, with concurrent manual fact-checking and, when needed, iterative refinement of the scores based on user feedback. Qualitative assessment of literature-based narratives and analysis of reference transcriptome data further refined the selection process. This novel multi-tiered approach consistently identified Glutathione Peroxidase 4 (GPX4) as the top candidate gene for module M14.51. GPX4's role in oxidative stress regulation, its potential as a future drug target, and its expression pattern across diverse cell types supported its selection. The incorporation of reference transcriptome data further validated GPX4 as the most suitable candidate for this module. This study presents an advanced LLM-driven workflow with a novel optimized scoring strategy for candidate gene prioritization, incorporating human-in-the-loop augmentation. The approach identified GPX4 as a key gene in the erythroid cell-associated module M14.51, suggesting its potential utility for biomarker discovery and targeted assay development. By combining AI-driven literature analysis with iterative human expert validation, this method leverages the strengths of both artificial and human intelligence, potentially contributing to the development of biologically relevant and clinically informative targeted assays. Further validation studies are needed to confirm the broader applicability of this human-augmented AI approach.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Biomarcadores / Células Eritroides / Fosfolipídeo Hidroperóxido Glutationa Peroxidase Limite: Humans Idioma: En Revista: Sci Rep / Sci. rep. (Nat. Publ. Group) / Scientific reports (Nature Publishing Group) Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Estados Unidos País de publicação: Reino Unido

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Biomarcadores / Células Eritroides / Fosfolipídeo Hidroperóxido Glutationa Peroxidase Limite: Humans Idioma: En Revista: Sci Rep / Sci. rep. (Nat. Publ. Group) / Scientific reports (Nature Publishing Group) Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Estados Unidos País de publicação: Reino Unido