Your browser doesn't support javascript.
loading
Leveraging immuno-fluorescence data to reduce pathologist annotation requirements in lung tumor segmentation using deep learning.
Mehrabian, Hatef; Brodbeck, Jens; Lyu, Peipei; Vaquero, Edith; Aggarwal, Abhishek; Diehl, Lauri.
Affiliation
  • Mehrabian H; Non-Clinical Safety and Pathobiology, Gilead Sciences, Foster City, CA, USA. hatef.mehrabian@gilead.com.
  • Brodbeck J; Non-Clinical Safety and Pathobiology, Gilead Sciences, Foster City, CA, USA.
  • Lyu P; Non-Clinical Safety and Pathobiology, Gilead Sciences, Foster City, CA, USA.
  • Vaquero E; Non-Clinical Safety and Pathobiology, Gilead Sciences, Foster City, CA, USA.
  • Aggarwal A; Non-Clinical Safety and Pathobiology, Gilead Sciences, Foster City, CA, USA.
  • Diehl L; Non-Clinical Safety and Pathobiology, Gilead Sciences, Foster City, CA, USA.
Sci Rep ; 14(1): 21643, 2024 09 16.
Article in En | MEDLINE | ID: mdl-39284813
ABSTRACT
The main bottleneck in training a robust tumor segmentation algorithm for non-small cell lung cancer (NSCLC) on H&E is generating sufficient ground truth annotations. Various approaches for generating tumor labels to train a tumor segmentation model was explored. A large dataset of low-cost low-accuracy panCK-based annotations was used to pre-train the model and determine the minimum required size of the expensive but highly accurate pathologist annotations dataset. PanCK pre-training was compared to foundation models and various architectures were explored for model backbone. Proper study design and sample procurement for training a generalizable model that captured variations in NSCLC H&E was studied. H&E imaging was performed on 112 samples (three centers, two scanner types, different staining and imaging protocols). Attention U-Net architecture was trained using the large panCK-based annotations dataset (68 samples, total area 10,326 [mm2]) followed by fine-tuning using a small pathologist annotations dataset (80 samples, total area 246 [mm2]). This approach resulted in mean intersection over union (mIoU) of 82% [77 87]. Using panCK pretraining provided better performance compared to foundation models and allowed for 70% reduction in pathologist annotations with no drop in performance. Study design ensured model generalizability over variations on H&E where performance was consistent across centers, scanners, and subtypes.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Carcinoma, Non-Small-Cell Lung / Pathologists / Deep Learning / Lung Neoplasms Limits: Humans Language: En Journal: Sci Rep Year: 2024 Document type: Article Affiliation country: United States Country of publication: United kingdom

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Carcinoma, Non-Small-Cell Lung / Pathologists / Deep Learning / Lung Neoplasms Limits: Humans Language: En Journal: Sci Rep Year: 2024 Document type: Article Affiliation country: United States Country of publication: United kingdom