Search | VHL Regional Portal

A Data Set and Deep Learning Algorithm for the Detection of Masses and Architectural Distortions in Digital Breast Tomosynthesis Images.

Buda, Mateusz; Saha, Ashirbani; Walsh, Ruth; Ghate, Sujata; Li, Nianyi; Swiecicki, Albert; Lo, Joseph Y; Mazurowski, Maciej A.

JAMA Netw Open ; 4(8): e2119100, 2021 08 02.

Article in English | MEDLINE | ID: mdl-34398205

ABSTRACT

Importance: Breast cancer screening is among the most common radiological tasks, with more than 39 million examinations performed each year. While it has been among the most studied medical imaging applications of artificial intelligence, the development and evaluation of algorithms are hindered by the lack of well-annotated, large-scale publicly available data sets. Objectives: To curate, annotate, and make publicly available a large-scale data set of digital breast tomosynthesis (DBT) images to facilitate the development and evaluation of artificial intelligence algorithms for breast cancer screening; to develop a baseline deep learning model for breast cancer detection; and to test this model using the data set to serve as a baseline for future research. Design, Setting, and Participants: In this diagnostic study, 16â¯802 DBT examinations with at least 1 reconstruction view available, performed between August 26, 2014, and January 29, 2018, were obtained from Duke Health System and analyzed. From the initial cohort, examinations were divided into 4 groups and split into training and test sets for the development and evaluation of a deep learning model. Images with foreign objects or spot compression views were excluded. Data analysis was conducted from January 2018 to October 2020. Exposures: Screening DBT. Main Outcomes and Measures: The detection algorithm was evaluated with breast-based free-response receiver operating characteristic curve and sensitivity at 2 false positives per volume. Results: The curated data set contained 22â¯032 reconstructed DBT volumes that belonged to 5610 studies from 5060 patients with a mean (SD) age of 55 (11) years and 5059 (100.0%) women. This included 4 groups of studies: (1) 5129 (91.4%) normal studies; (2) 280 (5.0%) actionable studies, for which where additional imaging was needed but no biopsy was performed; (3) 112 (2.0%) benign biopsied studies; and (4) 89 studies (1.6%) with cancer. Our data set included masses and architectural distortions that were annotated by 2 experienced radiologists. Our deep learning model reached breast-based sensitivity of 65% (39 of 60; 95% CI, 56%-74%) at 2 false positives per DBT volume on a test set of 460 examinations from 418 patients. Conclusions and Relevance: The large, diverse, and curated data set presented in this study could facilitate the development and evaluation of artificial intelligence algorithms for breast cancer screening by providing data for training as well as a common set of cases for model validation. The performance of the model developed in this study showed that the task remains challenging; its performance could serve as a baseline for future model development.

Subject(s)

Breast Neoplasms/diagnosis , Datasets as Topic , Deep Learning , Early Detection of Cancer/methods , Mammography , Aged , Breast/diagnostic imaging , False Positive Reactions , Female , Humans , Middle Aged , ROC Curve , Reproducibility of Results

A generative adversarial network-based abnormality detection using only normal images for model training with application to digital breast tomosynthesis.

Swiecicki, Albert; Konz, Nicholas; Buda, Mateusz; Mazurowski, Maciej A.

Sci Rep ; 11(1): 10276, 2021 05 13.

Article in English | MEDLINE | ID: mdl-33986361

ABSTRACT

Deep learning has shown tremendous potential in the task of object detection in images. However, a common challenge with this task is when only a limited number of images containing the object of interest are available. This is a particular issue in cancer screening, such as digital breast tomosynthesis (DBT), where less than 1% of cases contain cancer. In this study, we propose a method to train an inpainting generative adversarial network to be used for cancer detection using only images that do not contain cancer. During inference, we removed a part of the image and used the network to complete the removed part. A significant error in completing an image part was considered an indication that such location is unexpected and thus abnormal. A large dataset of DBT images used in this study was collected at Duke University. It consisted of 19,230 reconstructed volumes from 4348 patients. Cancerous masses and architectural distortions were marked with bounding boxes by radiologists. Our experiments showed that the locations containing cancer were associated with a notably higher completion error than the non-cancer locations (mean error ratio of 2.77). All data used in this study has been made publicly available by the authors.

Subject(s)

Breast Neoplasms/diagnostic imaging , Breast/diagnostic imaging , Computer Simulation , Mammography/methods , Neural Networks, Computer , Female , Humans , Middle Aged , Radiographic Image Enhancement/methods , Radiographic Image Interpretation, Computer-Assisted/methods

Deep learning-based algorithm for assessment of knee osteoarthritis severity in radiographs matches performance of radiologists.

Swiecicki, Albert; Li, Nianyi; O'Donnell, Jonathan; Said, Nicholas; Yang, Jichen; Mather, Richard C; Jiranek, William A; Mazurowski, Maciej A.

Comput Biol Med ; 133: 104334, 2021 06.

Article in English | MEDLINE | ID: mdl-33823398

ABSTRACT

A fully-automated deep learning algorithm matched performance of radiologists in assessment of knee osteoarthritis severity in radiographs using the Kellgren-Lawrence grading system. PURPOSE: To develop an automated deep learning-based algorithm that jointly uses Posterior-Anterior (PA) and Lateral (LAT) views of knee radiographs to assess knee osteoarthritis severity according to the Kellgren-Lawrence grading system. MATERIALS AND METHODS: We used a dataset of 9739 exams from 2802 patients from Multicenter Osteoarthritis Study (MOST). The dataset was divided into a training set of 2040 patients, a validation set of 259 patients and a test set of 503 patients. A novel deep learning-based method was utilized for assessment of knee OA in two steps: (1) localization of knee joints in the images, (2) classification according to the KL grading system. Our method used both PA and LAT views as the input to the model. The scores generated by the algorithm were compared to the grades provided in the MOST dataset for the entire test set as well as grades provided by 5 radiologists at our institution for a subset of the test set. RESULTS: The model obtained a multi-class accuracy of 71.90% on the entire test set when compared to the ratings provided in the MOST dataset. The quadratic weighted Kappa coefficient for this set was 0.9066. The average quadratic weighted Kappa between all pairs of radiologists from our institution who took part in the study was 0.748. The average quadratic-weighted Kappa between the algorithm and the radiologists at our institution was 0.769. CONCLUSION: The proposed model performed demonstrated equivalency of KL classification to MSK radiologists, but clearly superior reproducibility. Our model also agreed with radiologists at our institution to the same extent as the radiologists with each other. The algorithm could be used to provide reproducible assessment of knee osteoarthritis severity.

Subject(s)

Deep Learning , Osteoarthritis, Knee , Algorithms , Humans , Osteoarthritis, Knee/diagnostic imaging , Radiologists , Reproducibility of Results

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL