Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 31
Filter
2.
Jpn J Radiol ; 2024 May 11.
Article in English | MEDLINE | ID: mdl-38733472

ABSTRACT

PURPOSE: To assess the performance of GPT-4 Turbo with Vision (GPT-4TV), OpenAI's latest multimodal large language model, by comparing its ability to process both text and image inputs with that of the text-only GPT-4 Turbo (GPT-4 T) in the context of the Japan Diagnostic Radiology Board Examination (JDRBE). MATERIALS AND METHODS: The dataset comprised questions from JDRBE 2021 and 2023. A total of six board-certified diagnostic radiologists discussed the questions and provided ground-truth answers by consulting relevant literature as necessary. The following questions were excluded: those lacking associated images, those with no unanimous agreement on answers, and those including images rejected by the OpenAI application programming interface. The inputs for GPT-4TV included both text and images, whereas those for GPT-4 T were entirely text. Both models were deployed on the dataset, and their performance was compared using McNemar's exact test. The radiological credibility of the responses was assessed by two diagnostic radiologists through the assignment of legitimacy scores on a five-point Likert scale. These scores were subsequently used to compare model performance using Wilcoxon's signed-rank test. RESULTS: The dataset comprised 139 questions. GPT-4TV correctly answered 62 questions (45%), whereas GPT-4 T correctly answered 57 questions (41%). A statistical analysis found no significant performance difference between the two models (P = 0.44). The GPT-4TV responses received significantly lower legitimacy scores from both radiologists than the GPT-4 T responses. CONCLUSION: No significant enhancement in accuracy was observed when using GPT-4TV with image input compared with that of using text-only GPT-4 T for JDRBE questions.

3.
Article in English | MEDLINE | ID: mdl-38719605

ABSTRACT

BACKGROUND AND PURPOSE: The rise of large language models such as generative pre-trained transformers (GPTs) has sparked significant interest in radiology, especially in interpreting radiological reports and image findings. While existing research has focused on GPTs estimating diagnoses from radiological descriptions, exploring alternative diagnostic information sources is also crucial. This study introduces the use of GPTs (GPT-3.5 Turbo and GPT-4) for information retrieval and summarization, searching relevant case reports via PubMed, and investigates their potential to aid diagnosis. MATERIALS AND METHODS: From October 2021 to December 2023, we selected 115 cases from the "Case of the Week" series on the American Journal of Neuroradiology website. Their Description and Legend sections were presented to the GPTs for the two tasks. For the Direct Diagnosis task, the models provided three differential diagnoses that were considered correct if they matched the diagnosis in the diagnosis section. For the Case Report Search task, the models generated two keywords per case, creating PubMed search queries to extract up to three relevant reports. A response was considered correct if reports containing the disease name stated in the diagnosis section were extracted. McNemar's test was employed to evaluate whether adding a Case Report Search to Direct Diagnosis improved overall accuracy. RESULTS: In the Direct Diagnosis task, GPT-3.5 Turbo achieved a correct response rate of 26% (30/115 cases), whereas GPT-4 achieved 41% (47/115). For the Case Report Search task, GPT-3.5 Turbo scored 10% (11/115), and GPT-4 scored 7% (8/115). Correct responses totaled 32% (37/115) with three overlapping cases for GPT-3.5 Turbo, whereas GPT-4 had 43% (50/115) of correct responses with five overlapping cases. Adding Case Report Search improved GPT-3.5 Turbo's performance (p = 0.023) but not that of GPT-4 (p = 0.248). CONCLUSIONS: The effectiveness of adding Case Report Search to GPT-3.5 Turbo was particularly pronounced, suggesting its potential as an alternative diagnostic approach to GPTs, particularly in scenarios where direct diagnoses from GPTs are not obtainable. Nevertheless, the overall performance of GPT models in both direct diagnosis and case report retrieval tasks remains not optimal, and users should be aware of their limitations.ABBREVIATIONS: AI = Artificial Intelligence, GPT = generative pretrained transformer, LLM = large language model.

4.
Insights Imaging ; 15(1): 102, 2024 Apr 05.
Article in English | MEDLINE | ID: mdl-38578554

ABSTRACT

OBJECTIVES: To investigate the relationship between low kidney volume and subsequent estimated glomerular filtration rate (eGFR) decline in eGFR category G2 (60-89 mL/min/1.73 m2) population. METHODS: In this retrospective study, we evaluated 5531 individuals with eGFR category G2 who underwent medical checkups at our institution between November 2006 and October 2017. Exclusion criteria were absent for follow-up visit, missing data, prior renal surgery, current renal disease under treatment, large renal masses, and horseshoe kidney. We developed a 3D U-net-based automated system for renal volumetry on CT images. Participants were grouped by sex-specific kidney volume deviations set at mean minus one standard deviation. After 1:1 propensity score matching, we obtained 397 pairs of individuals in the low kidney volume (LKV) and control groups. The primary endpoint was progression of eGFR categories within 5 years, assessed using Cox regression analysis. RESULTS: This study included 3220 individuals (mean age, 60.0 ± 9.7 years; men, n = 2209). The kidney volume was 404.6 ± 67.1 and 376.8 ± 68.0 cm3 in men and women, respectively. The low kidney volume (LKV) cutoff was 337.5 and 308.8 cm3 for men and women, respectively. LKV was a significant risk factor for the endpoint with an adjusted hazard ratio of 1.64 (95% confidence interval: 1.09-2.45; p = 0.02). CONCLUSION: Low kidney volume may adversely affect subsequent eGFR maintenance; hence, the use of imaging metrics may help predict eGFR decline. CRITICAL RELEVANCE STATEMENT: Low kidney volume is a significant predictor of reduced kidney function over time; thus, kidney volume measurements could aid in early identification of individuals at risk for declining kidney health. KEY POINTS: • This study explores how kidney volume affects subsequent kidney function maintenance. • Low kidney volume was associated with estimated glomerular filtration rate decreases. • Low kidney volume is a prognostic indicator of estimated glomerular filtration rate decline.

5.
Article in English | MEDLINE | ID: mdl-38625446

ABSTRACT

PURPOSE: The quality and bias of annotations by annotators (e.g., radiologists) affect the performance changes in computer-aided detection (CAD) software using machine learning. We hypothesized that the difference in the years of experience in image interpretation among radiologists contributes to annotation variability. In this study, we focused on how the performance of CAD software changes with retraining by incorporating cases annotated by radiologists with varying experience. METHODS: We used two types of CAD software for lung nodule detection in chest computed tomography images and cerebral aneurysm detection in magnetic resonance angiography images. Twelve radiologists with different years of experience independently annotated the lesions, and the performance changes were investigated by repeating the retraining of the CAD software twice, with the addition of cases annotated by each radiologist. Additionally, we investigated the effects of retraining using integrated annotations from multiple radiologists. RESULTS: The performance of the CAD software after retraining differed among annotating radiologists. In some cases, the performance was degraded compared to that of the initial software. Retraining using integrated annotations showed different performance trends depending on the target CAD software, notably in cerebral aneurysm detection, where the performance decreased compared to using annotations from a single radiologist. CONCLUSIONS: Although the performance of the CAD software after retraining varied among the annotating radiologists, no direct correlation with their experience was found. The performance trends differed according to the type of CAD software used when integrated annotations from multiple radiologists were used.

6.
JMIR Med Educ ; 10: e54393, 2024 Mar 12.
Article in English | MEDLINE | ID: mdl-38470459

ABSTRACT

BACKGROUND: Previous research applying large language models (LLMs) to medicine was focused on text-based information. Recently, multimodal variants of LLMs acquired the capability of recognizing images. OBJECTIVE: We aim to evaluate the image recognition capability of generative pretrained transformer (GPT)-4V, a recent multimodal LLM developed by OpenAI, in the medical field by testing how visual information affects its performance to answer questions in the 117th Japanese National Medical Licensing Examination. METHODS: We focused on 108 questions that had 1 or more images as part of a question and presented GPT-4V with the same questions under two conditions: (1) with both the question text and associated images and (2) with the question text only. We then compared the difference in accuracy between the 2 conditions using the exact McNemar test. RESULTS: Among the 108 questions with images, GPT-4V's accuracy was 68% (73/108) when presented with images and 72% (78/108) when presented without images (P=.36). For the 2 question categories, clinical and general, the accuracies with and those without images were 71% (70/98) versus 78% (76/98; P=.21) and 30% (3/10) versus 20% (2/10; P≥.99), respectively. CONCLUSIONS: The additional information from the images did not significantly improve the performance of GPT-4V in the Japanese National Medical Licensing Examination.


Subject(s)
Licensure , Medicine , Japan , Language
7.
Magn Reson Med Sci ; 2024 Feb 07.
Article in English | MEDLINE | ID: mdl-38325833

ABSTRACT

PURPOSE: The purpose of this study was to investigate the longitudinal MRI characteristic of COVID-19-vaccination-related axillary lymphadenopathy by evaluating the size, T2-weighted signal intensity, and apparent diffusion coefficient (ADC) values. METHODS: COVID-19-vaccination-related axillary lymphadenopathy was observed in 90 of 433 health screening program participants on the chest region of whole-body axial MRIs in 2021, as reported in our previous study. Follow-up MRI was performed at an interval of approximately 1 year after the second vaccination dose from 2022 to 2023. The diameter, signal intensity on T2-weighted images, and ADC of the largest enlarged lymph nodes were measured on chest MRI. The values were compared between the post-vaccination MRI and the follow-up MRI, and statistically analyzed. RESULTS: Out of the 90 participants who had enlarged lymph nodes of 5 mm or larger in short axis after the second vaccination dose, 76 participants (45 men and 31 women, mean age: 61 years) were enrolled in the present study. The median short- and long-axis diameter of the enlarged lymph nodes was 7 mm and 9 mm for post-vaccination MRI and 4 mm and 6 mm for follow-up MRI, respectively. The median signal intensity relative to the muscle on T2-weighted images decreased (5.1 for the initial post-vaccination MRI and 3.6 for the follow-up MRI, P < .0001). The ADC values did not show a notable change and remained in a normal range. CONCLUSION: The enlarged axillary lymph nodes decreased both in size and in signal intensity on T2-weighted images of follow-up MRI. The ADC remained unchanged. Our findings may provide important information to establish evidence-based guidelines for conducting proper assessment and management of post-vaccination lymphadenopathy.

8.
J Imaging Inform Med ; 37(3): 1217-1227, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38351224

ABSTRACT

To generate synthetic medical data incorporating image-tabular hybrid data by merging an image encoding/decoding model with a table-compatible generative model and assess their utility. We used 1342 cases from the Stony Brook University Covid-19-positive cases, comprising chest X-ray radiographs (CXRs) and tabular clinical data as a private dataset (pDS). We generated a synthetic dataset (sDS) through the following steps: (I) dimensionally reducing CXRs in the pDS using a pretrained encoder of the auto-encoding generative adversarial networks (αGAN) and integrating them with the correspondent tabular clinical data; (II) training the conditional tabular GAN (CTGAN) on this combined data to generate synthetic records, encompassing encoded image features and clinical data; and (III) reconstructing synthetic images from these encoded image features in the sDS using a pretrained decoder of the αGAN. The utility of sDS was assessed by the performance of the prediction models for patient outcomes (deceased or discharged). For the pDS test set, the area under the receiver operating characteristic (AUC) curve was calculated to compare the performance of prediction models trained separately with pDS, sDS, or a combination of both. We created an sDS comprising CXRs with a resolution of 256 × 256 pixels and tabular data containing 13 variables. The AUC for the outcome was 0.83 when the model was trained with the pDS, 0.74 with the sDS, and 0.87 when combining pDS and sDS for training. Our method is effective for generating synthetic records consisting of both images and tabular clinical data.


Subject(s)
COVID-19 , Radiography, Thoracic , SARS-CoV-2 , Humans , COVID-19/diagnostic imaging , Radiography, Thoracic/methods , Female , Male , Middle Aged , Aged , ROC Curve , Adult
9.
Int J Comput Assist Radiol Surg ; 19(3): 581-590, 2024 Mar.
Article in English | MEDLINE | ID: mdl-38180621

ABSTRACT

PURPOSE: Standardized uptake values (SUVs) derived from 18F-fluoro-2-deoxy-D-glucose positron emission tomography/computed tomography are a crucial parameter for identifying tumors or abnormalities in an organ. Moreover, exploring ways to improve the identification of tumors or abnormalities using a statistical measurement tool is important in clinical research. Therefore, we developed a fully automatic method to create a personally normalized Z-score map of the liver SUV. METHODS: The normalized Z-score map for each patient was created using the SUV mean and standard deviation estimated from blood-test-derived variables, such as alanine aminotransferase and aspartate aminotransferase, as well as other demographic information. This was performed using the least absolute shrinkage and selection operator (LASSO)-based estimation formula. We also used receiver operating characteristic (ROC) to analyze the results of people with and without hepatic tumors and compared them to the ROC curve of normal SUV. RESULTS: A total of 7757 people were selected for this study. Of these, 7744 were healthy, while 13 had abnormalities. The area under the ROC curve results indicated that the anomaly detection approach (0.91) outperformed only the maximum SUV (0.89). To build the LASSO regression, sets of covariates, including sex, weight, body mass index, blood glucose level, triglyceride, total cholesterol, γ-glutamyl transpeptidase, total protein, creatinine, insulin, albumin, and cholinesterase, were used to determine the SUV mean, whereas weight was used to determine the SUV standard deviation. CONCLUSION: The Z-score normalizes the mean and standard deviation. It is effective in ROC curve analysis and increases the clarity of the abnormality. This normalization is a key technique for effective measurement of maximum glucose consumption by tumors in the liver.


Subject(s)
Fluorodeoxyglucose F18 , Neoplasms , Humans , Radiopharmaceuticals , Positron-Emission Tomography/methods , Neoplasms/diagnostic imaging , Liver/diagnostic imaging
10.
Radiol Phys Technol ; 17(1): 103-111, 2024 Mar.
Article in English | MEDLINE | ID: mdl-37917288

ABSTRACT

The purpose of the study was to develop a liver nodule diagnostic method that accurately localizes and classifies focal liver lesions and identifies the specific liver segments in which they reside by integrating a liver segment division algorithm using a four-dimensional (4D) fully convolutional residual network (FC-ResNet) with a localization and classification model. We retrospectively collected data and divided 106 gadolinium-ethoxybenzyl-diethylenetriamine pentaacetic acid-enhanced magnetic resonance examinations into Case-sets 1, 2, and 3. A liver segment division algorithm was developed using a 4D FC-ResNet and trained with semi-automatically created silver-standard annotations; performance was evaluated using manually created gold-standard annotations by calculating the Dice scores for each liver segment. The performance of the liver nodule diagnostic method was assessed by comparing the results with those of the original radiology reports. The mean Dice score between the output of the liver segment division model and the gold standard was 0.643 for Case-set 2 (normal liver contours) and 0.534 for Case-set 1 (deformed liver contours). Among the 64 lesions in Case-set 3, the diagnostic method localized 37 lesions, classified 33 lesions, and identified the liver segments for 30 lesions. A total of 28 lesions were true positives, matching the original radiology reports. The liver nodule diagnostic method, which integrates a liver segment division algorithm with a lesion localization and classification model, exhibits great potential for localizing and classifying focal liver lesions and identifying the liver segments in which they reside. Further improvements and validation using larger sample sizes will enhance its performance and clinical applicability.


Subject(s)
Contrast Media , Liver Neoplasms , Humans , Liver Neoplasms/diagnostic imaging , Liver Neoplasms/pathology , Retrospective Studies , Liver/diagnostic imaging , Gadolinium DTPA , Magnetic Resonance Imaging/methods
11.
Life (Basel) ; 13(12)2023 Dec 06.
Article in English | MEDLINE | ID: mdl-38137904

ABSTRACT

This study aimed to explore the relationship between thyroid-stimulating hormone (TSH) elevation and the baseline computed tomography (CT) density and volume of the thyroid. We examined 86 cases with new-onset hypothyroidism (TSH > 4.5 IU/mL) and 1071 controls from a medical check-up database over 5 years. A deep learning-based thyroid segmentation method was used to assess CT density and volume. Statistical tests and logistic regression were employed to determine differences and odds ratios. Initially, the case group showed a higher CT density (89.8 vs. 81.7 Hounsfield units (HUs)) and smaller volume (13.0 vs. 15.3 mL) than those in the control group. For every +10 HU in CT density and -3 mL in volume, the odds of developing hypothyroidism increased by 1.40 and 1.35, respectively. Over the course of the study, the case group showed a notable CT density reduction (median: -8.9 HU), whereas the control group had a minor decrease (-2.9 HU). Thyroid volume remained relatively stable for both groups. Higher CT density and smaller thyroid volume at baseline are correlated with future TSH elevation. Over time, there was a substantial and minor decrease in CT density in the case and control groups, respectively. Thyroid volumes remained consistent in both cohorts.

12.
JAMA Netw Open ; 6(6): e2318153, 2023 06 01.
Article in English | MEDLINE | ID: mdl-37378985

ABSTRACT

Importance: Characterizing longitudinal patterns of regional brain volume changes in a population with normal cognition at the individual level could improve understanding of the brain aging process and may aid in the prevention of age-related neurodegenerative diseases. Objective: To investigate age-related trajectories of the volumes and volume change rates of brain structures in participants without dementia. Design, Setting, and Participants: This cohort study was conducted from November 1, 2006, to April 30, 2021, at a single academic health-checkup center among 653 individuals who participated in a health screening program with more than 10 years of serial visits. Exposure: Serial magnetic resonance imaging, Mini-Mental State Examination, health checkup. Main Outcomes and Measures: Volumes and volume change rates across brain tissue types and regions. Results: The study sample included 653 healthy control individuals (mean [SD] age at baseline, 55.1 [9.3] years; median age, 55 years [IQR, 47-62 years]; 447 men [69%]), who were followed up annually for up to 15 years (mean [SD], 11.5 [1.8] years; mean [SD] number of scans, 12.1 [1.9]; total visits, 7915). Each brain structure showed characteristic age-dependent volume and atrophy change rates. In particular, the cortical gray matter showed a consistent pattern of volume loss in each brain lobe with aging. The white matter showed an age-related decrease in volume and an accelerated atrophy rate (regression coefficient, -0.016 [95% CI, -0.012 to -0.011]; P < .001). An accelerated age-related volume increase in the cerebrospinal fluid-filled spaces, particularly in the inferior lateral ventricle and the Sylvian fissure, was also observed (ventricle regression coefficient, 0.042 [95% CI, 0.037-0.047]; P < .001; sulcus regression coefficient, 0.021 [95% CI, 0.018-0.023]; P < .001). The temporal lobe atrophy rate accelerated from approximately 70 years of age, preceded by acceleration of atrophy in the hippocampus and amygdala. Conclusions and Relevance: In this cohort study of adults without dementia, age-dependent brain structure volumes and volume change rates in various brain structures were characterized using serial magnetic resonance imaging scans. These findings clarified the normal distributions in the aging brain, which are essential for understanding the process of age-related neurodegenerative diseases.


Subject(s)
Brain , Dementia , Male , Adult , Humans , Middle Aged , Child , Cohort Studies , Brain/diagnostic imaging , Brain/pathology , Aging/pathology , Magnetic Resonance Imaging , Cognition , Atrophy , Dementia/pathology
13.
Abdom Radiol (NY) ; 48(8): 2469-2476, 2023 08.
Article in English | MEDLINE | ID: mdl-37074475

ABSTRACT

PURPOSE: Biliary adenofibroma is a solid microcystic epithelial neoplasm in the liver, comprising microcystic and tubuloacinar glandular tissues lined by a non-mucin secreting biliary epithelium and supported by a fibrous stroma. It is an extremely rare benign tumor with potential for malignant transformation. Herein, we report the case of a 64-year-old woman diagnosed with intrahepatic cholangiocarcinoma arising from biliary adenofibroma. METHODS: Imaging studies revealed a tumor of 50 mm diameter, consisting of two components in S1 of the liver. The ventral portion of the tumor showed an ill-defined mass with early peripheral and gradual centripetal enhancement invading to the middle hepatic vein on computed tomography (CT), diffusion restriction on magnetic resonance images, and high fluorine-18-2-deoxy-D-glucose (FDG) uptake on positron emission tomography, like conventional intrahepatic cholangiocarcinoma. The dorsal portion showed a well-defined and low-attenuated mass with heterogeneous early enhancement and partial wash-out on CT, marked hyperintensity on heavily T2-weighted images, and low FDG uptake. The patient subsequently underwent extended left hepatectomy. RESULTS: Pathologically, the former was diagnosed as cholangiocarcinoma and the latter as biliary adenofibroma. We discuss the radiological-pathological correlation of the tumor with a literature review. CONCLUSION: Preoperative diagnosis of biliary adenofibroma is extremely challenging; however, clinically, it is crucial not to miss the presence of malignant findings.


Subject(s)
Adenofibroma , Bile Duct Neoplasms , Cholangiocarcinoma , Gastrointestinal Neoplasms , Female , Humans , Middle Aged , Fluorodeoxyglucose F18 , Bile Duct Neoplasms/diagnostic imaging , Bile Duct Neoplasms/surgery , Cholangiocarcinoma/diagnostic imaging , Cholangiocarcinoma/surgery , Cholangiocarcinoma/pathology , Gastrointestinal Neoplasms/pathology , Multimodal Imaging , Bile Ducts, Intrahepatic/diagnostic imaging , Bile Ducts, Intrahepatic/pathology , Adenofibroma/diagnostic imaging , Adenofibroma/surgery
14.
Radiol Phys Technol ; 16(1): 28-38, 2023 Mar.
Article in English | MEDLINE | ID: mdl-36344662

ABSTRACT

The purpose of this study was to realize an automated volume measurement of abdominal adipose tissue from the entire abdominal cavity in Dixon magnetic resonance (MR) images using deep learning. Our algorithm involves a combination of extraction of the abdominal cavity and body trunk regions using deep learning and extraction of a fat region based on automatic thresholding. To evaluate the proposed method, we calculated the Dice coefficient (DC) between the extracted regions using deep learning and labeled images. We also compared the visceral adipose tissue (VAT) and subcutaneous adipose tissue volumes calculated by employing the proposed method with those calculated from computed tomography (CT) images scanned on the same day using the automatic calculation method previously developed by our group. We implemented our method as a plug-in in a web-based medical image processing platform. The DCs of the abdominal cavity and body trunk regions were 0.952 ± 0.014 and 0.995 ± 0.002, respectively. The VAT volume measured from MR images using the proposed method was almost equivalent to that measured from CT images. The time required for our plug-in to process the test set was 118.9 ± 28.0 s. Using our proposed method, the VAT volume measured from MR images can be an alternative to that measured from CT images.


Subject(s)
Abdominal Cavity , Deep Learning , Reproducibility of Results , Abdominal Fat/diagnostic imaging , Image Processing, Computer-Assisted/methods , Magnetic Resonance Imaging/methods , Adipose Tissue
15.
Eur Thyroid J ; 12(1)2023 02 01.
Article in English | MEDLINE | ID: mdl-36562641

ABSTRACT

Objective: This study aimed to determine a standardized cut-off value for abnormal 18F-fluorodeoxyglucose (FDG) accumulation in the thyroid gland. Methods: Herein, 7013 FDG-PET/CT scans were included. An automatic thyroid segmentation method using two U-nets (2D- and 3D-U-net) was constructed; mean FDG standardized uptake value (SUV), CT value, and volume of the thyroid gland were obtained from each participant. The values were categorized by thyroid function into three groups based on serum thyroid-stimulating hormone levels. Thyroid function and mean SUV with increments of 1 were analyzed, and risk for thyroid dysfunction was calculated. Thyroid dysfunction detection ability was examined using a machine learning method (LightGBM, Microsoft) with age, sex, height, weight, CT value, volume, and mean SUV as explanatory variables. Results: Mean SUV was significantly higher in females with hypothyroidism. Almost 98.9% of participants in the normal group had mean SUV < 2 and 93.8% participants with mean SUV < 2 had normal thyroid function. The hypothyroidism group had more cases with mean SUV ≥ 2. The relative risk of having abnormal thyroid function was 4.6 with mean SUV ≥ 2. The sensitivity and specificity for detecting thyroid dysfunction using LightGBM (Microsoft) were 14.5 and 99%, respectively. Conclusions: Mean SUV ≥ 2 was strongly associated with abnormal thyroid function in this large cohort, indicating that mean SUV with FDG-PET/CT can be used as a criterion for thyroid evaluation. Preliminarily, this study shows the potential utility of detecting thyroid dysfunction based on imaging findings.


Subject(s)
Hypothyroidism , Thyroid Diseases , Female , Humans , Fluorodeoxyglucose F18 , Positron Emission Tomography Computed Tomography , Tomography, X-Ray Computed/methods , Thyroid Diseases/diagnostic imaging
16.
Radiology ; 306(1): 270-278, 2023 01.
Article in English | MEDLINE | ID: mdl-36098641

ABSTRACT

Background COVID-19 vaccination-related axillary lymphadenopathy has become an important problem in cancer imaging. Data are needed to update or support imaging guidelines for conducting appropriate follow-up. Purpose To investigate the prevalence, predisposing factors, and MRI characteristics of COVID-19 vaccination-related axillary lymphadenopathy. Materials and Methods Prospectively collected prevaccination and postvaccination chest MRI scans were secondarily analyzed. Participants who underwent two doses of either the Pfizer-BioNTech or Moderna COVID-19 vaccine and chest MRI from June to October 2021 were included. Enlarged axillary lymph nodes were identified on postvaccination MRI scans compared with prevaccination scans. The lymph node diameter, signal intensity with T2-weighted imaging, and apparent diffusion coefficient (ADC) of the largest enlarged lymph nodes were measured. These values were compared between prevaccination and postvaccination MRI by using the Wilcoxon signed-rank test. Results Overall, 433 participants (mean age, 65 years ± 11 [SD]; 300 men and 133 women) were included. The prevalence of axillary lymphadenopathy in participants 1-14 days after vaccination was 65% (30 of 46). Participants with lymphadenopathy were younger than those without lymphadenopathy (P < .001). Female sex and the Moderna vaccine were predisposing factors (P = .005 and P = .003, respectively). Five or more enlarged lymph nodes were noted in 2% (eight of 433) of participants. Enlarged lymph nodes greater than or equal to 10 mm in the short axis were noted in 1% (four of 433) of participants. The median signal intensity relative to the muscle on T2-weighted images was 4.0; enlarged lymph nodes demonstrated a higher signal intensity (P = .002). The median ADC of enlarged lymph nodes after vaccination in 90 participants was 1.1 × 10-3 mm2/sec (range, 0.6-2.0 × 10-3 mm2/sec), thus ADC values remained normal. Conclusion Axillary lymphadenopathy after the second dose of the Pfizer-BioNTech or Moderna COVID-19 vaccines was frequent within 2 weeks after vaccination, was typically less than 10 mm in size, and had a normal apparent diffusion coefficient. © RSNA, 2022.


Subject(s)
COVID-19 , Lymphadenopathy , Male , Female , Humans , Aged , COVID-19 Vaccines , 2019-nCoV Vaccine mRNA-1273 , Sensitivity and Specificity , COVID-19/pathology , Magnetic Resonance Imaging/methods , Lymph Nodes/pathology , Vaccination
17.
Tomography ; 8(5): 2129-2152, 2022 08 24.
Article in English | MEDLINE | ID: mdl-36136875

ABSTRACT

Ultra-sparse-view computed tomography (CT) algorithms can reduce radiation exposure for patients, but these algorithms lack an explicit cycle consistency loss minimization and an explicit log-likelihood maximization in testing. Here, we propose X2CT-FLOW for the maximum a posteriori (MAP) reconstruction of a three-dimensional (3D) chest CT image from a single or a few two-dimensional (2D) projection images using a progressive flow-based deep generative model, especially for ultra-low-dose protocols. The MAP reconstruction can simultaneously optimize the cycle consistency loss and the log-likelihood. We applied X2CT-FLOW for the reconstruction of 3D chest CT images from biplanar projection images without noise contamination (assuming a standard-dose protocol) and with strong noise contamination (assuming an ultra-low-dose protocol). We simulated an ultra-low-dose protocol. With the standard-dose protocol, our images reconstructed from 2D projected images and 3D ground-truth CT images showed good agreement in terms of structural similarity (SSIM, 0.7675 on average), peak signal-to-noise ratio (PSNR, 25.89 dB on average), mean absolute error (MAE, 0.02364 on average), and normalized root mean square error (NRMSE, 0.05731 on average). Moreover, with the ultra-low-dose protocol, our images reconstructed from 2D projected images and the 3D ground-truth CT images also showed good agreement in terms of SSIM (0.7008 on average), PSNR (23.58 dB on average), MAE (0.02991 on average), and NRMSE (0.07349 on average).


Subject(s)
Algorithms , Tomography, X-Ray Computed , Humans , Imaging, Three-Dimensional/methods , Radiation Dosage , Signal-To-Noise Ratio , Tomography, X-Ray Computed/methods
18.
Jpn J Radiol ; 40(7): 730-739, 2022 Jul.
Article in English | MEDLINE | ID: mdl-35094221

ABSTRACT

PURPOSE: To develop an anomaly detection system in PET/CT with the tracer 18F-fluorodeoxyglucose (FDG) that requires only normal PET/CT images for training and can detect abnormal FDG uptake at any location in the chest region. MATERIALS AND METHODS: We trained our model based on a Bayesian deep learning framework using 1878 PET/CT scans with no abnormal findings. Our model learns the distribution of standard uptake values in these normal training images and detects out-of-normal uptake regions. We evaluated this model using 34 scans showing focal abnormal FDG uptake in the chest region. This evaluation dataset includes 28 pulmonary and 17 extrapulmonary abnormal FDG uptake foci. We performed per-voxel and per-slice receiver operating characteristic (ROC) analyses and per-lesion free-response receiver operating characteristic analysis. RESULTS: Our model showed an area under the ROC curve of 0.992 on discriminating abnormal voxels and 0.852 on abnormal slices. Our model detected 41 of 45 (91.1%) of the abnormal FDG uptake foci with 12.8 false positives per scan (FPs/scan), which include 26 of 28 pulmonary and 15 of 17 extrapulmonary abnormalities. The sensitivity at 3.0 FPs/scan was 82.2% (37/45). CONCLUSION: Our model trained only with normal PET/CT images successfully detected both pulmonary and extrapulmonary abnormal FDG uptake in the chest region.


Subject(s)
Deep Learning , Fluorodeoxyglucose F18 , Bayes Theorem , Humans , Positron Emission Tomography Computed Tomography/methods , Positron-Emission Tomography/methods , Radiopharmaceuticals
19.
Int J Comput Assist Radiol Surg ; 16(11): 1901-1913, 2021 Nov.
Article in English | MEDLINE | ID: mdl-34652606

ABSTRACT

PURPOSE: The three-dimensional (3D) voxel labeling of lesions requires significant radiologists' effort in the development of computer-aided detection software. To reduce the time required for the 3D voxel labeling, we aimed to develop a generalized semiautomatic segmentation method based on deep learning via a data augmentation-based domain generalization framework. In this study, we investigated whether a generalized semiautomatic segmentation model trained using two types of lesion can segment previously unseen types of lesion. METHODS: We targeted lung nodules in chest CT images, liver lesions in hepatobiliary-phase images of Gd-EOB-DTPA-enhanced MR imaging, and brain metastases in contrast-enhanced MR images. For each lesion, the 32 × 32 × 32 isotropic volume of interest (VOI) around the center of gravity of the lesion was extracted. The VOI was input into a 3D U-Net model to define the label of the lesion. For each type of target lesion, we compared five types of data augmentation and two types of input data. RESULTS: For all considered target lesions, the highest dice coefficients among the training patterns were obtained when using a combination of the existing data augmentation-based domain generalization framework and random monochrome inversion and when using the resized VOI as the input image. The dice coefficients were 0.639 ± 0.124 for the lung nodules, 0.660 ± 0.137 for the liver lesions, and 0.727 ± 0.115 for the brain metastases. CONCLUSIONS: Our generalized semiautomatic segmentation model could label unseen three types of lesion with different contrasts from the surroundings. In addition, the resized VOI as the input image enables the adaptation to the various sizes of lesions even when the size distribution differed between the training set and the test set.


Subject(s)
Deep Learning , Humans , Liver , Magnetic Resonance Imaging , Thorax , Tomography, X-Ray Computed
20.
BMC Med Inform Decis Mak ; 21(1): 262, 2021 09 11.
Article in English | MEDLINE | ID: mdl-34511100

ABSTRACT

BACKGROUND: It is essential for radiologists to communicate actionable findings to the referring clinicians reliably. Natural language processing (NLP) has been shown to help identify free-text radiology reports including actionable findings. However, the application of recent deep learning techniques to radiology reports, which can improve the detection performance, has not been thoroughly examined. Moreover, free-text that clinicians input in the ordering form (order information) has seldom been used to identify actionable reports. This study aims to evaluate the benefits of two new approaches: (1) bidirectional encoder representations from transformers (BERT), a recent deep learning architecture in NLP, and (2) using order information in addition to radiology reports. METHODS: We performed a binary classification to distinguish actionable reports (i.e., radiology reports tagged as actionable in actual radiological practice) from non-actionable ones (those without an actionable tag). 90,923 Japanese radiology reports in our hospital were used, of which 788 (0.87%) were actionable. We evaluated four methods, statistical machine learning with logistic regression (LR) and with gradient boosting decision tree (GBDT), and deep learning with a bidirectional long short-term memory (LSTM) model and a publicly available Japanese BERT model. Each method was used with two different inputs, radiology reports alone and pairs of order information and radiology reports. Thus, eight experiments were conducted to examine the performance. RESULTS: Without order information, BERT achieved the highest area under the precision-recall curve (AUPRC) of 0.5138, which showed a statistically significant improvement over LR, GBDT, and LSTM, and the highest area under the receiver operating characteristic curve (AUROC) of 0.9516. Simply coupling the order information with the radiology reports slightly increased the AUPRC of BERT but did not lead to a statistically significant improvement. This may be due to the complexity of clinical decisions made by radiologists. CONCLUSIONS: BERT was assumed to be useful to detect actionable reports. More sophisticated methods are required to use order information effectively.


Subject(s)
Natural Language Processing , Radiology , Humans , Logistic Models , Machine Learning , Radiography
SELECTION OF CITATIONS
SEARCH DETAIL
...