Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 321
Filtrar
1.
Acta Radiol ; : 2841851241266369, 2024 Jul 23.
Artigo em Inglês | MEDLINE | ID: mdl-39043232

RESUMO

Radiographic measurements play a crucial role in evaluating the alignment of distal radius fractures (DRFs). Various manual methods have been used to perform the measurements, but they are susceptible to inaccuracies. Recently, computer-aided methods have become available. This review explores the methods commonly used to assess DRFs. The review introduces the different measurement techniques, discusses the sources of measurement errors and measurement reliability, and provides a recommendation for their use. Radiographic measurements used in the evaluation of DRFs are not reliable. Standardizing the measurement techniques is crucial to address this and automated image analysis could help improve accuracy and reliability.

2.
BMC Prim Care ; 25(1): 267, 2024 Jul 20.
Artigo em Inglês | MEDLINE | ID: mdl-39033295

RESUMO

BACKGROUND: Atrial fibrillation (AF) is highly correlated with heart failure, stroke and death. Screening increases AF detection and facilitates the early adoption of comprehensive intervention. Long-term wearable devices have become increasingly popular for AF screening in primary care. However, interpreting data obtained by long-term wearable ECG devices is a problem in primary care. To diagnose the disease quickly and accurately, we aimed to build AF episode detection model based on a nonlinear Lorenz scattergram (LS) and deep learning. METHODS: The MIT-BIH Normal Sinus Rhythm Database, MIT-BIH Arrhythmia Database and the Long-Term AF Database were extracted to construct the MIT-BIH Ambulatory Electrocardiograph (MIT-BIH AE) dataset. We converted the long-term ECG into a two-dimensional LSs. The LSs from MIT-BIH AE dataset was randomly divided into training and internal validation sets in a 9:1 ratio, which was used to develop and internally validated model. We built a MOBILE-SCREEN-AF (MS-AF) dataset from a single-lead wearable ECG device in primary care for external validation. Performance was quantified using a confusion matrix and standard classification metrics. RESULTS: During the evaluation of model performance based on the LS, the sensitivity, specificity and accuracy of the model in diagnosing AF were 0.992, 0.973, and 0.983 in the internal validation set respectively. In the external validation set, these metrics were 0.989, 0.956, and 0.967, respectively. Furthermore, when evaluating the model's performance based on ECG records in the MS-AF dataset, the sensitivity, specificity and accuracy of model diagnosis paroxysmal AF were 1.000, 0.870 and 0.876 respectively, and 0.927, 1.000 and 0.973 for the persistent AF. CONCLUSIONS: The model based on the nonlinear LS and deep learning has high accuracy, making it promising for AF screening in primary care. It has potential for generalization and practical application.


Assuntos
Fibrilação Atrial , Aprendizado Profundo , Atenção Primária à Saúde , Humanos , Fibrilação Atrial/diagnóstico , Fibrilação Atrial/fisiopatologia , Eletrocardiografia Ambulatorial/instrumentação , Eletrocardiografia Ambulatorial/métodos , Dispositivos Eletrônicos Vestíveis , Eletrocardiografia/instrumentação , Eletrocardiografia/métodos , Masculino , Sensibilidade e Especificidade , Feminino
3.
Asian Spine J ; 18(3): 407-414, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38917858

RESUMO

STUDY DESIGN: An experimental study. PURPOSE: This study aimed to investigate the potential use of artificial neural networks (ANNs) in the detection of odontoid fractures using the Konstanz Information Miner (KNIME) Analytics Platform that provides a technique for computer-assisted diagnosis using radiographic X-ray imaging. OVERVIEW OF LITERATURE: In medical image processing, computer-assisted diagnosis with ANNs from radiographic X-ray imaging is becoming increasingly popular. Odontoid fractures are a common fracture of the axis and account for 10%-15% of all cervical fractures. However, a literature review of computer-assisted diagnosis with ANNs has not been made. METHODS: This study analyzed 432 open-mouth (odontoid) radiographic views of cervical spine X-ray images obtained from dataset repositories, which were used in developing ANN models based on the convolutional neural network theory. All the images contained diagnostic information, including 216 radiographic images of individuals with normal odontoid processes and 216 images of patients with acute odontoid fractures. The model classified each image as either showing an odontoid fracture or not. Specifically, 70% of the images were training datasets used for model training, and 30% were used for testing. KNIME's graphic user interface-based programming enabled class label annotation, data preprocessing, model training, and performance evaluation. RESULTS: The graphic user interface program by KNIME was used to report all radiographic X-ray imaging features. The ANN model performed 50 epochs of training. The performance indices in detecting odontoid fractures included sensitivity, specificity, F-measure, and prediction error of 100%, 95.4%, 97.77%, and 2.3%, respectively. The model's accuracy accounted for 97% of the area under the receiver operating characteristic curve for the diagnosis of odontoid fractures. CONCLUSIONS: The ANN models with the KNIME Analytics Platform were successfully used in the computer-assisted diagnosis of odontoid fractures using radiographic X-ray images. This approach can help radiologists in the screening, detection, and diagnosis of acute odontoid fractures.

4.
Eur Radiol ; 2024 Jun 11.
Artigo em Inglês | MEDLINE | ID: mdl-38861161

RESUMO

PURPOSE: This work aims to assess standard evaluation practices used by the research community for evaluating medical imaging classifiers, with a specific focus on the implications of class imbalance. The analysis is performed on chest X-rays as a case study and encompasses a comprehensive model performance definition, considering both discriminative capabilities and model calibration. MATERIALS AND METHODS: We conduct a concise literature review to examine prevailing scientific practices used when evaluating X-ray classifiers. Then, we perform a systematic experiment on two major chest X-ray datasets to showcase a didactic example of the behavior of several performance metrics under different class ratios and highlight how widely adopted metrics can conceal performance in the minority class. RESULTS: Our literature study confirms that: (1) even when dealing with highly imbalanced datasets, the community tends to use metrics that are dominated by the majority class; and (2) it is still uncommon to include calibration studies for chest X-ray classifiers, albeit its importance in the context of healthcare. Moreover, our systematic experiments confirm that current evaluation practices may not reflect model performance in real clinical scenarios and suggest complementary metrics to better reflect the performance of the system in such scenarios. CONCLUSION: Our analysis underscores the need for enhanced evaluation practices, particularly in the context of class-imbalanced chest X-ray classifiers. We recommend the inclusion of complementary metrics such as the area under the precision-recall curve (AUC-PR), adjusted AUC-PR, and balanced Brier score, to offer a more accurate depiction of system performance in real clinical scenarios, considering metrics that reflect both, discrimination and calibration performance. CLINICAL RELEVANCE STATEMENT: This study underscores the critical need for refined evaluation metrics in medical imaging classifiers, emphasizing that prevalent metrics may mask poor performance in minority classes, potentially impacting clinical diagnoses and healthcare outcomes. KEY POINTS: Common scientific practices in papers dealing with X-ray computer-assisted diagnosis (CAD) systems may be misleading. We highlight limitations in reporting of evaluation metrics for X-ray CAD systems in highly imbalanced scenarios. We propose adopting alternative metrics based on experimental evaluation on large-scale datasets.

5.
Eur Radiol ; 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38842692

RESUMO

OBJECTIVES: To develop an automated pipeline for extracting prostate cancer-related information from clinical notes. MATERIALS AND METHODS: This retrospective study included 23,225 patients who underwent prostate MRI between 2017 and 2022. Cancer risk factors (family history of cancer and digital rectal exam findings), pre-MRI prostate pathology, and treatment history of prostate cancer were extracted from free-text clinical notes in English as binary or multi-class classification tasks. Any sentence containing pre-defined keywords was extracted from clinical notes within one year before the MRI. After manually creating sentence-level datasets with ground truth, Bidirectional Encoder Representations from Transformers (BERT)-based sentence-level models were fine-tuned using the extracted sentence as input and the category as output. The patient-level output was determined by compilation of multiple sentence-level outputs using tree-based models. Sentence-level classification performance was evaluated using the area under the receiver operating characteristic curve (AUC) on 15% of the sentence-level dataset (sentence-level test set). The patient-level classification performance was evaluated on the patient-level test set created by radiologists by reviewing the clinical notes of 603 patients. Accuracy and sensitivity were compared between the pipeline and radiologists. RESULTS: Sentence-level AUCs were ≥ 0.94. The pipeline showed higher patient-level sensitivity for extracting cancer risk factors (e.g., family history of prostate cancer, 96.5% vs. 77.9%, p < 0.001), but lower accuracy in classifying pre-MRI prostate pathology (92.5% vs. 95.9%, p = 0.002) and treatment history of prostate cancer (95.5% vs. 97.7%, p = 0.03) than radiologists, respectively. CONCLUSION: The proposed pipeline showed promising performance, especially for extracting cancer risk factors from patient's clinical notes. CLINICAL RELEVANCE STATEMENT: The natural language processing pipeline showed a higher sensitivity for extracting prostate cancer risk factors than radiologists and may help efficiently gather relevant text information when interpreting prostate MRI. KEY POINTS: When interpreting prostate MRI, it is necessary to extract prostate cancer-related information from clinical notes. This pipeline extracted the presence of prostate cancer risk factors with higher sensitivity than radiologists. Natural language processing may help radiologists efficiently gather relevant prostate cancer-related text information.

6.
Int J Comput Vis ; 132(7): 2567-2584, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38911323

RESUMO

Pulmonary hypertension (PH) in newborns and infants is a complex condition associated with several pulmonary, cardiac, and systemic diseases contributing to morbidity and mortality. Thus, accurate and early detection of PH and the classification of its severity is crucial for appropriate and successful management. Using echocardiography, the primary diagnostic tool in pediatrics, human assessment is both time-consuming and expertise-demanding, raising the need for an automated approach. Little effort has been directed towards automatic assessment of PH using echocardiography, and the few proposed methods only focus on binary PH classification on the adult population. In this work, we present an explainable multi-view video-based deep learning approach to predict and classify the severity of PH for a cohort of 270 newborns using echocardiograms. We use spatio-temporal convolutional architectures for the prediction of PH from each view, and aggregate the predictions of the different views using majority voting. Our results show a mean F1-score of 0.84 for severity prediction and 0.92 for binary detection using 10-fold cross-validation and 0.63 for severity prediction and 0.78 for binary detection on the held-out test set. We complement our predictions with saliency maps and show that the learned model focuses on clinically relevant cardiac structures, motivating its usage in clinical practice. To the best of our knowledge, this is the first work for an automated assessment of PH in newborns using echocardiograms.

7.
Abdom Radiol (NY) ; 2024 Jun 25.
Artigo em Inglês | MEDLINE | ID: mdl-38916618

RESUMO

OBJECTIVES: To identify lymphatic vascular space invasion (LVSI) and lymphatic node metastasis (LNM) status of endometrial cancer (EC) patients, using radiomics based on MRI images. METHODS: Five hundred and ninety-eight EC patients between January 2015 and September 2020 from two institutions were retrospectively included. Tumoral regions on DWI, T1CE, and T2W images were manually outlined. Radiomics features were extracted from tumor region and peri-tumor region of different thicknesses. We established sub-models to select features from each smaller category. Using this method, we separately constructed radiomic signatures for intra-tumoral and peri-tumoral images using different sequences. We constructed intra-tumoral and peri-tumoral models by combining their features, and a multi-sequence model by combining logits. Models were trained with 397 patients and validated with 170 internal and 31 external patients. RESULTS: For LVSI positive/LNM positive status identification, the multi-parameter MRI radiomics model achieved the area under curve (AUC) values of 0.771 (95%CI: [0.692-0.849])/0.801 (95%CI: [0.704, 0.898]) and 0.864 (95%CI: [0.728-1.000])/0.976 (95%CI: [0.919, 1.000]) in internal and external test cohorts, respectively. CONCLUSIONS: Intra-tumoral and peri-tumoral radiomics signatures based on mpMRI can both be used to identify LVSI or LNM status in EC patients non-invasively. Further studies on LVSI and LNM should pay attention to both of them.

8.
Ultrasound Med Biol ; 50(8): 1194-1202, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-38734528

RESUMO

OBJECTIVES: To assess the value of 3D multiparametric ultrasound imaging, combining hemodynamic and tissue stiffness quantifications by machine learning, for the prediction of prostate biopsy outcomes. METHODS: After signing informed consent, 54 biopsy-naïve patients underwent a 3D dynamic contrast-enhanced ultrasound (DCE-US) recording, a multi-plane 2D shear-wave elastography (SWE) scan with manual sweeping from base to apex of the prostate, and received 12-core systematic biopsies (SBx). 3D maps of 18 hemodynamic parameters were extracted from the 3D DCE-US quantification and a 3D SWE elasticity map was reconstructed based on the multi-plane 2D SWE acquisitions. Subsequently, all the 3D maps were segmented and subdivided into 12 regions corresponding to the SBx locations. Per region, the set of 19 computed parameters was further extended by derivation of eight radiomic features per parameter. Based on this feature set, a multiparametric ultrasound approach was implemented using five different classifiers together with a sequential floating forward selection method and hyperparameter tuning. The classification accuracy with respect to the biopsy reference was assessed by a group-k-fold cross-validation procedure, and the performance was evaluated by the Area Under the Receiver Operating Characteristics Curve (AUC). RESULTS: Of the 54 patients, 20 were found with clinically significant prostate cancer (csPCa) based on SBx. The 18 hemodynamic parameters showed mean AUC values varying from 0.63 to 0.75, and SWE elasticity showed an AUC of 0.66. The multiparametric approach using radiomic features derived from hemodynamic parameters only produced an AUC of 0.81, while the combination of hemodynamic and tissue-stiffness quantifications yielded a significantly improved AUC of 0.85 for csPCa detection (p-value < 0.05) using the Gradient Boosting classifier. CONCLUSIONS: Our results suggest 3D multiparametric ultrasound imaging combining hemodynamic and tissue-stiffness features to represent a promising diagnostic tool for biopsy outcome prediction, aiding in csPCa localization.


Assuntos
Técnicas de Imagem por Elasticidade , Imageamento Tridimensional , Próstata , Neoplasias da Próstata , Ultrassonografia , Humanos , Masculino , Próstata/diagnóstico por imagem , Próstata/patologia , Pessoa de Meia-Idade , Neoplasias da Próstata/diagnóstico por imagem , Neoplasias da Próstata/patologia , Idoso , Imageamento Tridimensional/métodos , Técnicas de Imagem por Elasticidade/métodos , Ultrassonografia/métodos , Valor Preditivo dos Testes , Biópsia
9.
Artigo em Inglês | MEDLINE | ID: mdl-38795148

RESUMO

PURPOSE: This study evaluates the efficacy of two advanced Large Language Models (LLMs), OpenAI's ChatGPT 4 and Google's Gemini Advanced, in providing treatment recommendations for head and neck oncology cases. The aim is to assess their utility in supporting multidisciplinary oncological evaluations and decision-making processes. METHODS: This comparative analysis examined the responses of ChatGPT 4 and Gemini Advanced to five hypothetical cases of head and neck cancer, each representing a different anatomical subsite. The responses were evaluated against the latest National Comprehensive Cancer Network (NCCN) guidelines by two blinded panels using the total disagreement score (TDS) and the artificial intelligence performance instrument (AIPI). Statistical assessments were performed using the Wilcoxon signed-rank test and the Friedman test. RESULTS: Both LLMs produced relevant treatment recommendations with ChatGPT 4 generally outperforming Gemini Advanced regarding adherence to guidelines and comprehensive treatment planning. ChatGPT 4 showed higher AIPI scores (median 3 [2-4]) compared to Gemini Advanced (median 2 [2-3]), indicating better overall performance. Notably, inconsistencies were observed in the management of induction chemotherapy and surgical decisions, such as neck dissection. CONCLUSIONS: While both LLMs demonstrated the potential to aid in the multidisciplinary management of head and neck oncology, discrepancies in certain critical areas highlight the need for further refinement. The study supports the growing role of AI in enhancing clinical decision-making but also emphasizes the necessity for continuous updates and validation against current clinical standards to integrate AI into healthcare practices fully.

11.
Neuroradiology ; 66(7): 1153-1160, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38619571

RESUMO

PURPOSE: To evaluate the impact of an AI-based software trained to detect cerebral aneurysms on TOF-MRA on the diagnostic performance and reading times across readers with varying experience levels. METHODS: One hundred eighty-six MRI studies were reviewed by six readers to detect cerebral aneurysms. Initially, readings were assisted by the CNN-based software mdbrain. After 6 weeks, a second reading was conducted without software assistance. The results were compared to the consensus reading of two neuroradiological specialists and sensitivity (lesion and patient level), specificity (patient level), and false positives per case were calculated for the group of all readers, for the subgroup of physicians, and for each individual reader. Also, reading times for each reader were measured. RESULTS: The dataset contained 54 aneurysms. The readers had no experience (three medical students), 2 years experience (resident in neuroradiology), 6 years experience (radiologist), and 12 years (neuroradiologist). Significant improvements of overall specificity and the overall number of false positives per case were observed in the reading with AI support. For the physicians, we found significant improvements of sensitivity on lesion and patient level and false positives per case. Four readers experienced reduced reading times with the software, while two encountered increased times. CONCLUSION: In the reading with the AI-based software, we observed significant improvements in terms of specificity and false positives per case for the group of all readers and significant improvements of sensitivity and false positives per case for the physicians. Further studies are needed to investigate the effects of the AI-based software in a prospective setting.


Assuntos
Aneurisma Intracraniano , Angiografia por Ressonância Magnética , Sensibilidade e Especificidade , Software , Humanos , Aneurisma Intracraniano/diagnóstico por imagem , Angiografia por Ressonância Magnética/métodos , Masculino , Feminino , Pessoa de Meia-Idade , Competência Clínica , Interpretação de Imagem Assistida por Computador/métodos , Inteligência Artificial , Idoso , Adulto
12.
J Dent Sci ; 19(2): 937-944, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38618087

RESUMO

Background/purpose: Recently, an artificial intelligence-based computer-assisted diagnosis (AI-CAD) for panoramic radiography was developed to scan the inferior margin of the mandible and automatically evaluate mandibular cortical morphology. The aim of this study was to analyze quantitatively the mandibular cortical morphology using the AI-CAD, especially focusing on underlying diseases and dental status in women over 20 years of age. Materials and methods: 419 patients in women over 20 years of age who underwent panoramic radiography were included in this study. The mandibular cortical morphology was analyzed with an AI-CAD that evaluated the degree of deformation of the mandibular inferior cortex (MIC) and mandibular cortical index (MCI) automatically. Those were analyzed in relation to underlying diseases, such as diabetes, hypertension, dyslipidemia, rheumatism and osteoporosis, and dental status, such as the number of teeth present in the maxilla and mandible. Results: The degree of deformation of MIC in women under 51 years of age (21-50 years; n = 229, 16.0 ± 12.7) was significantly lower than those of over 50 years of age (51-90 years; n = 190, 45.1 ± 23.0), and the MCI was a significant difference for the different age group. Regarding the degree of deformation of MIC and MCI in women over 50 years of age, osteoporosis and number of total teeth present in the maxilla and mandible were significant differences. Conclusion: The results of this study indicated that the mandibular cortical morphology using the AI-CAD is significantly related to osteoporosis and dental status in women over 50 years of age.

13.
J Alzheimers Dis ; 98(3): 793-823, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38489188

RESUMO

Background: The growing number of older adults in recent decades has led to more prevalent geriatric diseases, such as strokes and dementia. Therefore, Alzheimer's disease (AD), as the most common type of dementia, has become more frequent too. Background: Objective: The goals of this work are to present state-of-the-art studies focused on the automatic diagnosis and prognosis of AD and its early stages, mainly mild cognitive impairment, and predicting how the research on this topic may change in the future. Methods: Articles found in the existing literature needed to fulfill several selection criteria. Among others, their classification methods were based on artificial neural networks (ANNs), including deep learning, and data not from brain signals or neuroimaging techniques were used. Considering our selection criteria, 42 articles published in the last decade were finally selected. Results: The most medically significant results are shown. Similar quantities of articles based on shallow and deep ANNs were found. Recurrent neural networks and transformers were common with speech or in longitudinal studies. Convolutional neural networks (CNNs) were popular with gait or combined with others in modular approaches. Above one third of the cross-sectional studies utilized multimodal data. Non-public datasets were frequently used in cross-sectional studies, whereas the opposite in longitudinal ones. The most popular databases were indicated, which will be helpful for future researchers in this field. Conclusions: The introduction of CNNs in the last decade and their superb results with neuroimaging data did not negatively affect the usage of other modalities. In fact, new ones emerged.

14.
15.
Ultrasound Med Biol ; 50(6): 882-887, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38494413

RESUMO

OBJECTIVE: Deep learning algorithms have commonly been used for the differential diagnosis between benign and malignant thyroid nodules. The aim of the study described here was to develop an integrated system that combines a deep learning model and a clinical standard Thyroid Imaging Reporting and Data System (TI-RADS) for the simultaneous segmentation and risk stratification of thyroid nodules. METHODS: Three hundred four ultrasound images from two independent sites with TI-RADS 4 thyroid nodules were collected. The edge connection and Criminisi algorithm were used to remove manually induced markers in ultrasound images. An integrated system based on TI-RADS and a mask region-based convolution neural network (Mask R-CNN) was proposed to stratify subclasses of TI-RADS 4 thyroid nodules and to segment thyroid nodules in the ultrasound images. Accuracy and the precision-recall curve were used to evaluate stratification performance, and the Dice similarity coefficient (DSC) between the segmentation of Mask R-CNN and the radiologist's contour was used to evaluate the segmentation performance of the model. RESULTS: The combined approach could significantly enhance the performance of the proposed integrated system. Overall stratification accuracy of TI-RADS 4 thyroid nodules, mean average precision and mean DSC of the proposed model in the independent test set was 90.79%, 0.8579 and 0.83, respectively. Specifically, stratification accuracy values for TI-RADS 4a, 4b and 4c thyroid nodules were 95.83%, 84.21% and 77.78%, respectively. CONCLUSION: An integrated system combining TI-RADS and a deep learning model was developed. The system can provide clinicians with not only diagnostic assistance from TI-RADS but also accurate segmentation of thyroid nodules, which improves the applicability of the system in clinical practice.


Assuntos
Aprendizado Profundo , Nódulo da Glândula Tireoide , Ultrassonografia , Nódulo da Glândula Tireoide/diagnóstico por imagem , Humanos , Ultrassonografia/métodos , Medição de Risco , Masculino , Feminino , Glândula Tireoide/diagnóstico por imagem , Pessoa de Meia-Idade , Adulto , Idoso
16.
Med Eng Phys ; 123: 104077, 2024 01.
Artigo em Inglês | MEDLINE | ID: mdl-38365344

RESUMO

The process of feature selection (FS) is vital aspect of machine learning (ML) model's performance enhancement where the objective is the selection of the most influential subset of features. This paper suggests the Gravitational search optimization algorithm (GSOA) technique for metaheuristic-based FS. Glaucoma disease is selected as the subject of investigation as this disease is spreading worldwide at a very fast pace; 111 million instances of glaucoma are expected by 2040, up from 64 million in 2015. It causes widespread vision impairment. Optic nerve fibres can be degraded and cannot be replaced later in this disease. As a starting point, the retinal fundus images of glaucoma infected persons and healthy persons are used, and 36 features were retrieved from these images of public benchmark datasets and private dataset. Six ML models are trained for classification on the basis of the GSOA's returned subset of features. The suggested FS technique enhances classification performance with selection of most influential features. The eight statistical performance evaluating parameters along with execution time are calculated. The training and testing have been performed using a split approach (70:30), 5-fold cross validation (CV), as well as 10-fold CV. The suggested approach achieved 95.36 % accuracy. Due to its auspicious performance, doctors might use the suggested method to receive a second opinion, which would also help overburdened skilled medical practitioners and save patients from vision loss.


Assuntos
Sistemas de Apoio a Decisões Clínicas , Glaucoma , Humanos , Glaucoma/diagnóstico por imagem , Fundo de Olho , Aprendizado de Máquina , Algoritmos
17.
Cancers (Basel) ; 16(2)2024 Jan 08.
Artigo em Inglês | MEDLINE | ID: mdl-38254765

RESUMO

Ultrasonography is the preferred modality for detailed evaluation of enlarged lymph nodes (LNs) identified on computed tomography and/or magnetic resonance imaging, owing to its high spatial resolution. However, the diagnostic performance of ultrasonography depends on the examiner's expertise. To support the ultrasonographic diagnosis, we developed YOLOv7-based deep learning models for metastatic LN detection on ultrasonography and compared their detection performance with that of highly experienced radiologists and less experienced residents. We enrolled 462 B- and D-mode ultrasound images of 261 metastatic and 279 non-metastatic histopathologically confirmed LNs from 126 patients with head and neck squamous cell carcinoma. The YOLOv7-based B- and D-mode models were optimized using B- and D-mode training and validation images and their detection performance for metastatic LNs was evaluated using B- and D-mode testing images, respectively. The D-mode model's performance was comparable to that of radiologists and superior to that of residents' reading of D-mode images, whereas the B-mode model's performance was higher than that of residents but lower than that of radiologists on B-mode images. Thus, YOLOv7-based B- and D-mode models can assist less experienced residents in ultrasonographic diagnoses. The D-mode model could raise the diagnostic performance of residents to the same level as experienced radiologists.

18.
Stud Health Technol Inform ; 310: 1051-1055, 2024 Jan 25.
Artigo em Inglês | MEDLINE | ID: mdl-38269975

RESUMO

A clinical decision support system based on different methods of artificial intelligence (AI) can support the diagnosis of patients with unclear diseases by providing tentative diagnoses as well as proposals for further steps. In a user-centred-design process, we aim to find out how general practitioners envision the user interface of an AI-based clinical decision support system for primary care. A first user-interface prototype was developed using the task model based on user requirements from preliminary work. Five general practitioners evaluated the prototype in two workshops. The discussion of the prototype resulted in categorized suggestions with key messages for further development of the AI-based clinical decision support system, such as the integration of intelligent parameter requests. The early inclusion of different user feedback facilitated the implementation of a user interface for a user-friendly decision support system.


Assuntos
Sistemas de Apoio a Decisões Clínicas , Clínicos Gerais , Humanos , Inteligência Artificial , Inteligência , Atenção Primária à Saúde
19.
Eur Arch Otorhinolaryngol ; 281(4): 1835-1841, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38189967

RESUMO

PURPOSE: This study aimed to evaluate the utility of large language model (LLM) artificial intelligence tools, Chat Generative Pre-Trained Transformer (ChatGPT) versions 3.5 and 4, in managing complex otolaryngological clinical scenarios, specifically for the multidisciplinary management of odontogenic sinusitis (ODS). METHODS: A prospective, structured multidisciplinary specialist evaluation was conducted using five ad hoc designed ODS-related clinical scenarios. LLM responses to these scenarios were critically reviewed by a multidisciplinary panel of eight specialist evaluators (2 ODS experts, 2 rhinologists, 2 general otolaryngologists, and 2 maxillofacial surgeons). Based on the level of disagreement from panel members, a Total Disagreement Score (TDS) was calculated for each LLM response, and TDS comparisons were made between ChatGPT3.5 and ChatGPT4, as well as between different evaluators. RESULTS: While disagreement to some degree was demonstrated in 73/80 evaluator reviews of LLMs' responses, TDSs were significantly lower for ChatGPT4 compared to ChatGPT3.5. Highest TDSs were found in the case of complicated ODS with orbital abscess, presumably due to increased case complexity with dental, rhinologic, and orbital factors affecting diagnostic and therapeutic options. There were no statistically significant differences in TDSs between evaluators' specialties, though ODS experts and maxillofacial surgeons tended to assign higher TDSs. CONCLUSIONS: LLMs like ChatGPT, especially newer versions, showed potential for complimenting evidence-based clinical decision-making, but substantial disagreement was still demonstrated between LLMs and clinical specialists across most case examples, suggesting they are not yet optimal in aiding clinical management decisions. Future studies will be important to analyze LLMs' performance as they evolve over time.


Assuntos
Inteligência Artificial , Sinusite , Humanos , Estudos Prospectivos , Reprodutibilidade dos Testes , Idioma
20.
Clin Endosc ; 57(1): 18-23, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38178329

RESUMO

Computer-assisted polyp characterization (computer-aided diagnosis, CADx) facilitates optical diagnosis during colonoscopy. Several studies have demonstrated high sensitivity and specificity of CADx tools in identifying neoplastic changes in colorectal polyps. To implement CADx tools in colonoscopy, there is a need to confirm whether these tools satisfy the threshold levels that are required to introduce optical diagnosis strategies such as "diagnose-and-leave," "resect-and-discard" or "DISCARD-lite." In this article, we review the available data from prospective trials regarding the effect of multiple CADx tools and discuss whether they meet these thresholds.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...