Gemini AI vs. ChatGPT: A comprehensive examination alongside ophthalmology residents in medical knowledge.

Bahir, Daniel; Zur, Omri; Attal, Leah; Nujeidat, Zaki; Knaanie, Ariela; Pikkel, Joseph; Mimouni, Michael; Plopsky, Gilad

Bahir, Daniel; Zur, Omri; Attal, Leah; Nujeidat, Zaki; Knaanie, Ariela; Pikkel, Joseph; Mimouni, Michael; Plopsky, Gilad.

Afiliação

Bahir D; Department of Ophthalmology, Tzafon Medical Center, Poriya, Israel. bahirdaniel@gmail.com.
Zur O; Azrieli Faculty of Medicine, Bar Ilan University, Safed, Israel. bahirdaniel@gmail.com.
Attal L; Azrieli Faculty of Medicine, Bar Ilan University, Safed, Israel.
Nujeidat Z; Azrieli Faculty of Medicine, Bar Ilan University, Safed, Israel.
Knaanie A; Department of Ophthalmology, Tzafon Medical Center, Poriya, Israel.
Pikkel J; Azrieli Faculty of Medicine, Bar Ilan University, Safed, Israel.
Mimouni M; Department of Ophthalmology, Samson Assuta Ashdod Hospital, Ashdod, Israel.
Plopsky G; Department of Ophthalmology, Samson Assuta Ashdod Hospital, Ashdod, Israel.

Graefes Arch Clin Exp Ophthalmol ; 2024 Sep 15.

Article em En | MEDLINE | ID: mdl-39277830

ABSTRACT

ABSTRACT

INTRODUCTION:

The rapid advancement of artificial intelligence (AI), particularly in large language models like ChatGPT and Google's Gemini AI, marks a transformative era in technological innovation. This study explores the potential of AI in ophthalmology, focusing on the capabilities of ChatGPT and Gemini AI. While these models hold promise for medical education and clinical support, their integration requires comprehensive evaluation. This research aims to bridge a gap in the literature by comparing Gemini AI and ChatGPT, assessing their performance against ophthalmology residents using a dataset derived from ophthalmology board exams.

METHODS:

A dataset comprising 600 questions across 12 subspecialties was curated from Israeli ophthalmology residency exams, encompassing text and image-based formats. Four AI models - ChatGPT-3.5, ChatGPT-4, Gemini, and Gemini Advanced - underwent testing on this dataset. The study includes a comparative analysis with Israeli ophthalmology residents, employing specific metrics for performance assessment.

RESULTS:

Gemini Advanced demonstrated superior performance with a 66% accuracy rate. Notably, ChatGPT-4 exhibited improvement at 62%, Gemini at 58%, and ChatGPT-3.5 served as the reference at 46%. Comparative analysis with residents offered insights into AI models' performance relative to human-level medical knowledge. Further analysis delved into yearly performance trends, topic-specific variations, and the impact of images on chatbot accuracy.

CONCLUSION:

The study unveils nuanced AI model capabilities in ophthalmology, emphasizing domain-specific variations. The superior performance of Gemini Advanced superior performance indicates significant advancements, while ChatGPT-4's improvement is noteworthy. Both Gemini and ChatGPT-3.5 demonstrated commendable performance. The comparative analysis underscores AI's evolving role as a supplementary tool in medical education. This research contributes vital insights into AI effectiveness in ophthalmology, highlighting areas for refinement. As AI models evolve, targeted improvements can enhance adaptability across subspecialties, making them valuable tools for medical professionals and enriching patient care. KEY MESSAGES What is known AI breakthroughs, like ChatGPT and Google's Gemini AI, are reshaping healthcare. In ophthalmology, AI integration has overhauled clinical workflows, particularly in analyzing images for diseases like diabetic retinopathy and glaucoma. What is new This study presents a pioneering comparison between Gemini AI and ChatGPT, evaluating their performance against ophthalmology residents using a meticulously curated dataset derived from real-world ophthalmology board exams. Notably, Gemini Advanced demonstrates superior performance, showcasing substantial advancements, while the evolution of ChatGPT-4 also merits attention. Both models exhibit commendable capabilities. These findings offer crucial insights into the efficacy of AI in ophthalmology, shedding light on areas ripe for further enhancement and optimization.

Palavras-chave

Artificial Intelligence (AI); ChatGPT; Chatbots; Gemini AI; Ophthalmology; Residents

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Graefe's archive for clinical and experimental ophthalmology / Graefes Arch Clin Exp Ophthalmol / Graefes arch. clin. exp. ophthalmol Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Israel País de publicação: Alemanha

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google