Assessing data gathering of chatbot based symptom checkers - a clinical vignettes study.

Ben-Shabat, Niv; Sharvit, Gal; Meimis, Ben; Ben Joya, Daniel; Sloma, Ariel; Kiderman, David; Shabat, Aviv; Tsur, Avishai M; Watad, Abdulla; Amital, Howard

Ben-Shabat, Niv; Sharvit, Gal; Meimis, Ben; Ben Joya, Daniel; Sloma, Ariel; Kiderman, David; Shabat, Aviv; Tsur, Avishai M; Watad, Abdulla; Amital, Howard.

Ben-Shabat N; Sackler Faculty of Medicine, Tel-Aviv University, Israel; Department of Medicine 'B', Sheba Medical Centre, Ramat-Gan, Israel; Zabludowicz Center for Autoimmune Diseases, Sheba Medical Centre, Ramat-Gan, Israel. Electronic address: nivben7@gmail.com.
Sharvit G; Sackler Faculty of Medicine, Tel-Aviv University, Israel.
Meimis B; Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel.
Ben Joya D; Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel.
Sloma A; Sackler Faculty of Medicine, Tel-Aviv University, Israel.
Kiderman D; Kaplan Medical Center, Rehovot, Israel.
Shabat A; Department of Pediatrics A, Edmond and Lily Safra Children's Hospital, Sheba Medical Center, Ramat-Gan, Israel.
Tsur AM; Sackler Faculty of Medicine, Tel-Aviv University, Israel; Department of Medicine 'B', Sheba Medical Centre, Ramat-Gan, Israel; Zabludowicz Center for Autoimmune Diseases, Sheba Medical Centre, Ramat-Gan, Israel; Israel Defence Forces, Medical Corps, Tel Hashomer, Ramat Gan, Israel.
Watad A; Sackler Faculty of Medicine, Tel-Aviv University, Israel; Department of Medicine 'B', Sheba Medical Centre, Ramat-Gan, Israel; Zabludowicz Center for Autoimmune Diseases, Sheba Medical Centre, Ramat-Gan, Israel; Section of Musculoskeletal Disease, NIHR Leeds Musculoskeletal Biomedical Research Unit,
Amital H; Sackler Faculty of Medicine, Tel-Aviv University, Israel; Department of Medicine 'B', Sheba Medical Centre, Ramat-Gan, Israel; Zabludowicz Center for Autoimmune Diseases, Sheba Medical Centre, Ramat-Gan, Israel.

Int J Med Inform ; 168: 104897, 2022 12.

Article in English | MEDLINE | ID: covidwho-2082412

ABSTRACT

ABSTRACT

BACKGROUND:

The burden on healthcare systems is mounting continuously owing to population growth and aging, overuse of medical services, and the recent COVID-19 pandemic. This overload is also causing reduced healthcare quality and outcomes. One solution gaining momentum is the integration of intelligent self-assessment tools, known as symptom-checkers, into healthcare-providers' systems. To the best of our knowledge, no study so far has investigated the data-gathering capabilities of these tools, which represent a crucial resource for simulating doctors' skills in medical-interviews.

OBJECTIVES:

The goal of this study was to evaluate the data-gathering function of currently available chatbot symptom-checkers.

METHODS:

We evaluated 8 symptom-checkers using 28 clinical vignettes from the repository of MSD-Manual case studies. The mean number of predefined pertinent findings for each case was 31.8 ± 6.8. The vignettes were entered into the platforms by 3 medical students who simulated the role of the patient. For each conversation, we obtained the number of pertinent findings retrieved and the number of questions asked. We then calculated the recall-rates (pertinent-findings retrieved out of all predefined pertinent-findings), and efficiency-rates (pertinent-findings retrieved out of the number of questions asked) of data-gathering, and compared them between the platforms.

RESULTS:

The overall recall rate for all symptom-checkers was 0.32(2,280/7,112;95 %CI 0.31-0.33) for all pertinent findings, 0.37(1,110/2,992;95 %CI 0.35-0.39) for present findings, and 0.28(1140/4120;95 %CI 0.26-0.29) for absent findings. Among the symptom-checkers, Kahun platform had the highest recall rate with 0.51(450/889;95 %CI 0.47-0.54). Out of 4,877 questions asked overall, 2,280 findings were gathered, yielding an efficiency rate of 0.46(95 %CI 0.45-0.48) across all platforms. Kahun was the most efficient tool 0.74 (95 %CI 0.70-0.77) without a statistically significant difference from Your.MD 0.69(95 %CI 0.65-0.73).

CONCLUSION:

The data-gathering performance of currently available symptom checkers is questionable. From among the tools available, Kahun demonstrated the best overall performance.

Subject(s)

COVID-19; Humans; COVID-19/diagnosis; COVID-19/epidemiology; Pandemics; Quality of Health Care; Software

Keywords

Artificial intelligence; Chatbots; Computer-assisted diagnosis; Data-gathering; Diagnosis; Medical interview; Symptom checker; Telemedicine; Triage

Fulltext

XML

PubMed Links

Search on Google

Full text: Available Collection: International databases Database: MEDLINE Main subject: COVID-19 Type of study: Diagnostic study / Experimental Studies / Observational study / Prognostic study Limits: Humans Language: English Journal: Int J Med Inform Journal subject: Medical Informatics Year: 2022 Document Type: Article

Similar

MEDLINE

LILACS

LIS

Fulltext

XML

PubMed Links

Search on Google