Your browser doesn't support javascript.
Developing and testing an automated qualitative assistant (AQUA) to support qualitative analysis.
Lennon, Robert P; Fraleigh, Robbie; Van Scoy, Lauren J; Keshaviah, Aparna; Hu, Xindi C; Snyder, Bethany L; Miller, Erin L; Calo, William A; Zgierska, Aleksandra E; Griffin, Christopher.
  • Lennon RP; Family and Community Medicine, Penn State Health Milton S. Hershey Medical Center, Hershey, Pennsylvania, USA rlennon@pennstatehealth.psu.edu.
  • Fraleigh R; Applied Research Laboratory, Pennsylvania State University, University Park, Pennsylvania, USA.
  • Van Scoy LJ; Internal Medicine, Penn State Health Milton S. Hershey Medical Center, Hershey, Pennsylvania, USA.
  • Keshaviah A; Mathematica Policy Research Inc, Princeton, New Jersey, USA.
  • Hu XC; Mathematica Policy Research Inc, Princeton, New Jersey, USA.
  • Snyder BL; Center for Community Health Integration, Case Western Reserve University, Cleveland, Ohio, USA.
  • Miller EL; Family and Community Medicine, Penn State Health Milton S. Hershey Medical Center, Hershey, Pennsylvania, USA.
  • Calo WA; Public Health Services, Penn State Health Milton S. Hershey Medical Center, Hershey, Pennsylvania, USA.
  • Zgierska AE; Family and Community Medicine, Penn State Health Milton S. Hershey Medical Center, Hershey, Pennsylvania, USA.
  • Griffin C; Applied Research Laboratory, Pennsylvania State University, University Park, Pennsylvania, USA.
Fam Med Community Health ; 9(Suppl 1)2021 11.
Article in English | MEDLINE | ID: covidwho-1537968
ABSTRACT
Qualitative research remains underused, in part due to the time and cost of annotating qualitative data (coding). Artificial intelligence (AI) has been suggested as a means to reduce those burdens, and has been used in exploratory studies to reduce the burden of coding. However, methods to date use AI analytical techniques that lack transparency, potentially limiting acceptance of results. We developed an automated qualitative assistant (AQUA) using a semiclassical approach, replacing Latent Semantic Indexing/Latent Dirichlet Allocation with a more transparent graph-theoretic topic extraction and clustering method. Applied to a large dataset of free-text survey responses, AQUA generated unsupervised topic categories and circle hierarchical representations of free-text responses, enabling rapid interpretation of data. When tasked with coding a subset of free-text data into user-defined qualitative categories, AQUA demonstrated intercoder reliability in several multicategory combinations with a Cohen's kappa comparable to human coders (0.62-0.72), enabling researchers to automate coding on those categories for the entire dataset. The aim of this manuscript is to describe pertinent components of best practices of AI/machine learning (ML)-assisted qualitative methods, illustrating how primary care researchers may use AQUA to rapidly and accurately code large text datasets. The contribution of this article is providing guidance that should increase AI/ML transparency and reproducibility.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: Artificial Intelligence / Machine Learning Type of study: Observational study / Qualitative research Limits: Humans Language: English Year: 2021 Document Type: Article Affiliation country: Fmch-2021-001287

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Artificial Intelligence / Machine Learning Type of study: Observational study / Qualitative research Limits: Humans Language: English Year: 2021 Document Type: Article Affiliation country: Fmch-2021-001287