Automated Speech Recognition in Adult Stroke Survivors: Comparing Human and Computer Transcriptions.

Jacks, Adam; Haley, Katarina L; Bishop, Gary; Harmon, Tyson G

Jacks, Adam; Haley, Katarina L; Bishop, Gary; Harmon, Tyson G.

Afiliação

Jacks A; Division of Speech and Hearing Sciences, Department of Allied Health Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA, adam_jacks@med.unc.edu.
Haley KL; Division of Speech and Hearing Sciences, Department of Allied Health Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.
Bishop G; Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.
Harmon TG; Department of Communication Disorders, Brigham Young University, Provo, Utah, USA.

Folia Phoniatr Logop ; 71(5-6): 286-296, 2019.

Article em En | MEDLINE | ID: mdl-31117105

RESUMO

OBJECTIVE: Speech sound errors are common in people with a variety of communication disorders and can result in impaired message transmission to listeners. Valid and reliable metrics exist to quantify this problem, but they are rarely used in clinical settings due to the time-intensive nature of speech transcription by humans. Automated speech recognition (ASR) technologies have advanced substantially in recent years, enabling them to serve as realistic proxies for human listeners. This study aimed to determine how closely transcription scores from human listeners correspond to scores from an ASR system. PATIENTS AND METHODS: Sentence recordings from 10 stroke survivors with aphasia and apraxia of speech were transcribed orthographically by 3 listeners and a web-based ASR service. Adjusted transcription scores were calculated for all samples based on accuracy of transcribed content words. RESULTS: As expected, transcription scores were significantly higher for the humans than for ASR. However, intraclass correlations revealed excellent agreement among the humans and ASR systems, and the systematically lower scores for computer speech recognition were effectively equalized simply by adding the regression intercept. CONCLUSIONS: The results suggest the clinical feasibility of supplementing or substituting human transcriptions with computer-generated scores, though extension to other speech disorders requires further research.

Assuntos

Afasia/reabilitação; Apraxias/reabilitação; Interface para o Reconhecimento da Fala; Reabilitação do Acidente Vascular Cerebral/métodos; Sobreviventes; Adulto; Idoso; Feminino; Humanos; Masculino; Pessoa de Meia-Idade; Inteligibilidade da Fala

Palavras-chave

Aphasia; Assessment; Automated speech recognition; Intelligibility; Speech transcription; Stroke

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Afasia / Apraxias / Sobreviventes / Interface para o Reconhecimento da Fala / Reabilitação do Acidente Vascular Cerebral Limite: Adult / Aged / Female / Humans / Male / Middle aged Idioma: En Revista: Folia Phoniatr Logop Assunto da revista: PATOLOGIA DA FALA E LINGUAGEM Ano de publicação: 2019 Tipo de documento: Article País de publicação: Suíça

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google