A multiscale feature extraction algorithm for dysarthric speech recognition

Jianxing ZHAO; Peiyun XUE; Jing BAI; Chenkang SHI; Bo YUAN; Tongtong SHI

A multiscale feature extraction algorithm for dysarthric speech recognition / 生物医学工程学杂志

Jianxing ZHAO; Peiyun XUE; Jing BAI; Chenkang SHI; Bo YUAN; Tongtong SHI.

Journal of Biomedical Engineering ; (6): 44-50, 2023.

Artigo em Chinês | WPRIM | ID: wpr-970672

ABSTRACT

ABSTRACT

In this paper, we propose a multi-scale mel domain feature map extraction algorithm to solve the problem that the speech recognition rate of dysarthria is difficult to improve. We used the empirical mode decomposition method to decompose speech signals and extracted Fbank features and their first-order differences for each of the three effective components to construct a new feature map, which could capture details in the frequency domain. Secondly, due to the problems of effective feature loss and high computational complexity in the training process of single channel neural network, we proposed a speech recognition network model in this paper. Finally, training and decoding were performed on the public UA-Speech dataset. The experimental results showed that the accuracy of the speech recognition model of this method reached 92.77%. Therefore, the algorithm proposed in this paper can effectively improve the speech recognition rate of dysarthria.

Assuntos

Humanos; Disartria/diagnóstico; Fala; Percepção da Fala; Algoritmos; Redes Neurais de Computação

Dysarthric; Empirical mode decomposition; Fbank characteristics; Speech recognition

Texto completo

Imprimir

XML

Buscar no Google

Texto completo: DisponíveL Índice: WPRIM (Pacífico Ocidental) Assunto principal: Fala / Percepção da Fala / Algoritmos / Redes Neurais de Computação / Disartria Limite: Humanos Idioma: Chinês Revista: Journal of Biomedical Engineering Ano de publicação: 2023 Tipo de documento: Artigo

Similares

MEDLINE

LILACS

LIS

Texto completo

Imprimir

XML

Buscar no Google