MFCC-based deep convolutional neural network for audio depression recognition

Wang, Y.; Lu, X.; Shi, D.

Wang, Y.; Lu, X.; Shi, D..

2022 International Conference on Asian Language Processing, IALP 2022 ; : 162-166, 2022.

Article in English | Scopus | ID: covidwho-2191795

ABSTRACT

ABSTRACT

Emotions are a key factor affecting online learning, which has become the new normal due to the COVID-19 pandemic. Identifying students' emotions has important educational implications. Long-term negative emotions can lead to depression, and early identification can help reduce stress and improve learning efficiency. For depressed students in desperate need, current diagnoses are expensive and highly subjective. In response to the above problems, this paper proposes a method for automatic diagnosis of depression with the help of deep convolutional neural networks (Convolutional Neural Network, CNNs). This method inputs the Mel-frequency cepstral coefficients (MFCC) feature maps generated by the preprocessed speech into residual CNNs for training, and adjusts the number of layers and nodes of the network to achieve the highest recognition accuracy. The experimental results are achieved through different residual depths. In ResNet-34, the highest accuracy rate can reach 77%. This will effectively promote the process of emotion recognition research and promote the efficient development of online learning. © 2022 IEEE.

Keywords

convolutional neural network; depression recognition; MFCC; online learning; ResNet; Convolution; Deep neural networks; E-learning; Emotion Recognition; Speech recognition; Students; 'current; Key factors; Learning efficiency; Mel frequenc

Fulltext

XML

Search on Google

Full text: Available Collection: Databases of international organizations Database: Scopus Language: English Journal: 2022 International Conference on Asian Language Processing, IALP 2022 Year: 2022 Document Type: Article

Similar

MEDLINE

LILACS

LIS

Fulltext

XML

Search on Google