Title : ( Automatic speech emotion recognition based on hybrid features with ANN, LDA and K_NN classifiers )
Authors: Mohammed Jawad Al Dujaili , Abbas Ebrahimi Moghadam ,Access to full-text not allowed by authors
Abstract
Despite many efforts in Speech Emotion Recognition, there is still a big gap between natural human feelings and computer perception. In this article, the recognition of the speaker’s emotions in Persian and German has been examined. For this purpose, Persian emotional speech utterances have been expressed, including 748 sentences with seven feelings of Neutral, Disgust, Fear, Anger, Sadness, Boredom and Happiness. German emotional speech utterances consist of 536 sentences created by professional actors in a laboratory environment, 16 of which with seven different feelings of Happiness, hatred, naturalness, fear, Sadness, Anger, and fatigue. After extracting widely used properties such as MFCC Mel Frequency Cepstral Coefficients and its derivatives, local frequency perturbation coefficient (Jitter), and local perturbation coefficient (Shimmer), various features of this database are extracted separately because of the vast number of options. Reducing feature space is required before applying the principal component classification (PCA) algorithm. Also, three classifications of Artificial neural network (ANN), Linear Discriminant Analysis (LDA), and K_Nearest Neighbor (K_NN) have been used to classify emotions. For the German database, the top results were obtained by fusing the MFCC + Shimmer properties and LDA classification with a precision detection of 91.26% and a runtime execution of 0.43 s, and the best results for the Persian database were obtained by fusing the Jitter + Shimmer properties and K_NN classification with a precision detection of 91.5% and a runtime execution of 0.65 s. The results show that the ability to distinguish attribute vectors is quite different for each emotional state. Expression of emotions and their effect on speech differ in Persian and German.
Keywords
, Speech emotion recognition (SER), MFCC, Jitter, Shimmer, PCA, ANN, LDA, KNN@article{paperid:1094302,
author = {Mohammed Jawad Al Dujaili and Ebrahimi Moghadam, Abbas},
title = {Automatic speech emotion recognition based on hybrid features with ANN, LDA and K_NN classifiers},
journal = {Multimedia Tools and Applications},
year = {2023},
volume = {82},
number = {27},
month = {November},
issn = {1380-7501},
pages = {42783--42801},
numpages = {18},
keywords = {Speech emotion recognition (SER); MFCC; Jitter; Shimmer; PCA; ANN; LDA; KNN},
}
%0 Journal Article
%T Automatic speech emotion recognition based on hybrid features with ANN, LDA and K_NN classifiers
%A Mohammed Jawad Al Dujaili
%A Ebrahimi Moghadam, Abbas
%J Multimedia Tools and Applications
%@ 1380-7501
%D 2023