Arabic speaker emotion classification using rhythm metrics and neural networks

Arabic speaker emotion classification using rhythm metrics and neural networks. Mefiah, A., Alotaibi, Y. A., & Selouani, S. In 2015 23rd European Signal Processing Conference (EUSIPCO), pages 1426-1430, Aug, 2015.

Paper doi abstract bibtex

In this paper, rhythm metrics are calculated and used to classify five Arabic speech emotions; namely, neutral, sad, happy, surprised, and angry. Eight speakers (four male and four female) simulated the five emotions in their speech by speaking three selected sentences two times each. A human perception test was conducted using nine listeners (male and female). The results of a neural network-based automatic emotion recognition system using rhythm metrics were similar to the human perception test results, although less accurate. Anger was the most recognized speaker emotion and happiness was the least. One of our findings is that the emotions of male speakers are easier to recognize than those of female speakers. In addition, we found that the neural networks and rhythm metrics can be used for speaker emotion recognition using speech signals, but only when the dataset size is large enough.

@InProceedings{7362619,
  author = {A. Mefiah and Y. A. Alotaibi and S. Selouani},
  booktitle = {2015 23rd European Signal Processing Conference (EUSIPCO)},
  title = {Arabic speaker emotion classification using rhythm metrics and neural networks},
  year = {2015},
  pages = {1426-1430},
  abstract = {In this paper, rhythm metrics are calculated and used to classify five Arabic speech emotions; namely, neutral, sad, happy, surprised, and angry. Eight speakers (four male and four female) simulated the five emotions in their speech by speaking three selected sentences two times each. A human perception test was conducted using nine listeners (male and female). The results of a neural network-based automatic emotion recognition system using rhythm metrics were similar to the human perception test results, although less accurate. Anger was the most recognized speaker emotion and happiness was the least. One of our findings is that the emotions of male speakers are easier to recognize than those of female speakers. In addition, we found that the neural networks and rhythm metrics can be used for speaker emotion recognition using speech signals, but only when the dataset size is large enough.},
  keywords = {emotion recognition;natural language processing;neural nets;signal classification;speaker recognition;Arabic speaker emotion classification;rhythm metrics;Arabic speech emotions classification;neutral;sad;happy;surprised;angry;human perception test;neural network-based automatic emotion recognition system;female speakers;speaker emotion recognition;speech signals;Speech;Rhythm;Measurement;Speech processing;Speech recognition;Emotion recognition;Feature extraction;Emotion;Arabic;corpus;classification},
  doi = {10.1109/EUSIPCO.2015.7362619},
  issn = {2076-1465},
  month = {Aug},
  url = {https://www.eurasip.org/proceedings/eusipco/eusipco2015/papers/1570104855.pdf},
}

Downloads: 0

{"_id":"ATmRwYQSxix5mCzD8","bibbaseid":"mefiah-alotaibi-selouani-arabicspeakeremotionclassificationusingrhythmmetricsandneuralnetworks-2015","downloads":0,"creationDate":"2016-02-17T10:30:06.361Z","title":"Arabic speaker emotion classification using rhythm metrics and neural networks","author_short":["Mefiah, A.","Alotaibi, Y. A.","Selouani, S."],"year":2015,"bibtype":"inproceedings","biburl":"https://raw.githubusercontent.com/Roznn/EUSIPCO/main/eusipco2015url.bib","bibdata":{"bibtype":"inproceedings","type":"inproceedings","author":[{"firstnames":["A."],"propositions":[],"lastnames":["Mefiah"],"suffixes":[]},{"firstnames":["Y.","A."],"propositions":[],"lastnames":["Alotaibi"],"suffixes":[]},{"firstnames":["S."],"propositions":[],"lastnames":["Selouani"],"suffixes":[]}],"booktitle":"2015 23rd European Signal Processing Conference (EUSIPCO)","title":"Arabic speaker emotion classification using rhythm metrics and neural networks","year":"2015","pages":"1426-1430","abstract":"In this paper, rhythm metrics are calculated and used to classify five Arabic speech emotions; namely, neutral, sad, happy, surprised, and angry. Eight speakers (four male and four female) simulated the five emotions in their speech by speaking three selected sentences two times each. A human perception test was conducted using nine listeners (male and female). The results of a neural network-based automatic emotion recognition system using rhythm metrics were similar to the human perception test results, although less accurate. Anger was the most recognized speaker emotion and happiness was the least. One of our findings is that the emotions of male speakers are easier to recognize than those of female speakers. In addition, we found that the neural networks and rhythm metrics can be used for speaker emotion recognition using speech signals, but only when the dataset size is large enough.","keywords":"emotion recognition;natural language processing;neural nets;signal classification;speaker recognition;Arabic speaker emotion classification;rhythm metrics;Arabic speech emotions classification;neutral;sad;happy;surprised;angry;human perception test;neural network-based automatic emotion recognition system;female speakers;speaker emotion recognition;speech signals;Speech;Rhythm;Measurement;Speech processing;Speech recognition;Emotion recognition;Feature extraction;Emotion;Arabic;corpus;classification","doi":"10.1109/EUSIPCO.2015.7362619","issn":"2076-1465","month":"Aug","url":"https://www.eurasip.org/proceedings/eusipco/eusipco2015/papers/1570104855.pdf","bibtex":"@InProceedings{7362619,\n author = {A. Mefiah and Y. A. Alotaibi and S. Selouani},\n booktitle = {2015 23rd European Signal Processing Conference (EUSIPCO)},\n title = {Arabic speaker emotion classification using rhythm metrics and neural networks},\n year = {2015},\n pages = {1426-1430},\n abstract = {In this paper, rhythm metrics are calculated and used to classify five Arabic speech emotions; namely, neutral, sad, happy, surprised, and angry. Eight speakers (four male and four female) simulated the five emotions in their speech by speaking three selected sentences two times each. A human perception test was conducted using nine listeners (male and female). The results of a neural network-based automatic emotion recognition system using rhythm metrics were similar to the human perception test results, although less accurate. Anger was the most recognized speaker emotion and happiness was the least. One of our findings is that the emotions of male speakers are easier to recognize than those of female speakers. In addition, we found that the neural networks and rhythm metrics can be used for speaker emotion recognition using speech signals, but only when the dataset size is large enough.},\n keywords = {emotion recognition;natural language processing;neural nets;signal classification;speaker recognition;Arabic speaker emotion classification;rhythm metrics;Arabic speech emotions classification;neutral;sad;happy;surprised;angry;human perception test;neural network-based automatic emotion recognition system;female speakers;speaker emotion recognition;speech signals;Speech;Rhythm;Measurement;Speech processing;Speech recognition;Emotion recognition;Feature extraction;Emotion;Arabic;corpus;classification},\n doi = {10.1109/EUSIPCO.2015.7362619},\n issn = {2076-1465},\n month = {Aug},\n url = {https://www.eurasip.org/proceedings/eusipco/eusipco2015/papers/1570104855.pdf},\n}\n\n","author_short":["Mefiah, A.","Alotaibi, Y. A.","Selouani, S."],"key":"7362619","id":"7362619","bibbaseid":"mefiah-alotaibi-selouani-arabicspeakeremotionclassificationusingrhythmmetricsandneuralnetworks-2015","role":"author","urls":{"Paper":"https://www.eurasip.org/proceedings/eusipco/eusipco2015/papers/1570104855.pdf"},"keyword":["emotion recognition;natural language processing;neural nets;signal classification;speaker recognition;Arabic speaker emotion classification;rhythm metrics;Arabic speech emotions classification;neutral;sad;happy;surprised;angry;human perception test;neural network-based automatic emotion recognition system;female speakers;speaker emotion recognition;speech signals;Speech;Rhythm;Measurement;Speech processing;Speech recognition;Emotion recognition;Feature extraction;Emotion;Arabic;corpus;classification"],"metadata":{"authorlinks":{}},"downloads":0},"search_terms":["arabic","speaker","emotion","classification","using","rhythm","metrics","neural","networks","mefiah","alotaibi","selouani"],"keywords":["emotion recognition;natural language processing;neural nets;signal classification;speaker recognition;arabic speaker emotion classification;rhythm metrics;arabic speech emotions classification;neutral;sad;happy;surprised;angry;human perception test;neural network-based automatic emotion recognition system;female speakers;speaker emotion recognition;speech signals;speech;rhythm;measurement;speech processing;speech recognition;emotion recognition;feature extraction;emotion;arabic;corpus;classification"],"authorIDs":[],"dataSources":["eov4vbT6mnAiTpKji","knrZsDjSNHWtA9WNT"]}