Emotional speech recognition: Resources, features, and methods pdf

**computer tips** · 10-06-2014, 04:30 PM

Emotional speech recognition: Resources, features, and methods

.pdf

Emotional speech recognition.pdf (Size: 561.03 KB / Downloads: 42)

Abstract

In this paper we overview emotional speech recognition having in mind three goals. The first goal is to provide an up-todate
record of the available emotional speech data collections. The number of emotional states, the language, the number
of speakers, and the kind of speech are briefly addressed. The second goal is to present the most frequent acoustic features
used for emotional speech recognition and to assess how the emotion affects them. Typical features are the pitch, the
formants, the vocal tract cross-section areas, the mel-frequency cepstral coefficients, the Teager energy operator-based features,
the intensity of the speech signal, and the speech rate. The third goal is to review appropriate techniques in order to
classify speech into emotional states. We examine separately classification techniques that exploit timing information from
which that ignore it. Classification techniques based on hidden Markov models, artificial neural networks, linear discriminant
analysis, k-nearest neighbors, support vector machines are reviewed.

Introduction

Emotional speech recognition aims at automatically
identifying the emotional or physical state of
a human being from his or her voice. The emotional
and physical states of a speaker are known as
emotional aspects of speech and are included in
the so-called paralinguistic aspects. Although the
emotional state does not alter the linguistic content,
it is an important factor in human communication,
because it provides feedback information in many
applications as it is outlined next.

Outline

In Section 2, a corpus of 64 data collections is
reviewed putting emphasis on the data collection
procedures, the kind of speech (natural, simulated,
or elicited), the content, and other physiological
signals that may accompany the emotional speech.
In Section 3, short-term features (i.e. features that
are extracted on speech frame basis) that are related
to the emotional content of speech are discussed. In
addition to short-term features, their contours are
of fundamental importance for emotional speech
recognition. The emotions affect the contour characteristics,
such as statistics and trends as is summarized
in Section 4. Emotion classification techniques
that exploit timing information and other techniques
that ignore it are surveyed in Section 5.
Therefore, Sections 3 and 4 aim at describing the
appropriate features to be used with the emotional
classification techniques reviewed in Section 5.
Finally, Section 6 concludes the tutorial by indicating
future research directions.

Data collections

A record of emotional speech data collections is
undoubtedly useful for researchers interested in
emotional speech analysis. An overview of 64 emotional
speech data collections is presented in Table
1. For each data collection additional information
is also described such as the speech language, the
number and the profession of the subjects, other
physiological signals possibly recorded simultaneously
with speech, the data collection purpose
(emotional speech recognition, expressive synthesis),
the emotional states recorded, and the kind of
the emotions (natural, simulated, elicited).

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Software Crisis pdf	study tips	1	2,117	21-09-2017, 04:31 PM Last Post: jaseela123
	Design and Analysis Of Algorithms : Seminar Report and PPT	seminar projects maker	1	1,315	21-09-2017, 12:04 PM Last Post: jaseela123
	HOW EMAIL WORKS pdf	project girl	1	3,067	20-09-2017, 11:39 AM Last Post: jaseela123
	Cyber crime detection, investigation and prosecution pdf	seminar projects maker	1	958	20-09-2017, 11:31 AM Last Post: jaseela123
	Review: Context Aware Tools for Smart Home Development pdf	study tips	1	1,227	20-09-2017, 11:22 AM Last Post: jaseela123
	Getting Started with the MAXQ1103 Evaluation Kit and the CrossWorks Compiler pdf	project girl	1	969	15-09-2017, 03:11 PM Last Post: jaseela123
	Wireless Application Protocol (WAP) pdf	project girl	1	1,531	15-09-2017, 02:42 PM Last Post: jaseela123
	MAC Protocol for Reliable Multicast over Multi-Hop Wireless Ad Hoc Networks pdf	study tips	1	1,029	15-09-2017, 12:39 PM Last Post: jaseela123
	Wireless Automotive Communications pdf	seminar projects maker	1	637	14-09-2017, 01:27 PM Last Post: jaseela123
	Enabling Secure and Efficient Ranked Keyword Search over Outsourced Cloud Data pdf	study tips	1	2,018	13-09-2017, 12:59 PM Last Post: jaseela123

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.