Emotion corpora

6 Nov

One of the common ways that phoneticians and other researchers have looked at emotion-in-language is by studying acted affect. That is, you get a bunch of people to read number lists or the alphabet in “angry” voice, “happy” voice, etc. Then you see if other people can reliably guess the emotion and then you go and look for the acoustic correlates.

If you’re interested in this sort of thing, you could try the Emotional Prosody Speech and Transcripts corpus (if you’re at Stanford and you’ve gotten corpus access, you’ll find it at /afs/ir/data/linguistic-data/EmotionalProsodySpeechAndTranscripts).

Now, there are a number of known issues with acted data–which is that it is stereotyped in particular ways. And if you wanted to detect what’s going on in a call center, “angry actors” wouldn’t help you nearly as much as “actual callers who are annoyed/disappointed/etc”. If you’re curious about more naturalistic corpora/research, here are some resources you might find useful (they’re all on my web page about emotions and language: http://www.stanford.edu/~tylers/emotions.shtml).

My talk at Nuance (the Dragon Naturally Speaking and Siri folks): http://www.stanford.edu/~tylers/notes/papers/emotion/Nuance_emotion_detection_11-17-10_final.pptx. This is basically an intro for dealing with naturalistic emotional data for speech scientists and others interested in detection/recongition.
Notes on Clavel and Devillers (2011): http://www.stanford.edu/~tylers/notes/emotion/Comp_speech_special_issue_2011_reading_notes_Schnoebelen.pdf
Notes on Cowie and Cornelius (2003): http://www.stanford.edu/~tylers/notes/emotion/Cowie_Cornelius_2003_reading_notes_Schnoebelen.pdf
Maybe my notes on Amir and Cohen (2007) and a few others: http://www.stanford.edu/~tylers/notes/emotion/Various_detection_articles_reading_notes_Schnoebelen.pdf
You might poke around http://emotion-research.net/ for some more naturalistic corpora that are being used by people interested in emotion research. (And let me know what you find that’s useful.)

11/7/2011 post-script: If acted data suits your needs, you can also consider something other than English–for example, the Mandarin Affective Speech corpus will get you Chinese.

Tags: emotion, english, Mandarin, phonetics, phonology, prosody

Comments 1 Comment
Categories Uncategorized

One Response to “Emotion corpora”

Trackbacks/Pingbacks

Prosodically annotated corpora « Corpus linguistics - March 8, 2012
[…] my previous posts on emotion here and here for other resources–note that the two above are both […]

Some favorites

Intro to corpus linguistics

Here’s my presentation to Stanford undergrads about corpus linguistics. You’ll find it full of examples and resources. And even some findings. http://www.stanford.edu/~tylers/notes/presentations/IntroductionToCorpusLinguistics.pptx
Chat room corpus

Went hunting around for some chat room corpora today–I though I’d find tons and tons but really just turned up one resource. But it’s a big one: over 30 billion words across 47,860 English language news groups from Oct 2005 to Jan 2011. Posts that are not in English are pulled out and the people […]
African language corpora

There are over two thousand African languages, spoken (in situ) by 15% of the world’s population. In density of linguistic diversity it is rivaled only by New Guinea (which probably exceeds it to be honest). And yet it is the Electronic Dark Continent. The LRE Map will give you 663 corpora/computational tools on English. But (almost) […]
COCA: What a fantastic source of data!

Intro 425 million words from 1990-2011. I believe that one of the best resources out there for linguists (or anyone interested in language) is the Corpus of Contemporary American English (COCA). Mark Davies has put together a bunch of corpora and put together an easy-to-use interface so you can make sophisticated queries on vast amounts […]
What were the cultural keywords when you were born?

Raymond Williams published a fascinating (and often-cited) book called Keywords (first in the 70s, then an update in the 80s). It’s full of really interesting stuff (my notes are here). But Williams’ words were just sort of the ones he saw flying around and took an interest in. This post gives you something a little more […]

Search

Emotion corpora

One Response to “Emotion corpora”

Trackbacks/Pingbacks

Leave a comment Cancel reply

Recent Posts

Archives

Meta

On Twitter…

Some favorites

Intro to corpus linguistics

Chat room corpus

African language corpora

COCA: What a fantastic source of data!

What were the cultural keywords when you were born?

Search

Emotion corpora

Share this:

Related

One Response to “Emotion corpora”

Trackbacks/Pingbacks

Leave a comment Cancel reply

Recent Posts

Archives

Meta

On Twitter…

Some favorites

Intro to corpus linguistics

Chat room corpus

African language corpora

COCA: What a fantastic source of data!

What were the cultural keywords when you were born?