Name
|
|
Description
|
-
Hempels Sofa is a collection of more than 3900 spontaneous speech items recorded as extra material during the German SpeechDat-II project. Speakers were asked to report what they had been doing during the last hour: "Was haben Sie in der letzten Stunde gemacht?". This item was recorded as the last item of the recording session. Speakers had become acquainted with the recording procedure and they were quite relaxed because they knew that this item was the last to be recorded. This resulted in quite natural, colloquial speech, sometimes with marked regional accent. The corpus collection is described in more detail in the LREC2002 paper "Three New Corpora at the Bavarian Archive for Speech Signals - and a First Step Towards Distributed Web-Based Recording" by C. Draxler and F. Schiel. This paper is contained in this database in file DOC/BASCORPO.PDF; it also contains links to related SpeechDat documents. Note: the name of the corpus refers to the German proverbial phrase "wie bei Hempels unter'm Sofa". This phrase is often used to indicate that something is not well cleaned-up -- not dirty, just in its everyday state when one is not expecting visitors. I thought the phrase to be appropriate for this data collection because quite often when listening to the recordings one gets the impression of sitting next to the speaker on the sofa in a common living room. Note: Starting from version 2.0 (CLARIN Repository Version 3), HEMPEL is distributed as an emuR compatible emuDB. Version 2.1 (CLARIN Repository 4) is distributed without the MAU (phonetic segmentation) tier, as it was found to lack in accuracy.
-
Hempels Sofa is a collection of more than 3900 spontaneous speech items
recorded as extra material during the German SpeechDat-II project. Speakers
were asked to report what they had been doing during the last hour: "Was
haben Sie in der letzten Stunde gemacht?". This item was recorded as the
last item of the recording session. Speakers had become acquainted with
the recording procedure and they were quite relaxed because they knew
that this item was the last to be recorded. This resulted in quite
natural, colloquial speech, sometimes with marked regional accent.
The corpus collection is described in more detail in the LREC2002 paper
"Three New Corpora at the Bavarian Archive for Speech Signals - and a First
Step Towards Distributed Web-Based Recording" by C. Draxler and F. Schiel.
This paper is contained in this database in file DOC/BASCORPO.PDF; it also
contains links to related SpeechDat documents.
Note: the name of the corpus refers to the German proverbial phrase: "wie
bei Hempels unter'm Sofa". This phrase is often used to indicate that something
is not well cleaned-up -- not dirty, just in its everyday state when one is not
expecting visitors. I thought the phrase to be appropriate for this data collection
because quite often when listening to the recordings one gets the impression of
sitting next to the speaker on the sofa in a common living room.
|
Collection
|
-
Bavarian Archive for Speech Signals (BAS)
|
Language
|
|
Modality
|
|
Continent
|
|
Country
|
|
Genre
|
-
reporting about what the speaker had been doing during the last hour
|
Subject
|
-
reporting about what the speaker had been doing during the last hour
|
Organisation
|
-
Bavarian Archive for Speech Signals, Ludwig-Maximilians-Universität München
|
Keyword
|
-
SpeechDat II - Hempels Sofa
|
National project
|
|
Resource type
|
|
Data provider
|
-
Bayerisches Archiv für Sprachsignale
|
Temporal coverage
|
|
Record identifier
|
|