aGender

Name	aGender
Description	The speech corpus aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous speech. Native German speakers called a voice portal from their private phone, and read text + answered some open questions. The purpose of the corpus is the automatic detection of gender and/or age (7 mixed classes ranging from 7 - 80 years). The corpus contains the voices of 945 German speakers (approx. minimum of 100 speakers per class), each delivering 18 speech items in up to six different sessions. The time/date of the individual recordings sessions were not controlled, neither the total number of sessions per speaker. The audio signal was recorded over standard cell phones (GSM standard) and landline connections in 8000 Hz, 8 bit alaw format. Data were then expanded to 8000Hz, 16bit PCM (13 bits are valid!). The selection of speakers is approximately evenly distributed over the seven target classes, with class 1 also being balanced for gender. The read material consists of an altered version of the SpeechDat text material, containing short fixed and free text typical for automated call centers. A typical utterance is about 2 seconds in length, but there are also some utterances are between 3 and 6 seconds. In total, the corpus consists of 47 hours of speech. Two sets were defined on that data: A training set (81.5%) and a test set (175 speakers, 25 per class, 18.5%), each with disjunctive speaker sets. For the test set no class information is given in this corpus. Refer to Section 'Evaluation' on how to receive an evaluation from Telekom Labs. Users of this speech corpus are required to report any scientific publications based on these data to Felix Burkhardt (Felix.Burkhardt@telekom.de). The aGender project aims for automatic speaker classification into 7 age and gender classes over public telephone connections including cellular connections aGender was created under the supervision and funding of German Telekom, Berlin, Germany
Collection	Bavarian Archive for Speech Signals (BAS)
Language	English German
Modality	spoken
Continent	Europe
Country	Germany
Genre	speechdat style recordings for the purpose of automatic speaker classification according to age and gender over telephone
Subject	speechdat style recordings for the purpose of automatic speaker classification according to age and gender over telephone
Organisation	Deutsche Telekom, Bonn, Germany German Telekom, Berlin Bavarian Archive for Speech Signals, Ludwig-Maximilians-Universität München
Keyword	Age and Gender Speech Corpus
National project	CLARIN-D
Resource type	audio text
Data provider	Bayerisches Archiv für Sprachsignale
Temporal coverage	[2008 TO 2008]
Record identifier	HDL11022/1009-0000-0001-1500-7@format=cmdi

HDL11022/1009-0000-0001-1...

Landing page

Federated content search

Linked resource

CLARINDocumentation.zip

VCR Add this record to the Virtual Collection submission queue

More like this...

The following records may also interest you:

144001

Speaker of speech corpus BAS aGender. If the gender is given as unspecified, the actor belongs to the aGender speaker class 1 (age 7 - 14) for which no gender information is given in the corpus.

602301

Speaker of speech corpus BAS aGender. If the gender is given as unspecified, the actor belongs to the aGender speaker class 1 (age 7 - 14) for which no gender information is given in the corpus.

740703

Speaker of speech corpus BAS aGender. If the gender is given as unspecified, the actor belongs to the aGender speaker class 1 (age 7 - 14) for which no gender information is given in the corpus.

396304

Speaker of speech corpus BAS aGender. If the gender is given as unspecified, the actor belongs to the aGender speaker class 1 (age 7 - 14) for which no gender information is given in the corpus.

501001

Speaker of speech corpus BAS aGender. If the gender is given as unspecified, the actor belongs to the aGender speaker class 1 (age 7 - 14) for which no gender information is given in the corpus.