VLO | CLARIN ERIC

SmartKom Audio

(Part of Bavarian Archive for Speech Signals (BAS))

447
1

This corpus contains the audio recordings of all actors who use the SmartKom system; it covers the audio recordings (no …

This corpus contains the audio recordings of all actors who use the SmartKom system; it covers the audio recordings (no video) and annotations of all three original SmartKom corpora Public, Mobile and Home. Naive users were asked to test a 'prototype' for a market study not knowing that the system was in fact controlle…

German English

Landing page for this record

VCR

MultiCHannel Articulatory database: English

(Part of Bavarian Archive for Speech Signals (BAS))

9
1
1

The MOCHA database was compiled as part of the Engineering and Physical Sciences Research Council grant number:GR/L78680…

The MOCHA database was compiled as part of the Engineering and Physical Sciences Research Council grant number:GR/L78680 : "Speech recognition using articulatory data." It features a set of 460 short sentences designed to include the main connected speech processes in English (e.g. assimilations, weak forms ...). All r…

English

Landing page for this record

VCR

The Zurich Tangram Corpus - BAS Edition

(Part of Bavarian Archive for Speech Signals (BAS))

86
1
1

This corpus contains tasks, where one subject (the instructor) describes different Tangram figures to another subject (t…

This corpus contains tasks, where one subject (the instructor) describes different Tangram figures to another subject (the receiver) so that the receiver can recreate the same order of figures that the instructor has in front of them. The subjects initially don't know each other and work together to solve these tasks i…

English German

Landing page for this record

VCR

The Zurich Tangram Corpus - UZH Edition

(Part of Bavarian Archive for Speech Signals (BAS))

86
1

This corpus contains tasks, where one subject (the instructor) describes different Tangram figures to another subject (t…

This corpus contains tasks, where one subject (the instructor) describes different Tangram figures to another subject (the receiver) so that the receiver can recreate the same order of figures that the instructor has in front of them. The subjects initially don't know each other and work together to solve these tasks i…

English German

Landing page for this record

VCR

BAS SI100

(Part of Bavarian Archive for Speech Signals (BAS))

The corpus contains read speech of 101 different speakers (50 female, 50 male, 1 unknown). Each speaker has read approx.…

The corpus contains read speech of 101 different speakers (50 female, 50 male, 1 unknown). Each speaker has read approx. 100 sentences from either the SZ subcorpus or the CeBit subcorpus. The language is German. The subcorpus SZ contains 544 sentences from newspaper articles ("Sueddeutsche Zeitung"). The subcorpus Ce…

English German

Landing page for this record

VCR

BAS TAXI

(Part of Bavarian Archive for Speech Signals (BAS))

86
1
1

The TAXI dialog database was created in June 2001 in collaboration with the DFKI, Saarbruecken. TAXI contains 86 recorde…

The TAXI dialog database was created in June 2001 in collaboration with the DFKI, Saarbruecken. TAXI contains 86 recorded dialogues between a cab dispatcher and a client recorded over public phone lines (network and GSM). The dispatcher always speaks German, while the clients always speaks English. Starting from versio…

English German

Landing page for this record

VCR

aGender

(Part of Bavarian Archive for Speech Signals (BAS))

3614
1

The speech corpus aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous…

The speech corpus aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous speech. Native German speakers called a voice portal from their private phone, and read text + answered some open questions. The purpose of the corpus is the automatic detection of gender and/or age …

English German

Landing page for this record

VCR

BAS HEMPEL

(Part of Bavarian Archive for Speech Signals (BAS))

Hempels Sofa is a collection of more than 3900 spontaneous speech items recorded as extra material during the German Spe…

Hempels Sofa is a collection of more than 3900 spontaneous speech items recorded as extra material during the German SpeechDat-II project. Speakers were asked to report what they had been doing during the last hour: "Was haben Sie in der letzten Stunde gemacht?". This item was recorded as the last item of the recording…

English German

Landing page for this record

VCR

Nautilus Speaker Characterization

(Part of Bavarian Archive for Speech Signals (BAS))

300
1

NSC contains scripted, semi-spontaneous, and spontaneous human-human dialogs. In total, 300 speakers of German without n…

NSC contains scripted, semi-spontaneous, and spontaneous human-human dialogs. In total, 300 speakers of German without noticeable accent participated and were recorded in an acoustically-isolated room. Interactions between speakers and their interlocutor are provided in separate mono files, accompanied by timestamps an…

English German

Landing page for this record

VCR

BAS RVG1_CLARIN

(Part of Bavarian Archive for Speech Signals (BAS))

The corpus is a collection of more than 500 speakers of different dialect regions of Germany. The recordings were made u…

The corpus is a collection of more than 500 speakers of different dialect regions of Germany. The recordings were made using four different microphones (two in low and two in high quality) and consist of single digits, connected digits, phone numbers, phonetically balanced sentences, computer command phrases prompted o…

English German

Landing page for this record

VCR

CLARIN Virtual Language Observatory

Facets

Language

Collection

Resource type

Modality

Format

Keyword

Genre

Subject

Country

Organisation

Data provider

National project

Search options

Temporal Coverage

Availability

Search options

Search results

SmartKom Audio

MultiCHannel Articulatory database: English

The Zurich Tangram Corpus - BAS Edition

The Zurich Tangram Corpus - UZH Edition

BAS SI100

BAS TAXI

aGender

BAS HEMPEL

Nautilus Speaker Characterization

BAS RVG1_CLARIN