Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
Show more facetsThese levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
This corpus contains the audio recordings of all actors who use the SmartKom system; it covers the audio recordings (no …
This corpus contains the audio recordings of all actors who use the SmartKom system; it covers the audio recordings (no video) and annotations of all three original SmartKom corpora Public, Mobile and Home. Naive users were asked to test a 'prototype' for a market study not knowing that the system was in fact controlle…
De Stichting Nederlands Dagboekarchief verzamelt en beheert (ongepubliceerde) dagboeken, reisdagboeken, memoires, brieve…
De Stichting Nederlands Dagboekarchief verzamelt en beheert (ongepubliceerde) dagboeken, reisdagboeken, memoires, brieven en poëzie-albums uit het hele Nederlandse taalgebied en maakt deze toegankelijk voor wetenschap en onderwijs en voor particulier onderzoek. De collectie is in eigendom en beheer van de Stichting Ned…
The MOCHA database was compiled as part of the Engineering and Physical Sciences Research Council grant number:GR/L78680…
The MOCHA database was compiled as part of the Engineering and Physical Sciences Research Council grant number:GR/L78680 : "Speech recognition using articulatory data." It features a set of 460 short sentences designed to include the main connected speech processes in English (e.g. assimilations, weak forms ...). All r…
The songs in this collection were recorded and annotated as part of the project 'Metre and Melody in Dinka Speech and So…
The songs in this collection were recorded and annotated as part of the project 'Metre and Melody in Dinka Speech and Song', a project carried out by researchers from the University of Edinburgh and the School of Oriental and African Studies in London, and funded by the UK Arts and Humanities Research Council as part o…
The RVG-J Corpus (Regional Variants of German - Junior) was recorded in 2001 at the Institute of Phonetics and Speech Co…
The RVG-J Corpus (Regional Variants of German - Junior) was recorded in 2001 at the Institute of Phonetics and Speech Communication at the University of Munich, Germany. The corpus contains both read and non-scripted German utterances. It comprises the original RVG prompts (telephone numbers, sentences, commands, digit…
The TAXI dialog database was created in June 2001 in collaboration with the DFKI, Saarbruecken. TAXI contains 86 recorde…
The TAXI dialog database was created in June 2001 in collaboration with the DFKI, Saarbruecken. TAXI contains 86 recorded dialogues between a cab dispatcher and a client recorded over public phone lines (network and GSM). The dispatcher always speaks German, while the clients always speaks English. Starting from versio…
The CI_2 corpora contain German speech recordings of 48 cochlear implant users (CI) and 48 speakers without hearing impa…
The CI_2 corpora contain German speech recordings of 48 cochlear implant users (CI) and 48 speakers without hearing impairment (control group, KG). The data were analyzed in Veronika Neumeyer's dissertation "Akustische Analysen der Sprachproduktion von CI-Trägern" (2015). CI_2_Vowels contains recordings used for the an…
The speech corpus aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous…
The speech corpus aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous speech. Native German speakers called a voice portal from their private phone, and read text + answered some open questions. The purpose of the corpus is the automatic detection of gender and/or age …
Hempels Sofa is a collection of more than 3900 spontaneous speech items recorded as extra material during the German Spe…
Hempels Sofa is a collection of more than 3900 spontaneous speech items recorded as extra material during the German SpeechDat-II project. Speakers were asked to report what they had been doing during the last hour: "Was haben Sie in der letzten Stunde gemacht?". This item was recorded as the last item of the recording…