Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
Show more facetsThese levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
This database contains speech signals of dialogues in which a subject was recorded during a conversation via a spontaneo…
This database contains speech signals of dialogues in which a subject was recorded during a conversation via a spontaneous speech translation system. The response of the system was designed to invoke emotions (e.g. anger) in the subjects. It is part of the larger Verbmobil 2 speech data collection. Starting from BAS Cl…
The KEC contains 79 speakers of Southern German. Two speakers, usually acquainted with each other, had an one hour long …
The KEC contains 79 speakers of Southern German. Two speakers, usually acquainted with each other, had an one hour long conversation in separate booths. Each speaker is recorded in a separate channel (acoustically isolated). Manual annotation at the word level is provided, automatic annotation at the segment level as w…
The SMARTWEB UMTS data collection, of which the SMC corpus is a part, was created within the publicly funded German Smar…
The SMARTWEB UMTS data collection, of which the SMC corpus is a part, was created within the publicly funded German SmartWeb project in the years 2004 - 2006. It comprises a collection of user queries to a naturally spoken Web interface with the main focus on the soccer world series in 2006. The SMC corpus itself conta…
De Stichting Nederlands Dagboekarchief verzamelt en beheert (ongepubliceerde) dagboeken, reisdagboeken, memoires, brieve…
De Stichting Nederlands Dagboekarchief verzamelt en beheert (ongepubliceerde) dagboeken, reisdagboeken, memoires, brieven en poëzie-albums uit het hele Nederlandse taalgebied en maakt deze toegankelijk voor wetenschap en onderwijs en voor particulier onderzoek. De collectie is in eigendom en beheer van de Stichting Ned…
The SIGNUM Database contains both isolated and continuous utterances of various signers. Since we use a vision-based app…
The SIGNUM Database contains both isolated and continuous utterances of various signers. Since we use a vision-based approach for sign language recognition the corpus was recorded on video. For quick random access to individual frames, each video clip is stored as a sequence of images. The vocabulary comprises 450 basi…
WaSeP contains recordings of one female and one male speaker, both professional actors, uttering single German nouns and…
WaSeP contains recordings of one female and one male speaker, both professional actors, uttering single German nouns and pseudowords in multiple emotional prosodies. This edition improves the segmentation of the phonetic annotation, adds Praat TextGrid files and removes a few irregular items.
This corpus contains speech recordings of normal hearing speakers and speakers equipped with Cochlear Implants (CI), as …
This corpus contains speech recordings of normal hearing speakers and speakers equipped with Cochlear Implants (CI), as used for analysis in the Master thesis of Veronika Neumeyer (2009, LMU München). Speech data were collected with the software SpeechRecorder, for each recording a BPF file was generated (*.par), on wh…
The songs in this collection were recorded and annotated as part of the project 'Metre and Melody in Dinka Speech and So…
The songs in this collection were recorded and annotated as part of the project 'Metre and Melody in Dinka Speech and Song', a project carried out by researchers from the University of Edinburgh and the School of Oriental and African Studies in London, and funded by the UK Arts and Humanities Research Council as part o…
The corpus SC2 contains read speech of 10 different speakers with screen prompted 'automobil diagnosis phrases' recorded…
The corpus SC2 contains read speech of 10 different speakers with screen prompted 'automobil diagnosis phrases' recorded under real conditions in two different car maintenance halls. The language is German. All speakers are male native Germans and have never participated in such a task before. They are all experts in t…