Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
Show more facetsThese levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
The resource NOJU is a terminological database containing terms, definitions and other conceptual information in Norwegi…
The resource NOJU is a terminological database containing terms, definitions and other conceptual information in Norwegian and German within legal domains.
Recording equipment The recordings were done by means of a cassette recorder (Sony TC-D5M) and Sony lavaliere micropho…
Recording equipment The recordings were done by means of a cassette recorder (Sony TC-D5M) and Sony lavaliere microphones. The recordings took place in the speakers’ homes or in a hotel room. Sigurd Nordlie was recorded in his office at the University of Oslo. The tapes were digitized in the 1990s. The speakers …
The Norwegian Spanish Parallel Corpus (NSPC) was created at the University of Bergen, Norway. The corpus is primarily co…
The Norwegian Spanish Parallel Corpus (NSPC) was created at the University of Bergen, Norway. The corpus is primarily constructed for research in Translation Studies, and is built to be roughly comparable to the Spanish-English P-ACTRES corpus. The NSPC is a parallel, unidirectional translation corpus of contemporary N…
The resource is available via Kielipankki – The Language Bank of Finland. This parallel dataset can be used for trainin…
The resource is available via Kielipankki – The Language Bank of Finland. This parallel dataset can be used for training simplification models and/or studying simplification strategies that experts apply for Finnish news articles. The languages of the dataset are Finnish and Easy-to-read Finnish. The articles of which…
The corpus, containing the articles from YLE https://yle.fi from 2011-2018 is available in the download service of Kieli…
The corpus, containing the articles from YLE https://yle.fi from 2011-2018 is available in the download service of Kielipankki, the Language Bank of Finland, at korp.csc.fi/download.; Ylen uutisarkiston artikkeleita sivulta YLE https://yle.fi vuosilta 2011-2018 tullaan julkaisemaan Kielipankin latauspalvelussa korp.csc.fi/download.
This resource is available via Korp in Kielipankki – the Language Bank of Finland. This resource consists of .txt and .…
This resource is available via Korp in Kielipankki – the Language Bank of Finland. This resource consists of .txt and .wav files in four languages pertaining to the Finnish Christmas Gospel verses Luke 2. 1–20 The four languages include Komi-Zyrian (kpv), Erzya (myv), Karelian (krl) and Olonets-Karelian (olo, aka Livv…
This audiovisual dataset contains * audio files, subtitles and ground truth transcripts, speaker diarizations and NER a…
This audiovisual dataset contains * audio files, subtitles and ground truth transcripts, speaker diarizations and NER annotations of 16 factual programs in Finnish and Swedish * video files, subtitles, metadata and annotations for 8 factual programs that have been used for demonstration and test purposes in the MeMAD …
The Helsinki Corpus of Scottish Correspondence comprises circa 0.4 million words (0.5 million tokens) of early Scottish …
The Helsinki Corpus of Scottish Correspondence comprises circa 0.4 million words (0.5 million tokens) of early Scottish correspondence by male and female writers dating from the period 1540-1750. Unlike the majority of digital resources available for historical linguistics at present, the corpus consists of transcripts…
The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi). This dataset consists of the Yle S…
The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi). This dataset consists of the Yle Selkokieliset uutiset in Finnish (Yle Easy-to-read Finnish News). The dataset was created from the contents of the Yle News Archive for the language code "fi" for each month from the year 2011 to the y…
The resource, containing entire newspaper and magazine articles, has been made available for Download in Kielipankki - t…
The resource, containing entire newspaper and magazine articles, has been made available for Download in Kielipankki - the Language Bank of Finland at http://urn.fi/urn:nbn:fi:lb-201712201 The data consists of source data in PDF form or as plain text and is not annotated. An annotated version (lehdet90ff-vrt-v2) is av…