Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
Show more facetsThese levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
The corpus is available for Download at Kielipankki - the Language Bank of Finland at http://urn.fi/urn:nbn:fi:lb-201803…
The corpus is available for Download at Kielipankki - the Language Bank of Finland at http://urn.fi/urn:nbn:fi:lb-201803272 This is the downloadable version of Helsinki Corpus of Swahili 2.0 (HCS 2.0) Annotated Version, see http://urn.fi/urn:nbn:fi:lb-2016011301 for more information. The Helsinki Corpus of Swahili 2.…
The corpus is available for download in Kielipankki - the Language Bank of Finland. This dataset consists of the Yle Se…
The corpus is available for download in Kielipankki - the Language Bank of Finland. This dataset consists of the Yle Selkokieliset uutiset in Finnish (Yle Easy-to-read Finnish News). The dataset was created from the contents of the Yle News Archive for the language code "fi" for each month from the year 2011 to the ye…
The corpus, which is the downloadable version of the years 2013-2016 of the Aalto University DSP Course Conversation Cor…
The corpus, which is the downloadable version of the years 2013-2016 of the Aalto University DSP Course Conversation Corpus 2013-, is available in Kielipankki - the Language Bank of Finland.
The Helsinki Corpus of English Texts is a structured multi-genre diachronic corpus, which includes periodically organize…
The Helsinki Corpus of English Texts is a structured multi-genre diachronic corpus, which includes periodically organized text samples from Old, Middle and Early Modern English. This subcorpus contains the samples from Early Modern English only. This subcorpus is available for download in Kielipankki – the Language Ba…
The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi), http://urn.fi/urn:nbn:fi:lb-2014052…
The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi), http://urn.fi/urn:nbn:fi:lb-201405273. All the known letters, manuscripts and published works by Finnish author Aleksis Kivi (1834–1872). Most of the texts were written in Finnish while some of the letters and manuscripts are in Swedi…
This resource is available for download in Kielipankki – the Language Bank of Finland. This resource consists of .txt a…
This resource is available for download in Kielipankki – the Language Bank of Finland. This resource consists of .txt and .wav files in four languages pertaining to the Finnish Christmas Gospel verses Luke 2. 1–20 The four languages include Komi-Zyrian (kpv), Erzya (myv), Karelian (krl) and Olonets-Karelian (olo, aka …
This dataset contains browse-quality video files, accompanied by parallel multilingual subtitles and program metadata su…
This dataset contains browse-quality video files, accompanied by parallel multilingual subtitles and program metadata such as production years, genre classifications and topical segmentation timecodes from Yle production systems for 113 news, current affairs and factual programs. This dataset is split into these subti…
Until November 2020, this corpus is available via the LAT platform in the Language Bank of Finland, see Access location.…
Until November 2020, this corpus is available via the LAT platform in the Language Bank of Finland, see Access location. This corpus can no longer be accessed via the LAT interface, since The LAT service of the Language Bank of Finland was discontinued in November 2020. However, the same content is available for downl…
This content is available in Kielipankki. This collection contains two downloadable sets of Suomi24 data: "The Suomi24…
This content is available in Kielipankki. This collection contains two downloadable sets of Suomi24 data: "The Suomi24 Corpus 2001-2017, VRT version" and "The Suomi24 Corpus 2018-2020, VRT version". Together, the two corpora cover all the discussion forums of the Suomi24 online social networking website from 1st Jan…
The corpus is available in Kielipankki - the Language Bank of Finland's download service. NB: The data was reorganized i…
The corpus is available in Kielipankki - the Language Bank of Finland's download service. NB: The data was reorganized in smaller, more userfriendly packages on 15.3.2022. This subcorpus is part of the Corpus of Finnish Sign Language collected in the CFINSL project. The subcorpus comprises elicited narratives from 21 …