Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
Show more facetsThese levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
The resource contains the vrt data of the Ylilauta Corpus (http://urn.fi/urn:nbn:fi:lb-2015031802).
The resource contains the vrt data of the Ylilauta Corpus (http://urn.fi/urn:nbn:fi:lb-2015031802).
The resource, containing entire newspaper and magazine articles, has been made available in Kielipankki - the Language B…
The resource, containing entire newspaper and magazine articles, has been made available in Kielipankki - the Language Bank of Finland at http://urn.fi/urn:nbn:fi:lb-2016112315 Reference instructions for this older version: University of Helsinki (2016). Corpus of Finnish Magazines and Newspapers from the 1990s and 20…
This version of the The Magazine Corpus of the Institute for the Languages of Finland contains the data, where the OCR (…
This version of the The Magazine Corpus of the Institute for the Languages of Finland contains the data, where the OCR (optical character recognition) has been checked. The size of this sub-corpus is 670 000 tokens. It contains one 1935 issue from 'Historiallinen Aikakauskirja', 'Lakimies' and 'Suomi', as well as 4 iss…
The corpus consists of 953 articles (193,742 word tokens) with six named entity classes (organization, location, person,…
The corpus consists of 953 articles (193,742 word tokens) with six named entity classes (organization, location, person, product, event,and date). The articles are extracted from the archives of Digitoday, a Finnish online technology news source. The data sets are available at https://github.com/mpsilfve/finer-data a…
The resource is available for download in Kielipankki - the Language Bank of Finland at http://urn.fi/urn:nbn:fi:lb-2017…
The resource is available for download in Kielipankki - the Language Bank of Finland at http://urn.fi/urn:nbn:fi:lb-2017021501 (see there the Suomi24-2015-04-02_VRT zip file). For more information see http://urn.fi/urn:nbn:fi:lb-2015120101
The corpus contains all the discussion forums of the Suomi24 online social networking website from 1st January 2001 to 3…
The corpus contains all the discussion forums of the Suomi24 online social networking website from 1st January 2001 to 31st December 2017 available in the Suomi24 API. Researchers can download the entire corpus (see http://urn.fi/urn:nbn:fi:lb-2020021801). Updates: 2021-04-21: In the updated version 1.2, some new a…
This resource is available via Korp in Kielipankki – the Language Bank of Finland. This resource consists of .txt and .…
This resource is available via Korp in Kielipankki – the Language Bank of Finland. This resource consists of .txt and .wav files in four languages pertaining to the Finnish Christmas Gospel verses Luke 2. 1–20 The four languages include Komi-Zyrian (kpv), Erzya (myv), Karelian (krl) and Olonets-Karelian (olo, aka Livv…
The Terminological Vocabulary of Kela – Benefit-related Concepts, 4th edition (TSK 49) contains information on more than…
The Terminological Vocabulary of Kela – Benefit-related Concepts, 4th edition (TSK 49) contains information on more than 500 concepts in term records and concept diagrams. The concepts have been given definitions and term recommendations in Finnish and Swedish. The relations between the concepts are illustrated with th…
This corpus is available in Kielipankki, Korp service. The corpus contains all the discussion forums of the Suomi24 on…
This corpus is available in Kielipankki, Korp service. The corpus contains all the discussion forums of the Suomi24 online social networking website from 1st January 2018 to 31st December 2020 obtained via the Suomi24 API. Researchers can also download the entire corpus (for downloadable versions, see the resource g…
The Helsinki Corpus of Scottish Correspondence comprises circa 0.4 million words (0.5 million tokens) of early Scottish …
The Helsinki Corpus of Scottish Correspondence comprises circa 0.4 million words (0.5 million tokens) of early Scottish correspondence by male and female writers dating from the period 1540-1750. Unlike the majority of digital resources available for historical linguistics at present, the corpus consists of transcripts…