Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
Show more facetsThese levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
The corpus is available in Kielipankki - the Language Bank of Finland's download service. NB: The Finnish acronym for th…
The corpus is available in Kielipankki - the Language Bank of Finland's download service. NB: The Finnish acronym for this corpus used to be "Digilib", but the acronym "klk" and the short name klk-fi-1920-dl are recommended instead of Digilib-1920-dl. This corpus consists of the OCR results of the material published f…
This version of the The Magazine Corpus of the Institute for the Languages of Finland contains the data, where the OCR (…
This version of the The Magazine Corpus of the Institute for the Languages of Finland contains the data, where the OCR (optical character recognition) has been checked. The size of this sub-corpus is 670 000 tokens. It contains one 1935 issue from 'Historiallinen Aikakauskirja', 'Lakimies' and 'Suomi', as well as 4 iss…
The Terminological Vocabulary of Kela – Benefit-related Concepts, 4th edition (TSK 49) contains information on more than…
The Terminological Vocabulary of Kela – Benefit-related Concepts, 4th edition (TSK 49) contains information on more than 500 concepts in term records and concept diagrams. The concepts have been given definitions and term recommendations in Finnish and Swedish. The relations between the concepts are illustrated with th…
The corpus is available for download in Kielipankki - the Language Bank of Finland. This dataset consists of the Yle Se…
The corpus is available for download in Kielipankki - the Language Bank of Finland. This dataset consists of the Yle Selkokieliset uutiset in Finnish (Yle Easy-to-read Finnish News). The dataset was created from the contents of the Yle News Archive for the language code "fi" for each month from the year 2011 to the ye…
This resource is available for download in Kielipankki – the Language Bank of Finland. This is a parallel corpus create…
This resource is available for download in Kielipankki – the Language Bank of Finland. This is a parallel corpus created of the Yle news articles from 2014-2018 by aligning the standard Finnish versions with the easy-language versions. The dataset, created by Anna Dmitrieva and available in CSV format, is aligned on t…
The corpus contains issues of the Karjalan Sanomat newspaper published in 2012-2014. The corpus is available in Kielipa…
The corpus contains issues of the Karjalan Sanomat newspaper published in 2012-2014. The corpus is available in Kielipankki - the Language Bank of Finland (http://urn.fi/urn:nbn:fi:lb-2016112501). In case you are not a member of an academic institution please read the access rights instructions at https://www.kielipa…
This resource is available for download in Kielipankki – the Language Bank of Finland. This resource consists of .txt a…
This resource is available for download in Kielipankki – the Language Bank of Finland. This resource consists of .txt and .wav files in four languages pertaining to the Finnish Christmas Gospel verses Luke 2. 1–20 The four languages include Komi-Zyrian (kpv), Erzya (myv), Karelian (krl) and Olonets-Karelian (olo, aka …
Until November 2020, this corpus is available via the LAT platform in Kielipankki - the Language Bank of Finland (see Ac…
Until November 2020, this corpus is available via the LAT platform in Kielipankki - the Language Bank of Finland (see Access location). IMPORTANT NOTICE: The LAT service of the Language Bank of Finland will be discontinued in November 2020, after which this corpus version can no longer be used. However, a downloadable…
The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi). The Corpus of Global Web-Based Eng…
The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi). The Corpus of Global Web-Based English (GloWbE) contains about 1.8 billion words and 1 800 000 texts from web pages in United States, Great Britain, Australia, India, and 16 other countries. About 60 % of the texts come from blogs. A…
The corpus, containing the articles from YLE https://yle.fi from 2011-2018, is available at Korp.
The corpus, containing the articles from YLE https://yle.fi from 2011-2018, is available at Korp.