Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
Show more facetsThese levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
The resource, containing entire newspaper and magazine articles, has been made available for Download in Kielipankki - t…
The resource, containing entire newspaper and magazine articles, has been made available for Download in Kielipankki - the Language Bank of Finland at http://urn.fi/urn:nbn:fi:lb-201712201 The data consists of source data in PDF form or as plain text and is not annotated. An annotated version (lehdet90ff-vrt-v2) is av…
The corpus is available via Korp in Kielipankki - the Language Bank of Finland (korp.csc.fi). This most recent version …
The corpus is available via Korp in Kielipankki - the Language Bank of Finland (korp.csc.fi). This most recent version of Corpus of Contemporary American English (COCA), released in March 2020, contains about 1 billion words in 485,000 texts from the years 1990-2019. The corpus is evenly divided into spoken, fiction, …
This resource is available for download in Kielipankki – the Language Bank of Finland. This is a parallel corpus create…
This resource is available for download in Kielipankki – the Language Bank of Finland. This is a parallel corpus created of the Yle news articles from 2014-2020 by aligning the standard Finnish versions with the easy-language versions. The dataset, created by Anna Dmitrieva and available in CSV format, is aligned on t…
This corpus is available for download in Kielipankki – The Language Bank of Finland. NB: When processing personal data …
This corpus is available for download in Kielipankki – The Language Bank of Finland. NB: When processing personal data in this resource, the user must comply with the resource-specific data protection terms and conditions, see http://urn.fi/urn:nbn:fi:lb-2021050624. The topic of the interview data are the personal me…
This resource is available for download in Kielipankki – the Language Bank of Finland. The FinChat corpus consists of 8…
This resource is available for download in Kielipankki – the Language Bank of Finland. The FinChat corpus consists of 85 Finnish chat dialogs collected in 2019-2020. The participants (N=62) were native speakers of Finnish in three age-based user groups: high school students (16-19 years), university students (20-25 ye…
This version of the The Magazine Corpus of the Institute for the Languages of Finland contains the data, where the OCR (…
This version of the The Magazine Corpus of the Institute for the Languages of Finland contains the data, where the OCR (optical character recognition) has been checked. The size of this sub-corpus is 670 000 tokens. It contains one 1935 issue from 'Historiallinen Aikakauskirja', 'Lakimies' and 'Suomi', as well as 4 iss…
This resource contains answers to the matriculation exam in Swedish (B syllabus). The corpus will be made available via…
This resource contains answers to the matriculation exam in Swedish (B syllabus). The corpus will be made available via Kielipankki – The Language Bank of Finland. For the time being, the corpus can only be accessed by the Digisvenska project team at the University of Helsinki, but when the preparation of the material…
The corpus, containing the articles from YLE https://yle.fi from 2011-2021, is available via Korp in Kielipankki – the L…
The corpus, containing the articles from YLE https://yle.fi from 2011-2021, is available via Korp in Kielipankki – the Language Bank of Finland. This collection contains two sets of the Yle Finnish News Archive: "Yle Finnish News Archive 2011-2018, Korp" and "Yle Finnish News Archive 2019-2021, Korp". Together, the tw…
The corpus, containing the articles from YLE https://yle.fi from 2019-2021, is available for download in Kielipankki – the Language Bank of Finland.
The corpus, containing the articles from YLE https://yle.fi from 2019-2021, is available for download in Kielipankki – the Language Bank of Finland.
This content is available in Kielipankki. This collection contains two sets of Suomi24 data: "The Suomi24 Sentences Co…
This content is available in Kielipankki. This collection contains two sets of Suomi24 data: "The Suomi24 Sentences Corpus 2001-2017, Korp version" and "The Suomi24 Sentences Corpus 2018-2020, Korp version". Together, the two corpora cover all the discussion forums of the Suomi24 online social networking website fro…