Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
Show more facetsThese levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
The corpus, containing the articles from Svenska YLE https://svenska.yle.fi from 2012 onwards up to 2018 inclusive, is a…
The corpus, containing the articles from Svenska YLE https://svenska.yle.fi from 2012 onwards up to 2018 inclusive, is available at korp.csc.fi/download
The resource is a sub-corpus of Classics of English and American Literature in Finnish (http://urn.fi/urn:nbn:fi:lb-2016…
The resource is a sub-corpus of Classics of English and American Literature in Finnish (http://urn.fi/urn:nbn:fi:lb-2016110901). The resource is available in Kielipankki - The Language Bank of Finland at http://urn.fi/urn:nbn:fi:lb-2017022202 and contains the following texts with paragraphs scrambled: suomentanut K…
The corpus is available in Kielipankki - the Language Bank of Finland, download. License details: http://urn.fi/urn:nbn:…
The corpus is available in Kielipankki - the Language Bank of Finland, download. License details: http://urn.fi/urn:nbn:fi:lb-20150304151 The corpus contains all the texts available in the Suomi24 API from the discussion forums of the Suomi24 online social networking website from 1.1.2001 to 31.12.2017. The tokenized…
The corpus is available via Korp in Kielipankki – the Language Bank of Finland. The Finnish News Agency Archive corpus …
The corpus is available via Korp in Kielipankki – the Language Bank of Finland. The Finnish News Agency Archive corpus comprises newswire articles in Finnish sent to media outlets by the Finnish News Agency (STT) between 1992-2018. The corpus includes about 2,8 million items in total. Most of the material is news arti…
The corpus is available in the Language Bank's Korp service (http://urn.fi/urn:nbn:fi:lb-2015050503). The HS.fi News an…
The corpus is available in the Language Bank's Korp service (http://urn.fi/urn:nbn:fi:lb-2015050503). The HS.fi News and Comments Corpus contains the domestic news of the Helsingin Sanomat website and their comments from 5.9.2011 to 4.9.2012. The corpus starts with the first news of 5.9.2011 and ends with a news publ…
The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi). The Corpus of Historical American …
The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi). The Corpus of Historical American English (COHA) contains about 385 million words and 115 000 texts from the years 1810-2009. Each decade has roughly the same balance of fiction, popular magazine, newspaper, and non-fiction books. Ac…
This dataset contains semi-automatically cleaned, parallel professional subtitles from 44 programs, containing 10.3k ali…
This dataset contains semi-automatically cleaned, parallel professional subtitles from 44 programs, containing 10.3k aligned sentence pairs for these language pairs: FIN-SWE, FIN-ENG, SWE-ENG. This dataset does not contain video or audio, but the total content length covered by the subtitles is 22,46 hours. --- Yle h…
The Helsinki Corpus of Swahili 2.0 Annotated Version containing about 25 million words is available in Kielipankki - the…
The Helsinki Corpus of Swahili 2.0 Annotated Version containing about 25 million words is available in Kielipankki - the Language Bank of Finland in Korp (https://korp.csc.fi/) for academic use. This means that students and staff of universities can use the corpus by simply logging in with their university credentials.…
The corpus is available in Kielipankki - the Language Bank of Finland, download: http://urn.fi/urn:nbn:fi:lb-2015040801 …
The corpus is available in Kielipankki - the Language Bank of Finland, download: http://urn.fi/urn:nbn:fi:lb-2015040801 (see there Suomi24-2015-10-29_VRT). License details: http://urn.fi/urn:nbn:fi:lb-20150304151 The corpus contains all the texts available in the Suomi24 API from the discussion forums of the Suomi24 o…