Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
Show more facetsThese levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
NOT-basen is a TBX-export of a terminology database developed by Norsk termbank. This termbase is to be considered an hi…
NOT-basen is a TBX-export of a terminology database developed by Norsk termbank. This termbase is to be considered an historical resource, and has not been updated for a while.
The Norwegian Spanish Parallel Corpus (NSPC) was created at the University of Bergen, Norway. The corpus is primarily co…
The Norwegian Spanish Parallel Corpus (NSPC) was created at the University of Bergen, Norway. The corpus is primarily constructed for research in Translation Studies, and is built to be roughly comparable to the Spanish-English P-ACTRES corpus. The NSPC is a parallel, unidirectional translation corpus of contemporary N…
Recording equipment The recordings were done by means of a digital recorder (Fostex FR-2LE) and two AKG C451 B microph…
Recording equipment The recordings were done by means of a digital recorder (Fostex FR-2LE) and two AKG C451 B microphones placed on the table in front of the speakers. The recording took place in one of the participants’ home, speaker 04. The speakers The set consists of four speakers, two women, born in 1929 a…
Recording equipment The recordings were done by means of a cassette recorder (Sony TC-D5M) and Sony lavaliere micropho…
Recording equipment The recordings were done by means of a cassette recorder (Sony TC-D5M) and Sony lavaliere microphones. The recordings took place in the speakers’ homes or in a hotel room. Sigurd Nordlie was recorded in his office at the University of Oslo. The tapes were digitized in the 1990s. The speakers …
The corpus, containing the articles from YLE https://yle.fi from 2011-2018, is available at korp.csc.fi/download
The corpus, containing the articles from YLE https://yle.fi from 2011-2018, is available at korp.csc.fi/download
Until November 2020, this corpus version is available via the LAT platform in Kielipankki - the Language Bank of Finland…
Until November 2020, this corpus version is available via the LAT platform in Kielipankki - the Language Bank of Finland (see Access location). IMPORTANT NOTICE: The LAT service of the Language Bank of Finland will be discontinued in November 2020, after which this corpus version can no longer be used. However, a down…
The corpus is available for download in Kielipankki - the Language Bank of Finland. This dataset consists of the Yle Se…
The corpus is available for download in Kielipankki - the Language Bank of Finland. This dataset consists of the Yle Selkokieliset uutiset in Finnish (Yle Easy-to-read Finnish News). The dataset was created from the contents of the Yle News Archive for the language code "fi" for each month from the year 2011 to the ye…
The corpus is available for download in Kielipankki - the Language Bank of Finland. License details: http://urn.fi/urn:n…
The corpus is available for download in Kielipankki - the Language Bank of Finland. License details: http://urn.fi/urn:nbn:fi:lb-20150304151 The corpus contains all the texts available in the Suomi24 API from the discussion forums of the Suomi24 online social networking website from 1.1.2018 to 31.12.2020. The tokeniz…
The corpus is available via Korp in Kielipankki - the Language Bank of Finland (korp.csc.fi). This most recent version …
The corpus is available via Korp in Kielipankki - the Language Bank of Finland (korp.csc.fi). This most recent version of Corpus of Contemporary American English (COCA), released in March 2020, contains about 1 billion words in 485,000 texts from the years 1990-2019. The corpus is evenly divided into spoken, fiction, …
This version of the The Magazine Corpus of the Institute for the Languages of Finland contains the data, where the OCR (…
This version of the The Magazine Corpus of the Institute for the Languages of Finland contains the data, where the OCR (optical character recognition) has been checked. The size of this sub-corpus is 670 000 tokens. It contains one 1935 issue from 'Historiallinen Aikakauskirja', 'Lakimies' and 'Suomi', as well as 4 iss…