What's New
corpus

Description:
Dataset for Embedding Model Fine-Tuning has been created within the framework of the National Research Program project "Analysis of the applicability of artificial intelligence methods in the field of EU fund projects". ...
This item contains 4 files (161.23
MB).
Publicly Available
corpus

Description:
The Procurement Validation Dataset was created within the framework of the State Research Programme project "Analysis of the Applicability of Artificial Intelligence Methods in the Field of European Union Fund Projects". ...
This item contains 4 files (10.19
MB).
Publicly Available
lexicalConceptualResource

Description:
“Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language institute of University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. ...
This item contains 1 file (62.56
MB).
Publicly Available
Most Viewed Items
Top Last Week
corpus

Description:
The corpus consists of audio recordings and their transcripts. It documents natural, spontaneous speech, including field research recordings, interviews, TV and radio broadcasts.
This item contains 180 files (14.27
GB).
Academic Use

lexicalConceptualResource

Description:
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 405,000 entries based on 345 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic ...
This item contains 5 files (330.96
MB).
Publicly Available
corpus

Description:
The corpus includes almost 1000 texts created by foreign students studying at a Latvian higher education institution who are learning Latvian as a foreign language in the first or second semester. The morphologically ...
This item contains no files.