What's New

 corpus 
corpus
Author(s):
Description:
Dataset for Embedding Model Fine-Tuning has been created within the framework of the National Research Program project "Analysis of the applicability of artificial intelligence methods in the field of EU fund projects". ...
 This item contains 4 files (161.23 MB).
 
Publicly Available
 corpus 
corpus
Description:
The Procurement Validation Dataset was created within the framework of the State Research Programme project "Analysis of the Applicability of Artificial Intelligence Methods in the Field of European Union Fund Projects". ...
 This item contains 4 files (10.19 MB).
 
Publicly Available
 lexicalConceptualResource 
lexicalConceptualResource
Description:
“Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language institute of University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. ...
 This item contains 1 file (62.56 MB).
 
Publicly Available

Most Viewed Items

Top Last Week
 corpus 
corpus
Description:
The corpus consists of audio recordings and their transcripts. It documents natural, spontaneous speech, including field research recordings, interviews, TV and radio broadcasts.
 This item contains 180 files (14.27 GB).
 
Academic Use Noncommercial
 lexicalConceptualResource 
lexicalConceptualResource
Description:
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 405,000 entries based on 345 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic ...
 This item contains 5 files (330.96 MB).
 
Publicly Available
 corpus 
corpus
Description:
The corpus includes almost 1000 texts created by foreign students studying at a Latvian higher education institution who are learning Latvian as a foreign language in the first or second semester. The morphologically ...
 This item contains no files.