What's New
corpus
Description:
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version ...
This item contains 3 files (24.77
MB).
Publicly Available
corpus
ConLoan-LV: A Contrastive Dataset for Latvian Language Loanwords, Code-switching, and Named Entities
Description:
ConLoan-LV is a multi-purpose contrastive dataset designed for the classification and analysis of Latvian language loanwords, code-switching, and named entities. Replicating and extending the ConLoan methodology, the dataset ...
This item contains 3 files (1.95
MB).
Publicly Available
lexicalConceptualResource
Description:
“Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language Institute of the Faculty of Humanities at the University of Latvia, is a new explanatory dictionary based on Latvian language materials ...
This item contains 1 file (7.82
MB).
Publicly Available
Most Viewed Items
Top Last Week
lexicalConceptualResource
Description:
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 410,000 entries based on 350 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic ...
This item contains 5 files (290.43
MB).
Publicly Available
corpus
Description:
Corpus contains recordings of informal conversations, interviews and public speeches and their transcripts in orthographic transcription. Metadata has been added to each audio recording: gender and age group of the speaker, ...
This item contains 2 files (4.12
GB).
Academic Use
corpus
Description:
The Corpus of early written Latvian 'SENIE' provides access to the texts and facsimiles of written Latvian of the 16th–18th century. Its aim is to facilitate studies of early Latvian in general and to serve as the basis ...
This item contains 4 files (23.64
MB).
Publicly Available