What's New
corpus
Description:
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version ...
This item contains 3 files (24.77
MB).
Publicly Available
corpus
ConLoan-LV: A Contrastive Dataset for Latvian Language Loanwords, Code-switching, and Named Entities
Description:
ConLoan-LV is a multi-purpose contrastive dataset designed for the classification and analysis of Latvian language loanwords, code-switching, and named entities. Replicating and extending the ConLoan methodology, the dataset ...
This item contains 3 files (1.95
MB).
Publicly Available
lexicalConceptualResource
Description:
“Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language Institute of the Faculty of Humanities at the University of Latvia, is a new explanatory dictionary based on Latvian language materials ...
This item contains 1 file (7.82
MB).
Publicly Available
Most Viewed Items
Top Last Week
lexicalConceptualResource
Description:
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 410,000 entries based on 350 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic ...
This item contains 5 files (290.43
MB).
Publicly Available
corpus
Description:
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version ...
This item contains 3 files (24.77
MB).
Publicly Available
corpus
Description:
Corpus contains recordings of informal conversations, interviews and public speeches and their transcripts in orthographic transcription. Metadata has been added to each audio recording: gender and age group of the speaker, ...
This item contains 2 files (4.12
GB).
Academic Use