What's New
corpus
Description:
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version ...
This item contains 3 files (24.77
MB).
Publicly Available
corpus
ConLoan-LV: A Contrastive Dataset for Latvian Language Loanwords, Code-switching, and Named Entities
Description:
ConLoan-LV is a multi-purpose contrastive dataset designed for the classification and analysis of Latvian language loanwords, code-switching, and named entities. Replicating and extending the ConLoan methodology, the dataset ...
This item contains 3 files (1.95
MB).
Publicly Available
lexicalConceptualResource
Description:
“Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language Institute of the Faculty of Humanities at the University of Latvia, is a new explanatory dictionary based on Latvian language materials ...
This item contains 1 file (7.82
MB).
Publicly Available
Most Viewed Items
Top Last Week
lexicalConceptualResource
Description:
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 410,000 entries based on 350 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic ...
This item contains 5 files (290.43
MB).
Publicly Available
corpus
Description:
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version ...
This item contains 3 files (24.77
MB).
Publicly Available
corpus
Description:
The Latvian Communist Leaflet Corpus (1934–1940) is a structured digital corpus of underground political leaflets produced by illegal communist organizations in Latvia between January 1934 and July 1940, covering the final ...
This item contains 7 files (2.48
MB).
Publicly Available