What's New
corpus

Description:
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version ...
This item contains 3 files (24.69
MB).
Publicly Available
corpus

Description:
The evaluation and development data sets for speech translation for meetings were created within the microproject "Multi-layer evaluation sets for speech translation of web-based meetings" of the project "HumanE AI ...
This item contains 7 files (5.57
GB).
Publicly Available
corpus

Description:
The corpus consists of certain proportions of various Latgalian published texts (1988–2021) with accompanying metadata about the author, place and year of the publication, as well as information about the type and genre ...
This item contains no files.
Most Viewed Items
Top Last Week
lexicalConceptualResource

Description:
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains nearly 390,000 entries compiled from more than 330 sources. The dictionary is enriched with phonetic, morphological, semantic ...
This item contains 1 file (23.21
MB).
Publicly Available
lexicalConceptualResource

Description:
Tezaurs is a machine-readable lexicon and an online dictionary for Latvian. The initial human-oriented version of this resource was made publicly in 2009, comprising more than 125,000 entries. Since then, Tezaurs has been ...
This item contains 1 file (14.36
MB).
Publicly Available
corpus

Description:
A specialized corpus containing 468 students' essays for the 12th grade Latvian language exam.
This item contains no files.