What's New

 corpus 
corpus
Description:
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version ...
 This item contains 3 files (24.69 MB).
 
Publicly Available
 corpus 
corpus
Description:
The evaluation and development data sets for speech translation for meetings were created within the microproject "Multi-layer evaluation sets for speech translation of web-based meetings" of the project "HumanE AI ...
 This item contains 7 files (5.57 GB).
 
Publicly Available
 corpus 
corpus
Description:
The corpus consists of certain proportions of various Latgalian published texts (1988–2021) with accompanying metadata about the author, place and year of the publication, as well as information about the type and genre ...
 This item contains no files.

Most Viewed Items

Top Last Week
 lexicalConceptualResource 
lexicalConceptualResource
Description:
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains nearly 390,000 entries compiled from more than 330 sources. The dictionary is enriched with phonetic, morphological, semantic ...
 This item contains 1 file (23.21 MB).
 
Publicly Available
 lexicalConceptualResource 
lexicalConceptualResource
Description:
Tezaurs is a machine-readable lexicon and an online dictionary for Latvian. The initial human-oriented version of this resource was made publicly in 2009, comprising more than 125,000 entries. Since then, Tezaurs has been ...
 This item contains 1 file (14.36 MB).
 
Publicly Available
 corpus 
corpus
Description:
A specialized corpus containing 468 students' essays for the 12th grade Latvian language exam.
 This item contains no files.