Language resources and tools of AiLab IMCS UL

Language resources and tools of AiLab IMCS UL http://hdl.handle.net/20.500.12574/2 2026-06-26T11:48:27Z 2026-06-26T11:48:27Z Dictionary of Contemporary Latvian Language (MLVV) (2026-06-21) Zuicena, Ieva Auziņa, Ieva Briede, Santa Jansone, Irēna Ilga Kuplā, Ieva Lejniece, Gunta Migla, Ilga Oldere, Laimdota Ozola, Ārija Požarnova, Vija Rapa, Sanda Roze, Anitra Šmidebergs, Imants Šnē, Dorisa Šnē, Māra Timuška, Agris Grasmanis, Mikus Pretkalniņa, Lauma Znotiņš, Artūrs http://hdl.handle.net/20.500.12574/161 2026-06-26T08:09:10Z 2026-06-21T00:00:00Z

Dictionary of Contemporary Latvian Language (MLVV) (2026-06-21) Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs “Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language Institute of the Faculty of Humanities at the University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s encyclopaedias and dictionaries. Some of the dictionary content is machine-readable.

2026-06-21T00:00:00Z Tēzaurs.lv 2026 (Summer Edition) Spektors, Andrejs Pretkalniņa, Lauma Grūzītis, Normunds Paikens, Pēteris Rituma, Laura Saulīte, Baiba Nešpore-Bērzkalne, Gunta Lokmane, Ilze Klints, Agute Stāde, Madara Grasmanis, Mikus Auziņa, Ilze Znotiņš, Artūrs Darģis, Roberts Bārzdiņš, Guntis http://hdl.handle.net/20.500.12574/160 2026-06-26T07:50:02Z 2026-06-21T00:00:00Z

Tēzaurs.lv 2026 (Summer Edition) Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 410,000 entries based on 350 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and integrated with the Latvian WordNet data. This dataset is available as open data in TEI/XML and LMF/XML formats, as well as PostgreSQL database dump.

2026-06-21T00:00:00Z LVTB - Latvian Treebank v2.18 Rituma, Laura Pretkalniņa, Lauma Saulīte, Baiba Nešpore-Bērzkalne, Gunta Grūzītis, Normunds Znotiņš, Artūrs http://hdl.handle.net/20.500.12574/159 2026-05-28T09:43:41Z 2026-05-15T00:00:00Z

LVTB - Latvian Treebank v2.18 Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds; Znotiņš, Artūrs Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB).

2026-05-15T00:00:00Z ConLoan-LV: A Contrastive Dataset for Latvian Language Loanwords, Code-switching, and Named Entities Štekeļs, Jorens http://hdl.handle.net/20.500.12574/158 2026-05-12T12:35:02Z 2026-05-11T00:00:00Z

ConLoan-LV: A Contrastive Dataset for Latvian Language Loanwords, Code-switching, and Named Entities Štekeļs, Jorens ConLoan-LV is a multi-purpose contrastive dataset designed for the classification and analysis of Latvian language loanwords, code-switching, and named entities. Replicating and extending the ConLoan methodology, the dataset contains 353 manually validated sentences in the baseline version and 676 in the extended version, with all sentences sourced from the LVK2022 corpus. Each entry is enriched with labels for material borrowings (LOAN), while the extended version adds labels for code-switching (CS) and named entities (NE). Furthermore, the dataset includes native-language semantic equivalents for loanwords and English translations, providing a parallel structure for comparative analysis. This resource is intended for training and benchmarking language models in identifying non-native lexical elements within Latvian language texts.

2026-05-11T00:00:00Z Dictionary of Contemporary Latvian Language (MLVV) (2026-04-08) Zuicena, Ieva Auziņa, Ieva Briede, Santa Jansone, Irēna Ilga Kuplā, Ieva Lejniece, Gunta Migla, Ilga Oldere, Laimdota Ozola, Ārija Požarnova, Vija Rapa, Sanda Roze, Anitra Šmidebergs, Imants Šnē, Dorisa Šnē, Māra Timuška, Agris Grasmanis, Mikus Pretkalniņa, Lauma Znotiņš, Artūrs http://hdl.handle.net/20.500.12574/157 2026-06-26T08:09:10Z 2026-04-08T00:00:00Z

Dictionary of Contemporary Latvian Language (MLVV) (2026-04-08) Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs “Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language Institute of the Faculty of Humanities at the University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s encyclopaedias and dictionaries. Some of the dictionary content is machine-readable.

2026-04-08T00:00:00Z Tēzaurs.lv 2026 (Spring Edition) Spektors, Andrejs Pretkalniņa, Lauma Grūzītis, Normunds Paikens, Pēteris Rituma, Laura Saulīte, Baiba Nešpore-Bērzkalne, Gunta Lokmane, Ilze Klints, Agute Stāde, Madara Grasmanis, Mikus Auziņa, Ilze Znotiņš, Artūrs Darģis, Roberts Bārzdiņš, Guntis http://hdl.handle.net/20.500.12574/156 2026-06-26T07:50:02Z 2026-04-08T00:00:00Z

Tēzaurs.lv 2026 (Spring Edition) Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 410,000 entries based on 350 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and integrated with the Latvian WordNet data. This dataset is available as open data in TEI/XML and LMF/XML formats, as well as PostgreSQL database dump.

2026-04-08T00:00:00Z Latvian Communist Leaflet Corpus (1934–1940) Babaņins, Vladislavs http://hdl.handle.net/20.500.12574/154 2026-04-07T07:39:09Z 2026-03-30T00:00:00Z

Latvian Communist Leaflet Corpus (1934–1940) Babaņins, Vladislavs The Latvian Communist Leaflet Corpus (1934–1940) is a structured digital corpus of underground political leaflets produced by illegal communist organizations in Latvia between January 1934 and July 1940, covering the final months of the parliamentary period and the authoritarian regime of Kārlis Ulmanis. The corpus contains 251 unique leaflet texts. In total, there are 458 records, of which 273 include transcribed text (including textual variants) and the remainder are metadata-only records for leaflets not reproduced in the source edition. The transcribed texts have been manually reviewed and corrected to reduce transcription errors. Each record includes structured metadata fields such as title, author, date, print run, typography name, production method, original language, and text language. The corpus also includes manually compiled topic annotations and inferred location data as additional research annotations.

2026-03-30T00:00:00Z Historical Dictionary of Latvian Given Names Siliņa-Piņķe, Renāte Rapa, Sanda Jansone, Ilga Kazakevičs, Ņikita http://hdl.handle.net/20.500.12574/152 2026-02-18T18:10:00Z 2026-01-01T00:00:00Z

Historical Dictionary of Latvian Given Names Siliņa-Piņķe, Renāte; Rapa, Sanda; Jansone, Ilga; Kazakevičs, Ņikita "Historical Dictionary of Latvian Given Names" (LPVV) is an online scientific dictionary that collects and describes Latvian given names documented in written sources spanning more than eight centuries. This dictionary focuses on names that entered the Latvian given name system before the end of the 19th century.

2026-01-01T00:00:00Z Tēzaurs.lv 2026 (Winter Edition) Spektors, Andrejs Pretkalniņa, Lauma Grūzītis, Normunds Paikens, Pēteris Rituma, Laura Saulīte, Baiba Nešpore-Bērzkalne, Gunta Lokmane, Ilze Klints, Agute Stāde, Madara Grasmanis, Mikus Auziņa, Ilze Znotiņš, Artūrs Darģis, Roberts Bārzdiņš, Guntis http://hdl.handle.net/20.500.12574/151 2026-04-20T15:05:36Z 2025-12-21T00:00:00Z

Tēzaurs.lv 2026 (Winter Edition) Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 410,000 entries based on 350 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and integrated with the Latvian WordNet data. This dataset is available as open data in TEI/XML and LMF/XML formats, as well as PostgreSQL database dump.

2025-12-21T00:00:00Z Dictionary of Contemporary Latvian Language (MLVV) (2025-12-21) Zuicena, Ieva Auziņa, Ieva Briede, Santa Jansone, Irēna Ilga Kuplā, Ieva Lejniece, Gunta Migla, Ilga Oldere, Laimdota Ozola, Ārija Požarnova, Vija Rapa, Sanda Roze, Anitra Šmidebergs, Imants Šnē, Dorisa Šnē, Māra Timuška, Agris Grasmanis, Mikus Pretkalniņa, Lauma Znotiņš, Artūrs http://hdl.handle.net/20.500.12574/150 2026-04-20T15:07:04Z 2025-12-21T00:00:00Z

Dictionary of Contemporary Latvian Language (MLVV) (2025-12-21) Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs “Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language Institute of the Faculty of Humanities at the University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s encyclopaedias and dictionaries. Some of the dictionary content is machine-readable.

2025-12-21T00:00:00Z