<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0">
<channel>
<title>CLARIN-LV digital library at IMCS, University of Latvia</title>
<link>https://repository.clarin.lv:443/repository/xmlui</link>
<description>The CLARIN-LV digital repository system captures, stores, indexes, preserves, and distributes digital research material.</description>
<pubDate xmlns="http://apache.org/cocoon/i18n/2.1">Fri, 29 May 2026 06:17:50 GMT</pubDate>
<dc:date>2026-05-29T06:17:50Z</dc:date>
<item>
<title>LVTB - Latvian Treebank v2.18</title>
<link>http://hdl.handle.net/20.500.12574/159</link>
<description>LVTB - Latvian Treebank v2.18
Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds; Znotiņš, Artūrs
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB).
</description>
<pubDate>Fri, 15 May 2026 00:00:00 GMT</pubDate>
<guid isPermaLink="false">http://hdl.handle.net/20.500.12574/159</guid>
<dc:date>2026-05-15T00:00:00Z</dc:date>
</item>
<item>
<title>ConLoan-LV: A Contrastive Dataset for Latvian Language Loanwords, Code-switching, and Named Entities</title>
<link>http://hdl.handle.net/20.500.12574/158</link>
<description>ConLoan-LV: A Contrastive Dataset for Latvian Language Loanwords, Code-switching, and Named Entities
Štekeļs, Jorens
ConLoan-LV is a multi-purpose contrastive dataset designed for the classification and analysis of Latvian language loanwords, code-switching, and named entities. Replicating and extending the ConLoan methodology, the dataset contains 353 manually validated sentences in the baseline version and 676 in the extended version, with all sentences sourced from the LVK2022 corpus. Each entry is enriched with labels for material borrowings (LOAN), while the extended version adds labels for code-switching (CS) and named entities (NE). Furthermore, the dataset includes native-language semantic equivalents for loanwords and English translations, providing a parallel structure for comparative analysis. This resource is intended for training and benchmarking language models in identifying non-native lexical elements within Latvian language texts.
</description>
<pubDate>Mon, 11 May 2026 00:00:00 GMT</pubDate>
<guid isPermaLink="false">http://hdl.handle.net/20.500.12574/158</guid>
<dc:date>2026-05-11T00:00:00Z</dc:date>
</item>
<item>
<title>Dictionary of Contemporary Latvian Language (MLVV) (2026-04-08)</title>
<link>http://hdl.handle.net/20.500.12574/157</link>
<description>Dictionary of Contemporary Latvian Language (MLVV) (2026-04-08)
Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs
“Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language Institute of the Faculty of Humanities at the University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s encyclopaedias and dictionaries. Some of the dictionary content is machine-readable.
</description>
<pubDate>Wed, 08 Apr 2026 00:00:00 GMT</pubDate>
<guid isPermaLink="false">http://hdl.handle.net/20.500.12574/157</guid>
<dc:date>2026-04-08T00:00:00Z</dc:date>
</item>
<item>
<title>Tēzaurs.lv 2026 (Spring Edition)</title>
<link>http://hdl.handle.net/20.500.12574/156</link>
<description>Tēzaurs.lv 2026 (Spring Edition)
Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 410,000 entries based on 350 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and integrated with the Latvian WordNet data.&#13;
&#13;
This dataset is available as open data in TEI/XML and LMF/XML formats, as well as PostgreSQL database dump.
</description>
<pubDate>Wed, 08 Apr 2026 00:00:00 GMT</pubDate>
<guid isPermaLink="false">http://hdl.handle.net/20.500.12574/156</guid>
<dc:date>2026-04-08T00:00:00Z</dc:date>
</item>
<item>
<title>Database of Latvian Morphemes and Derivational Models (DLMDM)</title>
<link>http://hdl.handle.net/20.500.12574/155</link>
<description>Database of Latvian Morphemes and Derivational Models (DLMDM)
Kalnača, Andra; Pakalne, Tatjana; Auziņa, Ieva; Balmane, Vanesa; Butāne, Anita; Hoplíček, Milan; Horiguchi, Daiki; Jansone, Laura Paula; Levāne‑Petrova, Kristīne; Lokmane, Ilze; Miķelsone, Paula; Otomers, Oskars; Ozola, Paula; Urbanoviča, Inta
"The Database of Latvian Morphemes and Derivational Models (DLMDM)" is a corpus-based derivational morphology resource developed at the Department of Latvian and Baltic Studies, Faculty of Humanities, University of Latvia. The core of the database consists of lemmas imported from the Balanced Corpus of Modern Latvian (LVK2018), with additional lemmas from other sources added to improve coverage of Latvian derivational morphology. The morphemic segmentation, part-of-speech information, morphological features, and derivational data have been manually validated at the lemma level. DLMDM provides four cross-indexed linguistic registers: the lemma register, the root register, the affix register, and the source register. Each register captures a different layer of derivational morphology information and is distributed as a UTF-8 tab-separated file.
</description>
<pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
<guid isPermaLink="false">http://hdl.handle.net/20.500.12574/155</guid>
<dc:date>2026-03-01T00:00:00Z</dc:date>
</item>
<item>
<title>Latvian Communist Leaflet Corpus (1934–1940)</title>
<link>http://hdl.handle.net/20.500.12574/154</link>
<description>Latvian Communist Leaflet Corpus (1934–1940)
Babaņins, Vladislavs
The Latvian Communist Leaflet Corpus (1934–1940) is a structured digital corpus of underground political leaflets produced by illegal communist organizations in Latvia between January 1934 and July 1940, covering the final months of the parliamentary period and the authoritarian regime of Kārlis Ulmanis. The corpus contains 251 unique leaflet texts. In total, there are 458 records, of which 273 include transcribed text (including textual variants) and the remainder are metadata-only records for leaflets not reproduced in the source edition. The transcribed texts have been manually reviewed and corrected to reduce transcription errors. Each record includes structured metadata fields such as title, author, date, print run, typography name, production method, original language, and text language. The corpus also includes manually compiled topic annotations and inferred location data as additional research annotations.
</description>
<pubDate>Mon, 30 Mar 2026 00:00:00 GMT</pubDate>
<guid isPermaLink="false">http://hdl.handle.net/20.500.12574/154</guid>
<dc:date>2026-03-30T00:00:00Z</dc:date>
</item>
<item>
<title>Corpus of Contemporary Latgalian Speech (MuLaR) (2026-03-02)</title>
<link>http://hdl.handle.net/20.500.12574/153</link>
<description>Corpus of Contemporary Latgalian Speech (MuLaR) (2026-03-02)
Martena, Sanita; Nau, Nicole; Kļavinska, Antra; Juško-Štekele, Angelika; Kociņš-Kūceņš, Armands; Sprukte, Ausma; Briška, Anna; Gusāns, Ingars; Mazure, Laura
The corpus consists of audio recordings and their transcripts.  It documents natural, spontaneous speech, including field research recordings, interviews, TV and radio broadcasts.
</description>
<pubDate>Mon, 02 Mar 2026 00:00:00 GMT</pubDate>
<guid isPermaLink="false">http://hdl.handle.net/20.500.12574/153</guid>
<dc:date>2026-03-02T00:00:00Z</dc:date>
</item>
<item>
<title>Historical Dictionary of Latvian Given Names</title>
<link>http://hdl.handle.net/20.500.12574/152</link>
<description>Historical Dictionary of Latvian Given Names
Siliņa-Piņķe, Renāte; Rapa, Sanda; Jansone, Ilga; Kazakevičs, Ņikita
"Historical Dictionary of Latvian Given Names" (LPVV) is an online scientific dictionary that collects and describes Latvian given names documented in written sources spanning more than eight centuries. This dictionary focuses on names that entered the Latvian given name system before the end of the 19th century.
</description>
<pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
<guid isPermaLink="false">http://hdl.handle.net/20.500.12574/152</guid>
<dc:date>2026-01-01T00:00:00Z</dc:date>
</item>
<item>
<title>Tēzaurs.lv 2026 (Winter Edition)</title>
<link>http://hdl.handle.net/20.500.12574/151</link>
<description>Tēzaurs.lv 2026 (Winter Edition)
Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 410,000 entries based on 350 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and integrated with the Latvian WordNet data.&#13;
&#13;
This dataset is available as open data in TEI/XML and LMF/XML formats, as well as PostgreSQL database dump.
</description>
<pubDate>Sun, 21 Dec 2025 00:00:00 GMT</pubDate>
<guid isPermaLink="false">http://hdl.handle.net/20.500.12574/151</guid>
<dc:date>2025-12-21T00:00:00Z</dc:date>
</item>
<item>
<title>Dictionary of Contemporary Latvian Language (MLVV) (2025-12-21)</title>
<link>http://hdl.handle.net/20.500.12574/150</link>
<description>Dictionary of Contemporary Latvian Language (MLVV) (2025-12-21)
Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs
“Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language Institute of the Faculty of Humanities at the University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s encyclopaedias and dictionaries. Some of the dictionary content is machine-readable.
</description>
<pubDate>Sun, 21 Dec 2025 00:00:00 GMT</pubDate>
<guid isPermaLink="false">http://hdl.handle.net/20.500.12574/150</guid>
<dc:date>2025-12-21T00:00:00Z</dc:date>
</item>
</channel>
</rss>
