• Repository
  • Corpus Search
  • About
  • CLARIN
  •  Login
  • English Latviešu
  • CLARIN-LV Repository Home
  • View Item
  •  
  • CLARIN-LV logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   Statistics  
    •    StatisticsBETA
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 
 

Database of Latvian Morphemes and Derivational Models (DLMDM)

 
CLARIN Centre of Latvian language resources and tools
  Authors
Kalnača, Andra ; et al.show everyone Kalnača, Andra ; Pakalne, Tatjana ; Auziņa, Ieva ; Balmane, Vanesa ; Butāne, Anita ; Hoplíček, Milan ; Horiguchi, Daiki ; Jansone, Laura Paula ; Levāne‑Petrova, Kristīne ; Lokmane, Ilze ; Miķelsone, Paula ; Otomers, Oskars ; Ozola, Paula ; Urbanoviča, Inta
  Item identifier
http://hdl.handle.net/20.500.12574/155
 Project URL
https://github.com/MorphLatLang/DLMDM
 Demo URL
https://github.com/MorphLatLang/DLMDM/tree/main/data
 Referenced by
https://aclanthology.org/2025.nodalida-1.29/
 Date issued
2026-03
 Type
lexicalConceptualResource, text
 Size
10214 lexicalTypes
 Language(s)
Standard Latvian
 Description
"The Database of Latvian Morphemes and Derivational Models (DLMDM)" is a corpus-based derivational morphology resource developed at the Department of Latvian and Baltic Studies, Faculty of Humanities, University of Latvia. The core of the database consists of lemmas imported from the Balanced Corpus of Modern Latvian (LVK2018), with additional lemmas from other sources added to improve coverage of Latvian derivational morphology. The morphemic segmentation, part-of-speech information, morphological features, and derivational data have been manually validated at the lemma level. DLMDM provides four cross-indexed linguistic registers: the lemma register, the root register, the affix register, and the source register. Each register captures a different layer of derivational morphology information and is distributed as a UTF-8 tab-separated file.
 Publisher
University of Latvia
 Acknowledgement

The Latvian Council of Science

Project code: lzp-2022/1-0013

Project name: The “Database of Latvian Morphemes and Derivational Models (DLMDM)”

 Subject(s)
derivational morphology morphemes morphemic segmentation word formation lexical roots affixes derivational models derivational relations derivatives compounds word families derivational trees
 Collection(s)
Language and Cultural Heritage Data of the UL Faculty of Humanities
Show full item record
 
 

Partners, Coordination, Funding

  • Institute of Mathematics and Computer Science of the University of Latvia
  • Institute of Literature, Folklore and Art of the University of Latvia
  • University of Latvia
  • Rīga Stradiņš University
  • RTU Liepaja
  • Rezekne Academy of Technologies
  • National Library of Latvia

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

More

  • CLARIN
  • How to Sign in

This platform runs under the software developed for the LINDAT/CLARIN repository for linguistics , available on GitHub