Rādīt vienkāršu vienuma ierakstu

 
dc.contributor.author Spektors, Andrejs
dc.contributor.author Pretkalniņa, Lauma
dc.contributor.author Grūzītis, Normunds
dc.contributor.author Paikens, Pēteris
dc.contributor.author Rituma, Laura
dc.contributor.author Saulīte, Baiba
dc.contributor.author Nešpore-Bērzkalne, Gunta
dc.contributor.author Lokmane, Ilze
dc.contributor.author Klints, Agute
dc.contributor.author Stāde, Madara
dc.contributor.author Grasmanis, Mikus
dc.contributor.author Auziņa, Ilze
dc.contributor.author Znotiņš, Artūrs
dc.contributor.author Darģis, Roberts
dc.contributor.author Bārzdiņš, Guntis
dc.date.accessioned 2026-06-26T07:50:02Z
dc.date.available 2026-06-26T07:50:02Z
dc.date.issued 2026-06-21
dc.identifier.uri http://hdl.handle.net/20.500.12574/160
dc.description Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 410,000 entries based on 350 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and integrated with the Latvian WordNet data. This dataset is available as open data in TEI/XML and LMF/XML formats, as well as PostgreSQL database dump.
dc.language.iso lav
dc.publisher AiLab IMCS UL
dc.relation.isreferencedby http://www.lrec-conf.org/proceedings/lrec2016/pdf/1095_Paper.pdf
dc.relation.isreferencedby http://www.lrec-conf.org/proceedings/lrec2022/pdf/2022.lrec-1.300.pdf
dc.relation.isreferencedby https://elex.link/elex2023/wp-content/uploads/89.pdf
dc.relation.replaces http://hdl.handle.net/20.500.12574/156
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri http://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri https://tezaurs.lv
dc.subject thesaurus
dc.subject dictionary
dc.subject lexicon
dc.title Tēzaurs.lv 2026 (Summer Edition)
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.detailedType computationalLexicon
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN Centre of Latvian language resources and tools
demo.uri https://tezaurs.lv
contact.person Normunds Grūzītis normundsg@ailab.lv Normunds Grūzītis
sponsor Ministry of Education and Science VPP-IZM-DH-2020/1-0001 Digital Resources for Humanities: Integration and Development nationalFunds
sponsor Latvian Council of Science lzp-2019/1-0464 Latvian WordNet and Word Sense Disambiguation nationalFunds
sponsor Ministry of Education and Science VPP-IZM-2018/2-0002 Latvian Language nationalFunds
sponsor Ministry of Education and Science VPP-LETONIKA-2021/1-0006 Research on Modern Latvian Language and Development of Language Technology nationalFunds
sponsor Latvian Council of Science lzp-2022/1-0443 Advancing Latvian computational lexical resources for natural language understanding and generation nationalFunds
sponsor Ministry of Education and Science VPP-IZM-LETONIKA-2025/1-0004 Digital Resources and AI Technologies for the Sustainability of the Latvian Language nationalFunds
sponsor Latvian Council of Science lzp-2025/1-0685 Contemporary Methods for the Development of Latvian Lexical Resources nationalFunds
size.info 414347 entries
files.size 344238290
files.count 5


 Faili šajā vienumā

 Lejupielādēt visus vienuma failus (328.29 MB)
Šis vienums ir
Publicly Available
un ir licencēts saskaņā ar:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Icon
Vārds
tezaurs_2026_3_tei.xml.zip
Lielums
42.79 MB
Formāts
application/zip
Apraksts
Tezaurs.lv open data in the TEI/XML format (https://tei-c.org/release/doc/tei-p5-doc/en/html/DI.html)
MD5
4827c634501dfa1bd39d7ffafb992b36
 Lejupielādēt failu  Priekšskatījums
 Faila priekšskatījums  
    • tezaurs_2026_3_tei.xml384 MB
Icon
Vārds
tezaurs_2026_3_wordforms_tei.xml.zip
Lielums
205.34 MB
Formāts
application/zip
Apraksts
Tezaurs.lv open data (appendix: wordforms) in the TEI/XML format (https://tei-c.org/release/doc/tei-p5-doc/en/html/DI.html)
MD5
78976560a4502780e0c594589c6befa8
 Lejupielādēt failu  Priekšskatījums
 Faila priekšskatījums  
    • tezaurs_2026_3_wordforms_tei.xml12 GB
Icon
Vārds
tezaurs_2026_3_lmf.xml.zip
Lielums
7.56 MB
Formāts
application/zip
Apraksts
Latvian WordNet open data in the LMF/XML format (https://globalwordnet.github.io/schemas/#xml)
MD5
6971c5b6e69f3884b131db9172fb52fc
 Lejupielādēt failu  Priekšskatījums
 Faila priekšskatījums  
    • tezaurs_2026_3_lmf.xml30 MB
Icon
Vārds
tezaurs_2026_03.ispell.zip
Lielums
8.15 MB
Formāts
application/zip
Apraksts
Newline separated filtered wordform list.
MD5
fc338e051d01a2a3be5b8b16754accaa
 Lejupielādēt failu  Priekšskatījums
 Faila priekšskatījums  
    • tezaurs_2026_03.ispell64 MB
Icon
Vārds
tezaurs_2026_03-public.pgsql.gz
Lielums
64.45 MB
Formāts
application/gzip
Apraksts
PostgreSQL DB dump.
MD5
08a32aa82ed981569be5d39fc8aa2978
 Lejupielādēt failu

Rādīt vienkāršu vienuma ierakstu