dc.contributor.author | Spektors, Andrejs |
dc.contributor.author | Pretkalniņa, Lauma |
dc.contributor.author | Grūzītis, Normunds |
dc.contributor.author | Paikens, Pēteris |
dc.contributor.author | Rituma, Laura |
dc.contributor.author | Saulīte, Baiba |
dc.contributor.author | Nešpore-Bērzkalne, Gunta |
dc.contributor.author | Lokmane, Ilze |
dc.contributor.author | Klints, Agute |
dc.contributor.author | Stāde, Madara |
dc.contributor.author | Grasmanis, Mikus |
dc.contributor.author | Auziņa, Ilze |
dc.contributor.author | Znotiņš, Artūrs |
dc.contributor.author | Darģis, Roberts |
dc.contributor.author | Bārzdiņš, Guntis |
dc.date.accessioned | 2024-04-05T06:15:15Z |
dc.date.available | 2024-04-05T06:15:15Z |
dc.date.issued | 2023-12 |
dc.identifier.uri | http://hdl.handle.net/20.500.12574/103 |
dc.description | Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 397,000 entries based on 346 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and it is integrated with the Latvian WordNet data. This dataset is available as open data in TEI/XML and LMF/XML formats. If you are interested in acquiring the corresponding PostgreSQL database dump, please, send a request to info@tezaurs.lv. |
dc.language.iso | lav |
dc.publisher | AiLab IMCS UL |
dc.relation.isreferencedby | http://www.lrec-conf.org/proceedings/lrec2016/pdf/1095_Paper.pdf |
dc.relation.isreferencedby | http://www.lrec-conf.org/proceedings/lrec2022/pdf/2022.lrec-1.300.pdf |
dc.relation.isreferencedby | https://elex.link/elex2023/wp-content/uploads/89.pdf |
dc.relation.replaces | http://hdl.handle.net/20.500.12574/92 |
dc.relation.isreplacedby | http://hdl.handle.net/20.500.12574/104 |
dc.rights | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-sa/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://tezaurs.lv |
dc.subject | thesaurus |
dc.subject | dictionary |
dc.subject | lexicon |
dc.title | Tēzaurs.lv 2024 (Winter Edition) |
dc.type | lexicalConceptualResource |
metashare.ResourceInfo#ContentInfo.detailedType | computationalLexicon |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN Centre of Latvian language resources and tools |
demo.uri | https://tezaurs.lv |
contact.person | Normunds Grūzītis normundsg@ailab.lv Normunds Grūzītis |
sponsor | Ministry of Education and Science VPP-IZM-DH-2020/1-0001 Digital Resources for Humanities: Integration and Development nationalFunds |
sponsor | Latvian Council of Science lzp-2019/1-0464 Latvian WordNet and Word Sense Disambiguation nationalFunds |
sponsor | Ministry of Education and Science VPP-IZM-2018/2-0002 Latvian Language nationalFunds |
sponsor | Ministry of Education and Science VPP-LETONIKA-2021/1-0006 Research on Modern Latvian Language and Development of Language Technology nationalFunds |
sponsor | Latvian Council of Science lzp-2022/1-0443 Advancing Latvian computational lexical resources for natural language understanding and generation nationalFunds |
size.info | 399012 entries |
files.size | 305399308 |
files.count | 2 |
Files in this item
Download all files in item (291.25 MB)This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Name
- tezaurs_2024_1_tei.xml
- Size
- 283.53 MB
- Format
- XML
- Description
- Tezaurs.lv open data in the TEI/XML format (https://tei-c.org/release/doc/tei-p5-doc/en/html/DI.html)
- MD5
- cd4c6895a62226eea8d9515c046395e0
- Name
- tezaurs_2024_1_lmf.xml
- Size
- 7.72 MB
- Format
- XML
- Description
- Latvian WordNet open data in the LMF/XML format (https://globalwordnet.github.io/schemas/#xml)
- MD5
- 27d214c2e5a484404ba5953fb2b37cf1