dc.contributor.author |
Nešpore, Gunta |
dc.contributor.author |
Rituma, Laura |
dc.date.accessioned |
2023-03-01T10:27:07Z |
dc.date.available |
2023-03-01T10:27:07Z |
dc.date.issued |
2023-02-24 |
dc.identifier.uri |
http://hdl.handle.net/20.500.12574/80 |
dc.description |
Annotation of word senses for a running text corpus of 1200 tokens (beginning of The Little Prince by Antoine de Saint-Exupéry) as an evaluation corpus for Latvian WSD systems.
Data is provided in a tab-separated format similar to CoNLL, indexing senses to the Tēzaurs.lv word sense IDs as of Tēzaurs.lv 2022 (http://hdl.handle.net/20.500.12574/66) database release. |
dc.language.iso |
lav |
dc.publisher |
AiLab IMCS UL |
dc.rights |
CLARIN ACA |
dc.rights.uri |
https://www.kielipankki.fi/wp-content/uploads/CLARIN_ACA_AFFIL-EDU_NC_NORED_en.html |
dc.rights.label |
ACA |
dc.source.uri |
https://wordnet.ailab.lv |
dc.subject |
word sense disambiguation |
dc.title |
Word sense annotated "The Little Prince" fragments in Latvian 1.0 |
dc.type |
corpus |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
has.files |
yes |
branding |
CLARIN Centre of Latvian language resources and tools |
contact.person |
Pēteris Paikens peteris@ailab.lv AiLab IMCS UL |
sponsor |
Latvian Council of Science lzp-2019/1-0464 Latvian WordNet and Word Sense Disambiguation nationalFunds |
size.info |
1208 tokens |
files.size |
22442 |
files.count |
1 |