Show simple item record

 
dc.contributor.author Nešpore, Gunta
dc.contributor.author Rituma, Laura
dc.date.accessioned 2023-03-01T10:27:07Z
dc.date.available 2023-03-01T10:27:07Z
dc.date.issued 2023-02-24
dc.identifier.uri http://hdl.handle.net/20.500.12574/80
dc.description Annotation of word senses for a running text corpus of 1200 tokens (beginning of The Little Prince by Antoine de Saint-Exupéry) as an evaluation corpus for Latvian WSD systems. Data is provided in a tab-separated format similar to CoNLL, indexing senses to the Tēzaurs.lv word sense IDs as of Tēzaurs.lv 2022 (http://hdl.handle.net/20.500.12574/66) database release.
dc.language.iso lav
dc.publisher AiLab IMCS UL
dc.rights CLARIN ACA
dc.rights.uri https://www.kielipankki.fi/wp-content/uploads/CLARIN_ACA_AFFIL-EDU_NC_NORED_en.html
dc.rights.label ACA
dc.source.uri https://wordnet.ailab.lv
dc.subject word sense disambiguation
dc.title Word sense annotated "The Little Prince" fragments in Latvian 1.0
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN Centre of Latvian language resources and tools
contact.person Pēteris Paikens peteris@ailab.lv AiLab IMCS UL
sponsor Latvian Council of Science lzp-2019/1-0464 Latvian WordNet and Word Sense Disambiguation nationalFunds
size.info 1208 tokens
files.size 22442
files.count 1


 Files in this item

This item is
Academic Use
and licensed under:
CLARIN ACA
Noncommercial
Icon
Name
princis2.conll
Size
21.92 KB
Format
Unknown
Description
Word sense annotated corpus in a tab-separated format similar to CoNLL, indexing senses to the Tēzaurs.lv word sense IDs as of Tēzaurs.lv 2022 (http://hdl.handle.net/20.500.12574/66) database release.
MD5
b87ed0ae9627bffd51883fd1a2428a52
 Download file

Show simple item record