Rādīt vienkāršu vienuma ierakstu
dc.contributor.author | Nešpore, Gunta |
dc.contributor.author | Rituma, Laura |
dc.date.accessioned | 2023-03-01T10:27:07Z |
dc.date.available | 2023-03-01T10:27:07Z |
dc.date.issued | 2023-02-24 |
dc.identifier.uri | http://hdl.handle.net/20.500.12574/80 |
dc.description | Annotation of word senses for a running text corpus of 1200 tokens (beginning of The Little Prince by Antoine de Saint-Exupéry) as an evaluation corpus for Latvian WSD systems. Data is provided in a tab-separated format similar to CoNLL, indexing senses to the Tēzaurs.lv word sense IDs as of Tēzaurs.lv 2022 (http://hdl.handle.net/20.500.12574/66) database release. |
dc.language.iso | lav |
dc.publisher | AiLab IMCS UL |
dc.rights | CLARIN ACA |
dc.rights.uri | https://www.kielipankki.fi/wp-content/uploads/CLARIN_ACA_AFFIL-EDU_NC_NORED_en.html |
dc.rights.label | ACA |
dc.source.uri | https://wordnet.ailab.lv |
dc.subject | word sense disambiguation |
dc.title | Word sense annotated "The Little Prince" fragments in Latvian 1.0 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN Centre of Latvian language resources and tools |
contact.person | Pēteris Paikens peteris@ailab.lv AiLab IMCS UL |
sponsor | Latvian Council of Science lzp-2019/1-0464 Latvian WordNet and Word Sense Disambiguation nationalFunds |
size.info | 1208 tokens |
files.size | 22442 |
files.count | 1 |
Faili šajā vienumā
- Vārds
- princis2.conll
- Lielums
- 21.92 KB
- Formāts
- Nezināms
- Apraksts
- Word sense annotated corpus in a tab-separated format similar to CoNLL, indexing senses to the Tēzaurs.lv word sense IDs as of Tēzaurs.lv 2022 (http://hdl.handle.net/20.500.12574/66) database release.
- MD5
- b87ed0ae9627bffd51883fd1a2428a52