Rādīt vienkāršu vienuma ierakstu
| dc.contributor.author | Nešpore, Gunta |
| dc.contributor.author | Rituma, Laura |
| dc.date.accessioned | 2023-03-01T10:27:07Z |
| dc.date.available | 2023-03-01T10:27:07Z |
| dc.date.issued | 2023-02-24 |
| dc.identifier.uri | http://hdl.handle.net/20.500.12574/80 |
| dc.description | Annotation of word senses for a running text corpus of 1200 tokens (beginning of The Little Prince by Antoine de Saint-Exupéry) as an evaluation corpus for Latvian WSD systems. Data is provided in a tab-separated format similar to CoNLL, indexing senses to the Tēzaurs.lv word sense IDs as of Tēzaurs.lv 2022 (http://hdl.handle.net/20.500.12574/66) database release. |
| dc.language.iso | lav |
| dc.publisher | AiLab IMCS UL |
| dc.rights | CLARIN ACA |
| dc.rights.uri | https://www.kielipankki.fi/wp-content/uploads/CLARIN_ACA_AFFIL-EDU_NC_NORED_en.html |
| dc.rights.label | ACA |
| dc.source.uri | https://wordnet.ailab.lv |
| dc.subject | word sense disambiguation |
| dc.title | Word sense annotated "The Little Prince" fragments in Latvian 1.0 |
| dc.type | corpus |
| metashare.ResourceInfo#ContentInfo.mediaType | text |
| has.files | yes |
| branding | CLARIN Centre of Latvian language resources and tools |
| contact.person | Pēteris Paikens peteris@ailab.lv AiLab IMCS UL |
| sponsor | Latvian Council of Science lzp-2019/1-0464 Latvian WordNet and Word Sense Disambiguation nationalFunds |
| size.info | 1208 tokens |
| files.size | 22442 |
| files.count | 1 |
Faili šajā vienumā
- Vārds
- princis2.conll
- Lielums
- 21.92 KB
- Formāts
- Nezināms
- Apraksts
- Word sense annotated corpus in a tab-separated format similar to CoNLL, indexing senses to the Tēzaurs.lv word sense IDs as of Tēzaurs.lv 2022 (http://hdl.handle.net/20.500.12574/66) database release.
- MD5
- b87ed0ae9627bffd51883fd1a2428a52