Rādīt vienkāršu vienuma ierakstu
| dc.contributor.author | Pretkalniņa, Lauma |
| dc.contributor.author | Nešpore-Bērzkalne, Gunta |
| dc.contributor.author | Pokratniece, Kristīne |
| dc.contributor.author | Rituma, Laura |
| dc.date.accessioned | 2025-11-26T15:23:28Z |
| dc.date.available | 2025-11-26T15:23:28Z |
| dc.date.issued | 2025-11-15 |
| dc.identifier.uri | http://hdl.handle.net/20.500.12574/143 |
| dc.description | This corpus contains 20 Latvian and Latgalian sample sentences annotated in the same hybrid annotation model used in Latvian Treebank. Sentences used in this corpora are the same sentences that are used in "Cairo" sample corpora that showcase anntoation choices for Universal Dependency treebanks, and this corpus serves as a basis for both UD-Latvian_Cairo and UD-Latgalian_Cairo corpora. Based on the experience with these sentences, preliminary UD annotation documentation for Latgalian was also prepared. This work allows Latgalian UD data to be used to assess how multilingual tools perform on a language that has no training data and to serve as a base for further treebank development later. |
| dc.language.iso | lav |
| dc.language.iso | ltg |
| dc.publisher | AiLab IMCS UL |
| dc.relation.isreferencedby | https://dspace.lu.lv/handle/7/63034 |
| dc.rights | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
| dc.rights.uri | http://creativecommons.org/licenses/by-sa/4.0/ |
| dc.rights.label | PUB |
| dc.source.uri | https://sintakse.korpuss.lv/ |
| dc.subject | treebank |
| dc.subject | parallel corpus |
| dc.subject | syntax |
| dc.subject | hybrid dependency-constituency grammar |
| dc.subject | dependency |
| dc.subject | constituency |
| dc.subject | manual annotation |
| dc.title | Latvian and Latgalian Parallel Sample Treebank (Cairo) |
| dc.type | corpus |
| metashare.ResourceInfo#ContentInfo.mediaType | text |
| has.files | yes |
| branding | CLARIN Centre of Latvian language resources and tools |
| contact.person | Lauma Pretkalniņa lauma@ailab.lv AiLab IMCS UL |
| size.info | 20 sentences |
| files.size | 1171889 |
| files.count | 4 |
Faili šajā vienumā
Lejupielādēt visus vienuma failus (1.12 MB)Šis vienums ir
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
un ir licencēts saskaņā ar:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Vārds
- LTG-Cairo-PML.zip
- Lielums
- 10.89 KB
- Formāts
- application/zip
- Apraksts
- Latgalian sentences in PML format
- MD5
- 80cae170fb1b6d497c98be2ccb5854a6
- Vārds
- LV-Cairo-PML.zip
- Lielums
- 10.77 KB
- Formāts
- application/zip
- Apraksts
- Latvian sentences in PML format
- MD5
- 86a5dff67b70b85bf53bddb1fba66ce7
- Vārds
- lv-treebank.zip
- Lielums
- 37.07 KB
- Formāts
- application/zip
- Apraksts
- Extension module for Treex toolkit and TrEd
- MD5
- 9ce6c5a626dd8cea96a23f2a93a0e74d
- icons
- ailab.gif990 B
- contrib
- pmllv
- LV_A.mak5 kB
- contrib.mac244 B
- LV_A_Edit.mak21 kB
- LV_M.mak4 kB
- LV_A_View.mak1002 B
- pmllv
- libs
- PMLLVHelpers.pm525 B
- FormChecker.pm879 B
- LemmaChecker.pm2 kB
- MorphoTags.pm12 kB
- SyntaxChecker.pm11 kB
- resources
- lvaschema.xml8 kB
- lvmschema.xml3 kB
- lvwschema.xml2 kB
- stylesheets
- lv-m1 kB
- lv-a-full-ord2 kB
- lv-a-full-compact-ord2 kB
- lv-a-edit-ord2 kB
- lv-m-full-vert1 kB
- lv-a-ord2 kB
- lv-a-edit-full-ord2 kB
- lv-a-full-compact-light-ord2 kB
- lv-a-edit2 kB
- lv-a-full-vert-ord2 kB
- lv-a-compact-ord2 kB
- lv-m-vert1 kB
- lv-a-vert-ord2 kB
- lv-a-edit-full2 kB
- lv-a-full-compact-old-ord2 kB
- package.xml653 B
- Vārds
- v2.17-docs.zip
- Lielums
- 1.06 MB
- Formāts
- application/zip
- Apraksts
- Documentation for Latvian part of the corpus
- MD5
- a6c9030fb7ca375e4ff8600736c80a4e
- docs
- SemTi-Kamols_morphotags.ods47 kB
- phrasetags.ods86 kB
- roles+phrasetypes.ods32 kB
- tok_morph_manual.pdf118 kB
- synt_manual.pdf885 kB