Show simple item record

 
dc.contributor.author Pretkalniņa, Lauma
dc.contributor.author Nešpore-Bērzkalne, Gunta
dc.contributor.author Pokratniece, Kristīne
dc.contributor.author Rituma, Laura
dc.date.accessioned 2025-11-26T15:23:28Z
dc.date.available 2025-11-26T15:23:28Z
dc.date.issued 2025-11-15
dc.identifier.uri http://hdl.handle.net/20.500.12574/143
dc.description This corpus contains 20 Latvian and Latgalian sample sentences annotated in the same hybrid annotation model used in Latvian Treebank. Sentences used in this corpora are the same sentences that are used in "Cairo" sample corpora that showcase anntoation choices for Universal Dependency treebanks, and this corpus serves as a basis for both UD-Latvian_Cairo and UD-Latgalian_Cairo corpora. Based on the experience with these sentences, preliminary UD annotation documentation for Latgalian was also prepared. This work allows Latgalian UD data to be used to assess how multilingual tools perform on a language that has no training data and to serve as a base for further treebank development later.
dc.language.iso lav
dc.language.iso ltg
dc.publisher AiLab IMCS UL
dc.relation.isreferencedby https://dspace.lu.lv/handle/7/63034
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri http://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri https://sintakse.korpuss.lv/
dc.subject treebank
dc.subject parallel corpus
dc.subject syntax
dc.subject hybrid dependency-constituency grammar
dc.subject dependency
dc.subject constituency
dc.subject manual annotation
dc.title Latvian and Latgalian Parallel Sample Treebank (Cairo)
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN Centre of Latvian language resources and tools
contact.person Lauma Pretkalniņa lauma@ailab.lv AiLab IMCS UL
size.info 20 sentences
files.size 1171889
files.count 4


 Files in this item

 Download all files in item (1.12 MB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Icon
Name
LTG-Cairo-PML.zip
Size
10.89 KB
Format
application/zip
Description
Latgalian sentences in PML format
MD5
80cae170fb1b6d497c98be2ccb5854a6
 Download file  Preview
 File Preview  
    • ltg-Cairo.m34 kB
    • ltg-Cairo.w15 kB
    • ltg-Cairo.a70 kB
    • ltg-Cairo.txt936 B
Icon
Name
LV-Cairo-PML.zip
Size
10.77 KB
Format
application/zip
Description
Latvian sentences in PML format
MD5
86a5dff67b70b85bf53bddb1fba66ce7
 Download file  Preview
 File Preview  
    • c70-Cairo.txt963 B
    • c70-Cairo.m34 kB
    • c70-Cairo.w14 kB
    • c70-Cairo.a69 kB
Icon
Name
lv-treebank.zip
Size
37.07 KB
Format
application/zip
Description
Extension module for Treex toolkit and TrEd
MD5
9ce6c5a626dd8cea96a23f2a93a0e74d
 Download file  Preview
 File Preview  
  • icons
    • ailab.gif990 B
  • contrib
    • pmllv
      • LV_A.mak5 kB
      • contrib.mac244 B
      • LV_A_Edit.mak21 kB
      • LV_M.mak4 kB
      • LV_A_View.mak1002 B
  • libs
    • PMLLVHelpers.pm525 B
    • FormChecker.pm879 B
    • LemmaChecker.pm2 kB
    • MorphoTags.pm12 kB
    • SyntaxChecker.pm11 kB
  • resources
    • lvaschema.xml8 kB
    • lvmschema.xml3 kB
    • lvwschema.xml2 kB
  • stylesheets
    • lv-m1 kB
    • lv-a-full-ord2 kB
    • lv-a-full-compact-ord2 kB
    • lv-a-edit-ord2 kB
    • lv-m-full-vert1 kB
    • lv-a-ord2 kB
    • lv-a-edit-full-ord2 kB
    • lv-a-full-compact-light-ord2 kB
    • lv-a-edit2 kB
    • lv-a-full-vert-ord2 kB
    • lv-a-compact-ord2 kB
    • lv-m-vert1 kB
    • lv-a-vert-ord2 kB
    • lv-a-edit-full2 kB
    • lv-a-full-compact-old-ord2 kB
    • package.xml653 B
Icon
Name
v2.17-docs.zip
Size
1.06 MB
Format
application/zip
Description
Documentation for Latvian part of the corpus
MD5
a6c9030fb7ca375e4ff8600736c80a4e
 Download file  Preview
 File Preview  
  • docs
    • SemTi-Kamols_morphotags.ods47 kB
    • phrasetags.ods86 kB
    • roles+phrasetypes.ods32 kB
    • tok_morph_manual.pdf118 kB
    • synt_manual.pdf885 kB

Show simple item record