Show simple item record

 
dc.contributor.author Andronova, Everita
dc.contributor.author Baltiņa, Maija
dc.contributor.author Frīdenberga, Anna
dc.contributor.author Grūzītis, Normunds
dc.contributor.author Ķauķīte, Sintija
dc.contributor.author Pokratniece, Kristīne
dc.contributor.author Pretkalniņa, Lauma
dc.contributor.author Siliņa-Piņķe, Renāte
dc.contributor.author Skrūzmane, Elga
dc.contributor.author Spektors, Andrejs
dc.contributor.author Spektors, Mārtiņš
dc.contributor.author Štrausa, Ilze
dc.contributor.author Trumpa, Anta
dc.contributor.author Trumpa, Edmunds
dc.contributor.author Vanags, Pēteris
dc.date.accessioned 2025-11-20T16:14:51Z
dc.date.available 2025-11-20T16:14:51Z
dc.date.issued 2025-11-27
dc.identifier.uri http://hdl.handle.net/20.500.12574/141
dc.description The Corpus of early written Latvian 'SENIE' provides access to the texts and facsimiles of written Latvian of the 16th–18th century. Its aim is to facilitate studies of early Latvian in general and to serve as the basis for 'The Historical dictionary of Latvian (16th–17th cc.)'. Corpus serves as a unique digital repository of early Latvian texts, whose physical copies are distributed all over the world. The Corpus was first launched in January 2003, and in 2017 it was converted to Unicode. Work on corpus continues in various directions, including adding new sources. This version contains 102 sources.
dc.language.iso lav
dc.language.iso mul
dc.publisher AiLab IMCS UL
dc.publisher Latvian Language Institute, Faculty of Humanities, University of Latvia
dc.relation.isreferencedby https://www.bjmc.lu.lv/fileadmin/user_upload/lu_portal/projekti/bjmc/Contents/12_4_18_Andronova.pdf
dc.relation.isreferencedby http://ceur-ws.org/Vol-2612/short1.pdf
dc.relation.replaces http://hdl.handle.net/20.500.12574/90
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri http://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri http://senie.korpuss.lv/
dc.subject diachronic corpus
dc.subject early written Latvian
dc.subject historical corpus
dc.title The Corpus of Early Written Latvian (2025)
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN Centre of Latvian language resources and tools
demo.uri http://senie.korpuss.lv/
contact.person Everita Andronova everita@ailab.lv Ailab IMCS UL
sponsor The State Research program VPP-IZM-2018/2-0002 The Latvian Language nationalFunds
sponsor The State Research programme VPP-IZM-DH-2020/1-0001 'Digital resources of humanities: integration and development' nationalFunds
sponsor EU Recovery and Resilience Facility 2.3.1.1.i.0/1/22/I/CFLA/002 Language Technology Initiative euFunds
sponsor Ministry of Education and Science VPP-IZM-DH-2022/1-0002 Towards Development of Open and FAIR Digital Humanities Ecosystem in Latvia (DHELI) nationalFunds
size.info 3236079 tokens
size.info 2363483 words
files.size 24786296
files.count 4


 Files in this item

 Download all files in item (23.64 MB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Icon
Name
Senie-DSL-plaintext.zip
Size
6.13 MB
Format
application/zip
Description
SENIE plaintext data with @-codes (DSL).
MD5
4808b4e00b0eec4e9c55dac22f6371d1
 Download file  Preview
 File Preview  
Icon
Name
SENIE_Unicode.tei.xml.zip
Size
8.81 MB
Format
application/zip
Description
Corpus data in TEI 5 XML format.
MD5
1e06063000b041fcd25a1c49ff14fff7
 Download file  Preview
 File Preview  
    • SENIE_Unicode.tei.xml61 MB
Icon
Name
SENIE_Unicode_unhyphened.tei.xml.zip
Size
8.64 MB
Format
application/zip
Description
Corpus data in TEI 5 XML format, hyphenated words contracted.
MD5
700a878465ae84ba1bbed6faa87ce6be
 Download file  Preview
 File Preview  
    • SENIE_Unicode_unhyphened.tei.xml61 MB
Icon
Name
docs.zip
Size
58.22 KB
Format
application/zip
Description
Metadata and @-code documentation.
MD5
0ce5df5d74cec3492b96416833f9c4e3
 Download file  Preview
 File Preview  
    • codes-explained.ods14 kB
    • metadata.ods46 kB

Show simple item record