dc.contributor.author |
Levāne-Petrova, Kristīne |
dc.contributor.author |
Darģis, Roberts |
dc.date.accessioned |
2020-07-24T17:34:09Z |
dc.date.available |
2020-07-24T17:34:09Z |
dc.date.issued |
2018 |
dc.identifier.uri |
http://hdl.handle.net/20.500.12574/11 |
dc.description |
LVK2018 is a balanced and representative 10 million word text corpus of modern Latvian. It represents five different genres: journalism (60%), fiction (20%), scientific (10%), legal (8%), transcriptions (2%). LVK2018 is an extended version of LVK2013. |
dc.language.iso |
lav |
dc.publisher |
AiLab IMCS UL |
dc.relation.isreferencedby |
https://doi.org/10.22364/vnf.10.12 |
dc.source.uri |
http://www.korpuss.lv/id/LVK2018 |
dc.subject |
text |
dc.subject |
corpus |
dc.subject |
general |
dc.subject |
representative |
dc.subject |
morphology |
dc.subject |
reference corpus |
dc.title |
Balanced Corpus of Modern Latvian (LVK2018) |
dc.type |
corpus |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
hidden |
false |
hasMetadata |
false |
has.files |
no |
branding |
CLARIN Centre of Latvian language resources and tools |
demo.uri |
http://nosketch.korpuss.lv/#dashboard?corpname=LVK2018 |
contact.person |
Kristīne Levāne-Petrova kristine.levane@gmail.com IMCS UL |
sponsor |
European Regional Development Fund 1.1.1.1/16/A/219 Full Stack of Language Resources for Natural Language Understanding and Generation in Latvian euFunds |
size.info |
12289240 tokens |
size.info |
9813014 words |
size.info |
20864 documents |
files.size |
0 |
files.count |
0 |
featuredService.nosketch |
search|https://nosketch.korpuss.lv/#dashboard?corpname=LVK2018 |