Show simple item record

 
dc.contributor.author Laizāns, Mārtiņš
dc.contributor.author Pretkalniņa, Lauma
dc.date.accessioned 2023-02-22T15:26:46Z
dc.date.available 2023-02-22T15:26:46Z
dc.date.issued 2015
dc.identifier.uri http://hdl.handle.net/20.500.12574/79
dc.description Authomaticaly harvested Latvian blog corpus.
dc.language.iso lav
dc.publisher AiLab IMCS UL
dc.rights CLARIN ACA
dc.rights.uri https://www.kielipankki.fi/wp-content/uploads/CLARIN_ACA_AFFIL-EDU_NC_NORED_en.html
dc.rights.label ACA
dc.source.uri http://www.korpuss.lv/id/Emu%C4%81ri
dc.subject text
dc.subject specialized
dc.subject morphology
dc.title Latvian Blog Corpus 2015
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN Centre of Latvian language resources and tools
demo.uri http://nosketch.korpuss.lv/#dashboard?corpname=emuari
contact.person Normunds Grūzītis normundsg@ailab.lv Normunds Grūzītis
size.info 8000000 tokens
size.info 6600000 words
files.size 60481383
files.count 2
featuredService.nosketch search|http://nosketch.korpuss.lv/#dashboard?corpname=emuari


 Files in this item

 Download all files in item (57.68 MB)
This item is
Academic Use
and licensed under:
CLARIN ACA
Noncommercial
Icon
Name
80_percent.txt.zip
Size
19.04 MB
Format
application/zip
Description
Latvian blog corpus in plain text format. Metadata include title and source
MD5
ce6d78ba63560ac92ac434ec414d3f94
 Download file
Icon
Name
80_percent.vert.fixed.txt.zip
Size
38.64 MB
Format
application/zip
Description
Annotated version of the corpus. Contains morphologycal annotation and lemma in tab separated vertical format.
MD5
291dd889a2b0a236ae01896083a468f5
 Download file

Show simple item record