Rādīt vienkāršu vienuma ierakstu
dc.contributor.author | Laizāns, Mārtiņš |
dc.contributor.author | Pretkalniņa, Lauma |
dc.date.accessioned | 2023-02-22T15:26:46Z |
dc.date.available | 2023-02-22T15:26:46Z |
dc.date.issued | 2015 |
dc.identifier.uri | http://hdl.handle.net/20.500.12574/79 |
dc.description | Authomaticaly harvested Latvian blog corpus. |
dc.language.iso | lav |
dc.publisher | AiLab IMCS UL |
dc.rights | CLARIN ACA |
dc.rights.uri | https://www.kielipankki.fi/wp-content/uploads/CLARIN_ACA_AFFIL-EDU_NC_NORED_en.html |
dc.rights.label | ACA |
dc.source.uri | http://www.korpuss.lv/id/Emu%C4%81ri |
dc.subject | text |
dc.subject | specialized |
dc.subject | morphology |
dc.title | Latvian Blog Corpus 2015 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN Centre of Latvian language resources and tools |
demo.uri | http://nosketch.korpuss.lv/#dashboard?corpname=emuari |
contact.person | Normunds Grūzītis normundsg@ailab.lv Normunds Grūzītis |
size.info | 8000000 tokens |
size.info | 6600000 words |
files.size | 60481383 |
files.count | 2 |
featuredService.nosketch | search|http://nosketch.korpuss.lv/#dashboard?corpname=emuari |
Faili šajā vienumā
Lejupielādēt visus vienuma failus (57.68 MB)- Vārds
- 80_percent.txt.zip
- Lielums
- 19.04 MB
- Formāts
- application/zip
- Apraksts
- Latvian blog corpus in plain text format. Metadata include title and source
- MD5
- ce6d78ba63560ac92ac434ec414d3f94
- Vārds
- 80_percent.vert.fixed.txt.zip
- Lielums
- 38.64 MB
- Formāts
- application/zip
- Apraksts
- Annotated version of the corpus. Contains morphologycal annotation and lemma in tab separated vertical format.
- MD5
- 291dd889a2b0a236ae01896083a468f5