Files in this item
Download all files in item (57.68 MB)- Name
- 80_percent.txt.zip
- Size
- 19.04 MB
- Format
- application/zip
- Description
- Latvian blog corpus in plain text format. Metadata include title and source
- MD5
- ce6d78ba63560ac92ac434ec414d3f94
- Name
- 80_percent.vert.fixed.txt.zip
- Size
- 38.64 MB
- Format
- application/zip
- Description
- Annotated version of the corpus. Contains morphologycal annotation and lemma in tab separated vertical format.
- MD5
- 291dd889a2b0a236ae01896083a468f5