Faili šajā vienumā
Lejupielādēt visus vienuma failus (57.68 MB)- Vārds
- 80_percent.txt.zip
- Lielums
- 19.04 MB
- Formāts
- application/zip
- Apraksts
- Latvian blog corpus in plain text format. Metadata include title and source
- MD5
- ce6d78ba63560ac92ac434ec414d3f94
- Vārds
- 80_percent.vert.fixed.txt.zip
- Lielums
- 38.64 MB
- Formāts
- application/zip
- Apraksts
- Annotated version of the corpus. Contains morphologycal annotation and lemma in tab separated vertical format.
- MD5
- 291dd889a2b0a236ae01896083a468f5