dc.contributor.author | Gunta, Nešpore-Bērzkalne |
dc.contributor.author | Skadiņa, Inguna |
dc.contributor.author | Grūzītis, Normunds |
dc.contributor.author | Znotiņš, Artūrs |
dc.contributor.author | Goško, Didzis |
dc.date.accessioned | 2021-06-30T19:10:06Z |
dc.date.available | 2021-06-30T19:10:06Z |
dc.date.issued | 2021 |
dc.identifier.uri | http://hdl.handle.net/20.500.12574/47 |
dc.description | This multi-targeted dataset contains several datasets that allow to train goal-oriented dialogue systems for student service domain in Latvian. The dataset contains a manually annotated dataset of domain-specific dialog intents, a manually created and annotated dataset of generalised and formalised dialog scenarios based on corpus evidence, dataset for FAQ module training. |
dc.language.iso | lav |
dc.publisher | AiLab IMCS UL |
dc.relation.isreferencedby | https://ebooks.iospress.nl/volumearticle/55530 |
dc.relation.isreferencedby | https://www.aclweb.org/anthology/2021.eacl-demos.35/ |
dc.rights | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-sa/4.0/ |
dc.rights.label | PUB |
dc.source.uri | http://bots.ailab.lv/ |
dc.subject | dialogue |
dc.subject | named entities |
dc.subject | intents |
dc.subject | FrameNet |
dc.title | LUIS: data collection for task oriented dialogue system creation |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
hidden | false |
hasMetadata | false |
has.files | yes |
branding | CLARIN Centre of Latvian language resources and tools |
demo.uri | http://bots.ailab.lv/ |
contact.person | Inguna Skadiņa inguna.skadina@lumii.lv IMCS UL |
sponsor | Latvian Council of Science lzp2018/2-0216 Latvian Language Understanding and Generation in Human-Computer Interaction nationalFunds |
sponsor | COST Action CA18231 Multi3Generation: Multi-task, Multilingual, Multi-modal Language Generation euFunds |
files.size | 453402 |
files.count | 5 |
Files in this item
Download all files in item (442.78 KB)This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Name
- intents_faq.zip
- Size
- 5.06 KB
- Format
- application/zip
- Description
- Training, evaluation and test sets for FAQ module training. CSV format.
- MD5
- 1bf3bb394100d5875b71aa43a2f4d0cf
- intents_faq
- train.csv8 kB
- intents_all.csv14 kB
- test.csv2 kB
- valid.csv2 kB
- Name
- intents_forum.zip
- Size
- 6.12 KB
- Format
- application/zip
- Description
- Training, validation and test sets for intent detection.
- MD5
- c3c4460944f0684c20a2bb09151589fc
- intents_forum
- train.csv12 kB
- test.csv3 kB
- valid.csv1 kB
- Name
- dialogs.json
- Size
- 160.93 KB
- Format
- Unknown
- Description
- Manually annotated dataset of dialogs in Json format.
- MD5
- 4e3a7026fdd21157396ad4a4b56b3bc6
- Name
- dialogs.yaml
- Size
- 91.92 KB
- Format
- Unknown
- Description
- Manually annotated dataset of dialogs in YAML format.
- MD5
- 1ae2033006c2b48cdf0a623e63cb9428
- Name
- output-Luiss.json
- Size
- 178.74 KB
- Format
- Unknown
- Description
- The student service domain dataset consisting of approx. 300 written conversations representing 3 frequent intent classes: working hours, document submission, academic leave.
- MD5
- 10386b5e577fbcbdfb1b8656c7f1ee76