| dc.contributor.author | Gunta, Nešpore-Bērzkalne |
| dc.contributor.author | Skadiņa, Inguna |
| dc.contributor.author | Grūzītis, Normunds |
| dc.contributor.author | Znotiņš, Artūrs |
| dc.contributor.author | Goško, Didzis |
| dc.date.accessioned | 2021-06-30T19:10:06Z |
| dc.date.available | 2021-06-30T19:10:06Z |
| dc.date.issued | 2021 |
| dc.identifier.uri | http://hdl.handle.net/20.500.12574/47 |
| dc.description | This multi-targeted dataset contains several datasets that allow to train goal-oriented dialogue systems for student service domain in Latvian. The dataset contains a manually annotated dataset of domain-specific dialog intents, a manually created and annotated dataset of generalised and formalised dialog scenarios based on corpus evidence, dataset for FAQ module training. |
| dc.language.iso | lav |
| dc.publisher | AiLab IMCS UL |
| dc.relation.isreferencedby | https://ebooks.iospress.nl/volumearticle/55530 |
| dc.relation.isreferencedby | https://www.aclweb.org/anthology/2021.eacl-demos.35/ |
| dc.rights | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
| dc.rights.uri | http://creativecommons.org/licenses/by-sa/4.0/ |
| dc.rights.label | PUB |
| dc.source.uri | http://bots.ailab.lv/ |
| dc.subject | dialogue |
| dc.subject | named entities |
| dc.subject | intents |
| dc.subject | FrameNet |
| dc.title | LUIS: data collection for task oriented dialogue system creation |
| dc.type | corpus |
| metashare.ResourceInfo#ContentInfo.mediaType | text |
| hidden | false |
| hasMetadata | false |
| has.files | yes |
| branding | CLARIN Centre of Latvian language resources and tools |
| demo.uri | http://bots.ailab.lv/ |
| contact.person | Inguna Skadiņa inguna.skadina@lumii.lv IMCS UL |
| sponsor | Latvian Council of Science lzp2018/2-0216 Latvian Language Understanding and Generation in Human-Computer Interaction nationalFunds |
| sponsor | COST Action CA18231 Multi3Generation: Multi-task, Multilingual, Multi-modal Language Generation euFunds |
| files.size | 453402 |
| files.count | 5 |
Files in this item
Download all files in item (442.78 KB)This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Name
- intents_faq.zip
- Size
- 5.06 KB
- Format
- application/zip
- Description
- Training, evaluation and test sets for FAQ module training. CSV format.
- MD5
- 1bf3bb394100d5875b71aa43a2f4d0cf
- intents_faq
- train.csv8 kB
- intents_all.csv14 kB
- test.csv2 kB
- valid.csv2 kB
- Name
- intents_forum.zip
- Size
- 6.12 KB
- Format
- application/zip
- Description
- Training, validation and test sets for intent detection.
- MD5
- c3c4460944f0684c20a2bb09151589fc
- intents_forum
- train.csv12 kB
- test.csv3 kB
- valid.csv1 kB
- Name
- dialogs.json
- Size
- 160.93 KB
- Format
- Unknown
- Description
- Manually annotated dataset of dialogs in Json format.
- MD5
- 4e3a7026fdd21157396ad4a4b56b3bc6
- Name
- dialogs.yaml
- Size
- 91.92 KB
- Format
- Unknown
- Description
- Manually annotated dataset of dialogs in YAML format.
- MD5
- 1ae2033006c2b48cdf0a623e63cb9428
- Name
- output-Luiss.json
- Size
- 178.74 KB
- Format
- Unknown
- Description
- The student service domain dataset consisting of approx. 300 written conversations representing 3 frequent intent classes: working hours, document submission, academic leave.
- MD5
- 10386b5e577fbcbdfb1b8656c7f1ee76