Show simple item record

 
dc.contributor.author Gunta, Nešpore-Bērzkalne
dc.contributor.author Skadiņa, Inguna
dc.contributor.author Grūzītis, Normunds
dc.contributor.author Znotiņš, Artūrs
dc.contributor.author Goško, Didzis
dc.date.accessioned 2021-06-30T19:10:06Z
dc.date.available 2021-06-30T19:10:06Z
dc.date.issued 2021
dc.identifier.uri http://hdl.handle.net/20.500.12574/47
dc.description This multi-targeted dataset contains several datasets that allow to train goal-oriented dialogue systems for student service domain in Latvian. The dataset contains a manually annotated dataset of domain-specific dialog intents, a manually created and annotated dataset of generalised and formalised dialog scenarios based on corpus evidence, dataset for FAQ module training.
dc.language.iso lav
dc.publisher AiLab IMCS UL
dc.relation.isreferencedby https://ebooks.iospress.nl/volumearticle/55530
dc.relation.isreferencedby https://www.aclweb.org/anthology/2021.eacl-demos.35/
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri http://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri http://bots.ailab.lv/
dc.subject dialogue
dc.subject named entities
dc.subject intents
dc.subject FrameNet
dc.title LUIS: data collection for task oriented dialogue system creation
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
hidden false
hasMetadata false
has.files yes
branding CLARIN Centre of Latvian language resources and tools
demo.uri http://bots.ailab.lv/
contact.person Inguna Skadiņa inguna.skadina@lumii.lv IMCS UL
sponsor Latvian Council of Science lzp2018/2-0216 Latvian Language Understanding and Generation in Human-Computer Interaction nationalFunds
sponsor COST Action CA18231 Multi3Generation: Multi-task, Multilingual, Multi-modal Language Generation euFunds
files.size 453402
files.count 5


 Files in this item

 Download all files in item (442.78 KB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Icon
Name
intents_faq.zip
Size
5.06 KB
Format
application/zip
Description
Training, evaluation and test sets for FAQ module training. CSV format.
MD5
1bf3bb394100d5875b71aa43a2f4d0cf
 Download file  Preview
 File Preview  
  • intents_faq
    • train.csv8 kB
    • intents_all.csv14 kB
    • test.csv2 kB
    • valid.csv2 kB
Icon
Name
intents_forum.zip
Size
6.12 KB
Format
application/zip
Description
Training, validation and test sets for intent detection.
MD5
c3c4460944f0684c20a2bb09151589fc
 Download file  Preview
 File Preview  
Icon
Name
dialogs.json
Size
160.93 KB
Format
Unknown
Description
Manually annotated dataset of dialogs in Json format.
MD5
4e3a7026fdd21157396ad4a4b56b3bc6
 Download file
Icon
Name
dialogs.yaml
Size
91.92 KB
Format
Unknown
Description
Manually annotated dataset of dialogs in YAML format.
MD5
1ae2033006c2b48cdf0a623e63cb9428
 Download file
Icon
Name
output-Luiss.json
Size
178.74 KB
Format
Unknown
Description
The student service domain dataset consisting of approx. 300 written conversations representing 3 frequent intent classes: working hours, document submission, academic leave.
MD5
10386b5e577fbcbdfb1b8656c7f1ee76
 Download file

Show simple item record