README

This project is a collection of three corpora which can be used for evaluating chatbots or other conversational interfaces. Two of the corpora were extracted from StackExchange, one from a Telegram chatbot.

If you use the data and publish please let us know and cite our SIGdial 2017 paper:

@InProceedings{braun-EtAl:2017:SIGDIAL,
  author    = {Braun, Daniel  and  Hernandez-Mendez, Adrian  and  Matthes, Florian  and  Langen, Manfred},
  title     = {Evaluating Natural Language Understanding Services for Conversational Question Answering Systems},
  booktitle = {Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue},
  month     = {August},
  year      = {2017},
  address   = {Saarbrücken, Germany},
  publisher = {Association for Computational Linguistics},
  pages     = {174--185},
  url       = {http://www.aclweb.org/anthology/W17-3622}
}

Errata

There is an error in Table 5 of the paper. In the "true +" column, the overall sum should be 573, not 820, and accordingly precision, recall, and f-score are 0.92, 0.85, and 0.88.

[The reason for this error is in the Excel evaluation sheet, the total number of "true +" (573) was stored as number of "true +" for the chatbot corpus. Added up with the result for the other corpora (77, 170) we end up with 820.]

License

All three corpora are released under the CC BY-SA 3.0 license.

Content

Ask Ubuntu Corpus

162 questions and answers from https://askubuntu.com.

Five intents (MakeUpdate, SetupPrinter, ShutdownComputer, SoftwareRecommendation, None) and three entity types (Printer, Software, Version).

Web Applications Corpus

89 questions and answers from https://webapps.stackexchange.com.

Eight intents (ChangePassword, DeleteAccount, DownloadVideo, ExportData, FilterSpam, FindAlternative, SyncAccounts, None) and three entity types (WebService, OS, Browser).

Chatbot Corpus

206 questions from a Telegram chatbot for public transport in Munich.

Two intents (Departure Time, Find Connection) and five entity types (StationStart, StationDest, Criterion, Vehicle, Line).

Evaluation Scripts

Python scripts for automated evaluation are provided here.

Contact Information

If you have any questions, please contact:

Daniel Braun (Technical University of Munich) [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
original		original
scripts/benchmark		scripts/benchmark
utils		utils
.gitignore		.gitignore
.replrc.js		.replrc.js
AskUbuntuCorpus.json		AskUbuntuCorpus.json
ChatbotCorpus.json		ChatbotCorpus.json
LICENSE		LICENSE
README.md		README.md
WebApplicationsCorpus.json		WebApplicationsCorpus.json
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

Errata

License

Content

Ask Ubuntu Corpus

Web Applications Corpus

Chatbot Corpus

Evaluation Scripts

Contact Information

About

Releases

Packages

Languages

License

mauna-ai/NLU-Evaluation-Corpora

Folders and files

Latest commit

History

Repository files navigation

README

Errata

License

Content

Ask Ubuntu Corpus

Web Applications Corpus

Chatbot Corpus

Evaluation Scripts

Contact Information

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages