Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create process to allow editors to update the confidence level of documents #107

Open
markovial opened this issue Jul 25, 2023 · 1 comment

Comments

@markovial
Copy link

While going through the GPT 3.5 vs. GPT 4 comparison, the editors found that certain sources should just not be in the dataset. This would be somewhat mitigated by simply filtering out all non ai/ai safety tags/.

It is still worth thinking through a long term process to give certain trusted people an automated ability to either blacklist or add new sources.

@mruwnik
Copy link
Collaborator

mruwnik commented Jul 30, 2023

There is a confidence column for each article, the idea being that editors can set it to a value between 0 and 1. Then either the chatbot would only use items with confidence e.g. > 0.8, or ones with low confidence could simply be removed from the dataset. Though of course that requires a decent interface with which to manipulate the data...

@ccstan99 ccstan99 changed the title Create a dataset curation process Create process to allow editors to update the confidence level of documents Aug 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants