Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How could we expand this to create human language QA automation scripts? #9

Open
stefangrotz opened this issue Nov 9, 2023 · 4 comments

Comments

@stefangrotz
Copy link

This is a very interesting project. I wonder if there is a easy way to create human language QA scripts that are still reliable. What are your thoughts?

I will experiment with it and document the results here.

@stefangrotz
Copy link
Author

stefangrotz commented Nov 9, 2023

Ideas I am testing today:

  • Basic smoke test (clicking through all pages of a website and search for obvious errors)
  • Use the output to write test reports.

More advanced stuff for later:

  • Optical accessibility testing (does with website work for a color blind person?)
  • In general: more analysis during browsing
  • Convert user stories into test plans and execute them
  • compare the behavior between different versions of a website

@stefangrotz
Copy link
Author

stefangrotz commented Nov 9, 2023

grafik
simple clicking through all pages doesn't seem to work right now. I will try a simpler website next.

I selected the German Wikipedia because it still uses the old design and the menu doesn't has to be opened with the burger icon, which didn't work well in previous attempts.

EDIT: it seems to be a prompting issue. "one after another" doesn't work, but If I explicitly say click the first link on the left menu and then the second, then it works most of the time

EDIT2: I am on the end of my 100 request rate limit for vision for today, but this was fun. I believe if you add an analysis field to the JSON response this could become a useful tool for QA for some use cases

@ishan0102
Copy link
Owner

This is really interesting, thanks for working on this! I think QA automation is possible but somewhat challenging, as you said the prompt design is really important and GPT-4V is finnicky. I think it's possible that passing the accessibility tree could make this easier to solve, but not sure.

@stefangrotz
Copy link
Author

Small update: I think this has been solved much better already by the self-operating-computer project.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants