-
Notifications
You must be signed in to change notification settings - Fork 781
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Task/jailbreak adv sim #3455
base: main
Are you sure you want to change the base?
Task/jailbreak adv sim #3455
Conversation
promptflow-evals test result 9 files - 3 9 suites - 3 2h 38m 36s ⏱️ + 2h 17m 55s Results for commit c5f142f. ± Comparison against base commit ac574fa. This pull request removes 84 and adds 54 tests. Note that renamed tests count towards both.
♻️ This comment has been updated with latest results. |
src/promptflow-evals/promptflow/evals/synthetic/adversarial_simulator.py
Outdated
Show resolved
Hide resolved
src/promptflow-evals/promptflow/evals/synthetic/adversarial_simulator.py
Outdated
Show resolved
Hide resolved
src/promptflow-evals/promptflow/evals/synthetic/_conversation/__init__.py
Outdated
Show resolved
Hide resolved
src/promptflow-evals/promptflow/evals/synthetic/jailbreak_adversarial_simulator.py
Outdated
Show resolved
Hide resolved
* "subscription_id": Azure subscription ID. | ||
* "resource_group_name": Name of the Azure resource group. | ||
* "project_name": Name of the Azure Machine Learning workspace. | ||
* "credential": Azure credentials object for authentication. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should be a separate param ?
target: Callable, | ||
max_conversation_turns: int = 1, | ||
max_simulation_results: int = 3, | ||
api_call_retry_limit: int = 3, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we think about them being some kind of retry config param to make it cleaner ?
Description
Adding an additional way to utilize the jailbreak feature of the adversarial simulator.
Our customers who use the Adversarial Simulator with the
jailbreak=True
option, always try to run the simulator twice, to simulate with and without jailbreak.This PR introduces a new
JailbreakAdversarialSimulator
which runs the simulator twice. Once withjailbreak=True
and once withjailbreak=False
to accomplish our customers' goals of running the simulator for an adversarial dataset.All Promptflow Contribution checklist:
General Guidelines and Best Practices
Testing Guidelines