-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PII evaluation functionality #141
Conversation
@@ -21,6 +21,7 @@ rerankers[flashrank] | |||
datasets | |||
torch | |||
gradio | |||
huggingface-hub |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not using this in this PR, but will in a separate one for the HF deployment.
@@ -45,7 +45,7 @@ def generate_synthetic_queries( | |||
|
|||
with Pipeline(name="generate_embedding_queries") as pipeline: | |||
load_dataset = LoadDataFromHub( | |||
# num_examples=20, # use this for demo purposes | |||
num_examples=40, # use this for demo purposes |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Feel free to leave this commented out if you'd like the full run.
this looks super nice :) |
…ml-projects into feature/add-more-evaluation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🦭
Added PII detection as part of the synthetic data generation step
https://www.loom.com/share/7b68ea1d069b42a4872791a64895c19c?sid=42c7995d-caf5-4c6a-8497-ea7a213f15ee for a tiny video overview
https://cloud.zenml.io/organizations/fc992c14-d960-4db7-812e-8f070c99c6f0/tenants/8a462fb6-b1fe-48df-9677-edc76bc8352d/runs/e3e4d459-d7e6-4a93-8586-b78ea803069b?tab=overview -- the link to the pipeline where I ran this.