Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Assess the existing Clarifai tags #4647

Open
AetherUnbound opened this issue Jul 22, 2024 · 0 comments
Open

Assess the existing Clarifai tags #4647

AetherUnbound opened this issue Jul 22, 2024 · 0 comments
Labels
🗄️ aspect: data Concerns the data in our catalog and/or databases 🌟 goal: addition Addition of new feature 🟨 priority: medium Not blocking but should be addressed soon 🧱 stack: catalog Related to the catalog and Airflow DAGs 🧱 stack: ingestion server Related to the ingestion/data refresh server

Comments

@AetherUnbound
Copy link
Collaborator

Description

Once the filtering is in place (#4646, though not necessarily blocked by it), we can construct an exhaustive set of Clarifai labels and determine exclusions for that provider using the approach described in the IP. Then the Clarifai label exclusions can be added to #4541 in the same way Rekognition’s are added and the blanket exclusion for all tags from that provider can be lifted. These exclusion lists could be combined into a single filtering step, or we could have individual filter lists based on the label provider. My preference is former, since that way the single list serves as a more exhaustive exclusion list.

Additional context

See this section of the IP.

@AetherUnbound AetherUnbound added 🌟 goal: addition Addition of new feature 🗄️ aspect: data Concerns the data in our catalog and/or databases 🟨 priority: medium Not blocking but should be addressed soon 🧱 stack: catalog Related to the catalog and Airflow DAGs 🧱 stack: ingestion server Related to the ingestion/data refresh server labels Jul 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🗄️ aspect: data Concerns the data in our catalog and/or databases 🌟 goal: addition Addition of new feature 🟨 priority: medium Not blocking but should be addressed soon 🧱 stack: catalog Related to the catalog and Airflow DAGs 🧱 stack: ingestion server Related to the ingestion/data refresh server
Projects
Status: 📅 To Do
Development

No branches or pull requests

1 participant