Relevancy Experimentation Framework #421

obulat · 2023-02-18T03:52:17Z

Summary

Develop an experimental framework for assessing how relevancy is affected (either positively or negatively) by changes made to our search algorithms and the data itself.

Description

The search relevancy sandbox project outlines the infrastructure pieces necessary for making rapid changes to our search algorithms and components. The next step for us in order to leverage this toolbox is to develop an experimentation framework for assessing how relevancy changes when certain changes are made. This could be done in two parts:

A hybrid automation + manual process which takes snapshots of specific result sets in staging, then prompts a maintainer to compare the relevancy between the two snapshots. This is intended to be a blunt approach; do the results still feel relevant is all that's necessary, if the results are wildly irrelevant where they previously were not, then we do not roll the projected change out to production.
An automated assessment of relevancy based on A/B experimentation and paired with the results from our analytics. This would use metrics like SELECT_SEARCH_RESULT, LOAD_MORE_RESULTS, and any new metrics we require to assess if users are clicking through results at a higher/lower rate given any changes.

The former would act as more of a "gut check" with little nuance, and the second would serve as a longer-running assessment of result relevancy based on actual user behavior.

Best guess at list of implementation plans:

Develop algorithm for assessing relevancy given analytics
Staging snapshot automation & manual assessment protocols
Infrastructure changes necessary for performing A/B experimentation
Framework for combining relevancy assessment metric and infrastructure changes to accurately run an experiment

Documents

Project Proposal
Implementation Plan

Issues

Prior Art

* Enable xcom pickling * Update duration reporting

obulat added the 🧭 project: thread An issue used to track a project and its progress label Feb 18, 2023

dhruvkb pushed a commit that referenced this issue Apr 14, 2023

Enable XCom pickling in Airflow (#421)

0e3675c

* Enable xcom pickling * Update duration reporting

AetherUnbound changed the title ~~Relevancy Metrics and Reporting~~ Relevancy Experimentation Framework Dec 19, 2023

AetherUnbound added 🌟 goal: addition Addition of new feature 🧱 stack: api Related to the Django API 🧱 stack: frontend Related to the Nuxt frontend labels Dec 19, 2023

AetherUnbound mentioned this issue Mar 26, 2024

Project Proposal: Rekognition data incorporation #3948

Merged

2 tasks

AetherUnbound mentioned this issue May 2, 2024

Implementation Plan: Machine-generated tags in the API #4189

Merged

2 tasks

sarayourfriend mentioned this issue Jun 3, 2024

Implementation Plan: Augment the catalog database with suitable Rekognition tags #4417

Merged

2 tasks

zackkrida added 🧱 stack: catalog Related to the catalog and Airflow DAGs 🧱 stack: analytics Related to the analytics setup 🧱 stack: infra Related to the Terraform config and other infrastructure labels Jul 31, 2024

zackkrida mentioned this issue Aug 20, 2024

Downrank first-page results that aren't ever interacted with #4787

Open

sarayourfriend mentioned this issue Sep 12, 2024

IP: Undo split indices for sensitive text detection #4904

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relevancy Experimentation Framework #421

Relevancy Experimentation Framework #421

obulat commented Feb 18, 2023 •

edited by AetherUnbound

Loading

Relevancy Experimentation Framework #421

Relevancy Experimentation Framework #421

Comments

obulat commented Feb 18, 2023 • edited by AetherUnbound Loading

Summary

Description

Best guess at list of implementation plans:

Documents

Issues

Prior Art

obulat commented Feb 18, 2023 •

edited by AetherUnbound

Loading