Testing current Score in our Schema #79

JolanThomassin · 2024-03-11T19:19:58Z

Test our schema scoring to see if they are here/accurate.

Schema tested: Louis_v005

Pick a list of random chunk/crawl

Test our score weights

Evaluation Criteria

Similarity: ?
Recency: Compare dates and scores to ensure alignment.
Traffic: ?
Current: Confirm that archived documents receive a score of 0.
Typicality: Compare the average number of site references within the dataset and verify if documents with more references receive higher scores.

JolanThomassin · 2024-03-14T17:21:02Z

For now, when I check Similarity and Traffic, I'm having trouble understanding how they have been implemented in the schema and, therefore, how to evaluate them. Do you have a solution, @rngadam?

Similarity: Represents the primary signal in our scoring mechanism. It denotes the measurement of how closely a document aligns with the user's query, reflecting the relevance of the document to the search criteria. It is not a static precomputed score but a dynamic metric that is computed on the fly from the search results. This ensures that the relevance is tied to the specific query, going beyond simple keyword matching. Documents with higher similarity scores are considered more relevant. By prioritizing similarity as the first signal in our scoring process, we aim to deliver search results that are more accurate.

Traffic: The frequency with which users consult a document influences its score. Popular or frequently accessed documents are given higher scores with the help of web traffic logs, indicating their relevance and importance to users. Warning: The home page is rated really high since it's where every user land at first.

rngadam · 2024-03-21T15:13:03Z

Similarity is dynamic, it cannot by definition be precomputed. we the 1 - cosine distance between chunk vector and query vector for this:

ailab-db/sql/2024-03-20-weighted-search-updated.sql

Line 75 in ce18b09

    
           select s.crawl_id as id, s.chunk_id, 'similarity'::score_type as score_type, s.score as score

Traffic comes from the webserver logs. I'm imported and computed a score per page in the past already:

https://github.com/ai-cfia/ailab-db/blob/main/sql/compute-traffic-score.sql

JolanThomassin · 2024-04-25T13:25:49Z

After testing the score seems to be accurate.

JolanThomassin added this to Database Mar 11, 2024

JolanThomassin moved this to In Progress in Database Mar 11, 2024

JolanThomassin self-assigned this Mar 11, 2024

JolanThomassin added this to the louis_v005 milestone Mar 11, 2024

JolanThomassin added a commit that referenced this issue Mar 14, 2024

Fixes #79, SQL query getting random crawl and score

1836081

JolanThomassin added a commit that referenced this issue Mar 14, 2024

Fixes #79, New script printing random crawl

1510b8d

JolanThomassin linked a pull request Mar 14, 2024 that will close this issue

Issue #79, Testing Current Score #84

Draft

6 tasks

JolanThomassin added this to Finesse Mar 14, 2024

JolanThomassin moved this to In Progress in Finesse Mar 14, 2024

rngadam mentioned this issue Mar 21, 2024

Issue #79, Testing Current Score #84

Draft

6 tasks

JolanThomassin closed this as completed Apr 25, 2024

github-project-automation bot moved this from In Progress to Done in Finesse Apr 25, 2024

github-project-automation bot moved this from In Progress to Done in Database Apr 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Testing current Score in our Schema #79

Testing current Score in our Schema #79

JolanThomassin commented Mar 11, 2024 •

edited

Loading

JolanThomassin commented Mar 14, 2024

rngadam commented Mar 21, 2024

JolanThomassin commented Apr 25, 2024

Testing current Score in our Schema #79

Testing current Score in our Schema #79

Comments

JolanThomassin commented Mar 11, 2024 • edited Loading

Test our schema scoring to see if they are here/accurate.

Test our score weights

Evaluation Criteria

JolanThomassin commented Mar 14, 2024

rngadam commented Mar 21, 2024

JolanThomassin commented Apr 25, 2024

JolanThomassin commented Mar 11, 2024 •

edited

Loading