Issue #79, Testing Current Score #84

JolanThomassin · 2024-03-14T18:44:18Z

issue #79

Test our schema scoring to see if they are here/accurate.

Schema tested: Louis_v005

Pick a list of random chunk/crawl

Test our score weights

Evaluation Criteria

Similarity: ?
Recency: Compare dates and scores to ensure alignment.
Traffic: ?
Current: Confirm that archived documents receive a score of 0.
Typicality: Compare the average number of site references within the dataset and verify if documents with more references receive higher scores.

rngadam · 2024-03-21T15:22:04Z

ailab/db/finesse/test_queries/__init__.py

+    """
+
+    cursor.execute(query)
+    return cursor.fetchall()


EOF newline

(this should be validated by the repo standards check)

rngadam · 2024-03-21T15:22:20Z

ailab/db/finesse/test_queries/__init__.py

+        WITH random_crawl AS (  
+            SELECT id  
+            FROM crawl  
+            ORDER BY


this came up before; ordering randomly the table is expensive means you are fetching every row and then picking up one.

it's cheaper to pick a row between 1 and count(*) and use that with OFFSET combined with LIMIT:

https://www.postgresql.org/docs/current/queries-limit.html

Yes you told me about that, when I copy paste the query I forgot to change ORDER by OFFSET. It will be changed soon.

rngadam · 2024-03-21T15:34:31Z

the issue reference issue #79 should be in the description to make it clickable (edited)
this PR is open but no reviewers; did you mean to set it at Draft?
branch name is not following our naming conventions: issue#79-testing-current-score

https://github.com/ai-cfia/.github/blob/main/profile/CONTRIBUTING.md

JolanThomassin · 2024-03-21T17:54:53Z

Yes sorry, I forgot to put it as draft, I'm working on different PR at the same time for testing purpose, so everything is probably going to change.

JolanThomassin added 2 commits March 14, 2024 18:42

Fixes #79, SQL query getting random crawl and score

1836081

Fixes #79, New script printing random crawl

1510b8d

JolanThomassin added this to the louis_v005 milestone Mar 14, 2024

JolanThomassin self-assigned this Mar 14, 2024

JolanThomassin linked an issue Mar 14, 2024 that may be closed by this pull request

Testing current Score in our Schema #79

Closed

6 tasks

rngadam suggested changes Mar 21, 2024

View reviewed changes

JolanThomassin marked this pull request as draft March 21, 2024 17:54

JolanThomassin removed their assignment Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue #79, Testing Current Score #84

Issue #79, Testing Current Score #84

JolanThomassin commented Mar 14, 2024 •

edited by rngadam

Loading

rngadam Mar 21, 2024

rngadam Mar 21, 2024

rngadam Mar 21, 2024

JolanThomassin Mar 21, 2024

rngadam commented Mar 21, 2024

JolanThomassin commented Mar 21, 2024

Issue #79, Testing Current Score #84

Are you sure you want to change the base?

Issue #79, Testing Current Score #84

Conversation

JolanThomassin commented Mar 14, 2024 • edited by rngadam Loading

Test our schema scoring to see if they are here/accurate.

Test our score weights

Evaluation Criteria

rngadam Mar 21, 2024

Choose a reason for hiding this comment

rngadam Mar 21, 2024

Choose a reason for hiding this comment

rngadam Mar 21, 2024

Choose a reason for hiding this comment

JolanThomassin Mar 21, 2024

Choose a reason for hiding this comment

rngadam commented Mar 21, 2024

JolanThomassin commented Mar 21, 2024

JolanThomassin commented Mar 14, 2024 •

edited by rngadam

Loading