issue #9, Developing a Search Function Test Utilizing LLM #47

JolanThomassin · 2023-11-14T14:47:04Z

Content Analysis and Scoring System for QA Using OpenAI and Azure

Issue Description:

We aim to create a comprehensive system that extracts URLs from a web page chunk, leverages the OpenAI API through Azure to generate relevant questions and answers, and subsequently employs our search function to score the accuracy of the first ten search results in relation to the generated questions and answers.

The generated questions and answers provide a benchmark for the quality of the content analysis. When we use our search function to retrieve information related to these questions, we can assess the accuracy of the search results. This quality assurance step is crucial for ensuring the reliability of the information we collect.

https://gist.github.com/JolanThomassin/1cba745ddb3aae7f433dfb4d85298fd9

Steps to Implement:

URL Extraction: Extract chunk using query on the database.
Question-Answer Generation: Leverage the Azure OpenAI API to generate questions and corresponding answers based on the content. Store this question-answer pairs along with their associated URLs for future reference.

Chunk Extraction: Implement a method to extract chunks of text from the collected data.
Generated Q&A: Create a dataset of Q&A pairs from the conversational agent's responses.

rngadam · 2024-02-09T20:00:47Z

Does it look as it should? It's my first time doing unit tests, so I would appreciate feedback.

There's many aspects to make good unit tests which impact both your test code and your code itself.

Right now, the test only covers the "happy path"; tests should cover both success and failures cases.
if necessary, you may have to have to change your code to retrofit testability (which is why we prefer to TDD - test-driven development, to ensure we construct testable code in the first place)
- minimize side-effects of your functions; in this case, you are saving to a file; move this out as a separate step and validate the return of the function
  - Reading material: https://jerrynsh.com/how-to-write-testable-code-in-python/
  - you can test save to file separately
- don't sys.exit(1) in functions; raise custom exceptions (which you should trigger in a unit test)
a test is made up of three parts: setup, execute, assert.
- You want to use assert to check the results, not output to console (which is I/0)

ibrahim-kabir · 2024-02-14T21:58:21Z

This PR needs to be merged to start working on the QnA Documents Diverge from Azure Index Document Source issue.

rngadam · 2024-02-16T12:51:35Z

This PR needs to be merged to start working on the QnA Documents Diverge from Azure Index Document Source issue.

that's not correct, can branch off this PR and work on a PR/branch to this PR/branch to help @JolanThomassin advance his work while letting him review changes before integrating them.

We won't just merge nilly-willy without tests or checks passing.

JolanThomassin · 2024-02-22T15:50:54Z

Does it look as it should? It's my first time doing unit tests, so I would appreciate feedback.

There's many aspects to make good unit tests which impact both your test code and your code itself.

Right now, the test only covers the "happy path"; tests should cover both success and failures cases.

if necessary, you may have to have to change your code to retrofit testability (which is why we prefer to TDD - test-driven development, to ensure we construct testable code in the first place)

minimize side-effects of your functions; in this case, you are saving to a file; move this out as a separate step and validate the return of the function

Reading material: https://jerrynsh.com/how-to-write-testable-code-in-python/

you can test save to file separately

don't sys.exit(1) in functions; raise custom exceptions (which you should trigger in a unit test)

a test is made up of three parts: setup, execute, assert.

You want to use assert to check the results, not output to console (which is I/0)

For now I'm testing one case if it work, and a failure one.

test_generate_question
test_generate_question_db_connection_fail

I had to change the code a little so the test fit properly (b86fbe9).

I removed sys.exit(1) (b7e5b56)

For the three parts, setup, execute, assert.

Setup: In the setUp method, I'm setting up the testing environment.
Execute: In my test methods test_generate_question and test_generate_question_db_connection_fail, I'm calling the method generate_question which is the part of the code we are actually testing.
Assert: The assertIsNone and assertIsNotNone methods are the assertions.

For the saving part, I'll change the code so the saving is not done by the same function.

rngadam · 2024-02-26T19:11:50Z

ailab/db/finesse/test_queries/__init__.py

+            WHERE sc.entity_id = ch.id
+            AND sc.score_type = 'current' 
+            AND sc.score > 0.0
+        )
        ORDER BY RANDOM()


still not addressed @JolanThomassin

ailab/db/finesse/prompt/qna_user_prompt.txt

rngadam

Let's merge this because it's been pending for way too long AFTER we've linked to a new issue outlining the work to be done.

JolanThomassin · 2024-03-04T18:29:40Z

@rngadam , before I merge this PR, do you agree with this issue to continue the work? If yes, I would add the assignee, the project, etc...

Issue: #73

rngadam · 2024-03-04T19:44:18Z

@rngadam , before I merge this PR, do you agree with this issue to continue the work? If yes, I would add the assignee, the project, etc...

that issue is ok, but really very specific to one aspect of pending work.

we want issues to track the following conversations that have not been addressed yet:

JolanThomassin · 2024-03-07T15:04:00Z

@rngadam , before I merge this PR, do you agree with this issue to continue the work? If yes, I would add the assignee, the project, etc...

that issue is ok, but really very specific to one aspect of pending work.

we want issues to track the following conversations that have not been addressed yet:

On the 5 unresolved conversations only the first one deserve an issue, the 3 ones with Guy is about testing the function, and everything is already here and working. And the 2nd one from you was to change the query so we don't go trough all the documents and the modification is done and working too.

I'll add the issue about question quality here

melanie-fressard and others added 8 commits November 2, 2023 19:03

issue #41 - creation of the sql view

f3db963

issue #44: fix package configuration

9362d33

issue #44: fix module import

acb8b5d

Merge remote-tracking branch 'origin/main' into 41-individual-scoring

17c7877

issue #41 - adding avg score

da2ad52

modification of .env.template to match standards

8ba87c8

imports modification

9adc2d8

Fixes #9, new script

31df218

JolanThomassin self-assigned this Nov 14, 2023

JolanThomassin marked this pull request as draft November 14, 2023 14:47

JolanThomassin changed the title ~~issue #9, Developing a Search Function Test Utilizing LLM~~ WIP - issue #9, Developing a Search Function Test Utilizing LLM Nov 14, 2023

JolanThomassin and others added 15 commits November 14, 2023 19:47

issue#24, change louis.db for ailab.db

4f94a84

issue #41 - creation of init in ailab/db/finesse

c90769a

issue #20 - moving search.py

e668662

issue #41 - script to use the code

30f6069

issue #41 - debbug connexion db

67eca6e

issue #41 - debbug sql function

c97cd7d

issue #41 - formatting

21c6614

issue #41 - separating creation and select

403f782

issue #41 - csv file output + correction sql

32bc58a

issue #41 - suppressing similarity column

a1d7ab3

issue #41 - reworking output making

a3d10d2

changes in template

4924fc7

issue #49 - modification from changing repo name

68cf448

issue #49 - results file by query name

3c38ab7

Adding base script

ecfc757

JolanThomassin linked an issue Nov 21, 2023 that may be closed by this pull request

Developing a Search Function Test Utilizing LLM #9

Closed

4 tasks

melanie-fressard and others added 3 commits November 21, 2023 14:56

adressing issue #48 and final line

a687fdc

issue #49 - minor correction + script to launch

92f327a

Fixes #9, Query and LLM Q&A generation

08401f8

JolanThomassin added 3 commits February 8, 2024 19:22

Fixes #9, add black formatter extension

b721451

Fixes #9, add semver to requirements

418399b

Fixes #9, changes semver version

39e159d

JolanThomassin added 2 commits February 12, 2024 18:26

Fixes #9, test for db failure

b86fbe9

Fixes #9, replace sys.exit(1)

b7e5b56

Fixes #9, import removed

2046aa7

JolanThomassin added 3 commits February 22, 2024 20:09

Fixes #9, separate save for test

a40b7d5

Fixes #9, import at the top

7ecb190

Fixes #9, import at the top

252d3b3

JolanThomassin requested a review from rngadam February 26, 2024 15:00

rngadam suggested changes Feb 26, 2024

View reviewed changes

JolanThomassin added 5 commits February 26, 2024 19:18

Fixes #9, fixed number of generated question

d19bb55

Fixes #9, adding "question_quality" variable

f1b016a

Fixes #9, random query new method

ca58322

Fixes #9, adding parameter into call

8cd1664

Fixes #9, user_prompt more example

fa67e42

JolanThomassin requested a review from rngadam February 29, 2024 17:57

rngadam suggested changes Mar 1, 2024

View reviewed changes

ailab/db/finesse/prompt/qna_user_prompt.txt Outdated Show resolved Hide resolved

rngadam approved these changes Mar 4, 2024

View reviewed changes

Fixes #9, remove "question_quality" variable

71a436f

JolanThomassin mentioned this pull request Mar 7, 2024

Evaluate a Question Quality #74

Open

k-allagbe approved these changes Mar 7, 2024

View reviewed changes

JolanThomassin merged commit fd08308 into main Mar 7, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue #9, Developing a Search Function Test Utilizing LLM #47

issue #9, Developing a Search Function Test Utilizing LLM #47

JolanThomassin commented Nov 14, 2023 •

edited

Loading

rngadam commented Feb 9, 2024

ibrahim-kabir commented Feb 14, 2024

rngadam commented Feb 16, 2024

JolanThomassin commented Feb 22, 2024

rngadam Feb 26, 2024

rngadam left a comment

JolanThomassin commented Mar 4, 2024

rngadam commented Mar 4, 2024

JolanThomassin commented Mar 7, 2024 •

edited

Loading

issue #9, Developing a Search Function Test Utilizing LLM #47

issue #9, Developing a Search Function Test Utilizing LLM #47

Conversation

JolanThomassin commented Nov 14, 2023 • edited Loading

Content Analysis and Scoring System for QA Using OpenAI and Azure

Issue Description:

Steps to Implement:

rngadam commented Feb 9, 2024

ibrahim-kabir commented Feb 14, 2024

rngadam commented Feb 16, 2024

JolanThomassin commented Feb 22, 2024

rngadam Feb 26, 2024

Choose a reason for hiding this comment

rngadam left a comment

Choose a reason for hiding this comment

JolanThomassin commented Mar 4, 2024

rngadam commented Mar 4, 2024

JolanThomassin commented Mar 7, 2024 • edited Loading

JolanThomassin commented Nov 14, 2023 •

edited

Loading

JolanThomassin commented Mar 7, 2024 •

edited

Loading