QnA Documents Diverge from Azure Index Document Source #72

ibrahim-kabir · 2024-02-14T21:47:14Z

Summary:

The current QnA generation process relies on different documents than the Azure Indexes. QnAs are generated from the 'crawl' table, but Finesse use finesse-public-guidance index is generated from the 'public-guidance' blob storage. This causes issues with generating scores of 0 in the Finesse Benchmark tool located in the repository finesse-backend. The script needs modification to generate documents from the blob storage instead of the 'louis_crawl' postgreSQL databse.

Tasks:

Modify the script to generate QnAs from the Index blob storage
Test the new QnAs with the Finesse Benchmark tool
Update finesse-data with the new QnAs

Acceptance Criteria:

The script successfully connects to the Azure Blob Storage account.
Documents are retrieved from the 'public-guidance' container in the Azure Blob Storage.
QnAs are generated using documents from the blob storage.
Running the Finesse Benchmark tool with the modified script does not result in scores of 0.

rngadam · 2024-02-16T12:49:11Z

AFAIK @k-allagbe imported guidance documents from the same crawl table into the blob storage?

ibrahim-kabir · 2024-02-21T17:46:19Z

AFAIK @k-allagbe imported guidance documents from the same crawl table into the blob storage?

@rngadam After the testing I noticed that the 2 null scores (0%) are supposed to be in the bad_questions repository. This means that all the questions returned a score. Moreover, the imported guidance documents are indeed from the same crawl table than the blob storage. This lead me to think that the questions are perfectly generated and we only need more of them and to better classify them (into the good or bad questions repository). So before trying to fix this issue, I propose to generate more questions and test them on Finesse to verify that all documents return a score.

ibrahim-kabir · 2024-03-05T17:06:25Z

AFAIK @k-allagbe imported guidance documents from the same crawl table into the blob storage?

@rngadam , @k-allagbe confirmed me they were imported from the crawl table. I will close this issue if you agree.

ibrahim-kabir self-assigned this Feb 14, 2024

ibrahim-kabir added this to Finesse Feb 14, 2024

ibrahim-kabir mentioned this issue Feb 14, 2024

issue #9, Developing a Search Function Test Utilizing LLM #47

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QnA Documents Diverge from Azure Index Document Source #72

QnA Documents Diverge from Azure Index Document Source #72

ibrahim-kabir commented Feb 14, 2024

rngadam commented Feb 16, 2024

ibrahim-kabir commented Feb 21, 2024

ibrahim-kabir commented Mar 5, 2024 •

edited

Loading

QnA Documents Diverge from Azure Index Document Source #72

QnA Documents Diverge from Azure Index Document Source #72

Comments

ibrahim-kabir commented Feb 14, 2024

Summary:

Tasks:

Acceptance Criteria:

rngadam commented Feb 16, 2024

ibrahim-kabir commented Feb 21, 2024

ibrahim-kabir commented Mar 5, 2024 • edited Loading

ibrahim-kabir commented Mar 5, 2024 •

edited

Loading