-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Obs AI Assistant] Improve LLM evaluation framework #204574
base: main
Are you sure you want to change the base?
Conversation
57b281e
to
e82e4e6
Compare
Pinging @elastic/obs-ai-assistant (Team:Obs AI Assistant) |
…rch credentials are provided
b60786e
to
5d7fe68
Compare
🤖 GitHub commentsExpand to view the GitHub comments
Just comment with:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great @viduni94! just a few nits
|
||
#### Connector | ||
- Run Elasticsearch locally: `yarn es snapshot --license trial` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
--license trial
is unnecessary no?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
.../solutions/observability/plugins/observability_ai_assistant_app/scripts/evaluation/README.md
Outdated
Show resolved
Hide resolved
.../observability/plugins/observability_ai_assistant_app/scripts/evaluation/get_service_urls.ts
Outdated
Show resolved
Hide resolved
.../observability/plugins/observability_ai_assistant_app/scripts/evaluation/get_service_urls.ts
Outdated
Show resolved
Hide resolved
@@ -124,13 +124,13 @@ export class KibanaClient { | |||
return this.axios<T>({ | |||
method, | |||
url, | |||
data: data || {}, | |||
...(method.toLowerCase() !== 'delete' ? { data: data || {} } : {}), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why is this needed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Without this condition, deleting ruleIds
fails here - https://github.com/elastic/kibana/pull/204574/files#diff-23cc9139c91a064a3ca574552ad823023c579cc2c68ff7f277c392102a0d526aL139
Because the DELETE
method doesn't allow an undefined
or empty body.
...ons/observability/plugins/observability_ai_assistant_app/scripts/evaluation/kibana_client.ts
Outdated
Show resolved
Hide resolved
...gins/observability_ai_assistant_app/scripts/evaluation/scenarios/elasticsearch/index.spec.ts
Outdated
Show resolved
Hide resolved
Results after the updates:
|
💚 Build Succeeded
Metrics [docs]
History
cc @viduni94 |
Closes #203122
Summary
Problem
The Obs AI Assistant LLM evaluation framework cannot successfully run in the current state in the
main
branch and has missing scenarios.Problems identified:
/internal
APIsSolution
As a part of spacetime, worked on fixing the current issues in the LLM evaluation framework and working on improving and enhancing the framework.
Fixes
kibana.dev.yml
after
hooks were not running successfully because of an error in thecallKibana
method/internal
APIsEnhancements / Improvements
retrieve_elastic_doc
functionalerts
) to produce the best result. At present all scenarios use the scopeall
which is not ideal and doesn't align with the actual functionality of the AI AssistantChecklist
release_note:*
label is applied per the guidelines