Skip to content

Commit

Permalink
defer resolving helper models in self_critique() and `model_graded_…
Browse files Browse the repository at this point in the history
…qa()`
  • Loading branch information
jjallaire committed Oct 10, 2024
1 parent 6c6a03a commit 6a0d180
Show file tree
Hide file tree
Showing 5 changed files with 88 additions and 72 deletions.
1 change: 1 addition & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,7 @@
- Improve mime type detection for image content encoding (fixes issues w/ webp images).
- Fix memory leak in Inspect View worker-based JSON parsing.
- Add `fail_on_error` option for `eval_retry()` and `inspect eval-retry`.
- Defer resolving helper models in `self_critique()` and `model_graded_qa()`.

## v0.3.40 (6 October 2024)

Expand Down
6 changes: 5 additions & 1 deletion examples/theory_of_mind.py
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
from inspect_ai import Task, task
from inspect_ai import Task, eval, task
from inspect_ai.dataset import example_dataset
from inspect_ai.scorer import model_graded_fact
from inspect_ai.solver import chain_of_thought, generate, self_critique
Expand All @@ -16,3 +16,7 @@ def theory_of_mind(critique=False):
solver=solver,
scorer=model_graded_fact(),
)


if __name__ == "__main__":
eval(theory_of_mind(critique=True), model="openai/gpt-4o")
Loading

0 comments on commit 6a0d180

Please sign in to comment.