Change the repository type filter
All
Repositories list
14 repositories
vivaria
PublicVivaria is METR's tool for running evaluations and conducting agent elicitation research.viv-task-dev
Publicautonomy-evals-guide
Publictask-assets
Publictask-legacy-verifier
Publictask-protected-scoring
Publictask-aux-vm-helpers
Publicpublic-tasks
Publictask-standard
Publicworktest-sw-eng-deps
Publicpyhooks
Public archivevivaria-mentat
Publictask-template
Public template