Adds more tests! #865

finbarrtimbers · 2025-08-08T16:25:55Z

Here, I add two tests:

There was a pre-existing test for ToolUseLLM in open_instruct/tool_utils/tool_vllm.py. Here, I pull this out into test_tool_vllm.py. I also add a mocked vllm instance so that we will automatically run the tool tests as part of our CI. The tests pass on both GPU and CPU.
I have updated the beaker integration test so that it runs with 2 vllm engines and uses a smaller Qwen model (0.6B). This test would have caught the timeout issue (example repro). This also succeeds otherwise (example run).

finbarrtimbers marked this pull request as ready for review August 8, 2025 19:31

finbarrtimbers added 8 commits August 11, 2025 08:31

Added test for ToolUseLLM.

d13de22

Added test for ToolUseLLM.

8de68eb

Now, tests pass.

1cc8c98

Undid changes to tool_vllm.py.

17622a0

Better mocks.

35d4fd9

Convert mocking to free function.

8b2539b

Run two engines in single GPU script.

7689eb2

Attempt at fixing setup.

33d9449

finbarrtimbers force-pushed the test-tools branch from cb2f782 to 33d9449 Compare August 11, 2025 16:01

Another tweak.

02097dc

finbarrtimbers changed the title ~~Adds a test for ToolUseLLM.~~ Adds more tests! Aug 11, 2025

Cleaned up code.

a28f112

Provide feedback