Removes the generate thread #1054

finbarrtimbers · 2025-10-03T16:49:54Z

The only reason we had the generate thread previously was to fix a locking issue that we had when updating the weights on the actor. Turns out that, with vLLM v1, the locking issue goes away!

Step time decreases significantly on the runs with inflight_updates=True (wandb):

Runs with inflight_updates=True:

Single GPU: Beaker
Single GPU with tool use (16m -> 12m): Beaker
Multi-node (12m -> 5m): Beaker

Runs with inflight_updates=False:

Single GPU: Beaker
Multi-node: Beaker

hamishivi

Looks good to me generally, but I worry about error handling now.

open_instruct/grpo_fast.py

hamishivi · 2025-10-06T16:53:33Z

open_instruct/vllm_utils3.py

        total_processed = 0
        iteration_count = 0

-        while not self._should_exit():


similar question to above, what happens if some other process dies and the vllm worker should stop?

Then it gets killed when Ray shuts down:

open-instruct/open_instruct/grpo_fast.py

Line 2708 in c3f79a3

ray.shutdown()

hamishivi · 2025-10-06T16:53:54Z

open_instruct/vllm_utils3.py

                            output.request_id, output, complete_output, current_time
                        )
-
-            if self.verbose and iteration_count % 100 == 0:


i never really used this logging so fine with removing but was there any other impetus for doing so?

Truthfully, I asked Claude to remove all the debug logging (I had a bunch of debug logging in an earlier version of this PR) and it also removed this. I thought it was fine so I kept it. Open to changing this if you prefer.

no, all good! Was just wondering why the change.

hamishivi

Tested and seems good to me!

finbarrtimbers added 27 commits October 3, 2025 10:24

Updated LLMRayActor to run continuously

7c61ac1

Set v1 = true

d9dd059

Added pause loop in update weights

3379473

SEt inflight updates false

c704b19

Fixed condition

9364f25

Update code

63f6989

another attempt at fixing deadlocks

9943575

Added logs

86bdb40

More logging to diagnose hang

91a25b1

Fixed busy-waiting.

75d2866

Updated the code

5824515

Added logging

bc912e0

fixed prefetch

3743097

Updated code

adc93da

Cleaned up PR.

0280728

Cleaned up code.

ab46044

Fixed race condition

3d6f829

Removed logging from update_weight

3c34957

Less logging

981dde2

Cleaned up PR.

00f072e

Merge branch 'main' into remove-generate

8890d6b

Cleaned up PR.

020d008

Cleaned up PR.

10508dc

Cleaned up PR.

b02f9e0

Rearranged timeout

78995a1

Removes broken code.

2291ad6

Undid changes to inflight

7bb9481

finbarrtimbers marked this pull request as ready for review October 6, 2025 15:33

hamishivi reviewed Oct 6, 2025

View reviewed changes

Updated code with a health check on the actors.

fa430d6

Merge branch 'main' into remove-generate

d096136

finbarrtimbers enabled auto-merge October 6, 2025 17:24

finbarrtimbers requested a review from hamishivi October 6, 2025 17:24

hamishivi approved these changes Oct 7, 2025

View reviewed changes

finbarrtimbers added this pull request to the merge queue Oct 7, 2025

Merged via the queue into main with commit e320ff0 Oct 7, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Removes the generate thread #1054

Removes the generate thread #1054

Uh oh!

finbarrtimbers commented Oct 3, 2025 •

edited

Loading

Uh oh!

hamishivi left a comment

Uh oh!

Uh oh!

hamishivi Oct 6, 2025

Uh oh!

finbarrtimbers Oct 6, 2025

Uh oh!

hamishivi Oct 6, 2025

Uh oh!

finbarrtimbers Oct 6, 2025

Uh oh!

hamishivi Oct 7, 2025

Uh oh!

hamishivi left a comment

Uh oh!

Uh oh!

Uh oh!

Removes the generate thread #1054

Removes the generate thread #1054

Uh oh!

Conversation

finbarrtimbers commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hamishivi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hamishivi Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

finbarrtimbers Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

hamishivi Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

finbarrtimbers Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

hamishivi Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

hamishivi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

finbarrtimbers commented Oct 3, 2025 •

edited

Loading