You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am trying to run ``examples/ToT/blocksworld/test_tot_v1_dfs.sh'' on step 2, step 4, step 6. However, I noticed that Llama 8B returns empty strings as actions.
example: ['unstack the yellow block from on top of the red block', 'unstack the blue block from on top of the yellow block', '', 'pick up the blue block']
Upon inspection I see that prompt that is being passed is `` the block'' and thats it. I believe `generate' function the input is being trimmed resulting in this issue. I also observed that increasing max_depth is causing more empty strings as actions.
Is there any easy fix to this? Maybe some hyperparameter that can be tuned/code fix?
The text was updated successfully, but these errors were encountered:
sumedhpendurkar
changed the title
Blocksworld+ToT with Llama3.1
Actions are Empty Strings: Blocksworld+ToT with Llama3.1
Nov 17, 2024
I am trying to run ``examples/ToT/blocksworld/test_tot_v1_dfs.sh'' on step 2, step 4, step 6. However, I noticed that Llama 8B returns empty strings as actions.
example: ['unstack the yellow block from on top of the red block', 'unstack the blue block from on top of the yellow block', '', 'pick up the blue block']
Upon inspection I see that prompt that is being passed is `` the block'' and thats it. I believe `generate' function the input is being trimmed resulting in this issue. I also observed that increasing max_depth is causing more empty strings as actions.
Is there any easy fix to this? Maybe some hyperparameter that can be tuned/code fix?
The text was updated successfully, but these errors were encountered: