Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Actions are Empty Strings: Blocksworld+ToT with Llama3.1 #111

Open
sumedhpendurkar opened this issue Nov 17, 2024 · 4 comments
Open

Actions are Empty Strings: Blocksworld+ToT with Llama3.1 #111

sumedhpendurkar opened this issue Nov 17, 2024 · 4 comments

Comments

@sumedhpendurkar
Copy link

I am trying to run ``examples/ToT/blocksworld/test_tot_v1_dfs.sh'' on step 2, step 4, step 6. However, I noticed that Llama 8B returns empty strings as actions.

example: ['unstack the yellow block from on top of the red block', 'unstack the blue block from on top of the yellow block', '', 'pick up the blue block']

Upon inspection I see that prompt that is being passed is `` the block'' and thats it. I believe `generate' function the input is being trimmed resulting in this issue. I also observed that increasing max_depth is causing more empty strings as actions.

Is there any easy fix to this? Maybe some hyperparameter that can be tuned/code fix?

@sumedhpendurkar sumedhpendurkar changed the title Blocksworld+ToT with Llama3.1 Actions are Empty Strings: Blocksworld+ToT with Llama3.1 Nov 17, 2024
@sumedhpendurkar
Copy link
Author

This can be reproduced by running step 4 with the default parameters provided, or running step 2 with increased depth.

@Ber666
Copy link
Collaborator

Ber666 commented Nov 27, 2024

Hi! Did you use the instruct model or base model?

@sumedhpendurkar
Copy link
Author

I used the base model, should I be using the instruct model for the prompts?

@Ber666
Copy link
Collaborator

Ber666 commented Dec 3, 2024

Sorry for late reply.. No, the prompt was written for base models. We will look into the problem.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants