Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pretraining via Behavioral Cloning for BuildVillageHouse #22

Open
lauritowal opened this issue Aug 26, 2022 · 0 comments
Open

Pretraining via Behavioral Cloning for BuildVillageHouse #22

lauritowal opened this issue Aug 26, 2022 · 0 comments
Assignees

Comments

@lauritowal
Copy link
Contributor

The base model needs to be fine-tuned to be able to perform with at least average quality on one of the four competition’s tasks (Find Cave, Build Waterfall, etc.) [4]. Average Quality means, here, that the agent should at least do something which somehow resembles what the actual task looks like instead of doing completely random actions. This is important for collecting feedback from a human later, since giving feedback on completely random trajectories would not lead to much information gain.

To fine-tune the base model, we use some of the provided demonstrations

Task:
Train four models, each, on the officially provided demonstrations:

Find out how many demonstrations are needed to reach an average quality. Do we need more (e.g. MineDojo https://minedojo.org/knowledge_base)?
Can we further improve the quality by defining Subtasks --> See own Issue for that

@lauritowal lauritowal self-assigned this Aug 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant