BlendRL: A Framework for Merging Symbolic and Neural Policies (ICLR 2025)

*Refactoring is undergoing.

BlendRL: A Framework for Merging Symbolic and Neural Policies (ICLR 2025)

Hikaru Shindo, Quentin Delfosse, Devendra Singh Dhami, Kristian Kersting

We propose a framework that jointly learns symbolic and neural policies for reinforcement learning.

Quickstart

Installation

Follow INSTALLATION.md to install dependencies.

Download the trained agents:

wget https://hessenbox.tu-darmstadt.de/dl/fiCNznPuWkALH8JaCJWHeeAV/models.zip
unzip models.zip
rm models.zip

Then you can run the play script:

python play_gui.py --env-name kangaroo --agent-path models/kangaroo_demo
python play_gui.py --env-name seaquest --agent-path models/seaquest_demo

Note that a checkpoint is required to run the play script.

You can run the training script:

python train_blenderl.py --env-name seaquest --joint-training --num-steps 128 --num-envs 5 --gamma 0.99

--joint-training: train neural and logic modules jointly
--num-steps: the number of steps for policy rollout
--num-envs: the number of environments to train agents
--gamma: the discount factor for future rewards

How to Use

The Logic

Inside in/envs/[env_name]/logic/[ruleset_name]/, you find the logic rules that are used as a starting point for training. You can change them or create new rule sets. The ruleset to use is specified with the hyperparam rules.

```

How to Set up New Environments

You add a new environment inside in/envs/[new_env_name]/. There, you need to define a NudgeEnv class that wraps the original environment in order to do

logic state extraction: translates raw env states into logic representations
valuation: Each relation (like closeby) has a corresponding valuation function which maps the (logic) game state to a probability that the relation is true. Each valuation function is defined as a simple Python function. The function's name must match the name of the corresponding relation.
action mapping: action-predicates predicted by the agent need to be mapped to the actual env actions

See the freeway env to see how it is done.

Name	Name	Last commit message	Last commit date
Latest commit hkrsnd update donkeykong Feb 27, 2025 aaa4be6 · Feb 27, 2025 History 203 Commits
.devcontainer	.devcontainer	update: separate neumann dependency	Jan 24, 2025
.vscode	.vscode	update import and readme	Nov 18, 2024
assets	assets	add seaquest	Oct 16, 2024
blendrl	blendrl	remove hackatari import	Jan 31, 2025
cleanrl	cleanrl	update training	Jul 30, 2024
env_src	env_src	deictic agent	Apr 23, 2024
in	in	update donkeykong	Feb 27, 2025
neumann	neumann	Fixing kangaroo	Sep 29, 2024
nsfr	nsfr	add donkeykong	Sep 22, 2024
nudge	nudge	fix reward	Jan 28, 2025
plot	plot	add plot files	Sep 30, 2024
scripts	scripts	add training commands	Jan 28, 2025
INSTALLATION.md	INSTALLATION.md	fix for training	Jan 28, 2025
README.md	README.md	update readme	Jan 24, 2025
evaluate.py	evaluate.py	evalaute on new episodes	Jan 20, 2025
explain.py	explain.py	add ablations	Sep 29, 2024
play_gui.py	play_gui.py	adapted state repr on kangaroo	Jan 24, 2025
requirements.txt	requirements.txt	fix for training	Jan 28, 2025
train_blenderl.py	train_blenderl.py	fix reward log	Jan 30, 2025
train_neuralppo.py	train_neuralppo.py	update folder structure	Oct 16, 2024
train_nudge.py	train_nudge.py	update folder structure	Oct 16, 2024
utils.py	utils.py	update: separate neumann dependency	Jan 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

*Refactoring is undergoing.

BlendRL: A Framework for Merging Symbolic and Neural Policies (ICLR 2025)

Quickstart

Installation

How to Use

The Logic

How to Set up New Environments

About

Releases

Packages

Contributors 2

Languages

ml-research/blendrl

Folders and files

Latest commit

History

Repository files navigation

*Refactoring is undergoing.

BlendRL: A Framework for Merging Symbolic and Neural Policies (ICLR 2025)

Quickstart

Installation

How to Use

The Logic

How to Set up New Environments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages