[IROS 2024] Public code and model for IR2: Implicit Rendezvous for Robotic Exploration Teams under Sparse Intermittent Connectivity.
We present IR2, a deep reinforcement learning approach to information sharing for multi-robot exporation under communication constraints. Leveraging attention-based neural networks and hierarchical graph formulation, robots can effectively balance the longer-term trade-offs between disconnecting for solo exploration and reconnecting for information sharing.
We show that our model trained in this python simulation can generalize well to complex large-scale environments, such as those in Gazebo simulation and real world experiments. We hope that our work will be a useful learning-based benchmark for future research.
This demonstration showcases 4 robots exploring in an unknown Complex
map under line-of-sight signal strength communication constraints. The top gif illustrates the global map and robot positions assuming no communication constraints. Conversely, the bottom 4 gifs illustrates the individual robots' map and position beliefs subjected to communication constraints.
If this GIF is taking too long to load, you may view the demonstration here.
This repository was tested using the following dependencies. Newer version of these packages may work as well.
python == 3.8
pytorch == 1.10.0
ray == 1.10.0
scikit-image == 0.19.3
scikit-learn == 1.2.1
scipy == 1.10.0
matplotlib == 3.6.3
tensorboard == 2.8.0
- Set training parameters in
parameters.py
. - Run python
driver.py
.
- Set inference parameters in
test_parameters.py
. - Run
test_driver.py
.
parameter.py
Training parameters.driver.py
Driver of training program, maintain & update the global network.runner.py
Wrapper of the local network.multi_robot_worker.py
Interact with environment and collect episode experience.model.py
Define attention-based network.env.py
Autonomous exploration environment.graph_generator.py
Generate and update the collision-free graph.graph.py
Graph definition and utilities.node.py
Initialize and update nodes in the coliision-free graph.sensor.py
Simulate the sensor model of Lidar.robot.py
Acts as a replay buffer.ss_realistic_model.py
Realistic signal strength communication model./model
Trained model./DungeonMaps
Maps of training environments.
If you intend to use our work in your research, please cite the following publication:
@INPROCEEDINGS{derek2024IR2,
author={Tan, Derek Ming Siang and Ma, Yixiao and Liang, Jingsong and Cao, Yuhong and Sartoretti, Guillaume},
booktitle={2024 IEEE International Conference on Intelligent Robots and Systems (IROS)},
title={IR2: Implicit Rendezvous for Robotic Exploration Teams under Sparse Intermittent Connectivity},
year={2024},
volume={},
number={},
pages={},
doi={10.48550/arXiv.2409.04730}}
Derek Ming Siang Tan
Yixiao Ma
Jingsong Liang
Yi Cheng Chng
Yuhong Cao
Guillaume Sartoretti