OmniDrive: LLM-Agent for Autonomous Driving with 3D Perception, Reasoning and Planning

demo.mp4

We present OmniDrive, a holistic Drive LLM-Agent framework for end-to-end autonomous driving. Our main contributions involve novel solutions in both model (OmniDrive-Agent) and benchmark (OmniDrive-nuScenes). The former features a novel 3D multimodal LLM design that uses sparse queries to lift and compress visual representations into 3D. The latter is constituted of comprehensive VQA tasks for reasoning and planning, including scene description, traffic regulation, 3D grounding, counterfactual reasoning, decision making and planning.

News

[2025/04/16] Adding TensorRT support. [Link]
[2025/02/26] OmniDrive is accepted to CVPR 2025.
[2024/07/18] OmniDrive-nuScenes model release. [HF]
[2024/05/02] OmniDrive-nuScenes dataset release. [Data]
[2024/05/02] Technical report release. [arXiv]

Getting Started

Please follow Environment Setup step by step.

Currently Supported Features

Visual Results

Joint End-to-end Planning and Reasoning

Interactive Conversation with Ego Vehicle

Counterfactual Reasoning of Planning Behaviors

Citation

If this work is helpful for your research, please consider citing:

@inproceedings{wang2025omnidrive,
  title={{OmniDrive}: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning},
  author={Shihao Wang and Zhiding Yu and Xiaohui Jiang and Shiyi Lan and Min Shi and Nadine Chang and Jan Kautz and Ying Li and Jose M. Alvarez},
  booktitle={CVPR},
  year={2025}
}

Acknowledgement

The team would like to give special thanks to the NVIDIA TSE Team, including Le An, Chengzhe Xu, Yuchao Jin, and Josh Park, for their exceptional work on the TensorRT deployment of OmniDrive.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
assets		assets
data_gen		data_gen
data_utils		data_utils
deploy		deploy
docs		docs
evaluation		evaluation
projects		projects
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
update_coords.py		update_coords.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OmniDrive: LLM-Agent for Autonomous Driving with 3D Perception, Reasoning and Planning

News

Getting Started

Currently Supported Features

Visual Results

Citation

Acknowledgement

About

Releases 1

Packages

Contributors 3

Languages

License

NVlabs/OmniDrive

Folders and files

Latest commit

History

Repository files navigation

OmniDrive: LLM-Agent for Autonomous Driving with 3D Perception, Reasoning and Planning

News

Getting Started

Currently Supported Features

Visual Results

Citation

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 3

Languages

Packages