Monocular Vision-Based Path Planning for Autonomous Navigation

A path planning system that uses monocular depth estimation to generate navigable paths for autonomous rovers, implemented on a Jetson Nano platform.

Overview

This project implements a vision-based path planning system using a single camera input, optimized for deployment on the NVIDIA Jetson Nano. The system processes video frames to generate 3D point clouds, analyzes terrain traversability, and plans optimal navigation paths while avoiding obstacles.

Key Features

Real-time depth estimation from monocular camera input using Jetson Nano
3D point cloud generation and processing
Cost-based terrain analysis
A* path planning algorithm implementation
Optimized for embedded deployment

System Architecture

The system consists of three main components:

Depth Estimation and Point Cloud Generation
- Uses DepthAnything's ViT-S model for depth inference
- Converts 2D depth maps to 3D point clouds using camera intrinsics
- Implements filtering and optimization for point cloud quality
- Handles coordinate transformations and spatial mapping
Terrain Analysis
- Implements grid-based terrain segmentation
- Calculates traversability costs based on:
  - Slope climbing requirements
  - Obstacle height assessment
  - Combined cost metrics for path optimization
- Generates comprehensive terrain accessibility maps
Path Planning
- A* algorithm implementation for optimal path finding
- Integrates terrain cost analysis for path selection
- Avoids obstacles while minimizing energy costs
- Provides efficient route planning in real-world environments

Current Implementation

The project is currently implemented as two separate processes:

Point Cloud Generation (image_to_pointcloud.py)
- Handles image capture and processing
- Performs depth estimation
- Generates and saves point cloud data
Path Planning (path_planner.py)
- Loads processed point cloud data
- Performs terrain analysis
- Executes path planning algorithm

Future work includes integrating these processes into a single pipeline for real-time operation.

Implementation Details

Point Cloud Generation

The system processes video frames through these steps:

Depth estimation using DepthAnything model
Point cloud generation using camera intrinsics
Filtering and optimization of point cloud data

Comparison between original camera input and generated depth map

Terrain Analysis

The terrain analysis module implements two key cost functions to evaluate traversability:

Slope Climbing Cost Function

Fits a plane to each grid cell using least squares regression
Calculates slope angle θ from the fitted plane
If θ > θ_max (maximum climbable angle), cost = infinity
Otherwise, cost = M * g * L * sin(θ), where:
- M = rover mass
- g = gravitational acceleration
- L = length of fitted plane

Obstacle Crossing Cost Function

Calculates l_obs (maximum obstacle height) in each cell
If l_obs > l_max (maximum traversable height), cost = infinity
Otherwise, cost = M * g * l_obs
Helps identify impassable obstacles while allowing traversal of minor terrain variations

The total cost for each cell is the sum of these two functions, creating a comprehensive traversability map.

Point Cloud Visualization

The system generates detailed 3D point clouds from the depth data:

3D point cloud visualization showing spatial mapping of the environment

Overhead view of the point cloud

Labeled point cloud visualization

Performance

The system was evaluated using:

KITTI dataset for depth estimation accuracy
Real-world testing on various terrain types
Performance benchmarking on Jetson Nano
- Optimized for real-time processing on embedded hardware
- Balanced accuracy with computational efficiency

Limitations and Future Work

Limited by monocular depth estimation accuracy
Processing speed constraints on Jetson Nano
Potential for improvement in extreme lighting conditions

Acknowledgments

Based on research by Chen et al. (2023)
Uses DepthAnything's ViT-S model
Developed during a research internship at [Institution Name]

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
__pycache__		__pycache__
images		images
pointclouds		pointclouds
resources		resources
AStar.py		AStar.py
Dijkstra.py		Dijkstra.py
GridBox.py		GridBox.py
README.md		README.md
depth_map.png		depth_map.png
final_depth_grid.npy		final_depth_grid.npy
final_depth_map.png		final_depth_map.png
image_to_pointcloud.py		image_to_pointcloud.py
path_planner.py		path_planner.py
test.ipynb		test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Monocular Vision-Based Path Planning for Autonomous Navigation

Overview

Key Features

System Architecture

Current Implementation

Implementation Details

Point Cloud Generation

Terrain Analysis

Slope Climbing Cost Function

Obstacle Crossing Cost Function

Point Cloud Visualization

Performance

Limitations and Future Work

Acknowledgments

About

Languages

AJaiman/Cost-Based-Path-Planner

Folders and files

Latest commit

History

Repository files navigation

Monocular Vision-Based Path Planning for Autonomous Navigation

Overview

Key Features

System Architecture

Current Implementation

Implementation Details

Point Cloud Generation

Terrain Analysis

Slope Climbing Cost Function

Obstacle Crossing Cost Function

Point Cloud Visualization

Performance

Limitations and Future Work

Acknowledgments

About

Resources

Stars

Watchers

Forks

Languages