Reversible Image Transforms 🖼️

A few basics image transforms to help manipulate images into the shape, size and orientation needed for perception or visualization. Our main use case is converting camera images to the shape needed for our keypoint detection models. We also provide functionality to reverse transfrom the coordinates of the detected keypoints to the original image space. This is not available in image transform libraries such as Albumentations but is important for robotics applications: you often need to reproject the points using the camera intrinsics, which are known for the original but not for the transformed image.

Implemented transforms:

Crop
Resize
Rotation90
ComposedTransform

Usage

See the tutorial notebook.

Quick overview:

from airo_camera_toolkit.image_transforms import Crop, Resize, Rotation90, ComposedTransform

image = cv2.imread("path/to/image.jpg")

# Available transforms
crop = Crop(image.shape, x=160, y=120, h=200, w=200)
resize = Resize(crop.shape, h=100, w=100, round_transformed_points=False)
rotation_clockwise = Rotation90(resize.shape, -1)

image_cropped = crop(image)
image_resized = resize(image_cropped)
image_rotated = rotation_clockwise(image_resized)

# Composing transforms
transform = ComposedTransform([crop, resize, rotation_clockwise])
image_transformed = transform(image) # Equivalent to image_rotated

# Transforming a point
point = (200, 200)
point_transformed = transform.transform_point(point)
point_reversed = transform.reverse_transform_point(point_transformed)

point_reversed_int = tuple(map(int, point_reversed))

point == point_reversed_int

Image Coordinate System

The origin (0,0) of an image is generally chosen at the top left corner, and we also use this convention. The x-axis is positive to the right and the y-axis is positive downwards.

ℹ️ Sometimes a difference is made between integer pixel coordinates and float image coordinates, illustrated here. TODO check: when projecting from 3D to 2D, in which coordinate system are the 2D coordinates expressed?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Reversible Image Transforms 🖼️

Usage

Image Coordinate System

Files

README.md

Latest commit

History

README.md

File metadata and controls

Reversible Image Transforms 🖼️

Usage

Image Coordinate System