📸 Distributed Image-processing Pipeline

🧾 Summary

The project focuses on benchmarking different strategies for distributed image processing using Go and WebAssembly (WASM). The pipeline processes batches of images by splitting each into tiles, applying filters (via WASM modules), and reassembling them.

We compare naive disk-based approaches with increasingly optimized in-memory implementations. Performance gains are achieved by reducing disk I/O and minimizing overhead from WASM interop.

⚙️ How It Works

Each version of the pipeline follows this general structure:

Load an image (or a set of images).
Split each image into tiles.
Apply a WASM-based filter to each tile.
Reassemble the tiles into a final image.

🧱 Naive Implementation

Written in Go.
Uses the standard image and io packages.
Splits and saves tiles to disk.
Loads each tile back into memory for WASM processing.
Saves the processed tile again, and reassembles from disk.

➡️ Major bottleneck: Excessive disk reads/writes.

🧠 Optimized Implementation (In-Memory)

Still in Go.
Tiles are created and kept in memory (slices of image.Image).
Processed tiles are passed through pipes directly into WASM and collected via stdout.
Only one read/write to disk: once for loading, once for final output.

➡️ Benefit: Avoids intermediate disk I/O.

⚡ Super Optimized Implementation (Zero-Copy + Raw Pointers)

Written using Go for orchestration, but uses Rust-compiled WASM modules.
WASM filter module uses raw pointers and manual memory management for direct buffer access.
Avoids wasm-bindgen and serialization overhead.
Tile data is passed directly using WASI stdin/stdout streams in binary form.

➡️ Result: Best performance due to minimal syscall and memory copy overhead.

📚 Dataset Credit

This project uses the Kodak image dataset, a set of 24 uncompressed 768×512 RGB images widely used for evaluating compression algorithms.

To benchmark using the Go CLI:

curl -L -o ~/Downloads/kodak-dataset.zip \
  https://www.kaggle.com/api/v1/datasets/download/sherylmehta/kodak-dataset

Unzip the dataset into the input/ directory.

🧪 Benchmark Results

Implementation	Mode	Time (sec)	Description
Naive	Single-threaded	30.3	Disk I/O heavy, basic file-based processing
Naive	Multi-threaded (8 workers)	9.8	Parallelized disk-based processing (~3× speedup)
Optimized (in-memory)	Single-threaded	29.4	Reduced I/O, all tiles kept in memory
Optimized (in-memory)	Multi-threaded (8 workers)	8.7	Faster parallel in-memory processing
Super Optimized (zero-copy, raw pointers)	Single-threaded	24.73	Raw pointers , no serialization overhead, unsafe rust socerry
Super Optimized (zero-copy, raw pointers)	Multi-threaded (8 workers)	7.72	Best performance: minimal I/O + zero-copy, unsafe rust socerry

🧪 All benchmarks were run on an M3 MacBook Air with 16GB RAM.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
apps		apps
modules		modules
shared		shared
.gitignore		.gitignore
.tool-versions		.tool-versions
Dockerfile		Dockerfile
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📸 Distributed Image-processing Pipeline

🧾 Summary

⚙️ How It Works

🧱 Naive Implementation

🧠 Optimized Implementation (In-Memory)

⚡ Super Optimized Implementation (Zero-Copy + Raw Pointers)

📚 Dataset Credit

🧪 Benchmark Results

About

Uh oh!

Releases

Packages

Uh oh!

Languages

PhantomInTheWire/wasm-image-pipeline

Folders and files

Latest commit

History

Repository files navigation

📸 Distributed Image-processing Pipeline

🧾 Summary

⚙️ How It Works

🧱 Naive Implementation

🧠 Optimized Implementation (In-Memory)

⚡ Super Optimized Implementation (Zero-Copy + Raw Pointers)

📚 Dataset Credit

🧪 Benchmark Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages