[v3] benchmarks and performance tools

It would be very useful to record and publish benchmarks of how zarr-python performs in various workloads. Especially with the addition of sharding, I think people working with Zarr will benefit from some guidance for how to avoid performance problems. And without benchmarks, we can't do performance optimization of zarr-python itself. 

So we should write some benchmarking code, tracking things like duration and memory usage for a few core workloads, like:
- writing chunks to an array
- reading chunks from an array
- creating arrays and groups
- deleting chunks from an existing array
 
As a reach goal, the _benchmark code itself_ should be useful to people who want to check `zarr-python` performance on different compute / storage backends.

it looks like @JackKelly already started work in this direction at https://github.com/zarr-developers/zarr-benchmark. @JackKelly does the direction i'm proposing align with your vision for that repo?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[v3] benchmarks and performance tools #2034

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[v3] benchmarks and performance tools #2034

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions