since#3554 was an unpopular direction I'm going instead with codspeed + pytest-benchmark. Opening as a draft because I haven't looked into how codspeed works at all, but I'd like people to weigh in on whether these initial benchmarks make sense. Naturally we can add more specific ones later, but I figured just some bulk array read / write workloads would be a good start.

add benchmarks

ef5db51

github-actionsbot added the needs release notesAutomatically applied to PRs which haven't added release notes label

Oct 30, 2025

d-v-b added3 commits

October 30, 2025 13:05

remove failing zipstore

4c2a935

don't do benchmarking in default pytest runs

3e5c6cb

changelog

fc3388d

github-actionsbot removed the needs release notesAutomatically applied to PRs which haven't added release notes label

Oct 30, 2025

d-v-b added4 commits

October 30, 2025 13:17

codspeed workflow

da64194

lint

ba2c4cf

remove pedantic mode

009f739

only run benchmarks in one environment

7b26982

d-v-b marked this pull request as ready for review

October 30, 2025 13:46

Copy link

ContributorAuthor

d-v-b commentedOct 30, 2025

@zarr-developers/steering-council I don't have permission to register this repo with codspeed. I submitted a request to register it, could someone approve it?

d-v-b requested review fromdcherian andjhamman and removed request forjhamman

October 30, 2025 14:37

d-v-b mentioned this pull request

Oct 30, 2025

writing to a sharded array is 10x slower#3560

Closed

Copy link

Member

normanrz commentedOct 30, 2025

@zarr-developers/steering-council I don't have permission to register this repo with codspeed. I submitted a request to register it, could someone approve it?

done

Copy link

ContributorAuthor

d-v-b commentedOct 30, 2025

does anyone have opinions about benchmarks? feel free to suggest something concrete. Otherwise, I think we should take this as-is and deal with later benchmarks (like partial shard read / writes) in a subsequent pr

Copy link

codspeed-hqbot commentedOct 30, 2025•
edited
Loading

CodSpeed Performance Report

Congrats! CodSpeed is installed 🎉

🆕30 new benchmarks were detected.

You will start to see performance impacts in the reports once the benchmarks are run from your default branch.

Detected benchmarks

test_read_array[local-Layout(shape=(1000000,), chunks=(100,), shards=(1000000,))-None] (WallTime): 3.2 s
test_read_array[local-Layout(shape=(1000000,), chunks=(100,), shards=(1000000,))-gzip] (WallTime): 5.5 s
test_read_array[local-Layout(shape=(1000000,), chunks=(1000,), shards=(1000,))-None] (WallTime): 1.3 s
test_read_array[local-Layout(shape=(1000000,), chunks=(1000,), shards=(1000,))-gzip] (WallTime): 1.5 s
test_read_array[local-Layout(shape=(1000000,), chunks=(1000,), shards=None)-None] (WallTime): 532.8 ms
test_read_array[local-Layout(shape=(1000000,), chunks=(1000,), shards=None)-gzip] (WallTime): 771.4 ms
test_read_array[memory-Layout(shape=(1000000,), chunks=(100,), shards=(1000000,))-None] (WallTime): 3.2 s
test_read_array[memory-Layout(shape=(1000000,), chunks=(100,), shards=(1000000,))-gzip] (WallTime): 5.4 s
test_read_array[memory-Layout(shape=(1000000,), chunks=(1000,), shards=(1000,))-None] (WallTime): 954.3 ms
test_read_array[memory-Layout(shape=(1000000,), chunks=(1000,), shards=(1000,))-gzip] (WallTime): 1.2 s
test_read_array[memory-Layout(shape=(1000000,), chunks=(1000,), shards=None)-None] (WallTime): 298.7 ms
test_read_array[memory-Layout(shape=(1000000,), chunks=(1000,), shards=None)-gzip] (WallTime): 544.7 ms
test_write_array[local-Layout(shape=(1000000,), chunks=(100,), shards=(1000000,))-None] (WallTime): 9.5 s
test_write_array[local-Layout(shape=(1000000,), chunks=(100,), shards=(1000000,))-gzip] (WallTime): 13.2 s
test_write_array[local-Layout(shape=(1000000,), chunks=(1000,), shards=(1000,))-None] (WallTime): 2.2 s
test_write_array[local-Layout(shape=(1000000,), chunks=(1000,), shards=(1000,))-gzip] (WallTime): 2.5 s
test_write_array[local-Layout(shape=(1000000,), chunks=(1000,), shards=None)-None] (WallTime): 877.9 ms
test_write_array[local-Layout(shape=(1000000,), chunks=(1000,), shards=None)-gzip] (WallTime): 1.2 s
test_write_array[memory-Layout(shape=(1000000,), chunks=(100,), shards=(1000000,))-None] (WallTime): 9.5 s
test_write_array[memory-Layout(shape=(1000000,), chunks=(100,), shards=(1000000,))-gzip] (WallTime): 12.9 s
...

ℹ️Only the first 20 benchmarks are displayed.Go to the app to view all benchmarks.

dcherian reviewed

Oct 30, 2025

View reviewed changes

.github/workflows/codspeed.yml Outdated

		uses: CodSpeedHQ/action@v4
		with:
		mode: instrumentation
		run: hatch run test.py3.11-1.26-minimal:run-benchmark

Copy link

Contributor

dcherianOct 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

can we test the latest instead? seems more appropriate...

Copy link

ContributorAuthor

d-v-bOct 31, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

The latest version of python? What's the reasoning? I'd rather update this file when we drop a supported version vs when a new version of python comes out.

Copy link

Contributor

dcherianOct 31, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Because we'd want to catch a perf regression from upstream changes too? I'm suggested latest version of released librariespy=3.13, np=2.2

Copy link

ContributorAuthor

d-v-bOct 31, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

we don't have an upper bound on numpy versions, so I don't think this particular workflow will help us catch regressions from upstream changes -- we would need to update this workflow every time a new version of numpy is released. IMO that's something we should do in a separate benchmark workflow. This workflow here will run on every PR, and in that case the oldest version of numpy we support seems better.

we also don't have to use a pre-baked hatch environment here, we could define a dependency set specific to benchmarking. but my feeling is that benchmarking against older versions of stuff gives us a better measure of what users will actually experience.

Copy link

Contributor

dcherian commentedOct 30, 2025

feel free to suggest something concrete

indexing please. that'll exercise the codec pipeline too.

a peakmem metric would be good to track also, if possible.

use better string id for test params, make test data 1MB, and simplif…

c0342ee

…y params

Copy link

ContributorAuthor

d-v-b commentedOct 31, 2025

feel free to suggest something concrete
indexing please. that'll exercise the codec pipeline too.
a peakmem metric would be good to track also, if possible.

I don't think codspeed or pytest-benchmark do memory profiling. we would needhttps://pytest-memray.readthedocs.io/en/latest/ or something equivalent for that.

and an indexing benchmark sounds like a great idea but I don't think I have the bandwidth for it in this pr right now

d-v-b added6 commits

October 31, 2025 21:04

move layout to an external file

800b64c

get workloads to resemble recent sharding perf tests

f86291b

Merge branch 'main' ofhttps://github.com/zarr-developers/zarr-python…

dd76776

…into chore/benchmarks

test ids

90502a6

Merge branch 'main' ofhttps://github.com/zarr-developers/zarr-python…

5e0c228

…into chore/benchmarks

tweak tests

f613b29

d-v-b added2 commits

November 3, 2025 11:06

tweak tests

602d79f

fix typo

523f565

Copy link

ContributorAuthor

d-v-b commentedNov 3, 2025

I added a benchmark that clearly reveals the performance improvement of#3561

d-v-b added3 commits

November 3, 2025 11:37

add slice indexing benchmarks

9eb4bb6

remove readme

c9ab694

add docs documentation

0ecc190

Copy link

ContributorAuthor

d-v-b commentedNov 3, 2025

I added some slice-based benchmarks based on the examples from#3524, and I updated the contributing docs with a section about the benchmarks. assuming we can resolve the discussion about which python / numpy version to use in the CI job, I think this is ready

simplify pytest benchmark options

b657e59

Copy link

ContributorAuthor

d-v-b commentedNov 3, 2025

new problem: the codspeed CI benchmarks are way too slow! the benchmark suite runs in 90s locally, and It's taking over 40m to run in CI. Help would be appreciated in speeding this up.

d-v-b added2 commits

November 3, 2025 13:38

use --codspeed flag in benchmark ci

3033fbf

measure walltime in ci

4394226

Copy link

ContributorAuthor

d-v-b commentedNov 3, 2025

owing to the large number of syscalls in our benchmark code, codspeed recommended using thewalltime instrument instead of their virtual CPU instrument. But to turn on the walltime benchmark, we would need to run our benchmarking code on codspeed's servers, which is a security risk.

Given that codspeed is not turning out to be particularly simple, I am inclined to defer the codspeed CI stuff for later work. But if someone can help get the test runtime down, and / or we are OK running our benchmarks on codspeed's servers, then maybe we can get that sorted in this PR.

Merge branch 'main' into chore/benchmarks

7e4ca58

maxrjones reviewed

Nov 5, 2025

View reviewed changes

.github/workflows/codspeed.ymlShow resolvedHide resolved

d-v-band others added2 commits

November 5, 2025 17:22

Merge branch 'main' into chore/benchmarks

27ca426

Update .github/workflows/codspeed.yml

93ea2e2

Co-authored-by: Max Jones <14077947+maxrjones@users.noreply.github.com>

Copy link

ContributorAuthor

d-v-b commentedNov 5, 2025

looks like the walltime instrument is working! I think this is g2g

Labels

None yet

4 participants

Movatterモバイル変換

Uh oh!

add benchmarks using pytest-benchmark and codspeed#3562

Are you sure you want to change the base?

add benchmarks using pytest-benchmark and codspeed#3562

Uh oh!

Conversation

d-v-b commentedOct 30, 2025

Uh oh!

d-v-b commentedOct 30, 2025

Uh oh!

normanrz commentedOct 30, 2025

Uh oh!

d-v-b commentedOct 30, 2025

Uh oh!

codspeed-hqbot commentedOct 30, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

CodSpeed Performance Report

Congrats! CodSpeed is installed 🎉

Detected benchmarks

Uh oh!

dcherianOct 30, 2025

Choose a reason for hiding this comment

Uh oh!

d-v-bOct 31, 2025

Choose a reason for hiding this comment

Uh oh!

dcherianOct 31, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

d-v-bOct 31, 2025

Choose a reason for hiding this comment

Uh oh!

dcherian commentedOct 30, 2025

Uh oh!

d-v-b commentedOct 31, 2025

Uh oh!

d-v-b commentedNov 3, 2025

Uh oh!

d-v-b commentedNov 3, 2025

Uh oh!

d-v-b commentedNov 3, 2025

Uh oh!

d-v-b commentedNov 3, 2025

Uh oh!

Uh oh!

d-v-b commentedNov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codspeed-hqbot commentedOct 30, 2025•
edited
Loading

dcherianOct 31, 2025•
edited
Loading