Uh oh!
There was an error while loading.Please reload this page.
- Notifications
You must be signed in to change notification settings - Fork366
All-or-nothing generation of multiple Groups/Arrays/.. with zarr: Possible Approaches?#3094
Uh oh!
There was an error while loading.Please reload this page.
Uh oh!
There was an error while loading.Please reload this page.
-
Hi everyone, ContextI'm working on a data format built on top off zarr / using zarr for array and meta data storage. QuestionI was wondering on how to approach that in zarr? Options I considered:
Other potential issues?I guess in all cases I might run into issues with the async approach of v3, or have to force synchronization for the transactions. Thank you and best regards, |
BetaWas this translation helpful?Give feedback.
All reactions
Replies: 1 comment 1 reply
-
Icechunk, which builds on top of zarr, provides exactly this.
This might not be an option, depending on what file / object storage system you're using. Object stores like S3 don't provide multi-object, atomic updates so Zarr alone isn't enough. Consolidated metadata can help with a subset of use cases where you only ever append new data (since the update to the arrays can be done ahead of time and the update to the consolidated metadata file is atomic). But icechunk is probably the way to go. |
BetaWas this translation helpful?Give feedback.
All reactions
-
Oh, icechunk seems really cool and very fitting for that, thank you! |
BetaWas this translation helpful?Give feedback.