NotificationsYou must be signed in to change notification settings
Fork153
Star460

Allow seeking back to read skipped chunks#657

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Draft

fintelia wants to merge1 commit intoimage-rs:master

base:master

Choose a base branch

fromfintelia:seek-chunks

Draft

Allow seeking back to read skipped chunks#657

fintelia wants to merge1 commit intoimage-rs:masterfromfintelia:seek-chunks

+131 −7

Conversation

Copy link

Contributor

fintelia commentedNov 28, 2025

With#630 in place, we now have most of the pieces to load PNG metadata without consuming an unbounded amount of space.

The key here is that we track just the start position of each unbounded-size chunk we might care about. Which means that we can track thousands of text chunks with even a tiny space limit. When it comes time to read the chunks, we can seek back to them and provide the desired info. But if the chunk info is never requested then we don't pay the overhead of storing them. When in recording-mode, I believe that the only non-constant memory usage should be recording text chunk positions and reserving two rows of space for the unfiltering buffer.

This is particularly useful for theimage crate which needs to know whether there's a tRNS chunk (to determine whether the color type is RGB or RGBA) before it knows whether the EXIF or ICC profile will be requested. By doing things this way, it neatly sidesteps the question of setting decoding limits before reading metadata. The main thing decoding limits were being used for was prevent zip-bombs in the compressed iCCP chunks and to a lesser degree preventing unreasonably sized EXIF or text chunks.

Allow recording chunk locations while skipping them

342680e

197g reviewed

Nov 28, 2025

View reviewed changes

Copy link

Member

197g left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I like it in principle, seeking is a capability that will be highly in the streaming decoder. Since all the seeks are relation we should make sure that intermittent interrupts work correctly with the position tracking. I don't forsee many problems with this but a few tests would still be added.

src/decoder/stream.rs

Comment on lines +1145 to +1147

		ifself.exif_position.is_none(){
		self.exif_position =Some(chunk_start);
		}

Copy link

Member

197gNov 28, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

The error treatment here differs from parsing theeXIf andiCCP chunk. In the latter case we error first with aFormat and then later, probably, downgrade that toBadAncillaryChunk in most cases. Here however we do nothing.

Copy link

Member

197gNov 28, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Also do we consider the optionchanging throughset_ignore_exif_chunk while it is read, i.e. first reading one fully and then only recording a position would still be a duplicate chunk. That may not happen but without the interface really assuring it we should know if that is in-scope of the implementation or not.

src/decoder/stream.rs

		let chunk_start = stream_position -8 -self.current_chunk.raw_bytes.len()asu64;

		matchself.current_chunk.type_{
		chunk::tEXt \| chunk::zTXt \| chunk::iTXt =>{

Copy link

Member

197gNov 28, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Wouldn't we still want to know the exact kind of text chunk?zTXt is handled very different from the other chunks. We can of course parse it back when we seek back and re-read the chunk anew but it's fixed-size metadata—so we might as well.

Copy link

ContributorAuthor

finteliaDec 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Yeah, I wasn't sure whether to save the chunk size/type or just re-read it when requested. Easy enough to track it alongside the chunk start position.

fintelia mentioned this pull request

Dec 2, 2025

Set chunk action for IDAT and fdAT#658

Merged

Labels

None yet

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow seeking back to read skipped chunks#657

Are you sure you want to change the base?

Allow seeking back to read skipped chunks#657

Conversation

fintelia commentedNov 28, 2025

Uh oh!

197g left a comment

Choose a reason for hiding this comment

Uh oh!

197gNov 28, 2025

Choose a reason for hiding this comment

Uh oh!

197gNov 28, 2025

Choose a reason for hiding this comment

Uh oh!

197gNov 28, 2025

Choose a reason for hiding this comment

Uh oh!

finteliaDec 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants