>>>fromcompression.zstdimportcompress,decompress>>>invalid=compress(b'xxx')+b'yyy'>>>decompress(invalid)Traceback (mostrecentcalllast):File"<python-input-2>",line1,in<module>decompress(invalid)~~~~~~~~~~^^^^^^^^^File"/redacted/Lib/compression/zstd/__init__.py",line157,indecompressresults.append(decomp.decompress(data))~~~~~~~~~~~~~~~~~^^^^^^_zstd.ZstdError:Unabletodecompresszstddata:Unknownframedescriptor

Indeed,the Zstandard specification says “Zstandard compressed data is made of one or more frames”, and it does not say that random data can be added at the end.

However, this is not the case inZstdFile /zstd.open:

>>>fromcompression.zstdimportZstdFile>>>fromioimportBytesIO>>>ZstdFile(BytesIO(invalid)).read()b'xxx'

After this PR, the last call becomes:

>>>ZstdFile(BytesIO(invalid)).read()Traceback (mostrecentcalllast):File"<python-input-5>",line1,in<module>ZstdFile(BytesIO(invalid)).read()~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^File"/redacted/Lib/compression/zstd/_zstdfile.py",line176,inreadreturnself._buffer.read(size)~~~~~~~~~~~~~~~~~^^^^^^File"/redacted/Lib/compression/_common/_streams.py",line118,inreadallwhiledata:=self.read(sys.maxsize):~~~~~~~~~^^^^^^^^^^^^^File"/redacted/Lib/compression/_common/_streams.py",line91,inreaddata=self._decompressor.decompress(rawblock,size)_zstd.ZstdError:Unabletodecompresszstddata:Unknownframedescriptor

Issue:Implement PEP 784 - Adding Zstandard to the Python standard library #132983

ZstdFile: don't allow trailer data

98d2b86

bedevere-appbot mentioned this pull request

May 9, 2025

Implement PEP 784 - Adding Zstandard to the Python standard library#132983

Closed

13 tasks

Rogdham marked this pull request as ready for review

May 9, 2025 09:36

bedevere-appbot added the awaiting review label

May 9, 2025

AA-Turner added the needs backport to 3.14bugs and security fixes label

May 9, 2025

Copy link

Member

emmatyping commentedMay 9, 2025

The current behavior matches LZMA. I think unlikedecompress which is handed what is necessarily a zstd stream of one or more frames, withZstdFile, a user may be parsing a format which has additional information after a zstd stream.

>>>from lzmaimport LZMAFile, compress>>>from ioimport BytesIO>>> invalid= compress(b'foo')+b'bar'>>> LZMAFile(BytesIO(invalid)).read()b'foo'>>>

Copy link

ContributorAuthor

Rogdham commentedMay 9, 2025

You are right this is the case forLZMAFile with formatFORMAT_AUTO (which is the default) and also forBZ2File.

However,LZMAFile with formatFORMAT_XZ as well asGzipFile raise an exception in that case.

>>>fromlzmaimportLZMAFile,compress,FORMAT_XZ>>>fromioimportBytesIO>>>invalid=compress(b'foo')+b'bar'>>>LZMAFile(BytesIO(invalid),format=FORMAT_XZ).read()Traceback (mostrecentcalllast):File"<python-input-3>",line1,in<module>LZMAFile(BytesIO(invalid),format=FORMAT_XZ).read()~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^File"/redacted/lzma.py",line208,inreadreturnself._buffer.read(size)~~~~~~~~~~~~~~~~~^^^^^^File"/redacted/_compression.py",line118,inreadallwhiledata:=self.read(sys.maxsize):~~~~~~~~~^^^^^^^^^^^^^File"/redacted/_compression.py",line99,inreadraiseEOFError("Compressed file ended before the ""end-of-stream marker was reached")EOFError:Compressedfileendedbeforetheend-of-streammarkerwasreached

emmatyping approved these changes

May 9, 2025

View reviewed changes

Copy link

Member

emmatyping left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Okay this looks good then!

bedevere-appbot added awaiting core review and removed awaiting review labels

May 9, 2025

emmatyping added the skip news label

May 9, 2025

Copy link

ContributorAuthor

Rogdham commentedMay 9, 2025•
edited
Loading

In addition, considerdecompress(compress(b"xxx") + b"yyy"):

returnsb"xxx" on:lzma (formatFORMAT_AUTO),bz2
raises an exception on:lzma (formatFORMAT_XZ),gzip

Since forzstd we raise an exception on that, I would say to do the same forZstdFile to be consistent.

AA-Turner merged commit50b5370 intopython:main

May 10, 2025

48 checks passed

Copy link

miss-islington-appbot commentedMay 10, 2025

Thanks@Rogdham for the PR, and@AA-Turner for merging it 🌮🎉.. I'm working now to backport this PR to: 3.14.
🐍🍒⛏🤖

bedevere-appbot removed the awaiting core review label

May 10, 2025

miss-islington pushed a commit to miss-islington/cpython that referenced this pull request

May 10, 2025

pythongh-132983: Don't allow trailer data in ZstdFile (pythonGH-133736)

0c55b02

(cherry picked from commit50b5370)Co-authored-by: Rogdham <3994389+Rogdham@users.noreply.github.com>

Copy link

bedevere-appbot commentedMay 10, 2025

GH-133799 is a backport of this pull request to the3.14 branch.

bedevere-appbot removed the needs backport to 3.14bugs and security fixes label

May 10, 2025

AA-Turner pushed a commit that referenced this pull request

May 10, 2025

[3.14]gh-132983: Don't allow trailer data in ZstdFile (GH-133736) (#…

99ca086

…133799)gh-132983: Don't allow trailer data in ZstdFile (GH-133736)(cherry picked from commit50b5370)Co-authored-by: Rogdham <3994389+Rogdham@users.noreply.github.com>

Rogdham deleted the zstdfile-trailer-exception branch

May 10, 2025 06:45

Labels

skip news

3 participants

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-132983: Don't allow trailer data in ZstdFile#133736

gh-132983: Don't allow trailer data in ZstdFile#133736

Uh oh!

Conversation

Rogdham commentedMay 9, 2025•
edited
Loading

Uh oh!

Uh oh!

emmatyping commentedMay 9, 2025

Uh oh!

Rogdham commentedMay 9, 2025

Uh oh!

emmatyping left a comment

Choose a reason for hiding this comment

Uh oh!

Rogdham commentedMay 9, 2025•
edited
Loading

Uh oh!

Uh oh!

Uh oh!

miss-islington-appbot commentedMay 10, 2025

Uh oh!

bedevere-appbot commentedMay 10, 2025

Uh oh!

Uh oh!

Movatterモバイル変換

Uh oh!

gh-132983: Don't allow trailer data in ZstdFile#133736

gh-132983: Don't allow trailer data in ZstdFile#133736

Uh oh!

Conversation

Rogdham commentedMay 9, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

emmatyping commentedMay 9, 2025

Uh oh!

Rogdham commentedMay 9, 2025

Uh oh!

emmatyping left a comment

Choose a reason for hiding this comment

Uh oh!

Rogdham commentedMay 9, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

Uh oh!

miss-islington-appbot commentedMay 10, 2025

Uh oh!

bedevere-appbot commentedMay 10, 2025

Uh oh!

Uh oh!

Rogdham commentedMay 9, 2025•
edited
Loading

Rogdham commentedMay 9, 2025•
edited
Loading