python/cpythonPublic

NotificationsYou must be signed in to change notification settings
Fork34.1k
Star71.7k

Heap buffer overflow in`_PyTokenizer_ensure_utf8` #144872

New issue

Open

Heap buffer overflow in_PyTokenizer_ensure_utf8#144872

Labels

interpreter-core(Objects, Python, Grammar, and Parser dirs)topic-parsertype-bugAn unexpected behavior, bug, or error

Description

AdamKorcz

opened

on Feb 16, 2026

Bug report

Bug description:

OSS-Fuzz has found a heap buffer overflow in_PyTokenizer_ensure_utf8.Link to OSS-Fuzz bug report.

The root cause is thatvalid_utf8() inParser/tokenizer/helpers.c checks continuation bytes in reverse order thus readers[expected] befores[1] on these lines:

cpython/Parser/tokenizer/helpers.c

Lines 497 to 499 in8b7b5a9

	for (;expected;expected--)
	if (s[expected]<0x80\|\|s[expected] >=0xC0)
	return0;

When a multi-byte UTF-8 sequence is truncated - such as a 3-byte lead\xEA followed immediately by a null terminator - the backward loop reads past the end of the valid data before encountering the null byte that would stop it.

This is not a security-critical issue.

CPython versions tested on:

CPython main branch

Operating systems tested on:

No response

Linked PRs

Metadata

Assignees

No one assigned

Labels

interpreter-core(Objects, Python, Grammar, and Parser dirs)topic-parsertype-bugAn unexpected behavior, bug, or error

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Heap buffer overflow in`_PyTokenizer_ensure_utf8` #144872

Description

Bug report

Bug description:

CPython versions tested on:

Operating systems tested on:

Linked PRs

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

Movatterモバイル変換

Uh oh!

Heap buffer overflow in_PyTokenizer_ensure_utf8 #144872

Description

Bug report

Bug description:

CPython versions tested on:

Operating systems tested on:

Linked PRs

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

Heap buffer overflow in`_PyTokenizer_ensure_utf8` #144872