This removes any unknown opcodes. Instead the deopt table should just recognize unknown opcodes as deopting to themselves, allowing the extensible interpreter loop to consume unknown opcodes.

Issue:code objects remove unknown opcodes from the instruction stream when accessingco_code #128045

DinoV added the skip news label

Dec 17, 2024

DinoV changed the title~~Mark unknown opcodes as deopting to themselves~~gh-128045: Mark unknown opcodes as deopting to themselves

Dec 17, 2024

bedevere-appbot mentioned this pull request

Dec 17, 2024

code objects remove unknown opcodes from the instruction stream when accessingco_code#128045

Closed

DinoV marked this pull request as ready for review

December 17, 2024 20:12

DinoV requested a review frommarkshannon as acode owner

December 17, 2024 20:12

bedevere-appbot added the awaiting core review label

Dec 17, 2024

iritkatriel reviewed

Dec 17, 2024

View reviewed changes

Tools/cases_generator/opcode_metadata_generator.py Outdated

Comment on lines 251 to 256

		for name, deopt in sorted(deopts):
		out.emit(f"[{name}] = {deopt},\n")
		defined = set(analysis.opmap.values())
		for i in range(256):
		if i not in defined:
		out.emit(f"[{i}] = {i},\n")

Copy link

Member

iritkatrielDec 17, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Since we're not testing this at all in cpython, I'd suggest we at least add a couple of assertions to make sure this is correctly covering the range of opcodes:

Suggested change

	forname,deoptinsorted(deopts):
	out.emit(f"[{name}] ={deopt},\n")
	defined=set(analysis.opmap.values())
	foriinrange(256):
	ifinotindefined:
	out.emit(f"[{i}] ={i},\n")
	defined=set(analysis.opmap.values())
	foriinrange(256):
	ifinotindefined:
	deopts.append((f'{i}',f'{i}'))

	assertlen(deopts)==256
	assertlen(set(x[0]forxindeopts))==256
	forname,deoptinsorted(deopts):
	out.emit(f"[{name}] ={deopt},\n")

facebook-github-bot pushed a commit to facebookincubator/cinder that referenced this pull request

Dec 18, 2024

Mark undefined instructions as deopting to themselves

3d1b81e

Summary:When CPython hands out the bytecode it will first do a de-opt on it:https://www.internalfb.com/code/fbsource/[241e0d0760d4c107a3c0ac2b4914524120b0c909]/third-party/python/3.12/Objects/codeobject.c?lines=1540&reveal=1540This uses the de-opt table which only defines the known opcodes, meaning unknown opcodes get turned into 0's. We need CPython to at least define unknown opcodes to at least de-opt to themselves.Upstream PR:python/cpython#128044Reviewed By: jbower-fbDifferential Revision: D67350914fbshipit-source-id: 0073efab52da1be775272e7dd9ae5a46468ccb10

Copy link

Member

markshannon commentedJan 7, 2025

I think we regard undefined instructions an error.
If you want to pass custom bytecodes to your custom interpreter, a more robust approach might be needed.
We can do the deopt thing for now, but it seems fragile.

For example, one thing I have considered is storing bytecodes in a compact format on disk: Instructions without an oparg would take 1 byte, those with an oparg would take 2. We would then combine the unmarshalling and quickening steps to create the full in-memory form in a single pass. Custom instructions would not survive this process.

Mark unknown opcodes as deopting to themselves

e37182c

DinoV force-pushed thedeopt_unknown_ops branch from7048634 toa675bfaCompare

April 2, 2025 17:28

Copy link

ContributorAuthor

DinoV commentedApr 2, 2025

Took me a while to get back to this but I'm finally back :) I applied the changes suggested by@iritkatriel.

I think we regard undefined instructions an error. If you want to pass custom bytecodes to your custom interpreter, a more robust approach might be needed. We can do the deopt thing for now, but it seems fragile.
For example, one thing I have considered is storing bytecodes in a compact format on disk: Instructions without an oparg would take 1 byte, those with an oparg would take 2. We would then combine the unmarshalling and quickening steps to create the full in-memory form in a single pass. Custom instructions would not survive this process.

I think as long as we could still have some way to construct a code object ourselves that would be fine, we'd just may need to implement our own custom unmarshaling logic that could support our opcodes. Obviously that doesn't cover all the ways that things could change in the future but we can try and figure out how to adapt to other potential changes :)

Add extra assertions on number of opcodes emitted

efe6990

DinoV force-pushed thedeopt_unknown_ops branch froma675bfa toefe6990Compare

April 2, 2025 17:36

markshannon approved these changes

May 19, 2025

View reviewed changes

Copy link

Member

markshannon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Looks good

bedevere-appbot added awaiting merge and removed awaiting core review labels

May 19, 2025

DinoV added awaiting core review needs backport to 3.14bugs and security fixes and removed awaiting merge labels

May 19, 2025

DinoV merged commitcc9add6 intopython:main

May 19, 2025

59 checks passed

Copy link

miss-islington-appbot commentedMay 19, 2025

Thanks@DinoV for the PR 🌮🎉.. I'm working now to backport this PR to: 3.14.
🐍🍒⛏🤖

bedevere-appbot removed the awaiting core review label

May 19, 2025

miss-islington pushed a commit to miss-islington/cpython that referenced this pull request

May 19, 2025

pythongh-128045: Mark unknown opcodes as deopting to themselves (pyth…

215207e

…onGH-128044)* Mark unknown opcodes as deopting to themselves(cherry picked from commitcc9add6)Co-authored-by: Dino Viehland <dinoviehland@meta.com>

Copy link

bedevere-appbot commentedMay 19, 2025

GH-134228 is a backport of this pull request to the3.14 branch.

bedevere-appbot removed the needs backport to 3.14bugs and security fixes label

May 19, 2025

DinoV pushed a commit that referenced this pull request

May 19, 2025

[3.14]gh-128045: Mark unknown opcodes as deopting to themselves (GH-…

c869898

…128044) (#134228)*gh-128045: Mark unknown opcodes as deopting to themselves (GH-128044)

Pranjal095 pushed a commit to Pranjal095/cpython that referenced this pull request

Jul 12, 2025

pythongh-128045: Mark unknown opcodes as deopting to themselves (pyth…

76d7013

…on#128044)* Mark unknown opcodes as deopting to themselves

Labels

skip news

3 participants

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-128045: Mark unknown opcodes as deopting to themselves#128044

gh-128045: Mark unknown opcodes as deopting to themselves#128044

Uh oh!

Conversation

DinoV commentedDec 17, 2024•
edited by bedevere-appbot
Loading

Uh oh!

Uh oh!

iritkatrielDec 17, 2024

Choose a reason for hiding this comment

Uh oh!

markshannon commentedJan 7, 2025

Uh oh!

DinoV commentedApr 2, 2025

Uh oh!

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

miss-islington-appbot commentedMay 19, 2025

Uh oh!

bedevere-appbot commentedMay 19, 2025

Uh oh!

Uh oh!

Movatterモバイル変換

Uh oh!

gh-128045: Mark unknown opcodes as deopting to themselves#128044

gh-128045: Mark unknown opcodes as deopting to themselves#128044

Uh oh!

Conversation

DinoV commentedDec 17, 2024• edited by bedevere-appbotLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

iritkatrielDec 17, 2024

Choose a reason for hiding this comment

Uh oh!

markshannon commentedJan 7, 2025

Uh oh!

DinoV commentedApr 2, 2025

Uh oh!

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

miss-islington-appbot commentedMay 19, 2025

Uh oh!

bedevere-appbot commentedMay 19, 2025

Uh oh!

Uh oh!

DinoV commentedDec 17, 2024•
edited by bedevere-appbot
Loading