Movatterモバイル変換

Use dedicated opcodes to speed up calls/attribute lookups with super() as receiver#87729

Closed

bedevere-bot added the awaiting core review label

carljm mentioned this pull request

bpo-43563 : Introduce dedicated opcodes for super calls#24936

Closed

carljm added4 commits

April 12, 2023 21:32

document LOAD_ZERO_SUPER_ATTR

5625e73

fix gdb test_wrapper_call

0775efe

optimize 2-arg super also

f229b5b

fix incompatible assignment

626999d

carljm changed the title~~gh-87729: add instruction for faster zero-arg super()~~gh-87729: add LOAD_SUPER_ATTR instruction for faster super()

carljm added2 commits

April 13, 2023 09:52

Merge branch 'main' into superopt

9384106

* main:pythongh-103479: [Enum] require __new__ to be considered a data type (pythonGH-103495)pythongh-103365: [Enum] STRICT boundary corrections (pythonGH-103494)pythonGH-103488: Use return-offset, not yield-offset. (pythonGH-103502)pythongh-103088: Fix test_venv error message to avoid bytes/str warning (pythonGH-103500)pythonGH-103082: Turn on branch events for FOR_ITER instructions. (python#103507)pythongh-102978: Fix mock.patch function signatures for class and staticmethod decorators (python#103228)pythongh-103462: Ensure SelectorSocketTransport.writelines registers a writer when data is still pending (python#103463)pythongh-95299: Rework test_cppext.py to not invoke setup.py directly (python#103316)

fix bad first arg

92c943b

Copy link

Member

corona10 commentedApr 13, 2023

cc@Fidget-Spinner

Copy link

MemberAuthor

carljm commentedApr 14, 2023

https://github.com/carljm/cpython/compare/superopt...carljm:cpython:superopt_spec?expand=1 has a draft of the first specialization ofLOAD_SUPER_ATTR built on top of this, specializing for the method case.

With that specialization, the./python -m pyperf timeit -s 'from superbench import b' 'b.meth()' microbenchmark shown above now runs in 56ns, down from 130ns originally and 70ns without the specialization. That's 2.3x better than the current main-branch speed. For reference, a version of the same benchmark that usesreturn A.meth(self) in place ofreturn super().meth() runs in 48ns. So we are getting pretty close to zero-costsuper method calls.

(If reviewers would prefer to just have the specialization(s) included directly in this PR and all reviewed together, let me know and I can push everything here.)

corona10 added the 🔨 test-with-refleak-buildbotsTest PR w/ refleak buildbots; report in status section label

Copy link

bedevere-bot commentedApr 14, 2023

🤖 New build scheduled with the buildbot fleet by@corona10 for commit92c943b 🤖

If you want to schedule another build, you need to add the🔨 test-with-refleak-buildbots label again.

bedevere-bot removed the 🔨 test-with-refleak-buildbotsTest PR w/ refleak buildbots; report in status section label

corona10 reviewed

Objects/typeobject.c OutdatedShow resolvedHide resolved

corona10 reviewed

Python/bytecodes.c OutdatedShow resolvedHide resolved

corona10 reviewed

Objects/typeobject.c OutdatedShow resolvedHide resolved

carljmand others added2 commits

April 14, 2023 08:35

Apply suggestions from code review

775ed0f

Co-authored-by: Dong-hee Na <donghee.na92@gmail.com>

don't unnecessarily re-find args in error case

2077f1a

update generated cases with new comment

3a3cb74

Copy link

Member

markshannon commentedApr 18, 2023

Is the microbenchmark code correct? It doesn't look like you callmeth()

Copy link

MemberAuthor

carljm commentedApr 18, 2023

Is the microbenchmark code correct? It doesn't look like you callmeth()

The call tob.meth() happens in the actual invocation ofpyperf timeit:./python -m pyperf timeit -s 'from superbench import b' 'b.meth()'

markshannon reviewed

Apr 18, 2023

Copy link

Member

markshannon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

There is a fair bit of branchingLOAD_SUPER_ATTR which suggest that either it needs reworking or splitting up.

I've made a few suggestions as to how it can be made less branchy.
We'll see if that is sufficient.

The compiler code looks OK to me, but I'll leave it@iritkatriel to review it properly.

Lib/test/test_super.pyShow resolvedHide resolved

Objects/typeobject.c OutdatedShow resolvedHide resolved

Python/bytecodes.c OutdatedShow resolvedHide resolved

Objects/typeobject.cShow resolvedHide resolved

Copy link

MemberAuthor

carljm commentedApr 19, 2023•
edited
Loading

@markshannon

Thanks for the review!

There is a fair bit of branching LOAD_SUPER_ATTR which suggest that either it needs reworking or splitting up.

The causes of branching are these:

Has built-insuper been replaced or shadowed? This branching is unavoidable, since it can happen dynamically; we have to check at runtime.
Are we loading a method that will be called?
Is this zero-arg or two-argsuper? (We only need to know this ifsuper has been shadowed, so we can reconstruct the right call to whatever it is now.)

(2) and (3) are both known at compile time, so we could split the opcode in two along either axis (i.e.LOAD_SUPER_ATTR vsLOAD_SUPER_METHOD, orLOAD_ZERO_SUPER_ATTR vsLOAD_TWO_SUPER_ATTR). I considered both splits, and decided neither made sense: the second split would result in two separate opcodes that we'll later want to specialize to the same opcode, which is awkward, and the first split loses the parallel to howLOAD_ATTR works. (Both splits would result in code duplication.)

Your suggestion above about how to handleoparg & 2 eliminates the branching for zero-arg vs two-argsuper in the shadowing case; hopefully that's enough.

iritkatriel reviewed

Apr 19, 2023

Python/compile.cShow resolvedHide resolved

carljm added4 commits

April 19, 2023 15:58

simplify oparg & 2 handling

e4466a7

Merge branch 'main' into superopt

5c0a21c

* main: (24 commits)pythongh-98040: Move the Single-Phase Init Tests Out of test_imp (pythongh-102561)pythongh-83861: Fix datetime.astimezone() method (pythonGH-101545)pythongh-102856: Clean some of the PEP 701 tokenizer implementation (python#103634)pythongh-102856: Skip test_mismatched_parens in WASI builds (python#103633)pythongh-102856: Initial implementation of PEP 701 (python#102855)pythongh-103583: Add ref. dependency between multibytecodec modules (python#103589)pythongh-83004: Harden msvcrt further (python#103420)pythonGH-88342: clarify that `asyncio.as_completed` accepts generators yielding tasks (python#103626)pythongh-102778: IDLE - make sys.last_exc available in Shell after traceback (python#103314)pythongh-103582: Remove last references to `argparse.REMAINDER` from docs (python#103586)pythongh-103583: Always pass multibyte codec structs as const (python#103588)pythongh-103617: Fix compiler warning in _iomodule.c (python#103618)pythongh-103596: [Enum] do not shadow mixed-in methods/attributes (pythonGH-103600)pythonGH-100530: Change the error message for non-class class patterns (pythonGH-103576)pythongh-95299: Remove lingering setuptools reference in installer scripts (pythonGH-103613)  [Doc] Fix a typo in optparse.rst (python#103504)pythongh-101100: Fix broken reference `__format__` in `string.rst` (python#103531)pythongh-95299: Stop installing setuptools as a part of ensurepip and venv (python#101039)pythonGH-103484: Docs: add linkcheck allowed redirects entries for most cases (python#103569)pythongh-67230: update whatsnew note for csv changes (python#103598)  ...

cleanup and clarification

f161268

move __class__ special case out of the fast path

df442c0

Copy link

MemberAuthor

carljm commentedApr 20, 2023

@markshannon I've now addressed or replied to all comments, if you want to take another look.

Merge branch 'main' into superopt

19b8025

* main: (53 commits)pythongh-102498 Clean up unused variables and imports in the email module  (python#102482)pythongh-99184: Bypass instance attribute access in `repr` of `weakref.ref` (python#99244)pythongh-99032: datetime docs: Encoding is no longer relevant (python#93365)pythongh-94300: Update datetime.strptime documentation (python#95318)pythongh-103776: Remove explicit uses of $(SHELL) from Makefile (pythonGH-103778)pythongh-87092: fix a few cases of incorrect error handling in compiler (python#103456)pythonGH-103727: Avoid advancing tokenizer too far in f-string mode (pythonGH-103775)  Revert "Add tests for empty range equality (python#103751)" (python#103770)pythongh-94518: Port 23-argument `_posixsubprocess.fork_exec` to Argument Clinic (python#94519)pythonGH-65022: Fix description of copyreg.pickle function (python#102656)pythongh-103323: Get the "Current" Thread State from a Thread-Local Variable (pythongh-103324)pythongh-91687: modernize dataclass example typing (python#103773)pythongh-103746: Test `types.UnionType` and `Literal` types together (python#103747)pythongh-103765: Fix 'Warning: py:class reference target not found: ModuleSpec' (pythonGH-103769)pythongh-87452: Improve the Popen.returncode docs  Removed unnecessary escaping of asterisks (python#103714)pythonGH-102973: Slim down Fedora packages in the dev container (python#103283)pythongh-103091: Add PyUnstable_Type_AssignVersionTag (python#103095)  Add tests for empty range equality (python#103751)pythongh-103712: Increase the length of the type name in AttributeError messages (python#103713)  ...

markshannon self-requested a review

April 24, 2023 21:12

markshannon approved these changes

Merge branch 'main' into superopt

0de5bc6

* main:pythongh-101517: fix line number propagation in code generated for except* (python#103550)pythongh-103780: Use patch instead of mock in asyncio unix events test (python#103782)

carljmenabled auto-merge (squash)

April 24, 2023 21:27

carljm added the 🔨 test-with-refleak-buildbotsTest PR w/ refleak buildbots; report in status section label

Copy link

bedevere-bot commentedApr 24, 2023

🤖 New build scheduled with the buildbot fleet by@carljm for commit0de5bc6 🤖

If you want to schedule another build, you need to add the🔨 test-with-refleak-buildbots label again.

bedevere-bot removed the 🔨 test-with-refleak-buildbotsTest PR w/ refleak buildbots; report in status section label

Merge branch 'main' into superopt

dbe1665

* main:pythongh-100227: Only Use deepfreeze for the Main Interpreter (pythongh-103794)pythongh-103492: Clarify SyntaxWarning with literal comparison (python#103493)pythongh-101100: Fix Sphinx warnings in `argparse` module (python#103289)

carljm added the 🔨 test-with-refleak-buildbotsTest PR w/ refleak buildbots; report in status section label

Copy link

bedevere-bot commentedApr 24, 2023

🤖 New build scheduled with the buildbot fleet by@carljm for commitdbe1665 🤖

If you want to schedule another build, you need to add the🔨 test-with-refleak-buildbots label again.

bedevere-bot removed the 🔨 test-with-refleak-buildbotsTest PR w/ refleak buildbots; report in status section label

carljm merged commit0dc8b50 intopython:main

bedevere-bot removed the awaiting merge label

carljm added a commit to carljm/cpython that referenced this pull request