NotificationsYou must be signed in to change notification settings
Fork33.3k
Star69.7k

GH-117581: Specialize binary operators by refcount as well as type.#117627

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Closed

markshannon wants to merge28 commits intopython:mainfromfaster-cpython:specialize-binary-op-refcount

Closed

GH-117581: Specialize binary operators by refcount as well as type.#117627

markshannon wants to merge28 commits intopython:mainfromfaster-cpython:specialize-binary-op-refcount

Conversation

Copy link

Member

markshannon commentedApr 8, 2024•
edited
Loading

This PR changes the specialization of binary operators for the current, fairly ad-hoc approach to a bit more principled approach of specializing by refcount and using table lookup to specialize by type.

Tier 1 performance is in the noise, maybe showing a slight slowdown.

I expect the advantages of this approach to show up when applied toBINARY_SUBSCR,COMPARE_OP and with the JIT.

It also opens the possibility of specializing binary operators for numpy arrays and the Decimal class.

Issue:Specialize BINARY_OP by refcount and by type of operands #117581

markshannon added15 commits

March 27, 2024 09:05

Experimentally specialize BINARY_OP by refcount

8183250

Get better type information for stats on binary ops

198ceb6

Specialize binary op by refcount and type

3b3c8ce

Merge branch 'main' into specialize-binary-op-refcount

7eefd3e

Use the right version numbers

4baa860

Gather better stats

24d57df

Add percentages to summary

accc60a

Fix stats again

23cf5de

Handle errors

daa2733

Improve BINARY_OP specializations

67bbc52

Add back BINARY_OP_INPLACE_ADD_UNICODE and fix up dis test

2c964fa

Add more BINARY_OP specializations

4c084f8

Add back BINARY_OP_INPLACE_ADD_UNICODE specialization

cd8bea7

Merge branch 'main' into specialize-binary-op-refcount

627f373

Restore BINARY_OP_INPLACE_ADD_UNICODE uop code.

a964a49

bedevere-appbot mentioned this pull request

Apr 8, 2024

Specialize BINARY_OP by refcount and by type of operands#117581

Closed

Fidget-Spinner reviewed

Apr 8, 2024

View reviewed changes

Python/bytecodes.c

		unused/1+
		_GUARD_NOS_REFCNT1+
		_GUARD_TOS_IMMORTAL+
		_BINARY_OP_TABLE_NN;

Copy link

Member

Fidget-SpinnerApr 8, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Can you please document in thebytecodes.c file what these suffixes mean? Some of them are easy to see, but I don't know what'sNN vsND immediately.

Copy link

MemberAuthor

markshannonMay 21, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Done.

markshannon added8 commits

April 8, 2024 11:24

Fix up test_dis again

11ac97f

Mostly fix test_capi.test_opt

2d3fa14

Perform tier 2 optimization on type version guards

a327a4c

Explain _BINARY_OP_TABLE suffixes

a14d631

Export symbol for JIT

ffbc9e7

Make things const

d9acedb

Merge branch 'main' into specialize-binary-op-refcount

dc7092b

Add news

48e287c

markshannon marked this pull request as ready for review

May 21, 2024 19:54

markshannon requested review frombrettcannon,ericsnowcurrently,gvanrossum andncoghlan ascode owners

May 21, 2024 19:54

markshannon requested review frommethane,rhettinger andwarsaw ascode owners

May 21, 2024 19:54

bedevere-appbot added the awaiting core review label

May 21, 2024

rhettinger removed their request for review

May 21, 2024 21:12

markshannon added4 commits

May 22, 2024 17:04

Merge branch 'main' into specialize-binary-op-refcount

8708306

Fix test_dis

8b4a480

Restore constant propagation in binary op optimizer

28c384c

Workaround false positive in check-c-globals.py

881df50

Copy link

MemberAuthor

markshannon commentedMay 24, 2024

Performance on both tier 1 and tier 2 is about 1% slower.
With deferred reference counting, there will be no need for specializing on refcount, so I'm deferring this until deferred reference counting is done.

markshannon closed this

May 24, 2024

Copy link

MemberAuthor

markshannon commentedMay 29, 2024

Earlier results had an issue with thetelco benchmark, probably due to the decimal module falling back to the pure Python version.

Benchmarking on tier 1 shows theslowdown to be in the noise.
The significant results are a speedup of 7% for spectral_norm and a slowdown of 12% for nbody.
This is presumably due to more complete specialization for spectral_norm and additional overhead for the already well specialized nbody code.
We expect the advantage of better specialization to be important in tier 2, but the additional tier 1 overhead to be largely irrelevant there.