NotificationsYou must be signed in to change notification settings
Fork32k
Star67.3k

GH-93533: Shrink the`LOAD_ATTR` caches#103014

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Closed

brandtbucher wants to merge7 commits intopython:mainfrombrandtbucher:type-cache-fixed

Closed

GH-93533: Shrink the`LOAD_ATTR` caches#103014

brandtbucher wants to merge7 commits intopython:mainfrombrandtbucher:type-cache-fixed

Conversation

Copy link

Member

brandtbucher commentedMar 24, 2023•
edited by bedevere-bot
Loading

This adds a fixed array of cached methods to thePyTypeObject struct. This approach saves memory if there are ~23x moreLOAD_ATTR sites than there are types.

A size of 16 was chosen because:

A size of 8 is1% slower and has a81.0%LOAD_ATTR hit rate onpyperformance, with 32.5% of specialization failures due to a too-small cache.
A size of 16 is0% faster and has a82.2%LOAD_ATTR hit rate onpyperformance, with 8.7% of specialization failures due to a too-small cache.
A size of 32 is0% faster and has a82.6%LOAD_ATTR hit rate onpyperformance, with 3.3% of specialization failures due to a too-small cache.

A flexible buffer was also considered, but that was1% slower, likely due to the additional indirection and management of the buffer.

Issue:The inline cache forLOAD_ATTR is too large. #93533

brandtbucher added6 commits

March 22, 2023 18:29

Stick a small, resizable cache on all types

c87e8bd

Try a bigger starting cache size (8 instead of 1)

0e21f47

Try a fixed cache size

bde6bec

Bump the size to 16

212046c

Catch up with main

1ee7d47

Blurb add

cdc3189

brandtbucher added the performancePerformance or resource usage label

Mar 24, 2023

brandtbucher self-assigned this

Mar 24, 2023

brandtbucher requested a review frommarkshannon as acode owner

March 24, 2023 20:18

bedevere-bot mentioned this pull request

Mar 24, 2023

The inline cache forLOAD_ATTR is too large.#93533

Open

bedevere-bot added the awaiting core review label

Mar 24, 2023

brandtbucher changed the title~~GH-93533: Shrink theLOAD_ATTR caches.~~GH-93533: Shrink theLOAD_ATTR caches

Mar 24, 2023

Copy link

TeamSpen210 commentedMar 24, 2023

So if I'm understanding this correctly, for each class only 16 methods/attributes can be specialised, with everything after just failing? If so 16 seems fairly small, especially for larger libraries/applications. Something likendarray orflask.Flask for instance has 50+ methods. I could imagine a larger application using different sets of methods in different areas, filling up the cache. That wouldn't be the sort of thing showing up on benchmarks.

Copy link

MemberAuthor

brandtbucher commentedMar 24, 2023

So if I'm understanding this correctly, for each class only 16 methods/attributes can be specialised, with everything after just failing?

Correct (not including instance attributes). I definitely prefer a growable cache to this approach, except that the numbers we have are slower.

I'm also open to using a larger number of entries, like 32 or 64, if others feel similarly.

Copy link

MemberAuthor

brandtbucher commentedMar 24, 2023

I'm also not sure if this causes problems for interpreter isolation. Maybe putting this sort of state on static built-in types is a bad idea, whether it's resizable or not.

CC@ericsnowcurrently

Merge branch 'main' into type-cache-fixed

6934d55

Copy link

Member

hugovk commentedMar 25, 2023

(Docs now passing now#103019 is merged, and updating this withmain. Thanks for flagging!)

Copy link

Member

markshannon commentedMar 25, 2023•
edited
Loading

From a correctness point of view, adding the cache to static classes should be fine. All the attributes of superclasses of static classes must also be static.
If they aren't that's a bug, and this isn't the place to fix it.
No harm in adding some asserts, though.

However, we would like static classes to beconst so that they can be properly shared, which would mean that the cache would need to be pre-populated with all attributes of the class and all its superclasses.
The caches would be a bit big, but not that bad:len(object.__dict__ | int.__dict__) == 74; much smaller if we exclude special methods.

Copy link

MemberAuthor

brandtbucher commentedMar 27, 2023

Closing because of the various issues outlined above.

brandtbucher closed this

Mar 27, 2023

Labels

awaiting core review performance

Performance or resource usage

5 participants

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

GH-93533: Shrink the`LOAD_ATTR` caches#103014

GH-93533: Shrink the`LOAD_ATTR` caches#103014

Uh oh!

Conversation

brandtbucher commentedMar 24, 2023•
edited by bedevere-bot
Loading

Uh oh!