NotificationsYou must be signed in to change notification settings
Fork33.3k
Star69.7k

gh-112529: Make the GC scheduling thread-safe#114880

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Merged

colesbury merged 5 commits intopython:mainfromcolesbury:gh-112529-gc-schedule

Feb 16, 2024

Merged

gh-112529: Make the GC scheduling thread-safe#114880

colesbury merged 5 commits intopython:mainfromcolesbury:gh-112529-gc-schedule

Feb 16, 2024

Conversation

Copy link

Contributor

colesbury commentedFeb 1, 2024•
edited by bedevere-appbot
Loading

The GC keeps track of the number of allocations (less deallocations) since the last GC. This change buffers the allocation count in thread-local state and uses atomic operations to modify the per-interpreter count. The thread-local buffering avoids contention on shared state.

A consequence is that the GC scheduling is not as precise, so "test_sneaky_frame_object" is skipped because it requires that the GC be run exactly after allocating a frame object.

Issue:Make the garbage collector thread-safe in--disable-gil builds #112529

pythongh-112529: Make the GC scheduling thread-safe

5f53ebd

The GC keeps track of the number of allocations (less deallocations)since the last GC. This buffers the count in thread-local state and usesatomic operations to modify the per-interpreter count. The thread-localbuffering avoids contention on shared state.A consequence is that the GC scheduling is not as precise, so"test_sneaky_frame_object" is skipped because it requires that the GC berun exactly after allocating a frame object.

colesbury requested review fromDinoV,nascheme andpablogsal

February 1, 2024 21:21

colesbury requested review fromericsnowcurrently andmarkshannon ascode owners

February 1, 2024 21:21

bedevere-appbot added the awaiting review label

Feb 1, 2024

bedevere-appbot mentioned this pull request

Feb 1, 2024

Make the garbage collector thread-safe in--disable-gil builds#112529

Closed

3 tasks

colesbury added the skip news label

Feb 1, 2024

ericsnowcurrently reviewed

Feb 2, 2024

View reviewed changes

Copy link

Member

ericsnowcurrently left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

The GC is one of the runtime components with which I am least familiar, so I mostly have questions for you. 😄

Otherwise, the PR mostly makes sense.

Objects/typeobject.cShow resolvedHide resolved

Python/gc_free_threading.cShow resolvedHide resolved

Fix warning

e832dd9

Copy link

Member

ericsnowcurrently commentedFeb 2, 2024

The change here seems okay to me, but I'd feel better if one of the GC experts reviewed this before it's merged.

CC@markshannon @pablogsal @nascheme @DinoV @nanjekyejoannah

Copy link

Member

nascheme commentedFeb 4, 2024

I've not looked at the code but the idea of the change sounds fine to me. I suspect there are some users who require that the GC threshold is precise, like thetest_sneaky_frame_object case. However, I don't think that's a reasonable thing and I think it's okay to break them. We are pretty likely to adjust how the thresholds work anyhow. Using atomic operations to count allocations/dellocations will be too expensive.

colesbury added2 commits

February 6, 2024 15:38

Merge branch 'main' intopythongh-112529-gc-schedule

483b37e

Skip test_gc.test_get_count() in builds

456b778

colesbury mentioned this pull request

Feb 9, 2024

gh-112175: Addeval_breaker toPyThreadState#115194

Merged

Merge branch 'main' intopythongh-112529-gc-schedule

f99d14e

DinoV reviewed

Feb 14, 2024

View reviewed changes

Python/gc_free_threading.c

		// We buffer the allocation count to avoid the overhead of atomic
		// operations for every allocation.
		gc->alloc_count++;
		if (gc->alloc_count >=LOCAL_ALLOC_COUNT_THRESHOLD) {

Copy link

Contributor

DinoVFeb 14, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I wonder if this could be tied to the configurable GC threshold and therefore the tests could continue to pass but maybe it doesn't matter enough and the extra read isn't worth it.

Copy link

ContributorAuthor

colesburyFeb 14, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Yeah, I considered making it a configurable runtime threshold, but decided it wasn't worth it, at least for now.

I think there's a decent chance we change how we count allocations in the future. In thenogil forks, for example, I accounted for allocations inmi_page_to_full and_mi_page_unfull, which provides some natural batching and avoids the thread-local that's done in every allocation here, but wouldn't allow for a configurable threshold. I haven't attempted that yet because I'd like some performance measurements to justify it first.

DinoV approved these changes

Feb 14, 2024

View reviewed changes

Copy link

Contributor

DinoV left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

LGTM!

bedevere-appbot added awaiting merge and removed awaiting review labels

Feb 14, 2024

colesbury added the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

Feb 14, 2024

Copy link

bedevere-bot commentedFeb 14, 2024

🤖 New build scheduled with the buildbot fleet by@colesbury for commitf99d14e 🤖

If you want to schedule another build, you need to add the🔨 test-with-buildbots label again.

bedevere-bot removed the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

Feb 14, 2024

colesbury merged commitb24c916 intopython:main

Feb 16, 2024

colesbury deleted the gh-112529-gc-schedule branch

February 16, 2024 16:22

bedevere-appbot removed the awaiting merge label

Feb 16, 2024

woodruffw pushed a commit to woodruffw-forks/cpython that referenced this pull request

Mar 4, 2024

pythongh-112529: Make the GC scheduling thread-safe (python#114880)

a2a0dd9

The GC keeps track of the number of allocations (less deallocations)since the last GC. This buffers the count in thread-local state and usesatomic operations to modify the per-interpreter count. The thread-localbuffering avoids contention on shared state.A consequence is that the GC scheduling is not as precise, so"test_sneaky_frame_object" is skipped because it requires that the GC berun exactly after allocating a frame object.

diegorusso pushed a commit to diegorusso/cpython that referenced this pull request

Apr 17, 2024

pythongh-112529: Make the GC scheduling thread-safe (python#114880)

52df553

The GC keeps track of the number of allocations (less deallocations)since the last GC. This buffers the count in thread-local state and usesatomic operations to modify the per-interpreter count. The thread-localbuffering avoids contention on shared state.A consequence is that the GC scheduling is not as precise, so"test_sneaky_frame_object" is skipped because it requires that the GC berun exactly after allocating a frame object.

LukasWoodtli pushed a commit to LukasWoodtli/cpython that referenced this pull request

Jan 22, 2025

pythongh-112529: Make the GC scheduling thread-safe (python#114880)

df74dc2

The GC keeps track of the number of allocations (less deallocations)since the last GC. This buffers the count in thread-local state and usesatomic operations to modify the per-interpreter count. The thread-localbuffering avoids contention on shared state.A consequence is that the GC scheduling is not as precise, so"test_sneaky_frame_object" is skipped because it requires that the GC berun exactly after allocating a frame object.