NotificationsYou must be signed in to change notification settings
Fork33.3k
Star69.7k

gh-128807: Add marking phase for free-threaded cyclic GC#128808

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Merged

nascheme merged 20 commits intopython:mainfromnascheme:nogil-gc-mark-alive

Jan 15, 2025

Merged

gh-128807: Add marking phase for free-threaded cyclic GC#128808

nascheme merged 20 commits intopython:mainfromnascheme:nogil-gc-mark-alive

Jan 15, 2025

Conversation

Copy link

Member

nascheme commentedJan 14, 2025•
edited
Loading

This is conceptually similar to the phase that was added to the non-free-threaded GC. Start with a set of known root objects, like sysdict and mark all objects reachable from those (revealed by the tp_traverse method) as "alive". We know anything marked alive cannot be garbage and can be excluded from the regular cyclic GC process. For most programs, this saves a moderate amount of computation since the marking pass is relatively cheaper per object.

Ifgc.freeze() is used, it's unlikely that this marking phase will be a win since it's expected that the majority of objects will be frozen. Disable the marking phase if freeze is used.

Seegh-126491 for the non-free-threaded version of this technique.

pyperformanceresults vs merge base. I suspect the slowdown on some benchmarks is not real. For example, regex_v8 should not be slower.

Here are the pyperformance results for a bare-metal AMD Ryzen machine. It does not show a slowdown on regex_v8, for example.

To better show the expected improvement, I ran a "sphinx build" benchmark, like ingh-124567. Results are:

	old	new
GC collections	38	38
long-lived objects	586,986	585,730
total run time	3.24 s	3.03 s
mark phase time	0.00 s	0.16 s
total gc time	0.72 s	0.57 s

Issue:Add marking phase to free-threaded cyclic GC #128807

wip: gc mark alive for nogil

a111bff

Copy link

ghost commentedJan 14, 2025•
edited by ghost
Loading

All commit authors signed the Contributor License Agreement.

bedevere-appbot mentioned this pull request

Jan 14, 2025

Add marking phase to free-threaded cyclic GC#128807

Closed

nascheme added topic-free-threading performancePerformance or resource usage labels

Jan 14, 2025

nascheme added14 commits

January 13, 2025 17:09

wip: mark more roots

200b69b

wip: log gc timing to temp file

713689a

wip: mark stack refs as alive

2635fe0

wip: include # of collected in debug log

72128e2

wip: move freeze_used to gc state

942f77d

wip: fix OOM error handling, add ifdef toggles

a3089e5

wip: fix reversion on mark_heap_visitor()

4f99de6

If the object is already marked as reachable, we shouldn't traverse itagain.

wip: revert changes to gc_should_collect()

4f6e539

These are not strictly needed, simplify PR.

wip: removing timing code

b99f2df

wip: code cleanup, small bug fix

5f6ab4c

More code cleanup (better names, comments, simplify error handling).Fix bug in that "alive" bit must be checked in mark_alive_stack_push()to avoid visiting already visited objects.

wip: untrack tuples in "mark alive" pass

bd46b5f

Make sure we still do this optimization.  There is also a unit test thatchecks for this.

wip: enable stacks and extra roots

d25cb4a

wip: add helper gc_maybe_untrack()

347db45

Reduces duplicate code.

Add NEWS entry.

680f80f

nascheme force-pushed thenogil-gc-mark-alive branch fromccc5a11 to680f80fCompare

January 14, 2025 01:10

nascheme added2 commits

January 13, 2025 17:22

wip: spelling fixes, minor code cleanup

0c173c9

Merge branch 'origin/main' into nogil-gc-mark-alive

6654424

nascheme marked this pull request as ready for review

January 14, 2025 18:12

bedevere-appbot added the awaiting core review label

Jan 14, 2025

colesbury reviewed

Jan 14, 2025

View reviewed changes

Copy link

Contributor

colesbury left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Looks great! A few comments below

Python/gc_free_threading.cShow resolvedHide resolved

nascheme added3 commits

January 14, 2025 14:45

wip: use -1 for error convention for functions

79ea47e

wip: improve error handling for OOM

0090365

Use gc_abort_mark_alive() helper in case of OOM.  In addition tofreeing the stack, we need to ensure that no object has the alive bitset on it.  This also adding missing error handling in the case thatpropagate_alive_bits() fails.

Add comment about gc.freeze() disabling marking.

6100691