NotificationsYou must be signed in to change notification settings
Fork33.5k
Star70.1k

GH-126491: Lower heap size limit with faster marking#127519

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Merged

markshannon merged 24 commits intopython:mainfromfaster-cpython:faster-marking

Dec 6, 2024

Merged

GH-126491: Lower heap size limit with faster marking#127519

markshannon merged 24 commits intopython:mainfromfaster-cpython:faster-marking

Dec 6, 2024

Conversation

Copy link

Member

markshannon commentedDec 2, 2024•
edited
Loading

With marking added to the cyclic GC (#127110) we spend a lot of the time in the GC forming transitive closures, both for marking and for the increments of the incremental GC.

Unfortunately the current algorithm has a couple of mistakes in it. One harmful, one beneficial.

The beneficial one is counting the initial mark twice. This helps because it reduces the cost of GC on heaps with little or no garbage
The harmful one is allowing the amount of work done to grow in proportion to the heap size.

These more or less cancel out.
This PR deliberately counts marking as twice as effective as normal collection, but limits the amount of work done.
To do so, we need to increase the typical amount of work done a bit.
This has the advantage of limiting the amount of garbage to (roughly) 1/3 of the heap.

This PR does two things:

Speeds up the marking and increment creation phases
Visits objects a bit faster to maintain a lower heap size.

Issue:Mark all objects reachable from roots as live before doing main cyclic GC pass #126491

markshannon added7 commits

November 18, 2024 14:32

Faster marking of reachable objects

3038a78

Handle more classes in fast marking

c024484

Add support for asyn generators on fast path. Simplify counting

e8497ae

Check stackref before converting to PyObject *

4c1a6bc

Rename stuff

6efb4c0

Remove expand_region_transitively_reachable and use move_all_transiti…

b1c7ab0

…vely_reachable.

Merge branch 'main' into faster-marking

07f228b

bedevere-appbot mentioned this pull request

Dec 2, 2024

Mark all objects reachable from roots as live before doing main cyclic GC pass#126491

Open

Fix compiler warnings and linkage

51ff78e

markshannon added the skip news label

Dec 2, 2024

markshannon added10 commits

December 2, 2024 16:12

Fix another linkage issue

df907b5

Try 'extern'

9ca64f5

Go back to PyAPI_FUNC and move functions together

bda13f4

Use _Py_FALLTHROUGH

d9d63c8

Add necessary #ifndef Py_GIL_DISABLED

57b8820

Go back to using tp_traverse, but make traversal more efficient

a607059

Tidy up

1545508

A bit more tidying up

a1a38c8

Move all work to do calculations to one place

68fc90b

Assume that increments are 50% garbage for work done calculation

8893cf5

Copy link

MemberAuthor

markshannon commentedDec 3, 2024

!buildbot iOS

Copy link

bedevere-bot commentedDec 3, 2024

🤖 New build scheduled with the buildbot fleet by@markshannon for commit8893cf5 🤖

The command will test the builders whose names match following regular expression:iOS

The builders matched are:

iOS ARM64 Simulator PR

Copy link

MemberAuthor

markshannon commentedDec 3, 2024

!buildbot Android

Copy link

bedevere-bot commentedDec 3, 2024

🤖 New build scheduled with the buildbot fleet by@markshannon for commit8893cf5 🤖

The command will test the builders whose names match following regular expression:Android

The builders matched are:

aarch64 Android PR
AMD64 Android PR

Elaborate comment

ba20c7c

markshannon changed the title~~GH-126491: Faster marking~~GH-126491: Lower heap size limit with faster marking

Dec 4, 2024

mhsmith mentioned this pull request

Dec 4, 2024

Two selective !buildbot commands in quick succession causes ALL buildbots to runpython/buildmaster-config#566

Open

More tweaking of thresholds

8262bf0

Copy link

MemberAuthor

markshannon commentedDec 4, 2024

!buildbot Android|iOS

Copy link

bedevere-bot commentedDec 4, 2024

🤖 New build scheduled with the buildbot fleet by@markshannon for commit8262bf0 🤖

The command will test the builders whose names match following regular expression:Android|iOS

The builders matched are:

iOS ARM64 Simulator PR
aarch64 Android PR
AMD64 Android PR

Copy link

MemberAuthor

markshannon commentedDec 4, 2024

Performance is a wash overall, but I think that is an artifact of our benchmarks. I would expect this to perform better on larger heaps and consume less memory, although the benchmarks show no overall change in memory consumption.

Note that the "create gc cycles" benchmark shows a 10% speedup and "gc traversal" an 8% speedup, but there is an equivalent slowdown on the "xml etree" benchmarks.

markshannon marked this pull request as ready for review

December 4, 2024 14:03

markshannon requested a review frommethane as acode owner

December 4, 2024 14:03

bedevere-appbot added the awaiting core review label

Dec 4, 2024

markshannon added2 commits

December 4, 2024 16:54

Do some algebra

3c2116e

Revert to 2M+I from 3M+I

72d0284

iritkatriel reviewed

Dec 5, 2024

View reviewed changes

InternalDocs/garbage_collector.md OutdatedShow resolvedHide resolved

Address review comments

0f182e2

iritkatriel reviewed

Dec 5, 2024

View reviewed changes

InternalDocs/garbage_collector.md Outdated


		To work out how much work we need to do, consider a heap with`L` live objects
		and`G0` garbage objects at the start of a full scavenge and`G1` garbage objects
		at the end of the scavenge. We don't want amount of garbage to grow,`G1 ≤ G0`, and

Copy link

Member

iritkatrielDec 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	at the end of the scavenge. We don't want amount of garbage to grow,`G1 ≤ G0`, and
	at the end of the scavenge. We don't wanttheamount of garbage to grow,`G1 ≤ G0`, and

InternalDocs/garbage_collector.md Outdated

		The number of new objects created`N` must be at least the new garbage created,`N ≥ G1`,
		assuming that the number of live objects remains roughly constant.
		If we set`T == 4N` we get`T > 4G1` and`T = L + G0 + G1` =>`L + G0 > 3G1`
		For a steady state heap`G0 == G1` we get`L > 2G` and the desired garbage ratio.

Copy link

Member

iritkatrielDec 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	For a steady state heap`G0 == G1` we get`L >2G` and the desired garbage ratio.
	For a steady state heap`G0 == G1` we get`L >2*G0` and the desired garbage ratio.

InternalDocs/garbage_collector.md Outdated

		The number of new objects created`N` must be at least the new garbage created,`N ≥ G1`,
		assuming that the number of live objects remains roughly constant.
		If we set`T == 4N` we get`T > 4G1` and`T = L + G0 + G1` =>`L + G0 > 3G1`
		For a steady state heap`G0 == G1` we get`L > 2G` and the desired garbage ratio.

Copy link

Member

iritkatrielDec 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	For a steady state heap`G0 == G1` we get`L > 2G` and the desired garbage ratio.
	For a steady state heap(`G0 == G1`) we get`L > 2G` and the desired garbage ratio.

InternalDocs/garbage_collector.md Outdated


		If we choose the amount of work done such that`2*M + I == 6N` then we can do
		less work in most cases, but are still guaranteed to keep up.
		Since`I ≥ G0 + G1` (not strictly true, but close enough)

Copy link

Member

iritkatrielDec 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	Since`I≥ G0 + G1` (not strictly true, but close enough)
	Since`I≈ G0 + G1` (not strictly true, but close enough)

Copy link

MemberAuthor

markshannonDec 5, 2024•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

The increments (I) can include some of the live heap, depending on the how much is keep alive by C extensions.
So≥ is more correct. Although≳ is even more correct.

InternalDocs/garbage_collector.md Outdated

		If we choose the amount of work done such that`2*M + I == 6N` then we can do
		less work in most cases, but are still guaranteed to keep up.
		Since`I ≥ G0 + G1` (not strictly true, but close enough)
		`T == M + I == (6N + I)/2` and`(6N + I)/2 ≥ 4G`, so we can keep up.

Copy link

Member

iritkatrielDec 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	`T == M + I == (6N + I)/2` and`(6N + I)/2≥ 4G`, so we can keep up.
	`T == M + I == (6N + I)/2` and`(6N + I)/2≳ 4G`, so we can keep up.

InternalDocs/garbage_collector.md Outdated

		`T == M + I == (6N + I)/2` and`(6N + I)/2 ≥ 4G`, so we can keep up.

		The reason that this improves performance is that`M` is usually much larger
		than`I` Suppose`M == 10I`, then`T ≅ 3N`.

Copy link

Member

iritkatrielDec 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	than`I` Suppose`M == 10I`, then`T ≅ 3N`.
	than`I`. If`M == 10I`, then`T ≅ 3N`.

Python/gc.c

		assess_work_to_do(GCState*gcstate)
		{
		/* The amount of work we want to do depends onthree things.
		/* The amount of work we want to do depends ontwo things.

Copy link

Member

iritkatrielDec 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Is it worth linking to the doc from here?

Python/gc.c Outdated

		}
		intptr_tnew_objects=gcstate->young.count;
		intptr_tmax_heap_fraction=new_objects*3/2;
		intptr_tmax_heap_fraction=new_objects*5;

Copy link

Member

iritkatrielDec 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Why is this calledfraction?

Copy link

MemberAuthor

markshannonDec 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Not for any good reason. I'll rename it.

Address review comments and clarify code a bit

d3c21bb

iritkatriel approved these changes

Dec 5, 2024

View reviewed changes

bedevere-appbot added awaiting merge and removed awaiting core review labels

Dec 5, 2024

markshannon merged commit023b7d2 intopython:main

Dec 6, 2024

43 checks passed

bedevere-appbot removed the awaiting merge label

Dec 6, 2024

Copy link

Member

encukou commentedDec 9, 2024

This commit introduced reference leaks intest_import.

encukou added a commit to encukou/cpython that referenced this pull request

Dec 9, 2024

Revert "pythonGH-126491: Lower heap size limit with faster marking (p…

16327ad

…ythonGH-127519)"This reverts commit023b7d2, which introduceda refleak.

Copy link

Member

encukou commentedDec 9, 2024

I filed a revert PR:#127770

encukou added a commit that referenced this pull request

Dec 10, 2024

gh-126491: Revert "GH-126491: Lower heap size limit with faster marki…

690fe07

…ng (GH-127519)" (GH-127770)Revert "GH-126491: Lower heap size limit with faster marking (GH-127519)"This reverts commit023b7d2, which introduceda refleak.

srinivasreddy pushed a commit to srinivasreddy/cpython that referenced this pull request

Jan 8, 2025

pythonGH-126491: Lower heap size limit with faster marking (pythonGH-…

52244d1

…127519)* Faster marking of reachable objects* Changes calculation of work to do and work done.* Merges transitive closure calculations

srinivasreddy pushed a commit to srinivasreddy/cpython that referenced this pull request

Jan 8, 2025

pythongh-126491: Revert "pythonGH-126491: Lower heap size limit with …

13d7d0b

…faster marking (pythonGH-127519)" (pythonGH-127770)Revert "pythonGH-126491: Lower heap size limit with faster marking (pythonGH-127519)"This reverts commit023b7d2, which introduceda refleak.

markshannon mentioned this pull request

Jan 8, 2025

test_import leaks references#127738

Closed

markshannon deleted the faster-marking branch

January 10, 2025 16:23

This was referencedJul 25, 2025

Inconsistency handling of immortal objects in gc#137110

Open

gh-137110: Untrack immortal objects from expand_region_transitivity_reachable#137111

Open

This was referencedNov 24, 2025