NotificationsYou must be signed in to change notification settings
Fork33.3k
Star69.7k

gh-112175: Add`eval_breaker` to`PyThreadState`#115194

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Merged

colesbury merged 13 commits intopython:mainfromswtaarrs:cpython-eval-breaker

Feb 20, 2024

Merged

gh-112175: Add`eval_breaker` to`PyThreadState`#115194

colesbury merged 13 commits intopython:mainfromswtaarrs:cpython-eval-breaker

Feb 20, 2024

Conversation

Copy link

Member

swtaarrs commentedFeb 9, 2024•
edited
Loading

This change adds aneval_breaker field toPyThreadState, renaming the existingPyInterpreterState.eval_breaker toinstrumentation_version (explained further down). The primary motivation is for performance in free-threaded builds: with thread-local eval breakers, we can stop a specific thread (e.g., for an async exception) without interrupting other threads.

Pending Calls

We still want to signal all threads in an interpreter for pending calls that aren't limited to the main thread (possible since757b402ea1c2). These callbacks can be added by any thread, with or without the GIL held. If the targeted interpreter's GIL is locked, we signal the holding thread. When any thread is resumed, its_PY_CALLS_TO_DO bit is derived from the source of truth for pending calls (one of two_pending_calls structs). This handles situations where no thread held the GIL when the call was first added, or if the active thread did not handle the call before releasing the GIL.

In a free-threaded build, all threads in the targeted interpreter are signaled. This is slightly less efficient, both because we have to iterate over all threads to set the bit, and because it's more likely that multiple threads could call intomake_pending_calls() before the bit is cleared. This shouldn't affect correctness, though, sincemake_pending_calls() and the_pending_calls structs have appropriate synchronization in place to ensure each callback happens once.

Instrumentation Version

The source of truth for the global instrumentation version is still inPyInterpreterState.instrumentation_version (which used to beeval_breaker), in both normal and free-threaded builds. Now that this variable contains only the instrumentation version (and no more eval breaker bits), some of the code to read and write it is simplified.

Threads usually read the version from their localeval_breaker, where it continues to be colocated with the eval breaker bits, and the method for keeping it up to date depends on build type. Both builds first update the version ininstrumentation_version, and then:

Normal builds update the version in the current thread'seval_breaker. When a new thread takes the GIL, it copies the current version frominstrumentation_version.
Free-threaded builds again iterate over all threads in the current interpreter, updating the version on each one. Instrumentation (and the specializing interpreter more generally) will need more work to be compatible with free-threaded builds, so these changes are mostly intended to maintain the status quo in normal builds for now.

Other notable changes

The_PY_*_BIT macros now expand to the actual bit being set, rather than the bit's index. I think this is simpler overall.
_Py_set_eval_breaker_bit(interp, bit, set) has been split into_Py_set_eval_breaker_bit(interp, bit) and_Py_unset_eval_breaker_bit(interp, bit). This lets us use a single atomicor/and, rather than a loop around an atomic compare/exchange like the old code.
Issue:Move theeval_breaker toPyThreadState #112175

pythongh-112175: Addeval_breakertoPyThreadState

bc613e4

This change adds an `eval_breaker` field to `PyThreadState`, renaming the existing `eval_breaker` to`interp_eval_breaker` (its uses are explained further down). The primary motivation is forperformance in free-threaded builds: with thread-local eval breakers, we can stop a specific thread(e.g., for an async exception) without interrupting other threads.There are still two situations where we want the first available thread to handle a request:- Running a garbage collection: In normal builds, we set `_PY_GC_SCHEDULED_BIT` on the current  thread. In case a thread suspends before handling the collection, the bit is copied to and from  `interp_eval_breaker` on thread suspend and resume, respectively. In a free-threaded build, we  simply iterate over all threads and set the bit. The first thread to check its eval breaker runs  the collection, unsetting the bit on all threads.  - Free-threaded builds could have multiple threads attempt a GC from one trigger if we get very    unlucky with thread scheduling. I didn't put any protections against this in place because a)    the consequences of it happening are just that one or more threads will check the GC thresholds    right after a collection finishes, which won't affect correctness and b) it's incredibly,    vanishingly unlikely.- Pending calls not limited to the main thread (possible sincepython/cpython@757b402ea1c2). This is  a little tricker, since the callback can be added from any thread, with or without the GIL  held. If the targeted interpreter's GIL is locked, we signal the holding thread. When a thread is  resumed, its `_PY_CALLS_TO_DO` bit is derived from the source of truth for pending calls (one of  two `_pending_calls` structs). This handles situations where no thread held the GIL when the call  was first added, or if the active thread did not handle the call before releasing the GIL. In a  free-threaded build, all threads all signaled, similar to scheduling a GC.The source of truth for the global instrumentation version is still in `interp_eval_breaker`, inboth normal and free-threaded builds. Threads usually read the version from their local`eval_breaker`, where it continues to be colocated with the eval breaker bits, and the method forkeeping it up to date depends on build type. All builds first update the version in`interp_eval_breaker`, and then:- Normal builds update the version in the current thread's `eval_breaker`. When a thread takes the  GIL, it copies the current version from `interp_eval_breaker` as part of the same operation that  copies `_PY_GC_SCHEDULED_BIT`.- Free-threaded builds again iterate over all threads in the current interpreter, updating the  version on each one.Instrumentation (and the specializing interpreter more generally) will need more work to becompatible with free-threaded builds, so these changes are just intended to maintain the status quoin normal builds for now.Other notable changes are:- The `_PY_*_BIT` macros now expand to the actual bit being set, rather than the bit's index. I  think this is simpler overall. I also moved their definitions from `pycore_ceval.h` to  `pycore_pystate.h`, since their main usage is on `PyThreadState`s now.- Most manipulations of `eval_breaker` are done with a new pair of functions:  `_PyThreadState_Signal()` and `_PyThreadState_Unsignal()`. Having two separate functions to  set/unset a bit, rather than one function that takes the bit value to use, lets us use a single  atomic `or`/`and`, rather than a loop around an atomic compare/exchange like the old  `_Py_set_eval_breaker_bit` function.Existing tests provide pretty good coverage for most of this functionality. The one new test I addedis to make sure a GC still happens if a thread schedules it then drops the GIL before the GC runs. Idon't love how complicated this test ended up so I'm open to other ideas for how to test this (orother things to test in general).

swtaarrs requested review fromericsnowcurrently,gvanrossum andmarkshannon ascode owners

February 9, 2024 02:00

bedevere-appbot added the awaiting review label

Feb 9, 2024

bedevere-appbot mentioned this pull request

Feb 9, 2024

Move theeval_breaker toPyThreadState#112175

Closed

colesbury self-requested a review

February 9, 2024 16:59

colesbury reviewed

Feb 9, 2024

View reviewed changes

Copy link

Contributor

colesbury left a comment•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Thanks Brett! This looks good overall, but the free-threaded GitHub CI is failing and there are a few warnings that need to be addressed. I've left comments inline as well.

_PY_GC_SCHEDULED_BIT

For the free-threaded build, I think we only need to set the GC scheduled bit on the current thread. It doesn't matter much if we get de-scheduled -- other threads will see that a GC needs to be scheduled soon enough. Regarding "multiple threads attempt a GC": that can be a performance issue once the GIL is disabled, but will be addressed ingh-114880.

I think the same logic applies to the default build, but I'd like Mark's opinion on it. In other words, I don't think we need to copy the_PY_GC_SCHEDULED_BIT back to the interpreter eval breaker. In the rare case where we get swapped out between setting_PY_GC_SCHEDULED_BIT and running the GC, the newly scheduled thread will figure out that it needs to run the GC after the next GC allocation.

Instrumentation

In the free-threaded build, I think we'll also need to copy the instrumentation bits when we create a new thread (while holdingHEAD_LOCK(runtime)).

Python/instrumentation.c OutdatedShow resolvedHide resolved

Python/ceval_gil.c OutdatedShow resolvedHide resolved

Include/internal/pycore_ceval_state.h OutdatedShow resolvedHide resolved

Python/ceval_gil.c OutdatedShow resolvedHide resolved

Python/ceval_gil.cShow resolvedHide resolved

Python/gc_free_threading.c OutdatedShow resolvedHide resolved

Python/instrumentation.c OutdatedShow resolvedHide resolved

Python/gc.c OutdatedShow resolvedHide resolved

Python/ceval_gil.c OutdatedShow resolvedHide resolved

📜🤖 Added by blurb_it.

76fed4e

Copy link

MemberAuthor

swtaarrs commentedFeb 9, 2024

It doesn't matter much if we get de-scheduled -- other threads will see that a GC needs to be scheduled soon enough.

Thanks, that makes sense. I forgot that every subsequent allocation will keep checking the thresholds even once the GC is scheduled.

I don't think we need to copy the _PY_GC_SCHEDULED_BIT back to the interpreter eval breaker

If Mark agrees, then the only thing left ininterp_eval_breaker will be the instrumentation version, and it might be worth renaming it and/or considering larger changes to how it works.

In the free-threaded build, I think we'll also need to copy the instrumentation bits when we create a new thread (while holding HEAD_LOCK(runtime)).

This line should do that already in all build types, although I realized that I should change it to readinterp_eval_breaker atomically.init_threadstate() is only called bynew_threadstate(), and it's called with the runtime lock held.

swtaarrs added5 commits

February 9, 2024 13:32

Remove unnecessary test

321c153

Review comments from Sam

f78c505

Add !IsSignalled check to free-threaded _Py_ScheduleGC to match norma…

58f679c

…l build

Move breaker bits/functions back to pycore_ceval.h, restore old namin…

cf2a8a4

…g convention

Merge remote-tracking branch 'upstream/main' into cpython-eval-breaker.

470727e

Minor conflicts from the new _PY_EVAL_EXPLICIT_MERGE_BIT flag.

Copy link

MemberAuthor

swtaarrs commentedFeb 9, 2024

I think I've addressed all of Sam's comments except for removing theeval_breaker <->interp_eval_breaker GC bit transfer for normal builds. I'll wait to hear from@markshannon before doing that.

One of the free-threading test failures was the new gc scheduling test, which I deleted in anticipation of removing that functionality. I couldn't repro any of the other failures on either my Linux or MacOS development machines; if the same tests fail again in CI I'll dig further.

colesbury reviewed

Feb 12, 2024

View reviewed changes

Include/internal/pycore_ceval.h OutdatedShow resolvedHide resolved

Copy link

Member

markshannon commentedFeb 12, 2024

I'll wait to hear from@markshannon before doing that.

What do you want to know?

Copy link

Member

markshannon commentedFeb 12, 2024

If Mark agrees, then the only thing left in interp_eval_breaker will be the instrumentation version, and it might be worth renaming it and/or considering larger changes to how it works.

Feel free to rename it. How did you want to change in how it works?

Copy link

MemberAuthor

swtaarrs commentedFeb 12, 2024•
edited
Loading

I'll wait to hear from@markshannon before doing that.
What do you want to know?

If you have thoughts on Sam's suggestions around_PY_GC_SCHEDULED_BIT inthis comment. Basically: do we need to worry about a thread dropping the GIL after scheduling a GC (but before running the GC), or can we just assume that the next thread that takes the GIL will schedule a GC for itself soon anyway?

If Mark agrees, then the only thing left in interp_eval_breaker will be the instrumentation version, and it might be worth renaming it and/or considering larger changes to how it works.
Feel free to rename it. How did you want to change in how it works?

I didn't have any specific changes in mind, other than making sure we take any opportunities to simplify things once it contains only the version number. We should still keep the version number left-shifted out of the way of the eval breaker bits, since the version is combined with the bits in thread-local eval breakers. Other than that I can't think of anything.

Back to signed 1 in _PY_EVAL_EVENTS_MASK

f129fd6

Copy link

Member

markshannon commentedFeb 13, 2024

I think the same logic applies to the default build, but I'd like Mark's opinion on it. In other words, I don't think we need to copy the _PY_GC_SCHEDULED_BIT back to the interpreter eval breaker. In the rare case where we get swapped out between setting _PY_GC_SCHEDULED_BIT and running the GC, the newly scheduled thread will figure out that it needs to run the GC after the next GC allocation.

Correct me if I'm wrong, but we copy the bits from thread state to interpreter state when we are switching threads, and that only happens in the eval breaker handler, which also runs the GC.
So wouldn't_PY_GC_SCHEDULED_BIT always be zero when swapping threads as we would have just run the GC and nothing could have triggered another one?

Copy link

Contributor

colesbury commentedFeb 13, 2024

We also can switch threads any time the GIL is released, such as by aPyEval_SaveThread() call before a blocking operation.

swtaarrs added2 commits

February 13, 2024 14:48

Don't copy _PY_GC_SCHEDULED_BIT to/from interp_eval_breaker

4260410

This means that the instrumentation version was the only thing left in`interp_eval_breaker`, so I renamed it and performed some relatedsimplification.

Merge remote-tracking branch 'upstream/main' into cpython-eval-breaker

0db460e

colesbury reviewed

Feb 14, 2024

View reviewed changes

Copy link

Contributor

colesbury left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Looks pretty good, a few comments below

Python/ceval.c

		#include"pycore_abstract.h"// _PyIndex_Check()
		#include"pycore_call.h"// _PyObject_CallNoArgs()
		#include"pycore_ceval.h"// _PyEval_SignalAsyncExc()
		#include"pycore_ceval.h"

Copy link

Contributor

colesburyFeb 14, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Is this change intentional? Otherwise, should probably revert it.

Copy link

MemberAuthor

swtaarrsFeb 15, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

It was intentional: I removed the_PyEval_SignalAsyncExc() function and replaced the one call to it (which wasn't even in this file) with_Py_set_eval_breaker_bit(...). I tried removing the#include and saw that a number of other functions frompycore_ceval.h are still used.

Python/ceval_gil.c OutdatedShow resolvedHide resolved

Python/instrumentation.c OutdatedShow resolvedHide resolved

swtaarrs added2 commits

February 14, 2024 18:17

Remove unnecessary NULL check, add uint32_t cast

0e1055f

Merge remote-tracking branch 'upstream/main' into cpython-eval-breaker

2139ed4

colesbury reviewed

Feb 16, 2024

View reviewed changes

Copy link

Contributor

colesbury left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

This looks good to me other than the merge conflicts. Those were probably from conflicts with my PRs -- sorry about that.

Did you want to makeinstrumentation_version auint32_t now that it doesn't include the eval breaker? Either way seems fine to me.

Include/internal/pycore_ceval_state.hShow resolvedHide resolved

Merge remote-tracking branch 'upstream/main' into cpython-eval-breaker

90450ce

Copy link

MemberAuthor

swtaarrs commentedFeb 20, 2024

I resolved the merge conflicts - they were pretty straightforward. This should be good to go now (I replied inline about not making theuint32_t change in this PR).

colesbury approved these changes

Feb 20, 2024

View reviewed changes

Copy link

Contributor

colesbury left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Thanks Brett!

bedevere-appbot added awaiting merge and removed awaiting review labels

Feb 20, 2024

colesbury merged commit0749244 intopython:main

Feb 20, 2024

bedevere-appbot removed the awaiting merge label

Feb 20, 2024

woodruffw pushed a commit to woodruffw-forks/cpython that referenced this pull request

Mar 4, 2024

pythongh-112175: Addeval_breakertoPyThreadState(python#115194)

00654d5

This change adds an `eval_breaker` field to `PyThreadState`. The primarymotivation is for performance in free-threaded builds: with thread-local evalbreakers, we can stop a specific thread (e.g., for an async exception) withoutinterrupting other threads.The source of truth for the global instrumentation version is stored in the`instrumentation_version` field in PyInterpreterState. Threads usually read theversion from their local `eval_breaker`, where it continues to be colocatedwith the eval breaker bits.

diegorusso pushed a commit to diegorusso/cpython that referenced this pull request

Apr 17, 2024

pythongh-112175: Addeval_breakertoPyThreadState(python#115194)

6f23410

This change adds an `eval_breaker` field to `PyThreadState`. The primarymotivation is for performance in free-threaded builds: with thread-local evalbreakers, we can stop a specific thread (e.g., for an async exception) withoutinterrupting other threads.The source of truth for the global instrumentation version is stored in the`instrumentation_version` field in PyInterpreterState. Threads usually read theversion from their local `eval_breaker`, where it continues to be colocatedwith the eval breaker bits.

LukasWoodtli pushed a commit to LukasWoodtli/cpython that referenced this pull request

Jan 22, 2025

pythongh-112175: Addeval_breakertoPyThreadState(python#115194)

b16690d

This change adds an `eval_breaker` field to `PyThreadState`. The primarymotivation is for performance in free-threaded builds: with thread-local evalbreakers, we can stop a specific thread (e.g., for an async exception) withoutinterrupting other threads.The source of truth for the global instrumentation version is stored in the`instrumentation_version` field in PyInterpreterState. Threads usually read theversion from their local `eval_breaker`, where it continues to be colocatedwith the eval breaker bits.

yilei mentioned this pull request

Oct 5, 2025

Python 3.13: PyThreadState_SetAsyncExc might deliver the exception outside of the thread#139622

Open

Labels

None yet

Movatterモバイル変換

Uh oh!

gh-112175: Addeval_breaker toPyThreadState#115194

gh-112175: Addeval_breaker toPyThreadState#115194

Uh oh!

Conversation

swtaarrs commentedFeb 9, 2024• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Pending Calls

Instrumentation Version

Other notable changes

Uh oh!

colesbury left a comment• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

_PY_GC_SCHEDULED_BIT

Instrumentation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

swtaarrs commentedFeb 9, 2024

Uh oh!

swtaarrs commentedFeb 9, 2024

Uh oh!

Uh oh!

markshannon commentedFeb 12, 2024

Uh oh!

markshannon commentedFeb 12, 2024

Uh oh!

swtaarrs commentedFeb 12, 2024• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

markshannon commentedFeb 13, 2024

Uh oh!

colesbury commentedFeb 13, 2024

Uh oh!

colesbury left a comment

Choose a reason for hiding this comment

Uh oh!

colesburyFeb 14, 2024

Choose a reason for hiding this comment

Uh oh!

swtaarrsFeb 15, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

colesbury left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

swtaarrs commentedFeb 20, 2024

Uh oh!

colesbury left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gh-112175: Add`eval_breaker` to`PyThreadState`#115194

gh-112175: Add`eval_breaker` to`PyThreadState`#115194

swtaarrs commentedFeb 9, 2024•
edited
Loading

colesbury left a comment•
edited
Loading

swtaarrs commentedFeb 12, 2024•
edited
Loading