This is the same as the previous PR with the following additional change. Theupdate_all_slots() andtype_setattro() functions are now more careful when the world is stopped. Instead of doing the MRO lookups while the world is stopped, we do them all first and collect the slot pointers to be updated. Then, we stop the world and do those updates. This makes it much easier to confirm the code running during the stop-the-world is safe and that should avoid the deadlocks.

Thetest_opcache test has become quite a bit slower. It seems to be due to mutex contention in the__getitem__ and__getattribute__ method assignment tests. I reduced the items count from 1000 to 100 to keep the test from becoming much slower.

Issue:Type slots are not thread-safe in free-threaded builds #127266

nascheme added2 commits

April 29, 2025 16:44

pythongh-127266: avoid data races when updating type slots

3094372

In the free-threaded build, avoid data races caused by updating typeslots or type flags after the type was initially created.  For those(typically rare) cases, use the stop-the-world mechanism.  Remove theuse of atomics when reading or writing type flags.  The use of atomicsis not sufficient to avoid races (since flags are sometimes read withouta lock and without atomics) and are no longer required.

For update_all_slots(), do updates more safely.

f447ce4

To avoid deadlocks while the world is stopped, we need to avoid calling APIslike _PyObject_HashFast() and _PyDict_GetItemRef_KnownHash().  Collect theslot updates to be done and then apply them all at once.  This reduces theamount of code running in the stop-the-world condition.

nascheme added type-bug

An unexpected behavior, bug, or error

topic-free-threading labels

Apr 29, 2025

bedevere-appbot mentioned this pull request

Apr 29, 2025

Type slots are not thread-safe in free-threaded builds#127266

Closed

Avoid "empty structure" compile error.

d511ca6

nascheme added the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

Apr 30, 2025

Copy link

bedevere-bot commentedApr 30, 2025

🤖 New build scheduled with the buildbot fleet by@nascheme for commitd511ca6 🤖

Results will be shown at:

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F133177%2Fmerge

If you want to schedule another build, you need to add the🔨 test-with-buildbots label again.

bedevere-bot removed the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

Apr 30, 2025

nascheme added5 commits

April 29, 2025 22:40

Use apply_slot_updates() for type_setattro().

5e38497

Merge 'origin/main' into type-slot-ts-v2

e9516c7

Reduce number of items in test for slot updates.

8c74a0c

Now that stop-the-world is used to do the slot update, these testsare a lot slower in the free-threaded build.  Test with fewer items,which will still hopefully be enough to find bugs in the specializer.

Add TSAN suppression for _Py_slot_tp_getattr_hook.

6cd7644

Queue update of tp_flags as well.

3cb2256

The clearing of Py_TPFLAGS_HAVE_VECTORCALL must be done when the worldis stopped too.

nascheme added the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

Apr 30, 2025

Copy link

bedevere-bot commentedApr 30, 2025

🤖 New build scheduled with the buildbot fleet by@nascheme for commit3cb2256 🤖

Results will be shown at:

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F133177%2Fmerge

If you want to schedule another build, you need to add the🔨 test-with-buildbots label again.

bedevere-bot removed the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

Apr 30, 2025

nascheme marked this pull request as ready for review

April 30, 2025 13:29

nascheme requested a review frommarkshannon as acode owner

April 30, 2025 13:29

bedevere-appbot added the awaiting core review label

Apr 30, 2025

nascheme requested a review fromcolesbury

April 30, 2025 13:33

nascheme added5 commits

April 30, 2025 06:45

Performance, skip stop-the-world when possible.

47e41c9

Since we stack allocate one chunk, we need to check 'n' to see if thereare actually any updates to make.  It's pretty common that no updatesare actually needed.

Merge 'origin/main' into type-slot-ts-v2

cb848f1

Always clear version after __bases__ update.

9859ebf

Merge 'origin/main' into type-slot-ts-v2

6c74cac

Add test for assigning __bases__.

583c435

colesbury reviewed

May 1, 2025

View reviewed changes

Lib/test/test_opcache.py OutdatedShow resolvedHide resolved

Tools/tsan/suppressions_free_threading.txt OutdatedShow resolvedHide resolved

Objects/typeobject.cShow resolvedHide resolved

Copy link

MemberAuthor

nascheme commentedMay 2, 2025

@colesbury Not sure if you would like this approach but it is a variation on your idea to modify the critical section to prevent release of the mutex. I changed it to work with a PyCriticalSection2 as well.

https://github.com/nascheme/cpython/tree/type-slot-ts-release-hack

Not using a critical section for the type dict looks kind of difficult to do. For example,_Py_dict_lookup asserts that the dict is locked. I think we would have to duplicate some dictobject functions to make non-lock-asserting versions.

Copy link

Contributor

colesbury commentedMay 5, 2025

Thetype-slot-ts-release-hack approach makes sense to me

Avoid releasing TYPE_LOCK when stopping the world.

c01707e

nascheme added2 commits

May 5, 2025 12:31

Merge 'origin/main' into type-slot-ts-v2

1b9cad5

Add issue number for TSAN suppression.

a1c6b05

Copy link

MemberAuthor

nascheme commentedMay 5, 2025

SeeGH-133467 for some remaining data race issues.

colesbury approved these changes

May 5, 2025

View reviewed changes

bedevere-appbot added awaiting merge and removed awaiting core review labels

May 5, 2025

Bug fix for type_lock_prevent_release().

3f6222b

If the two mutex form of the critical section is used, need to put theother mutex into '_cs_mutex'.

nascheme added the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

May 5, 2025

Copy link

bedevere-bot commentedMay 5, 2025

🤖 New build scheduled with the buildbot fleet by@nascheme for commit3f6222b 🤖

Results will be shown at:

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F133177%2Fmerge

If you want to schedule another build, you need to add the🔨 test-with-buildbots label again.

bedevere-bot removed the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

May 5, 2025

Merge 'origin/main' into type-slot-ts-v2

6f218fb

kumaraditya303 reviewed

May 13, 2025

View reviewed changes

Objects/typeobject.cShow resolvedHide resolved

nascheme added5 commits

May 27, 2025 10:54

Add additional assert.

2bcf7ba

Merge 'origin/main' into type-slot-ts-v2

ddfdbd5

Revert test_opcache item size change.

63b7ae4

This test gets a bit slower, due to stop-the-world but it is not sodramatic.

Add comment for new unit test.

1a2fee1

Fix assert for default build.

c1f3ed5

nascheme added the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

May 27, 2025

Copy link

bedevere-bot commentedMay 27, 2025

🤖 New build scheduled with the buildbot fleet by@nascheme for commitc1f3ed5 🤖

Results will be shown at:

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F133177%2Fmerge

If you want to schedule another build, you need to add the🔨 test-with-buildbots label again.

bedevere-bot removed the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

May 27, 2025

colesbury reviewed

May 27, 2025

View reviewed changes

Objects/typeobject.c OutdatedShow resolvedHide resolved

Improve function name.

41e54e1

The apply_slot_updates_world_stopped() name implies that the worldis already stopped, based on convention.  Rename for clarity.

nascheme merged commitfbbbc10 intopython:main

May 28, 2025

45 checks passed

bedevere-appbot removed the awaiting merge label

May 28, 2025

kumaraditya303 mentioned this pull request

May 28, 2025

Deadlock in test_opcache with gh-131174 applied#133130

Closed

kumaraditya303 mentioned this pull request

Jul 1, 2025

More gil-disabled type thread safety#114214

Closed

Labels

topic-free-threading type-bug

An unexpected behavior, bug, or error

4 participants

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-127266: avoid data races when updating type slots v2#133177

gh-127266: avoid data races when updating type slots v2#133177

Uh oh!

Conversation

nascheme commentedApr 29, 2025•
edited
Loading

Uh oh!

Uh oh!

bedevere-bot commentedApr 30, 2025

Uh oh!

bedevere-bot commentedApr 30, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nascheme commentedMay 2, 2025

Uh oh!

colesbury commentedMay 5, 2025

Uh oh!

nascheme commentedMay 5, 2025

Uh oh!

bedevere-bot commentedMay 5, 2025

Uh oh!

Uh oh!

bedevere-bot commentedMay 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Movatterモバイル変換

Uh oh!

gh-127266: avoid data races when updating type slots v2#133177

gh-127266: avoid data races when updating type slots v2#133177

Uh oh!

Conversation

nascheme commentedApr 29, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

bedevere-bot commentedApr 30, 2025

Uh oh!

bedevere-bot commentedApr 30, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nascheme commentedMay 2, 2025

Uh oh!

colesbury commentedMay 5, 2025

Uh oh!

nascheme commentedMay 5, 2025

Uh oh!

bedevere-bot commentedMay 5, 2025

Uh oh!

Uh oh!

bedevere-bot commentedMay 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nascheme commentedApr 29, 2025•
edited
Loading