Movatterモバイル変換

Copy link

Member

markshannon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Is the_PyBinopSpecializationDescr supposed to be extensible?
If so we'll need a means for extensions to free the structs they have allocated. Or is the intention that they are statically allocated?
If they aren't extensible, maybe the VM should allocate them and clean them up?

Python/specialize.c OutdatedShow resolvedHide resolved

Modules/arraymodule.cShow resolvedHide resolved

Include/cpython/object.hShow resolvedHide resolved

set oparg, check *descr

89b6c37

picnixz reviewed

Include/cpython/object.h OutdatedShow resolvedHide resolved

Include/object.h OutdatedShow resolvedHide resolved

Modules/arraymodule.cShow resolvedHide resolved

Modules/arraymodule.c OutdatedShow resolvedHide resolved

Python/specialize.c OutdatedShow resolvedHide resolved

Python/specialize.cShow resolvedHide resolved

add free

6630a95

Copy link

MemberAuthor

iritkatriel commentedMay 5, 2025

Is the_PyBinopSpecializationDescr supposed to be extensible?

I assumed it's extensible and the extension manages it. The alternative is that it has avoid* field that the extension sets and manages. The advantage of letting extensions allocate is that if there is no dynamic information, you don't need to allocate a new one for each instruction that uses the specialisation.

review comments

f42d42f

Copy link

Contributor

eendebakpt commentedMay 5, 2025

Are there any microbenchmarks that show this improves performance?

whitespace

509b27f

Co-authored-by: Bénédikt Tran <10796600+picnixz@users.noreply.github.com>

picnixz reviewed

Python/specialize.cShow resolvedHide resolved

eendebakpt reviewed

Modules/arraymodule.cShow resolvedHide resolved

iritkatriel added3 commits

May 5, 2025 12:43

fix error

5c1ed68

remove unused

fd24d7b

use Py_ARRAY_LENGTH

b15ad61

iritkatrieland others added3 commits

May 5, 2025 13:16

fix deopt

81a3008

'experimental' comment

0190ecf

Merge branch 'main' into array_subscr

d8b5848

mdboom approved these changes

Include/cpython/object.h OutdatedShow resolvedHide resolved

Misc/NEWS.d/next/Core_and_Builtins/2025-05-04-19-32-47.gh-issue-133395.VhWWEP.rst OutdatedShow resolvedHide resolved

Python/specialize.cShow resolvedHide resolved

bedevere-appbot added awaiting merge and removed awaiting core review labels

iritkatrieland others added3 commits

May 5, 2025 15:13

credits

78f0576

comment on commutativity

90fb993

Apply suggestions from code review

454bfc5

Co-authored-by: Michael Droettboom <mdboom@gmail.com>

iritkatriel merged commit082dbf7 intopython:main

60 checks passed

bedevere-appbot removed the awaiting merge label

brandtbucher reviewed

Python/executor_cases.c.h

		}
		break;
		}
		/* _GUARD_BINARY_OP_EXTEND is not a viable micro-op for tier 2 because it uses the 'this_instr' variable */

Copy link

Member

brandtbucherMay 6, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

FYI, this change made it so that any traces containingBINARY_OP_EXTEND can't be JIT-compiled. Is there a way to rewrite this to be tier-2 friendly? Mutating inline caches doesn't work there.

Copy link

MemberAuthor

iritkatrielMay 6, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

We probably don't have to NULL the cache, just change the opcode to BINARY_OP and hope nobody tries to use the pointer in the cache anymore. Will make it harder to debug if they do.

Copy link

Member

brandtbucher commentedMay 6, 2025

Also, based on the benchmarking results that were posted, it looks like thecoverage,sqlalchemy_declarative, andsqlalchemy_imperative benchmarks crashed on this branch but succeeded on the base commit. Were these fixed before merging?

Copy link

Member

nascheme commentedMay 6, 2025

This seems to cause the refleaks unit test to fail for the test_datetime module. The leak seems to be related to theZoneInfoTest class, which uses thearray.array type.

Copy link

MemberAuthor

iritkatriel commentedMay 6, 2025

The crash should have been fixed. I'll check the leak.

Re tier 2 - maybe there's a way but we can just deopt them when jitting if not.

Copy link

Member

vstinner commentedMay 6, 2025

This seems to cause the refleaks unit test to fail for the test_datetime module. The leak seems to be related to the ZoneInfoTest class, which uses the array.array type.

The follow code is enough to reprodue the issue:

importarrayimportgcdeffunc():a=array.array('B',b'hello world')# call array_binop_specialize() 3 times, once per loopprint("specialize")for_inrange(10):a[1]print("specialize")for_inrange(10):a[1]print("specialize")for_inrange(10):a[1]func()# array_subscr_free() is not calleddelfuncgc.collect()

The specialization calls PyMem_Malloc() in array_binop_specialize(), but nothing calls PyMem_Free(): array_subscr_free() is not called.

hugovk mentioned this pull request

test_datetime leaks references#133496

Closed

Copy link

Member

Eclips4 commentedMay 6, 2025

It seems thatarray_subscr_guard behaves incorrectly. It never returns0 (at least I can't make it return0), which is needed to triggerarray_subscr_free.

Copy link

MemberAuthor

iritkatriel commentedMay 6, 2025

It seems thatarray_subscr_guard behaves incorrectly. It never returns0 (at least I can't make it return0), which is needed to triggerarray_subscr_free.

I don't think that's the issue. Free needs to happen when the code object is freed, and currently it's not.

iritkatriel added a commit to iritkatriel/cpython that referenced this pull request

Revert "pythongh-133395: add option for extension modules to speciali…

35c1b5e

…ze BINARY_OP/SUBSCR, apply to arrays (python#133396)"This reverts commit082dbf7.

Copy link

MemberAuthor

iritkatriel commentedMay 6, 2025

I'm going to revert this so it doesn't delay the release today.
#133498

It's not terribly important for this to be in 3.14.

hugovk pushed a commit that referenced this pull request