Count the number of Element attribute accesses as a proxy for work done. With double the amount of work, a ratio of 2.0 indicates linear scaling and 4.0 quadratic scaling. Use 3.2 as an intermediate threshold.

Issue:Remove quadratic behavior in node ID cache clearing #142145

pythongh-142145: Avoid timing measurements in quadratic behavior test

cc9cf09

Count the number of Element attribute accesses as a proxy for work done.With double the amount of work, a ratio of 2.0 indicates linear scalingand 4.0 quadratic scaling. Use 3.2 as an intermediate threshold.

colesbury requested a review fromsethmlarson

December 23, 2025 16:20

colesbury added the skip news label

Dec 23, 2025

bedevere-appbot added the testsTests in the Lib/test dir label

Dec 23, 2025

bedevere-appbot mentioned this pull request

Dec 23, 2025

Remove quadratic behavior in node ID cache clearing#142145

Open

colesbury marked this pull request as ready for review

December 23, 2025 16:20

bedevere-appbot added the awaiting core review label

Dec 23, 2025

colesbury requested a review fromgpshead

December 23, 2025 16:20

Lint

506e978

Copy link

ContributorAuthor

colesbury commentedDec 23, 2025

The context for this is that I'm trying to get the full test suite in a state where we can run it under TSan. TSan tends to be much slower, so timing tests are even more flaky.

Let me know what you think about this appraoch.

For context, I used ChatGPT to generate the test and then edited it. (It was pretty good, but not perfect).

Copy link

Contributor

sethmlarson commentedDec 23, 2025

Thanks@colesbury, did you happen to run this test with the commit in question reverted to see that this test would indeed catch quadratic behavior in number of accesses?

Copy link

ContributorAuthor

colesbury commentedDec 23, 2025

Yes, if you revert the fix then it fails with:

======================================================================FAIL: testAppendChildNoQuadraticComplexity (test.test_minidom.MinidomTest.testAppendChildNoQuadraticComplexity)----------------------------------------------------------------------Traceback (most recent call last):  File "/home/sgross/cpython/Lib/test/test_minidom.py", line 207, in testAppendChildNoQuadraticComplexity    self.assertLess(    ~~~~~~~~~~~~~~~^        max(r1, r2), 3.2,        ^^^^^^^^^^^^^^^^^        msg=f"Possible quadratic behavior: work={w1,w2,w3} ratios={r1,r2}"        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^    )    ^AssertionError: 3.9883495145631067 not less than 3.2 : Possible quadratic behavior: work=(1060864, 4218880, 16826368) ratios=(3.976833976833977, 3.9883495145631067)

sethmlarson approved these changes

Dec 23, 2025

View reviewed changes

Copy link

Contributor

sethmlarson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

LGTM, thanks@colesbury!

gpshead added needs backport to 3.13

bugs and security fixes

needs backport to 3.14bugs and security fixes labels

Dec 23, 2025

gpshead reviewed

Dec 23, 2025

View reviewed changes

Lib/test/test_minidom.py Outdated


		@support.requires_resource('cpu')
		deftestAppendChildNoQuadraticComplexity(self):
		# Don't use wall-clock timing (too flaky). Instead count a proxy for the

Copy link

Member

gpsheadDec 23, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Elide the model's explanatory comment about the old no longer present code.

Lib/test/test_minidom.py

		self.assertEqual(dom.documentElement.childNodes[-1].data,"Hello")
		dom.unlink()

		@support.requires_resource('cpu')

Copy link

Member

gpsheadDec 23, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

why remove this? i realize the test shouldn't be "slow" now, but the intent of adding it was to mark it as a test that may not be suitable for slow builds and loaded systems. some of our buildbots run with the cpu resource disabled for that reason. it's more of a performance test no matter how it is implemented.

for tests where it isn't platform specific and we just needsomething in our support tiers to catch an unlikely regression the issue, resource tags save effort.

Copy link

ContributorAuthor

colesburyDec 23, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'll add it back. I removed it because my understanding was that "cpu" was for tests that were CPU-heavy, and now this test is very fast (~30 ms in a debug build).

Lib/test/test_minidom.py

		total_calls+=1
		returnobject.__getattribute__(self,attr)

		withsupport.swap_attr(Element,"__getattribute__",getattribute_counter):

Copy link

Member

gpsheadDec 23, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

doing this is... gross, but likely works. yourassertGreater(w1, 0) below covers the case when it stops doing so. LGTM

the other thought i was having as i did the simple "raise the timeout to 4s" to unblock the earlier ones was that we don't need absolute times at all. measuring relative time taken as we increase the amount of work to ensure that it scales - similar to this scaling measurement of attribute accesses - would be sufficient and should produce similar results on super slow builds or heavily loaded systems. it would still be time based (so maybe useresource.getrusage(resource.RUSAGE_SELF).ru_utime instead oftime. APIs... though i suspect rusage is low-res) but far less "works on my system"

we have a smattering of other timing based cpu performancy tests in the repo. i don't think we've codified a reusable best practice.

Copy link

ContributorAuthor

colesburyDec 23, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

measuring relative time taken as we increase the amount of work ... would be sufficient

It'd be great to have some sort of standard best practice for this. Ifresource.getrusage(resource.RUSAGE_SELF).ru_utime isn't too affected by other processes, that would be good.

Some other things I was thinking about:

Count cycles (viaperf_event_open?)
Count bytecodes executed (by instrumentingceval.c in debug builds)

Changes from review

ec362b3

Labels

awaiting core review needs backport to 3.13

bugs and security fixes

needs backport to 3.14

bugs and security fixes

skip news tests

Tests in the Lib/test dir

Movatterモバイル変換

Uh oh!

gh-142145: Avoid timing measurements in quadratic behavior test#143105

Are you sure you want to change the base?

gh-142145: Avoid timing measurements in quadratic behavior test#143105

Uh oh!

Conversation

colesbury commentedDec 23, 2025• edited by bedevere-appbotLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

colesbury commentedDec 23, 2025

Uh oh!

sethmlarson commentedDec 23, 2025

Uh oh!

colesbury commentedDec 23, 2025

Uh oh!

sethmlarson left a comment

Choose a reason for hiding this comment

Uh oh!

gpsheadDec 23, 2025

Choose a reason for hiding this comment

Uh oh!

gpsheadDec 23, 2025

Choose a reason for hiding this comment

Uh oh!

colesburyDec 23, 2025

Choose a reason for hiding this comment

Uh oh!

gpsheadDec 23, 2025

Choose a reason for hiding this comment

Uh oh!

colesburyDec 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

colesbury commentedDec 23, 2025•
edited by bedevere-appbot
Loading