Movatterモバイル変換

Lib/logging/__init__.py OutdatedShow resolvedHide resolved

graingert reviewed

Lib/logging/__init__.py OutdatedShow resolvedHide resolved

graingert reviewed

Lib/logging/__init__.py

		return
		maybe_record = self.filter(record)
		if not maybe_record:
		if self._is_disabled():

Copy link

Contributor

graingertMar 29, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

rather than disabling the logging, can we instead append the record to aself._reentrant_records = collections.deque(), and then process all of the pending records:

maybe_record=self.filter(record)ifnotmaybe_record:returnifisinstance(maybe_record,LogRecord):record=maybe_recordwas_calling_handlers=set_calling_handlers()try:ifnotwas_calling_handlers:self.callHandlers(record)whileTrue:try:record=self._reentrant_records.popleft()exceptIndexError:returnself.callHandlers(record)else:self._reentrant_records.append(record)finally:set_not_calling_handlers()

Copy link

ContributorAuthor

duanegMar 29, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

This will still produce a stack overflow if handling the deferred log message itself logs another message

Copy link

Contributor

graingertMar 30, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'm not following how this would cause a stack overflow, if handling the log message logs another message it would go onto the _reentrant_records queue, and then be processed later once the stack returns all the way back to whereset_calling_handlers() is first called.

Copy link

ContributorAuthor

duanegMar 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'm not following how this would cause a stack overflow, if handling the log message logs another message it would go onto the _reentrant_records queue, and then be processed later once the stack returns all the way back to whereset_calling_handlers() is first called.

Sorry, I should have said deadlock with the current example. The stack overflow is from a different way of triggering this (see the second unit test added).

The trouble is that when the first recursive logging call exits thefinally block it clears the "calling handlers" flag, which means a subsequent (still recursive) one takes the wrong path and deadlocks/overflows.

That can be avoided for the original triggering example by only clearing the "handling" flag if it was initially unset (the deferred records collection also needs to be TLS not a member variable). It ends up looking something like this:

ifnothasattr(self._tls,'reentrant_records'):self._tls.reentrant_records=deque()deferred=self._tls.reentrant_recordswas_calling_handlers=self.set_calling_handlers()try:ifnotwas_calling_handlers:self.callHandlers(record)whiledeferred:self.callHandlers(deferred.popleft())else:deferred.append(record)finally:ifnotwas_calling_handlers:self.set_not_calling_handlers()

This fixes the two bugs, which only log thefirst time they try to process a log record (and means those recursive log messages are logged and not silently ignored, which is nice). However a different example which logsevery time (such as the second unit test) will still live-lock and never exit thatwhile loop.

Copy link

Contributor

graingertMar 30, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

will still live-lock and never exit that while loop.

Does the system keep logging forever instead?

This seems acceptable as you'd easily track down this bug just by looking at the logs

Copy link

ContributorAuthor

duanegMar 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Does the system keep logging forever instead?

Yep, it will just sit in a tight loop spamming the log forever, or at least until/unless that exhausts disk space or wherever the logs are actually going.

This seems acceptable as you'd easily track down this bug just by looking at the logs

IMO it is not agreat failure mode, but it will certainly be obvious!

FWIW I think I prefer ignoring them: the code is much simpler and it prevents the issue in non-trivial handler implementations like Sentry's (that would otherwise trigger the live-lock failure). I was hoping this fix would mean they would be able to remove that nasty monkey-patching on supported versions.

OTOH it is definitely nice to actually handle instead of drop the recursive log messages, in cases where they don't always continue to recurse.

Copy link

Contributor

graingertMar 30, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

we could have alogging.(_)N_RECURSIVE_CALLS constant to limit this so it's not forever

Copy link

ContributorAuthor

duanegMar 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

We could. Another alternative would be Victor Stinner's suggestion in the discourse discussion to raise an exception. That would bring it to the user's attention and force them to deal with it.

Ultimately, though, the way they will have to deal with it is by preventing, disabling, or otherwise intercepting and ignoring all such logging. That will be difficult to do reliably outside the core, will likely be overlooked unless/until it bites, and have to be done in every susceptible log handler or application that uses such.

IMO it would be better for us to do this once, centrally, with a small, simple, and robust fix.

graingert reviewed

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F131812%2Fmerge

Lib/logging/__init__.pyShow resolvedHide resolved

Copy link

Member

gpshead commentedMar 29, 2025

TODO list:

unit test(s) for the errant behavior would be helpful while working on this and are needed to prevent future regression.

Move TLS into class variable and add tests

5a89172

Copy link

bedevere-appbot commentedMar 30, 2025

Most changes to Pythonrequire a NEWS entry. Add one using theblurb_it web app or theblurb command-line tool.

If this change has little impact on Python users, wait for a maintainer to apply theskip news label instead.

Copy link

ContributorAuthor

duaneg commentedMar 30, 2025

Would a news entry be appropriate for this?

Copy link

Contributor

graingert commentedMar 30, 2025

Would a news entry be appropriate for this?

yes this needs a news entry

Add news entry

7b68d12

vsajip added the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

Apr 14, 2025

Copy link

bedevere-bot commentedApr 14, 2025

🤖 New build scheduled with the buildbot fleet by@vsajip for commit7b68d12 🤖

Results will be shown at:

If you want to schedule another build, you need to add the🔨 test-with-buildbots label again.

bedevere-bot removed the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

Apr 14, 2025

vsajip requested changes

Apr 17, 2025

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F131812%2Fmerge

Copy link

Member

vsajip left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

@duaneg, from the issue:

It disables logging on a per-thread basis and only while the thread runs the message handlers (+filters). Which is to say, it is disabled only while running code that would trigger this bug if it logged a message. Or at least, that is my intent: if I've overlooked anything or there is a bug, please let me know!

It doesn't stop other threads from logging to the same logger and/or handlers.

I think a test with logging from multiple threads and other handlers should be added to confirm this.

bedevere-appbot added awaiting changes and removed awaiting review labels

Apr 17, 2025

Copy link

bedevere-appbot commentedApr 17, 2025

A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated.

Once you have made the requested changes, please leave a comment on this pull request containing the phraseI have made the requested changes; please review again. I will then notify any core developers who have left a review that you're ready for them to take another look at this pull request.

Add test that verifies one thread having logging supressed while it h…

bc4bd11

…andles amessage does not block a different thread from handling a message on the samelogger.

Copy link

ContributorAuthor

duaneg commentedApr 17, 2025

I think a test with logging from multiple threads and other handlers should be added to confirm this.

Good idea, will add, thanks!

Merge remote-tracking branch 'origin/main' intopythongh-91555

99788f7

duaneg requested a review fromvsajip

April 17, 2025 11:48

vsajip added the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

Apr 18, 2025

Copy link

bedevere-bot commentedApr 18, 2025

🤖 New build scheduled with the buildbot fleet by@vsajip for commit99788f7 🤖

Results will be shown at:

If you want to schedule another build, you need to add the🔨 test-with-buildbots label again.

bedevere-bot removed the 🔨 test-with-buildbotsTest PR w/ buildbots; report in status section label

Apr 18, 2025

vsajip approved these changes

bedevere-appbot added awaiting merge and removed awaiting changes labels

vsajip merged commit2561e14 intopython:main

122 of 126 checks passed

bedevere-appbot removed the awaiting merge label

vsajip added needs backport to 3.13

bugs and security fixes

needs backport to 3.14bugs and security fixes labels

Copy link

miss-islington-appbot commentedMay 11, 2025

Thanks@duaneg for the PR, and@vsajip for merging it 🌮🎉.. I'm working now to backport this PR to: 3.13.
🐍🍒⛏🤖

Copy link

miss-islington-appbot commentedMay 11, 2025

Thanks@duaneg for the PR, and@vsajip for merging it 🌮🎉.. I'm working now to backport this PR to: 3.14.
🐍🍒⛏🤖

miss-islington pushed a commit to miss-islington/cpython that referenced this pull request

pythongh-91555: disable logger while handling log record (pythonGH-13…

cf0e0fd

…1812)Prevent the possibility of re-entrancy leading to deadlock or infinite recursion (caused by logging triggered by logging), by disabling logging while the logger is handling log messages.(cherry picked from commit2561e14)Co-authored-by: Duane Griffin <duaneg@dghda.com>

miss-islington pushed a commit to miss-islington/cpython that referenced this pull request

pythongh-91555: disable logger while handling log record (pythonGH-13…

4244ae5

…1812)Prevent the possibility of re-entrancy leading to deadlock or infinite recursion (caused by logging triggered by logging), by disabling logging while the logger is handling log messages.(cherry picked from commit2561e14)Co-authored-by: Duane Griffin <duaneg@dghda.com>

Copy link

bedevere-appbot commentedMay 11, 2025

GH-133898 is a backport of this pull request to the3.13 branch.

bedevere-appbot removed the needs backport to 3.13bugs and security fixes label

Copy link

bedevere-appbot commentedMay 11, 2025

GH-133899 is a backport of this pull request to the3.14 branch.

bedevere-appbot removed the needs backport to 3.14bugs and security fixes label