python/cpythonPublic

NotificationsYou must be signed in to change notification settings
Fork34.1k
Star71.6k

typing.py: builtin LRU caches worsen leaks that exist in other code #98253

New issue

Closed

typing.py: builtin LRU caches worsen leaks that exist in other code#98253

Labels

performancePerformance or resource usagetopic-typingtype-featureA feature request or enhancement

Description

wjakob

opened

on Oct 13, 2022

Bug report

I would like to report a refleak issue involvingtyping.py. The issue is that it internally uses LRU caches to cache certain type-related lookups, and these caches are not cleaned up when the Python interpreter shuts down. This causes leaks that impede software development and debugging of refleaks in general.

This specific part oftyping.py has already once been identified as a source of refleaks by@gvanrossum (context:https://bugs.python.org/issue28649).

The following provides a small reproducer via a trivial package (https://github.com/wjakob/typing_repro) that exposes a class namedA usingnanobind. Whynanobind? It is extremely paranoid about any leaks involving bound types, functions, and instances, and prints warning messages to tell the user about this after the interpreter has shut down (it performs checks following finalization usingPy_AtExit()).

preparation:

$ pip install git+https://github.com/wjakob/typing_repro

Reproducer:

fromtyping_reproimportAimportpandasimporttypingdeftest(t:typing.Optional[A]=None):print(t)

Running this yields

nanobind: leaked 1 types!                                                                                          - leaked type "A"                                                                                                nanobind: leaked 2 functions! - leaked function "add" - leaked function "__init__"nanobind: this is likely caused by a reference counting issue in the binding code.

Note the import ofpandas, which serves the role of a bigger package that uses thetyping module and thereby populates the LRU caches.torch (PyTorch) ortensorflow also cause the issue, as doesmarkupsafe, others likely affected as well.

EDIT: The problem that is common to all of these packages is that they leak some of their own types. For example, byPy_INCREFing references to heap types within extension modules. Because these types usetyping.py and thereby reference the LRU caches (which are never cleaned up), it causes a flurry of refleaks that cascade into other packages.

Removing thetest() function or removing the type annotation fixes the issue. The problem is that declaration causes cache entries to be created that are never cleaned up, even when the interpreter finalizes.

There is another way to avoid the issue: at the bottom of the script, insert

for f in typing._cleanups:    f()

which clears the LRU caches intyping.py. Poof, errors gone. This leads me to suggest the following simple fix, to be added at the end oftyping.py:

def _cleanup_handler():    for f in _cleanups:        f()import atexit as _atexit_atexit.register(_cleanup_handler)

This will clear the caches and ensure that interpreter finalization can avoid those type annotation-related leaks.

Your environment

CPython versions tested on: 3.8.10 and 3.10.;7
Operating system and architecture: Linux and macOS

PR:gh-98253: Break potential reference cycles in external code worsened by typing.py lru_cache #98591

Metadata

Assignees

No one assigned

Labels

performancePerformance or resource usagetopic-typingtype-featureA feature request or enhancement

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

typing.py: builtin LRU caches worsen leaks that exist in other code #98253

Description

Bug report

Your environment

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions