Uh oh!
There was an error while loading.Please reload this page.
- Notifications
You must be signed in to change notification settings - Fork32k
GH-102613: Improve performance ofpathlib.Path.rglob()
#104244
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.
Already on GitHub?Sign in to your account
Merged
barneygale merged 4 commits intopython:mainfrombarneygale:gh-102613-merge-neighbouring-star-star-in-globMay 7, 2023
Merged
GH-102613: Improve performance ofpathlib.Path.rglob()
#104244
barneygale merged 4 commits intopython:mainfrombarneygale:gh-102613-merge-neighbouring-star-star-in-globMay 7, 2023
Uh oh!
There was an error while loading.Please reload this page.
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
Stop de-duplicating results in `_RecursiveWildcardSelector`. A new`_DoubleRecursiveWildcardSelector` class is introduced which performsde-duplication, but this is used _only_ for patterns with multiplenon-adjacent `**` segments, such as `path.glob('**/foo/**')`. By avoidingthe use of a set, `PurePath.__hash__()` is not called, and so paths do notneed to be parsed and (case-) normalised.Also merge adjacent '**' segments in patterns.
Uh oh!
There was an error while loading.Please reload this page.
JelleZijlstra approved these changesMay 7, 2023
jbower-fb pushed a commit to jbower-fb/cpython that referenced this pull requestMay 8, 2023
…nGH-104244)Stop de-duplicating results in `_RecursiveWildcardSelector`. A new`_DoubleRecursiveWildcardSelector` class is introduced which performsde-duplication, but this is used _only_ for patterns with multiplenon-adjacent `**` segments, such as `path.glob('**/foo/**')`. By avoidingthe use of a set, `PurePath.__hash__()` is not called, and so paths do notneed to be stringified and case-normalised.Also merge adjacent '**' segments in patterns.
carljm added a commit to carljm/cpython that referenced this pull requestMay 9, 2023
* main: (47 commits)pythongh-97696 Remove unnecessary check for eager_start kwarg (python#104188)pythonGH-104308: socket.getnameinfo should release the GIL (python#104307)pythongh-104310: Add importlib.util.allowing_all_extensions() (pythongh-104311)pythongh-99113: A Per-Interpreter GIL! (pythongh-104210)pythonGH-104284: Fix documentation gettext build (python#104296)pythongh-89550: Buffer GzipFile.write to reduce execution time by ~15% (python#101251)pythongh-104223: Fix issues with inheriting from buffer classes (python#104227)pythongh-99108: fix typo in Modules/Setup (python#104293)pythonGH-104145: Use fully-qualified cross reference types for the bisect module (python#104172)pythongh-103193: Improve `getattr_static` test coverage (python#104286) Trim trailing whitespace and test on CI (python#104275)pythongh-102500: Remove mention of bytes shorthand (python#104281)pythongh-97696: Improve and fix documentation for asyncio eager tasks (python#104256)pythongh-99108: Replace SHA3 implementation HACL* version (python#103597)pythongh-104273: Remove redundant len() calls in argparse function (python#104274)pythongh-64660: Don't hardcode Argument Clinic return converter result variable name (python#104200)pythongh-104265 Disallow instantiation of `_csv.Reader` and `_csv.Writer` (python#104266)pythonGH-102613: Improve performance of `pathlib.Path.rglob()` (pythonGH-104244)pythongh-103650: Fix perf maps address format (python#103651)pythonGH-89812: Churn `pathlib.Path` methods (pythonGH-104243) ...
Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading.Please reload this page.
Stop de-duplicating results in
_RecursiveWildcardSelector
. A new_DoubleRecursiveWildcardSelector
class is introduced which performs de-duplication, but this is usedonly for patterns with multiple non-adjacent**
segments, such aspath.glob('**/foo/**')
. By avoiding the use of a set in most cases,PurePath.__hash__()
is not called, and so paths do not need to be parsed and (case-) normalised.Also merge adjacent
**
segments in patterns.Timings: