Movatterモバイル変換


[0]ホーム

URL:


homepage

Issue26574

This issue trackerhas been migrated toGitHub, and is currentlyread-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

classification
Title:replace_interleave can be optimized for single character byte strings
Type:performanceStage:
Components:Library (Lib)Versions:Python 3.6
process
Status:closedResolution:fixed
Dependencies:Superseder:
Assigned To:Nosy List: Josh Snider, python-dev, serhiy.storchaka, vstinner
Priority:normalKeywords:patch

Created on2016-03-16 23:28 byJosh Snider, last changed2022-04-11 14:58 byadmin. This issue is nowclosed.

Files
File nameUploadedDescriptionEdit
bytes.patchJosh Snider,2016-03-16 23:28review
bytes-2.patchJosh Snider,2016-03-17 00:43review
bench.pyvstinner,2016-03-21 09:39
Messages (8)
msg261870 -(view)Author: Josh Snider (Josh Snider)*Date: 2016-03-16 23:28
replace_interleave inObjects/bytesobject.c andObjects/bytearrayobject.c can be optimized for the special case where the interleaving byte string is a single character.Here's some quick results from timeit showing that it's about three times faster for the special case.* Before (cold start):>>> timeit.timeit('(b"x" * 2000000).replace(b"", b".")', number=1000)7.619218342995737* After (cold start):>>> timeit.timeit('(b"x" * 2000000).replace(b"", b".")', number=1000)2.7605581780080684For the non-special case, running timeit.timeit('(b"x" * 2000000).replace(b"", b".0")', number=10000) takes ~173 seconds on both versions.
msg261871 -(view)Author: STINNER Victor (vstinner)*(Python committer)Date: 2016-03-16 23:45
I reviewed your patch on Rietveld (you should get an email notification).
msg261873 -(view)Author: Josh Snider (Josh Snider)*Date: 2016-03-17 00:43
Addresses review comments.
msg262111 -(view)Author: STINNER Victor (vstinner)*(Python committer)Date: 2016-03-21 09:39
I wrote a microbenchmark with my benchmark.py tool.The patch always make bytes.replace(b'', char) and bytearray.replace(b'', char) faster even for strings of 10 bytes, the speedup on string of 1000 bytes or more is very interesting, even I never used this Python instruction :-)-------------+-------------+---------------type bytes   |        orig |          patch-------------+-------------+---------------length=10    |  250 ns (*) |  211 ns (-15%)length=10**3 | 4.67 us (*) | 1.07 us (-77%)length=10**5 |  441 us (*) | 78.2 us (-82%)-------------+-------------+---------------Total        |  446 us (*) | 79.5 us (-82%)-------------+-------------+------------------------------+-------------+---------------type bytearray |        orig |          patch---------------+-------------+---------------length=10      |  266 ns (*) |  224 ns (-16%)length=10**3   | 4.67 us (*) | 1.08 us (-77%)length=10**5   |  441 us (*) | 78.3 us (-82%)---------------+-------------+---------------Total          |  446 us (*) | 79.6 us (-82%)---------------+-------------+------------------------------+------------+---------------Summary        |       orig |          patch---------------+------------+---------------type bytes     | 446 us (*) | 79.5 us (-82%)type bytearray | 446 us (*) | 79.6 us (-82%)---------------+------------+---------------Total          | 892 us (*) |  159 us (-82%)---------------+------------+---------------
msg262112 -(view)Author: Roundup Robot (python-dev)(Python triager)Date: 2016-03-21 09:39
New changeset62e3b7af0697 by Victor Stinner in branch 'default':Optimize bytes.replace(b'', b'.')https://hg.python.org/cpython/rev/62e3b7af0697
msg262113 -(view)Author: STINNER Victor (vstinner)*(Python committer)Date: 2016-03-21 09:40
I pushed my latest patches, thanks for your contribution Josh.
msg262114 -(view)Author: Serhiy Storchaka (serhiy.storchaka)*(Python committer)Date: 2016-03-21 10:32
Is it worth to optimize this pretty rare special case?
msg262115 -(view)Author: STINNER Victor (vstinner)*(Python committer)Date: 2016-03-21 10:39
> Is it worth to optimize this pretty rare special case?There was a TODO in the code, so I guess that the author wanted to write specialized code for 1-char replacement. Since the patch is short (adds 8 lines of C code), I consider that it's ok to optimize it.
History
DateUserActionArgs
2022-04-11 14:58:28adminsetgithub: 70761
2016-03-21 10:39:44vstinnersetmessages: +msg262115
2016-03-21 10:32:03serhiy.storchakasetnosy: +serhiy.storchaka
messages: +msg262114
2016-03-21 09:40:19vstinnersetstatus: open -> closed
resolution: fixed
messages: +msg262113
2016-03-21 09:39:52python-devsetnosy: +python-dev
messages: +msg262112
2016-03-21 09:39:25vstinnersetfiles: +bench.py

messages: +msg262111
2016-03-17 00:43:02Josh Snidersetfiles: +bytes-2.patch

messages: +msg261873
2016-03-16 23:45:26vstinnersetnosy: +vstinner
messages: +msg261871
2016-03-16 23:28:10Josh Snidercreate
Supported byThe Python Software Foundation,
Powered byRoundup
Copyright © 1990-2022,Python Software Foundation
Legal Statements

[8]ページ先頭

©2009-2026 Movatter.jp