NotificationsYou must be signed in to change notification settings
Fork33.3k
Star69.7k

gh-60462: Fix locale.strxfrm() on Solaris#138242

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Merged

serhiy-storchaka merged 4 commits intopython:mainfromserhiy-storchaka:locale-strxfrm

Sep 3, 2025

Merged

gh-60462: Fix locale.strxfrm() on Solaris#138242

serhiy-storchaka merged 4 commits intopython:mainfromserhiy-storchaka:locale-strxfrm

Sep 3, 2025

Conversation

Copy link

Member

serhiy-storchaka commentedAug 29, 2025•
edited by bedevere-appbot
Loading

It should interpret the result of wcsxfrm() as a sequence of abstract integers, not a sequence of Unicode code points or using other encoding scheme that does not preserve ordering.

Issue:locale.strxfrm() may improperly use PyUnicode_FromWideChar() #138247

Issue:test_local.TestEnUSCollection failures on Solaris 10 #60462

This was referencedAug 29, 2025

Skip tests failing on Solaris#91214

Open

locale.strxfrm() may improperly use PyUnicode_FromWideChar()#138247

Closed

pythongh-138247: Fix locale.strxfrm()

60a5481

It should interpret the result of wcsxfrm() as a sequence of abstractintegers, not a sequence of Unicode code points or using other encodingscheme that does not preserve ordering.

serhiy-storchaka force-pushed thelocale-strxfrm branch fromea8283a to60a5481Compare

August 29, 2025 15:49

serhiy-storchaka changed the title~~Fix locale.strxfrm()~~gh-138247: Fix locale.strxfrm()

Aug 29, 2025

Copy link

MemberAuthor

serhiy-storchaka commentedAug 29, 2025

!buildbot Solaris

Copy link

bedevere-bot commentedAug 29, 2025

🤖 New build scheduled with the buildbot fleet by@serhiy-storchaka for commit60a5481 🤖

Results will be shown at:

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F138242%2Fmerge

The command will test the builders whose names match following regular expression:Solaris

The builders matched are:

SPARCv9 Oracle Solaris 11.4 PR

Copy link

Member

StanFromIreland commentedAug 29, 2025

test_locale now passes!

0:10:12 load avg: 17.86 [438/492/7] test_locale passed

Add a NEWS entry.

58713db

serhiy-storchaka changed the title~~gh-138247: Fix locale.strxfrm()~~gh-138247: Fix locale.strxfrm() on Solaris

Aug 30, 2025

serhiy-storchaka marked this pull request as ready for review

August 30, 2025 07:09

bedevere-appbot added the awaiting core review label

Aug 30, 2025

Copy link

Contributor

kulikjak commentedSep 1, 2025

Thanks! I tested the patch on Solaris on both SPARC and Intel, and the tests are happy with it.

That said, I am unsure whether it's correct to split the codes only when they are longer than 16 bits - couldn't that break the ordering?

for example with values0x100FF and0xF

0x100FF gets split into0x1 and0xFF
0xF remains unchanged

-> comparing element by element,0x1 < 0xF, but that would not be the case without the split

Copy link

Contributor

kulikjak commentedSep 1, 2025

BTW, we are using similar patch on Solaris:
https://github.com/oracle/solaris-userland/blob/master/components/python/python313/patches/24-strxfrm-fix.patch

Copy link

MemberAuthor

serhiy-storchaka commentedSep 1, 2025

Note| 0x10000u.0x100FF gets split into0x10001 and0xFF. It is larger than any unchanged value.

Copy link

MemberAuthor

serhiy-storchaka commentedSep 2, 2025

BTW, we are using similar patch on Solaris:

Yes, it is surprisingly similar. You don't need to add 0x10000 if you split every character. My implementation needs this because it leaves 16-bit codes unchanged (this saves memory and time).

More important,PyUnicode_FromWideChar() should not be used here, because it changes order on Solaris.

serhiy-storchaka changed the title~~gh-138247: Fix locale.strxfrm() on Solaris~~gh-60462: Fix locale.strxfrm() on Solaris

Sep 2, 2025

bedevere-appbot mentioned this pull request

Sep 2, 2025

test_local.TestEnUSCollection failures on Solaris 10#60462

Closed

Move topythongh-60462.

9aa3c9d

serhiy-storchaka requested a review fromvstinner

September 2, 2025 07:24

Copy link

Contributor

kulikjak commentedSep 2, 2025

Note| 0x10000u.0x100FF gets split into0x10001 and0xFF. It is larger than any unchanged value.

Oh, I completely overlooked that| 0x10000u; part - thanks for pointing that out.

More important,PyUnicode_FromWideChar() should not be used here, because it changes order on Solaris.

That's true. I don't know if in can change order in our case, but it certainly shouldn't go through thatHAVE_NON_UNICODE_WCHAR_T_REPRESENTATION specific conversion we have there.

kulikjak approved these changes

Sep 2, 2025

View reviewed changes

vstinner reviewed

Sep 2, 2025

View reviewed changes

Modules/_localemodule.cShow resolvedHide resolved

Ensure that it works for signed wchar_t.

349c3b2

vstinner approved these changes

Sep 3, 2025

View reviewed changes

Copy link

Member

vstinner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

LGTM

bedevere-appbot added awaiting merge and removed awaiting core review labels

Sep 3, 2025

serhiy-storchaka merged commit482fd0c intopython:main

Sep 3, 2025

49 checks passed

bedevere-appbot removed the awaiting merge label

Sep 3, 2025

serhiy-storchaka deleted the locale-strxfrm branch

September 3, 2025 12:49

serhiy-storchaka added needs backport to 3.13

bugs and security fixes

needs backport to 3.14bugs and security fixes labels

Sep 3, 2025

Copy link

miss-islington-appbot commentedSep 3, 2025

Thanks@serhiy-storchaka for the PR 🌮🎉.. I'm working now to backport this PR to: 3.14.
🐍🍒⛏🤖 I'm not a witch! I'm not a witch!

Copy link

miss-islington-appbot commentedSep 3, 2025

Thanks@serhiy-storchaka for the PR 🌮🎉.. I'm working now to backport this PR to: 3.13.
🐍🍒⛏🤖

miss-islington pushed a commit to miss-islington/cpython that referenced this pull request

Sep 3, 2025

pythongh-60462: Fix locale.strxfrm() on Solaris (pythonGH-138242)

d274343

It should interpret the result of wcsxfrm() as a sequence of abstractintegers, not a sequence of Unicode code points or using other encodingscheme that does not preserve ordering.(cherry picked from commit482fd0c)Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>

miss-islington pushed a commit to miss-islington/cpython that referenced this pull request

Sep 3, 2025

pythongh-60462: Fix locale.strxfrm() on Solaris (pythonGH-138242)

733994b

It should interpret the result of wcsxfrm() as a sequence of abstractintegers, not a sequence of Unicode code points or using other encodingscheme that does not preserve ordering.(cherry picked from commit482fd0c)Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>

Copy link

bedevere-appbot commentedSep 3, 2025

GH-138448 is a backport of this pull request to the3.14 branch.

bedevere-appbot removed the needs backport to 3.14bugs and security fixes label

Sep 3, 2025

Copy link

bedevere-appbot commentedSep 3, 2025

GH-138449 is a backport of this pull request to the3.13 branch.

bedevere-appbot removed the needs backport to 3.13bugs and security fixes label

Sep 3, 2025

serhiy-storchaka added a commit that referenced this pull request

Sep 3, 2025

[3.13]gh-60462: Fix locale.strxfrm() on Solaris (GH-138242) (GH-138449)

a7fd73e

It should interpret the result of wcsxfrm() as a sequence of abstractintegers, not a sequence of Unicode code points or using other encodingscheme that does not preserve ordering.(cherry picked from commit482fd0c)Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>

lkollar pushed a commit to lkollar/cpython that referenced this pull request

Sep 9, 2025

pythongh-60462: Fix locale.strxfrm() on Solaris (pythonGH-138242)

c007c30

It should interpret the result of wcsxfrm() as a sequence of abstractintegers, not a sequence of Unicode code points or using other encodingscheme that does not preserve ordering.

encukou pushed a commit that referenced this pull request

Oct 8, 2025

[3.14]gh-60462: Fix locale.strxfrm() on Solaris (GH-138242) (GH-138448)

5579708

It should interpret the result of wcsxfrm() as a sequence of abstractintegers, not a sequence of Unicode code points or using other encodingscheme that does not preserve ordering.(cherry picked from commit482fd0c)Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>

Labels

None yet

Movatterモバイル変換

Uh oh!

gh-60462: Fix locale.strxfrm() on Solaris#138242

gh-60462: Fix locale.strxfrm() on Solaris#138242

Uh oh!

Conversation

serhiy-storchaka commentedAug 29, 2025• edited by bedevere-appbotLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

serhiy-storchaka commentedAug 29, 2025

Uh oh!

bedevere-bot commentedAug 29, 2025

Uh oh!

StanFromIreland commentedAug 29, 2025

Uh oh!

kulikjak commentedSep 1, 2025

Uh oh!

kulikjak commentedSep 1, 2025

Uh oh!

serhiy-storchaka commentedSep 1, 2025

Uh oh!

serhiy-storchaka commentedSep 2, 2025

Uh oh!

kulikjak commentedSep 2, 2025

Uh oh!

Uh oh!

vstinner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

miss-islington-appbot commentedSep 3, 2025

Uh oh!

miss-islington-appbot commentedSep 3, 2025

Uh oh!

bedevere-appbot commentedSep 3, 2025

Uh oh!

bedevere-appbot commentedSep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

serhiy-storchaka commentedAug 29, 2025•
edited by bedevere-appbot
Loading