Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Fix #61072: inconsistent fullmatch results with regex alternation#61343

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Open
Pedro-Santos04 wants to merge7 commits intopandas-dev:main
base:main
Choose a base branch
Loading
fromPedro-Santos04:bug#61072

Conversation

Pedro-Santos04
Copy link

in PyArrow strings
Fixes an issue where regex patterns with alternation (|) produce different results between str dtype and string[pyarrow] dtype. When using patterns like "(as)|(as)", PyArrow implementation would incorrectly match "asdf" while Python's implementation correctly rejects it. The fix adds special handling to ensure alternation patterns are properly parenthesized when using PyArrow-backed strings

…nationin PyArrow stringsFixes an issue where regex patterns with alternation (|) producedifferent results between str dtype and string[pyarrow] dtype. Whenusing patterns like "(as)|(as)", PyArrow implementation wouldincorrectly match "asdf" while Python's implementation correctlyrejects it. The fix adds special handling to ensure alternationpatterns are properly parenthesized when using PyArrow-backedstrings
)
if is_pyarrow and "|" in pat:

def _is_fully_wrapped(pattern):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

This appears to be an improper solution to me. Can you explain why the current code path fails?

@rhshadrachrhshadrach added Bug StringsString extension data type and string data labelsMay 19, 2025
Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment
Reviewers

@rhshadrachrhshadrachrhshadrach left review comments

Assignees
No one assigned
Labels
BugStringsString extension data type and string data
Projects
None yet
Milestone
No milestone
Development

Successfully merging this pull request may close these issues.

BUG: str.fullmatch behavior is not the same for object dtype and string[pyarrow] dtype
2 participants
@Pedro-Santos04@rhshadrach

[8]ページ先頭

©2009-2025 Movatter.jp