Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Add more canonical equivalence tests#42

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Conversation

@Jules-Bertholet
Copy link
Contributor

Test that all canonically equivalent sequences in Unicode'sNormalizationTest.txt have the same width.

Currently no changes need to be made to the width logic to ensure these tests pass. However, Unicode 16 is adding a few new characters that will be problematic (the Kirat Rai vowel signs:https://www.unicode.org/charts/PDF/Unicode-16.0/U160-16D40.pdf). Adding this test in advance ensures that we won't forget to account for these changes when the time comes.

Test that all canonically equivalent sequencesin Unicode's `NormalizationTest.txt` have the same width.Currently no changes need to be made to the width logicto ensure these tests pass. However, Unicode 16is adding a few new characters that will be problematic(the Kirat Rai vowel signs:<https://www.unicode.org/charts/PDF/Unicode-16.0/U160-16D40.pdf>).Adding this test in advance ensures that we won't forgetto account for these changes when the time comes.
The new test subsumes it
Ensures CI will keep passing until we choose to update
letmut nfd =String::new();
unicode_normalization::char::decompose_canonical(c, |d| nfd.push(d));
let norm_file =BufReader::new(
File::open("scripts/NormalizationTest.txt")
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

issue: I'd prefer to check in the file if it's used in tests.

@Manishearth
Copy link
Member

CI fails

@ManishearthManishearth merged commit74c8394 intounicode-rs:masterApr 30, 2024
@Jules-BertholetJules-Bertholet deleted the more-normalization-tests branchApril 30, 2024 16:00
Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment

Reviewers

@ManishearthManishearthManishearth approved these changes

Assignees

No one assigned

Labels

None yet

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

2 participants

@Jules-Bertholet@Manishearth

[8]ページ先頭

©2009-2025 Movatter.jp