forked frompostgres/postgres
- Notifications
You must be signed in to change notification settings - Fork6
Commit1850184
committed
Add simple codepoint redirections to unaccent.rules.
Previously we searched for code points where the Unicode data filelisted an equivalent combining character sequence that added accents.Some codepoints redirect to a single other codepoint, instead of doingany combining. We can follow those references recursively to get theanswer.Per bug report #18362, which reported missing Ancient Greek characters.Specifically, precomposed characters with oxia (from the polytonicaccent system used for old Greek) just point to precomposed characterswith tonos (from the monotonic accent system for modern Greek), and wehave to follow the extra hop to find out that they are composed withan acute accent.Besides those, the new rule also:* pulls in a lot of 'Mathematical Alphanumeric Symbols', which are copies of the Latin and Greek alphabets and numbers rendered in different typefaces, and* corrects a single mathematical letter that previously came from the CLDR transliteration file, but the new rule extracts from the main Unicode database file, where clearly the latter is right and the former is a wrong (reported to CLDR).Reported-by: Cees van Zeeland <cees.van.zeeland@freedom.nl>Reviewed-by: Robert Haas <robertmhaas@gmail.com>Reviewed-by: Peter Eisentraut <peter@eisentraut.org>Reviewed-by: Michael Paquier <michael@paquier.xyz>Discussion:https://postgr.es/m/18362-be6d0cfe122b6354%40postgresql.org1 parent1eff827 commit1850184
File tree
3 files changed
+1025
-9
lines changed- contrib/unaccent
- expected
3 files changed
+1025
-9
lines changedLines changed: 1 addition & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
176 | 176 |
| |
177 | 177 |
| |
178 | 178 |
| |
179 |
| - | |
| 179 | + | |
180 | 180 |
| |
181 | 181 |
|
Lines changed: 12 additions & 7 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
104 | 104 |
| |
105 | 105 |
| |
106 | 106 |
| |
107 |
| - | |
108 |
| - | |
109 |
| - | |
110 |
| - | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
111 | 112 |
| |
112 | 113 |
| |
113 | 114 |
| |
| |||
148 | 149 |
| |
149 | 150 |
| |
150 | 151 |
| |
151 |
| - | |
152 |
| - | |
| 152 | + | |
153 | 153 |
| |
154 | 154 |
| |
155 | 155 |
| |
| |||
200 | 200 |
| |
201 | 201 |
| |
202 | 202 |
| |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
203 | 208 |
| |
204 | 209 |
| |
205 | 210 |
| |
| |||
251 | 256 |
| |
252 | 257 |
| |
253 | 258 |
| |
254 |
| - | |
| 259 | + | |
255 | 260 |
| |
256 | 261 |
| |
257 | 262 |
| |
|
0 commit comments
Comments
(0)