forked frompostgres/postgres
- Notifications
You must be signed in to change notification settings - Fork6
Commit08c0d6a
committed
Invent "rainbow" arcs within the regex engine.
Some regular expression constructs, most notably the "." match-anythingmetacharacter, produce a sheaf of parallel NFA arcs covering allpossible colors (that is, character equivalence classes). We can makea noticeable improvement in the space and time needed to process largeregexes by replacing such cases with a single arc bearing the specialcolor code "RAINBOW". This requires only minor additional complicationin places such as pull() and push().Callers of pg_reg_getoutarcs() must now be prepared for the possibilityof seeing a RAINBOW arc. For the one known user, contrib/pg_trgm,that's a net benefit since it cuts the number of arcs to be dealt with,and the handling isn't any different than for other colors that containtoo many characters to be dealt with individually.This is part of a patch series that in total reduces the regex engine'sruntime by about a factor of four on a large corpus of real-world regexes.Patch by me, reviewed by Joel JacobsonDiscussion:https://postgr.es/m/1340281.1613018383@sss.pgh.pa.us1 parent1766118 commit08c0d6a
File tree
10 files changed
+177
-37
lines changed- contrib/pg_trgm
- src
- backend/regex
- include/regex
10 files changed
+177
-37
lines changedLines changed: 18 additions & 9 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
282 | 282 |
| |
283 | 283 |
| |
284 | 284 |
| |
285 |
| - | |
286 |
| - | |
| 285 | + | |
| 286 | + | |
287 | 287 |
| |
288 | 288 |
| |
289 | 289 |
| |
| |||
780 | 780 |
| |
781 | 781 |
| |
782 | 782 |
| |
783 |
| - | |
| 783 | + | |
| 784 | + | |
784 | 785 |
| |
785 | 786 |
| |
786 | 787 |
| |
| |||
1098 | 1099 |
| |
1099 | 1100 |
| |
1100 | 1101 |
| |
1101 |
| - | |
| 1102 | + | |
1102 | 1103 |
| |
1103 |
| - | |
| 1104 | + | |
1104 | 1105 |
| |
1105 | 1106 |
| |
1106 | 1107 |
| |
| |||
1156 | 1157 |
| |
1157 | 1158 |
| |
1158 | 1159 |
| |
| 1160 | + | |
| 1161 | + | |
| 1162 | + | |
| 1163 | + | |
| 1164 | + | |
| 1165 | + | |
| 1166 | + | |
| 1167 | + | |
1159 | 1168 |
| |
1160 | 1169 |
| |
1161 | 1170 |
| |
| |||
1216 | 1225 |
| |
1217 | 1226 |
| |
1218 | 1227 |
| |
1219 |
| - | |
1220 |
| - | |
1221 |
| - | |
1222 |
| - | |
| 1228 | + | |
| 1229 | + | |
| 1230 | + | |
| 1231 | + | |
1223 | 1232 |
| |
1224 | 1233 |
| |
1225 | 1234 |
| |
|
Lines changed: 28 additions & 8 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
261 | 261 |
| |
262 | 262 |
| |
263 | 263 |
| |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
264 | 276 |
| |
265 | 277 |
| |
266 | 278 |
| |
| |||
349 | 361 |
| |
350 | 362 |
| |
351 | 363 |
| |
| 364 | + | |
| 365 | + | |
352 | 366 |
| |
353 | 367 |
| |
354 | 368 |
| |
355 | 369 |
| |
356 | 370 |
| |
357 | 371 |
| |
358 | 372 |
| |
359 |
| - | |
| 373 | + | |
360 | 374 |
| |
361 | 375 |
| |
362 | 376 |
| |
363 |
| - | |
| 377 | + | |
364 | 378 |
| |
365 | 379 |
| |
366 | 380 |
| |
| |||
396 | 410 |
| |
397 | 411 |
| |
398 | 412 |
| |
| 413 | + | |
| 414 | + | |
| 415 | + | |
| 416 | + | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
399 | 420 |
| |
400 | 421 |
| |
401 | 422 |
| |
402 | 423 |
| |
403 | 424 |
| |
404 |
| - | |
405 |
| - | |
406 |
| - | |
407 |
| - | |
408 |
| - | |
409 |
| - | |
| 425 | + | |
| 426 | + | |
| 427 | + | |
| 428 | + | |
| 429 | + |
Lines changed: 21 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
977 | 977 |
| |
978 | 978 |
| |
979 | 979 |
| |
| 980 | + | |
980 | 981 |
| |
981 | 982 |
| |
982 | 983 |
| |
| |||
994 | 995 |
| |
995 | 996 |
| |
996 | 997 |
| |
| 998 | + | |
997 | 999 |
| |
998 | 1000 |
| |
999 | 1001 |
| |
| |||
1012 | 1014 |
| |
1013 | 1015 |
| |
1014 | 1016 |
| |
| 1017 | + | |
| 1018 | + | |
| 1019 | + | |
1015 | 1020 |
| |
1016 | 1021 |
| |
1017 | 1022 |
| |
| |||
1025 | 1030 |
| |
1026 | 1031 |
| |
1027 | 1032 |
| |
| 1033 | + | |
| 1034 | + | |
| 1035 | + | |
| 1036 | + | |
| 1037 | + | |
| 1038 | + | |
| 1039 | + | |
1028 | 1040 |
| |
1029 | 1041 |
| |
1030 | 1042 |
| |
| |||
1034 | 1046 |
| |
1035 | 1047 |
| |
1036 | 1048 |
| |
| 1049 | + | |
| 1050 | + | |
| 1051 | + | |
1037 | 1052 |
| |
1038 | 1053 |
| |
1039 | 1054 |
| |
1040 | 1055 |
| |
1041 | 1056 |
| |
1042 | 1057 |
| |
1043 |
| - | |
| 1058 | + | |
1044 | 1059 |
| |
1045 | 1060 |
| |
1046 | 1061 |
| |
| |||
1049 | 1064 |
| |
1050 | 1065 |
| |
1051 | 1066 |
| |
| 1067 | + | |
| 1068 | + | |
| 1069 | + | |
| 1070 | + | |
| 1071 | + | |
1052 | 1072 |
| |
1053 | 1073 |
| |
1054 | 1074 |
| |
|
Lines changed: 74 additions & 8 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
271 | 271 |
| |
272 | 272 |
| |
273 | 273 |
| |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
274 | 279 |
| |
275 | 280 |
| |
276 | 281 |
| |
| |||
1170 | 1175 |
| |
1171 | 1176 |
| |
1172 | 1177 |
| |
| 1178 | + | |
| 1179 | + | |
| 1180 | + | |
1173 | 1181 |
| |
1174 | 1182 |
| |
1175 | 1183 |
| |
| |||
1181 | 1189 |
| |
1182 | 1190 |
| |
1183 | 1191 |
| |
| 1192 | + | |
1184 | 1193 |
| |
1185 | 1194 |
| |
| 1195 | + | |
| 1196 | + | |
1186 | 1197 |
| |
| 1198 | + | |
1187 | 1199 |
| |
1188 | 1200 |
| |
1189 | 1201 |
| |
| |||
1597 | 1609 |
| |
1598 | 1610 |
| |
1599 | 1611 |
| |
1600 |
| - | |
| 1612 | + | |
1601 | 1613 |
| |
1602 | 1614 |
| |
1603 | 1615 |
| |
| |||
1624 | 1636 |
| |
1625 | 1637 |
| |
1626 | 1638 |
| |
| 1639 | + | |
| 1640 | + | |
| 1641 | + | |
| 1642 | + | |
1627 | 1643 |
| |
1628 | 1644 |
| |
1629 | 1645 |
| |
| |||
1764 | 1780 |
| |
1765 | 1781 |
| |
1766 | 1782 |
| |
1767 |
| - | |
| 1783 | + | |
1768 | 1784 |
| |
1769 | 1785 |
| |
1770 | 1786 |
| |
| |||
1791 | 1807 |
| |
1792 | 1808 |
| |
1793 | 1809 |
| |
| 1810 | + | |
| 1811 | + | |
| 1812 | + | |
| 1813 | + | |
1794 | 1814 |
| |
1795 | 1815 |
| |
1796 | 1816 |
| |
| |||
1810 | 1830 |
| |
1811 | 1831 |
| |
1812 | 1832 |
| |
| 1833 | + | |
1813 | 1834 |
| |
1814 | 1835 |
| |
1815 |
| - | |
| 1836 | + | |
| 1837 | + | |
1816 | 1838 |
| |
1817 | 1839 |
| |
1818 | 1840 |
| |
| |||
1827 | 1849 |
| |
1828 | 1850 |
| |
1829 | 1851 |
| |
| 1852 | + | |
| 1853 | + | |
| 1854 | + | |
| 1855 | + | |
| 1856 | + | |
| 1857 | + | |
| 1858 | + | |
| 1859 | + | |
| 1860 | + | |
| 1861 | + | |
| 1862 | + | |
| 1863 | + | |
| 1864 | + | |
| 1865 | + | |
1830 | 1866 |
| |
1831 | 1867 |
| |
1832 | 1868 |
| |
1833 | 1869 |
| |
1834 |
| - | |
| 1870 | + | |
| 1871 | + | |
| 1872 | + | |
| 1873 | + | |
| 1874 | + | |
1835 | 1875 |
| |
1836 | 1876 |
| |
1837 | 1877 |
| |
| 1878 | + | |
| 1879 | + | |
| 1880 | + | |
| 1881 | + | |
| 1882 | + | |
| 1883 | + | |
| 1884 | + | |
| 1885 | + | |
| 1886 | + | |
| 1887 | + | |
| 1888 | + | |
| 1889 | + | |
| 1890 | + | |
| 1891 | + | |
1838 | 1892 |
| |
1839 | 1893 |
| |
1840 | 1894 |
| |
| |||
2895 | 2949 |
| |
2896 | 2950 |
| |
2897 | 2951 |
| |
| 2952 | + | |
2898 | 2953 |
| |
2899 | 2954 |
| |
2900 | 2955 |
| |
| |||
3068 | 3123 |
| |
3069 | 3124 |
| |
3070 | 3125 |
| |
3071 |
| - | |
| 3126 | + | |
| 3127 | + | |
| 3128 | + | |
| 3129 | + | |
3072 | 3130 |
| |
3073 | 3131 |
| |
3074 |
| - | |
| 3132 | + | |
| 3133 | + | |
| 3134 | + | |
| 3135 | + | |
3075 | 3136 |
| |
3076 | 3137 |
| |
3077 |
| - | |
| 3138 | + | |
| 3139 | + | |
| 3140 | + | |
| 3141 | + | |
3078 | 3142 |
| |
3079 | 3143 |
| |
3080 | 3144 |
| |
| |||
3161 | 3225 |
| |
3162 | 3226 |
| |
3163 | 3227 |
| |
3164 |
| - | |
| 3228 | + | |
| 3229 | + | |
| 3230 | + | |
3165 | 3231 |
| |
3166 | 3232 |
| |
3167 | 3233 |
| |
|
0 commit comments
Comments
(0)