forked frompostgres/postgres
- Notifications
You must be signed in to change notification settings - Fork6
Commitfd58231
committed
Sync our Snowball stemmer dictionaries with current upstream.
We haven't touched these since text search functionality landed in corein 2007 :-(. While the upstream project isn't a beehive of activity,they do make additions and bug fixes from time to time. Update ourcopies of these files.Also update our documentation about how to keep things in sync, sincethey're not making distribution tarballs these days. Fortunately,their source code turns out to be a breeze to build.Notable changes:* The non-UTF8 version of the hungarian stemmer now works in LATIN2not LATIN1.* New stemmers have appeared for arabic, indonesian, irish, lithuanian,nepali, and tamil. These all work in UTF8, and the indonesian andirish ones also work in LATIN1.(There are some new stemmers that I did not incorporate, mainly becausetheir names don't match the underlying languages, suggesting that they'renot to be considered mainstream.)Worth noting: the upstream Nepali dictionary was contributed byArthur Zakirov.initdb forced because the contents of snowball_create.sql havechanged.Still TODO: see about updating the stopword lists.Arthur Zakirov, minor mods and doc work by meDiscussion:https://postgr.es/m/20180626122025.GA12647@zakirov.localdomainDiscussion:https://postgr.es/m/20180219140849.GA9050@zakirov.localdomain1 parentb076eb7 commitfd58231
File tree
88 files changed
+13093
-7303
lines changed- doc/src/sgml
- src
- backend/snowball
- libstemmer
- stopwords
- bin/initdb
- include
- catalog
- snowball/libstemmer
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
88 files changed
+13093
-7303
lines changedLines changed: 8 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
3792 | 3792 |
| |
3793 | 3793 |
| |
3794 | 3794 |
| |
3795 |
| - | |
3796 |
| - | |
| 3795 | + | |
| 3796 | + | |
3797 | 3797 |
| |
| 3798 | + | |
3798 | 3799 |
| |
3799 | 3800 |
| |
3800 | 3801 |
| |
3801 | 3802 |
| |
3802 | 3803 |
| |
3803 | 3804 |
| |
3804 | 3805 |
| |
| 3806 | + | |
| 3807 | + | |
3805 | 3808 |
| |
| 3809 | + | |
| 3810 | + | |
3806 | 3811 |
| |
3807 | 3812 |
| |
3808 | 3813 |
| |
3809 | 3814 |
| |
3810 | 3815 |
| |
3811 | 3816 |
| |
3812 | 3817 |
| |
| 3818 | + | |
3813 | 3819 |
| |
3814 | 3820 |
| |
3815 | 3821 |
| |
|
Lines changed: 15 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
23 | 23 |
| |
24 | 24 |
| |
25 | 25 |
| |
26 |
| - | |
| 26 | + | |
| 27 | + | |
27 | 28 |
| |
28 | 29 |
| |
29 | 30 |
| |
30 | 31 |
| |
31 | 32 |
| |
32 | 33 |
| |
| 34 | + | |
33 | 35 |
| |
34 | 36 |
| |
| 37 | + | |
35 | 38 |
| |
36 | 39 |
| |
37 | 40 |
| |
38 | 41 |
| |
39 | 42 |
| |
40 | 43 |
| |
41 | 44 |
| |
| 45 | + | |
| 46 | + | |
42 | 47 |
| |
| 48 | + | |
| 49 | + | |
43 | 50 |
| |
44 | 51 |
| |
45 | 52 |
| |
46 | 53 |
| |
47 | 54 |
| |
48 | 55 |
| |
49 | 56 |
| |
| 57 | + | |
50 | 58 |
| |
51 | 59 |
| |
52 | 60 |
| |
53 | 61 |
| |
54 | 62 |
| |
55 | 63 |
| |
56 | 64 |
| |
| 65 | + | |
57 | 66 |
| |
58 | 67 |
| |
59 | 68 |
| |
60 | 69 |
| |
61 | 70 |
| |
62 | 71 |
| |
63 | 72 |
| |
| 73 | + | |
| 74 | + | |
64 | 75 |
| |
| 76 | + | |
| 77 | + | |
65 | 78 |
| |
66 | 79 |
| |
67 | 80 |
| |
68 | 81 |
| |
69 | 82 |
| |
70 | 83 |
| |
| 84 | + | |
71 | 85 |
| |
72 | 86 |
| |
73 | 87 |
| |
|
Lines changed: 34 additions & 19 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
4 | 4 |
| |
5 | 5 |
| |
6 | 6 |
| |
7 |
| - | |
| 7 | + | |
8 | 8 |
| |
9 | 9 |
| |
10 |
| - | |
11 |
| - | |
12 |
| - | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
13 | 25 |
| |
14 |
| - | |
15 |
| - | |
16 |
| - | |
17 |
| - | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
18 | 34 |
| |
19 |
| - | |
20 |
| - | |
| 35 | + | |
21 | 36 |
| |
22 |
| - | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
23 | 40 |
| |
24 | 41 |
| |
25 |
| - | |
| 42 | + | |
26 | 43 |
| |
27 | 44 |
| |
28 | 45 |
| |
29 | 46 |
| |
30 |
| - | |
31 |
| - | |
32 |
| - | |
33 |
| - | |
| 47 | + | |
34 | 48 |
| |
35 | 49 |
| |
36 | 50 |
| |
37 | 51 |
| |
38 | 52 |
| |
39 |
| - | |
| 53 | + | |
40 | 54 |
| |
41 | 55 |
| |
42 | 56 |
| |
43 | 57 |
| |
44 | 58 |
| |
45 |
| - | |
| 59 | + | |
| 60 | + | |
46 | 61 |
| |
47 | 62 |
| |
48 |
| - | |
| 63 | + | |
49 | 64 |
|
Lines changed: 53 additions & 33 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
32 | 32 |
| |
33 | 33 |
| |
34 | 34 |
| |
35 |
| - | |
| 35 | + | |
| 36 | + | |
36 | 37 |
| |
37 | 38 |
| |
38 | 39 |
| |
39 | 40 |
| |
40 | 41 |
| |
41 | 42 |
| |
| 43 | + | |
42 | 44 |
| |
43 | 45 |
| |
| 46 | + | |
44 | 47 |
| |
45 | 48 |
| |
46 | 49 |
| |
47 | 50 |
| |
48 | 51 |
| |
49 | 52 |
| |
50 | 53 |
| |
| 54 | + | |
| 55 | + | |
51 | 56 |
| |
| 57 | + | |
| 58 | + | |
52 | 59 |
| |
53 | 60 |
| |
54 | 61 |
| |
55 | 62 |
| |
56 | 63 |
| |
57 | 64 |
| |
58 | 65 |
| |
| 66 | + | |
59 | 67 |
| |
60 | 68 |
| |
61 | 69 |
| |
| |||
74 | 82 |
| |
75 | 83 |
| |
76 | 84 |
| |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
77 | 89 |
| |
78 | 90 |
| |
79 | 91 |
| |
80 | 92 |
| |
81 | 93 |
| |
82 |
| - | |
83 |
| - | |
84 |
| - | |
85 |
| - | |
86 |
| - | |
87 |
| - | |
88 |
| - | |
89 |
| - | |
90 |
| - | |
91 |
| - | |
92 |
| - | |
93 |
| - | |
94 |
| - | |
95 |
| - | |
96 |
| - | |
97 |
| - | |
98 |
| - | |
99 |
| - | |
100 |
| - | |
101 |
| - | |
102 |
| - | |
103 |
| - | |
104 |
| - | |
105 |
| - | |
106 |
| - | |
107 |
| - | |
108 |
| - | |
109 |
| - | |
110 |
| - | |
111 |
| - | |
112 |
| - | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
113 | 133 |
| |
114 | 134 |
| |
115 | 135 |
| |
116 | 136 |
| |
117 | 137 |
| |
118 |
| - | |
| 138 | + | |
119 | 139 |
| |
120 | 140 |
| |
121 | 141 |
| |
|
0 commit comments
Comments
(0)