Uh oh!
There was an error while loading.Please reload this page.
- Notifications
You must be signed in to change notification settings - Fork32k
Commitb579dba
committed
#1486713: Add a tolerant mode to HTMLParser.
The motivation for adding this option is that the the functionality itprovides used to be provided by sgmllib in Python2, and was used by,for example, BeautifulSoup. Without this option, the Python3 versionof BeautifulSoup and the many programs that use it are crippled.The original patch was by 'kxroberto'. I modified it heavily but kept hisheuristics and test. I also added additional heuristics to fix #975556,#1046092, and part of#6191. This patch should be completely backwardcompatible: the behavior with the default strict=True is unchanged.1 parent79cdb66 commitb579dba
File tree
4 files changed
+139
-24
lines changed- Doc/library
- Lib
- html
- test
- Misc
4 files changed
+139
-24
lines changedLines changed: 11 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
12 | 12 |
| |
13 | 13 |
| |
14 | 14 |
| |
15 |
| - | |
| 15 | + | |
16 | 16 |
| |
17 |
| - | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
18 | 22 |
| |
19 | 23 |
| |
20 | 24 |
| |
| |||
191 | 195 |
| |
192 | 196 |
| |
193 | 197 |
| |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + |
Lines changed: 83 additions & 16 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
24 | 24 |
| |
25 | 25 |
| |
26 | 26 |
| |
| 27 | + | |
| 28 | + | |
27 | 29 |
| |
28 | 30 |
| |
29 | 31 |
| |
30 |
| - | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
31 | 35 |
| |
32 | 36 |
| |
33 | 37 |
| |
| |||
42 | 46 |
| |
43 | 47 |
| |
44 | 48 |
| |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
45 | 64 |
| |
46 | 65 |
| |
47 | 66 |
| |
| |||
86 | 105 |
| |
87 | 106 |
| |
88 | 107 |
| |
| 108 | + | |
| 109 | + | |
89 | 110 |
| |
90 |
| - | |
91 |
| - | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
92 | 117 |
| |
93 | 118 |
| |
94 | 119 |
| |
| |||
160 | 185 |
| |
161 | 186 |
| |
162 | 187 |
| |
163 |
| - | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
164 | 191 |
| |
165 |
| - | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
166 | 200 |
| |
167 | 201 |
| |
168 | 202 |
| |
| |||
193 | 227 |
| |
194 | 228 |
| |
195 | 229 |
| |
196 |
| - | |
| 230 | + | |
| 231 | + | |
| 232 | + | |
| 233 | + | |
| 234 | + | |
| 235 | + | |
197 | 236 |
| |
198 | 237 |
| |
199 | 238 |
| |
| |||
240 | 279 |
| |
241 | 280 |
| |
242 | 281 |
| |
243 |
| - | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
244 | 286 |
| |
245 | 287 |
| |
246 | 288 |
| |
| |||
262 | 304 |
| |
263 | 305 |
| |
264 | 306 |
| |
265 |
| - | |
266 |
| - | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
| 310 | + | |
| 311 | + | |
267 | 312 |
| |
268 | 313 |
| |
269 | 314 |
| |
| |||
277 | 322 |
| |
278 | 323 |
| |
279 | 324 |
| |
280 |
| - | |
| 325 | + | |
| 326 | + | |
| 327 | + | |
| 328 | + | |
281 | 329 |
| |
282 | 330 |
| |
283 | 331 |
| |
| |||
290 | 338 |
| |
291 | 339 |
| |
292 | 340 |
| |
293 |
| - | |
294 |
| - | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
| 344 | + | |
| 345 | + | |
| 346 | + | |
| 347 | + | |
295 | 348 |
| |
296 | 349 |
| |
297 | 350 |
| |
| |||
300 | 353 |
| |
301 | 354 |
| |
302 | 355 |
| |
303 |
| - | |
304 |
| - | |
| 356 | + | |
| 357 | + | |
| 358 | + | |
| 359 | + | |
| 360 | + | |
| 361 | + | |
| 362 | + | |
305 | 363 |
| |
306 | 364 |
| |
307 | 365 |
| |
| |||
314 | 372 |
| |
315 | 373 |
| |
316 | 374 |
| |
317 |
| - | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
318 | 384 |
| |
319 | 385 |
| |
320 | 386 |
| |
| |||
358 | 424 |
| |
359 | 425 |
| |
360 | 426 |
| |
361 |
| - | |
| 427 | + | |
| 428 | + | |
362 | 429 |
| |
363 | 430 |
| |
364 | 431 |
| |
|
Lines changed: 42 additions & 6 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
8 | 8 |
| |
9 | 9 |
| |
10 | 10 |
| |
11 |
| - | |
| 11 | + | |
12 | 12 |
| |
13 | 13 |
| |
14 |
| - | |
| 14 | + | |
15 | 15 |
| |
16 | 16 |
| |
17 | 17 |
| |
| |||
72 | 72 |
| |
73 | 73 |
| |
74 | 74 |
| |
75 |
| - | |
76 |
| - | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
77 | 79 |
| |
78 | 80 |
| |
79 | 81 |
| |
| |||
84 | 86 |
| |
85 | 87 |
| |
86 | 88 |
| |
87 |
| - | |
| 89 | + | |
88 | 90 |
| |
89 | 91 |
| |
90 | 92 |
| |
| |||
321 | 323 |
| |
322 | 324 |
| |
323 | 325 |
| |
| 326 | + | |
| 327 | + | |
| 328 | + | |
| 329 | + | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
| 333 | + | |
| 334 | + | |
| 335 | + | |
| 336 | + | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
| 344 | + | |
| 345 | + | |
| 346 | + | |
| 347 | + | |
| 348 | + | |
| 349 | + | |
| 350 | + | |
| 351 | + | |
| 352 | + | |
| 353 | + | |
| 354 | + | |
| 355 | + | |
| 356 | + | |
| 357 | + | |
| 358 | + | |
| 359 | + | |
324 | 360 |
| |
325 |
| - | |
| 361 | + | |
326 | 362 |
| |
327 | 363 |
| |
328 | 364 |
| |
|
Lines changed: 3 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
58 | 58 |
| |
59 | 59 |
| |
60 | 60 |
| |
| 61 | + | |
| 62 | + | |
| 63 | + | |
61 | 64 |
| |
62 | 65 |
| |
63 | 66 |
| |
|
0 commit comments
Comments
(0)