Commite0dc25f

committed

Fix attribute order to the treebuilder to be document order

Somehow I managed to screw this up so it became reverse document order!

1 parenta3b8252 commite0dc25fCopy full SHA for e0dc25f

File tree

3 files changed

+39

-5

lines changed

CHANGES.rst
html5lib
- html5parser.py
- tests
  - test_parser2.py

3 files changed

+39

-5

lines changed

`‎CHANGES.rst‎`

Lines changed: 3 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -4,9 +4,10 @@ Change Log`
`4`	`4`	`0.999999999/1.0b10`
`5`	`5`	`~~~~~~~~~~~~~~~~~~`
`6`	`6`
`7`		`-Released onXXX`
	`7`	`+Released onJuly 15, 2016`
`8`	`8`
`9`		`-* XXX`
	`9`	`+* Fix attribute order going to the tree builder to be document order`
	`10`	`+ instead of reverse document order(!).`
`10`	`11`
`11`	`12`
`12`	`13`	`0.99999999/1.0b9`

`‎html5lib/html5parser.py‎`

Lines changed: 5 additions & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -265,7 +265,11 @@ def normalizeToken(self, token):`
`265`	`265`	`""" HTML5 specific normalizations to the token stream """`
`266`	`266`
`267`	`267`	`iftoken["type"]==tokenTypes["StartTag"]:`
`268`		`-token["data"]=OrderedDict(token['data'][::-1])`
	`268`	`+raw=token["data"]`
	`269`	`+token["data"]=OrderedDict(raw)`
	`270`	`+iflen(raw)>len(token["data"]):`
	`271`	`+# we had some duplicated attribute, fix so first wins`
	`272`	`+token["data"].update(raw[::-1])`
`269`	`273`
`270`	`274`	`returntoken`
`271`	`275`

`‎html5lib/tests/test_parser2.py‎`

Lines changed: 31 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -1,12 +1,12 @@`
`1`	`1`	`from __future__importabsolute_import,division,unicode_literals`
`2`	`2`
`3`		`-fromsiximportPY2,text_type`
	`3`	`+fromsiximportPY2,text_type,unichr`
`4`	`4`
`5`	`5`	`importio`
`6`	`6`
`7`	`7`	`from .importsupport# noqa`
`8`	`8`
`9`		`-fromhtml5lib.constantsimportnamespaces`
	`9`	`+fromhtml5lib.constantsimportnamespaces,tokenTypes`
`10`	`10`	`fromhtml5libimportparse,parseFragment,HTMLParser`
`11`	`11`
`12`	`12`
`@@ -53,13 +53,42 @@ def test_unicode_file():`
`53`	`53`	`assertparse(io.StringIO("a"))isnotNone`
`54`	`54`
`55`	`55`
	`56`	`+deftest_maintain_attribute_order():`
	`57`	`+# This is here because we impl it in parser and not tokenizer`
	`58`	`+p=HTMLParser()`
	`59`	`+# generate loads to maximize the chance a hash-based mutation will occur`
	`60`	`+attrs= [(unichr(x),i)fori,xinenumerate(range(ord('a'),ord('z')))]`
	`61`	`+token= {'name':'html',`
	`62`	`+'selfClosing':False,`
	`63`	`+'selfClosingAcknowledged':False,`
	`64`	`+'type':tokenTypes["StartTag"],`
	`65`	`+'data':attrs}`
	`66`	`+out=p.normalizeToken(token)`
	`67`	`+attr_order=list(out["data"].keys())`
	`68`	`+assertattr_order== [xforx,iinattrs]`
	`69`	`+`
	`70`	`+`
`56`	`71`	`deftest_duplicate_attribute():`
`57`	`72`	`# This is here because we impl it in parser and not tokenizer`
`58`	`73`	`doc=parse('<p class=a class=b>')`
`59`	`74`	`el=doc[1][0]`
`60`	`75`	`assertel.get("class")=="a"`
`61`	`76`
`62`	`77`
	`78`	`+deftest_maintain_duplicate_attribute_order():`
	`79`	`+# This is here because we impl it in parser and not tokenizer`
	`80`	`+p=HTMLParser()`
	`81`	`+attrs= [(unichr(x),i)fori,xinenumerate(range(ord('a'),ord('z')))]`
	`82`	`+token= {'name':'html',`
	`83`	`+'selfClosing':False,`
	`84`	`+'selfClosingAcknowledged':False,`
	`85`	`+'type':tokenTypes["StartTag"],`
	`86`	`+'data':attrs+ [('a',len(attrs))]}`
	`87`	`+out=p.normalizeToken(token)`
	`88`	`+attr_order=list(out["data"].keys())`
	`89`	`+assertattr_order== [xforx,iinattrs]`
	`90`	`+`
	`91`	`+`
`63`	`92`	`deftest_debug_log():`
`64`	`93`	`parser=HTMLParser(debug=True)`
`65`	`94`	`parser.parse("<!doctype html><title>a</title><p>b<script>c</script>d</p>e")`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commite0dc25f

File tree

3 files changed

3 files changed

`‎CHANGES.rst‎`

`‎html5lib/html5parser.py‎`

`‎html5lib/tests/test_parser2.py‎`

0 commit comments