forked fromhtml5lib/html5lib-python
- Notifications
You must be signed in to change notification settings - Fork0
Standards-compliant library for parsing and serializing HTML documents and fragments in Python
License
NotificationsYou must be signed in to change notification settings
lilbludevil/html5lib-python
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
html5lib is a pure-python library for parsing HTML. It is designed toconform to the Web Applications 1.0 specification, which hasformalized the error handling algorithms of popular web browsers. = Installation =html5lib is packaged with distutils. To install it use: $ python setup.py install = Tests =You may wish to check that your installation has been a success byrunning the testsuite. All the tests can be run by invokingruntests.py in the tests/ directory = Usage =Simple usage follows this pattern:import html5libf = open("mydocument.html")parser = html5lib.HTMLParser()document = parser.parse(f)By default, the returned document is a simple DOM-like structure whichcan be navigated using the .parent and .childNode attributes on eachelement. It is also possible to generate an ElementTree tree, this requires the use of the "tree" argument to the parser:from html5lib.treebuilders import etreeparser = html5lib.HTMLParser(tree=etree.TreeBuilder)Intrepid users may write their own treebuilder implementations - seehelp(html5lib.treebuilders) for more informationMore documentation is avaliable in the docstrings. = Bugs =Please report any bugs on the issue tracker:http://code.google.com/p/html5lib/issues/list = Get Involved =Contributions to code or documenation are actively encouraged. Submitpatches to the issue tracker or discuss changes on irc in the #whatwgchannel on freenode.netAbout
Standards-compliant library for parsing and serializing HTML documents and fragments in Python
Resources
License
Contributing
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Packages0
No packages published
Languages
- Python98.6%
- Shell1.4%