scrapy/parselPublic

NotificationsYou must be signed in to change notification settings
Fork155
Star1.3k

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

License

BSD-3-Clause license

1.3k stars 155 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 824 Commits
.github/workflows		.github/workflows
docs		docs
parsel		parsel
tests		tests
.git-blame-ignore-revs		.git-blame-ignore-revs
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
LICENSE		LICENSE
NEWS		NEWS
README.rst		README.rst
pyproject.toml		pyproject.toml
release.rst		release.rst
tox.ini		tox.ini

Repository files navigation

Parsel

Parsel is a BSD-licensedPython library to extract data fromHTML,JSON, andXML documents.

It supports:

CSS andXPath expressions for HTML and XML documents
JMESPath expressions for JSON documents
Regular expressions

Find the Parsel online documentation athttps://parsel.readthedocs.org.

Example (open online demo):

>>>from parselimport Selector>>> text="""...<html>...<body>...<h1>Hello, Parsel!</h1>...<ul>...<li><a href="http://example.com">Link1</a></li>...<li><a href="http://scrapy.org">Link2</a></li>...</ul>...<scripttype="application/json">{"a": ["b","c"]}</script>...</body>...</html>""">>> selector= Selector(text=text)>>> selector.css("h1::text").get()'Hello, Parsel!'>>> selector.xpath("//h1/text()").re(r"\w+")['Hello', 'Parsel']>>>for liin selector.css("ul > li"):...print(li.xpath(".//@href").get())...http://example.comhttp://scrapy.org>>> selector.css("script::text").jmespath("a").get()'b'>>> selector.css("script::text").jmespath("a").getall()['b', 'c']

About

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

Releases17

v1.9.1 Latest

Dec 16, 2024

+ 16 releases

Packages

No packages published

Contributors50

+ 36 contributors

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Parsel

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases17

Packages

Uh oh!

Contributors50

Uh oh!

Languages

Movatterモバイル変換

License

scrapy/parsel

Folders and files

Latest commit

History

Repository files navigation

Parsel

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases17

Packages0

Uh oh!

Contributors50

Uh oh!

Languages

Packages