Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

NER toolkit for HTML data

NotificationsYou must be signed in to change notification settings

scrapinghub/webstruct

Repository files navigation

PyPI VersionBuild StatusCode CoverageDocumentation

Webstruct is a library for creating statisticalNER systems that workon HTML data, i.e. a library for building tools that extract namedentities (addresses, organization names, open hours, etc) from webpages.

Unlike most NER systems, webstruct works on HTML data, not onlyon text data. This allows to define features that use HTML structure,and also to embed annotation results back into HTML.

Read thedocs for more info.

License is MIT.

Contributing

To run tests, make suretox is installed, then runtox from the source root.


[8]ページ先頭

©2009-2025 Movatter.jp