Highlights
- Pro
Popular repositoriesLoading
- dataset-popular
dataset-popular PublicA dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.
- dataset-random
dataset-random PublicA dataset of random pages with manually marked up semantic blocks.
CSS 7
- python-boilerpipe
python-boilerpipe PublicForked frommisja/python-boilerpipe
Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages
Python 1
- sandcastle
sandcastle PublicForked frombcoe/sandcastle
A simple and powerful sandbox for running untrusted JavaScript.
JavaScript
- vips_java
vips_java PublicForked fromtpopela/vips_java
Implementation of Vision Based Page Segmentation algorithm in Java
Java 1
Something went wrong, please refresh the page to try again.
If the problem persists, check theGitHub status page orcontact support.
If the problem persists, check theGitHub status page orcontact support.
Uh oh!
There was an error while loading.Please reload this page.