- Notifications
You must be signed in to change notification settings - Fork96
sangaline/advanced-web-scraping-tutorial
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
This repository is a companion to the articleAdvanced Web Scraping: Bypassing captcha, "403 Forbidden," and more.Please refer to the article for further details.
This is ascrapy web scraper for the fictional Zipru torrent site.It is designed to bypass four distinct anti-scraping mechanisms:
- User agent filtering.
- Obfuscated javascript redirects.
- Captchas.
- Header consistency checks.
The scraper is not actually functional because Zipru is not a real site.The code, however, is otherwise complete and can easily be adapted to work on other sites.
About
The Zipru scraper developed in the Advanced Web Scraping Tutorial.
Topics
Resources
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Releases
No releases published
Packages0
No packages published
Uh oh!
There was an error while loading.Please reload this page.