- Notifications
You must be signed in to change notification settings - Fork28
testdrivenio/concurrent-web-scraping
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Check out theblog post.
Fork/Clone
Create and activate a virtual environment
Install the requirements
Run the scrapers:
# sync(env)$ python script.py headless# parallel with multiprocessing(env)$ python script_parallel_1.py headless# parallel with concurrent.futures(env)$ python script_parallel_2.py headless# concurrent with concurrent.futures (should be the fastest!)(env)$ python script_concurrent.py headless# parallel with concurrent.futures and concurrent with asyncio(env)$ python script_asyncio.py headless
Run the tests:
(env)$ python -m pytest test/test_scraper.py(env)$ python -m pytest test/test_scraper_mock.py
About
Building a Concurrent Web Scraper with Python and Selenium
Topics
Resources
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Releases
No releases published
Packages0
No packages published
Uh oh!
There was an error while loading.Please reload this page.
Contributors2
Uh oh!
There was an error while loading.Please reload this page.