- Notifications
You must be signed in to change notification settings - Fork294
Fix spider#294
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.
Already on GitHub?Sign in to your account
Fix spider#294
Uh oh!
There was an error while loading.Please reload this page.
Conversation
spider.py used both Python 2-only (md5) and Python 3-only (urllib) imports.Also, it didnt use a namespace when searching for links to spider,and did not read the robots.txt, preventing any spidering occurring.Fix exception occuring when robots processing removed items fromlist toVisit while iterating over the list.Add more output on stderr, and a main() which spiders yahoo.com
We could probably replace most of this with an existing library. |
Can we use |
jayvdb commentedAug 5, 2016 • edited
Loading Uh oh!
There was an error while loading.Please reload this page.
edited
Uh oh!
There was an error while loading.Please reload this page.
Using |
As far as I'm aware, nobody's actually run it for years hence it being badly broken. We should probably just kill it at this point, as it likely has no real use. |
Given the previous comment, I'm going to close this out. I opened#349 to cover removing |
Uh oh!
There was an error while loading.Please reload this page.
No description provided.