python/cpythonPublic

NotificationsYou must be signed in to change notification settings
Fork34k
Star71.3k

Dead example URL in urlib.robotparser documentation #141444

New issue

Open

#144443

Open

Dead example URL in urlib.robotparser documentation#141444

#144443

Labels

docsDocumentation in the Doc direasy

Description

lexzlei

opened

on Nov 12, 2025

Documentation

In the Python documentation forurllib.robotparser, the example currently references a page that is no longer available (musi-cal.com). The example code now points to an inactive website:

>>>importurllib.robotparser>>>rp=urllib.robotparser.RobotFileParser()>>>rp.set_url("http://www.musi-cal.com/robots.txt")>>>rp.read()>>>rrate=rp.request_rate("*")>>>rrate.requests3>>>rrate.seconds20>>>rp.crawl_delay("*")6>>>rp.can_fetch("*","http://www.musi-cal.com/cgi-bin/search?city=San+Francisco")False>>>rp.can_fetch("*","http://www.musi-cal.com/")True

Additionally, the currentrobots.txt file athttp://www.musi-cal.com/robots.txt contains:

User-agent:*Disallow: /wp-admin/Allow: /wp-admin/admin-ajax.php

Because of this, bothcan_fetch() calls now return True, which doesn't align with the expected output from the example.

Proposed fix:
Update the example inurlib.robotparser.rst to replace the outdated musi-cal.com URL with a valid URL (e.g.https://www.python.org).

I would be happy to work on this issue and put together a PR for the update.

Linked PRs

Metadata

Assignees

No one assigned

Labels

docsDocumentation in the Doc direasy

Projects

docs issues

Status

Todo

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dead example URL in urlib.robotparser documentation #141444

Description

Documentation

Linked PRs

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions