Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
#

robots-txt

Here are 226 public repositories matching this topic...

Polite, slim and concurrent web crawler.

  • UpdatedMay 19, 2021
  • Go

A simple and flexible web crawler that follows the robots.txt policies and crawl delays.

  • UpdatedMay 19, 2021
  • Go

Tame the robots crawling and indexing your Nuxt site.

  • UpdatedApr 29, 2025
  • TypeScript

The robots.txt exclusion protocol implementation for Go language

  • UpdatedNov 9, 2022
  • Go
InfinityCrawler

A simple but powerful web crawler library for .NET

  • UpdatedDec 15, 2023
  • C#

A set of reusable Java components that implement functionality common to any web crawler

  • UpdatedApr 25, 2025
  • Java

Determine if a page may be crawled from robots.txt, robots meta tags and robot headers

  • UpdatedFeb 3, 2025
  • PHP
llms-txt-hub

🤖 The largest directory for AI-ready documentation and tools implementing the proposed llms.txt standard

  • UpdatedApr 28, 2025
  • TypeScript

Ultimate Website Sitemap Parser

  • UpdatedApr 28, 2025
  • Python
weboptout

Opt-Out tool to check Copyright reservations in a way that even machines can understand.

  • UpdatedJan 8, 2024
  • Python

Open-Source Python Based SEO Web Crawler

  • UpdatedJul 7, 2023
  • Python

NodeJS robots.txt parser with support for wildcard (*) matching.

  • UpdatedOct 28, 2024
  • JavaScript

Known tags and settings suggested to opt out of having your content used for AI training.

  • UpdatedJun 21, 2024
  • HTML

Makes it easy to add robots.txt, sitemap and web app manifest during build to your Astro app.

  • UpdatedDec 15, 2023
  • TypeScript

grobotstxt is a native Go port of Google's robots.txt parser and matcher library.

  • UpdatedMar 16, 2022
  • Go

Gatsby plugin that automatically creates robots.txt for your site

  • UpdatedJan 29, 2024
  • JavaScript

Collection of SEO utilities like sitemap, robots.txt, etc. for a Remix application. Forked fromhttps://github.com/balavishnuvj/remix-seo

  • UpdatedApr 14, 2025
  • TypeScript

🤖 A curated list of websites that restrict access to AI Agents, AI crawlers and GPTs

  • UpdatedApr 2, 2025
  • Python

Simple robots.txt template. Keep unwanted robots out (disallow). White lists (allow) legitimate user-agents. Useful for all websites.

  • UpdatedFeb 16, 2025

Improve this page

Add a description, image, and links to therobots-txt topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with therobots-txt topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp