website-crawler
Here are 31 public repositories matching this topic...
Language:All
Sort:Most stars
It allows you to download a website from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer.
- Updated
Jun 1, 2023 - Visual Basic .NET
Python-based web crawling script with randomized intervals, user-agent rotation, and proxy server IP rotation to outsmart website bots and prevent blocking.
- Updated
Jun 10, 2025 - Python
Sneakpeek is a framework that helps to quickly and conviniently develop scrapers. It’s the best choice for scrapers that have some specific complex scraping logic that needs to be run on a constant basis
- Updated
Aug 19, 2023 - Python
Scraper forhttps://marvelsnapzone.com to retrieve metadata of Marvel SNAP cards.
- Updated
Jul 1, 2024 - Python
A universal and local phishing toolkit for audit purposes
- Updated
Nov 21, 2024 - Python
An almost generic web crawler built using Scrapy and Python 3.7 to recursively crawl entire websites.
- Updated
Mar 1, 2022 - Python
WebKnoGraph is an open research project that uses data processing, vector embeddings, and graph algorithms to optimize internal linking at scale. Built for both academic and industry use, it offers THE FIRST FULLY transparent, AI-driven framework for improving SEO and site navigation through reproducible methods.
- Updated
Jul 10, 2025 - Jupyter Notebook
A tutorial and code samples of web scraping with PHP
- Updated
Jun 26, 2025 - PHP
A Simple Script To Scrape DuckDuckGo Search Results Using Python And Selenium WebDriver.
- Updated
Nov 1, 2022 - Python
🕷️ | ReconX is a Live-Website Crawler made to gather critical information with an option to take a picture of each site crawled!
- Updated
Feb 20, 2025 - Python
💫 Crawl urls from a webpage and provide a DomCrawler with Scraper Library
- Updated
Nov 12, 2024 - PHP
This a project to demonstrate the use of standard python libraries like os, urllib, HTMLParser to create a minimalist webpage crawler that crawls webpages on a website to gather hyperlinks (URLs)
- Updated
May 26, 2020 - Python
Web Link Crawler: A Python script to crawl websites and collect links based on a regex pattern. Efficient and customizable.
- Updated
May 30, 2024 - Python
Crawls website and collect SEO relevant data
- Updated
Sep 27, 2022 - Go
The most advanced Lightshot (or prnt.sc) scraper ever!
- Updated
Dec 17, 2023 - Java
sponge is a website crawler and links downloader command-line tool
- Updated
Jul 12, 2025 - Kotlin
Recursive website crawler
- Updated
Mar 23, 2022 - Python
Java website crawler - library for analyze and testing websites
- Updated
Dec 30, 2021 - Java
The most advanced Imgur scraper ever!
- Updated
Apr 29, 2023 - Java
Improve this page
Add a description, image, and links to thewebsite-crawler topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thewebsite-crawler topic, visit your repo's landing page and select "manage topics."