webscraper
Here are 1,518 public repositories matching this topic...
Language:All
Sort:Most stars
Self-hosted webscraper.
- Updated
Oct 12, 2025 - TypeScript
AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/Baidu/etc. Native multi-threading for bulk processing.
- Updated
Nov 24, 2025 - TypeScript
Web Scraper in Go, similar to BeautifulSoup
- Updated
Nov 2, 2023 - Go
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
- Updated
Feb 22, 2025 - Pascal
Scalable Python web scraping scripts for +40 popular domains
- Updated
Nov 26, 2025 - Python
a class that uses scraped proxies to make http GET/POST requests (Python requests)
- Updated
Dec 3, 2020 - Python
An R web crawler and scraper
- Updated
Mar 27, 2022 - R
An AI assistant tool that integrates coding, writing, and reading functions. For better alternatives seehttps://monica.im/desktop
- Updated
May 12, 2023 - TypeScript
Python class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
- Updated
Dec 27, 2023 - Python
📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simple prefixhttp://127.0.0.1:3000/https://website-to-scrape.com/
- Updated
Jul 18, 2025 - TypeScript
Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.
- Updated
Jun 10, 2024 - Python
RSS feed builder created with Bun🥖 and Hono🔥- builds from webpages, email folders, and REST API calls.
- Updated
Nov 19, 2025 - TypeScript
Spotify Scraper to extract all the information from spotify, download mp3 with cover of the song
- Updated
Nov 24, 2025 - Makefile
A Python command-line tool for scraping and downloading subtitles from AppleTV and iTunes movie pages.
- Updated
Oct 14, 2025 - Python
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link :https://medium.com/@mehmetozkaya/creating-custom-web-crawler-w…
- Updated
Dec 20, 2022 - C#
Financial Web Scraper & Sentiment Classifier
- Updated
Oct 2, 2020 - Python
Web scrapper for Shutterstock
- Updated
Nov 24, 2020 - Python
Scrapes g4g and creates PDF
- Updated
May 15, 2020 - Python
Cryptocurrency Historical Market Data R Package
- Updated
Oct 8, 2025 - R
Improve this page
Add a description, image, and links to thewebscraper topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thewebscraper topic, visit your repo's landing page and select "manage topics."