crawler-python

Spiderbuf 是一个专注于 Python 爬虫练习的网站。提供丰富的爬虫教程、爬虫案例解析和爬虫练习题。Python爬虫开发强化练习，在矛与盾的攻防中不断提高技术水平，通过大量的爬虫实战掌握常见的爬虫与反爬套路。引导式爬虫案例 + 免费爬虫视频教程，以闯关的形式挑战各个爬虫任务，培养爬虫开发的直觉及经验，验证自身爬虫开发与反爬虫实力的时候到了。

python crawler spider captcha cookie scraping cookies selenium requests xpath crawlers scraping-websites scraping-python crawler-python scraping-web scraping-data js-reverse spiderbuf

UpdatedNov 17, 2025
Python

WwwwwyDev /crawlist

Star110

A universal solution for web crawling lists. 抓取网页列表的通用解决方案

python crawler crawl reptile crawling-python crawler-python crawlist

UpdatedJun 5, 2024
Python

guilatrova /GMaps-Crawler

Sponsor

Star83

Google Maps crawler using Selenium. All extracted data is forwarded to a SQS queue.

python crawler google-maps selenium-python crawler-python antifragiledev

UpdatedNov 25, 2021
Python

DEENUU1 /meta-spy

Star68

👾 CLI MetaSpy (Facebook, Instagram) scraper and crawler - instagram account, facebook accounts, pages and search

css python html cli crawler scraper facebook web graph jinja2 sqlite selenium python3 sqlite3 facebook-login rich typer fastapi crawler-python typer-cli

UpdatedNov 25, 2023
Python

mcxiaoxiao /xiaohongshuCrawler

Star42

🍠小红书 rednote 简易爬虫获取文章title、文章id、文章内容、话题标签 👌🏻 三步实现

crawling-sites crawler-python

UpdatedNov 11, 2025
JavaScript

Sneakpeek is a framework that helps to quickly and conviniently develop scrapers. It’s the best choice for scrapers that have some specific complex scraping logic that needs to be run on a constant basis

python crawler scraper vue scraping crawling python3 scrapers scraper-engine crawlers crawling-framework website-crawler scraping-framework crawler-python scraper-api crawling-engine

UpdatedAug 19, 2023
Python

vlmaier /marvel-snap-scrapr

Star26

Scraper forhttps://marvelsnapzone.com to retrieve metadata of Marvel SNAP cards.

game crawler scraper marvel website-scraper website-crawler marvel-characters crawler-python marvel-snap

UpdatedJul 1, 2024
Python

pyladies-brazil /crawler-tutorial

Star25

Tutorial de raspagem de dados realizado em parceria com a JusBrasil

python crawler brazil crawl beautifulsoup python-tutorial pyladies raspagem-de-dados pyladies-workshop crawling-python crawler-python pyladies-brasil requests-python

UpdatedOct 6, 2023
HTML

JimouChen /bing-chat-fxxk

Star25

newbing api by PlayWright

crawler gpt bing-api crawler-python

UpdatedSep 3, 2023
Python

andripwn /crawler-python

Star24

email scraper/crawls using python (Google/Bing)

email-scraper email-crawler crawler-python

UpdatedMay 11, 2020
Python

KSMubasshir /bd-newspaper-crawlers

Star18

A collection of Bangla newspaper and blog crawlers. Can be used to mine bangla text data for Natural Language Processing tasks.

nlp crawler newspaper data-collection dataset-generation bangla-dataset bangla-dataset-for-opinion-mining bangla-nlp crawler-python bangla-language-model bangla-dataset-machine-translation bangla-language-processing bangla-natural-language-processing

UpdatedJan 30, 2023
Python

RaccoonTamer /Reddit-Crawler

Star16

Reddit Media Downloader is a Python application designed to simplify the process of downloading images and GIFs from Reddit. It allows users to specify a subreddit and number of posts to fetch, then automatically retrieves and downloads all available media files. The app features built-in cache logic, which remembers previously downloaded posts to

python crawler downloader reddit reddit-api reddit-theme image-analysis reddit-application reddit-client reddit-crawler reddit-downloader crawler-python api-free

UpdatedMay 15, 2025
Python

Viper373 /JD-comments

Star15

爬取京东商品评论数据

python spider data-analysis crawler-python

UpdatedJul 2, 2025
JavaScript

Viper373 /LOL-DeepWinPredictor

Star14

基于双向双层、引入注意力机制的LSTM对英雄联盟比赛胜率进行预测。

python flask spider mongodb deep-learning lol prediction lstm attention-mechanism rocketmq crawler-python

UpdatedDec 14, 2025
JavaScript

MarkPhamm /skytrax_reviews

Star12

A comprehensive ELT pipeline for analyzing passenger satisfaction data. Features a modern data architecture with Apache Airflow for extraction, dbt/Snowflake for transformation, Python/Pandas for cleaning, and interactive dashboards for visualization with NextJS.

nextjs aws-s3 chartjs snowflake dbt cicd dockers tailwindcss github-actions astronomer airflow-dags crawler-python langchain chromadb deepseek-r1

UpdatedOct 5, 2025

changhyeonnam /Google-Full-size-image-crawler

Star10

crawling google full size image

crawler google-images-crawler google-images-downloader crawler-python chrome-crawler full-size-image

UpdatedNov 26, 2022
Python

Improve this page

Add a description, image, and links to thecrawler-python topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thecrawler-python topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly