archivebox
Here are 35 public repositories matching this topic...
Language:All
Sort:Most stars
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
- Updated
Nov 15, 2025 - Python
A lightweight, open-source, privacy-first bookmark manager that unifies your bookmarks across multiple browsers, syncs them in real time (locally or P2P), requires no extensions, and stores everything locally.
- Updated
Nov 25, 2025 - Go
Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
- Updated
May 3, 2025 - JavaScript
😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...
- Updated
May 19, 2025
Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)
- Updated
Feb 28, 2023 - JavaScript
⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...
- Updated
Aug 20, 2025 - JavaScript
🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.
- Updated
Aug 15, 2024 - JavaScript
Home of the official docker image for ArchiveBox
- Updated
Dec 18, 2024
Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
- Updated
Sep 16, 2024 - JavaScript
MCP server tailored to connecting web crawler data and archives
- Updated
Sep 24, 2025 - HTML
Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.
- Updated
Jul 12, 2024 - Python
[FREE] A service to help export your pocket bookmarks, tags, saved article text, and more...
- Updated
Nov 20, 2025 - TypeScript
Homebrew formula for the ArchiveBox self-hosted internet archiving solution.
- Updated
Oct 5, 2024 - Ruby
Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.
- Updated
Oct 31, 2024 - Python
🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.
- Updated
Jul 11, 2025 - JavaScript
DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.
- Updated
Feb 2, 2024 - HTML
Self-hosted internet archiving solution to collect, save, and view sites you want to preserve offline, for YunoHost.
- Updated
Oct 28, 2025 - Shell
Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from ArchiveBox.
- Updated
Nov 25, 2025 - Shell
Home of the official apt/deb package for Ubuntu/Debian-based systems.
- Updated
Oct 5, 2024 - Python
Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.
- Updated
Aug 1, 2025 - CSS
Improve this page
Add a description, image, and links to thearchivebox topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thearchivebox topic, visit your repo's landing page and select "manage topics."