- Notifications
You must be signed in to change notification settings - Fork367
⬛️ CLI tool and library for saving complete web pages as a single HTML file
License
Y2Z/monolith
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
_____ ______________ __________ ___________________ ___| \ / \ | | | | | || \_/ __ \_| __ | | ___ ___ |__| || | | | | | | | | | | || |\ /| |__| _ |__| |____| | | | | __ || | \___/ | | \ | | | | | | ||___| |__________| \_____________________| |___| |___| |___|
A data hoarder’s dream come true: bundle any web page into a single HTML file. You can finally replace that gazillion of open tabs with a gazillion of .html files stored somewhere on your precious little drive.
Unlike the conventional “Save page as”,monolith
not only saves the target document, it embeds CSS, image, and JavaScript assetsall at once, producing a single HTML5 document that is a joy to store and share.
If compared to saving websites withwget -mpk
, this tool embeds all assets as data URLs and therefore lets browsers render the saved page exactly the way it was on the Internet, even when no network connection is available.
UsingCargo (cross-platform)
cargo install monolith
ViaHomebrew (macOS and GNU/Linux)
brew install monolith
ViaChocolatey (Windows)
choco install monolith
ViaScoop (Windows)
scoop install main/monolith
ViaWinget (Windows)
winget install --id=Y2Z.Monolith -e
ViaMacPorts (macOS)
sudo port install monolith
UsingSnapcraft (GNU/Linux)
snap install monolith
UsingGuix (GNU/Linux)
guix install monolith
UsingNixPkgs
nix-env -iA nixpkgs.monolith
UsingFlox
flox install monolith
UsingPacman (Arch Linux)
pacman -S monolith
Usingaports (Alpine Linux)
apk add monolith
UsingXBPS Package Manager (Void Linux)
xbps-install -S monolith
UsingFreeBSD packages (FreeBSD)
pkg install monolith
UsingFreeBSD ports (FreeBSD)
cd /usr/ports/www/monolith/make install clean
Usingpkgsrc (NetBSD, OpenBSD, Haiku, etc)
cd /usr/pkgsrc/www/monolithmake install clean
Usingcontainers
docker build -t y2z/monolith .sudo install -b dist/run-in-container.sh /usr/local/bin/monolith
Fromsource
Dependencies:libssl
,cargo
Install cargo (GNU/Linux)
Check if cargo is installedcargo -v
If cargo is not already installed, install and add it to your existing$PATH
(paraphrasing theofficial installation instructions):
curl https://sh.rustup.rs -sSf | sh. "$HOME/.cargo/env"
Proceed with installing from source:
git clone https://github.com/Y2Z/monolith.gitcd monolithmake install
Usingpre-built binaries (Windows, ARM-based devices, etc)
Every release contains pre-built binaries for Windows, GNU/Linux, as well as platforms with non-standard CPU architecture.
monolith https://lyrics.github.io/db/P/Portishead/Dummy/Roads/ -o portishead-roads-lyrics.html
cat some-site-page.html | monolith -aIiFfcMv -b https://some.site/ - > some-site-page-with-assets.html
-a
: Exclude audio sources-b
: Usecustom base URL
-B
: Forbid retrieving assets from specified domain(s)-c
: Exclude CSS-C
: Read cookies fromfile
-d
: Allow retrieving assets only from specifieddomain(s)
-e
: Ignore network errors-E
: Save document usingcustom encoding
-f
: Omit frames-F
: Exclude web fonts-h
: Print help information-i
: Remove images-I
: Isolate the document-j
: Exclude JavaScript-k
: Accept invalid X.509 (TLS) certificates-M
: Don't add timestamp and URL information-n
: Extract contents of NOSCRIPT elements-o
: Write output tofile
(use “-” for STDOUT)-q
: Be quiet-t
: Adjustnetwork request timeout
-u
: Providecustom User-Agent
-v
: Exclude videos-V
: Print version number
Options-d
and-B
provide control over what domains can be used to retrieve assets from, e.g.:
monolith -I -d example.com -d www.example.com https://example.com -o example-only.html
monolith -I -B -d .googleusercontent.com -d googleanalytics.com -d .google.com https://example.com -o example-no-ads.html
Monolith doesn't feature a JavaScript engine, hence websites that retrieve and display data after initial load may require usage of additional tools.
For example, Chromium (Chrome) can be used to act as a pre-processor for such pages:
chromium --headless --window-size=1920,1080 --run-all-compositor-stages-before-draw --virtual-time-budget=9000 --incognito --dump-dom https://github.com | monolith - -I -b https://github.com -o github.html
Please sethttps_proxy
,http_proxy
, andno_proxy
environment variables.
Please open an issue if something is wrong, that helps make this project better.
To the extent possible under law, the author(s) have dedicated all copyright related and neighboring rights to this software to the public domain worldwide.This software is distributed without any warranty.
About
⬛️ CLI tool and library for saving complete web pages as a single HTML file