Uh oh!
There was an error while loading.Please reload this page.
- Notifications
You must be signed in to change notification settings - Fork1.4k
Description
What type of suggestion are you making?
Proposing a new feature
What is the problem that your feature request solves?
Looking atavailable sources, archiving browser history requires runningexport_browser_history.sh.
However I see a few issues:
- It looks like this was written for macOS only. Linux users have to figure out how to use the script manually.
- There's a sqlite syntax error for the Firefox export.
- The script fails silently. Depending on the error it will just generate an empty file, do nothing, etc. and may generate no helpful output.
What is your proposed solution?
I'm a bit of a Bash nerd and would love to make this work with Linux and Firefox at least. I've already startedhere. Is this kind of contribution something you would take?
Side notes:
- This branch seeks to fix all the issues I've found so far. I have split the commits up in a logical way as well, so they're easy to review one-by-one.
- I donot have a mac to test with, so you will definitely want to test these changes on a mac before merging.
- I am onlyconsidering installing Chromium to get that working on Linux. Not sure if I will yet.
What hacks or alternative solutions have you tried to solve the problem?
Pass the full file name to the script after the--firefox argument. But that still fails with a sqlite syntax error.
Share the entire output of thearchivebox version command for the current verison you are using.
0.7.3ArchiveBox v0.7.3 COMMIT_HASH=069aabc BUILD_TIME=2024-12-15 09:54:03 1734256443IN_DOCKER=True IN_QEMU=False ARCH=x86_64 OS=Linux PLATFORM=Linux-6.13.2-arch1-1-x86_64-with-glibc2.36 PYTHON=CpythonFS_ATOMIC=True FS_REMOTE=True FS_USER=911:911 FS_PERMS=644DEBUG=False IS_TTY=True TZ=UTC SEARCH_BACKEND=ripgrep LDAP=False[i] Dependency versions: √ PYTHON_BINARY v3.11.11 valid /usr/local/bin/python3.11 √ SQLITE_BINARY v2.6.0 valid /usr/local/lib/python3.11/sqlite3/dbapi2.py √ DJANGO_BINARY v3.1.14 valid /usr/local/lib/python3.11/site-packages/django/__init__.py √ ARCHIVEBOX_BINARY v0.7.3 valid /usr/local/bin/archivebox √ CURL_BINARY v8.10.1 valid /usr/bin/curl √ WGET_BINARY v1.21.3 valid /usr/bin/wget √ NODE_BINARY v20.18.1 valid /usr/bin/node √ SINGLEFILE_BINARY v1.1.54 valid /app/node_modules/single-file-cli/single-file √ READABILITY_BINARY v0.0.11 valid /app/node_modules/readability-extractor/readability-extractor √ MERCURY_BINARY v1.0.0 valid /app/node_modules/@postlight/parser/cli.js √ GIT_BINARY v2.39.5 valid /usr/bin/git √ YOUTUBEDL_BINARY v2024.12.13 valid /usr/local/bin/yt-dlp √ CHROME_BINARY v131.0.6778.33 valid /usr/bin/chromium-browser √ RIPGREP_BINARY v13.0.0 valid /usr/bin/rg [i] Source-code locations: √ PACKAGE_DIR 23 files valid /app/archivebox √ TEMPLATES_DIR 3 files valid /app/archivebox/templates - CUSTOM_TEMPLATES_DIR - disabled None [i] Secrets locations: - CHROME_USER_DATA_DIR - disabled None - COOKIES_FILE - disabled None [i] Data locations: √ OUTPUT_DIR 5 files @ valid /data √ SOURCES_DIR 5 files valid ./sources √ LOGS_DIR 2 files valid ./logs √ ARCHIVE_DIR 4 files valid ./archive √ CONFIG_FILE 81.0 Bytes valid ./ArchiveBox.conf √ SQL_INDEX 244.0 KB valid ./index.sqlite3
This is on the latestdev branch. The last time this script was touched was inaa5533b
How badly do you want this new feature?
- It's an urgent deal-breaker, I can't live without it
- It's important to add it in the near-mid term future
- It would be nice to have eventually
- I'm willing tostart a PR to develop this myself
- I havedonated money to go towards fixing this issue
Mini Survey
- I like ArchiveBox so far / would recommend it to a friend
- I've had a lot of difficulty getting ArchiveBox set up
- I would pay $10/mo for a hosted version of ArchiveBox if it had this feature
Metadata
Metadata
Assignees
Labels
No labels