Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

v0.8 Release Candidate#1311

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Open
pirate wants to merge1,246 commits intostable
base:stable
Choose a base branch
Loading
fromdev
Open
Changes from1 commit
Commits
Show all changes
1246 commits
Select commitHold shift + click to select a range
922fd42
bump version 0.8.5rc51 -> 0.8.5rc52
pirateOct 24, 2024
6770394
use pep440_version when bumping version
pirateOct 24, 2024
c83abd7
bump version v0.8.5rc52 -> v0.8.5rc53
pirateOct 24, 2024
6c2f1d2
move DEBUG=True packages into pip-available pkgs
pirateOct 24, 2024
5295320
add django-autotyping to debug pip group
pirateOct 24, 2024
60f0458
rename configfile to collection
pirateOct 24, 2024
b61f6ff
rename system_tasks queue to commands queue
pirateOct 24, 2024
4b6f08b
swap more direct settings.CONFIG access to abx getters
pirateOct 24, 2024
5d9a32c
wip
pirateOct 25, 2024
4213d7d
Fix API crash
benmuthOct 26, 2024
7ff2c7f
Fix API crash (#1569)
pirateOct 27, 2024
b3c1cb7
move abx plugins inside vendor dir
pirateOct 28, 2024
d47d429
add placeholder pyproj
pirateOct 28, 2024
d93aa46
fix django.forms.JSONField does not exist 500 error
pirateOct 29, 2024
a5d99b8
add more plugins
pirateOct 29, 2024
70926f1
replace os.access with os.path.isdir
pirateOct 29, 2024
6530d1f
remove vendored copy of pocket and add [debug] group of pkgs for runn…
pirateOct 29, 2024
001056f
remove vendored copy of pydantic-pkgr
pirateOct 29, 2024
7d75867
bump rc version since there have been tons of changes
pirateOct 29, 2024
dee4eb7
rename vendor dir to pkgs
pirateOct 29, 2024
30cd48c
update lockfiles
pirateOct 29, 2024
eb721bd
tweak parser imports
pirateOct 29, 2024
5efeb9d
add get_SCOPE_CONFIG
pirateOct 29, 2024
f56cdd2
add chrome flag to fix long screenshots getting cut off
pirateOct 29, 2024
5ea035c
Update README.md
pirateOct 30, 2024
9c2eac4
add new actors and orchestrators
pirateOct 31, 2024
17faa5a
improvements to new actor and orchestrators
pirateOct 31, 2024
721427a
hide progress bar on startup
pirateOct 31, 2024
ecfdab1
Update and rename bug_report.md to bug_report.yml
pirateNov 2, 2024
6adca82
Update bug_report.yml
pirateNov 2, 2024
8ce010a
Update bug_report.yml
pirateNov 2, 2024
ea6156f
Update bug_report.yml
pirateNov 2, 2024
8e0e9f2
Update bug_report.yml
pirateNov 2, 2024
a0bbe55
Update bug_report.yml
pirateNov 2, 2024
b47b453
Update bug_report.yml
pirateNov 2, 2024
65bb71e
Update bug_report.yml
pirateNov 2, 2024
2bff4f4
Update bug_report.yml
pirateNov 2, 2024
983119d
Delete .github/ISSUE_TEMPLATE/question_or_discussion.md
pirateNov 2, 2024
80dd3c6
Update and rename feature_request.md to feature_request.yml
pirateNov 2, 2024
2948637
Update feature_request.yml
pirateNov 2, 2024
61f1501
Update feature_request.yml
pirateNov 2, 2024
e68806b
Update and rename documentation_change.md to documentation_change.yml
pirateNov 2, 2024
2e0dc1f
Update documentation_change.yml
pirateNov 2, 2024
c017491
Update documentation_change.yml
pirateNov 2, 2024
ce6aa20
Update documentation_change.yml
pirateNov 2, 2024
eeac839
Update documentation_change.yml
pirateNov 2, 2024
a675949
Update documentation_change.yml
pirateNov 2, 2024
12a95b5
Create config.yml
pirateNov 3, 2024
ce6ae34
Update config.yml
pirateNov 3, 2024
85747f9
Rename bug_report.yml to 1-bug_report.yml
pirateNov 3, 2024
7862d58
Rename feature_request.yml to 2-feature_request.yml
pirateNov 3, 2024
abad13f
Rename documentation_change.yml to 3-documentation_change.yml
pirateNov 3, 2024
f5cf805
Update 3-documentation_change.yml
pirateNov 3, 2024
27f26fd
Update config.yml
pirateNov 3, 2024
dbe5c0b
more orchestrator and actor improvements
pirateNov 3, 2024
9b24fe7
merge dev
pirateNov 3, 2024
2337f87
better actor atomic claim
pirateNov 3, 2024
41efd01
add wip crawl actor spec
pirateNov 3, 2024
48f8416
add new core and crawsl statemachine manager
pirateNov 3, 2024
49c5209
playwright: support PLAYWRIGHT_BROWSERS_PATH environment variable
andrew-dNov 3, 2024
50a85ec
Update archivebox/plugins_pkg/playwright/binproviders.py
pirateNov 3, 2024
cc49ecb
playwright: support PLAYWRIGHT_BROWSERS_PATH environment variable (#1…
pirateNov 3, 2024
758c0c6
add user providable PLAYWRIGHT cache dir
pirateNov 3, 2024
b6ab4e2
merge dev
pirateNov 3, 2024
b7b3add
v0.8.6-rc: Moving plugins to independent python packages with finite …
pirateNov 3, 2024
5872375
Update Dockerfile.simple
pirateNov 3, 2024
1148cad
Update __init__.py
pirateNov 3, 2024
fd89de5
Update setup.sh
pirateNov 4, 2024
cad1be9
Require bash for setup.sh script instead of sh
pirateNov 4, 2024
99ed978
Prevent accidentally mounting home folder as DATA_DIR
pirateNov 4, 2024
5d3c2a8
Update docker_entrypoint.sh
pirateNov 4, 2024
a9a3b15
more StateMachine, Actor, and Orchestrator improvements
pirateNov 4, 2024
a0f9d3f
Update README.md
pirateNov 12, 2024
ad7eec2
bump docs changes
pirateNov 13, 2024
5ce25d7
Delete click_test.py
pirateNov 13, 2024
c6710a8
Delete CNAME
pirateNov 13, 2024
840f831
move readthedocs config into subdir
pirateNov 13, 2024
57852fd
fix sphinx docs build
pirateNov 13, 2024
f0a7198
bump docs changes
pirateNov 13, 2024
ec100bf
fix docs build for vendored pkgs
pirateNov 13, 2024
5cb1fd7
bump docs changes
pirateNov 13, 2024
6448968
Use archivebox/sonic multi-arch container with bundled config file
pirateNov 13, 2024
ed43f1d
better docstrings and comments
pirateNov 16, 2024
7c0e3dc
load crawls,seeds,actors apps as pluggy plugins
pirateNov 16, 2024
c3d692b
fix minor actor erros around CLAIM_ATOMIC
pirateNov 16, 2024
48bb634
fix orchestrator startup and add exit_on_idle option
pirateNov 16, 2024
43514da
add crawl and seed endpoints to REST API
pirateNov 16, 2024
b4a5da3
update archivebox add CLI command to use new actor system
pirateNov 16, 2024
684a394
add HOSTNAME to config.permissions
pirateNov 16, 2024
227fd4e
fix statemachine progression for Snapshot, Crawl, and ArchiveResult
pirateNov 16, 2024
ba26d75
add notes and label fields, fix model getters
pirateNov 16, 2024
c2add71
make supervisord start orchestrator on startup
pirateNov 16, 2024
8cd285e
add Seed admin
pirateNov 16, 2024
2291f02
setup seed model
pirateNov 16, 2024
b7df1ca
add start orchestrator management command
pirateNov 16, 2024
a4635fe
bump rc version
pirateNov 16, 2024
210fd93
make orchestrator run as long as any tasks are pending
pirateNov 16, 2024
c8e186f
fix plugin loading order, admin, abx-pkg
pirateNov 16, 2024
8f8fbbb
API fixes and add actors endpoints
pirateNov 18, 2024
fb82fda
make actor pending include all obj with retry_at in the past
pirateNov 18, 2024
36d24cd
add jobs dashboard
pirateNov 18, 2024
1b8bafd
add abx-spec-abx-pkg pkg
pirateNov 18, 2024
2f30a35
add extractors files to favicon and title plugins
pirateNov 18, 2024
2c59524
bump docs build
pirateNov 18, 2024
c206056
add better docstrings to abx package
pirateNov 18, 2024
dbd6272
Update config.yml
pirateNov 18, 2024
2ae70de
Update config.yml
pirateNov 18, 2024
3e5ae16
Update config.yml
pirateNov 18, 2024
18403b7
Update config.yml (#1598)
pirateNov 18, 2024
148ea90
fix serious bug with Actor.get_next updating all rows instead of only…
pirateNov 18, 2024
2a66bb9
flip queue processing order to do most recent first
pirateNov 18, 2024
67c22b2
fix config set not working with constants
pirateNov 18, 2024
1ec2753
fix statemachine create_root_snapshot and retry timing
pirateNov 18, 2024
b852442
add crawls app back to django admin
pirateNov 18, 2024
c8b830b
add ABIDModel.update_for_workers to update-in-place + bump retry_at time
pirateNov 18, 2024
af21c34
add ModelWithOutputDir base model to manage output directories and in…
pirateNov 18, 2024
9b8cf7b
simplify actor and orchestrator by removing threading code, fixing bugs
pirateNov 18, 2024
f5727c7
rename actors to workers
pirateNov 18, 2024
f65c2b4
tweak dashboard UI css
pirateNov 18, 2024
1e3ce67
fix API and CLU calls
pirateNov 18, 2024
385ccaa
extend core models with ModelWithOutputDir
pirateNov 18, 2024
9adfe0e
add code to log all SQL queries for DEBUG
pirateNov 18, 2024
eb53145
working state machine flow yay
pirateNov 18, 2024
c7bd944
better jobs dashboard with faster refresh
pirateNov 18, 2024
eeb2671
API improvements
pirateNov 18, 2024
6b83b4c
leave archivebox running when in archivebox update
pirateNov 18, 2024
0acd388
fix imports and deps
pirateNov 19, 2024
e50f8cb
fix abx handling of obj, module, and class based plugins, fix archive…
pirateNov 19, 2024
e469c5a
merge queues and actors apps into new workers app
pirateNov 19, 2024
4a5d607
move logging_util into archivebox.misc subfolder
pirateNov 19, 2024
4c25e90
move monkey_patches.py into archivebox.misc subfolder
pirateNov 19, 2024
65afd40
merge seeds and crawls apps
pirateNov 19, 2024
0db6437
fix plural name for output_dir
pirateNov 19, 2024
569081a
rename abid_utils to base_models
pirateNov 19, 2024
328eb98
move main funcs into cli files and switch to using click for CLI
pirateNov 19, 2024
5f01fc8
fix archivebox shell and manage CLI commands
pirateNov 19, 2024
a0edf21
fix archivebox init and archivebox install CLI commands
pirateNov 19, 2024
c9a05c9
working archivebox update CLI cmd
pirateNov 19, 2024
2595139
improve statemachine logging and archivebox update CLI cmd
pirateNov 19, 2024
0347b91
archivebox add and remove CLI cmds
pirateNov 19, 2024
3a64ced
fix archivebox delete errors
pirateNov 19, 2024
292730e
working archivebox_schedule cmd
pirateNov 19, 2024
0f860d4
working archivebox_status CLI cmd
pirateNov 19, 2024
f21b86a
better cli colors
pirateNov 19, 2024
6740202
fix cli loading edge case where setup_django wasnt running when it sh…
pirateNov 19, 2024
ee548eb
fix archivebox install not using LIB_DIR
pirateNov 19, 2024
230bf34
restore missing archivebox_config work
pirateNov 19, 2024
fe3320e
restore missing archivebox_remove work
pirateNov 19, 2024
0f536ff
restore missing archivebox_schedule work
pirateNov 19, 2024
52446b8
restore missing archivebox_status work
pirateNov 19, 2024
f8e2f7c
restore missing archivebox_update work
pirateNov 19, 2024
6b47510
always pre-setup binproviders
pirateNov 19, 2024
b852951
fix cli loading edge case where setup_django wasnt running when it sh…
pirateNov 19, 2024
4dd53dc
Merge branch 'newchanges' into dev
pirateNov 19, 2024
28386ff
add jobs_dashboard.html back
pirateNov 19, 2024
b948e49
add urls log to Crawl model
pirateNov 19, 2024
44d337a
convert index.schema.ArchiveResult and Link to pydantic
pirateNov 19, 2024
2290140
Update 2-feature_request.yml
pirateNov 22, 2024
eae7ed8
add hashing misc library for merkle tree generation
pirateDec 3, 2024
c374d76
allow getting crawl from API as rss feed
pirateDec 3, 2024
1ceaa1a
add ABID model check and fix model inheritance
pirateDec 3, 2024
337acda
add base extractor class
pirateDec 3, 2024
dcd7e25
add new archivebox_extract cli command
pirateDec 3, 2024
8c8ec6a
add extractors README
pirateDec 3, 2024
73a75bb
Update FUNDING.yml
pirateDec 4, 2024
a3fe78a
add basename to hashing get_dir_info
pirateDec 3, 2024
dc0f1b0
add new File model in filestore
pirateDec 3, 2024
d192eb5
add filestore content addressible store draft
pirateDec 4, 2024
f1b9aec
fix syntax errors
pyrox0Dec 5, 2024
a572db3
fix syntax errors (#1609)
pirateDec 6, 2024
ac53fdf
make chrome binary and configs directly runnable and make extractor u…
pirateDec 6, 2024
81bf81a
add extract.js prototype extractor
pirateDec 6, 2024
1444cf7
add new KVTags system
pirateDec 13, 2024
a859278
tags apps.py
pirateDec 13, 2024
5cf7725
add new archivebox worker implementation based on better distributed …
pirateDec 13, 2024
6b3e297
fix lock_pkgs.sh version parsing and python version
pirateDec 13, 2024
51447b9
bump django version to 5.1.4
pirateDec 13, 2024
bab26d6
better base_models separation of concerns
pirateDec 13, 2024
930b9bf
add archivebox worker cli cmd to list of all cmds
pirateDec 13, 2024
bd5dd2f
clearer core models separation of concerns using new basemodels
pirateDec 13, 2024
2a1afcf
move crawl models back into dedicated app
pirateDec 13, 2024
651ba0b
add new Process model to Machine models
pirateDec 13, 2024
5c06b8f
add new Event model to workers/models
pirateDec 13, 2024
c11a1b5
add new worker test
pirateDec 13, 2024
74e08a1
add filestore migrations
pirateDec 13, 2024
34e4b48
add example js extractor
pirateDec 13, 2024
f6d22a3
tweak worker updated logic and add output_dir_template and symlinks l…
pirateDec 13, 2024
f31adff
Update README.md
pirateDec 15, 2024
2b77422
remove requirements.txt entirely because people keep trying to run it…
pirateDec 18, 2024
b4c5004
Update README.md
pirateDec 18, 2024
c54b944
change docker build to use uv exclusively instead of requirements.txt
pirateDec 18, 2024
90f511c
Bump Dockerfile.simple to rc51
pirateDec 18, 2024
0ad1bda
remove old deprecated bin/archive entrypoint
pirateDec 18, 2024
1e7b1df
move Dockerfile.simple to ArchiveBox/docker-archivebox/README.md
pirateDec 18, 2024
0985737
clean up Dockerfile
pirateDec 18, 2024
47a7cab
re-order dockerfile blocks
pirateDec 18, 2024
54d4d7f
bring image back down to 700mb
pirateDec 18, 2024
839016b
get docker image down to 630mb
pirateDec 18, 2024
9ca66c6
fix syntax error in archivebox/core/models.py
pyrox0Dec 18, 2024
db9771c
fix syntax error in archivebox/core/models.py (#1621)
pirateDec 18, 2024
eee9f67
Update pyproject.toml dependency groups
pirateDec 19, 2024
7975b47
remove dependencies on unneeded libraries
pirateDec 19, 2024
8e9ef31
remove dependencies on unneeded libraries in lockfiles
pirateDec 19, 2024
c5fc406
fix unneeded import
pirateDec 19, 2024
baa3be7
ignore requirements.txt now that its not needed
pirateDec 19, 2024
b78e892
update github actions to build docker image
pirateDec 19, 2024
e862031
use uv to build pip package in github actions instead of pdm
pirateDec 19, 2024
46f4a90
install needed packages to run archivebox during pip build
pirateDec 19, 2024
1fb5ecf
change pip flow to use PAT
pirateDec 19, 2024
3312a34
Fix typo in timestamp scale factor
1over137Dec 25, 2024
b74b0d2
Fix typo in timestamp scale factor (#1627)
pirateDec 26, 2024
96c5d2f
Update statemachines.py
pirateJan 3, 2025
a851ad4
Update models.py
pirateJan 3, 2025
55a347c
Update file_migrations.py
pirateJan 3, 2025
83bb8a2
Remove outdated architecture diagram
pirateJan 8, 2025
765abc9
Update pip.yml
pirateJan 8, 2025
62a99c8
clarify filesystems selections in bug report github template
pirateJan 9, 2025
b28f2e7
Update 1-bug_report.yml fix markdown formatting
pirateJan 9, 2025
91eb347
Update 1-bug_report.yml
pirateJan 9, 2025
7ba7ad6
Update 1-bug_report.yml
pirateJan 9, 2025
ba5380f
Update 1-bug_report.yml
pirateJan 9, 2025
b93918f
Update 1-bug_report.yml
pirateJan 9, 2025
fd21728
Update 1-bug_report.yml
pirateJan 9, 2025
d1c8acd
Update 1-bug_report.yml
pirateJan 9, 2025
e1c443a
Update 2-feature_request.yml
pirateJan 9, 2025
aa55e0d
Update 2-feature_request.yml
pirateJan 9, 2025
58fc6d9
readwise: fix SOURCES_DIR syntax
ckieeJan 17, 2025
952bde6
spec-config: fix CONSTANTS import
ckieeJan 17, 2025
6edcac6
Fix two small errors in abx-{readwise,spec-config} (#1635)
pirateJan 17, 2025
12f109b
Update docker-compose.yml minor tweaks
pirateJan 18, 2025
9f4cf0a
Kill the timer process if it doesn't properly terminate.
benmuthFeb 3, 2025
71c02ca
Update archivebox/misc/logging_util.py
benmuthFeb 5, 2025
37c0ea7
Kill the timer process if it doesn't properly terminate. (#1649)
pirateFeb 6, 2025
3ae30c4
Update README.md
pirateFeb 13, 2025
a27a91b
Update README.md
pirateFeb 13, 2025
0043b59
fix(export_browser_history): tilde doesn't expand in quotes
pcrockettFeb 16, 2025
2ff3fc4
feat(export_browser_history): basic arg parsing error message
pcrockettFeb 16, 2025
2e1ac04
feat(export_browser_history): fail script when errors occur
pcrockettFeb 16, 2025
feded9e
fix(export_browser_history): fix sqlite quote syntax error
pcrockettFeb 16, 2025
58bf8d0
feat(export_browser_history): add linux support for firefox
pcrockettFeb 16, 2025
9fbc2d3
fix chrome browser history export on Linux
pcrockettFeb 18, 2025
639aa72
fix typo
pcrockettFeb 18, 2025
ba6a8c2
support XDG standard, search for chrome and chromium DBs
pcrockettFeb 18, 2025
1ab4e06
remove dead competitor links
pirateMar 20, 2025
d9d67e9
add swag link to funding links
pirateMar 20, 2025
26eb75e
archivebox swag is now available!
pirateMar 20, 2025
8b67186
make sure uv is using the right python binary
pirateMar 20, 2025
d93f32a
fix(export_browser_history): tilde doesn't expand in quotes (#1661)
pirateMar 20, 2025
f72f047
Add link to Proxmox installer
NelsonMinarMay 11, 2025
c302481
Add link to Proxmox installer (#1682)
pirateMay 19, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
PrevPrevious commit
NextNext commit
Update README.md
  • Loading branch information
@pirate
pirate authoredFeb 13, 2025
commit3ae30c43a90c03bd1c417a0f0f306cfa4448921c
6 changes: 4 additions & 2 deletionsREADME.md
View file
Open in desktop
Original file line numberDiff line numberDiff line change
Expand Up@@ -20,9 +20,11 @@ curl -fsSL 'https://get.archivebox.io' | bash # (or see pip/brew/Docker instr
<hr/>
<br/>

**ArchiveBox is apowerful,self-hostedinternet archiving solution to collect, save, and viewwebsitesoffline.**
**ArchiveBox is a self-hostedapp that lets you preserve content fromwebsitesin a variety of formats.**

Without active preservation effort, everything on the internet eventually disappears or degrades. Archive.org does a great job as a centralized service, but saved URLs have to be public, and they can't save every type of content.
We aim to make your data immediately useful, and stored in common formats that been around for decades. As output we save standard HTML, PNG, PDF, TXT, JSON, WARC, SQLite, all guaranteed to be readable for decades to come. ArchiveBox also has a CLI, REST API, and webhooks so you can set up integrations with other services.

Without active preservation effort, everything on the internet eventually disappears or degrades.

*ArchiveBox is an open source tool that lets organizations & individuals archive both public & private web content while retaining control over their data. It can be used to save copies of bookmarks, preserve evidence for legal cases, backup photos from FB/Insta/Flickr or media from YT/Soundcloud/etc., save research papers, and more...*
<br/>
Expand Down
Loading

[8]ページ先頭

©2009-2025 Movatter.jp