- Notifications
You must be signed in to change notification settings - Fork0
📎 Using scanners and OCR to grep paper documents the easy way
License
STudio26/paperwork
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Paperwork is a personal document manager for scanned documents (and PDFs).
It's designed to be easy and fast to use. The idea behind Paperworkis "scan & forget": You should be able to just scan a new document andforget about it until the day you need it again.
In other words, let the machine do most of the work for you.
- Scan
- Automatic detection of page orientation
- OCR
- Document labels
- Automatic guessing of the labels to apply on new documents
- Search
- Keyword suggestions
- Quick edit of scans
- PDF support
Note regarding updates:If you're upgrading from a previous version installed with pip, it is strongly recommended you uninstallit first before installing the new version.
- GNU/Linux Archlinux
- GNU/Linux Debian
- GNU/Linux Fedora
- GNU/Linux Gentoo
- GNU/Linux Ubuntu
- Docker
- GNU/Linux Development
- Windows:Installation /Development
If you used the installer fromOpenPaper, Paperwork can be uninstalled like anyother Windows application (something like Control Panel --> Applications --> Uninstall).
If you installed it manually (for development), you can follow the same process than forGNU/Linux
- Extra documentation / FAQ / Tips / Wiki
- Mailing-list (be careful:By default, Google groups set your subscription to "no email")
- Bug trackers
Papers are organized into documents. Each document contains pages.
It mainly uses:
- Sane/Pyinsane: To scan the pages
- Tesseract/Pyocr: To extract the words from the pages (OCR)
- GTK: For the user interface
- Whoosh: To index and search documents, and provide keyword suggestions
- Simplebayes: To guess the labels
- Pillow/Libpillowfight: Image manipulation
Paperwork has automated tests. These tests are designed in a unusual manner:They use screenshots (aka "May work or may not work. Do you feel lucky today ?").Because these screenshots change often and are heavy (compared to source files),they are stored in a separate Git repository:paperwork-tests
GPLv3 or later. See COPYING.
All the information can be found onthe wiki
About
📎 Using scanners and OCR to grep paper documents the easy way
Resources
License
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Packages0
Languages
- Python99.1%
- Other0.9%