- Notifications
You must be signed in to change notification settings - Fork22
Optical character recognition in manga images. Manga OCR desktop application
License
blueaxis/Poricom
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Optical character recognition in manga images. Manga OCR desktop application
Poricom is a desktop program for optical character recognition in manga images. Although it is a manga OCR application, it can recognize text on other type of images as well. The project is a GUI implementation of theManga OCR library (supports Japanese only) and the Tesseract-API python wrappertesserocr (supports other languages). See demo below to see how it works.
Detect text on locally stored manga images:
sample_usage.mp4
Perform OCR on the current screen by pressingAlt+Q
:
external_capture.mp4
- Cloe - The app is based on Poricom's global snipping functionality. If you downloaded Poricom to useonly the global shortcut, it might be better if you use Cloe instead.
- mokuro - Converts manga images to web pages with selectable text. This saves you time and manual effort since textboxes are automatically detected with an almost 100% accuracy.
Follow the installation instructionshere. Load a directory with manga images and select text boxes with Japanese text. If you are not getting good results using the default settings,use the MangaOcr model to improve text detection.
Listed below are some of the features of Poricom. Smaller features that are not covered in this section are mentioned in thechangelog. Click the arrow to see how each implemented feature works.
Open a directory with manga images or a supported manga file (cbz, cbr, pdf) and start scanning text bubbles.
manga-text-detection.mp4
Capture images outside the application using the shortcut `Alt+Q`.
external_capture.mp4
Load MangaOcr model to improve Japanese text recognition.
load-mangaocr-model.mp4
Change language and/or orientation (limited to the Tesseract API).
language-orientation-option.mp4
Detect text on non-manga images.
non-manga-detection.mp4
Download the latest zip filehere. Decompress the file in the desired directory. Make sure that theapp
folder is in the same folder as the shortcutPoricom
.
Recommended:
- Hard drive: at least 800 MB HD space
- RAM: at least 2 GB
Approximately 250 MB of free space and 200 MB of memory is needed to run the application using the Tesseract API. If using the Manga OCR model, an additional 450 MB of free space and 800 MB of memory is required.
- Clone this repo and installconda.
- Install dependencies by running
conda env create -f environment/base.yaml
. - Activate the environment with
conda activate poricom-py39
and run the app usingpython main.py
. - If you want to build the app locally, install build dependencies by running
conda env update -f environment/build.yaml
. Then runpyinstaller main.spec
in thebuild
directory.
This project will not be possible without the MangaOcr model byMaciej Budyś and the Tesseract python wrapper bysirfz andthe tesserocr contributors.
The software is licensed under GPLv3 (seeLICENSE) and uses third party libraries that are distributed under their own terms (seeLICENSE-3RD-PARTY).
The icons used in this project are fromIcons8.
About
Optical character recognition in manga images. Manga OCR desktop application
Topics
Resources
License
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Uh oh!
There was an error while loading.Please reload this page.
Contributors3
Uh oh!
There was an error while loading.Please reload this page.