- Notifications
You must be signed in to change notification settings - Fork0
OCR: A Swift CLI tool using Apple's Vision framework for versatile text recognition on macOS.
License
az5app/swift-ocr-cli
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
swift-ocr-cli is a Swift command-line tool that leverages Apple'sVision framework (macOS only) to perform Optical Character Recognition (OCR) on images. This tool supports multiple input methods and output formats, making it flexible for various use cases.
$ swift-ocr-cli-arm-mac ./docs/images/test.png ko,en --json
{"text" :"[EPA 및 DHA 함유 유지] 혈중 중성지질 개선\n• 혈행 개선에 도움을 줄 수 있음, 기억력 개선\n에 동장에 도수 있 수 건조한 눈을 개선하여\n[비타민트 항산화 작용을 하여 유해산소로부터\n세포를 보호하는데 필요\n1일 섭취량 : 1캡슐(1,200 mg)\n1일 섭취량 당\n함량\n%영양성분기준치\n열량\n10 kcal\n탄수화물\n0 g\n단백질\n지방\n0 g\n1.2 g\n나트륨\n0 mg\nEPA 와 DHA의 합 900 mg\n비타민E\n3.4 mg a-TE\n0 %\n0 %\n2%\n0 %\n31 %\n※ %영양성분기준치 : 1일 영양성분기준치에 대한 비율"}$ swift-ocr-cli-arm-mac ./docs/images/test.png ko,en --json --coordinateYou can easily install swift-ocr-cli usingHomebrew:
brew tap az5app/tapbrew install swift-ocr
Multiple Input Forms:
Accepts either an image file path or a base64-encoded image string.Output Formats:
- Plain Text: Simply outputs the recognized text.
- JSON Format: Use the
--jsonflag to receive coordinates in JSON.
Coordinates Mode:
Use the--coordinateflag to get recognized text along with bounding box coordinates.
The output includes:- Recognized text.
- Bounding box dimensions (
widthandheight). - A default confidence score.
- Pixel coordinates (
x,y) for the top-left corner of the bounding box.
- macOS: Version 10.15 or later
- Xcode: Version 13 or later (to build Swift tools using Vision)
- Swift: Version 5.5 or later
Clone the repository and change to the project directory:
git clone<REPO_URL> swift-ocr-clicd swift-ocr-cli
Build the project using Swift Package Manager:
swift build
Run the tool with the following syntax:
swift run swift-ocr-cli<imageFilePath or base64Str> [recognitionLanguages] [--coordinate] [--json]
Basic Text Recognition (defaults to English):
swift run swift-ocr-cli /path/to/image.jpg
Specifying Custom Recognition Languages:
swift run swift-ocr-cli /path/to/image.jpg ko-KR,en-US
Coordinate Mode with JSON Output:
swift run swift-ocr-cli /path/to/image.jpg --coordinate --json
Using Base64 Input:
swift run swift-ocr-cli<base64EncodedImageString> --coordinate
To build release versions for different architectures, use the provided scripts.
./scripts/release.sh
For arm64:
swift build -c release --arch arm64 --build-path .build/arm64
For x86_64:
swift build -c release --arch x86_64 --build-path .build/x86_64
A sample test is included which uses a base64-encoded image. Run the tests with:
swifttestAbout
OCR: A Swift CLI tool using Apple's Vision framework for versatile text recognition on macOS.
Topics
Resources
License
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Packages0
Uh oh!
There was an error while loading.Please reload this page.
Contributors2
Uh oh!
There was an error while loading.Please reload this page.
