pdf-ocr

An out-of-the-box local Web UI for DeepSeek-OCR. Built with FastAPI + Vue.js, it supports PDF/Image uploads, progress tracking, and result visualization with bounding boxes. Easily experience the power of a top-tier OCR model.

ocr computer-vision text-recognition research-tool image-to-text optical-character-recognition document-analysis multimodal math-ocr latex-ocr llm large-language-model pdf-ocr deepseek deepseek-ocr ocr-webservice

UpdatedDec 6, 2025
Python

jiangnanboy /JiaJiaOCR

Star69

Building on the existing general text recognition capabilities, new features such as handwritten OCR, layout detection, and table detection and recognition have been added, covering all scenarios involving printed text, handwritten text, and document structure analysis.在原通用文本识别基础上，新增手写 OCR、版面检测、表格检测与识别功能，覆盖印刷体、手写体、文档结构解析全场景。

ocr layout handwriting-recognition java-ocr table-ocr pdf-ocr

UpdatedJan 7, 2026
Java

vorojar /Folio-OCR

Star50

Open-source batch OCR workbench — a free, local alternative to ABBYY FineReader. Powered by Ollama + GLM-OCR + PP-DocLayoutV3, ~0.5s/page on RTX 4090. Three-panel editor, layout-aware, PDF/image batch processing, Markdown/Word export. 批量OCR工作台，纯本地运行，免费平替ABBYY，适合书籍文档数字化。

privacy ocr offline book-digitization document-processing document-ocr layout-detection markdown-export pdf-ocr local-ai ollama batch-ocr glm-ocr abbyy-alternative

UpdatedFeb 7, 2026
JavaScript

ahnafnafee /local-llm-pdf-ocr

Sponsor

Star16

Convert scanned PDFs into searchable text locally using Vision LLMs (olmOCR). 100% private, offline, and free. Features a modern Web UI & CLI.

python ocr web-ui document-processing fastapi privacy-focused searchable-pdf no-api-key pdf-ocr local-llm offline-ai surya-ocr olmocr vision-llm

UpdatedDec 23, 2025
Python

Achiwilms /OCR-Wizard

Star15

A powerful and user-friendly tool based on OCRmyPDF, offering a seamless GUI for conversion of image-based PDFs into searchable text.

python pdf ocrmypdf ocr-recognition pdf-ocr-extraction ocr-python searchable-pdf ocr-pdf pdf-ocr

UpdatedOct 28, 2023
Python

AzozzALFiras /Pdf-OCR

Star6

A simple, free tool for extracting text from scanned PDFs and images using OCR, and converting images to PDFs. It processes files locally in the browser, ensuring privacy and security while enabling users to effortlessly convert documents and images into editable text or PDF format.

ocr pdf2txt azozzalfiras pdf2text image-ocr pdf-ocr

UpdatedJan 15, 2025
HTML

am009 /LLM-online-tool

Star5

LLM PDF OCR工具，Markdown/Latex 文章翻译工具。支持逐段翻译和直接校对。支持数学公式。基于大语言模型（LLM）API

ocr translation llm pdf-ocr

UpdatedFeb 13, 2026
JavaScript

R0mb0 /PDF_accessibility_fixer

Sponsor

Star4

Client-side tool to check and fix PDF accessibility. Analyze PDFs for text layer accessibility, detect image-only pages, and rebuild selectable text layers with browser-based OCR—no server or backend required. Perfect for privacy-first and legacy environments.

javascript css html pdf ocr tesseract-ocr pdf-js pdf-lib tesseract-js italian-developers r0mb0 pdf-lib-js pdf-ocr accessibility-pdf pdf-worker-js pdf-fix pdf-accessibility-fixer

UpdatedDec 10, 2025
JavaScript

Marcello2020-dev /vision-ocr-pdf-toolkit

Star3

Open-source macOS PDF toolkit: merge PDFs and create searchable OCR PDFs via Apple VisionKit (optional OCRmyPDF deskew).

macos swift pdf ocr tesseract vision pdfkit ocrmypdf pdf-merger deskew pdf-tools swiftui pdf-ocr

UpdatedFeb 11, 2026
Swift

antonioanerao /ocr_pdf_api

Star3

OCR em arquivos PDF

python ocr tesseract pdf-ocr

UpdatedSep 30, 2025
Python

thiagoaramizo /file-to-md

Star2

A document processing service designed to extract structured text (Markdown) from various file formats using OCR (Tesseract) and native parsers.

api markdown pdf ocr microservice image-ocr llm file-extraction pdf-ocr

UpdatedJan 27, 2026
Python

mcagriaksoy /diff_merge_pdf

Star2

A tool for compare, merge, display difference and make OCR between the PDFs.

pdf-viewer pdf-generator pdf-merger ocr-recognition pdf-comparison x-ray-images ocr-text-reader diff-tool pdf-document-processor pdf-ocr-extraction pyqt6-desktop-application pymupdf-fitz pdf-ocr pdf-visual-testing diff-tool-pdf

UpdatedJan 21, 2024
Python

KuchikiRenji /pypdftotext

Star1

OCR-enabled PDF text extraction in Python with pypdf and Azure Document Intelligence.

python pdf ocr aws-s3 text-extraction pdf-parsing pypdf document-intelligence pdf-ocr azure-document-intelligence

UpdatedJan 31, 2026
Python

GPicy /gpicy

Star1

GPicy - AI Artificial Intelligence-driven image processing for your sporadic needs.

ocr image-processing text-recognition image-recognition image-to-text object-removal multilanguage-support background-removal pdf-ocr image-enlargement photo-colorization portrait-restoration ai-artificial-intelligence

UpdatedApr 1, 2024

hz01 /pdf-ocr-llm

Star1

PDF to Markdown OCR using vision-language models with multi-GPU support

ocr pdf-to-text pdf-ocr pdf-to-markdown

UpdatedOct 17, 2025
Python

ridpath /pdfscapel

Star1

PDFScalpel is a forensic PDF analysis and CTF toolkit for security researchers, digital forensics analysts, and penetration testers, providing deep insight into PDF structure, encryption, malware, steganography, metadata, revisions, and document authenticity.

pdf adobe pentesting watermark ctf-tools pdf-ocr-extraction pdf-malware pdf-password watermark-remover pdf-processing redteam-tools watermark-removal pdf-ocr pdf-password-remover watermark-tool pdf-forensic-tool

UpdatedFeb 3, 2026
Python

outman /ocr-ocr-support

Star1

ocr ocr-recognition ocr-pdf image-ocr pdf-ocr local-ocr ocr-local ocr-image

UpdatedFeb 16, 2026
Svelte

Improve this page

Add a description, image, and links to thepdf-ocr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thepdf-ocr topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdf-ocr

Here are 26 public repositories matching this topic...

Stirling-Tools /Stirling-PDF

alam00000 /bentopdf

Haste171 /langchain-chatbot

Cross2pro /DeepSeek-OCR-Dashboard

jiangnanboy /JiaJiaOCR

vorojar /Folio-OCR

ahnafnafee /local-llm-pdf-ocr

Achiwilms /OCR-Wizard

AzozzALFiras /Pdf-OCR

am009 /LLM-online-tool

R0mb0 /PDF_accessibility_fixer

Marcello2020-dev /vision-ocr-pdf-toolkit

antonioanerao /ocr_pdf_api

thiagoaramizo /file-to-md

mcagriaksoy /diff_merge_pdf

KuchikiRenji /pypdftotext

GPicy /gpicy

hz01 /pdf-ocr-llm

ridpath /pdfscapel

outman /ocr-ocr-support

Improve this page

Add this topic to your repo