ocr
Here are 5,425 public repositories matching this topic...
Language:All
Sort:Most stars
Tesseract Open Source OCR Engine (main repository)
- Updated
Feb 12, 2025 - C++
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
- Updated
Mar 18, 2025 - Python
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
- Updated
Mar 3, 2025 - JavaScript
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
- Updated
Mar 18, 2025 - TypeScript
ShareX is a free and open source program that lets you capture or record any area of your screen and share it with a single press of a key. It also allows uploading images, text or other types of files to many supported destinations you can choose from.
- Updated
Feb 20, 2025 - C#
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
- Updated
Mar 14, 2025 - Python
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
- Updated
Mar 13, 2025 - Python
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
- Updated
Sep 24, 2024 - Python
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
- Updated
Mar 18, 2025 - Python
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
- Updated
Feb 27, 2025 - Python
pix2tex: Using a ViT to convert images of equations into LaTeX code.
- Updated
Jan 18, 2025 - Python
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
- Updated
Jan 27, 2025 - JavaScript
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
- Updated
Mar 18, 2025 - HTML
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.
- Updated
Mar 14, 2025 - Objective-C
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
- Updated
Mar 18, 2025 - Python
Improve this page
Add a description, image, and links to theocr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theocr topic, visit your repo's landing page and select "manage topics."