pymupdf
Here are 200 public repositories matching this topic...
Language:All
Sort:Most stars
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
- Updated
Dec 18, 2025 - Python
Open source Python library for converting PDF to DOCX.
- Updated
May 28, 2025 - Python
(eBook,PDFs Translation) A multilingual eBook processing tool supporting all eBook formats. Features online and offline translation while preserving original layouts. Compatible with both scanned and digital PDFs. Elegant user interface. The world's highest-performing open-source layout-preserving eBook translator.
- Updated
Sep 28, 2025 - Python
High-accuracy PDF-to-Markdown OCR API using LLMs with vision capabilities. Features parallel processing, batching, and auto-retry logic for scalable extraction.
- Updated
Nov 29, 2025 - Python
A CLI toolset to generate table of contents for PDF files automatically.
- Updated
Nov 26, 2023 - Python
Demos, examples and utilities using PyMuPDF
- Updated
Jul 1, 2024 - Jupyter Notebook
Extract annotations (highlights and scribbles) from PDF, EPUB, and notebooks marked with reMarkable tablets. Export to Markdown, PDF, PNG, SVG
- Updated
May 26, 2024 - Python
Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.
- Updated
Aug 29, 2025 - Python
A Pure Python PDFViewer, which provides functionalities same as other famous PDFViewers.
- Updated
Jul 14, 2023 - Python
Smart PDF to Markdown converter with intelligent heading detection, automatic header/footer removal, orphan fragment merging, and image export. Features a user-friendly GUI with preview mode, persistent settings, and per-page error recovery. Optimized for Obsidian and other Markdown-based note-taking workflows.
- Updated
Dec 1, 2025 - Python
In this code, a simple implementation of PDF to audio converter is shown
- Updated
Mar 30, 2021 - Python
A powerful PDF processing engine that deconstructs documents into their core elements—text, images, and tables—and seamlessly reconstructs them into pristine, structured Markdown. Built with a React frontend and a robust Python (PyMuPDF) backend on Appwrite.
- Updated
Sep 10, 2025 - Python
pdfgui_tools is a user interface tool developed in Qt and Python that integrates with poppler-utils and PyPDF2 for PDF document management. It's a simple and user-friendly tool that includes various utilities.
- Updated
Feb 5, 2024 - Python
AI-Powered Thesis Review Tool
- Updated
Aug 8, 2025 - Python
Fills the lack of an open-source PDF Editor with the capability to draw and add notes
- Updated
Jun 17, 2024 - Python
Improve this page
Add a description, image, and links to thepymupdf topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thepymupdf topic, visit your repo's landing page and select "manage topics."