pymupdf

(eBook，PDFs Translation) A multilingual eBook processing tool supporting all eBook formats. Features online and offline translation while preserving original layouts. Compatible with both scanned and digital PDFs. Elegant user interface. The world's highest-performing open-source layout-preserving eBook translator.

pdf latex translation math ebook formulas pymupdf openai-api deepseek

UpdatedMar 16, 2025
Python

lucasrla /remarks

Star366

Extract annotations (highlights and scribbles) from PDF, EPUB, and notebooks marked with reMarkable tablets. Export to Markdown, PDF, PNG, SVG

markdown pdf ocr highlighting annotations pdf-converter epub zotero obsidian ocrmypdf svg-images pymupdf remarkable-tablet roamresearch

UpdatedMay 26, 2024
Python

Zain-Bin-Arshad /pdf-viewer

Star82

A Pure Python PDFViewer, which provides functionalities same as other famous PDFViewers.

python pdf pdf-viewer pure-python fitz pymupdf python-pdf pysimplegui pdf-viewer-python

UpdatedJul 14, 2023
Python

vb64 /markdown-pdf

Star72

Markdown to pdf renderer

markdown pdf markdown-it pymupdf

UpdatedMar 15, 2025
Python

devxzh /PDFTools

Star59

基于pyqt5, pymupdf实现的批量添加目录书签，增强pdf，拆分合并pdf的小工具

pdf bookmark pyqt5 pdf-merge pdf-split pymupdf add-catalog

UpdatedAug 5, 2021
Python

shayanalibhatti /Designing-a-PDF-Audiobook-using-Python

Star48

In this code, a simple implementation of PDF to audio converter is shown

python python3 pdf-reader audio-converter gtts pytesseract pymupdf pdf-to-audio pdf-text pytesseract-ocr

UpdatedMar 30, 2021
Python

benitomartin /multimodal-llm-pymupdf4llm

Sponsor

Star35

Multimodal LLM Application with PyMuPDF4LLM

python openai pymupdf qdrant llama-index

UpdatedOct 4, 2024
Jupyter Notebook

genieincodebottle /parsemypdf

Star35

Collection of PDF parsing libraries like AI based docling, claude, openai, llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.

openai claude camelot pymupdf pypdf markitdown llama-parse unstructured-io docling llama-vision

UpdatedMar 3, 2025
Python

xxao /pero

Star34

Unified Python drawing API

visualization python svg drawing pyqt5 pyside2 wxpython pymupdf pycairo pyqt6 pyside6

UpdatedFeb 1, 2025
Python

TheWatcherMultiversal /pdfgui_tools

Star34

pdfgui_tools is a user interface tool developed in Qt and Python that integrates with poppler-utils and PyPDF2 for PDF document management. It's a simple and user-friendly tool that includes various utilities.

linux pdf gnu-linux python3 pdf-document pypdf2 pymupdf qt6 pyside6 poppler-utils

UpdatedFeb 5, 2024
Python

pymupdf /PyMuPDF-Optional-Material

Star16

Help file downloads, early ZIP binaries, wheels for retired Python 2.7, 3.5.

python windows pdf mupdf fitz pymupdf

UpdatedApr 3, 2022

stroblme /UNote

Star14

Fills the lack of an open-source PDF Editor with the capability to draw and add notes

editor lightweight pdf dark-theme viewer draw note create freehand annotate productive pymupdf handwrite

UpdatedJun 17, 2024
Python

lheredias /Luftmensch

Star11

Useful PDF-related productivity tool.

python pdf automation pyqt5 gui-application web-scraping windows-desktop pdfa pdf-merger pymupdf pdf-compression pysimplegui image-to-pdf-converter pdf-combiner luftmensch pdf-to-pdfa

UpdatedOct 12, 2021
Python

gautam132002 /invoice-pdf-data-extraction

Star9

Automated extraction of specific information from invoices, achieving over 95% accuracy.

python automation data-extraction pdf-data-extraction pymupdf

UpdatedJul 14, 2023
Python

politikundbildung /kindle_to_pdf

Star8

Creates PDF annotations from Kindle clippings

python-script kindle pymupdf

UpdatedDec 20, 2022
Python

yte9pc /Internet-Archive-PDF-Capstone

Star6

UVA Data Science Capstone project for Internet Archive. This project aimed to classify PDFs as research or non-research documents using an image and text-based approach. For the image-based models, we leveraged CNN transfer learning and used XGBoost for text-based approach.

pdf deep-learning pipeline tensorflow image-processing internet-archive transfer-learning pymupdf cnn-classification

UpdatedMay 7, 2021
Jupyter Notebook

Improve this page

Add a description, image, and links to thepymupdf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thepymupdf topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly