Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

pdf-processing

Here are 129 public repositories matching this topic...

Document chatbot — multiple files, topics, chat windows and chat history. Powered by GPT.

  • UpdatedJul 21, 2023
  • TypeScript

library supporting NLP and CV research on scientific papers

  • UpdatedNov 8, 2024
  • Python
PDFs-TextExtractdocument-processing-pipeline-for-regulated-industries

Official Python client library for Nutrient Document Web Services API - PDF processing, OCR, watermarking, and document manipulation with automatic Office format conversion

  • UpdatedJul 3, 2025
  • Python

Python scripts that converts PDF files to text, splits them into chunks, and stores their vector representations using GPT4All embeddings in a Chroma DB. It also provides a script to query the Chroma DB for similarity search based on user input.

  • UpdatedOct 23, 2023
  • Python

A NPM Package built on top of pdf-lib that provides functonalities like merge, rotate, split,download pdf to disk and many more...

  • UpdatedOct 31, 2023
  • JavaScript

LangGraphRAG: A terminal-based Retrieval-Augmented Generation system using LangGraph. Features include message history caching, query transformation, and vector database retrieval. Ideal for NLP researchers and developers working on advanced conversational AI and information retrieval systems.

  • UpdatedJul 13, 2024
  • Python

An all-in-one GUI management toolkit built with PyQt6, offering a suite of tools for file synchronization, media organization, PDF merging, code formatting, and more.

  • UpdatedMar 15, 2025
  • Python

AI-powered RAG-based tool for summarizing, extracting insights, and answering questions about research papers with high accuracy

  • UpdatedMar 20, 2025
  • HTML

📚 AI-Powered Book PDF Knowledge Extractor & Summarizer Transform your PDF books into structured knowledge effortlessly! This tool leverages AI to analyze books page by page, extracting key insights, definitions, and concepts, and organizes them into Markdown summaries for easier study

  • UpdatedJan 2, 2025
  • Python

The Document Summarizer leverages Hugging Face’s facebook/bart-large-cnn model to transform lengthy documents into concise summaries. Built with ReactJS (Vite) for the frontend and Flask for the backend, it supports PDF and text files, offering real-time summarization for researchers, students, and professionals.

  • UpdatedDec 7, 2024
  • JavaScript

Polymind is a powerful multi-modal Telegram bot built with Gemini, DeepSeek, OpenRouter, and over 50 cutting-edge AI models. It offers seamless conversational intelligence, Mermaid diagram rendering, PDF/DOCX analysis, image generation, and collaborative tools—all in a single bot interface.

  • UpdatedJul 15, 2025
  • Python

PdfSnipper is a lightweight and efficient Python package designed to simplify the management of PDF files, pages, and their conversions during various NLP, Computer Vision (CV), or other data processing tasks. The package eliminates the need for repetitive code by providing intuitive, ready-to-use functions for common PDF-related operations.

  • UpdatedFeb 3, 2025
  • Python

A side project to easily get and annotate questions and answers to the PsychometryBot project DB using computer vision and pdf parsing

  • UpdatedSep 18, 2022
  • Python

This is some useful mini projects that I had worked for self-learning Python programming.

  • UpdatedMay 20, 2024
  • Python

A Streamlit-based app for asking questions directly from uploaded documents using Gemini embeddings and a language model. Supports PDF, TXT, and DOCX files. Fast, simple, and powerful document-based QA.

  • UpdatedJul 18, 2025
  • Jupyter Notebook

Azure Document Intelligence Result Processor: A toolset for annotating PDFs based on Azure Document Intelligence analysis results, featuring a React web application and a standalone Python script for processing and visualizing extracted data with confidence indicators.

  • UpdatedMar 12, 2025
  • JavaScript

Improve this page

Add a description, image, and links to thepdf-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thepdf-processing topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp