Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

document-understanding

Here are 51 public repositories matching this topic...

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

  • UpdatedDec 17, 2025
  • Python

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

  • UpdatedMay 30, 2025
  • Python

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

  • UpdatedJul 25, 2024
  • Python

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

  • UpdatedOct 31, 2022
  • Python
document-ai-samples

Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud

  • UpdatedNov 17, 2025
  • Jupyter Notebook

A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.

  • UpdatedSep 9, 2024

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

  • UpdatedMar 1, 2025

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

  • UpdatedApr 3, 2024
  • Python

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

  • UpdatedJan 13, 2025
  • Jupyter Notebook

Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.

  • UpdatedOct 18, 2025
  • Jupyter Notebook

ReadingBank: A Benchmark Dataset for Reading Order Detection

  • UpdatedAug 26, 2024

Object Detection Model for Scanned Documents

  • UpdatedMar 6, 2025
  • Jupyter Notebook

Checkbox Detection Model for Scanned Documents

  • UpdatedMar 6, 2025
  • Jupyter Notebook
Snappy

🐊 Snappy's unique approach unifies vision-language late interaction with structured OCR for region-level knowledge retrieval. Like the project? Drop a star! ⭐

  • UpdatedDec 15, 2025
  • Python

Datasets and Evaluation Scripts for CompHRDoc

  • UpdatedFeb 25, 2025
  • Python

3DCF / doc2dataset: token-efficient document layer with NumGuard numeric integrity and multi-framework exports for RAG & fine-tuning.

  • UpdatedDec 7, 2025
  • Rust

Improve this page

Add a description, image, and links to thedocument-understanding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thedocument-understanding topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp