document-layout-analysis
Here are 36 public repositories matching this topic...
Language:All
Sort:Most stars
A Unified Toolkit for Deep Learning Based Document Image Analysis
- Updated
Aug 15, 2024 - Python
A Repo For Document AI
- Updated
Jul 17, 2025 - Python
A curated list of resources for Document Understanding (DU) topic
- Updated
Jun 2, 2023
📚 Process PDFs, Word documents and more with spaCy
- Updated
Mar 8, 2025 - Python
Document Layout Analysis resources repos for development with PdfPig.
- Updated
Oct 1, 2023 - C#
Document Layout Analysis
- Updated
Jun 12, 2025 - Python
Page to PAGE Layout Analysis Tool
- Updated
Jan 17, 2022 - Python
Detectron2 for Document Layout Analysis
- Updated
Aug 2, 2024 - Python
ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...
- Updated
May 11, 2021 - Python
Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pipelines (GenAI, LLM, VLLM) into your applications, supporting various tasks such as document cleanup, optical character recognition (OCR), classification, splitting, named entity recognition, and form processing
- Updated
Jul 18, 2025 - Python
A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers
- Updated
Sep 9, 2024 - Python
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
- Updated
Apr 16, 2023 - Python
Tools for extract figure, table, text, .. from a pdf document.
- Updated
Nov 25, 2020
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
- Updated
Mar 16, 2020 - C#
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
- Updated
Apr 16, 2023 - Python
BoundaryNet - A Semi-Automatic Layout Annotation Tool
- Updated
Dec 11, 2021 - Python
Simple docker deployment of document layout analysis using detectron2
- Updated
Nov 7, 2021 - JavaScript
Using a MaskRCNN model trained on the PublayNet dataset with ML.Net in C# / .Net for Document layout analysis and page segmmentation task.
- Updated
May 13, 2023 - C#
GloSAT Historical Measurement Table Dataset
- Updated
Feb 14, 2025 - Python
Improve this page
Add a description, image, and links to thedocument-layout-analysis topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thedocument-layout-analysis topic, visit your repo's landing page and select "manage topics."