#
layout-parsing
Here are 3 public repositories matching this topic...
Language:All
Filter by language
Improved file parsing for LLM’s
- Updated
Nov 13, 2024 - Python
A lightweight Python library for metadata-rich document chunking in Retrieval-Augmented Generation (RAG) workflows. It leverages Azure AI Document Intelligence to enhance chunking by retaining hierarchical structure, page numbers, and bounding boxes for seamless integration with PDF viewers.
reactpythonagentazurechunkingagentsunstructured-dataragproduction-gradereact-pdf-viewerlayout-parserllmlangchainretrieval-augmented-generationazure-ai-searchazure-ai-document-intelligencelayout-parsingdocument-chunking
- Updated
Jan 11, 2025 - Python
--UNDER CONSTRUCTION-- (Undergrad Research) Exploring layout parsing capabilities in Python
- Updated
Nov 11, 2024 - Jupyter Notebook
Improve this page
Add a description, image, and links to thelayout-parsing topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thelayout-parsing topic, visit your repo's landing page and select "manage topics."