structured-data-extraction
Here are 7 public repositories matching this topic...
Language:All
Simplifies the retrieval, extraction, and training of structured data from various unstructured sources.
- Updated
Oct 22, 2025 - Python
A ruby gem to extract structured data from Google Local Search Results using the serpapi/bert-base-local-results model, enabling parsing, classification, and information extraction from English HTML content.
- Updated
Jul 14, 2023 - Ruby
Structured data extraction from research literature
- Updated
Aug 2, 2025 - Python
find a template of many similar html files
- Updated
Nov 26, 2022 - JavaScript
Convert any document format into LLM-ready data format (markdown) with advanced intelligent document processing capabilities powered by pre-trained models.
- Updated
Aug 14, 2025 - Python
A Python-based tool for extracting structured data from PDFs using OCR and regex, and exporting it to CSV. Ideal for processing invoices, logs, or scanned documents into organized, usable datasets.
- Updated
Oct 30, 2024 - Jupyter Notebook
AI-powered assistant for analyzing Engineering Change Orders (ECOs) using Google Gemini and RAG
- Updated
May 25, 2025 - Jupyter Notebook
Improve this page
Add a description, image, and links to thestructured-data-extraction topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thestructured-data-extraction topic, visit your repo's landing page and select "manage topics."