omniparser
Here are 11 public repositories matching this topic...
Language:All
Sort:Most stars
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
- Updated
Jun 11, 2025 - Python
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
- Updated
Mar 16, 2025 - Python
OmniMCP uses Microsoft OmniParser and Model Context Protocol (MCP) to provide AI models with rich UI context and powerful interaction capabilities.
- Updated
Apr 8, 2025 - Python
AI-powered computer control for automated testing. Factifai uses vision models (Claude, GPT-4o, Gemini) to interact with applications naturally - clicking, typing, and verifying results just like a human would.
- Updated
Jun 26, 2025 - TypeScript
Cappuccino is an GUI Agent based on desktop screen. It is a Manus-like AI Agent that can be deployed locally.
- Updated
Mar 26, 2025 - Python
Docker implementation of the OmniParser screen parsing tool
- Updated
Jul 2, 2025
Placeholder for Omniparser Schemas used by universal-etl-parser
- Updated
Jun 16, 2025
Effortless Deployment and Integration for SOTA Screenshot Parsing and Action Models
- Updated
Feb 18, 2025
Improve this page
Add a description, image, and links to theomniparser topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theomniparser topic, visit your repo's landing page and select "manage topics."