omniparser
Here are 13 public repositories matching this topic...
Language:All
Sort:Most stars
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
- Updated
Dec 12, 2025 - Python
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
- Updated
Feb 18, 2026 - Python
OmniMCP uses Microsoft OmniParser and Model Context Protocol (MCP) to provide AI models with rich UI context and powerful interaction capabilities.
- Updated
Apr 8, 2025 - Python
AI-powered computer control for automated testing. Factifai uses vision models (Claude, GPT-4o, Gemini) to interact with applications naturally - clicking, typing, and verifying results just like a human would.
- Updated
Oct 1, 2025 - TypeScript
Cappuccino is an GUI Agent based on desktop screen. It is a Manus-like AI Agent that can be deployed locally.
- Updated
Feb 2, 2026 - Python
Docker implementation of the OmniParser screen parsing tool
- Updated
Jul 2, 2025
Placeholder for Omniparser Schemas used by universal-etl-parser
- Updated
Feb 16, 2026
Effortless Deployment and Integration for SOTA Screenshot Parsing and Action Models
- Updated
Feb 18, 2025
🤖 Control your Gemini computer effortlessly with a unified console for browser automation, desktop management, and intelligent task execution.
- Updated
Feb 20, 2026 - Python
Computer Use Agent Using ADK
- Updated
Feb 17, 2026 - Python
Improve this page
Add a description, image, and links to theomniparser topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theomniparser topic, visit your repo's landing page and select "manage topics."