grounding
Here are 47 public repositories matching this topic...
Language:All
Sort:Most stars
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
- Updated
Nov 7, 2024 - Python
awesome grounding: A curated list of research papers in visual grounding
- Updated
Apr 9, 2023
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
- Updated
Jun 7, 2024 - Python
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
- Updated
Feb 14, 2025 - Python
CLIPort: What and Where Pathways for Robotic Manipulation
- Updated
Nov 2, 2023 - Jupyter Notebook
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
- Updated
Mar 19, 2024 - Python
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
- Updated
Jan 2, 2024 - Python
We perform functional grounding of LLMs' knowledge in BabyAI-Text
- Updated
Aug 23, 2024 - Python
[TPAMI reviewing] Towards Visual Grounding: A Survey
- Updated
Mar 21, 2025 - Shell
Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.07129)
- Updated
Apr 22, 2020 - Python
Hierarchical Universal Language Conditioned Policies
- Updated
Mar 19, 2024 - Python
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
- Updated
Jun 10, 2020 - Python
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
- Updated
Aug 17, 2021 - Python
[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
- Updated
Feb 9, 2024 - Python
[ICRA2023] Grounding Language with Visual Affordances over Unstructured Data
- Updated
Oct 29, 2023 - Python
Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"
- Updated
Jul 4, 2018 - Python
This is the official implementation for our paper;"LAR:Look Around and Refer".
- Updated
Dec 1, 2022 - C++
Improve this page
Add a description, image, and links to thegrounding topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thegrounding topic, visit your repo's landing page and select "manage topics."