controllable-image-captioning
Here are 4 public repositories matching this topic...
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.https://huggingface.co/spaces/TencentARC/Caption-Anythinghttps://huggingface.co/spaces/VIPLab/Caption-Anything
- Updated
Aug 29, 2023 - Python
A length-controllable and non-autoregressive image captioning model.
- Updated
Jun 10, 2021 - Python
PyTorch implementation of a Controllable Image Captioning model with a language-driven mechanism for advancing the region pointer state that keeps it in sync with the state of the language model. Code for the paper Language-Driven Region Pointer Advancement for Controllable Image Captioning (Lindh et al., 2020).
- Updated
Oct 1, 2021 - Python
Pipeline model for controllable image captioning with user preference settings. Code and model output for the paper Show, Prefer and Tell: Incorporating User Preferences into Image Captioning (Lindh et al., 2023).
- Updated
May 25, 2025 - Python
Improve this page
Add a description, image, and links to thecontrollable-image-captioning topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thecontrollable-image-captioning topic, visit your repo's landing page and select "manage topics."