mscoco-dataset
Here are 38 public repositories matching this topic...
Language:All
Sort:Most stars
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
- Updated
Feb 3, 2023 - Jupyter Notebook
An easy implementation of Faster R-CNN (https://arxiv.org/pdf/1506.01497.pdf) in PyTorch.
- Updated
Jul 3, 2020 - Python
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
- Updated
Mar 15, 2024 - Python
An easy implementation of FPN (https://arxiv.org/pdf/1612.03144.pdf) in PyTorch.
- Updated
Jan 24, 2019 - Python
Real-time semantic image segmentation on mobile devices
- Updated
Mar 24, 2023 - Python
Using LSTM or Transformer to solve Image Captioning in Pytorch
- Updated
Jul 20, 2021 - Python
A Clone version from Original SegCaps source code with enhancements on MS COCO dataset.
- Updated
Aug 28, 2018 - Jupyter Notebook
Pytorch implementation of image captioning using transformer-based model.
- Updated
Apr 13, 2023 - Jupyter Notebook
Adds SPICE metric to coco-caption evaluation server codes
- Updated
Feb 2, 2023 - Jupyter Notebook
Convert segmentation binary mask images to COCO JSON format.
- Updated
Sep 13, 2022 - Python
PyTorch implementation of paper: "Self-critical Sequence Training for Image Captioning"
- Updated
Apr 8, 2023 - Python
We aim to generate realistic images from text descriptions using GAN architecture. The network that we have designed is used for image generation for two datasets: MSCOCO and CUBS.
- Updated
May 7, 2018 - HTML
The pytorch implementation on “Fine-Grained Image Captioning with Global-Local Discriminative Objective”
- Updated
Oct 17, 2019 - Python
Clone of COCO API - Dataset @http://cocodataset.org/ - with changes to support Windows build and python3
- Updated
Jan 14, 2023 - Jupyter Notebook
A demo for mapping class labels from ImageNet to COCO.
- Updated
May 7, 2019 - Jupyter Notebook
Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval [ECCV 2020]
- Updated
Aug 22, 2020 - Python
MS COCO captions in Arabic
- Updated
Oct 7, 2020
Karpathy Splits json files for image captioning
- Updated
Apr 4, 2024
Image caption generation using GRU-based attention mechanism
- Updated
Aug 15, 2020 - Python
An end-to-end vision and language model incorporating explicit knowledge graphs and OOD-detection.
- Updated
May 3, 2024 - Python
Improve this page
Add a description, image, and links to themscoco-dataset topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with themscoco-dataset topic, visit your repo's landing page and select "manage topics."