mscoco-dataset

We aim to generate realistic images from text descriptions using GAN architecture. The network that we have designed is used for image generation for two datasets: MSCOCO and CUBS.

python machine-learning deep-neural-networks deep-learning discriminator generative-adversarial-network gan batch-normalization dcgan sentence cubs loss-functions skip-thought-vectors text-encodings mscoco mscoco-dataset dcgan-tensorflow tensrorflow stack-gan

UpdatedMay 7, 2018
HTML

WuJie1010 /Fine-Grained-Image-Captioning

Star21

The pytorch implementation on “Fine-Grained Image Captioning with Global-Local Discriminative Objective”

multimedia image-captioning mscoco-dataset

UpdatedOct 17, 2019
Python

gautamchitnis /cocoapi

Star18

Clone of COCO API - Dataset @http://cocodataset.org/ - with changes to support Windows build and python3

mscoco mscoco-dataset cocodataset pycocotools

UpdatedJan 14, 2023
Jupyter Notebook

howardyclo /ImageNet2COCO

Star11

A demo for mapping class labels from ImageNet to COCO.

deep-learning detection imagenet one-shot-learning zero-shot-learning mscoco imagenet-dataset mscoco-dataset few-shot-learning

UpdatedMay 7, 2019
Jupyter Notebook

CLT29 /semantic_neighborhoods

Star9

Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval [ECCV 2020]

computer-vision politics retrieval code coco cross-modal doc2vec mscoco mscoco-dataset goodnews cross-modal-retrieval eccv2020 conceptual-captions visual-semantic-embedding

UpdatedAug 22, 2020
Python

canesee-project /Arabic-COCO

Star6

MS COCO captions in Arabic

captions image-captioning arabic mscoco-image-dataset scene-recognition arabic-language mscoco mscoco-dataset

UpdatedOct 7, 2020

Delphboy /karpathy-splits

Star6

Karpathy Splits json files for image captioning

image-caption mscoco-dataset flickr8k-dataset flickr30k karpathy-split

UpdatedApr 4, 2024

Mehrdad93 /Image-captioning-with-RNN-based-attention

Star5

Image caption generation using GRU-based attention mechanism

gru image-captioning attention-mechanism rnn-tensorflow mscoco-dataset

UpdatedAug 15, 2020
Python

ellenzhuwang /implicitOOD

Star5

An end-to-end vision and language model incorporating explicit knowledge graphs and OOD-detection.

deep-learning transformer knowledge-graph multimodal-learning mscoco-dataset visual-question-answering vision-and-language ood-detection

UpdatedMay 3, 2024
Python

Improve this page

Add a description, image, and links to themscoco-dataset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with themscoco-dataset topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mscoco-dataset

Here are 38 public repositories matching this topic...

peteanderson80 /bottom-up-attention

potterhsu /easy-faster-rcnn.pytorch

yalesong /pvse

potterhsu /easy-fpn.pytorch

sercant /mobile-segmentation

RoyalSkye /Image-Caption

Cheng-Lin-Li /SegCaps

zarzouram /image_captioning_with_transformers

peteanderson80 /coco-caption

brunobelloni /binary-to-coco-json-converter

Wentong-DST /self-critical

ayansengupta17 /GAN