zsyzzsoft/co-mod-ganPublic

NotificationsYou must be signed in to change notification settings
Fork69
Star458

[ICLR 2021, Spotlight] Large Scale Image Completion via Co-Modulated Generative Adversarial Networks

License

View license

458 stars 69 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
dataset_tools		dataset_tools
dnnlib		dnnlib
imgs		imgs
metrics		metrics
training		training
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
run_demo.py		run_demo.py
run_generator.py		run_generator.py
run_metrics.py		run_metrics.py
run_training.py		run_training.py

Repository files navigation

Large Scale Image Completion via Co-Modulated Generative Adversarial Networks, ICLR 2021 (Spotlight)

Demo (Unofficial) |Paper

[NEW!] Anotherunofficial demo is available!

[NOTICE] Our web demo will be closed recently. Enjoy the last days!

[NEW!] Time to play with ourinteractive web demo!

Numerous task-specific variants of conditional generative adversarial networks have been developed for image completion. Yet, a serious limitation remains that all existing algorithms tend to fail when handlinglarge-scale missing regions. To overcome this challenge, we propose a generic new approach that bridges the gap between image-conditional and recent modulated unconditional generative architectures viaco-modulation of both conditional and stochastic style representations. Also, due to the lack of good quantitative metrics for image completion, we propose the newPaired/Unpaired Inception Discriminative Score (P-IDS/U-IDS), which robustly measures the perceptual fidelity of inpainted images compared to real images via linear separability in a feature space. Experiments demonstrate superior performance in terms of both quality and diversity over state-of-the-art methods in free-form image completion and easy generalization to image-to-image translation.

Large Scale Image Completion via Co-Modulated Generative Adversarial Networks
Shengyu Zhao,Jonathan Cui, Yilun Sheng, Yue Dong, Xiao Liang, Eric I Chang, Yan Xu
Tsinghua University and Microsoft Research
arXiv |OpenReview

Overview

This repo is implemented upon and has the same dependencies as the officialStyleGAN2 repo. We also provide aDockerfile for Docker users. This repo currently supports:

Large scale image completion experiments on FFHQ and Places2
Image-to-image translation experiments on Edges2Shoes and Edges2Handbags
Image-to-image translation experiments on COCO-Stuff
Evaluation code ofPaired/Unpaired Inception Discriminative Score (P-IDS/U-IDS)

Datasets

FFHQ dataset (in TFRecords format) can be downloaded following theStyleGAN2 repo.
Places2 dataset can be downloaded inthis website (Places365-Challenge 2016 high-resolution images,training set andvalidation set). The raw images should be converted into TFRecords usingdataset_tools/create_from_images.py with--shuffle --compressed.
Edges2Shoes and Edges2Handbags datasets can be downloaded following thepix2pix repo. The raw images should be converted into TFRecords usingdataset_tools/create_from_images.py with--shuffle --pix2pix.
To prepare a custom dataset, please usedataset_tools/create_from_images.py, which will automatically center crop and resize your images to the specified resolution. You only need to specify--val-image-dir for testing purpose.

Training

The following script is for training on FFHQ. It will split 10k images for validation. We recommend using 8 NVIDIA Tesla V100 GPUs for training. Training at 512x512 resolution takes about 1 week.

python run_training.py --data-dir=DATA_DIR --dataset=DATASET --metrics=ids10k --mirror-augment --num-gpus=8

The following script is for training on Places2 at resolution 512x512 (resolution must be specified when training on compressed dataset), which has a validation set of 36500 images:

python run_training.py --data-dir=DATA_DIR --dataset=DATASET --resolution=512 --metrics=ids36k5 --total-kimg 50000 --num-gpus=8

The following script is for training on Edges2Handbags (and similarly for Edges2Shoes):

python run_training.py --data-dir=DATA_DIR --dataset=DATASET --metrics=fid200-rt-handbags --mirror-augment --num-gpus=8

Pre-Trained Models

Our pre-trained models are available onGoogle Drive:

Model name & URL	Description
co-mod-gan-ffhq-9-025000.pkl	Large scale image completion on FFHQ (512x512)
co-mod-gan-ffhq-10-025000.pkl	Large scale image completion on FFHQ (1024x1024)
co-mod-gan-places2-050000.pkl	Large scale image completion on Places2 (512x512)
co-mod-gan-coco-stuff-025000.pkl	Image-to-image translation on COCO-Stuff (labels to photos) (512x512)
co-mod-gan-edges2shoes-025000.pkl	Image-to-image translation on edges2shoes (256x256)
co-mod-gan-edges2handbags-025000.pkl	Image-to-image translation on edges2handbags (256x256)

Use the following script to run the interactive demo locally:

python run_demo.py -d DATA_DIR/DATASET -c CHECKPOINT_FILE(S)

or the following command as a minimal example of usage:

python run_generator.py -c CHECKPOINT_FILE -i imgs/example_image.jpg -m imgs/example_mask.jpg -o imgs/example_output.jpg

Evaluation

The following script is for evaluation:

python run_metrics.py --data-dir=DATA_DIR --dataset=DATASET --network=CHECKPOINT_FILE(S) --metrics=METRIC(S) --num-gpus=1

Commonly used metrics areids10k andids36k5 (for FFHQ and Places2 respectively), which will compute P-IDS and U-IDS together with FID. By default, masks are generated randomly for evaluation, or you may append the metric name with-h0 ([0.0, 0.2]) to-h4 ([0.8, 1.0]) to specify the range of masked ratio.

Citation

If you find this code helpful, please cite our paper:

@inproceedings{zhao2021comodgan,  title={Large Scale Image Completion via Co-Modulated Generative Adversarial Networks},  author={Zhao, Shengyu and Cui, Jonathan and Sheng, Yilun and Dong, Yue and Liang, Xiao and Chang, Eric I and Xu, Yan},  booktitle={International Conference on Learning Representations (ICLR)},  year={2021}}

About

[ICLR 2021, Spotlight] Large Scale Image Completion via Co-Modulated Generative Adversarial Networks

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Large Scale Image Completion via Co-Modulated Generative Adversarial Networks, ICLR 2021 (Spotlight)

Demo (Unofficial) |Paper

Overview

Datasets

Training

Pre-Trained Models

Evaluation

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

License

zsyzzsoft/co-mod-gan

Folders and files

Latest commit

History

Repository files navigation

Large Scale Image Completion via Co-Modulated Generative Adversarial Networks, ICLR 2021 (Spotlight)

Demo (Unofficial) |Paper

Overview

Datasets

Training

Pre-Trained Models

Evaluation

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages