PITI-Synthesis/PITIPublic

NotificationsYou must be signed in to change notification settings
Fork26
Star501

PITI: Pretraining is All You Need for Image-to-Image Translation

License

MIT license

501 stars 26 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
dataset		dataset
figure		figure
preprocess		preprocess
pretrained_diffusion		pretrained_diffusion
test_imgs		test_imgs
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
image_sample.py		image_sample.py
image_train.py		image_train.py
inference.py		inference.py
mask_finetune_base.sh		mask_finetune_base.sh
mask_finetune_upsample.sh		mask_finetune_upsample.sh
sample.sh		sample.sh
sketch_finetune_base.sh		sketch_finetune_base.sh
sketch_finetune_upsample.sh		sketch_finetune_upsample.sh

Repository files navigation

PITI: Pretraining is All You Need for Image-to-Image Translation

Official PyTorch implementation

Pretraining is All You Need for Image-to-Image Translation
Tengfei Wang,Ting Zhang,Bo Zhang,Hao Ouyang,Dong Chen,Qifeng Chen,Fang Wen
2022

paper |project website |video |online demo

Introduction

We present a simple and universal framework that brings the power of the pretraining to variousimage-to-image translation tasks. You may try ouronline demo if interested.

Diverse samples synthesized by our approach.

Set up

Installation

git clone https://github.com/PITI-Synthesis/PITI.gitcd PITI

Environment

conda env create -f environment.yml

Quick Start

Pretrained Models

Please download our pre-trained models for bothBase model andUpsample model, and put them in./ckpt.

Model	Task	Dataset
Base-64x64	Mask-to-Image	Trained on COCO.
Upsample-64-256	Mask-to-Image	Trained on COCO.
Base-64x64	Sketch-to-Image	Trained on COCO.
Upsample-64-256	Sketch-to-Image	Trained on COCO.

If you fail to access to these links, you may alternatively find our pretrained modelshere.

Prepare Images

We put some example images in./test_imgs, and you can quickly try them.

COCO

For COCO dataset, download the images and annotations from theCOCO webpage.

For mask-to-image synthesis, we use the semantic maps in RGB format as inputs. To obtain such semantic maps, run./preprocess/preprocess_mask.py (an example of the raw mask and the processed mask is given inpreprocess/example). Note that we do not need instant masks like previous works.

For sketch-to-image synthesis, we use sketch maps extracted by HED as inputs. To obtain such sketch maps, run./preprocess/preprocess_sketch.py.

Inference

Interactive Inference

Run the following script, and it would create an interactive GUI built by gradio. You can upload input masks or sketches and generate images.

pip install gradiopython inference.py

Batch Inference

Modifysample.sh according to the follwing instructions, and run:

bash sample.sh

Args	Description
--model_path	the path of ckpt for base model.
--sr_model_path	the path of ckpt for upsample model.
--val_data_dir	the path of a txt file that contains the paths for images.
--num_samples	number of images that you want to sample.
--sample_c	Strength of classifier-free guidance.
--mode	The input type.

Training

Preparation

Download and preprocess datasets. For COCO dataset, download the images and annotations from theCOCO webpage. Run./preprocess/preprocess_mask.py or./preprocess/preprocess_sketch.py
Download pretrained models bypython preprocess/download.py.

Start Training

Taking mask-to-image synthesis as an example: (sketch-to-image is the same)

Finetune the Base Model

Modifymask_finetune_base.sh and run:

bash mask_finetune_base.sh

Finetune the Upsample Model

Modifymask_finetune_upsample.sh and run:

bash mask_finetune_upsample.sh

Citation

If you find this work useful for your research, please cite:

@article{wang2022pretraining, title = {Pretraining is All You Need for Image-to-Image Translation},  author = {Wang, Tengfei and Zhang, Ting and Zhang, Bo and Ouyang, Hao and Chen, Dong and Chen, Qifeng and Wen, Fang},  journal={arXiv:2205.12952},  year = {2022},}

Acknowledgement

Thanks forGLIDE for sharing their code.

About

PITI: Pretraining is All You Need for Image-to-Image Translation

Movatterモバイル変換

License

PITI-Synthesis/PITI

Folders and files

Latest commit

History

Repository files navigation

PITI: Pretraining is All You Need for Image-to-Image Translation

Introduction

Set up

Installation

Environment

Quick Start

Pretrained Models

Prepare Images

COCO

Inference

Interactive Inference

Batch Inference

Training

Preparation

Start Training

Finetune the Base Model

Finetune the Upsample Model

Citation

Acknowledgement

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Contributors4

Uh oh!

Languages

Packages