Janspiry/Palette-Image-to-Image-Diffusion-ModelsPublic

NotificationsYou must be signed in to change notification settings
Fork229
Star1.8k

Unofficial implementation of Palette: Image-to-Image Diffusion Models by Pytorch

License

MIT license

1.8k stars 229 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
config		config
core		core
data		data
misc		misc
models		models
preprocess		preprocess
slurm		slurm
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
requirements.txt		requirements.txt
run.py		run.py

Repository files navigation

Palette: Image-to-Image Diffusion Models

Paper |Project

Brief

This is an unofficial implementation ofPalette: Image-to-Image Diffusion Models byPytorch, and it is mainly inherited from its super-resolution versionImage-Super-Resolution-via-Iterative-Refinement. The code template is from my another seed project:distributed-pytorch-template.

There are some implementation details with paper descriptions:

We adapted the U-Net architecture used inGuided-Diffusion, which give a substantial boost to sample quality.
We used the attention mechanism in low-resolution features (16×16) like vanillaDDPM.
We encode the$\gamma$ rather than$t$ inPalette and embed it with affine transformation.
We fix the variance$Σ_\theta(x_t, t)$ to a constant during the inference as described inPalette.

Status

Code

Diffusion Model Pipeline
Train/Test Process
Save/Load Training State
Logger/Tensorboard
Multiple GPU Training (DDP)
EMA
Metrics (now for FID, IS)
Dataset (now for inpainting, uncropping, colorization)
Google colab script 🌟(now for inpainting)

Task

I try to finish following tasks in order:

Inpainting onCelebaHQ🚀 (Google Colab)
Inpainting onPlaces2 with 128×128 centering mask🚀

The follow-up experiment is uncertain, due to lack of time and GPU resources:

Uncropping on Places2
Colorization on ImageNet val set

Results

The DDPM model requires significant computational resources, and we have only built a few example models to validate the ideas in this paper.

Visuals

Celeba-HQ

Results with 200 epochs and 930K iterations, and the first 100 samples incentering mask andirregular mask.

Places2 with 128×128 centering mask

Results with 16 epochs and 660K iterations, and the severalpicked samples incentering mask.

Uncropping on Places2

Results with 8 epochs and 330K iterations, and the severalpicked samples inuncropping.

Metrics

Tasks	Dataset	EMA	FID(-)	IS(+)
Inpainting with centering mask	Celeba-HQ	False	5.7873	3.0705
Inpainting with irregular mask	Celeba-HQ	False	5.4026	3.1221

Usage

Environment

pipinstall-rrequirements.txt

Pre-trained Model

Dataset	Task	Iterations	GPUs×Days×Bs	URL
Celeba-HQ	Inpainting	930K	2×5×3	Google Drive
Places2	Inpainting	660K	4×8×10	Google Drive

Bs indicates sample size per gpu.

Data Prepare

We get most of them from Kaggle, which may be slightly different from official version, and you also can download them from official website.

We use the default division of these datasets for training and evaluation. The file lists we use can be found inCeleba-HQ,Places2.

After you prepared own data, you need to modify the corresponding configure file to point to your data. Take the following as an example:

"which_dataset":{  // import designated dataset using arguments"name":["data.dataset", "InpaintDataset"], // import Dataset() class"args":{ // arguments to initialize dataset"data_root":"your data path","data_len":-1,"mask_mode":"hybrid"}},

More choices aboutdataloader andvalidation split also can be found indatasets part of configure file.

Training/Resume Training

Download the checkpoints from given links.
Setresume_state of configure file to the directory of previous checkpoint. Take the following as an example, this directory contains training states and saved model:

"path":{ //set every part file path"resume_state": "experiments/inpainting_celebahq_220426_150122/checkpoint/100"},

Set your network label inload_everything function ofmodel.py, default isNetwork. Follow the tutorial settings, the optimizers and models will be loaded from 100.state and 100_Network.pth respectively.

netG_label=self.netG.__class__.__name__self.load_network(network=self.netG,network_label=netG_label,strict=False)

Run the script:

pythonrun.py-ptrain-cconfig/inpainting_celebahq.json

We test the U-Net backbone used inSR3 andGuided Diffusion, andGuided Diffusion one have a more robust performance in our current experiments. More choices aboutbackbone,loss andmetric can be found inwhich_networks part of configure file.

Test

Modify the configure file to point to your data following the steps inData Prepare part.
Set your model path following the steps inResume Training part.
Run the script:

pythonrun.py-ptest-cconfig/inpainting_celebahq.json

Evaluation

Create two folders saving ground truth images and sample images, and their file names need to correspond to each other.
Run the script:

pythoneval.py-s [groundimagepath]-d [sampleimagepath]

Acknowledge

Our work is based on the following theoretical works:

and we are benefiting a lot from the following projects:

About

Unofficial implementation of Palette: Image-to-Image Diffusion Models by Pytorch

Movatterモバイル変換

License

Janspiry/Palette-Image-to-Image-Diffusion-Models

Folders and files

Latest commit

History

Repository files navigation

Palette: Image-to-Image Diffusion Models

Brief

Status

Code

Task

Results

Visuals

Celeba-HQ

Places2 with 128×128 centering mask

Uncropping on Places2

Metrics

Usage

Environment

Pre-trained Model

Data Prepare

Training/Resume Training

Test

Evaluation

Acknowledge

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors2

Languages

Packages