HasnainRaz/Fast-SRGANPublic

NotificationsYou must be signed in to change notification settings
Fork119
Star690

A Fast Deep Learning Model to Upsample Low Resolution Videos to High Resolution at 30fps

License

MIT license

690 stars 119 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
configs		configs
models		models
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
dataloader.py		dataloader.py
inference.py		inference.py
model.py		model.py
train.py		train.py
trainer.py		trainer.py

Repository files navigation

Fast-SRGAN

The goal of this repository is to enable real time super resolution for upsampling low resolution videos. Currently, the design follows theSR-GAN architecture. For speed, the upsampling is done through pixel shuffle.

The training setup looks like the following diagram:

Speed Benchmarks

The following runtimes/fps are obtained by averaging runtimes over 800 frames. Measured on MPS (MacBook M1 Pro GPU).

Input Image Size	Output Size	Time (s)	FPS
90x160	360x640 (360p)	0.01	82
180x320	720x1080 (720p)	0.04	27

We see it's possible to upsample to 720p at around 30fps.

Requirements

This was tested on Python 3.10. To install the required packages, use the provided Pipfile:

pip install pipenv --upgradepipenv install --system --deploy

Pre-trained Model

A pretrained generator model on the DIV2k dataset is provided in the 'models' directory. It uses 8 residual blocks, with 64 filters in every layer of the generator.

To try out the provided pretrained model on your own images, run the following:

python inference.py --image_dir'path/to/your/image/directory' --output_dir'path/to/save/super/resolution/images'

Training

To train, simply edit the config file in the folderconfigs/config.yaml with your settings, and then launch the training with:

python train.py

You can also change the config parameters from the command line. The following will run training with abatch_size of 32, a generator with 12 residual blocks, and a path to the image directory/path/to/image/dataset.

python train.py data.image_dir="/path/to/image/dataset" training.batch_size=32 generator.n_layers=12

This is powered byhydra, which means all the parameters in the config are editable via the CLI.

Model checkpoints and training summaries are saved in tensorboard. To monitor training progress, open up tensorboard by pointing it to theoutputs directory that will be created when you start training.

Samples

Following are some results from the provided trained model. Left shows the low res image, after 4x bicubic upsampling. Middle is the output of the model. Right is the actual high resolution image.

The following shows images upsampled 4x by bicubic interpolation, the pretrained model from this repository and the original high resolution image as a comparison

Contributing

If you have ideas on improving model performance, adding metrics, or any other changes, please make a pull request or open an issue. I'd be happy to accept any contributions.

About

A Fast Deep Learning Model to Upsample Low Resolution Videos to High Resolution at 30fps

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Fast-SRGAN

Speed Benchmarks

Requirements

Pre-trained Model

Training

Samples

Contributing

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors3

Uh oh!

Languages

Movatterモバイル変換

License

HasnainRaz/Fast-SRGAN

Folders and files

Latest commit

History

Repository files navigation

Fast-SRGAN

Speed Benchmarks

Requirements

Pre-trained Model

Training

Samples

Contributing

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors3

Uh oh!

Languages

Packages