- Notifications
You must be signed in to change notification settings - Fork5
A High Quality (HD / 2K / 4K) Image Generation Using Stable Diffusion and Real-ESR / SwinIR /GFPGAN
License
HemantKArya/HqStableDiffusionColab
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Generate 4K and FULL HD Images and Artworks for Free Using Stable Diffusion.
For Upscale Only goto RealESR Notebook (4K Upscale)
Run All the cell until you Reach your Prompt cell.In case if you have any human face in your images then it will restore Distorted figures(like eyes,nose,etc) in images, here is example.In Stable Diffusin her Eyes and Lips are bit distorted.
To upscale images to 2K or 4k using Real-ESR GAN. Note that after running Reasl-ESRGAN leave SwinIR until unless you are not satisfied with RealESR results.after running this cell you will get a comparison matrix like this.
Input Images --> Upscaled Images(Real-ESR)
After Upscaling you images using Real-ESRGAN rest of the cell are optional to run and not recommended (Cause limited GPU RAM in Colab, After running these cell may be it will show you error likecuda out of memory
) to run until you are not satisfied with result of Upscaled images of Real-ESR.right Now I am going to show you difference b/w both Upscalers.Using both Optional cell at the last of notebook. (It may full your current colab RAM)
Input Images ------ Upscaled Images(SwinIR) ----- Upscaled Images(RealESRGAN)
Visit Logical Spot for Video Help:-
...using🧨diffusers
Stable Diffusion is a text-to-image latent diffusion model created by the researchers and engineers fromCompVis,Stability AI andLAION. It's trained on 512x512 images from a subset of theLAION-5B database. This model uses a frozen CLIP ViT-L/14 text encoder to condition the model on text prompts. With its 860M UNet and 123M text encoder, the model is relatively lightweight and runs on a GPU with at least 10GB VRAM.See themodel card for more information.This Colab notebook shows how to use Stable Diffusion with the 🤗 Hugging Face🧨 Diffusers library .https://github.com/CompVis/stable-diffusion
orignal-link to colabhttps://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/stable_diffusion.ipynb
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.We extend the powerful ESRGAN to a practical restoration application (namely, Real-ESRGAN), which is trained with pure synthetic data.
SwinIR achieves state-of-the-art performance on six tasks: image super-resolution (including classical, lightweight and real-world image super-resolution), image denoising (including grayscale and color image denoising) and JPEG compression artifact reduction. See ourpaper andproject page for detailed results.
GFPGAN is a blind face restoration algorithm towards real-world face images.
It leverages the generative face prior in a pre-trained GAN (e.g., StyleGAN2) to restore realistic faces while precerving fidelity.
About
A High Quality (HD / 2K / 4K) Image Generation Using Stable Diffusion and Real-ESR / SwinIR /GFPGAN
Topics
Resources
License
Uh oh!
There was an error while loading.Please reload this page.