This repository was archived by the owner on Jul 16, 2025. It is now read-only.

google/prompt-to-promptPublic archive

NotificationsYou must be signed in to change notification settings
Fork321
Star3.4k

License

Apache-2.0 license

3.4k stars 321 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
docs		docs
example_images		example_images
DDS_zeroshot.ipynb		DDS_zeroshot.ipynb
LICENSE		LICENSE
README.md		README.md
contributing.md		contributing.md
null_text_w_ptp.ipynb		null_text_w_ptp.ipynb
prompt-to-prompt_ldm.ipynb		prompt-to-prompt_ldm.ipynb
prompt-to-prompt_stable.ipynb		prompt-to-prompt_stable.ipynb
ptp_utils.py		ptp_utils.py
requirements.txt		requirements.txt
seq_aligner.py		seq_aligner.py

Repository files navigation

Prompt-to-Prompt

Latent Diffusion andStable Diffusion Implementation

Project Page Paper

Setup

This code was tested with Python 3.8,Pytorch 1.11 using pre-trained models throughhuggingface / diffusers.Specifically, we implemented our method overLatent Diffusion andStable Diffusion.Additional required packages are listed in the requirements file.The code was tested on a Tesla V100 16GB but should work on other cards with at least12GB VRAM.

Quickstart

In order to get started, we recommend taking a look at our notebooks:prompt-to-prompt_ldm andprompt-to-prompt_stable. The notebooks contain end-to-end examples of usage of prompt-to-prompt on top ofLatent Diffusion andStable Diffusion respectively. Take a look at these notebooks to learn how to use the different types of prompt edits and understand the API.

Prompt Edits

In our notebooks, we perform our main logic by implementing the abstract classAttentionControl object, of the following form:

classAttentionControl(abc.ABC):@abc.abstractmethoddefforward (self,attn,is_cross:bool,place_in_unet:str):raiseNotImplementedError

Theforward method is called in each attention layer of the diffusion model during the image generation, and we use it to modify the weights of the attention. Our method (See Section 3 of ourpaper) edits images with the procedure above, and each different prompt edit type modifies the weights of the attention in a different manner.

The general flow of our code is as follows, with variations based on the attention control type:

prompts= ["A painting of a squirrel eating a burger", ...]controller=AttentionControl(prompts, ...)run_and_display(prompts,controller, ...)

Replacement

In this case, the user swaps tokens of the original prompt with others, e.g., the editing the prompt"A painting of a squirrel eating a burger" to"A painting of a squirrel eating a lasagna" or"A painting of a lion eating a burger". For this we define the classAttentionReplace.

Refinement

In this case, the user adds new tokens to the prompt, e.g., editing the prompt"A painting of a squirrel eating a burger" to"A watercolor painting of a squirrel eating a burger". For this we define the classAttentionEditRefine.

Re-weight

In this case, the user changes the weight of certain tokens in the prompt, e.g., for the prompt"A photo of a poppy field at night", strengthen or weaken the extent to which the wordnight affects the resulting image. For this we define the classAttentionReweight.

Attention Control Options

cross_replace_steps: specifies the fraction of steps to edit the cross attention maps. Can also be set to a dictionary[str:float] which specifies fractions for different words in the prompt.
self_replace_steps: specifies the fraction of steps to replace the self attention maps.
local_blend (optional):LocalBlend object which is used to make local edits.LocalBlend is initialized with the words from each prompt that correspond with the region in the image we want to edit.
equalizer: used for attention Re-weighting only. A vector of coefficients to multiply each cross-attention weight

Citation

@article{hertz2022prompt,title ={Prompt-to-Prompt Image Editing with Cross Attention Control},author ={Hertz, Amir and Mokady, Ron and Tenenbaum, Jay and Aberman, Kfir and Pritch, Yael and Cohen-Or, Daniel},journal ={arXiv preprint arXiv:2208.01626},year ={2022},}

Null-Text Inversion for Editing Real Images

Project Page Paper

Null-text inversion enables intuitive text-based editing ofreal images with the Stable Diffusion model. We use an initial DDIM inversion as an anchor for our optimization which only tunes the null-text embedding used in classifier-free guidance.

Editing Real Images

Prompt-to-Prompt editing of real images by first using Null-text inversion is provided in thisNotebooke.

@article{mokady2022null,title={Null-text Inversion for Editing Real Images using Guided Diffusion Models},author={Mokady, Ron and Hertz, Amir and Aberman, Kfir and Pritch, Yael and Cohen-Or, Daniel},journal={arXiv preprint arXiv:2211.09794},year={2022}}

Disclaimer

This is not an officially supported Google product.

About

No description, website, or topics provided.

Resources

Readme

License

Apache-2.0 license

Code of conduct

Contributing

Languages

Jupyter Notebook100.0%

Movatterモバイル変換

License

google/prompt-to-prompt

Folders and files

Latest commit

History

Repository files navigation

Prompt-to-Prompt

Project Page Paper

Setup

Quickstart

Prompt Edits

Replacement

Refinement

Re-weight

Attention Control Options

Citation

Null-Text Inversion for Editing Real Images

Project Page Paper

Editing Real Images

Disclaimer

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors5

Uh oh!

Languages

Packages