Perturbed-Attention Guidance, Smoothed Energy Guidance and Sliding Window Guidance for ComfyUI / SD WebUI (Forge/reForge)

Implementation of

Perturbed-Attention Guidance from Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance (D. Ahn et al.)
Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention (Susung Hong)
Sliding Window Guidance from The Unreasonable Effectiveness of Guidance for Diffusion Models (Kaiser et al.)

as an extension for ComfyUI and SD WebUI (Forge) / SD WebUI (reForge).

Works with SD1.5 and SDXL.

Installation

ComfyUI

You can either:

git clone https://github.com/pamparamm/sd-perturbed-attention.git into ComfyUI/custom-nodes/ folder.
Install it via ComfyUI Manager (search for custom node named "Perturbed-Attention Guidance").
Install it via comfy-cli with comfy node registry-install sd-perturbed-attention

SD WebUI (Forge/reForge)

git clone https://github.com/pamparamm/sd-perturbed-attention.git into stable-diffusion-webui-forge/extensions/ folder.

SD WebUI (Auto1111)

As an alternative for A1111 WebUI you can use PAG implementation from sd-webui-incantations extension.

Guidance Nodes/Scripts

ComfyUI

SD WebUI (Forge/reForge)

Note

You can override CFG Scale and PAG Scale/SEG Scale for Hires. fix by opening/enabling Override for Hires. fix tab. To disable PAG during Hires. fix, you can set PAG Scale under Override to 0.

Inputs

scale: Guidance scale, higher values can both increase structural coherence of an image and oversaturate/fry it entirely.
adaptive_scale (PAG only): PAG dampening factor, it penalizes PAG during late denoising stages, resulting in overall speedup: 0.0 means no penalty and 1.0 completely removes PAG.
blur_sigma (SEG only): Normal deviation of Gaussian blur, higher values increase "clarity" of an image. Negative values set blur_sigma to infinity.
unet_block: Part of U-Net to which Guidance is applied, original paper suggests to use middle.
unet_block_id: Id of U-Net layer in a selected block to which Guidance is applied. Guidance can be applied only to layers containing Self-attention blocks.
sigma_start / sigma_end: Guidance will be active only between sigma_start and sigma_end. Set both values to negative to disable this feature.
rescale: Acts similar to RescaleCFG node - it prevents over-exposure on high scale values. Based on Algorithm 2 from Common Diffusion Noise Schedules and Sample Steps are Flawed (Lin et al.). Set to 0 to disable this feature.
rescale_mode:
- full - takes into account both CFG and Guidance.
- partial - depends only on Guidance.
- snf - Saliency-adaptive Noise Fusion from High-fidelity Person-centric Subject-to-Image Synthesis (Wang et al.). Should increase image quality on high guidance scales. Ignores rescale value.
unet_block_list: Optional input, replaces both unet_block and unet_block_id and allows you to select multiple U-Net layers separated with commas. SDXL U-Net has multiple indices for layers, you can specify them by using dot symbol (if not specified, Guidance will be applied to the whole layer). Example value: m0,u0.4 (it applies Guidance to middle block 0 and to output block 0 with index 4)
- In terms of U-Net d means input, m means middle and u means output.
- SD1.5 U-Net has layers d0-d5, m0, u0-u8.
- SDXL U-Net has layers d0-d3, m0, u0-u5. In addition, each block except d0 and d1 has 0-9 index values (like m0.7 or u0.4). d0 and d1 have 0-1 index values.

ComfyUI TensorRT PAG (Experimental)

To use PAG together with ComfyUI_TensorRT, you'll need to:

Have 24GB of VRAM.
Build static/dynamic TRT engine of a desired model.
Build static/dynamic TRT engine of the same model with the same TRT parameters, but with fixed PAG injection in selected UNET blocks (TensorRT Attach PAG node).
Use TensorRT Perturbed-Attention Guidance node with two model inputs: one for base engine and one for PAG engine.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
examples		examples
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
pag_nodes.py		pag_nodes.py
pag_trt_nodes.py		pag_trt_nodes.py
pag_utils.py		pag_utils.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Perturbed-Attention Guidance, Smoothed Energy Guidance and Sliding Window Guidance for ComfyUI / SD WebUI (Forge/reForge)

Installation

ComfyUI

SD WebUI (Forge/reForge)

SD WebUI (Auto1111)

Guidance Nodes/Scripts

ComfyUI

SD WebUI (Forge/reForge)

Inputs

ComfyUI TensorRT PAG (Experimental)

About

Releases

Packages

Contributors 5

Languages

License

pamparamm/sd-perturbed-attention

Folders and files

Latest commit

History

Repository files navigation

Perturbed-Attention Guidance, Smoothed Energy Guidance and Sliding Window Guidance for ComfyUI / SD WebUI (Forge/reForge)

Installation

ComfyUI

SD WebUI (Forge/reForge)

SD WebUI (Auto1111)

Guidance Nodes/Scripts

ComfyUI

SD WebUI (Forge/reForge)

Inputs

ComfyUI TensorRT PAG (Experimental)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages