What do all the settings do?

This document hopes to explain what the various settings are. Some of them we're still figuring out. :) Note that a few of the settings can be randomly chosen -- see the section below for details.

Setting name	Default in settings.json	Explanation
batch_name	"Default"	The directory within images_out to store your results
text_prompts	"The Big Sur Coast, by Asher Brown Durand, featured on ArtStation."	The phrase(s) to use for generating an image
n_batches	1	How many images to generate
steps	250	Generally, the more steps you run, the more detailed the results, however the returns start to diminish after 350
display_rate	50	How often (in steps) to update the progress.png image
width	832	Image output width in pixels - must be a multiple of 64
height	512	Image output height in pixels - must be a multiple of 64
set_seed	"random_seed"	If set to random_seed it will generate a new seed. Replace this with a specific number to elimate randomness in the start. Additional images in a batch are always the seed from the previous image - 1
image_prompts	{}	For using images instead of words for prompts. Specifiy the file name of the init image.
clip_guidance_scale	"auto"	Controls how much the image should look like the prompt. Affected by resolution, so "auto" will try to calculate a good value.
tv_scale	0	Controls the smoothness of the final output. tests have shown minimal impact when changing this.
range_scale	150	Controls how far out of range RGB values are allowed to be.
sat_scale	0	Controls how much saturation is allowed.
cutn_batches	4	Lowering this number can reduce how much memory is needed, however note that cutn itself is hard set at 16
max_frames	10000	No idea
interp_spline	"Linear"	Do not change, currently will not look good.
init_image	null	The starting image to use. Usuallly leave this blank and it will start with randomness
init_scale	1000	This enhances the effect of the init image, a good value is 1000
skip_steps	0	How many steps in the overall process to skip. Generally leave this at 0, though if using an init_image it is recommended to be 50% of overall steps
frames_scale	1500	Tries to guide the new frame to looking like the old one. A good default is 1500.
frames_skip_steps	"60%"	Will blur the previous frame - higher values will flicker less
perlin_init	false	Option to start with random perlin noise
perlin_mode	"mixed"	Other options are "grey" or "color", what they do I'm not sure
skip_augs	false	Controls whether to skip torchvision augmentations
randomize_class	true	Controls whether the imagenet class is randomly changed each iteration
clip_denoised	false	Determines whether CLIP discriminates a noisy or denoised image
clamp_grad	true	Experimental: Using adaptive clip grad in the cond_fn
clamp_max	"auto"	Lower values (0.01) can help keep colors muted. Higher values (0.25) allow for more vibrancy. However it is affected by steps, so "auto" will try to calculate a good value.
fuzzy_prompt	false	Controls whether to add multiple noisy prompts to the prompt losses
rand_mag	0.05	Controls how far it can stray from your prompt - not used unless either fuzzy_prompt is true, or an init image is used
eta	"auto"	Has to do with how much the generator can stray from your prompt. Affected by steps, so "auto" will calculate a good value.
diffusion_model	"512x512_diffusion_uncond_finetune_008100",
use_secondary_model	true	Reduces memory and improves speed, potentially at a loss of quality
diffusion_steps	1000	Note: The code seems to calculate this no matter what you put in, so might as well leave it
sampling_mode	"plms"	Options are "plms" or "ddim" - plms can reach a nice image in fewer steps, but may not look as good as ddim.
ViTB32	true	Enable or disable the VitB32 CLIP model. Low memory, low accuracy
ViTB16	true	Enable or disable the VitB16 CLIP model. Med memory, high accuracy
ViTL14	false	Enable or disable the VitB32 CLIP model. Very high memory, very high accuracy
RN101	false	Enable or disable the VitB32 CLIP model. Low memory, low accuracy
RN50	true	Enable or disable the VitB32 CLIP model. Med memory, med accuracy
RN50x4	false	Enable or disable the VitB32 CLIP model. High memory, high accuracy
RN50x16	false	Enable or disable the VitB32 CLIP model. Very high memory, high accuracy
RN50x64	false	Enable or disable the VitB32 CLIP model. Extremely high memory, unknown accuracy
cut_overview	"[12]400+[4]600"	How many "big picture" passes to do. More towards the start, less later, is the general idea
cut_innercut	"[4]400+[12]600"	Conversely, how many detail passes to do. Fewer at the start, then get more detailed
cut_ic_pow	1	Anyone? Beuller?
cut_icgray_p	"[0.2]400+[0]600"	Anyone? Beuller?
key_frames	true	Animation stuff...
angle	"0:(0)"	Animation stuff...
zoom	"0: (1), 10: (1.05)"	Animation stuff...
translation_x	"0: (0)"	Animation stuff...
translation_y	"0: (0)"	Animation stuff...
video_init_path	"/content/training.mp4"	Animation stuff...
extract_nth_frame	2	Animation stuff...
intermediate_saves	0	Save in progress. A value of `2` will save a copy at 33% and 66%. 0 will save none. A value of `[5, 9, 34, 45]` will save at steps 5, 9, 34, and 45. (Make sure to include the brackets)

Randomizable settings

The following settings can be set to "random" (with the quotes), which will tell the code to pick a random value within their expected boundaries:

clip_guidance_scale tv_scale range_scale sat_scale clamp_max rand_mag eta cut_ic_pow

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SETTINGS.md

SETTINGS.md

What do all the settings do?

Randomizable settings

Files

SETTINGS.md

Latest commit

History

SETTINGS.md

File metadata and controls

What do all the settings do?

Randomizable settings