You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It mentions that using rescale_betas_zero_snr with v-prediction might be more scientifically sound. However, rescale_betas_zero_snr with e-prediction should also work. Yet, when rescale_betas_zero_snr is True, the calculated snr is 0 at timestep 999.
I think e-prediction, snr_gamma, and rescale_betas_zero_snr should be compatible options that can work together. I don’t have much knowledge about how these work, but maybe adding a small epsilon when calculating snr could solve the problem, though I’m not sure if it’s reasonable. Could someone offer some advice?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I'm using SDXL training script.
I read the document at
https://huggingface.co/docs/diffusers/api/schedulers/ddim.
It mentions that using
rescale_betas_zero_snr
withv-prediction
might be more scientifically sound. However,rescale_betas_zero_snr
withe-prediction
should also work. Yet, whenrescale_betas_zero_snr
isTrue
, the calculated snr is 0 at timestep 999.And when
snr_gamma
is enabled, calculating the loss requires dividing by snr, which triggers a division by zero error, causing the loss to becomeNaN
. huggingface/diffusers@6dfaec3/examples/text_to_image/train_text_to_image_sdxl.py#L1147I think
e-prediction
,snr_gamma
, and rescale_betas_zero_snr
should be compatible options that can work together. I don’t have much knowledge about how these work, but maybe adding a small epsilon when calculating snr could solve the problem, though I’m not sure if it’s reasonable. Could someone offer some advice?Beta Was this translation helpful? Give feedback.
All reactions