BatchNorm don't work as expected #2650

wangjiawen2013 · 2024-12-26T06:57:58Z

wangjiawen2013
Dec 26, 2024

Hi,
Here is the output of pytorch BatchNorm1d, it worked well:

and here is the rust burn BatchNorm, the output is weired:

Then I changed to AutodiffBackend, the results is correct ! Why don't limit the BatchNorm to only AutodiffBackend ? Otherwise we are prone to make mistakes.

Answered by laggui

Jan 2, 2025

The result is not wrong 🙂

The difference lies in the training vs inference computation for a batchnorm module. If you use m.eval() instead for the pytorch module you should get equivalent results.

With burn you have to be explicit when using autodiff. In pytorch it's kind of the opposite, it will track the gradients and keep the autodiff graph by default unless you use with torch.no_grad() context.

This is also explained in the autodiff section.

View full answer

laggui · 2025-01-02T14:36:08Z

laggui
Jan 2, 2025
Maintainer

The result is not wrong 🙂

The difference lies in the training vs inference computation for a batchnorm module. If you use m.eval() instead for the pytorch module you should get equivalent results.

With burn you have to be explicit when using autodiff. In pytorch it's kind of the opposite, it will track the gradients and keep the autodiff graph by default unless you use with torch.no_grad() context.

This is also explained in the autodiff section.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BatchNorm don't work as expected #2650

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

BatchNorm don't work as expected #2650

wangjiawen2013 Dec 26, 2024

Replies: 1 comment

laggui Jan 2, 2025 Maintainer

wangjiawen2013
Dec 26, 2024

laggui
Jan 2, 2025
Maintainer