Fix: conditional use of GradScaler based on device_type and dtype in train.py #481

BRAINIAC2677 · 2024-05-09T12:54:10Z

Problem:

Use of GradScaler gives AssertionError in train.py while using device = cpu.

Traceback (most recent call last): File "/home/brainiac77/github/neural-network-playground/gpt/train.py", line 305, in <module> scaler.scale(loss).backward() ^^^^^^^^^^^^^^^^^^ File "/home/brainiac77/miniconda3/envs/vision-1/lib/python3.12/site-packages/torch/cuda/amp/grad_scaler.py", line 203, in scale assert outputs.is_cuda or outputs.device.type == "xla" AssertionError

Fix:

Use a conditional to check device_type and dtype and based on that take decision whether to use GradScaler or not.

conditional use of grad scaler based on device in train.py

a0076fb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: conditional use of GradScaler based on device_type and dtype in train.py #481

Fix: conditional use of GradScaler based on device_type and dtype in train.py #481

BRAINIAC2677 commented May 9, 2024

Fix: conditional use of GradScaler based on device_type and dtype in train.py #481

Are you sure you want to change the base?

Fix: conditional use of GradScaler based on device_type and dtype in train.py #481

Conversation

BRAINIAC2677 commented May 9, 2024

Problem:

Fix: