I am training a WAN2.2 model with only 512x512 images, which should be relatively fast. I noticed that the GPU usage is consistently around 60%, GPU memory usage is only 10.3 GB System is a Windows 11 ...