1.Change dtype of scale to dtype of grad in loss_scale.py; 2.Change dtype of weight_decay to dtype of weight in optimizer.py.pull/12/head
parent
930a1fb0a8
commit
6c03542eec
Loading…
Reference in new issue