yoonlee666
|
dfd85caa1b
|
delete enable_fused_layernorm
|
5 years ago |
yoonlee666
|
954f53da9e
|
enhancement
|
5 years ago |
wanghua
|
f347d1c9e7
|
add schema for BERT and TinyBERT
|
5 years ago |
shibeiji
|
40fc11e9a4
|
Mimic higher batch size by accumulating gradients N times before weight update
|
5 years ago |
wanghua
|
a40cc12fae
|
modify BERT and TinyBERT README.md
|
5 years ago |
chenhaozhe
|
fa10a4e483
|
optimize print of bert scripts'
|
5 years ago |
mindspore-ci-bot
|
7113f1f22c
|
!3857 modify bert and tinybert scripts and README
Merge pull request !3857 from wanghua/master
|
5 years ago |
wanghua
|
89fa2d3708
|
modify bert and tinybert readme
|
5 years ago |
panbingao
|
98b76b9020
|
remove old MINDSPORE_HCCL_CONFIG_PATH in model zoo 2
|
5 years ago |
GuoMengHao
|
2309e7369a
|
add_python_distribute_pretrain_script
Signed-off-by: GuoMengHao <guomenghao@huawei.com>
|
5 years ago |
wangnan39@huawei.com
|
082433183d
|
uniform learning_rate behavior of optimizers
|
5 years ago |
chenhaozhe
|
541bf81c1f
|
move bert to new model zoo directory
|
5 years ago |