mwang
|
8f8eee4b5e
|
bert thor supports lr configuration in config.py
|
4 years ago |
mindspore-ci-bot
|
1c12b84757
|
!11992 [Bert][Gpu]Sync modify of bert script from r1.1 to master
From: @hanhuifeng2020
Reviewed-by: @gaoxiong1,@anyrenwei
Signed-off-by: @anyrenwei
|
4 years ago |
MingHan-Y
|
67a4c62b4b
|
Add new optimizer THOR option to BERT pretrain script.
|
4 years ago |
hanhuifeng2020
|
53d4510ea6
|
[Bert][Gpu]Sync modify of bert script from r1.1 to master
|
4 years ago |
looop5
|
0161209e40
|
update submoudle akg, close graph kernel ascend ci testcases
|
4 years ago |
VectorSL
|
c13cd24e38
|
add restrict for gpu only
|
4 years ago |
shibeiji
|
f0b08e8bff
|
all reduce after each step in gradients accumulation mode for bert
|
4 years ago |
VectorSL
|
0c97835662
|
update control flow int adamweightdecay for bert
|
4 years ago |
hanhuifeng2020
|
3988376b67
|
Performance optimization of Bert on GPU by the graph_kernel
|
4 years ago |
tronzhang
|
17d6f1c2f9
|
add option for graph kernel and mixed precision
|
4 years ago |
mindspore-ci-bot
|
c95ed54fe1
|
!5239 reduce cyclomatic complexity in model zoo
Merge pull request !5239 from zhaoting/master
|
4 years ago |
mindspore-ci-bot
|
3671244ff8
|
!6233 move batch_size from bert_cfg_cfg to cfg
Merge pull request !6233 from yoonlee666/master
|
4 years ago |
yoonlee666
|
528072f45f
|
move batch_size from bert_cfg_cfg to cfg
|
4 years ago |
zhaoting
|
a4a65ffe06
|
reduce cyclomatic complexity
|
4 years ago |
root
|
ec947ebf3d
|
modify the ckpt path
|
4 years ago |
chenhaozhe
|
91c65a734a
|
fix some doc error
|
4 years ago |
lichenever
|
f2d3fd34ce
|
rectification_allreduce_fusion_api
|
5 years ago |
yao_yf
|
d4cfe55c04
|
rename mirror_mean to gradients_mean
|
5 years ago |
linqingke
|
4d9d8c3e74
|
Modelzoo interface change.
|
5 years ago |
mindspore-ci-bot
|
44a9c25251
|
!5632 Add clip_by_global_nrom in bert
Merge pull request !5632 from chenhaozhe/add-global-norm-to-bert
|
5 years ago |
chenhaozhe
|
ac95836257
|
add global norm in bert
|
5 years ago |
shibeiji
|
d57960ed4c
|
delete the redundant argument while initializing class of GradOperation
|
5 years ago |
mindspore-ci-bot
|
3f1e05881d
|
!5474 bert script bugfix
Merge pull request !5474 from yoonlee666/bugfix
|
5 years ago |
yoonlee666
|
954f53da9e
|
enhancement
|
5 years ago |
mindspore-ci-bot
|
e6a4d932b4
|
!5350 [AutoParallel]Rectification distributed init
Merge pull request !5350 from lichen/rectification_init
|
5 years ago |
lichenever
|
d3e55b543e
|
rectification init
|
5 years ago |
yao_yf
|
07117e4dd4
|
mv ParallelMode to context
|
5 years ago |
shibeiji
|
40fc11e9a4
|
Mimic higher batch size by accumulating gradients N times before weight update
|
5 years ago |
chenhaozhe
|
fa10a4e483
|
optimize print of bert scripts'
|
5 years ago |
yoonlee666
|
32846d791b
|
bugfix bert script
|
5 years ago |
yoonlee666
|
a5ac2427a7
|
bugfix bert script
|
5 years ago |
Wei Luning
|
776d094c5b
|
quant export fix up for atc tools
|
5 years ago |
shibeiji
|
29e35a31c0
|
add order params for bert to improve performance
|
5 years ago |
shibeiji
|
af4923123c
|
script update for bert
|
5 years ago |
chenhaozhe
|
6fdf380923
|
fix bert scripts to adapt the new concept of repeatcount in minddata
|
5 years ago |
wangnan39@huawei.com
|
082433183d
|
uniform learning_rate behavior of optimizers
|
5 years ago |
mindspore-ci-bot
|
8e4c0a9d93
|
!3212 GetDatasize feature
Merge pull request !3212 from anzhengqi/epochs-ready
|
5 years ago |
chenhaozhe
|
541bf81c1f
|
move bert to new model zoo directory
|
5 years ago |