Commit Graph

6332 Commits (172a4687041206f3254caed12b706b13117dc902)
 

Author SHA1 Message Date
ling 6e230e807a op pad
5 years ago
mindspore-ci-bot e6c596a9d9 !3653 add epoch_num description master
5 years ago
zhouyaqiang 34864fbc56 fix weight init and add data aug
5 years ago
fangzehua 17d3982d46 fix scatter error msg
5 years ago
mindspore-ci-bot 6ea2aa4e73 !3672 fix serving input numbers
5 years ago
kswang 76733ce816 fix cpu multi graph mem error
5 years ago
mindspore-ci-bot 389cb35740 !3661 Alarm modification
5 years ago
mindspore-ci-bot 8f35d2ed29 !3664 Modify the order of init and open of TDT
5 years ago
mindspore-ci-bot b73ea6a7aa !3668 Modify collecting graph and dataset graph to step end stage
5 years ago
mindspore-ci-bot 567509affc !3522 add tinybert scripts
5 years ago
ms_yan abbd7b50db optimize the vgg script
5 years ago
dessyang 4307c1fa61 change the column order and add drop_reminder option to make this script compatible with BertCLS model
5 years ago
danish a2ffc9530e stuff added
5 years ago
Xun Deng e94d91ba95 remove import probability from nn/__init__.py
5 years ago
tony_liu2 269b477684 use np.testing.assert instead of asserting
5 years ago
mindspore-ci-bot 6f70146153 !3660 modify readme for maskrcnn
5 years ago
mindspore-ci-bot d66e6b33bf !3665 support multy node training in deeplabv3
5 years ago
mindspore-ci-bot afce1c3a40 !3341 GPU maxpool with argmax op
5 years ago
Ziyan 98e2ee90de fix optimizer parallel problems
5 years ago
ougongchang 1dafb2c6f5 Modify collecting graph and dataset graph to step end stage
5 years ago
shenwei41 051c290d8b Modify patches and alerts
5 years ago
gengdongjie 00f7a936bf add resnet50 support multi node training
5 years ago
zhouyaqiang b0004a1791 support multy node training and remove code
5 years ago
mindspore-ci-bot 387dac5832 !3651 change num_samples definition
5 years ago
hanjun996 20ccf83826 modify tdt
5 years ago
mindspore-ci-bot a3e7c4c754 !3625 Optimize tensor data
5 years ago
mindspore-ci-bot fe514bd1cc !3644 [MD] fix minddataset core dump when file list size ia greater than 1000.
5 years ago
hexia 3100824703 fix input
5 years ago
mindspore-ci-bot a337a02732 !3638 fix codex and support akg op profiling
5 years ago
yangzhenzhang 9aa84b3d14 add strided slice op
5 years ago
wandongdong b39d524d44 set out format to nhwc4
5 years ago
meixiaowei 8950952fe3 modify readme
5 years ago
limingqi107 af39ca8252 modify the wrong word
5 years ago
wilfChen 9cad0fec1d gpu broadcast to
5 years ago
mindspore-ci-bot 1b69923472 !3643 Throw exception if different communication ops which are divided to the same segement share the same input
5 years ago
mindspore-ci-bot d4b52ac59f !3489 use kernelruntime::mem_manager to reduce rtMalloc and rtFree time in trans data format
5 years ago
panfengfeng 4644085e92 add epoch_num
5 years ago
wanghua 7dd5e78fde add tinybert scripts
5 years ago
jiangzhiwen 1eda0ef071 change num_samples definition
5 years ago
mindspore-ci-bot fcdad59ce6 !3594 fix batchnorm issue under mix precision in pynative mode
5 years ago
GuoMengHao 2309e7369a add_python_distribute_pretrain_script
5 years ago
mindspore-ci-bot 12a150bb5d !3630 not reuse refnode input's memory
5 years ago
mindspore-ci-bot c57ad1528f !3635 fix dataset & train gil lock of gpu process master
5 years ago
mindspore-ci-bot 44e739ae31 !3627 fix: device occupied tdt hung
5 years ago
geekun 17d71280b8 fix codex and support akg op profiling
5 years ago
mindspore-ci-bot 9ccc6889eb !3624 fix GeneratorDataset time out
5 years ago
mindspore-ci-bot 4834a6b347 !3574 Rename AnfNode::user_data related functions to follow naming rule
5 years ago
liyong ed70de8070 fix coredump when number of file list more than 1000.
5 years ago
huanghui 311d8ea1f9 add exception when different communication op in one segment shared the same input
5 years ago
mindspore-ci-bot e4a7ca7f08 !3637 Lowering value checking threshold to support training with very small steps or
5 years ago