Commit Graph

2488 Commits (ab57d3893ea2cfe8b002ed4a82e88a0d40b2f1e8)

Author SHA1 Message Date
Leo Zhao ff77dea969 not use transferscope cache in cpu case (#18578)
6 years ago
123malin b414645a65
fix #17430: int64类型的attr训练非预期 (#18264)
6 years ago
gongweibao c0a82748cf
Polish backwards optimizer dependency codes and use more default values. (#18255)
6 years ago
Zeng Jinle d3003a1620
Feature/buffer_shared_inplace (#17911)
6 years ago
Zeng Jinle be24e5b391
Clean unused code of dim and place (#18565)
6 years ago
Jiabin Yang 667f88f9a6
Fix/gcc 4.8 ubt link error (#18558)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
Leo Zhao ce38bb5341 use static variable to do cache instead of thread local in thread frequent switching case (#18428)
6 years ago
gongweibao 160ddc980c
Regroup fusion by date type. (#18496)
6 years ago
chengduo 7453857324 Make fuse_all_reduce_op_pass support mix_precision (#17652)
6 years ago
pkpk e9c7e218f2
Nan debugger init (#18401)
6 years ago
Tao Luo d234aa02cd
add transfer_scope_cache unit-test (#18467)
6 years ago
Yi Liu a873fa84ce
supports collective training with programs (#18392)
6 years ago
Michał Gallus 7023a86c3a Fix Pooling output scale (#18186)
6 years ago
jiaqi 93a2b317f7
fix data feed ptr error (#18419)
6 years ago
chengduo 8ed33bf91f
Fix Bug-prone code of PE (#18354)
6 years ago
tangwei12 999d9a59a5
fix communicator with pyreader (#18350)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
Sylwester Fraczek 9252e8fa08 add int8 mkldnn prior_box (#17242)
6 years ago
chengduo 135a59ed45
update reduce config (#18334)
6 years ago
chengduo 5489216eba
Clean build strategy (#18148)
6 years ago
chengduo 14e1e165df
update alloc_continuous_space_for_grad_pass (#18287)
6 years ago
jiaqi 3f8031e256
dataset (#17973)
6 years ago
chengduo 25f3cd6486
Update execution_strategy option default value (#18183)
6 years ago
chengduo 4978db2c10
Remove nccl dep when the number of GPU is 1 (#18158)
6 years ago
chengduo 24e988a471
Fix bug of scope_buffered_ssa_graph_executor (#18100)
6 years ago
gongweibao f5caf3443c
Fix reinitialized ncclid error! (#18025)
6 years ago
chengduo b5a1c1463d
Update CPU_NUM config (#18059)
6 years ago
hutuxian f1d458daf0
add trainer_desc proto DEPS (#18019)
6 years ago
gongweibao da9143c1cc
Polish codes of old prs. (#17938)
6 years ago
石晓伟 bce259e5bf
Update the Anakin interfaces for content-dnn and MLU (#17890)
6 years ago
hutuxian 969e6378b9
Pipeline Concurrency (#17402)
6 years ago
Zeng Jinle 3ece61f71e
Remove attribute in Allocator::Allocate (#17878)
6 years ago
gongweibao 972c54cd70
Fix FLAGS_fuse_parameter_memory_size unit from Bytes to MBytes. (#17924)
6 years ago
gongweibao dd4cd352c7
Fix sync_batch_norm_op ncclallreduce error! (#17918)
6 years ago
gongweibao fbbdc9ccad
Add backward and optimizer operator dependency pass. (#17746)
6 years ago
wopeizl 453a49b1bc
Make ParallelExecutor support Windows GPU (#17787)
6 years ago
baojun a4c528a31c [NGraph] some ngraph updates to enable bert (#17739)
6 years ago
chengduo 437520474c
fix DropLocalExeScopes (#17829)
6 years ago
Leo Zhao 50326563d5 enable mkldnn primitive reuse for platform reorder (#17826)
6 years ago
chengduo 863c75168c
polish error doc (#17772)
6 years ago
guru4elephant d52391094d
fix prepare context redundant code problem, optimize executor by cach… (#17743)
6 years ago
chengduo 67c8dade58
Add Event in ScopeBuffer Executor (#17667)
6 years ago
Yiqun Liu 8fd39f3e99
Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236)
6 years ago
gongweibao 0d561ef442
fix 2dconn test=develop (#17681)
6 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
6 years ago
Jacek Czaja 6d8075ecef [MKL-DNN] conv_transpose mkldnn bias pass (#17644)
6 years ago
Sylwester Fraczek 96845d2168 add Concat quantization (#17448)
6 years ago
gongweibao 65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. (#17263)
6 years ago
Zeng Jinle 4aa931dd85
Code clean of Allocator (#17602)
6 years ago