Commit Graph

2629 Commits (6e6eab07e80d287fb10f6033a01f15650b36fcdb)

Author SHA1 Message Date
zhongpu 72d1d72c09 fix ExecutionContext::HasInput and ExecutionContext::HasOutput depend on the scope structure, test=develop (#20721)
5 years ago
石晓伟 48a774c713
fix ts_sort's bug, test=develop (#20720)
5 years ago
wopeizl 9e5948230e
add support to gcc8, add docker env test=develop (#19807)
5 years ago
xujiaqi01 5223b0dd9d
add check nan / inf in downpour worker (#20694)
5 years ago
WangXi 507afa8a8a Fix dgc nan by stripping nccl from sparseReduce. (#20630)
5 years ago
Zeng Jinle 4eeda9d676
fix tensor_util, test=develop (#20699)
5 years ago
Zeng Jinle ab575de725 Fix op run log when memory optimization strategy is enabled (#20695)
5 years ago
Jacek Czaja a1cd27f13f [MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241)
5 years ago
Zeng Jinle 10505faf4e
polish codes, test=develop (#20672)
5 years ago
Chen Weihang 003f369bb2
Add IndicateVarDataType interface to block tensor is not initialized problem in OP GetExceptedKernelType (#20044)
5 years ago
Chengmo 940c6ff1c8
Fix communicator slow bug & fix communicator stop bug (#20366)
5 years ago
WangXi cadc6a9704 fix dgc test and bug when not set trainers_endpoints_, test=develop (#20617)
5 years ago
Thunderbrook f76a32df4a
dump fix dov vec file num (#20539)
5 years ago
633WHU 12e4be0382 Dlpack support (#20039)
5 years ago
Pei Yang 443f604c3b
add DisableGlogInfo() to AnalysisConfig, test=develop (#20581)
5 years ago
Zeng Jinle a9c8bdad7b
refine pe codes, test=develop (#20479)
5 years ago
Zeng Jinle 76b321872a
fix cuda dev_ctx by event, test=develop (#20553)
5 years ago
zhaoyuchen2018 b8333edef6
Add Multihead matmul fuse pass (#20167)
5 years ago
Adam 7faa3e9555 Add ConvTranspose + BatchNorm fuse pass (#20161)
5 years ago
xujiaqi01 22b80e1246
fix parse content in CreatePreLoadReaders (#20258)
5 years ago
hong fa43e80e19 New save load interface (#20148)
5 years ago
Zeng Jinle c20b11ba11
simplify op_info.h, test=develop (#20195)
5 years ago
hong 0ec2c081d9
update op compatible list; test=develop (#20175)
5 years ago
tangwei12 c9139c3db3
trainer from dataset fetch targets (#19760)
5 years ago
chengduo bfa55c9ddb Add place deps for fused_all_reduce_op_handle (#20077)
5 years ago
Zeng Jinle 5fef859c65
remove map type from var_type_traits.h, test=develop (#20090)
5 years ago
Zeng Jinle 4ad66c779c
fix op_compatiable_compile_error, test=develop (#20076)
5 years ago
qingqing01 1a3eef026c
Enable users to create custom cpp op outside framework. (#19256)
5 years ago
bingyanghuang 9de6772510 Follow comment of Merged QAT PR 18970 (#19979)
5 years ago
石晓伟 01b9d07963
update operator compatible info, test=develop (#19978)
5 years ago
joanna.wozna.intel f5221ac19f Disable conv requant squash (#20041)
5 years ago
wangchaochaohu c9ea317b36
codegen code for reconstruction (#19728)
5 years ago
tangwei12 8f0b3c0516
the integrated communicator (#19849)
5 years ago
Chen Weihang b916335025 Paddle error message stack shaping and optimization (#19895)
5 years ago
chengduo 2450d15b78
disable fuse_all_optimizer_ops (#19966)
5 years ago
chengduo 101a2b610a Add dtype for coalesce_tensor_op (#20016)
5 years ago
Huihuang Zheng 88af4ab650
Add new data layer (#19916)
5 years ago
xujiaqi01 f50e701b3b
fix memory leak in HogwildWorker (#19956)
5 years ago
xujiaqi01 cedc04775c
support change shuffle and train thread num (#19841)
5 years ago
Zeng Jinle cc157d5990
add inplace to assign op, test=develop (#19927)
5 years ago
chengduo 55ce696986
clean tensor array (#19930)
5 years ago
chengduo d7251a8e1e
Delete local execution scopes (#19749)
5 years ago
wopeizl 5452b6a152
remove the useless warning for user to avoid confuse test=develop (#19871)
5 years ago
hong 85b398f171
Add op compatible information (#19910)
5 years ago
Huihuang Zheng e117114289
Set states of recurrent op as dependent vars in prune (#19865)
5 years ago
Zeng Jinle b754700fb5
fix reduce and broadcast to avoid multi-stream, test=develop (#19889)
5 years ago
joanna.wozna.intel 3f1d0234ae Fix conv2d+dequantize squash for residual fusion (#19545)
5 years ago
Huihuang Zheng a35557d8f4
Fix deps of prune (#19876)
5 years ago
Leo Chen 578a2f5da3 fix SplitLodTensor when batch_size = 0, test=develop (#19866)
5 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
5 years ago