Commit Graph

4940 Commits (f5db451d3b87f6138598240db2d5c9894b6d6cf8)

Author SHA1 Message Date
Adam ab610a34ff
transpose_mkldnn code change to meet Paddle standards (#22591)
6 years ago
Jiawei Wang 8f035fb637
Add TopK Op Grad CPU&GPU Kernel test=develop (#22628)
6 years ago
Steffy-zxf 90ee366653
update ops's unittest data type from float32 to float64 and shape over 100 (#22544)
6 years ago
Zhaolong Xing 8acd745c25
[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486)
6 years ago
Guo Sheng 31b5464632
Add support for dynamic_decode(while) training. (#22231)
6 years ago
Wojciech Uss 4cddb43c5c
Add support for Ernie NLP model to the Slim QAT (#22506)
6 years ago
Double_V 58d99247f4
support slice double grad, test=develop (#22166)
6 years ago
hutuxian 1a7962be97
Paddlebox about box_wrapper (#22497)
6 years ago
huzhiqiang 9e29d3ebed
【OpPorting Example】DEMO OF FIX COMPILE&RUNTIME LOD_EQUALITY (#22460)
6 years ago
zhaoyuchen2018 54970444ce
Improve transpose performance with tile sm copy, test=develop (#22311)
6 years ago
Wilber a90fa54092
Compile without nccl deps. [1/2] (#22509)
6 years ago
Wilber de009152a7 Compile without nccl deps. [2/2] (#22484)
6 years ago
Yiqun Liu 4b2227e958 Fix dismatch of std::max's arguments type on windows. (#22507)
6 years ago
Wilber 870f465887 fix test_fusion_seqpool_concat lod level between compile and runtime (#22488)
6 years ago
Zhong Hui a61d09527b
Fix the integer overflow problem of sequence2batch (#22479)
6 years ago
cc 197913ebe1
Add weight quantization in post_training_quanzitaion (#22445)
6 years ago
Tao Luo 7c9ce097f1
refine reshape_op shape error message (#22480)
6 years ago
LielinJiang 2b1386b2b2
optimize performance of interpolate op (#22436)
6 years ago
Yiqun Liu 44b45b9f07
Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456)
6 years ago
Wilber 7bc4b09500
add WITH_NCCL option for cmake. (#22384)
6 years ago
Tao Luo 943cb8c664
fix sigmoid cudnn bug (#22439)
6 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
6 years ago
liu zhengxi 0404e7a985
Update the precision of pad, pad2d, pad_constant_like's unit tests from fp32 to fp64 (#22394)
6 years ago
Michał Gallus 269db0d1d1
[DNNL] Fix accuracy in INT8 FC (#22404)
6 years ago
joanna.wozna.intel fb3086fd57
[UT coverage]Remove unnecessary transpose op registration (#22402)
6 years ago
lidanqing ade5022681 [UT Coverage]Improve sum_mkldnn_op line coverage (#22275)
6 years ago
Wojciech Uss 92462e948d improve elementwise_add_mkldnn_op test code coverage (#22359)
6 years ago
ceci3 20f30dd604
add benchmark flag for conv_transpose (#22389)
6 years ago
Chengmo 8f36c39537
Fix GEO-SGD init & send Bug (#22375)
6 years ago
zhupengyang c6f888e5a5 update unittest accuracy to float64 for relu, prelu, maxout (#22273)
6 years ago
wangchaochaohu 0d8b222b79
Optimize the depthwise op test=develop (#22265)
6 years ago
qingqing01 2d20869c94 Fix infer_shape in compling for elementwise_op (#22291)
6 years ago
tangwei12 82bc814a57
integrated HALF_ASYNC to communicator (#21869)
6 years ago
wangchaochaohu 1e932eccfa
remove unused code test=develop (#22327)
6 years ago
Leo Chen 3e5744aa65
Remove unused inputs for some operators (#22284)
6 years ago
zhangchunle 805328e13b fix typo in error message (#22312)
6 years ago
lidanqing 895f8da7d6 change std::cout to log(INFO), vlog (#22316)
6 years ago
Bai Yifan faba4b116a
Remove disable flag in test_fsp_op.py (#22171)
6 years ago
Adam 9942d9ed5c Add caching mechanizm to requantize_mkldnn_op (#22223)
6 years ago
123malin 985bceac53
Bug fix for sparse recorder (#21969)
6 years ago
FlyingQianMM 443a713c9e
add backward gradient computation for op argsort (#22203)
6 years ago
Zhen Wang 46189b166d Add bn and relu fuse pass (#22048)
6 years ago
baojun 298ee7d28a Improve ngraph file line coverage (#22155)
6 years ago
zhongpu d0f0a2520c test Optimizer in dygraph (#21949)
6 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
6 years ago
zhaoyuchen2018 3d4f2aa689
Refine stack op to improve xlnet performance, test=develop (#22142)
6 years ago
liu zhengxi 64a4044292
add double register op_data_type of pad2d and fix compile error, test=develop (#22075)
6 years ago
Double_V 6ea3809143 Support prroi_pool_op with Tensor and LoDTensor rois (#20649)
6 years ago
zhaoyuchen2018 3dbd4087fe
Fix windows build not kernel issue, test=develop (#22105)
6 years ago
Chengmo 418abc92f4
Update pyramid related OP (#21372)
6 years ago