Commit Graph

9721 Commits (0bddb951c2017fd9cc9d370e718eb01902bf00f2)

Author SHA1 Message Date
ruri 94bef03539
Revert "Add masked select api (#21172)" (#21456)
7 years ago
ruri 3706ea67f8
fix sample code in density prior box
7 years ago
Zeng Jinle 87ab93af01
fix adam fp64, test=develop (#21423)
7 years ago
liym27 beec87b911
fix bug in example codes of API case and switch_case. test=develop,test=document_fix (#21477)
7 years ago
hutuxian 7e68bc896b
refactor AUC OP and add its CUDA Kernel (#21336)
7 years ago
juncaipeng 1f57ac1241
delete concat in AddQuantDequantPass, test=develop (#21454)
7 years ago
Zeng Jinle 2a54c359f0
add fraction of cpu memory to use, test=develop (#21453)
7 years ago
Zhang Ting 101240d2c1
fix PythonAPI test in Op unittest, test=develop (#21462)
7 years ago
wawltor dbbe6e9cb6
fix the device supported of the op unique and unique_with_counts. (#21395)
7 years ago
Huihuang Zheng 32959e031e
Add English Document for cond API (#21452)
7 years ago
Zhang Ting 3df13ab40c
fix PythonAPI test in Op unittest, test=develop (#21455)
7 years ago
Jie Fang 5e813b53c5 nhwc optimization for batchnorm (#21090)
7 years ago
Leo Chen e0c9d856fb
add unused input vars check for OpWithKernel, test=develop (#21169)
7 years ago
Chen Weihang 664f958a02
Fix optimizer op infershape failed in dygraph multi-cards mode (#21374)
7 years ago
Huihuang Zheng 630be31952
Fix Cond Bug for Nested Control Flow (#21340)
7 years ago
Jacek Czaja cd43c4440e [MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375)
7 years ago
xujiaqi01 ca879e5a77
fix skip_op bug (#21418)
7 years ago
zhaoyuchen2018 b16274556a
Add dscending for argsort (#21400)
7 years ago
hong ac8546701d
Add dygraph execution context (#20157)
7 years ago
hutuxian a6b089c614
add macro to ban windows (#21422)
7 years ago
Kaipeng Deng ebfb720a63
add Adam beta1/beta2 support Variable (#21234)
7 years ago
Zeng Jinle 09696d5df8
Use system allocator in OpTest (#21335)
7 years ago
ruri 007c997572
Add masked select api (#21172)
7 years ago
Kaipeng Deng 67c836fb5c
batch_norm momentum support variable (#21246)
7 years ago
lidanqing c0aa13672e Fp32 vs int8 qat C++ performance (#21244)
7 years ago
xujiaqi01 f1178e9d79
fix fleet save bug (#21362)
7 years ago
Liufang Sang 1840c1652c add config file to avoid load checkpoint test=develop (#21373)
7 years ago
Zeng Jinle b97fc16d21
fix lod_reset bug, test=develop (#21392)
7 years ago
hutuxian 47a82e38e3
Support data_norm gpu kernel (#21325)
7 years ago
Youwei Song d5ff79e55e Support numpy bridge (enabled by default in dygraph mode) (#20983)
7 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
7 years ago
itminner 07e6a94268 paddleslim quantization skip pattern support list of string (#21141)
7 years ago
Zhen Wang be2e3e67d9
Fix some typos in AMP. (#21354)
7 years ago
lilong12 41d13209d7
add the framework support for distfc (#21197)
7 years ago
hong a214a3081b
change download log format (#21290)
7 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
7 years ago
ruri 6cfcbe0510
reduce interp op input size to pass CI, test=develop (#21341)
7 years ago
Jacek Czaja f4cf028a8c [MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207)
7 years ago
Michał Gallus ed9ceb9f98 Refactor MKL-DNN ElementwiseMul (#21061)
7 years ago
Dong Daxiang 0a93635b5f
fix logger problem (#21342)
7 years ago
wangchaochaohu 6514f52e46
fix the fill_constant op precious problem (#21322)
7 years ago
zhaoyuchen2018 08c19c585d
Improve argsort performance. (#21267)
7 years ago
lijianshe02 7fcaa39b36
fix Print_op input dtype list error test=develop (#21326)
7 years ago
juncaipeng 84865b806b add resnet50 test for post trainint quantization, test=develop (#21272)
7 years ago
Thunderbrook 9a7832f8be
print table stat info for pslib (#21296)
7 years ago
WangXi 8ac7687e36 Fix dgc accuracy by mv regularization to local (#21278)
7 years ago
Zeng Jinle b9f8ae8494
Add global value getter setter (#21285)
7 years ago
Dong Daxiang 691ced87c0
Refactor fetch handler (#21264)
7 years ago
Yi Liu f1b09ba30e
adapt test_collective_base.py for only two GPU cards available. (#21307)
7 years ago
gongweibao ed2a185248
optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597)
7 years ago