Commit Graph

25763 Commits (007c9975727ca6ab28b253bc8bdee0dfae832073)
 

Author SHA1 Message Date
ruri 007c997572
Add masked select api (#21172)
5 years ago
tianshuo78520a d624b417d8 change make nproc on Cloud Integration (#21350)
5 years ago
wangchaochaohu 8293f21a52
Profile refine (#21258)
5 years ago
Kaipeng Deng 67c836fb5c
batch_norm momentum support variable (#21246)
5 years ago
lidanqing c0aa13672e Fp32 vs int8 qat C++ performance (#21244)
5 years ago
xujiaqi01 f1178e9d79
fix fleet save bug (#21362)
5 years ago
ShenLiang e2c6f434ec Add Lod information for gather_nd & scatter_nd (#21404)
5 years ago
Tao Luo c0656dcb1a
remove -Wno-error=sign-compare, make warning as error (#21358)
5 years ago
Liufang Sang 1840c1652c add config file to avoid load checkpoint test=develop (#21373)
5 years ago
wangchaochaohu e0e205ea2d
fix the profiling bug test=develop (#21396)
5 years ago
Zeng Jinle b97fc16d21
fix lod_reset bug, test=develop (#21392)
5 years ago
Zeng Jinle 89966525f1
Polish reference count pass (#21324)
5 years ago
zhouwei25 b39f947698 Eliminate the impact on incremental compilation (#21410)
5 years ago
tianshuo78520a e0da2bcd54 optimization check_api_approvals (#21371)
5 years ago
Zhaolong Xing d1a6e112e6 fix C++ multicard inference bug. (#20955)
5 years ago
hutuxian 47a82e38e3
Support data_norm gpu kernel (#21325)
5 years ago
Youwei Song d5ff79e55e Support numpy bridge (enabled by default in dygraph mode) (#20983)
5 years ago
GaoWei8 8493f20ebc Polish the codes of fc when needs padding (#21378)
5 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
5 years ago
Zeng Jinle b639a882c3 fix syn bn grad maker, test=develop, test=document_fix (#21317)
5 years ago
Youwei Song 4d0f5ab1a8 add axis check for concat op (#21288)
5 years ago
itminner 07e6a94268 paddleslim quantization skip pattern support list of string (#21141)
5 years ago
Tao Luo d8e7d25274
make CUDA_ARCH_NAME default Auto (#21352)
5 years ago
Zhen Wang be2e3e67d9
Fix some typos in AMP. (#21354)
5 years ago
zhaoyuchen2018 afb134847d
Fix ernie python infer diff (#21311)
5 years ago
Lv Mengsi b6ce4f8b2f
Fix mistake of batch norm op (#21237)
5 years ago
lilong12 41d13209d7
add the framework support for distfc (#21197)
5 years ago
Zeng Jinle dbba9c7e4b
polish global_value_getter_setter, test=develop (#21332)
5 years ago
hong a214a3081b
change download log format (#21290)
5 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
5 years ago
ruri 6cfcbe0510
reduce interp op input size to pass CI, test=develop (#21341)
5 years ago
silingtong123 45c1e7bb7b add prediction demo and script on windows (#21248)
5 years ago
silingtong123 4b429c190d package the CAPI inference library and third_party (#21299)
5 years ago
Jacek Czaja f4cf028a8c [MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207)
5 years ago
Michał Gallus ed9ceb9f98 Refactor MKL-DNN ElementwiseMul (#21061)
5 years ago
Dong Daxiang 0a93635b5f
fix logger problem (#21342)
5 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
5 years ago
wangchaochaohu 6514f52e46
fix the fill_constant op precious problem (#21322)
5 years ago
zhaoyuchen2018 08c19c585d
Improve argsort performance. (#21267)
5 years ago
lijianshe02 7fcaa39b36
fix Print_op input dtype list error test=develop (#21326)
5 years ago
juncaipeng 84865b806b add resnet50 test for post trainint quantization, test=develop (#21272)
5 years ago
Thunderbrook 9a7832f8be
print table stat info for pslib (#21296)
5 years ago
zhouwei25 341dee0657 Cache 3rd source code, improve stability, reduce the compilation time (#21190)
5 years ago
WangXi 8ac7687e36 Fix dgc accuracy by mv regularization to local (#21278)
5 years ago
Zeng Jinle b9f8ae8494
Add global value getter setter (#21285)
5 years ago
Leo Zhao b19e1a1b56 use prefetch to load next mem into cache (#21206)
5 years ago
Dong Daxiang 691ced87c0
Refactor fetch handler (#21264)
5 years ago
Yi Liu f1b09ba30e
adapt test_collective_base.py for only two GPU cards available. (#21307)
5 years ago
gongweibao ed2a185248
optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597)
5 years ago
Yiqun Liu c918788ba9 Disable fusion_group pass for windows and mac. We will do some experiments on Linux first. (#21310)
5 years ago