Commit Graph

9010 Commits (234060f88f631654ab4dd953d6abdc4eed7df171)

Author SHA1 Message Date
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
6 years ago
silingtong123 45c1e7bb7b add prediction demo and script on windows (#21248)
6 years ago
Jacek Czaja f4cf028a8c [MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207)
6 years ago
Michał Gallus ed9ceb9f98 Refactor MKL-DNN ElementwiseMul (#21061)
6 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
6 years ago
wangchaochaohu 6514f52e46
fix the fill_constant op precious problem (#21322)
6 years ago
zhaoyuchen2018 08c19c585d
Improve argsort performance. (#21267)
6 years ago
Thunderbrook 9a7832f8be
print table stat info for pslib (#21296)
6 years ago
WangXi 8ac7687e36 Fix dgc accuracy by mv regularization to local (#21278)
6 years ago
Zeng Jinle b9f8ae8494
Add global value getter setter (#21285)
6 years ago
Leo Zhao b19e1a1b56 use prefetch to load next mem into cache (#21206)
6 years ago
Dong Daxiang 691ced87c0
Refactor fetch handler (#21264)
6 years ago
gongweibao ed2a185248
optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597)
6 years ago
Yiqun Liu c918788ba9 Disable fusion_group pass for windows and mac. We will do some experiments on Linux first. (#21310)
6 years ago
Yihua Xu 69dd5152cf Fix the crash issue when scale or bias was null-pointer. (#21284)
6 years ago
Zhang Ting 698b8b73ad
optimize lod_reset op to avoid data transform
6 years ago
Liufang Sang f0b1518438 add dequantize_abs_max op and modify lookup_table op (#20899)
6 years ago
hutuxian a6ce2306f9 support cvm_op run in gpu (#21300)
6 years ago
Yihua Xu b085ecc258 Avoid the string as the key of map to improve the jit performance (#21292)
6 years ago
Chen Weihang 952508527a
Polish some PE code details (#21274)
6 years ago
Thunderbrook 0d17c1b816
solve pslib core in stop worker (#21263)
6 years ago
zhongpu c4ede95c74 open dygraph op test, test=develop (#19787)
6 years ago
danleifeng 6fc3e8ec84 edit elementwise_mul doublegrad inplace (#21245)
6 years ago
Thunderbrook 349e82d669
support general embedding params (#21217)
6 years ago
liu zhengxi 3cb6c0a059
Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240)
6 years ago
Yiqun Liu 6b1e1f0dda
Enable generating code for a given subgraph. (#21126)
6 years ago
zhaoyuchen2018 3ff5cc2d5e
Fix topk compile failed on windows (#21243)
6 years ago
Pei Yang 2e2f92a5b1
fix trt weight bug (#21231)
6 years ago
Zhang Ting 01a9646323
optimize assign op to avoid copy data from GPU to GPU (#21181)
6 years ago
zhouwei25 c0dcb090a3 Determine whether to copy and link inference lib by ON_INFER (#20931)
6 years ago
danleifeng 0e7baabe59 extend elementwise broadcast function (#20957)
6 years ago
Adam d623e863c9 Fix GELU grad error (#21204)
6 years ago
Zeng Jinle a152315be7
refine Tensor method, test=develop (#21031)
6 years ago
yaoxuefeng b5d8ba8394
fix data_norm op to avoid impractical normalization result test=develop (#21152)
6 years ago
Zeng Jinle 67e88424e5
Polish jit trace codes (#21218)
6 years ago
Zeng Jinle cdb3d27985
Fix warn of gcc8 (#21205)
6 years ago
liuwei1031 d8b6cf2bcd
fix sporadically hang issue on windows(#21201)
6 years ago
Zhang Ting 9cbe7bccba modified error message and API doc for channel_last supported Op (#21002)
6 years ago
Zhaolong Xing 65f7052554
TRT int8: refine trt int8 for dynamic range set (#21112)
6 years ago
guofei 56b5d14704 Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118)
6 years ago
WangXi 3c98ec90ce Fix INF bug of softmax_cross_entropy_op (#21165)
6 years ago
xujiaqi01 23876de55b
fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052)
6 years ago
Yihua Xu eec9c9cbe7 Fix jit tls issue (#21151)
6 years ago
GaoWei8 a9d4eed3a8 fix cmake fails on inference_download_and_uncompress (#21185)
6 years ago
xujiaqi01 9e045170c0
add copy table (#21086)
6 years ago
ruri aeb887911f
Refine edit distance cn (#21121)
6 years ago
Kaipeng Deng 98b59cb82c
fix elementwise_mod float point kernel. test=develop (#21183)
6 years ago
Zeng Jinle 5fdfbe3413
Add friendly dygraph trace API (#21091)
6 years ago
Chen Weihang 4bd9463630
fix detail error message error, test=develop (#21170)
6 years ago
whs cfdd1fc2cd
Fix warpctc in padding mode. (#21033)
6 years ago