Commit Graph

16677 Commits (0536b5263d188c54069765c4168bdba91ad250c7)

Author SHA1 Message Date
songyouwei 99d30bfc36
speedup slice impl (#23340)
5 years ago
Zhaolong Xing 1a6ce8b910
add swish split gelu plugin dynamic support (#23305)
5 years ago
Jacek Czaja 2bb1b0e89e
[DNNL] Added MKL-DNN inplace pass for C-API inference (#23315)
5 years ago
Yi Liu 0471476a18
fix nccl comm double free bug (#23344)
5 years ago
wangchaochaohu 1ee2a9a424
Profiler refine (#23294)
5 years ago
Leo Chen 488b2387e2
Feature/expand params in auto-generated pybind functions for dygraph operators (#23181)
5 years ago
GaoWei8 20eed5401a
Change fluid.layers.where‘s C++ operator name (#23250)
5 years ago
Yi Liu 2169e6fb58
Initialize global nccl_comm in PE (#23275)
5 years ago
Jacek Czaja 012886df79
[DNNL] Softmax mkldnn op inplace support (#23197)
5 years ago
石晓伟 75ebb48a91
supports thread-binding stream, test=develop (#23177)
5 years ago
石晓伟 708ded584e
pause the io_utils_test of int64 and resume after repair, test=develop (#23234)
5 years ago
Zeng Jinle babda94c8a
Distinguish public/private global vars (#23269)
5 years ago
zhaoyuchen2018 58615a6272
Improve elementwise performance. (#23001)
5 years ago
Wojciech Uss f836c8aa8f
add check for scales and a message (#23119)
5 years ago
Zeng Jinle 8bfd62ffb7
Expose dygraph.grad api (#23124)
5 years ago
Wilber 0129f4b568
Add some inference API comments for AnalysisPredictor (#23242)
5 years ago
Tao Luo c00d427d52
simplify the cmake log of ir/CMakeLists.txt (#23262)
5 years ago
Zeng Jinle 77b4dc80c9
code polish for adding const qualifier, test=develop, test=document_fix (#23248)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
xujiaqi01 68ea1ad55b
add clear one table (#23089)
5 years ago
danleifeng ae3bb16d06
add MaskAucCalculator in paddlebox (#23157)
5 years ago
liym27 6af480ca33
Support int64 for op assign_value. test=develop (#23179)
5 years ago
Zeng Jinle 53e6f8e1da
rename macro, test=develop (#23161)
5 years ago
Zeng Jinle bba740710d
add cuda resource pool for BufferedReader, test=develop (#23152)
5 years ago
Zeng Jinle 7d8d50b6cc
rename no_need_buffer_vars macro, test=develop (#23160)
5 years ago
Liufang Sang a486a739e1
fix compile error in win gpu (#23196)
5 years ago
Zeng Jinle 7ca77a90ac
add Tensor::IsSharedBufferWith method, test=develop (#23175)
5 years ago
Zeng Jinle b8886bf122
rename no_need_buffer_vars_macro, test=develop (#23159)
5 years ago
Zeng Jinle bae5930ba1
fix graph attr copy issues, test=develop (#23191)
5 years ago
wangchaochaohu b721e23b25
transpose cudnn using cudnn v7 api (#19738)
5 years ago
Pei Yang 46b8d282dc
Add some inference API comments for AnalysisConfig (#23117)
5 years ago
Adam 4f5e4540f8
Improve SGD jit code to work with large data (#23120)
5 years ago
Liufang Sang 4db031902d
add dequantize_log_op and make pyramid hash support int8 weight (#22548)
5 years ago
Zeng Jinle e5fef8f38a
[Dygraph double grad]Code polish (#23121)
5 years ago
Zeng Jinle 9258e96094
fix read op comments, test=develop, test=document_fix (#23122)
5 years ago
Zeng Jinle acfc9b8a70
Reader sequential and inference partial feed (#22699)
5 years ago
Wilber 95b356a069
update embedding_eltwise_layernorm fuse and kernel. test=develop (#23114)
5 years ago
Zeng Jinle a31d7328b7
Add dygraph double grad implementation (#22939)
5 years ago
Yiqun Liu 3af4771122
Add the detection and code-generation of sqrt and square in fusion_group (#23095)
5 years ago
hutuxian 0c30098f8b
Add need_save_delta parameter to solve OOM (#23097)
5 years ago
songyouwei 2e2da7124b
high-performance dygraph slice (#22879)
5 years ago
Sylwester Fraczek abee05a8c8
added mkldnn swish activation (#23041)
5 years ago
Zhaolong Xing 8c6fde9e69
fix align error (#23090)
5 years ago
Liufang Sang 915b892a15
Fix div zero in fake quantize op (#22966)
5 years ago
Yi Liu 121b2aed4d
initialize global nccl context in dygraph (#23037)
5 years ago
Zhang Ting 880eb04d93
skip PrepareData when it is unnecessary (#22839)
5 years ago
Feiyu Chan 01ab8a0619
add approximation for gelu, test=develop (#22961)
5 years ago
Adam 5842ae6785
Revert "Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695)" (#22985)
5 years ago
Pei Yang 24db750386
fix trt int8 calib precision bug. test=develop (#23036)
5 years ago
GaoWei8 1dc1f9270e
Fix lod error of concat op for axis = 0 (#22538)
5 years ago