Commit Graph

858 Commits (46b8d282dcdc4c595fda865dc94bc459da9fc282)

Author SHA1 Message Date
Sylwester Fraczek abee05a8c8
added mkldnn swish activation (#23041)
5 years ago
Yi Liu 121b2aed4d
initialize global nccl context in dygraph (#23037)
5 years ago
wangchaochaohu 99db0cf762
remove debug log test=develop (#22994)
5 years ago
wangchaochaohu c979c9f2b0
refine the profiler print test=develop (#22968)
5 years ago
Zhang Ting ca9c8b417d
fix compute ratio of profile, test=develop (#22872)
5 years ago
wangchaochaohu dbb0b9b3b6
refine the profiler print (#22823)
5 years ago
Zeng Jinle d41d802ba3
Add flags to limit gpu memory (#22793)
5 years ago
Zhang Ting 72ff5a09c3
fix print bug of profile, test=develop (#22804)
5 years ago
wangchaochaohu 8456c3f4dd
polish the profiler_help code (#22811)
5 years ago
wangchaochaohu 7578fcbac4
Profile code refine (#22800)
5 years ago
Adam 2b80e9a719
Add cpu_info without XBYAK (#22716)
5 years ago
Zhang Ting f97f3f9301
add framework overhead ratio in profile report (#22590)
5 years ago
wangchaochaohu 611411b90e
Fusion group profile support (#22718)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
Yiqun Liu 22bbd54719
Add the support of fp16 in fusion_group (#22239)
5 years ago
wangchaochaohu a089072c8b
fix the profile print error (#22665)
5 years ago
wangchaochaohu c65c6ae534
add flag to control profile level in python API (#22319)
5 years ago
Chen Weihang fe685cc185
fix enforce test error, test=develop (#22610)
5 years ago
Chen Weihang 266106da75
Fix mismatch with plus sign in the line (#22588)
5 years ago
Wilber de009152a7 Compile without nccl deps. [2/2] (#22484)
5 years ago
LielinJiang 2b1386b2b2
optimize performance of interpolate op (#22436)
5 years ago
wangchaochaohu 77dd0d97bb
use enum class to replace the usage of enum in some condition test=develop (#22464)
5 years ago
Wilber 7bc4b09500
add WITH_NCCL option for cmake. (#22384)
5 years ago
Michał Gallus 269db0d1d1
[DNNL] Fix accuracy in INT8 FC (#22404)
5 years ago
wangchaochaohu 621d3e0b66
fix the bug of profile update (#22207)
6 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
6 years ago
Yiqun Liu 96980c2244
Polish the PADDLE_ENFORCE in fusion_group pass related codes. (#22144)
6 years ago
wangchaochaohu c3876cf82d
add support for nested profiling event and printing in different level (#22061)
6 years ago
zhaoyuchen2018 3d4f2aa689
Refine stack op to improve xlnet performance, test=develop (#22142)
6 years ago
Zeng Jinle 4c2df8e4d4
fix allocator strategy comment, test=develop, test=document_fix (#22121)
6 years ago
bingyanghuang 7872d06ff4 Add explanation on conv grad for dims<3 (#22125)
6 years ago
Chen Weihang ba8414d3a5
replace CUDNN_ENFORCE with PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#22109)
6 years ago
Jacek Czaja b0b27ff699 [MKL-DNN] Conv grad and Batch Norm grad NHWC support (#22088)
6 years ago
Zeng Jinle 9587249442
polish allocator strategy doc, test=develop, test=document_fix (#22095)
6 years ago
Zeng Jinle d9f5d1eb29
ag allocator by default, test=develop (#21837)
6 years ago
Jacek Czaja ad8a9cb82c [MKL-DNN] Pool & LRN Grad Ops NHWC support (#21747)
6 years ago
Yiqun Liu d48320777e
Add the first implememtation of fusion_group op (#19621)
6 years ago
Chen Weihang 2e9082250d
polish default error msg & cublas error hint, test=develop (#22032)
6 years ago
Chen Weihang 35ff1568e9 Add error message for cublas inItizalize failed (#21995)
6 years ago
Chen Weihang fbb42173a9
fix no hint problem when use ENFORCE for cuda, test=develop (#21994)
6 years ago
Chen Weihang 1fd1f06f11 Rename paddle throw error macro (#21657)
6 years ago
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
6 years ago
Zeng Jinle 97e76cb96d
refine dev_ctx.Wait() exception throw, test=develop (#21600)
6 years ago
Huihuang Zheng b241c7329c
Refine a Warning Which Can Occur Not Only During Init (#21546)
6 years ago
wangchaochaohu 932aca162d
Add Branch to avoid CPU profiler warning print (#21556)
6 years ago
Pei Yang 122b37ce62
make config option DisableGlogInfo() able to mute all inference logs (#21318)
6 years ago
Zhaolong Xing c5f0293cf3
NV jetson(nano, tx2, xavier) inference compile support (#21393)
6 years ago
Huihuang Zheng a71f53d7ac
Add warning message when initialize GLOG failed. (#21487)
6 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
6 years ago
Jie Fang 5e813b53c5 nhwc optimization for batchnorm (#21090)
6 years ago