Commit Graph

9160 Commits (7f4abaf2f555ed9df515b2e521e72df5f4dbd52a)

Author SHA1 Message Date
SunAhong1993 7f4abaf2f5
register int/int64_t/float16 in pow/square kernel,test=develop (#22023)
6 years ago
Leo Chen 3f653c8323
register NoNeedBufferVarsInference for max_pool_grad_op, test=develop (#22055)
6 years ago
Yiqun Liu d48320777e
Add the first implememtation of fusion_group op (#19621)
6 years ago
Michał Gallus 6192108408 [DNNL] 3D Fully-Connected (#21746)
6 years ago
FDInSky aa2ed0dcc6 fix generate_proposal_labesl op (#21793)
6 years ago
ceci3 95d79b6d00
update error log for batch_norm_grad (#22017)
6 years ago
Aurelius84 c53b62eb8e
fix integer overflow in match_matrix (#22036)
6 years ago
Chen Weihang 2e9082250d
polish default error msg & cublas error hint, test=develop (#22032)
6 years ago
wangchaochaohu 64baee4144
polish code test=develop (#22014)
6 years ago
Chen Weihang 35ff1568e9 Add error message for cublas inItizalize failed (#21995)
6 years ago
Chen Weihang fbb42173a9
fix no hint problem when use ENFORCE for cuda, test=develop (#21994)
6 years ago
zhouwei25 e66f92d1ae Modify demo_ci to support Windows, prepare for PR_Windows_Inference (#21873)
6 years ago
danleifeng b7697f6218 fix broadcast bug;test=develop (#21898)
6 years ago
liu zhengxi 196e20dfbb
Fix multi-threads memory out of bounds error for passes (#21920)
6 years ago
zhaoyuchen2018 8859ddd6cf
Refine multihead kernel, align block to 32 (#21961)
6 years ago
silingtong123 fd9b00df4b test=develop, remove unused variable (#21974)
6 years ago
zhoushiyu cee2ccb078
add shuffle batch op (#21674)
6 years ago
mapingshuo c3e1954918
make reverse op support negative axis (#21925)
6 years ago
石晓伟 03479469a7
fix multi-thread error of fc_gru_fuse_pass.cc, test=develop (#21841)
6 years ago
Aurelius84 10d6846900
Remove double registered dataType in Pad2d (#21942)
6 years ago
zhouwei25 2df4be5d35 Fix openblas bug to support compile on windows when WITH_MKL=OFF (#21902)
6 years ago
hutuxian 27decacb8a
fix aucop stat shape (#21846)
6 years ago
Pei Yang 3e5008ad01
fix trt calib not working bug, test=develop (#21934)
6 years ago
Aurelius84 5cb2c74127
add register op_data_type of pad/expand_as et.al (#21718)
6 years ago
qingqing01 2066745847
Pack imperative/layer into paddle_framework.so (#21921)
6 years ago
hong 30d000f8c2
fix matmul error message; test=develop (#21885)
6 years ago
zhouwei25 a01663ca1f remove patch command and file of cares to Improved quality of Paddle Repo (#21776)
6 years ago
flame 2bbc0d7d60
python zero copy inference, delete pass (#21897)
6 years ago
Aurelius84 51a86d2b6b Optimize adam speed (#21777)
6 years ago
Leo Chen 310edc0d0c
Update layers used in ptb model to use auto-generated op functions in dygraph mode (#21724)
6 years ago
lidanqing 9dff56e8e2 change qat_performance with mobilenet, change batch_size of qat2_resnet50 (#21895)
6 years ago
FDInSky 6b9fbcf3ad Update iou_similarity op to support non-normalized bbox (#21671)
6 years ago
guofei 46f9184aff Modify the while_loop API (#21844)
6 years ago
Guo Sheng 7689b6aaa4
Fix default label dim of label_smooth_op. test=develop (#21862)
6 years ago
GaoWei8 d4dda8628e optimize fc jit (#21878)
6 years ago
Chen Weihang 2b941736f3 fix softmax_with_cross_entropy_fix bug, test=develop (#21810)
6 years ago
Thunderbrook c3cf42d0f7
add table id in cache shuffle (#21585)
6 years ago
Michał Gallus 253e664275 Disable memory opt pass when DNNL is on (#21826)
6 years ago
Chengmo a86f11b5f5
Speed GEO dense calc & communication (#21579)
6 years ago
Wojciech Uss 666c3bb9b0 handle multi-inputs with empty inputs for mkldnn_concat_op (#21827)
6 years ago
Zeng Jinle aa4d6a5d6c
Add some debug flags to auto growth allocator (#21766)
6 years ago
guofei 8b7c50f49a Make While Op could run on GPU place and add while_loop unittest (#21672)
6 years ago
WangXi 17299b8d21 fix batch_norm_grad infer shape=0 & add allreduce enforce shape, test=develop (#21801)
6 years ago
Huihuang Zheng 557bce77da
Fix Backward Bugs in Conditional Block (#21809)
6 years ago
xujiaqi01 0eb4d990c4
fix compiled error when with_pslib=on (#21769)
6 years ago
Huihuang Zheng 0677a1c1c1
Fix That conditional_block_op Doesn't Have InferShape (#21733)
6 years ago
zhaoyuchen2018 a5a8d14414
Fix softmax cuda bug (#21720)
6 years ago
Kaipeng Deng 943a44492b
yolo_box OP add Attr(clip_bbox). (#21620)
6 years ago
Michał Gallus a5159d8480 Re-anble vgg and resnet101 models download (#21713)
6 years ago
Leo Chen 7181afd75c
Fix elementwise_pow bug on CUDA place with integer (#21675)
6 years ago