Commit Graph

3865 Commits (c47f3cc7fe3221e074d2e3e301d1e0e6bfa78921)

Author SHA1 Message Date
liangan1 4acc522087 Enable function coverage for U8/S8 ConvMKLDNNOpKernel
6 years ago
sneaxiy a8c4324d3c fix hang bug
6 years ago
Xin Pan 44e7fcddc5
Merge pull request #15844 from panyx0718/infer
6 years ago
Jacek Czaja dec9cf53c8 [MKL-DNN] MKL-DNN specific Tensor modification (#15429)
6 years ago
heqiaozhi 08c96d1b48 remove mkldnn & fix commit
6 years ago
Xin Pan 5dd281f738 polish
6 years ago
heqiaozhi fab09ac0b8 Merge branch 'new_develop' of https://github.com/colourful-tree/Paddle into new_develop
6 years ago
heqiaozhi da4f5a2f18 remove mkl & fix commit
6 years ago
colourful-tree f2d6473ef8
Merge branch 'develop' into new_develop
6 years ago
heqiaozhi 04f876f5bc remove mkl & fix commit
6 years ago
dengkaipeng 373cfb0ccf use kernel size in global_pooling. test=develop
6 years ago
dengkaipeng 60305196b8 fix spell mistakes. test=develop
6 years ago
Tao Luo 8a7efc78f1
Merge pull request #15882 from sfraczek/unique_ptr_dereference
6 years ago
tensor-tang a0c37662b9 enable sgd jitkernel refer code and test
6 years ago
xuezhong 1dad36f6aa
Merge pull request #15609 from xuezhong/add_sample_logits_op
6 years ago
Kaipeng Deng 9e524a7b51
Merge pull request #15870 from heavengate/fix_adaptive_pool_doc
6 years ago
sneaxiy 1e4c0a6f72 merge develop
6 years ago
dengkaipeng 14df92fe8f fix spell error. test=develop
6 years ago
dengkaipeng 144016fcfc fix adaptive_pool and yolov3_loss. test=develop
6 years ago
Sylwester Fraczek 74672d1aff Change *(smart_ptr.get()) -> *smart_ptr
6 years ago
tensor-tang ee2321debd
Revert 15770 develop a6910f900 gelu mkl opt (#15872)
6 years ago
xuezhong 81870723c6
Merge pull request #15605 from xuezhong/fix_bug_for_lstmp
6 years ago
dengkaipeng eb65b4e47d \frac -> \frac. test=develop
6 years ago
nhzlx 1d5ef7c9ee 5. add static trt load model
6 years ago
dengkaipeng 8167588f14 add blank after math::. test=develop
6 years ago
dengkaipeng d9ec605873 use math:: instead of 29. test=develop
6 years ago
dengkaipeng 19292ac6a1 fix adaptive pool doc.test=develop
6 years ago
Yiqun Liu 7d96c74ab2
Initialize the benchmark tester for operator. (#15772)
6 years ago
Yihua Xu 676995c86c Optimze Gelu with MKL Erf function (#15770)
6 years ago
mozga-intel 5d132ecf83 Auto-cmake generator, auto-fill map (#15402)
6 years ago
Krzysztof Binias 1578c60bdd Add new ut and remove unnecessary code
6 years ago
Xin Pan 5eb87506bc add per kernel config and remove const_cast.
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
sneaxiy 7160cb0f32 decoupled reader
6 years ago
mozga-intel 13ec2d331b Enable momentum operator for a ngraph engine (#15673)
6 years ago
xuezhong eb7bc3e7ea remove non-ascii charactor
6 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
6 years ago
xuezhong d328660304 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
xuezhong f2262d7336 update comment
6 years ago
Tao Luo 6402424f7a
Merge pull request #15773 from chengduoZH/fix_shape_api_doc
6 years ago
xuezhong d12252e6a6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op
6 years ago
xuezhong c5360a3f6b refine code
6 years ago
tensor-tang 5aea2cd2e0
Merge pull request #15652 from tensor-tang/refine/pyramiddnn
6 years ago
mozga-intel df23a6f894 Enable cross_entropy operator for a ngraph engine (#15674)
6 years ago
Yiqun Liu 56a5039e24
Correct the doc in Python API (#15725)
6 years ago
baojun 72061b0ac0 Add ngraph op coverage (#15721)
6 years ago
chengduozh d79d2f686c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_shape_api_doc
6 years ago
xuezhong 4424021623 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op
6 years ago
nhzlx 2070fb246d 4. do the trt_engine optim during init.
6 years ago
Yihua Xu 685a20ef56 Add JIT CRF_decoding and Layer_norm unit-test (#15699)
6 years ago
tensor-tang 75fc792d40 fix when table width larger than 64
6 years ago
tensor-tang 40402d5e68 add emb seqpool jitcode
6 years ago
tensor-tang 2ccbcb157d Merge remote-tracking branch 'ups/develop' into refine/pyramiddnn
6 years ago
chengduozh 3ce12b1b8e fix shape api doc
6 years ago
Dun 5e6834d891
inplace group_norm (#15754)
6 years ago
Hongyu Liu 8c0292dead
Merge pull request #15717 from phlrain/fix_leak
6 years ago
Tao Luo 4da291c6a3
Merge pull request #15726 from qingqing01/fix_api_doc
6 years ago
nhzlx ecc12fb430 3. when runing in trt mode, do not allocate memory for parameters in fluid.
6 years ago
Dun e4b9fcdbd2
More restrict check load_combine_op. (#15479)
6 years ago
qingqing01 48a5cccbcd
Fix debug mode in prior_box_op (#15702)
6 years ago
Dang Qingqing 2868232556 Fix row_conv doc
6 years ago
tensor-tang a3a3d3d861 add embseqpool jitkernel mkl impl and use it
6 years ago
tensor-tang 15da2f9a0d add embseqpool jitkernel refer code, test and benchmark
6 years ago
tensor-tang c2ccf14590 Merge remote-tracking branch 'ups/develop' into refine/pyramiddnn
6 years ago
qingqing01 abcefe7211
Fix debug mode in fake_quantize_op (#15693)
6 years ago
liuhongyu 029be5fda9 fix lstmp bug; test=develop
6 years ago
nhzlx 9cc6249cd6 2. TRTEngine using stream only when execute.
6 years ago
liuhongyu 393fa6021e set lstm lstmp unsed pointer to nullptr; test=develop
6 years ago
liuhongyu 869f00ffc6 set lstm lstmp unsed pointer to null
6 years ago
nhzlx 034ba1c291 add static model load for trt
6 years ago
jerrywgz 6f11f35abe
Merge pull request #15703 from jerrywgz/enhance_expand_op
6 years ago
Tao Luo 3086502522
Merge pull request #15704 from Sand3r-/mgallus/old-fc-mkldnn-branch-fix-develop
6 years ago
baojun c47e258ea4 Add ngraph sum, sigmoid, relu_grad and tanh_grad op (#15642)
6 years ago
tensor-tang 33d0cebbff
Merge pull request #15695 from tensor-tang/fix/name
6 years ago
Michal Gallus 7a8eff36a6 Fix old FC backward weights descriptor creation
6 years ago
chengduo ad61e1b22c
fix potential bug (#15688)
6 years ago
dzhwinter f9ac88e1a0
Merge pull request #15694 from liuwei1031/fix_security_issue
6 years ago
jerrywgz 8fc0fc314a support multiple var types for expand op, test=develop
6 years ago
tensor-tang fb2a7b2300
fix aligned-new error in jitkernel (#15626)
6 years ago
乔龙飞 Qiao Longfei 08ad72d0b9
Merge pull request #15679 from jacquesqiao/update-lookup_table_grad-padding-index
6 years ago
Tao Luo d9270e34d1
Merge pull request #15691 from luotao1/activation_doc
6 years ago
tensor-tang 15d7220f94 fix jitcode name
6 years ago
tensor-tang 31fd8ce1e1
Merge pull request #15375 from mozga-intel/mozga-intel/batch_norm_ngraph_operator
6 years ago
liuwei1031 b1f97a6fa9 fix security issue 27, 38 test=develop
6 years ago
Tao Luo 882e7ec480 fix generate doc error in activation ops
6 years ago
Gabor Buella da9c94da33 Clang build fixes (#15628)
6 years ago
Qiao Longfei 76c1378a70 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-lookup_table_grad-padding-index
6 years ago
Qiao Longfei 29a4b21bc8 fix problem test=develop
6 years ago
Qiao Longfei 7b673bce6a lookup_table_grad kernel should consider padding_idx test=develop
6 years ago
jerrywgz 5ce48220f1 change default option related to softmax, test=develop
6 years ago
xuezhong 9b24ac34dd remove debug print
6 years ago
xuezhong 50b48400bb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
dzhwinter b80bcbb4fd
Merge pull request #15660 from dzhwinter/enhance/memory
6 years ago
mozga-intel 1198ccae6b Enable batch_norm operator for a ngraph engine
6 years ago
xuezhong 58101e6d4d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
xuezhong 4921c2cd02 add api spec change
6 years ago
baojun f4a0e68481 Fix ngraph compile WITH_DISTRIBUTE=ON (#15636)
6 years ago
xuezhong fb261793b9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op
6 years ago
xuezhong fb9a6a2bc6 pass test for lstm op
6 years ago
xuezhong 1abb0d835e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
xuezhong 2ba256df40 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
dzhwinter 104d3b4e68 add details. test=develop
6 years ago
dzhwinter 3932cd6714 Merge remote-tracking branch 'origin/develop' into enhance/memory
6 years ago
Dun Liang ceec13562c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_checkpoint
6 years ago
Dun Liang bc92192747 Fix Pr #15296
6 years ago
dzhwinter 94dd50c33f add details. test=develop
6 years ago
Tao Luo bec68fa0b3
Merge pull request #15637 from jerrywgz/refine_box_coder
6 years ago
乔龙飞 Qiao Longfei 7ddf4e2c55
Merge pull request #15649 from jacquesqiao/fix-ctr-reader-svm
6 years ago
Qiao Longfei fa77186fdc fix ctr_reader_test test=develop
6 years ago
Qiao Longfei 2afe82fe83 fix ctr reader read svm data
6 years ago
Kaipeng Deng 9df7bc2c5a fix exlusive pool doc. test=develop (#15632)
6 years ago
wopeizl 3614dadf23
Merge pull request #15631 from wopeizl/windows/fixci
6 years ago
tensor-tang 18bff5298d extract fused_emb_seq_pool forward function
6 years ago
peizhilin 55510744b5 test=develop
6 years ago
peizhilin 061299be87 fix dependency
6 years ago
baojun ac4cde009d Enable accuracy op for ngraph engine (#15592)
6 years ago
kolinwei 6f0f8045f6
Revert "Async double buffered py reader"
6 years ago
jerrywgz ceb412b0ae speed up box coder in CPU, test=develop
6 years ago
Dun db8fcf6b95
Merge pull request #15296 from cjld/async_double_buffered_py_reader
6 years ago
xuezhong dff7461ea8 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
Zeng Jinle 1ccbc51218
Merge pull request #15619 from sneaxiy/debug
6 years ago
Xin Pan 74bc55c2a6
Merge pull request #14975 from dzhwinter/ir_inplace_pass
6 years ago
xuezhong 2857dac260 add assert for clip and remove print
6 years ago
xuezhong 4028943125 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
Tao Luo 546eefae6f
Merge pull request #15591 from lidanqing-intel/lidanqing/density_prior_box
6 years ago
sneaxiy f26a1c9077 test=develop
6 years ago
dzhwinter 4f01de6378 Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
dzhwinter 46a6cac91f
fix batch norm. test=develop (#15597)
6 years ago
dzhwinter 9c9ad7d40b Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
jerrywgz 1743d1a58f
Merge pull request #15356 from jerrywgz/add_clip_op
6 years ago
mozga-intel 43a67a2662 Enable conv2d operator for a ngraph engine (#15269)
6 years ago
tensor-tang a6a1a92ef7
Merge pull request #15586 from tensor-tang/jit/cache
6 years ago
lidanqing-intel 4b3c6612a1 optimize density_prior_box_op.h for cpu
6 years ago
xuezhong 4c98c2ccc3 remove debug print
6 years ago
xuezhong 58ad40cc15 add sample_logits op
6 years ago
tensor-tang ba02ac4692
use mat attr and refine test (#15448)
6 years ago
xuezhong b0c75f1763 remove debug print
6 years ago
xuezhong 880836329d add cell clip and proj clip, fix bug for h0
6 years ago
jerrywgz 4eb44380a6
Merge branch 'develop' into add_clip_op
6 years ago
Xin Pan 30cc8b7a92
Merge pull request #15554 from heavengate/yolo_loss_darknet
6 years ago
mozga-intel 312500dcb5 Enable pool2d operator for a ngraph engine (#15395)
6 years ago
Tao Luo ea92905be4
Merge pull request #15478 from kbinias/kbinias/seperate-folders-for-mkldnn
6 years ago
Yibing Liu 170842cbb4
Some improvements to support bert mixed precision training (#15585)
6 years ago
Yiqun Liu 16d54f7f23
Return parent_idx in beam_search op (#15520)
6 years ago
jerrywgz 72ee3c6232
Merge pull request #15398 from jerrywgz/add_axis_for_boxcoder
6 years ago
jerrywgz e402c0ec7d test=develop
6 years ago
Kaipeng Deng d3eeb92bba
Merge pull request #15491 from tink2123/new_align_corners
6 years ago
jerrywgz 3046799ecd
Merge branch 'develop' into add_clip_op
6 years ago
dzhwinter 1a44b2fbe8 Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
Jiabin Yang 2d0ffdc485 test=develop, fix debug mode unitest, hsigmoid (#15574)
6 years ago
tensor-tang 2b0811c3fb refine vadd jitkernel choice
6 years ago
tensor-tang a18c0d4242 cache fc kernel
6 years ago
tensor-tang 6e1ee7fb57 cache softmax kernel func
6 years ago
Krzysztof Binias 69b7c595d6 Small fix
6 years ago
Krzysztof Binias b1bdcd4de8 Make separate folders for mkldnn codes
6 years ago
dzhwinter 06f2448848 Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
dengkaipeng 23d34d1f7e move yolov3_loss to detection. test=develop
6 years ago
tensor-tang c7449227e8
Merge pull request #15563 from tensor-tang/jit/softmax
6 years ago
dengkaipeng 733bb82ec0 downsample -> downsample_ratio. test=develop
6 years ago
dengkaipeng ae0b0d5f93 fix doc. test=develop
6 years ago
dengkaipeng 56e21c558e add comments and docs. test=develop
6 years ago
dengkaipeng 577424e5ec use darknet loss and trick
6 years ago
dengkaipeng 042fecefab use L2Loss. test=develop
6 years ago
dengkaipeng af124dcdf6 fix API error
6 years ago
dengkaipeng c945ffa7f8 fix label_smooth and mixup score
6 years ago
tink2123 2b89f59055 add attr use_label_smooth test=develop
6 years ago
dengkaipeng 8218e30176 add gtscore. test=develop
6 years ago
dengkaipeng 3c08f620c2 add label smooth. test=develop
6 years ago
dengkaipeng cc01db6029 calc valid gt before loss calc. test=develop
6 years ago
dengkaipeng 32d533c2cd cache obj_mask and gt_match_mask. test=develop
6 years ago
dengkaipeng 6c5a5d0789 format code. test=develop
6 years ago
dengkaipeng e7e4f084e5 ignore pred overlap gt > 0.7. test=develop
6 years ago
dengkaipeng db8ff57a61 remove useless code and update doc. test=develop
6 years ago
dengkaipeng 577a92d992 use typename DeviceContext. test=develop
6 years ago
dengkaipeng 0c4acc8305 imporve yolo loss implement. test=develop
6 years ago
dengkaipeng 2fbfef2ec9 fix no box expression. test=develop
6 years ago
dengkaipeng c0fa8d2eec use L1Loss for w, h. test=develop
6 years ago
dengkaipeng 3841983aa0 fix division error in mean process. test=develop
6 years ago
dengkaipeng 192d293854 use stable Sigmoid Cross Entropy implement. test=develop
6 years ago
tink2123 909f864a9b remove unnecessary flags
6 years ago
tink2123 6961a94e94 avoid out_size less than 1
6 years ago
jerrywgz 7bc8481c62
Merge pull request #15418 from jerrywgz/refine_nms
6 years ago
tensor-tang d59f733551 refine softmax and use with cache
6 years ago
tensor-tang 7383eefd2d add softmax mix and mkl code
6 years ago
tensor-tang 50945685f2 add hmax, hsum jitcode
6 years ago
tensor-tang 8117725852 add jit kernel hsum, hmax and softmax refer code
6 years ago
Zeng Jinle bf7dedcbc7
Merge pull request #15545 from sneaxiy/fix_debug_nccl_error
6 years ago
dzhwinter ee3aae56cd merge develop branch. test=develop
6 years ago
jerrywgz cee2e1b089 refine code, test=develop
6 years ago
sneaxiy ba4f43fd62 fix compile error in distributed mode
6 years ago
tink2123 a0c63f1106 add align_flag
6 years ago
Tao Luo b919190232
Merge pull request #15531 from jczaja/prv-googlenet-fix
6 years ago
Zhaolong Xing 97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
6 years ago
Kaipeng Deng aeca5c50b2 fix grid_sampler PADDLE_ENFORCE error. test=develop (#15542)
6 years ago
乔龙飞 Qiao Longfei 5f89ce7fcd
Merge pull request #15536 from jacquesqiao/fix-prefetch-one-parameter
6 years ago
Jacek Czaja 5885c5cdf6 - Added explanation to LRN MKL-DNN op on alpha modification
6 years ago
Jacek Czaja 4aa7ef3c13 - Compensation fix to LRN MKL-DNN op
6 years ago
Qiao Longfei 806658d72b add space after colon in commnet test=develop
6 years ago
nhzlx b43ea40c51 delete the usage of the const_cast
6 years ago
baojun-nervana 8e9308a51a mv ngraph_bridge to ngraph directory test=develop
6 years ago
Qiao Longfei 4d13434443 fix a little problem test=develop
6 years ago
Qiao Longfei 9c3910f390 IncreaseBatchBarrier should be in the right condition test=develop
6 years ago
ruri 88bd7e1a61
Merge pull request #15027 from shippingwang/shufflechannel
6 years ago
Jacek Czaja fa286b1052 LRN reengineering
6 years ago
nhzlx 92cf4a4c6b fix comments
6 years ago
tensor-tang e043ea9653
Merge pull request #15515 from tensor-tang/jit/benchmark
6 years ago
Qiao Longfei 5a0c6593d5 revert RequestGetHandler
6 years ago
jerrywgz 466a10dcdd refine code, test=develop
6 years ago
乔龙飞 Qiao Longfei c58555067e
Merge pull request #14731 from jacquesqiao/optimize-cpp-reader
6 years ago
jerrywgz a39240c3b6 add attr variance for box coder, test=develop
6 years ago
gongweibao d54494ba87
cleanup test=develop (#15347)
6 years ago
Qiao Longfei 84220765a7 refine code, add more log
6 years ago
Qiao Longfei c750be6d9d add some log
6 years ago
gongweibao fe8f28c957
Add GetVariableNoBarrier on brpc. (#15488)
6 years ago
tangwei12 981fc2bdba
fix bug in merge_ids (#15503)
6 years ago
baojun efce25673c Adding ngraph_engine_op (#14948)
6 years ago
chengduo f8f91fb4b3
Revert conv transpose cudnn (#15514)
6 years ago
tensor-tang b67584a6e9 jit benchmark use tensor
6 years ago
Yiqun Liu 3008fa1261
Add the CUDA kernel for beam_search op (#15020)
6 years ago
tink2123 78145c7dff modified some comments
6 years ago
nhzlx 027d24c831 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
chengduo bf91d11ed5
Clean elementwise_op_function (#15502)
6 years ago
tangwei12 5cfc40dea8
nce add check sample lables, test=develop (#15463)
6 years ago
tink2123 e448bdb298 modified some comments
6 years ago
tink2123 88744e4ab8 fixed some errors
6 years ago
jerrywgz 9eb2d7b3e1 refine code, test=develop
6 years ago
jerrywgz 6dfd789bfc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_nms
6 years ago
jerrywgz 6928f8318f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_axis_for_boxcoder
6 years ago
jerrywgz e60c8438fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_clip_op
6 years ago
tink2123 48cc484643 add align_corners and align_mode for image_resize
6 years ago
jerrywgz 11f1baa406 refine code, test=develop
6 years ago
Zhaolong Xing b7b68f2a8c
Merge pull request #15461 from NHZlX/fix_trt_stream_bug
6 years ago
tangwei12 8b50ad80ff
checkpoint at distributed training (#14854)
6 years ago
jerrywgz 57e5f61ec8 add gpu kernel, test=develop
6 years ago
jerrywgz cc53453057 add comment and refine code, test=develop
6 years ago
qingqing01 07dc5a1506
Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371)
6 years ago
Yiqun Liu eaad3e4c3d
Add check of input in sequence_expand op. (#15466)
6 years ago
gongweibao f4dec5cdee
Check collective server's data. (#15449)
6 years ago
jerrywgz c12a969bd4 refine comment and unittest, test=develop
6 years ago
chengduo 5a8bd82c0c
Remove workspace_handle (#15376)
6 years ago
jerrywgz 1c558ad388 add gpu kernel for box clip, test=develop
6 years ago
nhzlx 5b92ddabe2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_trt_stream_bug
6 years ago
nhzlx 2f4aee361a fix comments
6 years ago
nhzlx ec213730bc fix trt stream bug.
6 years ago
wopeizl a8aa79130b
Merge pull request #15453 from wopeizl/fix15313
6 years ago
gongweibao 7f8b40f68d
Fix brpc complation error. (#15451)
6 years ago
jerrywgz 0d4b60ab8b add lod for slice op, test=develop
6 years ago
dzhwinter 8f3b252392 squash commits. test=develop
6 years ago
peizhilin e6a3a3a31a fix pr 15313
6 years ago
jerrywgz 66bb5dd760 refine infer shape, test=develop
6 years ago
tensor-tang 266e625d2e
Merge pull request #15399 from tensor-tang/refine/seqpool/fc
6 years ago
Qiao Longfei 45578c1b48 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
jerrywgz 0d91507859 fix share lod, test=develop
6 years ago
Tao Luo 6597ccb01f
Merge pull request #15413 from luotao1/legacy_code
6 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
6 years ago
jerrywgz 5246285e34 test=develop
6 years ago
jerrywgz b10d84bc5a fix bug when run on GPU, test=develop
6 years ago
whs 530869f829
Share LoD from Input(Rois). (#15420)
6 years ago
gongweibao 7ab4af2716
Fix brpc compilation. (#15417)
6 years ago
Dun Liang e5004f3c1c fix ci && test=develop
6 years ago
tensor-tang 316e44b1b7 fix unused warnings
6 years ago
Wu Yi 7e651a38dd
fix mac cmake version 3.13 build (#15386)
6 years ago
jerrywgz b62a17bbae add nms api
6 years ago
tensor-tang 579d758254 fix jitkernel tests and refine benchmark
6 years ago
jerrywgz f660553d77 enhance nms for mask rcnn, test=develop
6 years ago
shippingwang 14f2a1060d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
6 years ago
jerrywgz 88ee56d0b2 enhance nms for mask rcnn
6 years ago
zhaozhehao e2ba9668b4 Tree conv op (#15217)
6 years ago
Tao Luo 3ede8b67e6 update CMakeLists.txt
6 years ago
Yiqun Liu f413b6892b
Revert the modification of while_op in #14764. (#15372)
6 years ago
jerrywgz ab9d6a4f39 add comments, test=develop
6 years ago
jerrywgz 10dd3b37ad add axis for box coder op
6 years ago
乔龙飞 Qiao Longfei adba4384ec
Merge pull request #15161 from jacquesqiao/gru-add-mode
6 years ago
nhzlx 8817841c73 fix unit test bug
6 years ago
jerrywgz 5fb2856584 test_develop
6 years ago
Xin Pan 3ecf6bb338
Merge pull request #15028 from yihuaxu/develop_641313ea7_elementwise_mul_mkldnn_bug_fix
6 years ago
jerrywgz af448373c7 test=develop
6 years ago
nhzlx b938324381 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
nhzlx 312fe0ece1 add trt int8 calibration support
6 years ago
wopeizl 994e73f685
Merge pull request #15351 from wopeizl/fixbuildissue
6 years ago
jerrywgz 481d8bce2f add box clip op
6 years ago
Yiqun Liu 568cc2ffa8
Optimize while_op for test (#14764)
6 years ago
tensor-tang 3759c1db8c
Merge pull request #14805 from mozga-intel/mozga-intel/element_wise_operator_ngraph
6 years ago
tensor-tang 904a39239d
Merge pull request #15254 from mozga-intel/mozga-intel/softmax_operator_ngraph
6 years ago
peizhilin cd562f8fb7 disable the parallel mode for adam op on windows test=develop
6 years ago
Xin Pan 16cb3ebd68
Merge pull request #15268 from xiaolil1/pool-int8
6 years ago
tensor-tang a7fc3d42a0
Merge pull request #15304 from tensor-tang/fuse/second_order_mul_sub
6 years ago
mozga-intel cba729404d Enable softmax operator for a ngraph engine
6 years ago
Qiao Longfei cd31b90a46 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
Qiao Longfei 8c516a24e5 remote min_row_size_to_use_multithread in adam interface test=develop
6 years ago
Qiao Longfei 9b4fe283e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago
Qiyang Min 3f687765e6
Merge pull request #15281 from velconia/fix_expand_op_compile_time
6 years ago
minqiyang c4cf5967db Change backward op infershape
6 years ago
tensor-tang 84b0ecdcce Merge remote-tracking branch 'ups/develop' into fuse/second_order_mul_sub
6 years ago
chengduo 46d01d798e
Revert "Revert "Remove workspace_handle in conv_cudnn (#15186)"" (#15290)
6 years ago
Qiao Longfei 4d15515c40 fix gru_gpu_kernel test=develop
6 years ago
tensor-tang 93e75c5ae5 refine jitcode of vsub and vsquare
6 years ago
tensor-tang d618e48309 fix fuse square mat order and refine test
6 years ago
Qiao Longfei 4feae25378 fix build problem test=develop
6 years ago