Commit Graph

1789 Commits (3d06ccfb23f45994253cd229ddeab0e7b36e0a15)

Author SHA1 Message Date
Tao Luo d04ef276a5
Merge pull request #12745 from tensor-tang/refine/op/elewise_mul
7 years ago
tangwei12 cbc6e6eb97
Merge pull request #12247 from seiriosPlus/dis_ckpt_fix
7 years ago
Qingsheng Li 3d11d018e0
Fix scatter_op python API (#12742)
7 years ago
Tao Luo 8f9f414a14
Merge pull request #12805 from tensor-tang/fix/op/elewise_add
7 years ago
tensor-tang e955361267
Merge pull request #12737 from tensor-tang/feature/op/fusion_lstm
7 years ago
tensor-tang 82bb9170fb Merge remote-tracking branch 'ups/develop' into fix/op/elewise_add
7 years ago
Chen Weihang 57b34d9196
Merge pull request #12808 from chenwhql/remove_inplace_param_in_squeeze_and_unsqueeze
7 years ago
Yihua Xu 084d4a9e9e Optimize CRF Decoding with AVX/AVX2/AVX512F instruction (#12767)
7 years ago
dzhwinter 00463fdfe3
cudnn windows support (#12757)
7 years ago
qingqing01 c62f68cb94
Fix bug in conditional_block_op. (#12246)
7 years ago
chenweihang bc471b6ac4 refactor: remove inplace parameter from squeeze and unsqueeze op
7 years ago
tensor-tang 0507f7bc3c fix SEGV elementwise add at debug mode
7 years ago
tangwei12 ca1e18c04a
Merge pull request #12469 from seiriosPlus/sum_op_dim_fix
7 years ago
Zhaolong Xing e5674f6dde
Merge pull request #12753 from NHZlX/add_benchmark
7 years ago
tensor-tang b090479409 Merge remote-tracking branch 'ups/develop' into feature/op/fusion_lstm
7 years ago
tangwei12 b4f52b01d0 bug fix when all inputs are empty
7 years ago
tangwei12 3efac174ea Merge branch 'develop' of github.com:PaddlePaddle/Paddle into sum_op_dim_fix
7 years ago
tangwei12 dbb4f0d35d Merge branch 'develop' of github.com:PaddlePaddle/Paddle into dis_ckpt_fix
7 years ago
Qiao Longfei fd10669ecb
Add dependency to send recv (#12760)
7 years ago
tangwei12 7c12c0f865 add sync in load selectedrows
7 years ago
Michal Gallus 4a7f0698e0 Add consts to new MKLDNN integration
7 years ago
Michal Gallus 6588d0e039 Update MKLDNN to 0.15, fix conv integration
7 years ago
tangwei12 9f11db4080 add todo in impl
7 years ago
tangwei12 c24a9263ba Merge branch 'develop' of github.com:PaddlePaddle/Paddle into sum_op_dim_fix
7 years ago
tangwei12 ac9ae97001 code fix
7 years ago
nhzlx f55e8901c8 merge develop
7 years ago
nhzlx 1600ba86f6 1. change tensorrt op from cpu to gpu
7 years ago
tangwei12 bb9f494740 merge develop
7 years ago
dzhwinter 4069262f0e
Revert ""cherry picked operators changes" (#12184)" (#12747)
7 years ago
Qiao Longfei 653fad08f8
Optimize selected rows for dist lookup table with pthread rwlock (#12635)
7 years ago
tensor-tang 92890ac258 Merge remote-tracking branch 'ups/develop' into feature/op/fusion_lstm
7 years ago
tangwei12 0749c8822d
Merge pull request #12556 from seiriosPlus/samplingIdOp
7 years ago
tensor-tang a56142c155 optimize elementwise_mul cpu forward
7 years ago
tensor-tang 6644ce79a5 add mklml vmul
7 years ago
tensor-tang ff92b6ba81
Merge pull request #12531 from tensor-tang/refine/op/gru
7 years ago
tangwei12 26b228e405 remove assignment and add vlog
7 years ago
tangwei12 125e9166e1 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into sum_op_dim_fix
7 years ago
tensor-tang a72f68f223 Merge remote-tracking branch 'ups/develop' into feature/op/fusion_lstm
7 years ago
tensor-tang df28a3b452 fix lod and op test
7 years ago
Qingsheng Li 317e18abd2
Remove Data Sharing between input and output in scatter_op (#12672)
7 years ago
tensor-tang f3cd2612ae refine fc and use the fc compute in fusion_lstm
7 years ago
tangwei12 822496f626 merge cpu and gpu
7 years ago
dzhwinter bf3c34960f
"cherry picked operators changes" (#12184)
7 years ago
tensor-tang 40138c4cd6 add unit test of fusion lstm op
7 years ago
jerrywgz c108376506 Add three modes for prelu_op (#12630)
7 years ago
tangwei12 9f09d68678 add enforce
7 years ago
gongweibao d06849305a
parameter dispather. (#12666)
7 years ago
tensor-tang 852bc6f4aa refine fusion lstm op doc
7 years ago
tensor-tang 8f9132959e fuse fc in lstm
7 years ago
tensor-tang ddb05dffb6 init fusion lstm op
7 years ago
tensor-tang efc5392d97
Merge pull request #12676 from tensor-tang/refine/op/fc
7 years ago
tangwei12 470fb7c5c3 bug fix
7 years ago
tangwei12 60dda7bf9f add gpu Implementation
7 years ago
tangwei12 4661f5589d random optimize
7 years ago
Bai Yifan 9333a62792
Add flatten op interface and enhance APIs about detection to support variable-length image. (#12422)
7 years ago
tensor-tang eee38464dc refine fc op use cpu only
7 years ago
tangwei12 ed937bc6f8 merge
7 years ago
tensor-tang d84a1a0010 fc op use cpu only
7 years ago
tangwei12 478f73c188 merge header in cc
7 years ago
fengjiayi d6b5302bd6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_op_tensor_support
7 years ago
tensor-tang c588c64a76 Merge remote-tracking branch 'ups/develop' into refine/op/gru
7 years ago
tensor-tang 0098a494a2 Merge remote-tracking branch 'ups/develop' into refine/op/fc
7 years ago
fengjiayi 5e7aa8c7e5 code clean
7 years ago
tensor-tang 742300baa8 fix unkown omp pragmas
7 years ago
tensor-tang b9dbb7c5cb fix bias attri in mkldnn fc
7 years ago
tangwei12 59580a7f69 bug fix
7 years ago
tensor-tang 4b5986bb77 enable fc op in normal case
7 years ago
tensor-tang e133df6037 enable native fc forward
7 years ago
tensor-tang 6a2a9a8350
Revert "Refine elementwise_add op"
7 years ago
Yu Yang 8dda526a45
Merge pull request #12659 from sneaxiy/refine_softmax_with_cross_entropy
7 years ago
sneaxiy f6f5cdaa05
Merge pull request #12555 from sneaxiy/refine_layer_norm
7 years ago
sneaxiy c50c537732 fix arithmetic error in backward kernel
7 years ago
tensor-tang 038cbf799d add bias for fc op
7 years ago
whs 9d6243b6fb Fix crop op. (#12603)
7 years ago
Bai Yifan 649f5d74f0
fix mine_hard_example bug (#12664)
7 years ago
sneaxiy 2d9508f8f3
Merge pull request #12554 from sneaxiy/refine_elementwise_add
7 years ago
tensor-tang 171a0e2b42 add some comment
7 years ago
sneaxiy 2c560623d1 fix dependency error
7 years ago
tensor-tang 5377edd282 refine packed condition
7 years ago
tensor-tang 3bf3e77ac8 Merge remote-tracking branch 'ups/develop' into refine/op/gru
7 years ago
qiaolongfei c0890988da add RPCServerProfiler, replace listen and serv optimizer
7 years ago
tangwei12 64a4925cb4 Merge branch 'Pdv' into samplingIdOp
7 years ago
tangwei12 0bfd62be3d remove gpu supported, will add it later
7 years ago
Tao Luo 5a9ae411e0
Merge pull request #12618 from sfraczek/sfraczek/fix-new-mkldnn-conv-tests
7 years ago
sneaxiy cf799a6a04
Merge pull request #12553 from sneaxiy/refine_softmax_with_cross_entropy
7 years ago
dzhwinter 8499559c42
"fix style" (#12600)
7 years ago
sneaxiy 010883689c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_layer_norm
7 years ago
sneaxiy 5d698589ce Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_elementwise_add
7 years ago
sneaxiy 19ff254d05 Merge branch 'refine_elementwise_add' of https://github.com/sneaxiy/Paddle into refine_elementwise_add
7 years ago
Sylwester Fraczek d74bb6ab9c fix ut for mkldnn 0.15 - added forcing layout NCHW in mkldnn conv tests
7 years ago
fengjiayi 855c9e3311 clean softmax_op code
7 years ago
fengjiayi 24d51de022 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_op_tensor_support
7 years ago
fengjiayi 27df3a9f2b make cross_entropy_op supporting tensors
7 years ago
fengjiayi 66be53264e
Merge pull request #12592 from JiayiFeng/fix_mac_compile_error
7 years ago
fengjiayi 8e604a10aa fix mac compile error
7 years ago
nhzlx 551c802cdc merge develop
7 years ago
sneaxiy ad45d39222 refine layer_norm
7 years ago
chengduo 7c8b69c700
Feature/op fusion (#12240)
7 years ago
sneaxiy 1b4515f6db refine softmax_with_cross_entropy
7 years ago
nhzlx 3a0caf801f modify trt engine op test
7 years ago