Commit Graph

19906 Commits (ce31deb7e938270249b719bce93ef6d8baf5c0c4)
 

Author SHA1 Message Date
tensor-tang ce31deb7e9 refine refer code and add lstm refer code
7 years ago
tensor-tang c2cfb03a72 add lstm jitcode
7 years ago
Yiqun Liu 8bc1c5d2ab
Implement the Tensorrt plugin for elementwise op (#14487)
7 years ago
tensor-tang 7aa3aff338
Merge pull request #14465 from tensor-tang/fea/jit/exp
7 years ago
Tao Luo 1b894e495f
Merge pull request #14437 from jczaja/prv-softmax-mkl
7 years ago
chengduo a94a7355f0
Refine the GraphNum check (#14144)
7 years ago
tensor-tang 48be9dc3e1
Merge pull request #14489 from tensor-tang/api/example
7 years ago
qingqing01 9eefd2c766
Modify some infer-shape about detection operators in compile-time. (#14483)
7 years ago
Tao Luo cf685f361b
Merge pull request #14458 from tpatejko/tpatejko/mkldnn-skip-connections
7 years ago
tensor-tang e3645c2708 add api example of brelu, leaky_relu and soft_relu
7 years ago
Yihua Xu f4c869d872 Optimize the layer_norm operator with AVX intrinsic function (#14417)
7 years ago
Houjiang Chen 816b464037
Merge pull request #14486 from hjchen2/develop
7 years ago
Yu Yang f1a392a5fe
Merge pull request #13804 from sneaxiy/rewrite_allocation
7 years ago
hjchen2 2825685f2a Fix tensorrt plugin cmake dependency, test=develop
7 years ago
qingqing01 fd7e643153
Convolution fusion operator. (#14449)
7 years ago
Yu Yang 98bbfc17be Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
Yu Yang 38143e5aca Clean unused changes
7 years ago
Yu Yang d424115f9e Clean code
7 years ago
Wu Yi d7bd0361cb fix dist deps (#14471)
7 years ago
Yu Yang b12c77dae2 Fix unittests
7 years ago
Jacek Czaja 9b0eae3023 - Removing partial specialization of sotmax for inference for GPU
7 years ago
Qiao Longfei 05c15a0867
Merge pull request #14467 from jacquesqiao/update-trainer-retry
7 years ago
tensor-tang a19b3225a1 fix jitcode small size
7 years ago
Qiao Longfei fbc529db91 update test=develop
7 years ago
Qiao Longfei 98a0437d70 optimize distribute checkport
7 years ago
Jacek Czaja be80bb4f28 - Fix to GPU
7 years ago
tensor-tang 4dbdfa60ef sigmoid and tanh support all size
7 years ago
tensor-tang ccb8963705 refine exp jitcode with all size
7 years ago
tensor-tang d3eae8f61b refine relu and fix addrelu test
7 years ago
tensor-tang 4e67fe6a12 refine act and vxx with all size
7 years ago
tensor-tang ba3eaed7a7 exp support all size
7 years ago
tensor-tang d239801b90
Merge pull request #14463 from tensor-tang/fix/noavx
7 years ago
tensor-tang 1ffce8c0ae fix build error on noavx
7 years ago
Wu Yi a2d9b34417
Refine operator cmake (#14413)
7 years ago
Tomasz Patejko 53da846d1e MKLDNN residual connections fuse pass: initial implementation of fusion for projection pass
7 years ago
tensor-tang 7f17e561d7
Merge pull request #14423 from tensor-tang/fea/jit/act
7 years ago
Jiabin Yang 28bd5b7bad fix space_to_depth_op unicode problem (#14430)
7 years ago
Qiao Longfei 142b5ed474
Merge pull request #14450 from jacquesqiao/fix-test-label-sentiment
7 years ago
Jacek Czaja 513bb6c151 Squashing MKL based softmax for inference
7 years ago
Tomasz Patejko dbc4fcd722 MKLDNN residual connections fuse pass: unit tests enabled and added
7 years ago
Tomasz Patejko 4224089354 MKLDNN residual connections fuse pass: Maybe removed and boost::optional used where it makes sense
7 years ago
Tomasz Patejko 86fd3b32be MKLDNN residual connections fuse pass: counting statistics added to the pass
7 years ago
Tomasz Patejko ee6f778beb MKLDNN residual connections fuse pass: further refactoring
7 years ago
Tomasz Patejko 7423748e37 MKLDNN residual connections fuse pass:
7 years ago
whs 1722678258
Make nce support more distribution. (#13549)
7 years ago
Qiao Longfei 09bca67395 add check if the model does not save model
7 years ago
tensor-tang 1f00723fa3 exp, sigmoid, tanh jitcode support more size
7 years ago
Qiao Longfei e4d8f47fcb change the target cost of test_label_semantic_roles to speed up test
7 years ago
Yu Yang 19e669a992 Add legacy_allocator
7 years ago
Zhaolong Xing 2f27c048cc
Merge pull request #14440 from hjchen2/develop
7 years ago