JiabinYang
510601b279
test=develop
6 years ago
JiabinYang
982e489220
test=develop
6 years ago
Tao Luo
1d9b2a453c
Merge pull request #14508 from luotao1/warm_up_multi_thread
...
add warm up in TestMultiThreadPrediction
6 years ago
Yu Yang
a685f305f8
Merge pull request #14479 from reyoung/feature/fix_macos_ut
...
fix(Mac): fix unittest of macos
6 years ago
tensor-tang
10fb4ceefc
Merge pull request #14351 from tpatejko/tpatejko/mkldnn-elementwise_mul
...
[MKLDNN][JIT][AVX512] Elementwise Mul
6 years ago
Zhaolong Xing
05280674e6
Merge pull request #14500 from NHZlX/refine_trt
...
Fix gpu load model and demo_ci on trt
6 years ago
nhzlx
a4dc1d4292
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt
...
test=develop
6 years ago
nhzlx
faeb9b8aa9
fix compile rely problem
6 years ago
chengduo
a8d3aaae2a
print output log warning ( #14497 )
...
test=develop
6 years ago
Tao Luo
eb9b9becdc
add warm up in TestMultiThreadPrediction
...
test=develop
6 years ago
Tao Luo
7e8c4597e2
Merge pull request #14490 from wojtuss/wojtuss/fix-allocator-header-copy
...
add allocator.h copy
6 years ago
Tao Luo
5cc7946313
Merge pull request #14499 from luotao1/disable_openblas_test
...
disable two openblas test temporary
6 years ago
Houjiang Chen
10ae3ba486
Merge pull request #14493 from hjchen2/develop
...
Implement leaky relu converter from fluid to tensorRT
6 years ago
nhzlx
2a84054372
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt
...
test=develop
6 years ago
nhzlx
b742d46520
fix demo ci bug on trt
6 years ago
Houjiang Chen
33c65517fd
Update CMakeLists.txt test=develop
6 years ago
Kaipeng Deng
f17b05d4a1
Merge pull request #14492 from heavengate/resize_nearest_example
...
Add python example for resize_nearest Api
6 years ago
Tao Luo
1d3e9bde1e
Merge pull request #14488 from yihuaxu/develop_7a64d48f5_stack_opt
...
Optimize the stack operator
6 years ago
Houjiang Chen
01bda73116
Update CMakeLists.txt
6 years ago
Tao Luo
09ee266f8e
disable two openblas test temporary
...
test=develop
6 years ago
hjchen2
2c2a192eb1
Resolve merge conflicts
...
test=develop
6 years ago
Yiqun Liu
8bc1c5d2ab
Implement the Tensorrt plugin for elementwise op ( #14487 )
...
* Initialize the elementwise plugin.
* Implement the basic CUDA kernel of elementwise plugin.
test=develop
6 years ago
tensor-tang
7aa3aff338
Merge pull request #14465 from tensor-tang/fea/jit/exp
...
jitcode act support all size
6 years ago
Tao Luo
1b894e495f
Merge pull request #14437 from jczaja/prv-softmax-mkl
...
Introducing MKL to softmax for inference
6 years ago
dengkaipeng
bb2b35c85e
Add python example for resize_nearest. test=develop
6 years ago
chengduo
a94a7355f0
Refine the GraphNum check ( #14144 )
...
* refine GraphCheck
test=develop
* fix ci fail
test=develop
6 years ago
Yihua Xu
a906a361be
Add the macro for NVCC (test=develop)
6 years ago
Yihua Xu
d91740acb1
Revert "Remove the remnant code (test=develop)"
...
This reverts commit be50670348
.
6 years ago
Yihua Xu
be50670348
Remove the remnant code (test=develop)
6 years ago
hjchen2
1622cb9937
Fix alpha tensor key
6 years ago
tensor-tang
48be9dc3e1
Merge pull request #14489 from tensor-tang/api/example
...
add api example of brelu, leaky_relu and soft_relu
6 years ago
hjchen2
a8c077df7c
Implement leaky relu tensorRT converter
6 years ago
qingqing01
9eefd2c766
Modify some infer-shape about detection operators in compile-time. ( #14483 )
...
* Modify some infer-shape in compile-time.
6 years ago
Tao Luo
cf685f361b
Merge pull request #14458 from tpatejko/tpatejko/mkldnn-skip-connections
...
[WIP] Correcting and extending MKLDNN residual connection fuse pass
6 years ago
tensor-tang
e3645c2708
add api example of brelu, leaky_relu and soft_relu
...
test=develop
6 years ago
Yihua Xu
f4c869d872
Optimize the layer_norm operator with AVX intrinsic function ( #14417 )
...
* Optimize layer_norm operator with AVX intrinsic functions
* Revert the wrong modifications
* Implement the jit kernel for layer_norm operator
* Add math headfile to fix the compile issue (test=develop)
* Add math headfile to fix the compile issue (test=develop)
* Fixed the intrinsic headfile issue (test=develop)
* Fix the conflicts (test=develop)
* Revert for CUDA compiler (test=develop)
* Fixed the cuda depency (test=develop)
* Fix the marco issues (test=develop)
6 years ago
Houjiang Chen
816b464037
Merge pull request #14486 from hjchen2/develop
...
Fix tensorrt plugin cmake dependency, test=develop
6 years ago
Yu Yang
f1a392a5fe
Merge pull request #13804 from sneaxiy/rewrite_allocation
...
Rewrite allocation
6 years ago
Yihua Xu
f418f552df
Merge branch 'develop' into develop_7a64d48f5_stack_opt (test=develop)
6 years ago
hjchen2
2825685f2a
Fix tensorrt plugin cmake dependency, test=develop
6 years ago
Superjomn
e878a8e885
update
...
test=develop
6 years ago
qingqing01
fd7e643153
Convolution fusion operator. ( #14449 )
...
* Convolution fusion operator.
* Clean code
test=develop
6 years ago
Yu Yang
98bbfc17be
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
Yu Yang
38143e5aca
Clean unused changes
...
test=develop
6 years ago
Wojciech Uss
d36491c28a
add allocator.h copy
...
The allocator.h header file is required for C-API inference applications
test=develop
6 years ago
Yu Yang
7486b0ddec
fix(Mac): fix unittest of macos
...
test=develop
6 years ago
Yu Yang
d424115f9e
Clean code
...
test=develop
6 years ago
Wu Yi
d7bd0361cb
fix dist deps ( #14471 )
...
* fix dist deps test=develop
* update test=develop
* update test=develop
* update test=develop
* update test=develop
6 years ago
Yu Yang
b12c77dae2
Fix unittests
...
test=develop
6 years ago
Jacek Czaja
9b0eae3023
- Removing partial specialization of sotmax for inference for GPU
...
test=develop
6 years ago