Tao Luo
546eefae6f
Merge pull request #15591 from lidanqing-intel/lidanqing/density_prior_box
...
optimize density_prior_box_op.h in detect model for cpu
6 years ago
Yan Chunwei
dc5e25fc7f
remove dot marked node ( #15606 )
6 years ago
Jiabin Yang
48a054f0fd
Merge pull request #15424 from JiabinYang/fast_install_1.3
...
test=develop, fast_install shell for linux and mac
6 years ago
JiabinYang
addf58c6b5
test=develop, fix exit issue
6 years ago
guoshengCS
b6c3b69af8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-beam-search-size
...
test=develop
6 years ago
dzhwinter
46a6cac91f
fix batch norm. test=develop ( #15597 )
6 years ago
qingqing01
943d972878
Fix analysis predictor when loading the persistable RAW type variable. ( #15613 )
6 years ago
liuwei1031
6e84eb131f
expose peak gpu memory API to python test=develop ( #15529 )
...
* expose peak gpu memory API to python test=develop
* add unittest for peak gpu memory monitoring test=develop
* add pybind change test=develop
* add mutex to gpu mem usage monitor test=develop
* update benchmark flag definition file test=develop
* tweak unittest for memory monitoring test=develop
6 years ago
jerrywgz
1743d1a58f
Merge pull request #15356 from jerrywgz/add_clip_op
...
Add box clip op
6 years ago
mozga-intel
43a67a2662
Enable conv2d operator for a ngraph engine ( #15269 )
...
test=develop
6 years ago
tensor-tang
a6a1a92ef7
Merge pull request #15586 from tensor-tang/jit/cache
...
refine bert
6 years ago
jerrywgz
4f18a9b87b
test=develop
6 years ago
JiabinYang
2c133430f4
test=develop, fix no_avx exit
6 years ago
guoshengCS
5dfce93101
To make CUDA_LAUNCH_KERNEL_HELPER support large size.
...
test=develop
6 years ago
Yan Chunwei
e887d71958
fix ir debug config ( #15571 )
6 years ago
Yan Chunwei
897789b16e
fix save_inferece_model bug ( #15365 )
6 years ago
lidanqing-intel
4b3c6612a1
optimize density_prior_box_op.h for cpu
...
test=develop
6 years ago
tensor-tang
ba02ac4692
use mat attr and refine test ( #15448 )
...
* use mat attr and refine test
test=develop
* add matmul jitcode
test=develop
* fix mac compile
test=develop
6 years ago
JiabinYang
be36e186f6
test=develop, refine wget issue
6 years ago
JiabinYang
9640736ad7
test=develop, refine wget issue
6 years ago
jerrywgz
4eb44380a6
Merge branch 'develop' into add_clip_op
6 years ago
Xin Pan
30cc8b7a92
Merge pull request #15554 from heavengate/yolo_loss_darknet
...
Yolo loss darknet
6 years ago
shanyi15
f96f166c8c
test=develop, refine doc
6 years ago
Tao Luo
1a252f4be6
Merge pull request #15587 from luotao1/bert
...
use embedding=128 bert model for test
6 years ago
shanyi15
b612709fec
test=develop, refine doc for fast_install
6 years ago
JiabinYang
ed7ae471d3
test=develop, fix mac python check error
6 years ago
JiabinYang
a6af361278
Merge branch 'fast_install_1.3' of https://github.com/JiabinYang/Paddle into fast_install_1.3
6 years ago
Haihao Shen
1b8047b712
Add INT8 calibration support in Paddle package ( #15569 )
...
* Add INT8 calibration support in Paddle package; test=develop
6 years ago
JiabinYang
ff85734eed
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fast_install_1.3
6 years ago
mozga-intel
312500dcb5
Enable pool2d operator for a ngraph engine ( #15395 )
...
* Enable pool2d operator for a ngraph engine
test=develop
* Update
test=develop
6 years ago
Jiabin Yang
b4c24f3f7c
Merge pull request #15575 from JiabinYang/feature/imperative
...
test=develop, polish code and fix some wrong change
6 years ago
Tao Luo
ea92905be4
Merge pull request #15478 from kbinias/kbinias/seperate-folders-for-mkldnn
...
Make separate folders for mkldnn codes
6 years ago
Yibing Liu
170842cbb4
Some improvements to support bert mixed precision training ( #15585 )
...
* Some improvements to support bert mixed precision training
test=develop
* Revert the cast in layer_norm
test=develop
6 years ago
Yiqun Liu
16d54f7f23
Return parent_idx in beam_search op ( #15520 )
...
* Refine beam_search_op to output an extra parent_idx tensor.
test=develop
* Fix the unittest test_beam_search_op.
test=develop
* Fix the merging mistake.
test=develop
6 years ago
jerrywgz
72ee3c6232
Merge pull request #15398 from jerrywgz/add_axis_for_boxcoder
...
Add axis for boxcoder
6 years ago
jerrywgz
e402c0ec7d
test=develop
6 years ago
Kaipeng Deng
d3eeb92bba
Merge pull request #15491 from tink2123/new_align_corners
...
add align_corners and align_mode for image_resize
6 years ago
jerrywgz
3046799ecd
Merge branch 'develop' into add_clip_op
6 years ago
Jiabin Yang
2d0ffdc485
test=develop, fix debug mode unitest, hsigmoid ( #15574 )
6 years ago
Zhaolong Xing
90ffe74954
Merge pull request #15546 from NHZlX/fix_trt_utest_random_failed
...
fix trt models utest failed.
6 years ago
luotao1
8f0c2b07f2
use embedding=128 bert model for test
...
test=develop
6 years ago
tensor-tang
2b0811c3fb
refine vadd jitkernel choice
...
test=develop
6 years ago
tensor-tang
a18c0d4242
cache fc kernel
...
test=develop
6 years ago
tensor-tang
6e1ee7fb57
cache softmax kernel func
...
test=develop
6 years ago
Krzysztof Binias
69b7c595d6
Small fix
...
test=develop
6 years ago
Krzysztof Binias
b1bdcd4de8
Make separate folders for mkldnn codes
...
test=develop
6 years ago
dengkaipeng
23d34d1f7e
move yolov3_loss to detection. test=develop
6 years ago
JiabinYang
16f64b43d4
test=develop, Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/imperative
6 years ago
tensor-tang
c7449227e8
Merge pull request #15563 from tensor-tang/jit/softmax
...
refine softmax kernel
6 years ago
dengkaipeng
733bb82ec0
downsample -> downsample_ratio. test=develop
6 years ago