Wu Yi
29d9fb53fc
[Feature] multi process multi gpu dist training, boost v100 performance by 20% ( #14661 )
...
* wip multi process multi gpu dist training
* workable for p2p
* update test=develop
* change back env name test=develop
* fix alloc init
* fix cpu build test=devlop
* fix mac tests test=develop
* refine code
* refine test=develop
6 years ago
sneaxiy
a9093c2775
fix conflict
...
test=develop
6 years ago
liuhongyu
e80402fd0e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
6 years ago
liuhongyu
968dd3c078
add cudnn 5 support; test=develop
6 years ago
sneaxiy
e694d0c2e4
fix while_op eager deletion bug
...
add unittest
test=develop
6 years ago
Xin Pan
461ca35be1
Merge pull request #14590 from panyx0718/fix4
...
enable API check for readers
6 years ago
sneaxiy
2d0d037d8e
fix while_op eager deletion bug
...
add unittest
test=develop
6 years ago
gongweibao
50a698525d
Fix log level ( #14692 )
6 years ago
Wu Yi
6e67d0fb78
layer fixes ( #14591 )
...
* layer fixes test=develop
* follow update test=develop
6 years ago
JiabinYang
7cbcb2b8d2
Merge branch 'optimize_hs_op' of https://github.com/JiabinYang/Paddle into optimize_hs_op
...
, test=develop
6 years ago
JiabinYang
8c75705984
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize_hs_op
...
, test=develop
6 years ago
Xin Pan
dc458b1482
Merge pull request #14713 from panyx0718/api
...
add more files to protected file list
6 years ago
JiabinYang
d144f630ff
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize_hs_op
6 years ago
JiabinYang
b387a19410
optimize op with blas
6 years ago
Tink_Y
6d04a9cf47
fix api format and example ( #14686 )
...
* fix api format and examples
test=develop
* Update executor.py
test=develop
* Update nn.py
* Update nn.py
test=develop
* Update nn.py
test=develop
6 years ago
Zeng Jinle
ff4237309a
Merge pull request #14720 from sneaxiy/fix_seq_mask_op_infershape
...
Fix sequence_mask_op InferShape
6 years ago
ZongwuYang
1560eb4a6d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my-cool-stuff
6 years ago
ZongwuYang
deb04809bd
test=develop
...
Fix the bug that profiler cannot trace the nccl allreduce operator
6 years ago
Xin Pan
da4e0bf1a1
add 2 more files
...
test=develop
6 years ago
Xin Pan
7c5289f68e
Merge pull request #14719 from PaddlePaddle/revert-14666-feature/estiminate_flops
...
Revert "Add EstiminateFlops"
6 years ago
lujun
9da5954a21
fix mac ci test step, test=develop
6 years ago
Kaipeng Deng
934f13a70a
Merge pull request #14371 from heavengate/yolo_loss
...
Add YOLOv3 loss operator for YOLOv3 model
6 years ago
sneaxiy
35a2578426
fix bug
...
test=develop
6 years ago
sneaxiy
65867d8989
test=develop
6 years ago
Jiabin Yang
6dcc6378b7
Merge pull request #14665 from JiabinYang/ci/add_import_check
...
add mac ci check on import
6 years ago
zhang wenhui
abbe382e1e
Revert "Add EstiminateFlops"
6 years ago
Xin Pan
0591ba96ec
fix hack
...
test=develop
6 years ago
Yihua Xu
ac803fed18
Fix the compile issue for cuda device (test=develop)
6 years ago
sneaxiy
64ad051b9a
merge develop
...
test=develop
6 years ago
sneaxiy
c47c451a00
fix bug
6 years ago
Tao Luo
3437e17713
Merge branch 'has_attr' of https://github.com/luotao1/Paddle into luotao1-has_attr
6 years ago
nhzlx
e7abe6b654
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_prelu_gpu
...
test=develop
6 years ago
nhzlx
f75815b78c
add prelu gpu inference
6 years ago
Xin Pan
bcf36d8401
add more files to protected file list
...
test=develop
6 years ago
Xin Pan
7464bd29e8
polish
...
test=develop
6 years ago
Tao Luo
0e3048db43
Merge pull request #14659 from luotao1/update_pass
...
update is_test_pass and mkldnn_placement_pass
6 years ago
Xin Pan
7e0801d4ed
Merge pull request #14441 from baojun-nervana/intel/ngraph_op
...
Implementing ngraph engine
6 years ago
Yihua Xu
82eefceabe
Add the profile_mkldnn flag for profile function(test=develop)
6 years ago
Xin Pan
35e6b5e16a
polish
...
test=develop
6 years ago
Yihua Xu
ea00270fe8
Remove the dims checking when the dim is 3 (test=develop)
6 years ago
Xin Pan
b80fe8264a
polish
...
test=develop
6 years ago
Yihua Xu
64e261c6cd
Implement the fusion of convolution and bias for mkldnn
...
(test=develop)
6 years ago
Tao Luo
8d6984eb9b
change OpHasAttr to RuntimeHasAttr, add some comments
...
test=develop
6 years ago
jerrywgz
96dc3d8326
Merge pull request #14511 from jerrywgz/ignore_index_for_sigmoid_cross_entropy
...
add ignore index for sigmoid cross entropy with logits op, test=develop
6 years ago
Tao Luo
a6ac42669c
Merge branch 'develop' into update_pass
6 years ago
Yihua Xu
669191c9cc
Implement conv3d with mkldnn library (test=develop)
6 years ago
Hongyu Liu
4f71a6ee2c
Merge pull request #14622 from PaddlePaddle/add_cudnn_lstm
...
Add cudnn lstm
6 years ago
Yibing Liu
c7382df80f
Print assert failure id in lookup_table_op ( #14698 )
6 years ago
Yu Yang
566a325939
Merge pull request #14676 from reyoung/feature/stable_decorator_test
...
Stablize decorator test
6 years ago
Yu Yang
0f0e197914
Merge pull request #14666 from reyoung/feature/estiminate_flops
...
Add EstiminateFlops
6 years ago