Leo Zhao
bf4b21fa3d
fix assertion failure issue when test_analyzer_bert uses ngraph ( #17148 )
...
resolve #17147
test=develop
6 years ago
tangwei12
deb510d451
cvm op feature ( #17081 )
...
cvm without LoD.
6 years ago
Zeng Jinle
28d69d710a
Refine dropout gpu memory ( #17095 )
...
* refine_dropout_mem,test=develop
* # This is a combination of 14 commits.
# The first commit's message is:
remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066 )
# This is the 2nd commit message:
Fleet unify distributed training (#16791 )
* implement distributed transpiler with fleet
# This is the 3rd commit message:
ParallelDyGraph with GPU collective mode (#16827 )
implement dygraph.parallel.DataParallel to hook reduce op.
# This is the 4th commit message:
Init mixed precision training interface (#16856 )
* Init mixed precision training interface
* Add fp16 test script
test=develop
* All initializers support float16
test=develop
* Code cleanup & add more code annotations
test=develop
* Update API spec
test=develop
* Add usage example in doc
test=develop
# This is the 5th commit message:
fix reference_count_pass,test=develop (#17060 )
test=develop
# This is the 6th commit message:
Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090 )
* Cache the information of linear interpolation in forward and use it in backward.
test=develop
* Fix cuda kernel.
test=develop
# This is the 7th commit message:
remove unnecessary prepare_data (#17080 )
test=develop
# This is the 8th commit message:
fix interpolate cu. test=develop (#17101 )
# This is the 9th commit message:
test=develop, double backward leaky_relu (#17067 )
backward of backward: leaky_relu
# This is the 10th commit message:
fix fuse optimizer ops (#17102 )
test=develop
# This is the 11th commit message:
truncated_gaussian_random supported in distributed training, test=develop (#17091 )
# This is the 12th commit message:
Detailed coordinate description for yolov3 loss (#17007 )
* Detailed coordinate description for yolov3 loss
test=develop
* modified api.spec
test=develop
* modified loss name
* fix api.spec
test=develop
* polish description
test=develop
* modified api.spec
test=develop
# This is the 13th commit message:
fix test_weight_decay (#17109 )
test=develop
# This is the 14th commit message:
Path flag (#17105 )
* fix python/paddle/fluid/__init__.py detecting problems
6 years ago
Huihuang Zheng
b9494058b3
Use CudnnWorkspaceHandle in exhaustive search ( #17082 )
...
1. Use CudnnWorkspaceHandle in exhaustive search of conv_cudnn.
2. For Ops using CudnnWorkspaceHandle in exhaustive search, release their GPU memory after exhaustive search.
test=develop
6 years ago
xiaoting
7da7881c0e
Detailed coordinate description for yolov3 loss ( #17007 )
...
* Detailed coordinate description for yolov3 loss
test=develop
* modified api.spec
test=develop
* modified loss name
* fix api.spec
test=develop
* polish description
test=develop
* modified api.spec
test=develop
6 years ago
ceci3
258e000be6
test=develop, double backward leaky_relu ( #17067 )
...
backward of backward: leaky_relu
6 years ago
Kaipeng Deng
10c487eb21
fix interpolate cu. test=develop ( #17101 )
6 years ago
whs
55ce36e981
Speedup roi_perspective_transform op by caching the information of linear interpolation in forward ( #17090 )
...
* Cache the information of linear interpolation in forward and use it in backward.
test=develop
* Fix cuda kernel.
test=develop
6 years ago
Yan Xu
0b07eef118
ParallelDyGraph with GPU collective mode ( #16827 )
...
implement dygraph.parallel.DataParallel to hook reduce op.
6 years ago
Zeng Jinle
0c335dcd2c
Make conv cudnn workspace size configurable ( #17036 )
...
* make_conv_cudnn_ws_size_configurable, test=develop
* change std::max to std::min
test=develop
6 years ago
jerrywgz
ea3504c7ec
Merge pull request #17017 from jerrywgz/fix_potential_hung
...
fix potential hung in generate proposals, test=develop
6 years ago
qingqing01
c1c2633a63
Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. ( #16862 )
...
* Support backward of backward and a new gradient checker
* Rename decorators.py to decorator_helper.py, since Python on Windows CI has decorators package.
1. Add ReluDoubleGradMaker when register relu_grad.
2. Add a new gradient checker by comparing theoretical and numerical Jacobian. Check double gradients by double_grad_check.
6 years ago
tangwei12
45136b1b41
fix bug in save, test=develop
6 years ago
jerrywgz
47013af0a6
Merge pull request #17011 from jerrywgz/enhance_generate_proposal_labels
...
enhance generate proposal labels, test=develop
6 years ago
jerrywgz
b2df6de860
fix potential hung in generate proposals, test=develop
6 years ago
qingqing01
ea42e431f8
Speed unit testing. ( #16978 )
...
* Speed affine_channel_op unit testing
* Add check in tensor_py
* Fix ONLY_CPU Compiling
6 years ago
jerrywgz
d3a66fc616
enhance generate proposal labels, test=develop
6 years ago
Zeng Jinle
1202d3fc74
Refine model gpu memory ( #16993 )
...
* speedup gc and inplace softmax_with_cross_entropy_grad
test=develop
* refine models gpu mem
Merge skip vars and warning messages of mem opt
remove relu mem opt
test=develop
* follow comments
test=develop
6 years ago
Yibing Liu
3c375751f8
Support seq len equal to 0 in sequence ops ( #16935 )
...
* Support seq len equal to 0 in sequence ops
test=develop
* Add more test cases
* Fix some comments
test=develop
* Fix py3 error
test=develop
6 years ago
Yibing Liu
36c05d36ab
Check some shapes only in runtime ( #16919 )
...
* Check some shapes only in runtime
test=develop
* Follow review comments
test=develop
* Update API spec
6 years ago
gongweibao
cbdb8a17b1
Polish DGC code ( #16818 )
6 years ago
Guo Sheng
9f1d4a152b
Merge pull request #16902 from guoshengCS/refine-infer-shape
...
Refine ENFORCE in infer_shape of gru_op and lstm_unit_op.
6 years ago
Guo Sheng
caf2848356
Merge pull request #16898 from Superjomn/fix/logical_op_infershape
...
fix logical op infershape
6 years ago
tangwei12
2b61db07d1
fix sampling id op bug ( #16909 )
...
* fix sampling id op bug, test=develop
6 years ago
Kevin
c474e7ddf5
fix overflow by int32 mul test=develop ( #16794 )
...
* fix overflow by int32 mul test=develop
* fix reference nullptr
* fix codestyle test=develop
* modify to point in ContextProjectFunctor test=develop
* modify to point in ContextProjectFunctor test=develop
* modify . to -> test=develop
6 years ago
Hongyu Liu
baf60e3a27
Merge pull request #16907 from xuezhong/fix_infershape_bug2
...
fix infershape bug
6 years ago
Yan Chunwei
8cff2b4231
Update logical_op.cc
...
test=develop
6 years ago
Hongyu Liu
40be9590d4
Merge pull request #16897 from velconia/fix_split_lod_tensor_op_infer_shape
...
Fix infer shape of split lod tensor op
6 years ago
Hongyu Liu
d68fb792f8
Merge pull request #16890 from colourful-tree/dev
...
fix teacher_student op infer
6 years ago
Hongyu Liu
ad2a2bb063
Merge pull request #16913 from phlrain/fix_bpr_loss
...
Fix bpr loss
6 years ago
Hongyu Liu
8bd549bb68
Merge pull request #16861 from tensor-tang/refine/infershape
...
separate runtime infershape
6 years ago
Hongyu Liu
9d5d44f939
Merge pull request #16840 from phlrain/fix_shape_check_many
...
fix shape check many by hongyu
6 years ago
SunGaofeng
0508c9869c
Merge pull request #16853 from SunGaofeng/affine_modify
...
modify infer shape pad_op, pad_constant_like_op, no need in psroi_pool_op, crop_op
6 years ago
tangwei12
008fd785fd
fix/positive negative pair op ( #16895 )
...
* fix infershape in runtime
* fix infershape in runtime
test=develop
* fix infershape in runtime
6 years ago
Hongyu Liu
d5a7c09856
Merge pull request #16798 from phlrain/softmax_cross_support_high_rank
...
softmax cross entropy support high rank
6 years ago
xiaoting
431eab648e
Merge branch 'develop' into yolov3_loss
6 years ago
xuezhong
9c6ee7cf4c
add <memory>
...
test=develop
6 years ago
xuezhong
742d758747
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_infershape_bug2
6 years ago
phlrain
ddd9e1cb66
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bpr_loss
6 years ago
phlrain
01eda557cd
fix bpr loss; test=developp
6 years ago
xuezhong
41740519df
add <memory>
...
test=develop
6 years ago
xuezhong
4791029f19
remove <memory>
...
test=develop
6 years ago
xuezhong
fb75bd3e9c
remove <memory>
...
test=develop
6 years ago
xuezhong
afbc435adf
fix infershape check bug
...
test=develop
6 years ago
Yan Chunwei
916930a8ae
Update logical_op.cc
...
test=develop
6 years ago
jerrywgz
f4626ee425
Merge pull request #16873 from jerrywgz/roi_align_infer_shape
...
refine infer shape in roi align
6 years ago
xuezhong
5663fbfb0a
fix infershape bug
...
test=develop
6 years ago
tensor-tang
be18636e59
Merge remote-tracking branch 'ups/develop' into refine/infershape
...
test=develop
6 years ago
Hongyu Liu
0701c2db47
Merge pull request #16518 from zhoukunsheng/rsqrt
...
Rsqrt
6 years ago
guosheng
f641a47bb1
Refine ENFORCE in infer_shape of gru_op and lstm_unit_op.
...
test=develop
6 years ago
tensor-tang
ed892ebaf9
update
...
test=develop
6 years ago
tensor-tang
411b9ba520
update
...
test=develop
6 years ago
superjomn
0c233e8870
up
...
test=develop
6 years ago
superjomn
f0985cecb9
fix logical op infershape
...
test=develop
6 years ago
minqiyang
592011bbcf
Fix infer shape of split lod tensor op
...
test=develop
6 years ago
phlrain
d722841622
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into softmax_cross_support_high_rank
6 years ago
phlrain
5309b081f6
simple code; test=develop
6 years ago
liuwei1031
6864370a9e
scatter_op bug fix, test=develop ( #16866 )
...
* scatter_op bug fix, test=develop
* scatter_op bug fix, test=develop
* scatter_op bug fix, test=develop
6 years ago
jerrywgz
46bd853c10
Merge pull request #16843 from ceci3/infershape
...
fix batch_norm and cos_sim shape check
6 years ago
Hongyu Liu
779ffb844b
Merge pull request #16876 from tink2123/infer_shape
...
modified infer shape for op
6 years ago
phlrain
766c868199
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into softmax_cross_support_high_rank
6 years ago
Tao Luo
d966faae95
Merge pull request #16852 from sneaxiy/fix_merge_lod_tensor_op_infer_shape
...
Fix merge_lod_tensor_op infer shape
6 years ago
phlrain
f7a5a98fdb
remove unused code; test=develop
6 years ago
heqiaozhi
1cca7114c6
fix infer
...
test=develop
6 years ago
jerrywgz
c139f1e049
refine roi align infer shape, test=develop
6 years ago
Hongyu Liu
208abe9763
Merge pull request #16787 from phlrain/fix_concat_shape_2
...
Fix concat shape
6 years ago
zhaoyuchen2018
44bd3a630e
Merge pull request #16857 from zhaoyuchen2018/sumreshape
...
Fix sum infershape issue
6 years ago
whs
6429877816
Fix infer_shape in pad2d_op ( #16831 )
...
test=develop
6 years ago
乔龙飞 Qiao Longfei
8a7daeea4c
Merge pull request #16871 from jacquesqiao/fix-shape
...
fix split_byref_op infer shape
6 years ago
Jacek Czaja
87a44b1149
[MKL-DNN] Added reusing of primitive descriptors (fp32) ( #16667 )
...
* - Reuse of conv PD
- conv transpose pd reused
- Added PD reusing of softmax and Batch Norm
- Refactoring and removal of not needed routines of mkl-dnn ops
test=develop
- Fix to reusing conv
test=develop
- Lint fixes
test=develop
- Further lint fixes
test=develop
- Lint fixes
test=develop
- lint fixes
test=develop
- Lint workaround
test=develop
* - Fix after review on including boost as third party header
test=develop
* - Fix after review. Name change to something more descriptive
test=develop
6 years ago
liuwei1031
072db0938b
optimize lstmp and sample_logits op, test=develop ( #16845 )
...
* optimize lstmp and sample_logits op, test=develop
* update op_use_default_grad_op_maker.spec, test=develop
* delete useless file,test=develop
* append 0 to dim variable to avoid memory reusage, test=develop
6 years ago
phlrain
a5d1f9cf66
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_shape_check_many
6 years ago
phlrain
87916f8d84
simple code;test=develop
6 years ago
tink2123
e0f7bf4f2f
polish the code
...
test=develop
6 years ago
Jiabin Yang
84b7a7291e
test=develop, fix hsigmoid dereference nullptr ( #16769 )
...
* test=develop, fix hsigmoid dereference nullptr
* test=develop, refine condition
* test=develop, refine comments
6 years ago
Kaipeng Deng
19bb53fa61
Merge pull request #16850 from heavengate/fix_infer_shape
...
infer shape compatable -1. test=develop
6 years ago
Hongyu Liu
2de7f3cfc3
Merge pull request #16799 from phlrain/sigmoid_corss_entropy_support_high_rank
...
supprt high rank
6 years ago
tink2123
ffe81af073
modified infer shape
...
test=develop
6 years ago
Tao Luo
a67fbffdca
Merge pull request #16854 from luotao1/conv_shift_infershape
...
Fix conv_shift_op infershape
6 years ago
Qiao Longfei
0e663d7f51
fix split_byref_op infer shape
...
test=develop
6 years ago
phlrain
7e933056ae
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_shape_2
6 years ago
phlrain
64bf752dcc
fix concat; test=develop
6 years ago
Hongyu Liu
c96ee47d01
Merge pull request #16797 from phlrain/fix_split
...
Fix split
6 years ago
ceci3
74fc786097
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into infershape
6 years ago
ceci3
dd4c54cd37
test=develop
6 years ago
colourful-tree
434caab21b
Merge pull request #16741 from colourful-tree/dev
...
add continuous value model op
6 years ago
zhaoyuchen
aeddb14148
Fix sum infershape issue
...
if dim is -1, compile time check fails.
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
6 years ago
tensor-tang
10879a3cae
separate runtime infershape
...
test=develop
6 years ago
Hongyu Liu
e9cdd0e0cd
Merge pull request #16826 from zhoukunsheng/all_any
...
All any
6 years ago
SunGaofeng
76888b0ba1
modify in pad_op and pad_constant
6 years ago
Zhen Wang
cabea96789
Merge pull request #16838 from wzzju/fix_quan_transform
...
fix the hang bugs of memory copying. test=develop
6 years ago
Tao Luo
6f0a40fa29
Fix conv_shift_op infershape
...
test=develop
6 years ago
dengkaipeng
e590588a02
fix for itnerpolate. test=develop
6 years ago
乔龙飞 Qiao Longfei
bcc0d41646
Merge pull request #16822 from jacquesqiao/optimize-merge-add
...
Optimize merge add
6 years ago
SunGaofeng
2120f075a3
modify infer shape in pad_op.cc, pad_constant_like_op.cc. No need in psroi_pool_op.cc, crop_op.cc
6 years ago
sneaxiy
4a83522c38
fix merge_lod_tensor_op infer shape, test=develop
6 years ago
wanghaoshuang
89c2bc09ea
Fix infer_shape in pad2d_op
...
test=develop
6 years ago
dengkaipeng
b2dcdb5100
infer shape compatable -1. test=develop
6 years ago
ceci3
55f572b2da
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into infershape
6 years ago
ceci3
87d89dfe14
fix batch_norm and cos_sim infer shape, test=develop
6 years ago
乔龙飞 Qiao Longfei
82cff5ec42
Merge pull request #16762 from jacquesqiao/add-async_sparse_param_update_recorder
...
Add async sparse param update recorder
6 years ago
phlrain
165a7bd5a1
fix shape check many; test=develop
6 years ago
Zhen Wang
d988a24a14
fix the hang bugs of memory copying. test=develop
6 years ago
Yibing Liu
4267a81afc
Correct the lod level of compiled time in lod_reset ( #16790 )
...
test=develop
6 years ago
chengduo
c62674f475
Refine StaticRnn ( #16707 )
...
* enable recurrent op test=develop
6 years ago
SunGaofeng
1f2afccf30
test=develop ( #16783 )
6 years ago
tink2123
06156b6cb7
polish yolov3 loss annotation
...
test=develop
6 years ago
zhoukunsheng
bb8ea1637d
fix 16823: delete default_grad register for reduce_all, reduce_any
6 years ago
Qiao Longfei
faae1b4170
fix cpplint test=develop
6 years ago
zhoukunsheng
4aa594e3e7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
6 years ago
zhoukunsheng
2d6b4f23f0
test=develop
...
bug fix: reduce_all, reduce_any register GRAD_OP, but have not defined GradKernel
6 years ago
Qiao Longfei
0a8ff2ecd4
add cpu_merge_add_multi_noduplicated_test test=develop
6 years ago
Qiao Longfei
920a960974
optimize merge add if input rows of all selected rows is not duplicated
6 years ago
zhoukunsheng
b1c5820b3f
fix merge conflict
6 years ago
heqiaozhi
759940786e
Merge remote-tracking branch 'upstream/develop' into dev
...
test=develop
6 years ago
zhoukunsheng
9643f906ed
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rsqrt
6 years ago
phlrain
6bc3932823
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into softmax_cross_support_high_rank
6 years ago
phlrain
a3e5238112
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into sigmoid_corss_entropy_support_high_rank
6 years ago
phlrain
715a31b35e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_split
6 years ago
phlrain
db0518bb4d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_shape_2
6 years ago
Qiao Longfei
afc56949c1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
Qiao Longfei
d831f1b0ba
fix brpc code
6 years ago
liuwei1031
85363848a1
Security issue ( #16774 )
...
* disable memory_optimize and inpalce strategy by default, test=develop
* fix security issue
http://newicafe.baidu.com:80/issue/PaddleSec-3/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-8/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-12/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-32/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-35/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-37/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-40/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-43/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-44/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-45/show?from=page
test=develop
* revert piece.cc, test=develop
* adjust api.cc,test=develop
6 years ago
phlrain
468f8ccff9
supprt high rank; test=develop
6 years ago
phlrain
bbfc82cc42
softmax corss entropy support high rank
...
test=develop
6 years ago
zhoukunsheng
2b2b4ca21e
Merge branch 'develop' into rsqrt
6 years ago
heqiaozhi
5fb9bdc892
add X to grad
...
test=develop
6 years ago
Hongyu Liu
e2897ba13a
Merge pull request #16432 from zhoukunsheng/linspace
...
add linspace op
6 years ago
Hongyu Liu
afe0d64c9d
Merge pull request #16320 from zhoukunsheng/all_any
...
add reduce_all, reduce_any op
6 years ago
phlrain
026836ffe0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_split
6 years ago
phlrain
488e889f3d
fix split infer shape; test=develop
6 years ago
Qiao Longfei
8b8a0487c7
fix compile test=develop
6 years ago
phlrain
3f0d047d1b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_shape_2
6 years ago
phlrain
dc6e814686
fix concat shape; test=develop
6 years ago
Qiao Longfei
a541c25ab6
fix cpplint test=develop
6 years ago
Qiao Longfei
0608f8ca56
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
heqiaozhi
e9d79dd5d7
ctx.device_context() to CPUPlace
...
test=develop
6 years ago
heqiaozhi
44b226eda6
ctx.device_context() to CPUPlace
...
test=develop
6 years ago
heqiaozhi
fa6ea1e0e6
remove grad X
...
test=develop
6 years ago
heqiaozhi
72c9aecfc3
fix doc
...
test=develop
6 years ago
heqiaozhi
8de5dc31db
add doc
...
test=develop
6 years ago
heqiaozhi
5204fb4402
add doc
...
test=develop
6 years ago
heqiaozhi
6e5c44d3fe
add doc
6 years ago
gongweibao
bf606bce8a
Fix grpc log message. ( #16735 )
6 years ago
Zeng Jinle
9f7b027dce
fix activation grad op desc maker ( #16715 )
...
test=develop
6 years ago
heqiaozhi
ba78446cca
add continuous value model op
...
test=develop
6 years ago
Kaipeng Deng
ed97156461
Merge pull request #16439 from heavengate/resize_scale
...
add attr scale. test=develop
6 years ago
heqiaozhi
0c3c5e19d3
add continuous value model op
...
test=develop
6 years ago
heqiaozhi
54dddee37e
add continuous value model op
...
test=develop
6 years ago
Huihuang Zheng
2146293d26
Fix op registry ( #16677 )
...
list of fixed ops:
lookup_table_op
space_to_depth_op
squared_l2_distance_op
squared_l2_norm_op
teacher_student_sigmoid_loss_op
tree_conv_op
warpctc_op
test=develop
6 years ago
baojun
1c8b34ddd2
fix training validation test=develop ( #16698 )
6 years ago
tensor-tang
ad45a08351
fix avx option ( #16683 )
...
test=develop
6 years ago
Zeng Jinle
bb143052cb
fix gc bug in conditional block ( #16673 )
...
test=develop
6 years ago
ruri
229dc93277
Add Pixel shuffle OP ( #15782 )
...
* add pixel_shuffle op
* add pixel_shuffle op, test=develop
* rewrite code, test=develop
* delete useless comment, test=develop
* Refine pixel_shuffle_op and unit testing
* refine code,test=develop
* refine .cu,test=develop
* fix unittest,test=develop
* Fix unit testing
test=develop
* resolve conflict, test=develop
* fix test, test=develop
* fix API, test=develop
* fix test datatype bug,test=develop
* polish comments,test=develop
* add API,test=develop
* test=develop
* Add Pixel_Shuffle OP,test=develop
* support python3,test=develop
* add include memory to travis CI bug,test=develop
6 years ago
lujun
6173585051
Merge pull request #16634 from junjun315/my-cool-stuff
...
fix load bug about context.place
6 years ago
Zeng Jinle
1c526e1d1a
Fix some grad op desc makers ( #16633 )
...
* fix some grad op desc maker
test=develop
* fix grad op desc makers
test=develop
6 years ago
lujun
3773260edd
fix load type, test=develop
6 years ago
Zeng Jinle
d658244997
fix some grad op desc maker ( #16581 )
...
test=develop
6 years ago
gongweibao
0342f01249
Fix dgc bug. ( #16602 )
6 years ago
乔龙飞 Qiao Longfei
21622ca30b
Merge pull request #16172 from jacquesqiao/add-async-ssa-graph-executor-communicator
...
Add async ssa graph executor communicator
6 years ago
Qiao Longfei
766666a957
add log for FLAGS_communicator_send_wait_times
6 years ago
sneaxiy
10249c0b78
Merge develop
...
test=develop
6 years ago
zhoukunsheng
5edf4fb4fb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
6 years ago
Qiao Longfei
4031c1a7b1
fix ci build test=develop
6 years ago
Qiao Longfei
9861a92f6f
change the return type of NewTempScope to unique ptr test=develop
6 years ago
chengduo
feb1b54f9d
fix min and max bug ( #16570 )
...
test=develop
6 years ago
石晓伟
5dea0bdd1b
Merge pull request #16498 from Shixiaowei02/feature/anakin-engine
...
merge feature/anakin-engine to develop
6 years ago
Qiao Longfei
fb6cc3a1bd
follow commnet, optimize code and add comment test=develop
6 years ago
Qiao Longfei
baf02328b2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
Wu Yi
22b02bfa62
Batch norm cudnn accurate ( #16545 )
...
* fix cudnn batch norm accuracy test=develop
* fix cudnn batch norm accuracy test=develop
* disable failed test for later fix test=develop
6 years ago
Qiao Longfei
df45c8c538
update nce and hierarchical_sigmoid remote_prefetch
...
test=develop
6 years ago
Qiao Longfei
a1821a0449
remote remote_prefetch in embedding layer test=develop
6 years ago
Kaipeng Deng
3d939d32ee
Merge pull request #16023 from heavengate/kl_div_loss
...
KL div loss: add kldiv_loss op
6 years ago
Kaipeng Deng
54474637ae
Merge pull request #16057 from heavengate/softmax_axis
...
Add attr 'axis' for softmax
6 years ago
Kaipeng Deng
63ac947e2f
Merge pull request #16135 from heavengate/shift
...
Add temporal_shift op for TSM model
6 years ago
Zhaolong Xing
3e6aa498d6
Merge pull request #16526 from NHZlX/refine_trt_anakin
...
refine subgraph trt and anakin
6 years ago
dengkaipeng
0f7411a1ae
round down for scale. test=develop
6 years ago
zhoukunsheng
5284213942
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rsqrt
6 years ago
zhoukunsheng
91ba75000c
fix type conversion problem in rsqrt functor
6 years ago
sneaxiy
33473890f3
Merge develop
...
test=develop
6 years ago
nhzlx
7cde2d9e84
fix trt engine test error.
...
test=develop
6 years ago
zhoukunsheng
3c4f5f0368
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into linspace
6 years ago
Qiao Longfei
d8974e6da0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
Shixiaowei02
bddb2cd315
resolve conflicts with the develop branch test=develop
6 years ago
chengduo
1096746cbf
Fuse Adam And SGD ops ( #15933 )
...
* fuse optimizer
6 years ago
Jacek Czaja
2632327429
[MKL-DNN] Tensor modifications revert ( #16462 )
...
* Revert "[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233 )"
This reverts commit 13816dd4ac
.
Apart from enabling transformer for MKL-DNN
* Revert "- MKL-DNN pooling updated to set_prim_desc"
This reverts commit c63f6b2039
.
Conflicts:
paddle/fluid/operators/mkldnn/concat_mkldnn_op.cc
* Revert "[MKL-DNN] MKL-DNN specific Tensor modification (#15429 )"
test=develop
This reverts commit dec9cf53c8
.
* - concat compilation fix
- lint
test=develop
- Lint fixes
test=develop
- Lint fixes
test=develop
- Fix Transpose MKLDNN op
test=develop
6 years ago
Zeng Jinle
4143a1c216
Merge pull request #16491 from sneaxiy/feature/advance_gc
...
Fix grad op makers
6 years ago
sneaxiy
2c836ff914
check default grad maker
...
test=develop
6 years ago
nhzlx
d065b5bf2b
Anakin ssd support
...
refine trt first run
add quant dequant fuse pass
omit simplify_anakin_priorbox_detection template
omit transpose_flatten_concat_fuse template
test=develop
6 years ago
zhoukunsheng
2f9e562100
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into linspace
6 years ago
zhoukunsheng
082822d417
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rsqrt
6 years ago
zhoukunsheng
c47f3cc7fe
test=develop
...
add rsqrt op
6 years ago
dengkaipeng
8160a66193
fix doc priority. test=develop
6 years ago
dengkaipeng
193185b840
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shift
6 years ago
Zeng Jinle
5f1c92a81c
Merge pull request #16450 from zhhsplendid/del-redundant-op-var-reg
...
Add SpectralNormGradOpDescMaker
6 years ago
gongweibao
eb83abeac3
Add DGC(Deep Gradient Compression) interface. ( #15841 )
6 years ago
Qiao Longfei
34890fd3b1
fix gpu build for lookup_table_op test=develop
6 years ago
sneaxiy
4c8254e3bf
revert some loop op revision
...
test=develop
6 years ago
zhoukunsheng
874b5d8362
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into linspace
6 years ago
zhoukunsheng
83c7bca13f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
6 years ago
sneaxiy
16f0994728
Merge develop
...
test=develop
6 years ago
sneaxiy
63651c1968
fix grad desc maker
...
test=develop
6 years ago
Zeng Jinle
c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
...
Enhance gc to support deleting tensor buffer in advance
6 years ago
Qiao Longfei
b542639dc0
code clean test=develop
6 years ago
Jiabin Yang
0d9d25d40f
Feature/refactor layers to Layers ( #16337 )
...
* test=develop, add some Layers and tests
* test=develop, add more layers
* test=develop, add more layers
* test=develop, add force cpu option
* Update test_layers.py
remove pdb
* test=develop, refine code
6 years ago
dengkaipeng
3e352388eb
fix format. test=develop
6 years ago
dengkaipeng
eb2123e12d
fix doc and jit. test=develop
6 years ago
liuwei1031
8d22bc17a4
Memory optimize ( #16410 )
...
* fix cdn issue, test=develop
* fix memory optimize bugs, test=develop
* fix memory optimize bugs, test=develop
* remove add/sub_2 op, test=develop
* disable memory_optimize by default, test=develop
* disable inplace activation in python, test=develop
* fix unittests, test=develop
* fix unittests, test=develop
* bug-fix, test=develop
6 years ago
Xin Pan
f8c279b11c
Merge pull request #16454 from panyx0718/imperative2
...
polish deepCF model to support real dataset
6 years ago
Zhaolong Xing
fa1796a30a
Merge pull request #16330 from NHZlX/merge_anakin_branch_to_dev
...
Cherry-pick from PaddlePaddle:feature/anakin-engine: Anakin subgraph support.
6 years ago
Qiao Longfei
392e97aae5
fix cpplint test=develop
6 years ago
tensor-tang
1eff834e97
update jitkernel doc ( #16327 )
...
* update jitkernel doc
test=develop
* follow comments
* follow comments
test=develop
6 years ago
Qiao Longfei
30618409db
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Yiqun Liu
98802e1f75
Optimize the implementation of while_op again, for cases when is_test is true. ( #16359 )
...
test=develop
6 years ago
lujun
c34b24ede7
Merge pull request #16425 from junjun315/checkpoint-hotfix
...
Checkpoint hotfix
6 years ago
nhzlx
953bdde058
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
...
test=develop
6 years ago
sneaxiy
78fb3a62e0
fix env variable settting bug
...
test=develop
6 years ago
Qiao Longfei
b65adf7f65
add communicator_send_wait_times
6 years ago
Xin Pan
1f89249a95
update DeepCF model
...
test=develop
6 years ago
sneaxiy
a7d0ac50b8
Merge develop
6 years ago
sneaxiy
7000ec85d9
fix some op grad maker
...
fix ctest eager deletion disable bug
test=develop
6 years ago
nhzlx
45b3766fdf
fix comments
...
test=develop
6 years ago
zhhsplendid
3909108cae
Add SpectralNormGradOpDescMaker
...
Use SpectralNormGradOpDescMaker instead of DefaultGradOpDescMaker
to avoid registering useless variables to improve GPU usage.
test=develop
6 years ago
dengkaipeng
ceb31d30f0
fix formax. test=develop
6 years ago
dengkaipeng
cfef382a85
fix format. test=develop
6 years ago
Zeng Jinle
4cc9809cae
Merge pull request #15799 from sneaxiy/feature/decoupled_reader
...
Try to decouple reader with program_desc
6 years ago
whs
e9bec9369b
[slim] Add quantization strategy and distillation strategy. ( #16408 )
...
* Add fsp operator.
1 Add unitest.
2. Add python API.
3. Add layer test.
* Add quantization strategy.
1. Add API.
2. Add unitest.
* Add distillatoin strategy.
* Add unitest config file for quantization
* Fix Copyright
test=develop
* Fix setup.py
* Fix document of layers.py.
test=develop
* Fix unitest in python3.
test=develop
* Fix documents.
test=develop
* 1. refine fsp op by batched gemm
2. remove unused import
test=develop
* Fix test_dist_se_resnext.
1. disable test distillation.
2. reset framework.py
test=develop
* Enable unitest of distillation after fixing Block._clone_variable
test=develop
* Fix cdn issue.
test=develop
6 years ago
dengkaipeng
d54005a7f4
fix unittest. test=develop
6 years ago
dengkaipeng
90bd038d35
fix format. test=develop
6 years ago
Qiao Longfei
63acbe7a65
fix bug
6 years ago
zhoukunsheng
d3d31a5894
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
6 years ago
zhoukunsheng
664c342ca0
test=develop
...
split reduce_all_any_op.h into two files
add unit test for reduce_all, reduce_any
6 years ago
lujun
bc4d1c7246
fix mix input type error, test=develop
6 years ago
Qiao Longfei
0ff1e64fab
fix a bug
6 years ago
zhoukunsheng
43060084a4
test=develop
...
add linspace, modify interface comments in tensor.py, merge with develop branch
6 years ago
Qiao Longfei
0997cf8f65
add more check
6 years ago
sneaxiy
f8ed2c229e
try to fix ci error
...
test=develop
6 years ago
zhoukunsheng
8e9ebebcef
test=develop
...
add linspace op
6 years ago
lujun
18aa59493e
fix mix input type error, test=develop
6 years ago
dengkaipeng
cfda1fdea7
add attr scale. test=develop
6 years ago
lujun
1b6a2a09e8
fix mix input type error, test=develop
6 years ago
Qiao Longfei
93464b25ac
update async_sparse_param_update_recorder
6 years ago
Qiao Longfei
542b52fac3
fix trainer_id
6 years ago
Qiao Longfei
be0c482304
update trainer_id
6 years ago
sneaxiy
2f54d9f995
Merge develop
...
test=develop
6 years ago
Qiao Longfei
c60f312d1b
add trick
6 years ago
Qiao Longfei
103c9bb376
update rpc_client
6 years ago
sneaxiy
072d95d8f6
Merge develop
...
test=develop
6 years ago
sneaxiy
a93a9eef8f
add op registry type
...
refine gc code
test=develop
6 years ago
dengkaipeng
f45aced59b
add jit test. develop=test
6 years ago