ceci3
74fc786097
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into infershape
6 years ago
ceci3
dd4c54cd37
test=develop
6 years ago
colourful-tree
434caab21b
Merge pull request #16741 from colourful-tree/dev
...
add continuous value model op
6 years ago
zhaoyuchen
aeddb14148
Fix sum infershape issue
...
if dim is -1, compile time check fails.
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
6 years ago
tensor-tang
10879a3cae
separate runtime infershape
...
test=develop
6 years ago
Tao Luo
ca8b8fa0bd
Merge pull request #16830 from Superjomn/fix/tmp-memory-optim
...
fix memory optim temporarily
6 years ago
Hongyu Liu
e9cdd0e0cd
Merge pull request #16826 from zhoukunsheng/all_any
...
All any
6 years ago
dengkaipeng
7b1702d9a1
fix unittest and API.spec. test=develop
6 years ago
SunGaofeng
76888b0ba1
modify in pad_op and pad_constant
6 years ago
lijianshe02
de26df440b
add SaveOptimModel interface in analysis_predictor.h and test it in a… ( #16441 )
...
* add SaveOptimModel interface in analysis_predictor.h and test it in analyzer_dam_tester and analyzer_resnet50_tester test=develop
6 years ago
Zhen Wang
cabea96789
Merge pull request #16838 from wzzju/fix_quan_transform
...
fix the hang bugs of memory copying. test=develop
6 years ago
Tao Luo
6f0a40fa29
Fix conv_shift_op infershape
...
test=develop
6 years ago
dengkaipeng
e590588a02
fix for itnerpolate. test=develop
6 years ago
lidanqing
de02d40e98
improve preprocess script and read from tar
...
test=develop
6 years ago
乔龙飞 Qiao Longfei
bcc0d41646
Merge pull request #16822 from jacquesqiao/optimize-merge-add
...
Optimize merge add
6 years ago
SunGaofeng
2120f075a3
modify infer shape in pad_op.cc, pad_constant_like_op.cc. No need in psroi_pool_op.cc, crop_op.cc
6 years ago
sneaxiy
4a83522c38
fix merge_lod_tensor_op infer shape, test=develop
6 years ago
wanghaoshuang
89c2bc09ea
Fix infer_shape in pad2d_op
...
test=develop
6 years ago
dengkaipeng
b2dcdb5100
infer shape compatable -1. test=develop
6 years ago
ceci3
55f572b2da
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into infershape
6 years ago
ceci3
87d89dfe14
fix batch_norm and cos_sim infer shape, test=develop
6 years ago
乔龙飞 Qiao Longfei
82cff5ec42
Merge pull request #16762 from jacquesqiao/add-async_sparse_param_update_recorder
...
Add async sparse param update recorder
6 years ago
phlrain
165a7bd5a1
fix shape check many; test=develop
6 years ago
heqiaozhi
aab9ea6ccb
out && commit id
...
test=develop
6 years ago
Zhen Wang
d988a24a14
fix the hang bugs of memory copying. test=develop
6 years ago
Yibing Liu
4267a81afc
Correct the lod level of compiled time in lod_reset ( #16790 )
...
test=develop
6 years ago
guru4elephant
1b75049407
Merge pull request #16788 from guru4elephant/fix_python_codestyle
...
Fix python codestyle in incubator
6 years ago
chengduo
c62674f475
Refine StaticRnn ( #16707 )
...
* enable recurrent op test=develop
6 years ago
chengduo
e9409665f7
Refine Fuse Optimize Ops ( #16810 )
...
* fix bug of fuse optimize ops
6 years ago
SunGaofeng
1f2afccf30
test=develop ( #16783 )
6 years ago
superjomn
f58c3ec189
fix memory optim temporarily
...
test=develop
6 years ago
chengduo
d105c06b50
Replace ThreadedExecutor with FastThreadedExecutor ( #16650 )
...
* replace ThreadedExecutor with FastThreadedExecutor
test=develop
* Fix Travise CI
test=develop
* Test FastThreadedSSAGraphExecutor
test=develop
* refine parallel_ssa_graph_executor.cc
test=develop
6 years ago
tink2123
9b9e5e606c
modified api.spec
...
test=develop
6 years ago
tink2123
06156b6cb7
polish yolov3 loss annotation
...
test=develop
6 years ago
zhoukunsheng
bb8ea1637d
fix 16823: delete default_grad register for reduce_all, reduce_any
6 years ago
Qiao Longfei
faae1b4170
fix cpplint test=develop
6 years ago
zhoukunsheng
4aa594e3e7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
6 years ago
zhoukunsheng
2d6b4f23f0
test=develop
...
bug fix: reduce_all, reduce_any register GRAD_OP, but have not defined GradKernel
6 years ago
Qiao Longfei
0a8ff2ecd4
add cpu_merge_add_multi_noduplicated_test test=develop
6 years ago
Qiao Longfei
920a960974
optimize merge add if input rows of all selected rows is not duplicated
6 years ago
zhoukunsheng
b1c5820b3f
fix merge conflict
6 years ago
Qiao Longfei
1526a3e4da
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
...
test=develop
6 years ago
heqiaozhi
759940786e
Merge remote-tracking branch 'upstream/develop' into dev
...
test=develop
6 years ago
zhoukunsheng
9643f906ed
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rsqrt
6 years ago
phlrain
6bc3932823
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into softmax_cross_support_high_rank
6 years ago
phlrain
a3e5238112
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into sigmoid_corss_entropy_support_high_rank
6 years ago
phlrain
715a31b35e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_split
6 years ago
phlrain
db0518bb4d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_shape_2
6 years ago
XiaoguangHu
06809ebbb1
Merge pull request #16815 from sneaxiy/fix_new_added_reduce_ops_spec
...
Fix default_grad_op_desc_maker
6 years ago
zhoukunsheng
ebf6cf9f18
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into zeros_like
6 years ago
zhoukunsheng
380df8281f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into zeros_like
6 years ago
Yihua Xu
93cedfdb9c
Fix the order while sorting the operators ( #16756 )
...
* Fix the order when sorting operators.
test=develop
* Enable transfomer compare test item.
test=develop
* Use set to replace vector.
test=develop
6 years ago
sneaxiy
00b4580f46
fix default_grad_op_desc_maker
...
test=develop
6 years ago
Qiao Longfei
afc56949c1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
Qiao Longfei
d831f1b0ba
fix brpc code
6 years ago
heqiaozhi
96d5ec16f6
change API
...
test=develop
6 years ago
liuwei1031
85363848a1
Security issue ( #16774 )
...
* disable memory_optimize and inpalce strategy by default, test=develop
* fix security issue
http://newicafe.baidu.com:80/issue/PaddleSec-3/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-8/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-12/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-32/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-35/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-37/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-40/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-43/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-44/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-45/show?from=page
test=develop
* revert piece.cc, test=develop
* adjust api.cc,test=develop
6 years ago
phlrain
468f8ccff9
supprt high rank; test=develop
6 years ago
phlrain
bbfc82cc42
softmax corss entropy support high rank
...
test=develop
6 years ago
zhoukunsheng
2b2b4ca21e
Merge branch 'develop' into rsqrt
6 years ago
heqiaozhi
5fb9bdc892
add X to grad
...
test=develop
6 years ago
Hongyu Liu
e2897ba13a
Merge pull request #16432 from zhoukunsheng/linspace
...
add linspace op
6 years ago
Hongyu Liu
283ae0faaa
Merge pull request #16525 from zhoukunsheng/rank
...
Rank
6 years ago
Hongyu Liu
afe0d64c9d
Merge pull request #16320 from zhoukunsheng/all_any
...
add reduce_all, reduce_any op
6 years ago
phlrain
026836ffe0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_split
6 years ago
phlrain
488e889f3d
fix split infer shape; test=develop
6 years ago
ruri
39d6a985bc
fix some comments, include cosine_decay,l2_normalize,pixel_shuffle ( #16763 )
...
* fix some comments, include cosine_decay,l2_normalize,pixel_shuffle
* Add api.spec, test=develop
* update api.spec, test=develop
* add api.spec,test=develop
* test=develop
* test=develop
* fix conflict,test=develop
6 years ago
Qiao Longfei
8b8a0487c7
fix compile test=develop
6 years ago
dongdaxiang
a659b37ace
make lodtensor_printer usable in gpu setting
...
test=develop
6 years ago
guru4elephant
aa46caf3d9
Merge pull request #16765 from guru4elephant/gpu_dataset_train
...
add gpu training for Executor.train_from_dataset
6 years ago
phlrain
3f0d047d1b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_shape_2
6 years ago
phlrain
dc6e814686
fix concat shape; test=develop
6 years ago
Wu Yi
8b58732013
remove append_LARS not used api test=develop ( #16703 )
6 years ago
dongdaxiang
3c2d236815
remove all warnings
...
test=develop
6 years ago
Yiqun Liu
112f16143b
Add an option to enable the cache of expected kernel in train phase. ( #16724 )
...
* Add an option to enable the cache of expected kernel in train phase.
test=develop
* Change the default value of cache_expected_kernel to true.
6 years ago
liuwei1031
2e07c19a9c
disable memory_optimize and inpalce strategy by default, test=develop ( #16760 )
6 years ago
dongdaxiang
ea07eb8cd2
remove comment in data_feed.cc
...
develop=test
6 years ago
guru4elephant
e349a7443f
Update nccl_context.h
6 years ago
Qiao Longfei
a541c25ab6
fix cpplint test=develop
6 years ago
dongdaxiang
05464e7c5c
add gpu training for Executor.train_from_dataset
...
test=develop
6 years ago
Qiao Longfei
0608f8ca56
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
heqiaozhi
e9d79dd5d7
ctx.device_context() to CPUPlace
...
test=develop
6 years ago
heqiaozhi
44b226eda6
ctx.device_context() to CPUPlace
...
test=develop
6 years ago
heqiaozhi
fa6ea1e0e6
remove grad X
...
test=develop
6 years ago
heqiaozhi
72c9aecfc3
fix doc
...
test=develop
6 years ago
heqiaozhi
8de5dc31db
add doc
...
test=develop
6 years ago
heqiaozhi
5204fb4402
add doc
...
test=develop
6 years ago
heqiaozhi
6e5c44d3fe
add doc
6 years ago
gongweibao
bf606bce8a
Fix grpc log message. ( #16735 )
6 years ago
Zeng Jinle
9f7b027dce
fix activation grad op desc maker ( #16715 )
...
test=develop
6 years ago
lujun
9bd44b94da
Merge pull request #16561 from junjun315/move-api-to-root
...
Move dygraph api to root
6 years ago
heqiaozhi
ba78446cca
add continuous value model op
...
test=develop
6 years ago
liuwei1031
fdb719a1bf
avoid optimize variable used in subblock, test=develop ( #16739 )
6 years ago
Kaipeng Deng
ed97156461
Merge pull request #16439 from heavengate/resize_scale
...
add attr scale. test=develop
6 years ago
heqiaozhi
0c3c5e19d3
add continuous value model op
...
test=develop
6 years ago
Tao Luo
1a21d08f12
Merge pull request #16725 from tensor-tang/pass/disable_seqpool
...
disable seqpool concat pass by default saving CI time
6 years ago
heqiaozhi
54dddee37e
add continuous value model op
...
test=develop
6 years ago
liuwei1031
a18ef10c87
only use the latest version variable for inplace strategy ( #16736 )
...
* bug-fix, test=develop
* tweak code, test=develop
6 years ago
Huihuang Zheng
2146293d26
Fix op registry ( #16677 )
...
list of fixed ops:
lookup_table_op
space_to_depth_op
squared_l2_distance_op
squared_l2_norm_op
teacher_student_sigmoid_loss_op
tree_conv_op
warpctc_op
test=develop
6 years ago
Tao Luo
5c364cda3c
Merge pull request #16711 from luotao1/has_attr
...
reduce hasAttr elapsed time in RunImpl
6 years ago
tensor-tang
d6c1b5a73b
disable seqpool concat pass by default saving CI time
...
test=develop
6 years ago
baojun
1c8b34ddd2
fix training validation test=develop ( #16698 )
6 years ago
lujun
92c8ac8a74
merge conflict, test=develop
6 years ago
chengduo
55b15db5af
Add unit test for fuse all_reduce ops ( #16699 )
...
* test fuse all_reduce
6 years ago
luotao1
4098ba29ed
reduce hasAttr elapsed time in RunImpl
...
test=develop
6 years ago
luotao1
f89a9c5d95
Merge branch 'develop' into has_attr
6 years ago
Tao Luo
ad4a1bd13c
Merge pull request #16339 from luotao1/core_opt_choose_kernel
...
Cache the chosen kernel of operators
6 years ago
luotao1
6afc97ca6b
reduce hasAttr elapsed time in RunImpl
...
test=develop
6 years ago
Yan Xu
55e3c6949b
disable reuse port test=develop ( #16704 )
6 years ago
gongweibao
8b793d0efd
Fix DGC bug. ( #16697 )
6 years ago
Yiqun Liu
3fe8cb0dd7
Enable the runtime_context_cache pass in train phase ( #16640 )
...
* Try to enable the runtime_context_cache pass in train phase.
* Put the append of runtime_context_cache pass ahead of multi_dev passes.
test=develop
6 years ago
Tao Luo
4048a2681f
Merge pull request #16687 from luotao1/reduce_inference_ci_time
...
reduce all analyzer_test ci elasped time
6 years ago
Yan Xu
169829c83a
fix win gpu test=develop ( #16694 )
6 years ago
guru4elephant
7d653f0aed
Merge pull request #16652 from xjqbest/dataset_merge_develop
...
fix dataset bug
6 years ago
xjqbest
6a57e8075a
remove trainer_id in datafeed and dataset
...
test=develop
6 years ago
tensor-tang
ad45a08351
fix avx option ( #16683 )
...
test=develop
6 years ago
Tao Luo
d5c8d4acfe
reduce all analyzer_test ci elasped time
...
test=develop
6 years ago
luotao1
695f2db6a0
update expected_kernel_cache_pass
...
test=develop
6 years ago
luotao1
226596a296
Merge branch 'develop' into core_opt_choose_kernel
6 years ago
Tao Luo
0d6581c574
Merge pull request #16656 from PaddlePaddle/revert-16555-model_data_cryption_link_all_lib
...
Revert "Model data cryption link all lib"
6 years ago
bingyanghuang
88ceda5134
MKLDNN INT8 v2 readme.md ( #16515 )
6 years ago
Tao Luo
503b5eb4dd
Merge pull request #16678 from luotao1/int8_cache
...
test_analyzer_int8 tests use default pass order
6 years ago
Yan Xu
b4c3a6aa0b
[Imperative] implement imperative NCCLParallelContext ( #16477 )
...
add NCCLParallelContext for parallel dygraph
6 years ago
Zeng Jinle
bb143052cb
fix gc bug in conditional block ( #16673 )
...
test=develop
6 years ago
xjqbest
5e5139283b
fix runtime error
...
test=develop
6 years ago
luotao1
bd636a9ea6
test_analyzer_int8 tests use default pass order
...
test=develop
6 years ago
ruri
229dc93277
Add Pixel shuffle OP ( #15782 )
...
* add pixel_shuffle op
* add pixel_shuffle op, test=develop
* rewrite code, test=develop
* delete useless comment, test=develop
* Refine pixel_shuffle_op and unit testing
* refine code,test=develop
* refine .cu,test=develop
* fix unittest,test=develop
* Fix unit testing
test=develop
* resolve conflict, test=develop
* fix test, test=develop
* fix API, test=develop
* fix test datatype bug,test=develop
* polish comments,test=develop
* add API,test=develop
* test=develop
* Add Pixel_Shuffle OP,test=develop
* support python3,test=develop
* add include memory to travis CI bug,test=develop
6 years ago
lujun
b236091e89
merge confict, test=develop
6 years ago
lujun
facd95e6e3
merge confict, test=develop
6 years ago
lujun
01f4f2d7e4
merge confict, test=develop
6 years ago
lujun
6173585051
Merge pull request #16634 from junjun315/my-cool-stuff
...
fix load bug about context.place
6 years ago
Chen Weihang
0b2aec14b6
Revert "Model data cryption link all lib ( #16555 )"
...
test=develop
This reverts commit c38c7c5619
.
6 years ago
Yan Chunwei
044ae2497d
fix identity temporarily ( #15942 )
6 years ago
xjqbest
271b7147cc
fix dataset bug
...
test=develop
6 years ago
Zeng Jinle
1c526e1d1a
Fix some grad op desc makers ( #16633 )
...
* fix some grad op desc maker
test=develop
* fix grad op desc makers
test=develop
6 years ago
chengduo
ea2a2f778a
Fix the bug of AllReduceDepPass ( #16393 )
6 years ago
Wojciech Uss
ec2750b3c2
fix repeating passes ( #16606 )
6 years ago
Wojciech Uss
9b6a029666
fix dataset reading and add support for full dataset ( #16559 )
6 years ago
lujun
3773260edd
fix load type, test=develop
6 years ago
chengduo
b75a69bad6
Add Stream for fetch op handle ( #16600 )
...
* expose fuse broadcast ops
6 years ago
chengduo
1342e2ea04
Fix the bug of the fast threaded executor ( #16514 )
...
* Fix the bug of the fast threaded executor. I
6 years ago
Zeng Jinle
d658244997
fix some grad op desc maker ( #16581 )
...
test=develop
6 years ago
gongweibao
423bc515da
fix batch merge bug ( #16601 )
6 years ago
gongweibao
0342f01249
Fix dgc bug. ( #16602 )
6 years ago
lidanqing
2ca0de3cd4
fix preprocess script with processbar, integrity check and logs ( #16608 )
...
* fix preprocess script with processbar, integrity check and logs
test=develop
* delete unnecessary empty lines, change function name
test=develop
6 years ago
Tao Luo
c797aed893
Merge pull request #16584 from luotao1/enhance_wget
...
enhance analyzer_tests download
6 years ago
Qiyang Min
12e36d38a5
Imperative deep-first backward process ( #16605 )
...
* Fix bug of gradient interface
* shrink transformer
* Right transformer
* Change from width-first backward to deep-first backward process
test=develop
* Reverse iterator op's input
test=develop
* Polish code
* Change the iteration direction in ingrads' map slots
test=develop
* Polish code
test=develop
6 years ago
liuwei1031
bd193781df
fix the bug of reusing different types of variables in memory_optimiz… ( #16547 )
...
* fix the bug of reusing different types of variables in memory_optimize_pass, test=develop
* disable SELECTED_ROWS AND LOD_TENSOR_ARRAY reusage, test=develop
6 years ago
Chen Weihang
c38c7c5619
Model data cryption link all lib ( #16555 )
...
* link the libwbaes.so into paddle
* polish detail, test=develop
* try fix mac_pr_ci error, test=develop
* add compile option, test=develop
* fix ci error, test=develop
* ignore failed to find mac lib, test=develop
* change cdn to bj, cdn can't get the latest version
* trigger ci, test=develop
* temporary delete win32 lib linking, test=develop
* change https to http, test=develop
* turn compile option on to off
* turn compile option off to on, test=develop
* try lib compiled by gcc4.8, test=develop
* update lib version, test=develop
* link other lib, test=develop
* add setup config
* delete false, test=develop
* delete no_soname, test=develop
* recover so name set
* fix, test=develop
* adjust make config, test=develop
* remove link to wbaes, test=develop
* remove useless define, test=develop
6 years ago
乔龙飞 Qiao Longfei
21622ca30b
Merge pull request #16172 from jacquesqiao/add-async-ssa-graph-executor-communicator
...
Add async ssa graph executor communicator
6 years ago
Tao Luo
ce18710421
enhance analyzer_tests download
...
test=develop
6 years ago
Qiao Longfei
766666a957
add log for FLAGS_communicator_send_wait_times
6 years ago
sneaxiy
10249c0b78
Merge develop
...
test=develop
6 years ago
zhoukunsheng
5edf4fb4fb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
6 years ago
zhoukunsheng
47a5eec87d
improve comment
6 years ago
Qiao Longfei
4031c1a7b1
fix ci build test=develop
6 years ago
zhoukunsheng
68f887aab3
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into zeros_like
6 years ago
zhoukunsheng
b2cdadd700
improve comment
6 years ago
Qiao Longfei
9861a92f6f
change the return type of NewTempScope to unique ptr test=develop
6 years ago
dengkaipeng
d02f4c2fd6
fix API.spec. test=develop
6 years ago
zhoukunsheng
315176c65b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rank
6 years ago
zhoukunsheng
8b6b626718
fix problem of API.spec mismatch
6 years ago
chengduo
feb1b54f9d
fix min and max bug ( #16570 )
...
test=develop
6 years ago
石晓伟
5dea0bdd1b
Merge pull request #16498 from Shixiaowei02/feature/anakin-engine
...
merge feature/anakin-engine to develop
6 years ago
Qiao Longfei
fb6cc3a1bd
follow commnet, optimize code and add comment test=develop
6 years ago
Qiao Longfei
adf272bcec
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
guru4elephant
76b49f02ee
Merge pull request #16539 from guru4elephant/train_with_pipe_reader_merge_develop
...
Train with pipe reader merge develop
6 years ago
Qiao Longfei
baf02328b2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
qingqing01
1ebd7434d5
Add linear learning warmup method in learning rate scheduler. ( #16563 )
...
* Add linear learning warmup method
This warmup lr can be combinated with other learning rate strategies.
For example:
decayed_lr = fluid.layers.linear_lr_warmup(
fluid.layers.piecewise_decay(boundaries, lr_steps),
warmup_steps, start_lr, end_lr)
6 years ago
Qiao Longfei
9db1a9e128
change log level test=develop
6 years ago
gongweibao
a61ed9782e
fix log level test=develop ( #16554 )
6 years ago
Wu Yi
22b02bfa62
Batch norm cudnn accurate ( #16545 )
...
* fix cudnn batch norm accuracy test=develop
* fix cudnn batch norm accuracy test=develop
* disable failed test for later fix test=develop
6 years ago
Qiao Longfei
8342f12e31
fix set remote_prefetch test=develop
6 years ago
Qiao Longfei
df45c8c538
update nce and hierarchical_sigmoid remote_prefetch
...
test=develop
6 years ago
Qiao Longfei
a1821a0449
remote remote_prefetch in embedding layer test=develop
6 years ago
zhaoyuchen2018
0b0abdbcd4
Merge pull request #16041 from zhaoyuchen2018/docrefine
...
Fix document array_read code error
6 years ago
dongdaxiang
d7963e1065
infer_from_dataset API.spec
...
test=develop
6 years ago
dongdaxiang
718ea6dbd5
fix fleet code style
...
test=develop
6 years ago
xjqbest
782ab2e2bd
add some doc
...
test=develop
6 years ago
xjqbest
9b84e8e66b
fix code style
...
test=develop
6 years ago
xjqbest
a99c8d0c29
fix client to client communication bug
...
test=develop
6 years ago
gongweibao
fea91164b7
Fix windows compilation error! ( #16546 )
...
* fix compiled
test=develop
* follow comments test=develop
6 years ago
dongdaxiang
b35d27fa94
fix API spec
...
test=develop
6 years ago
Kaipeng Deng
3d939d32ee
Merge pull request #16023 from heavengate/kl_div_loss
...
KL div loss: add kldiv_loss op
6 years ago
Kaipeng Deng
54474637ae
Merge pull request #16057 from heavengate/softmax_axis
...
Add attr 'axis' for softmax
6 years ago
Kaipeng Deng
63ac947e2f
Merge pull request #16135 from heavengate/shift
...
Add temporal_shift op for TSM model
6 years ago
chengduo
bb80dae7d0
Add DecoupledWeightDecay ( #16427 )
...
* Add DecoupledWeightDecay
6 years ago
dengkaipeng
737ea825ac
fix doc. test=develop
6 years ago
dongdaxiang
3829eac27b
fix API spec about infer_from_dataset
...
test=develop
6 years ago
Shixiaowei02
7b9fc71076
update tensorrt subgraph_util test=develop
6 years ago
dengkaipeng
0f7411a1ae
round down for scale. test=develop
6 years ago
dongdaxiang
87027a2eef
fix API.spec problem and executor's docstring
...
test=develop
6 years ago
zhoukunsheng
5284213942
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rsqrt
6 years ago
zhoukunsheng
91ba75000c
fix type conversion problem in rsqrt functor
6 years ago
Tao Luo
8f7b5883b8
Merge pull request #16529 from lidanqing-intel/lidanqing/preprocess-data
...
preprocess with PIL the full val dataset and save binary
6 years ago
Tao Luo
5b24002389
Merge pull request #16399 from sfraczek/sfraczek/analyzer_int8_resnet50_test
...
create test for quantized resnet50
6 years ago
sneaxiy
8c869a865d
update develop ops
...
test=develop
6 years ago
sneaxiy
33473890f3
Merge develop
...
test=develop
6 years ago
dongdaxiang
ade9337486
fix API.spec
...
test=develop
6 years ago
liuwei1031
278debab71
fix comments of 16410, test=develop ( #16499 )
...
* fix comments of 16410, test=develop
* modify inplace_op_inference_test according to pass interface change, test=develop
6 years ago
Wojciech Uss
2498395132
remove profiling from int8 test
...
test=develop
6 years ago
Zhaolong Xing
3e6aa498d6
Merge pull request #16526 from NHZlX/refine_trt_anakin
...
refine subgraph trt and anakin
6 years ago
nhzlx
7cde2d9e84
fix trt engine test error.
...
test=develop
6 years ago
zhoukunsheng
3c4f5f0368
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into linspace
6 years ago
zhoukunsheng
ead3c0a8fc
update api.spec
6 years ago
dongdaxiang
720647e17f
rebase current develop and fix conflict
...
test=develop
6 years ago
zhoukunsheng
2336d5ca5d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rank
6 years ago
dongdaxiang
3a79be6eb3
refine API spec
...
test=develop
6 years ago
dongdaxiang
98dda08a85
fix pull sparse slow problem
...
test=develop
6 years ago
dongdaxiang
93c3c7f9b3
fix dataset testcase problem
...
test=develop
6 years ago
dongdaxiang
d739bab844
fix async_executor problem and remove some unnecessary testcase, fix trainer_desc import problem
...
test=develop
6 years ago
dongdaxiang
241d8808be
add timer to distributed executor
...
test=develop
6 years ago
dongdaxiang
3c73859eec
add trainer_desc.proto to distributed executor
...
test=develop
6 years ago
dongdaxiang
60b7bf6fa6
add infer_from_dataset for inference
6 years ago
xjqbest
030c7e7e9d
fix FillSparseValue error
...
test=develop
6 years ago
dongdaxiang
88880d9b69
fix import trainer_desc_pb2 error
...
test=develop
6 years ago
dongdaxiang
0030eb2a61
fix distributed building
...
test=develop
6 years ago
dongdaxiang
ed31874397
undefine rand_r()
...
test=develop
6 years ago
dongdaxiang
f7e4813804
add WIN32 for rand_r and usleep
...
test=develop
6 years ago
dongdaxiang
cedbc161da
add more _LINUX maroc on data_feed.cc for mac and window compile
...
test=develop
6 years ago
dongdaxiang
c5980c3566
add _LINUX macro
...
test=develop
6 years ago
dongdaxiang
433301fbc2
remove glog in shell.h
...
test=develop
6 years ago
dongdaxiang
9e51ad4a65
fix io and fs compile on mac
...
test=develop
6 years ago
dongdaxiang
6eca88ac76
fix io and fs compile on mac
...
test=develop
6 years ago
dongdaxiang
2708108a08
fix fleet_wrapper compile on windows
...
test=develop
6 years ago
dongdaxiang
4ce35815fb
fix windows GLOG problem
...
test=develop
6 years ago
dongdaxiang
e3107a6ae0
fix windows compile problem
...
test=develop
6 years ago
dongdaxiang
398004ece0
disable sys/wait.h to fix windows compile problem, include scope in lodtensor_printer
...
test=develop
6 years ago
dongdaxiang
d4514949bf
remove local random engine in fleet with rand_r()
...
test=develop
6 years ago
dongdaxiang
e82969eeb0
remove getdelim in windows
...
test=develop
6 years ago
dongdaxiang
45eb6f0765
run pre-commit check files and fix code style problem
...
test=develop
6 years ago
dongdaxiang
d87ba58c14
refine document of python API, make device_worker and trainer's API private
...
test=develop
6 years ago
dongdaxiang
5687f234bf
fix trainer_desc.proto error
6 years ago
dongdaxiang
b95b80bc76
add doc string for executor and update API.spec
...
test=develop
6 years ago
dongdaxiang
6be9f719e2
make string_helper dependency work
...
test=develop
6 years ago
xjqbest
e95cafd9a7
fix code style & add dataset testcase
...
test=develop
6 years ago
dongdaxiang
39362a8415
move root_scope->DropKids() into Finalize() so that we do not have to drop all the kids
...
test=develop
6 years ago
dongdaxiang
ba15d6b164
move root_scope->DropKids() into Finalize() so that we do not have to drop all the kids
...
test=develop
6 years ago
xjqbest
be74de2c61
fix code style & fix register bug & add release_memory
...
test=develop
6 years ago
dongdaxiang
a0b59773af
fix code style
6 years ago
dongdaxiang
f39b323ed7
remove trainer_library in CMakeLists
...
test=develop
6 years ago
dongdaxiang
365be5d559
support win32 flag in io.cc shell.cc, fix code style problem in fleet_wrapper, fix lodtensor_printer_test problem
...
test=develop
6 years ago
dongdaxiang
dc8cf36e4b
add more example on datagenerator
...
test=develop
6 years ago
dongdaxiang
6bf796df14
refine print fetch list
6 years ago
xjqbest
589467f24c
fix bug
6 years ago
xjqbest
b7940c2918
fix bug of gen_worker_desc and set_filelist, add some doc
6 years ago
dongdaxiang
68d7bf3de5
add fetch var function
...
test=develop
6 years ago
xjqbest
a34fe6248f
add some doc
6 years ago
xujiaqi01
f5c6a14b54
fix runtime error
6 years ago
xujiaqi01
a5b1a0e12b
support multi dataset && add init model && fix bug
6 years ago
dongdaxiang
3c65cc1bbd
add document for role_maker and fleet parameter, data_generator
6 years ago
dongdaxiang
f6c9232a3d
fix dataset float32 type problem
6 years ago
dongdaxiang
73b1f396d7
add data_generator into paddle.fluid.incubate.data_generator, add op run log in hogwild_device_worker and downpour_device_worker
...
test=develop
6 years ago
dongdaxiang
73544e8b8d
add training speed log
6 years ago
dongdaxiang
9419de521f
add IO percent for multi_trainer
6 years ago
dongdaxiang
6af697adb0
add trainfileswithprofiler for downpour worker
6 years ago
dongdaxiang
2644b88685
add comment for MPI Symetric role maker
...
test=develop
6 years ago
dongdaxiang
cf45c54340
add distributed optimizer factory
6 years ago
dongdaxiang
b7a202aa38
add distributed optimizer factory
6 years ago
xujiaqi01
70a5d4f797
fix error
6 years ago
xujiaqi01
d25389fefd
add some log && fix error
6 years ago
dongdaxiang
f612877797
add incubate for unified API
6 years ago
dongdaxiang
317eb0aad3
add incubate for unified API
6 years ago
xujiaqi01
39449ba0b9
fix bug && add DestroyReaders in trainer
6 years ago
dongdaxiang
e657c127a8
hide opt_info in distirbuted optimizer
6 years ago
xujiaqi01
ecfc7df913
add dataset factory && fix style
6 years ago
dongdaxiang
328f11b8b6
refactor downpour optimization
...
test=develop
6 years ago
xujiaqi01
3cea00bd52
store memory data in Dataset && fix bug
6 years ago
dongdaxiang
ff87698a44
refactor downpour optimization
6 years ago
dongdaxiang
b66f0074b6
fix data reading bugs in api, add VLOG(3) log for setup
6 years ago
dongdaxiang
b415ec27e8
make Dataset* as an argument
6 years ago
xjqbest
dd67ad08a2
modify c++ and python dataset related code & fix bug
6 years ago
dongdaxiang
cc4def6ba5
fix some conflict for compilation
6 years ago
heqiaozhi
9bca1926c1
refactor & fix bug
6 years ago
xjqbest
2e9a836c6f
add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
6 years ago
dongdaxiang
2486389793
add RunFromDataset in executor
6 years ago
dongdaxiang
e36bbcc871
fix some typo and CMakefile.txt
6 years ago
xjqbest
824b84d185
add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
6 years ago
dongdaxiang
08c25995a2
add run from dataset in executor.
6 years ago
dongdaxiang
c28bbdf8ba
add dataset_generator.py
...
dataset_generator.py is a framework for generating data with python
the generated data with a fixed format will be feeded into c++ reader
test=develop
6 years ago
dongdaxiang
be757096da
add pybind for fleet
6 years ago
dongdaxiang
687cb79dbb
add pipe command io interface
6 years ago
dongdaxiang
1fe54416c9
move fs.cc and shell.cc into paddle/fluid/framework/io
...
test=develop
6 years ago
dongdaxiang
53fbab5d33
add fs_local_open example
6 years ago
dongdaxiang
afaf937010
add fs_local_open example
6 years ago
dongdaxiang
cf1360643f
add printer for fetch variable
6 years ago
dongdaxiang
d65cb13ad5
add pslib flag on fleet_wrapper CMakefile
6 years ago
dongdaxiang
6de9ebc65c
refine VLOG in fleet_wrapper.h
...
test=develop
6 years ago
dongdaxiang
97d5cd30f0
make pull dense worker work
6 years ago
dongdaxiang
39014b9f9f
fix class register problem
6 years ago
dongdaxiang
f0dd1201cc
fix destructor problem
...
test=develop
6 years ago
dongdaxiang
f2bde9c241
fix destructor problem
6 years ago
dongdaxiang
54f047a126
fix ngraph compile option
6 years ago
dongdaxiang
dd1dc9bcf0
add common.h.in back
6 years ago
dongdaxiang
378037c535
make s_instance_ private to ensure singleton
6 years ago
dongdaxiang
a446d26e8a
add todo for asynce executor
6 years ago
dongdaxiang
c165012031
refine device_worker and trainer code
...
test=develop
6 years ago
dongdaxiang
8a335b50be
add downpour device_worker pb configuration
6 years ago
dongdaxiang
24a8001142
make -DWITH_PSLIB=ON compilable
6 years ago
dongdaxiang
67b1d6d721
add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
6 years ago
dongdaxiang
855bf579d2
add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
6 years ago
lujun
d4f63d82ac
Merge pull request #16475 from junjun315/fix-doc-multiplex
...
refine multiplex-doc
6 years ago
Qiao Longfei
d8974e6da0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
Shixiaowei02
bddb2cd315
resolve conflicts with the develop branch test=develop
6 years ago
lidanqing
0d656996bf
fix some bugs of unzip and reading val list
...
test=develop
6 years ago
chengduo
1096746cbf
Fuse Adam And SGD ops ( #15933 )
...
* fuse optimizer
6 years ago
Jacek Czaja
2632327429
[MKL-DNN] Tensor modifications revert ( #16462 )
...
* Revert "[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233 )"
This reverts commit 13816dd4ac
.
Apart from enabling transformer for MKL-DNN
* Revert "- MKL-DNN pooling updated to set_prim_desc"
This reverts commit c63f6b2039
.
Conflicts:
paddle/fluid/operators/mkldnn/concat_mkldnn_op.cc
* Revert "[MKL-DNN] MKL-DNN specific Tensor modification (#15429 )"
test=develop
This reverts commit dec9cf53c8
.
* - concat compilation fix
- lint
test=develop
- Lint fixes
test=develop
- Lint fixes
test=develop
- Fix Transpose MKLDNN op
test=develop
6 years ago
Zeng Jinle
4143a1c216
Merge pull request #16491 from sneaxiy/feature/advance_gc
...
Fix grad op makers
6 years ago
chengduo
2265d091e6
Fix threaded executor bug ( #16508 )
...
* fix threaded executor bug
test=develop
* change the order of class member
test=develop
* Fix Travis CI
test=develop
6 years ago
sneaxiy
2c836ff914
check default grad maker
...
test=develop
6 years ago
nhzlx
d065b5bf2b
Anakin ssd support
...
refine trt first run
add quant dequant fuse pass
omit simplify_anakin_priorbox_detection template
omit transpose_flatten_concat_fuse template
test=develop
6 years ago
zhoukunsheng
beb4a86d13
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rank
6 years ago
zhoukunsheng
b06e9b773d
test=develop
...
add rank op
6 years ago
Zeng Jinle
69cb9792ea
Merge pull request #16506 from sneaxiy/revert-16424-fix_allocator_bug
...
Revert "Fix allocator bug"
6 years ago
lidanqing
b46e467abc
add wget and unzip part and change data_dir
...
test=develop
6 years ago
zhoukunsheng
2f9e562100
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into linspace
6 years ago
dengkaipeng
2078f4207f
fix API.spec. test=develop
6 years ago
zhoukunsheng
082822d417
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rsqrt
6 years ago
zhoukunsheng
c47f3cc7fe
test=develop
...
add rsqrt op
6 years ago
lidanqing
894aa9b235
change script file name and data_dir location
...
test=develop
6 years ago
lidanqing
57f51e5b08
preprocess with PIL the full val dataset and save binary
...
test=develop
6 years ago
dengkaipeng
8160a66193
fix doc priority. test=develop
6 years ago
chengduo
ed61d67c73
Fix the interface of Pass::Apply ( #16484 )
...
* modify the interface of Pass::Allay
test=develop
* Polish code
test=develop
* Fix Travis CI
test=develop
* fix Pass::Apply interface
test=develop
* Fix Travis CI
test=develop
6 years ago
dengkaipeng
193185b840
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shift
6 years ago
Zeng Jinle
5f1c92a81c
Merge pull request #16450 from zhhsplendid/del-redundant-op-var-reg
...
Add SpectralNormGradOpDescMaker
6 years ago
Zeng Jinle
2aa18e2bda
Merge pull request #16496 from sneaxiy/fix_gc_bug
...
Fix gc bug
6 years ago
Sylwester Fraczek
8ece7a9708
fixed url to dataset
...
test=develop
6 years ago
sneaxiy
5656fa9f7c
fix travis ci
...
test=develop
6 years ago
Zeng Jinle
174d0d0b90
Revert "Fix allocator bug"
...
add include headers to fix travis-ci
test=develop
6 years ago
gongweibao
eb83abeac3
Add DGC(Deep Gradient Compression) interface. ( #15841 )
6 years ago
Qiao Longfei
34890fd3b1
fix gpu build for lookup_table_op test=develop
6 years ago
Sylwester Fraczek
fe21578a44
create test for quantized resnet50
...
test=develop
6 years ago
Michał Gallus
2d8b7b3a76
Refine default MKL-DNN Pass order ( #16490 )
...
* Refine default MKL-DNN Pass order
test=develop
* Add comment to default MKL-DNN Pass list
test=develop
6 years ago
Wojciech Uss
09dfc7a2aa
C-API quantization core 2 ( #16396 )
...
* C-API quantization core
test=develop
Co-authored-by: Sylwester Fraczek <sylwester.fraczek@intel.com>
* Decouple Quantizer from AnalysisPredictor
test=develop
* fixes after review
test=develop
* renamed mkldnn quantize stuff
test=develop
* remove ifdef from header file
test=develop
6 years ago
Jiabin Yang
e41d581304
test=develop, fix space_to_depth_doc ( #16293 )
...
* test=develop, fix space_to_depth_doc
* test=develop, refine indent
* test=develop, refine code
* test=develop, add api spec
6 years ago
sneaxiy
4c8254e3bf
revert some loop op revision
...
test=develop
6 years ago
Zeng Jinle
644e8af4cf
Merge pull request #16424 from sneaxiy/fix_allocator_bug
...
Fix allocator bug
6 years ago
sneaxiy
c4c6205268
fix gc bug
...
test=develop
6 years ago
zhoukunsheng
874b5d8362
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into linspace
6 years ago
zhoukunsheng
83c7bca13f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
6 years ago
zhoukunsheng
a55111b869
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into zeros_like
6 years ago
zhoukunsheng
848ec97ab3
test=develop
...
add zeros_like op
6 years ago
sneaxiy
16f0994728
Merge develop
...
test=develop
6 years ago
sneaxiy
63651c1968
fix grad desc maker
...
test=develop
6 years ago
Yihua Xu
57dc3c1943
Disable compare for Issue#16316 ( #16466 )
...
* Disable compare for accuracy issue.
test=develop
* Add todo comments to show more information.
test=develop
6 years ago
Zeng Jinle
c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
...
Enhance gc to support deleting tensor buffer in advance
6 years ago
Qiao Longfei
33be014535
fix distribute compile problem test=develop
6 years ago
Jiabin Yang
54a73578a8
Feature/install check ( #16044 )
...
* test=develop, add install check
* test=develop, add install check scripts
* test=develop, refine language
* test=develop, add api spec
* test=develop, change cdn to bj to pass ci
6 years ago
Qiao Longfei
b542639dc0
code clean test=develop
6 years ago
wopeizl
c300b1ba69
Tensor index ( #16223 )
...
* extend the slice function for python
test=develop
6 years ago