Commit Graph

7767 Commits (bc037c13c728f6d17d48408f986c78dea11d4a89)

Author SHA1 Message Date
Qiao Longfei 0a8ff2ecd4 add cpu_merge_add_multi_noduplicated_test test=develop
6 years ago
Qiao Longfei 920a960974 optimize merge add if input rows of all selected rows is not duplicated
6 years ago
zhoukunsheng b1c5820b3f fix merge conflict
6 years ago
Qiao Longfei 1526a3e4da Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
heqiaozhi 759940786e Merge remote-tracking branch 'upstream/develop' into dev
6 years ago
zhoukunsheng 9643f906ed Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rsqrt
6 years ago
phlrain 6bc3932823 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into softmax_cross_support_high_rank
6 years ago
phlrain a3e5238112 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into sigmoid_corss_entropy_support_high_rank
6 years ago
phlrain 715a31b35e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_split
6 years ago
phlrain db0518bb4d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_shape_2
6 years ago
XiaoguangHu 06809ebbb1
Merge pull request #16815 from sneaxiy/fix_new_added_reduce_ops_spec
6 years ago
zhoukunsheng ebf6cf9f18 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into zeros_like
6 years ago
zhoukunsheng 380df8281f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into zeros_like
6 years ago
Yihua Xu 93cedfdb9c Fix the order while sorting the operators (#16756)
6 years ago
sneaxiy 00b4580f46 fix default_grad_op_desc_maker
6 years ago
Qiao Longfei afc56949c1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
Qiao Longfei d831f1b0ba fix brpc code
6 years ago
heqiaozhi 96d5ec16f6 change API
6 years ago
liuwei1031 85363848a1
Security issue (#16774)
6 years ago
phlrain 468f8ccff9 supprt high rank; test=develop
6 years ago
phlrain bbfc82cc42 softmax corss entropy support high rank
6 years ago
zhoukunsheng 2b2b4ca21e
Merge branch 'develop' into rsqrt
6 years ago
heqiaozhi 5fb9bdc892 add X to grad
6 years ago
Hongyu Liu e2897ba13a
Merge pull request #16432 from zhoukunsheng/linspace
6 years ago
Hongyu Liu 283ae0faaa
Merge pull request #16525 from zhoukunsheng/rank
6 years ago
Hongyu Liu afe0d64c9d
Merge pull request #16320 from zhoukunsheng/all_any
6 years ago
phlrain 026836ffe0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_split
6 years ago
phlrain 488e889f3d fix split infer shape; test=develop
6 years ago
ruri 39d6a985bc
fix some comments, include cosine_decay,l2_normalize,pixel_shuffle (#16763)
6 years ago
Qiao Longfei 8b8a0487c7 fix compile test=develop
6 years ago
dongdaxiang a659b37ace make lodtensor_printer usable in gpu setting
6 years ago
guru4elephant aa46caf3d9
Merge pull request #16765 from guru4elephant/gpu_dataset_train
6 years ago
phlrain 3f0d047d1b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_shape_2
6 years ago
phlrain dc6e814686 fix concat shape; test=develop
6 years ago
Wu Yi 8b58732013
remove append_LARS not used api test=develop (#16703)
6 years ago
dongdaxiang 3c2d236815 remove all warnings
6 years ago
Yiqun Liu 112f16143b
Add an option to enable the cache of expected kernel in train phase. (#16724)
6 years ago
liuwei1031 2e07c19a9c
disable memory_optimize and inpalce strategy by default, test=develop (#16760)
6 years ago
dongdaxiang ea07eb8cd2 remove comment in data_feed.cc
6 years ago
guru4elephant e349a7443f
Update nccl_context.h
6 years ago
Qiao Longfei a541c25ab6 fix cpplint test=develop
6 years ago
dongdaxiang 05464e7c5c add gpu training for Executor.train_from_dataset
6 years ago
Qiao Longfei 0608f8ca56 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
heqiaozhi e9d79dd5d7 ctx.device_context() to CPUPlace
6 years ago
heqiaozhi 44b226eda6 ctx.device_context() to CPUPlace
6 years ago
heqiaozhi fa6ea1e0e6 remove grad X
6 years ago
heqiaozhi 72c9aecfc3 fix doc
6 years ago
heqiaozhi 8de5dc31db add doc
6 years ago
heqiaozhi 5204fb4402 add doc
6 years ago
heqiaozhi 6e5c44d3fe add doc
6 years ago
gongweibao bf606bce8a
Fix grpc log message. (#16735)
6 years ago
Zeng Jinle 9f7b027dce
fix activation grad op desc maker (#16715)
6 years ago
lujun 9bd44b94da
Merge pull request #16561 from junjun315/move-api-to-root
6 years ago
heqiaozhi ba78446cca add continuous value model op
6 years ago
liuwei1031 fdb719a1bf
avoid optimize variable used in subblock, test=develop (#16739)
6 years ago
Kaipeng Deng ed97156461
Merge pull request #16439 from heavengate/resize_scale
6 years ago
heqiaozhi 0c3c5e19d3 add continuous value model op
6 years ago
Tao Luo 1a21d08f12
Merge pull request #16725 from tensor-tang/pass/disable_seqpool
6 years ago
heqiaozhi 54dddee37e add continuous value model op
6 years ago
liuwei1031 a18ef10c87
only use the latest version variable for inplace strategy (#16736)
6 years ago
Huihuang Zheng 2146293d26 Fix op registry (#16677)
6 years ago
Tao Luo 5c364cda3c
Merge pull request #16711 from luotao1/has_attr
6 years ago
tensor-tang d6c1b5a73b disable seqpool concat pass by default saving CI time
6 years ago
baojun 1c8b34ddd2 fix training validation test=develop (#16698)
6 years ago
lujun 92c8ac8a74 merge conflict, test=develop
6 years ago
chengduo 55b15db5af
Add unit test for fuse all_reduce ops (#16699)
6 years ago
luotao1 4098ba29ed reduce hasAttr elapsed time in RunImpl
6 years ago
luotao1 f89a9c5d95 Merge branch 'develop' into has_attr
6 years ago
Tao Luo ad4a1bd13c
Merge pull request #16339 from luotao1/core_opt_choose_kernel
6 years ago
luotao1 6afc97ca6b reduce hasAttr elapsed time in RunImpl
6 years ago
Yan Xu 55e3c6949b
disable reuse port test=develop (#16704)
6 years ago
gongweibao 8b793d0efd
Fix DGC bug. (#16697)
6 years ago
Yiqun Liu 3fe8cb0dd7
Enable the runtime_context_cache pass in train phase (#16640)
6 years ago
Tao Luo 4048a2681f
Merge pull request #16687 from luotao1/reduce_inference_ci_time
6 years ago
Yan Xu 169829c83a fix win gpu test=develop (#16694)
6 years ago
guru4elephant 7d653f0aed
Merge pull request #16652 from xjqbest/dataset_merge_develop
6 years ago
xjqbest 6a57e8075a remove trainer_id in datafeed and dataset
6 years ago
tensor-tang ad45a08351
fix avx option (#16683)
6 years ago
Tao Luo d5c8d4acfe reduce all analyzer_test ci elasped time
6 years ago
luotao1 695f2db6a0 update expected_kernel_cache_pass
6 years ago
luotao1 226596a296 Merge branch 'develop' into core_opt_choose_kernel
6 years ago
Tao Luo 0d6581c574
Merge pull request #16656 from PaddlePaddle/revert-16555-model_data_cryption_link_all_lib
6 years ago
bingyanghuang 88ceda5134 MKLDNN INT8 v2 readme.md (#16515)
6 years ago
Tao Luo 503b5eb4dd
Merge pull request #16678 from luotao1/int8_cache
6 years ago
Yan Xu b4c3a6aa0b
[Imperative] implement imperative NCCLParallelContext (#16477)
6 years ago
Zeng Jinle bb143052cb
fix gc bug in conditional block (#16673)
6 years ago
xjqbest 5e5139283b fix runtime error
6 years ago
luotao1 bd636a9ea6 test_analyzer_int8 tests use default pass order
6 years ago
ruri 229dc93277
Add Pixel shuffle OP (#15782)
6 years ago
lujun b236091e89 merge confict, test=develop
6 years ago
lujun facd95e6e3 merge confict, test=develop
6 years ago
lujun 01f4f2d7e4 merge confict, test=develop
6 years ago
lujun 6173585051
Merge pull request #16634 from junjun315/my-cool-stuff
6 years ago
Chen Weihang 0b2aec14b6 Revert "Model data cryption link all lib (#16555)"
6 years ago
Yan Chunwei 044ae2497d
fix identity temporarily (#15942)
6 years ago
xjqbest 271b7147cc fix dataset bug
6 years ago
Zeng Jinle 1c526e1d1a
Fix some grad op desc makers (#16633)
6 years ago
chengduo ea2a2f778a Fix the bug of AllReduceDepPass (#16393)
6 years ago
Wojciech Uss ec2750b3c2 fix repeating passes (#16606)
6 years ago
Wojciech Uss 9b6a029666 fix dataset reading and add support for full dataset (#16559)
6 years ago
lujun 3773260edd fix load type, test=develop
6 years ago
chengduo b75a69bad6
Add Stream for fetch op handle (#16600)
6 years ago
chengduo 1342e2ea04
Fix the bug of the fast threaded executor (#16514)
6 years ago
Zeng Jinle d658244997
fix some grad op desc maker (#16581)
6 years ago
gongweibao 423bc515da
fix batch merge bug (#16601)
6 years ago
gongweibao 0342f01249
Fix dgc bug. (#16602)
6 years ago
lidanqing 2ca0de3cd4 fix preprocess script with processbar, integrity check and logs (#16608)
6 years ago
Tao Luo c797aed893
Merge pull request #16584 from luotao1/enhance_wget
6 years ago
Qiyang Min 12e36d38a5 Imperative deep-first backward process (#16605)
6 years ago
liuwei1031 bd193781df
fix the bug of reusing different types of variables in memory_optimiz… (#16547)
6 years ago
Chen Weihang c38c7c5619
Model data cryption link all lib (#16555)
6 years ago
乔龙飞 Qiao Longfei 21622ca30b
Merge pull request #16172 from jacquesqiao/add-async-ssa-graph-executor-communicator
6 years ago
Tao Luo ce18710421 enhance analyzer_tests download
6 years ago
Qiao Longfei 766666a957 add log for FLAGS_communicator_send_wait_times
6 years ago
sneaxiy 10249c0b78 Merge develop
6 years ago
zhoukunsheng 5edf4fb4fb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
6 years ago
zhoukunsheng 47a5eec87d improve comment
6 years ago
Qiao Longfei 4031c1a7b1 fix ci build test=develop
6 years ago
zhoukunsheng 68f887aab3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into zeros_like
6 years ago
zhoukunsheng b2cdadd700 improve comment
6 years ago
Qiao Longfei 9861a92f6f change the return type of NewTempScope to unique ptr test=develop
6 years ago
dengkaipeng d02f4c2fd6 fix API.spec. test=develop
6 years ago
zhoukunsheng 315176c65b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rank
6 years ago
zhoukunsheng 8b6b626718 fix problem of API.spec mismatch
6 years ago
chengduo feb1b54f9d
fix min and max bug (#16570)
6 years ago
石晓伟 5dea0bdd1b
Merge pull request #16498 from Shixiaowei02/feature/anakin-engine
6 years ago
Qiao Longfei fb6cc3a1bd follow commnet, optimize code and add comment test=develop
6 years ago
Qiao Longfei adf272bcec Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
guru4elephant 76b49f02ee
Merge pull request #16539 from guru4elephant/train_with_pipe_reader_merge_develop
6 years ago
Qiao Longfei baf02328b2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
qingqing01 1ebd7434d5
Add linear learning warmup method in learning rate scheduler. (#16563)
6 years ago
Qiao Longfei 9db1a9e128 change log level test=develop
6 years ago
gongweibao a61ed9782e
fix log level test=develop (#16554)
6 years ago
Wu Yi 22b02bfa62
Batch norm cudnn accurate (#16545)
6 years ago
Qiao Longfei 8342f12e31 fix set remote_prefetch test=develop
6 years ago
Qiao Longfei df45c8c538 update nce and hierarchical_sigmoid remote_prefetch
6 years ago
Qiao Longfei a1821a0449 remote remote_prefetch in embedding layer test=develop
6 years ago
zhaoyuchen2018 0b0abdbcd4
Merge pull request #16041 from zhaoyuchen2018/docrefine
6 years ago
dongdaxiang d7963e1065 infer_from_dataset API.spec
6 years ago
dongdaxiang 718ea6dbd5 fix fleet code style
6 years ago
xjqbest 782ab2e2bd add some doc
6 years ago
xjqbest 9b84e8e66b fix code style
6 years ago
xjqbest a99c8d0c29 fix client to client communication bug
6 years ago
gongweibao fea91164b7 Fix windows compilation error! (#16546)
6 years ago
dongdaxiang b35d27fa94 fix API spec
6 years ago
Kaipeng Deng 3d939d32ee
Merge pull request #16023 from heavengate/kl_div_loss
6 years ago
Kaipeng Deng 54474637ae
Merge pull request #16057 from heavengate/softmax_axis
6 years ago
Kaipeng Deng 63ac947e2f
Merge pull request #16135 from heavengate/shift
6 years ago
chengduo bb80dae7d0
Add DecoupledWeightDecay (#16427)
6 years ago
dengkaipeng 737ea825ac fix doc. test=develop
6 years ago