Commit Graph

15334 Commits (c26130f3a9cb0de2266d5288c837e7f15b2ff7fe)

Author SHA1 Message Date
guru4elephant 5d6a1fcf16
fix infer_from_dataset and train_from_dataset (#17243)
6 years ago
chengduo 516317cf91
use sync copy (#17291)
6 years ago
Huihuang Zheng 2c4462711f
Fix API example code of save_inference_model (#17274)
6 years ago
xiaoting 9ed4aaada4 modified formula for Lrn (#17281)
6 years ago
zhaoyuchen2018 792443ef23
Refine elementwise kernel. (#16952)
6 years ago
lujun e388a1fb66
Repair api example (#17221)
6 years ago
Yiqun Liu 6b84688ba2
Optimize the cuda implementation of sum_op (#17283)
6 years ago
chengduo db5e74ab95
update assert (#17282)
6 years ago
Hongyu Liu c3195de522
Fix concat shape check (#17247)
6 years ago
lvmengsi dab71e8d97
Fix api example (#17231)
6 years ago
whs 7d7e29957f Fix bp of roi perspective transform op. (#17216)
6 years ago
baojun 7bd1d03ee5 Adding lrn op for ngraph engine (#17189)
6 years ago
Wojciech Uss 984aa90583 improved unit test output (#17266)
6 years ago
chengduo 8f534696b7
Polish Executor and Compiler doc (#17262)
6 years ago
tianshuo78520a dd86b40058 document_preview (#17166)
6 years ago
gongweibao 91784f8ec3
Fix code in document. (#17237)
6 years ago
chengduo 04bd413acb
Code Clean: Move all pass to paddle::framework::ir (#17228)
6 years ago
Huihuang Zheng 648320bb6c
Fix some data and reader related API code (#17202)
6 years ago
Zeng Jinle f2fa3f7300
fix api doc,test=develop (#17241)
6 years ago
Zeng Jinle 4f8594088d
Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225)
6 years ago
baojun e782b54b9c update sofmax with axis arg test=develop (#17190)
6 years ago
tensor-tang 71f0c6d5bd
fix api doc of hash, relu, concat, argmin, argmax, argsoft and all activations (#17235)
6 years ago
Zeng Jinle 6fafd37e12
fix retry_allocator (#17245)
6 years ago
Tao Luo ff1661f12a
remove unused FLAGS_warpctc_dir (#17162)
6 years ago
Kaipeng Deng a71d8fdb87
Softmax_cross_entropy op add axis (#16806)
6 years ago
songhao c2e20e2a29 fix build warning like 'comparison between signed and unsigned (#17240)
6 years ago
Zhen Wang a914d9b116
Quant output scale (#17215)
6 years ago
zhaoyuchen2018 32b62c25af
optimize sum op (#16820)
6 years ago
石晓伟 a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 (#17156)
6 years ago
Tao Luo 16922e0093
fix api_example of tree_conv (#17239)
6 years ago
jerrywgz ef66baedc0
Refine api doc (#17230)
6 years ago
Leo Zhao 54636a1982 call SetNumThreads everytime to avoid missing omp thread setting (#17224)
6 years ago
Yibing Liu 6b0f27e802
Fix some APIs' example (#17214)
6 years ago
ruri 5817077c99
Fix unexecutable API examples (#17218)
6 years ago
jerrywgz cc95a7516c
fix distribute fpn proposals, test=develop (#16152)
6 years ago
Tao Luo 9ec4615deb
fix profiler and name_scope API examples (#17212)
6 years ago
Zeng Jinle c5eeecca7c
Fix tensor_py.h (#17195)
6 years ago
Zeng Jinle ee2028a110
Add use_cuda to inplace pass (#17205)
6 years ago
chengduo 950aec55fd
It doesn't need sync when fetch_list nit not empty (#17201)
6 years ago
jerrywgz a72907bbf4
Enhance concat op to support empty input. (#17015)
6 years ago
wopeizl 83c4f7721f
use two GPUs to run the exclusive test test=develop (#17187)
6 years ago
chengduo 3c6ab799cd
Remove unnecessary set_devices (#17158)
6 years ago
guru4elephant f938ccec62
remove async executor python api to fix document (#17174)
6 years ago
Zeng Jinle 5dfe2ab9e8
Fix mem leak when converting Tensor to numpy array (#17182)
6 years ago
Huihuang Zheng e4a5332416
Fix a typo in gpu_info.cc (#17175)
6 years ago
tensor-tang 79ed1c76cd
fix bn fuse vardesc and add model saver (#17143)
6 years ago
Zeng Jinle 4e1bc6e805
Rewrite inplace pass and fix gc bug (#17126)
6 years ago
Zeng Jinle 08773b6069
fix reader default stream,test=develop (#17106)
6 years ago
xiaoting bc48453b73 polish the label_smooth (#17138)
6 years ago
Leo Zhao bf4b21fa3d fix assertion failure issue when test_analyzer_bert uses ngraph (#17148)
6 years ago
tangwei12 deb510d451
cvm op feature (#17081)
6 years ago
wopeizl 3acb3635c2
1. move the API check into CPU process (#17110)
6 years ago
tianshuo78520a 92ce445227 Supplementary monitoring file reason explanation (#17131)
6 years ago
Zeng Jinle 28d69d710a
Refine dropout gpu memory (#17095)
6 years ago
Huihuang Zheng b9494058b3
Use CudnnWorkspaceHandle in exhaustive search (#17082)
6 years ago
tianshuo78520a 2192e7bb61 Path flag (#17105)
6 years ago
xiaoting 7da7881c0e Detailed coordinate description for yolov3 loss (#17007)
6 years ago
chengduo 794a195881
fix fuse optimizer ops (#17102)
6 years ago
ceci3 258e000be6
test=develop, double backward leaky_relu (#17067)
6 years ago
Kaipeng Deng 10c487eb21
fix interpolate cu. test=develop (#17101)
6 years ago
Tao Luo aca60e9a20
remove unnecessary prepare_data (#17080)
6 years ago
whs 55ce36e981
Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)
6 years ago
Zeng Jinle 842ded14b0
fix reference_count_pass,test=develop (#17060)
6 years ago
Yibing Liu beda78258f
Init mixed precision training interface (#16856)
6 years ago
Yan Xu 0b07eef118
ParallelDyGraph with GPU collective mode (#16827)
6 years ago
Tao Luo d9cd989825
Merge pull request #17048 from luotao1/fix_runtime_cache_bug
6 years ago
wopeizl f5d6937fe1
specify the cuda arch name and bin to decrease the compile time for i… (#17020)
6 years ago
chengduo cc31681687
use fast executor as default (#17044)
6 years ago
chengduo a2be4b4d91
Add fuse momenutum ops (#16745)
6 years ago
guru4elephant 03d469ad98
Merge pull request #17005 from wopeizl/fix_ncclwrapper_win1
6 years ago
tangwei12 13295d90d9
load persistables with selected rows, test=develop (#17047)
6 years ago
luotao1 490e746269 fix runtime_context_cache bug when gpu model has an op runs only on cpu
6 years ago
Zeng Jinle 0c335dcd2c
Make conv cudnn workspace size configurable (#17036)
6 years ago
jerrywgz ea3504c7ec
Merge pull request #17017 from jerrywgz/fix_potential_hung
6 years ago
qingqing01 c1c2633a63
Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862)
6 years ago
tangwei12 45136b1b41 fix bug in save, test=develop
6 years ago
jerrywgz 47013af0a6
Merge pull request #17011 from jerrywgz/enhance_generate_proposal_labels
6 years ago
tianshuo78520a 73a360b504 Cmakelists fix (#17018)
6 years ago
liuwei1031 a770ce0615
add doc for memory_optimize, test=develop (#17010)
6 years ago
wopeizl d9991dccdd
add parallel build script to ci … (#16901)
6 years ago
jerrywgz b2df6de860 fix potential hung in generate proposals, test=develop
6 years ago
Zeng Jinle 24923f7604
fix py_reader demo (#16997)
6 years ago
qingqing01 ea42e431f8
Speed unit testing. (#16978)
6 years ago
jerrywgz d3a66fc616 enhance generate proposal labels, test=develop
6 years ago
wopeizl 51a0243a56 fix nccl wrapper on windows
6 years ago
Zeng Jinle 1202d3fc74
Refine model gpu memory (#16993)
6 years ago
Yibing Liu 3c375751f8
Support seq len equal to 0 in sequence ops (#16935)
6 years ago
Tao Luo c017025531
Merge pull request #16981 from luotao1/disable_runtime_context_default
6 years ago
Yibing Liu 36c05d36ab
Check some shapes only in runtime (#16919)
6 years ago
Tao Luo aa7b975bf6 disable runtime_context_cache pass by default
6 years ago
Zhaolong Xing 27cd3efdd1
Merge pull request #16969 from NHZlX/fix_trt_anakin_compile_rely
6 years ago
tianshuo78520a 3242e88b70 fix cmakelist detecting problems (#16944)
6 years ago
jiaqi 8bcba3db84
Merge pull request #16896 from xjqbest/develop
6 years ago
nhzlx bc6b0ca1f4 fix trt anakin subgraph compile rely
6 years ago
guru4elephant bbc6c5714f
Merge pull request #16887 from guru4elephant/add_nccl_context_pybind
6 years ago
gongweibao cbdb8a17b1
Polish DGC code (#16818)
6 years ago
lujun dbf66dd034
Merge pull request #16954 from junjun315/fix-dygraph-checkpoint
6 years ago
Tao Luo aa9caa1691
Merge pull request #16951 from luotao1/reduce_ci_time
6 years ago
Guo Sheng 9f1d4a152b
Merge pull request #16902 from guoshengCS/refine-infer-shape
6 years ago
Guo Sheng caf2848356
Merge pull request #16898 from Superjomn/fix/logical_op_infershape
6 years ago
lujun a7c11979ba fix dygraph save/load checkpoint error, test=develop
6 years ago
Tao Luo bc037c13c7 use multi-thread to speedup CI tests
6 years ago
tangwei12 2b61db07d1
fix sampling id op bug (#16909)
6 years ago
Tao Luo 5b1565a7be
Merge pull request #16875 from lidanqing-intel/lidanqing/improve_preprocess_script
6 years ago
Kevin c474e7ddf5 fix overflow by int32 mul test=develop (#16794)
6 years ago
Hongyu Liu baf60e3a27
Merge pull request #16907 from xuezhong/fix_infershape_bug2
6 years ago
Yan Chunwei 8cff2b4231
Update logical_op.cc
6 years ago
Hongyu Liu 40be9590d4
Merge pull request #16897 from velconia/fix_split_lod_tensor_op_infer_shape
6 years ago
Hongyu Liu d68fb792f8
Merge pull request #16890 from colourful-tree/dev
6 years ago
Hongyu Liu ad2a2bb063
Merge pull request #16913 from phlrain/fix_bpr_loss
6 years ago
Hongyu Liu 8bd549bb68
Merge pull request #16861 from tensor-tang/refine/infershape
6 years ago
Hongyu Liu 9d5d44f939
Merge pull request #16840 from phlrain/fix_shape_check_many
6 years ago
dongdaxiang 2ab2869c2d fix GPU compile error problem
6 years ago
dongdaxiang 466d177d09 add pybind dependency
6 years ago
SunGaofeng 0508c9869c
Merge pull request #16853 from SunGaofeng/affine_modify
6 years ago
tangwei12 008fd785fd
fix/positive negative pair op (#16895)
6 years ago
Hongyu Liu d5a7c09856
Merge pull request #16798 from phlrain/softmax_cross_support_high_rank
6 years ago
xiaoting 431eab648e
Merge branch 'develop' into yolov3_loss
6 years ago
xuezhong 9c6ee7cf4c add <memory>
6 years ago
xuezhong 742d758747 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_infershape_bug2
6 years ago
Kaipeng Deng 5d45eb06f9
Merge pull request #16858 from heavengate/fix_yolo_param
6 years ago
phlrain ddd9e1cb66 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bpr_loss
6 years ago
phlrain 01eda557cd fix bpr loss; test=developp
6 years ago
xuezhong 41740519df add <memory>
6 years ago
xuezhong 4791029f19 remove <memory>
6 years ago
xuezhong fb75bd3e9c remove <memory>
6 years ago
xuezhong afbc435adf fix infershape check bug
6 years ago
Yan Chunwei 916930a8ae
Update logical_op.cc
6 years ago
xjqbest 10991e00a9 fix bug of num > INT_MAX
6 years ago
jerrywgz f4626ee425
Merge pull request #16873 from jerrywgz/roi_align_infer_shape
6 years ago
xiaoting ccc3bd70c1 polish doc for uniform_random and multi_box_head (#16864)
6 years ago
xuezhong 5663fbfb0a fix infershape bug
6 years ago
tensor-tang be18636e59 Merge remote-tracking branch 'ups/develop' into refine/infershape
6 years ago
dongdaxiang 4aa6f679b5 add pybind dependency
6 years ago
xjqbest 241120d94d fix bug of num > INT_MAX
6 years ago
Hongyu Liu 0701c2db47
Merge pull request #16518 from zhoukunsheng/rsqrt
6 years ago
Hongyu Liu bbcfa8ffb2
Merge pull request #16493 from zhoukunsheng/zeros_like
6 years ago
xjqbest dac70ad4c5 fix bug of num > INT_MAX
6 years ago
guosheng f641a47bb1 Refine ENFORCE in infer_shape of gru_op and lstm_unit_op.
6 years ago
tensor-tang ed892ebaf9 update
6 years ago
tensor-tang 411b9ba520 update
6 years ago
superjomn 0c233e8870 up
6 years ago
superjomn f0985cecb9 fix logical op infershape
6 years ago
minqiyang 592011bbcf Fix infer shape of split lod tensor op
6 years ago
xjqbest 74471397cf fix bug of num > INT_MAX
6 years ago
Tao Luo 34aecb09a9
Merge pull request #16881 from NHZlX/fix_trt_ci_times_too_long
6 years ago
phlrain d722841622 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into softmax_cross_support_high_rank
6 years ago
phlrain 5309b081f6 simple code; test=develop
6 years ago
liuwei1031 6864370a9e
scatter_op bug fix, test=develop (#16866)
6 years ago
jerrywgz 46bd853c10
Merge pull request #16843 from ceci3/infershape
6 years ago
Hongyu Liu 779ffb844b
Merge pull request #16876 from tink2123/infer_shape
6 years ago
tianshuo78520a 69bdcfa65d test=develop (#16839)
6 years ago
zhoukunsheng f9223c5fa9 Logical compare (#16513)
6 years ago
phlrain 766c868199 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into softmax_cross_support_high_rank
6 years ago
Tao Luo d966faae95
Merge pull request #16852 from sneaxiy/fix_merge_lod_tensor_op_infer_shape
6 years ago
phlrain f7a5a98fdb remove unused code; test=develop
6 years ago
heqiaozhi 1cca7114c6 fix infer
6 years ago
jerrywgz c139f1e049 refine roi align infer shape, test=develop
6 years ago
Hongyu Liu 208abe9763
Merge pull request #16787 from phlrain/fix_concat_shape_2
6 years ago
zhaoyuchen2018 44bd3a630e
Merge pull request #16857 from zhaoyuchen2018/sumreshape
6 years ago
whs 6429877816
Fix infer_shape in pad2d_op (#16831)
6 years ago
乔龙飞 Qiao Longfei 8a7daeea4c
Merge pull request #16871 from jacquesqiao/fix-shape
6 years ago
dongdaxiang b091139049 add nccl wrapper for python API
6 years ago
Jacek Czaja 87a44b1149 [MKL-DNN] Added reusing of primitive descriptors (fp32) (#16667)
6 years ago
liuwei1031 072db0938b
optimize lstmp and sample_logits op, test=develop (#16845)
6 years ago
phlrain a5d1f9cf66 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_shape_check_many
6 years ago
phlrain 87916f8d84 simple code;test=develop
6 years ago
tink2123 e0f7bf4f2f polish the code
6 years ago
Jiabin Yang 84b7a7291e test=develop, fix hsigmoid dereference nullptr (#16769)
6 years ago
dongdaxiang fff795e5c8 add nccl_wrapper
6 years ago
root 1965a22488 minus trt ci times.
6 years ago
Kaipeng Deng 19bb53fa61
Merge pull request #16850 from heavengate/fix_infer_shape
6 years ago
Hongyu Liu 2de7f3cfc3
Merge pull request #16799 from phlrain/sigmoid_corss_entropy_support_high_rank
6 years ago
tink2123 ffe81af073 modified infer shape
6 years ago
Tao Luo a67fbffdca
Merge pull request #16854 from luotao1/conv_shift_infershape
6 years ago
Qiao Longfei 0e663d7f51 fix split_byref_op infer shape
6 years ago
phlrain 7e933056ae Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_shape_2
6 years ago
phlrain 64bf752dcc fix concat; test=develop
6 years ago
Hongyu Liu c96ee47d01
Merge pull request #16797 from phlrain/fix_split
6 years ago
ceci3 74fc786097 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into infershape
6 years ago
ceci3 dd4c54cd37 test=develop
6 years ago
colourful-tree 434caab21b
Merge pull request #16741 from colourful-tree/dev
6 years ago
zhaoyuchen aeddb14148 Fix sum infershape issue
6 years ago
tensor-tang 10879a3cae separate runtime infershape
6 years ago
Tao Luo ca8b8fa0bd
Merge pull request #16830 from Superjomn/fix/tmp-memory-optim
6 years ago
Hongyu Liu e9cdd0e0cd
Merge pull request #16826 from zhoukunsheng/all_any
6 years ago
dengkaipeng 7b1702d9a1 fix unittest and API.spec. test=develop
6 years ago
SunGaofeng 76888b0ba1 modify in pad_op and pad_constant
6 years ago
lijianshe02 de26df440b add SaveOptimModel interface in analysis_predictor.h and test it in a… (#16441)
6 years ago
Zhen Wang cabea96789
Merge pull request #16838 from wzzju/fix_quan_transform
6 years ago
Tao Luo 6f0a40fa29 Fix conv_shift_op infershape
6 years ago
dengkaipeng e590588a02 fix for itnerpolate. test=develop
6 years ago
lidanqing de02d40e98 improve preprocess script and read from tar
6 years ago
乔龙飞 Qiao Longfei bcc0d41646
Merge pull request #16822 from jacquesqiao/optimize-merge-add
6 years ago
SunGaofeng 2120f075a3 modify infer shape in pad_op.cc, pad_constant_like_op.cc. No need in psroi_pool_op.cc, crop_op.cc
6 years ago
sneaxiy 4a83522c38 fix merge_lod_tensor_op infer shape, test=develop
6 years ago
wanghaoshuang 89c2bc09ea Fix infer_shape in pad2d_op
6 years ago
dengkaipeng b2dcdb5100 infer shape compatable -1. test=develop
6 years ago
ceci3 55f572b2da Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into infershape
6 years ago
ceci3 87d89dfe14 fix batch_norm and cos_sim infer shape, test=develop
6 years ago
乔龙飞 Qiao Longfei 82cff5ec42
Merge pull request #16762 from jacquesqiao/add-async_sparse_param_update_recorder
6 years ago
phlrain 165a7bd5a1 fix shape check many; test=develop
6 years ago
heqiaozhi aab9ea6ccb out && commit id
6 years ago
Zhen Wang d988a24a14 fix the hang bugs of memory copying. test=develop
6 years ago
Yibing Liu 4267a81afc
Correct the lod level of compiled time in lod_reset (#16790)
6 years ago
guru4elephant 1b75049407
Merge pull request #16788 from guru4elephant/fix_python_codestyle
6 years ago
chengduo c62674f475
Refine StaticRnn (#16707)
6 years ago
chengduo e9409665f7
Refine Fuse Optimize Ops (#16810)
6 years ago
SunGaofeng 1f2afccf30 test=develop (#16783)
6 years ago
superjomn f58c3ec189 fix memory optim temporarily
6 years ago
chengduo d105c06b50
Replace ThreadedExecutor with FastThreadedExecutor (#16650)
6 years ago
tink2123 9b9e5e606c modified api.spec
6 years ago
tink2123 06156b6cb7 polish yolov3 loss annotation
6 years ago
zhoukunsheng bb8ea1637d fix 16823: delete default_grad register for reduce_all, reduce_any
6 years ago
Qiao Longfei faae1b4170 fix cpplint test=develop
6 years ago
zhoukunsheng 4aa594e3e7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
6 years ago
zhoukunsheng 2d6b4f23f0 test=develop
6 years ago
Qiao Longfei 0a8ff2ecd4 add cpu_merge_add_multi_noduplicated_test test=develop
6 years ago
Qiao Longfei 920a960974 optimize merge add if input rows of all selected rows is not duplicated
6 years ago
zhoukunsheng b1c5820b3f fix merge conflict
6 years ago
Qiao Longfei 1526a3e4da Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
heqiaozhi 759940786e Merge remote-tracking branch 'upstream/develop' into dev
6 years ago
zhoukunsheng 9643f906ed Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rsqrt
6 years ago
phlrain 6bc3932823 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into softmax_cross_support_high_rank
6 years ago
phlrain a3e5238112 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into sigmoid_corss_entropy_support_high_rank
6 years ago
phlrain 715a31b35e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_split
6 years ago
phlrain db0518bb4d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_shape_2
6 years ago
XiaoguangHu 06809ebbb1
Merge pull request #16815 from sneaxiy/fix_new_added_reduce_ops_spec
6 years ago
zhoukunsheng ebf6cf9f18 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into zeros_like
6 years ago
zhoukunsheng 380df8281f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into zeros_like
6 years ago
Yihua Xu 93cedfdb9c Fix the order while sorting the operators (#16756)
6 years ago
sneaxiy 00b4580f46 fix default_grad_op_desc_maker
6 years ago
Qiao Longfei afc56949c1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
Qiao Longfei d831f1b0ba fix brpc code
6 years ago
heqiaozhi 96d5ec16f6 change API
6 years ago
liuwei1031 85363848a1
Security issue (#16774)
6 years ago
phlrain 468f8ccff9 supprt high rank; test=develop
6 years ago
phlrain bbfc82cc42 softmax corss entropy support high rank
6 years ago
zhoukunsheng 2b2b4ca21e
Merge branch 'develop' into rsqrt
6 years ago
heqiaozhi 5fb9bdc892 add X to grad
6 years ago
Hongyu Liu e2897ba13a
Merge pull request #16432 from zhoukunsheng/linspace
6 years ago
Hongyu Liu 283ae0faaa
Merge pull request #16525 from zhoukunsheng/rank
6 years ago
Hongyu Liu afe0d64c9d
Merge pull request #16320 from zhoukunsheng/all_any
6 years ago
phlrain 026836ffe0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_split
6 years ago
phlrain 488e889f3d fix split infer shape; test=develop
6 years ago
ruri 39d6a985bc
fix some comments, include cosine_decay,l2_normalize,pixel_shuffle (#16763)
6 years ago
Qiao Longfei 8b8a0487c7 fix compile test=develop
6 years ago
dongdaxiang a659b37ace make lodtensor_printer usable in gpu setting
6 years ago
guru4elephant aa46caf3d9
Merge pull request #16765 from guru4elephant/gpu_dataset_train
6 years ago
phlrain 3f0d047d1b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_shape_2
6 years ago
phlrain dc6e814686 fix concat shape; test=develop
6 years ago
Wu Yi 8b58732013
remove append_LARS not used api test=develop (#16703)
6 years ago
Tao Luo f96446cade
Merge pull request #16738 from luotao1/high_level_api_test
6 years ago
dongdaxiang 3c2d236815 remove all warnings
6 years ago
Yiqun Liu 112f16143b
Add an option to enable the cache of expected kernel in train phase. (#16724)
6 years ago
liuwei1031 2e07c19a9c
disable memory_optimize and inpalce strategy by default, test=develop (#16760)
6 years ago
dongdaxiang ea07eb8cd2 remove comment in data_feed.cc
6 years ago
Tao Luo 544f91deba add WITH_HIGH_LEVEL_API option, default OFF
6 years ago
guru4elephant e349a7443f
Update nccl_context.h
6 years ago
Qiao Longfei a541c25ab6 fix cpplint test=develop
6 years ago
dongdaxiang 05464e7c5c add gpu training for Executor.train_from_dataset
6 years ago
Qiao Longfei 0608f8ca56 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder
6 years ago
heqiaozhi e9d79dd5d7 ctx.device_context() to CPUPlace
6 years ago
heqiaozhi 44b226eda6 ctx.device_context() to CPUPlace
6 years ago
heqiaozhi fa6ea1e0e6 remove grad X
6 years ago
heqiaozhi 72c9aecfc3 fix doc
6 years ago
heqiaozhi 8de5dc31db add doc
6 years ago
heqiaozhi 5204fb4402 add doc
6 years ago
heqiaozhi 6e5c44d3fe add doc
6 years ago
gongweibao bf606bce8a
Fix grpc log message. (#16735)
6 years ago
Zeng Jinle 9f7b027dce
fix activation grad op desc maker (#16715)
6 years ago
lujun 9bd44b94da
Merge pull request #16561 from junjun315/move-api-to-root
6 years ago
heqiaozhi ba78446cca add continuous value model op
6 years ago
wopeizl 00279fdcc2
modify the build script for new ci test=develop (#16732)
6 years ago
liuwei1031 fdb719a1bf
avoid optimize variable used in subblock, test=develop (#16739)
6 years ago
Kaipeng Deng ed97156461
Merge pull request #16439 from heavengate/resize_scale
6 years ago
heqiaozhi 0c3c5e19d3 add continuous value model op
6 years ago
Tao Luo 1a21d08f12
Merge pull request #16725 from tensor-tang/pass/disable_seqpool
6 years ago
heqiaozhi 54dddee37e add continuous value model op
6 years ago
liuwei1031 a18ef10c87
only use the latest version variable for inplace strategy (#16736)
6 years ago
Huihuang Zheng 2146293d26 Fix op registry (#16677)
6 years ago
Tao Luo 5c364cda3c
Merge pull request #16711 from luotao1/has_attr
6 years ago
tensor-tang d6c1b5a73b disable seqpool concat pass by default saving CI time
6 years ago
baojun 1c8b34ddd2 fix training validation test=develop (#16698)
6 years ago
lujun 92c8ac8a74 merge conflict, test=develop
6 years ago
chengduo 55b15db5af
Add unit test for fuse all_reduce ops (#16699)
6 years ago
luotao1 4098ba29ed reduce hasAttr elapsed time in RunImpl
6 years ago
luotao1 f89a9c5d95 Merge branch 'develop' into has_attr
6 years ago
Tao Luo ad4a1bd13c
Merge pull request #16339 from luotao1/core_opt_choose_kernel
6 years ago
luotao1 6afc97ca6b reduce hasAttr elapsed time in RunImpl
6 years ago
Yan Xu 55e3c6949b
disable reuse port test=develop (#16704)
6 years ago
gongweibao 8b793d0efd
Fix DGC bug. (#16697)
6 years ago
Yiqun Liu 3fe8cb0dd7
Enable the runtime_context_cache pass in train phase (#16640)
6 years ago
Tao Luo 4048a2681f
Merge pull request #16687 from luotao1/reduce_inference_ci_time
6 years ago
Yan Xu 169829c83a fix win gpu test=develop (#16694)
6 years ago
guru4elephant 7d653f0aed
Merge pull request #16652 from xjqbest/dataset_merge_develop
6 years ago
xjqbest 6a57e8075a remove trainer_id in datafeed and dataset
6 years ago
tensor-tang ad45a08351
fix avx option (#16683)
6 years ago
Tao Luo d5c8d4acfe reduce all analyzer_test ci elasped time
6 years ago
luotao1 695f2db6a0 update expected_kernel_cache_pass
6 years ago