Commit Graph

2058 Commits (717bbc087b639d1182b2b4b0401b0382990084a6)

Author SHA1 Message Date
Xin Pan 5eb87506bc add per kernel config and remove const_cast.
6 years ago
Qiao Longfei 62f1248ff5 fix use gpu test=develop
6 years ago
Xin Pan 6019054cdd
Merge pull request #15716 from Yancey1989/refine_pg
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
minqiyang f53e1d5c4b implement ClearBlock
6 years ago
dzhwinter f2e8409f5a
Merge pull request #15795 from dzhwinter/fix/block_desc
6 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
6 years ago
Yancey1989 4b193db14c polish code test=develop
6 years ago
dzhwinter 6deb17ed8c fix default value. test=develop
6 years ago
dzhwinter 089d262c41 fix default value. test=develop
6 years ago
dzhwinter 9c92d0304f fix default value. test=develop
6 years ago
Yancey1989 d5090c892d polish code test=develop
6 years ago
dzhwinter 28609b3435
Merge pull request #15696 from dzhwinter/cherry-pick/memory
6 years ago
Yancey1989 0f8bd73cc9 cleanup code test=develop
6 years ago
Yancey1989 5677c9d4ee update comment test=develop
6 years ago
Yancey1989 642fd68ce0 update by comment test=develop
6 years ago
dzhwinter d94a314db5 add reference. test=develop
6 years ago
dzhwinter 591ad33e32 polish code for reading. test=develop
6 years ago
dzhwinter 18afb77e78 polish code for reading. test=develop
6 years ago
Yan Chunwei 077d12b939
fix scale cleaner (#15742)
6 years ago
dzhwinter 684b572307 polish code for reading. test=develop
6 years ago
dzhwinter 3787e61fca polish code for reading. test=develop
6 years ago
dzhwinter c1455e606d Merge remote-tracking branch 'origin/develop' into cherry-pick/memory
6 years ago
dzhwinter d376cf71b7 polish code for reading. test=develop
6 years ago
Yancey1989 7cd6de37f5 fix cpu test=develop
6 years ago
Yancey1989 bd0d44af24 fix build failed test=develop
6 years ago
Yancey1989 ecdd1166b8 cleanup code test=develop
6 years ago
Yancey1989 73005ee00d cleanup code test=develop
6 years ago
Yancey1989 88d3dc949e Merge branch 'develop' of github.com:PaddlePaddle/Paddle into refine_pg
6 years ago
Yancey1989 f3463ecb6e refine pg execution
6 years ago
dzhwinter 283573c6aa add details. test=develop
6 years ago
chengduo 5a03b515ae
fix potential bug in async_executor (#15707)
6 years ago
乔龙飞 Qiao Longfei 45b19cbc9a
Revert "Revert "cpu reduce mode did not need to broadcast params test=develop""
6 years ago
chengduo ad61e1b22c
fix potential bug (#15688)
6 years ago
dzhwinter 6d6ddcfe15 add details. test=develop
6 years ago
dzhwinter f9ac88e1a0
Merge pull request #15694 from liuwei1031/fix_security_issue
6 years ago
dzhwinter 11afbe0f53 add details. test=develop
6 years ago
tensor-tang e49706c80e
Merge pull request #15659 from GBuella/add_to_string
6 years ago
liuwei1031 b1f97a6fa9 fix security issue 27, 38 test=develop
6 years ago
Gabor Buella da9c94da33 Clang build fixes (#15628)
6 years ago
kolinwei acfe28d5eb
Merge pull request #15684 from PaddlePaddle/revert-15661-fix-cpu-broadcast
6 years ago
Xin Pan d670d8ef1d
Merge pull request #15671 from cjld/fix_graph
6 years ago
乔龙飞 Qiao Longfei 6e0e706198
Revert "cpu reduce mode did not need to broadcast params test=develop"
6 years ago
Qiao Longfei 97b143fb49 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-cpu-broadcast
6 years ago
dzhwinter 381f2015a5
Merge pull request #15665 from dzhwinter/experiment/refactor_memory
6 years ago
Qiao Longfei ffd0d1d216 clean need_broadcast_var_ test=develop
6 years ago
Qiao Longfei fbadd4b60c follow comment test=develop
6 years ago
dzhwinter 04e9776aef add details. test=develop
6 years ago
baojun f4a0e68481 Fix ngraph compile WITH_DISTRIBUTE=ON (#15636)
6 years ago
Dun Liang 1905f1a108 bug fix && test=develop
6 years ago
Qiao Longfei 76072261f8 fix compiler
6 years ago
Qiao Longfei b99db0e2c2 cpu reduce mode did not need to broadcast test=develop
6 years ago
Gabor Buella 4975a9050a Tests - add some missing to_string calls
6 years ago
wopeizl c1e18b13aa
Merge pull request #15635 from wopeizl/fixbuildissue
6 years ago
dzhwinter 5d30b55de1 rerun ci. test=develop
6 years ago
dzhwinter 4ef34916a4 enhanced print message. test=develop
6 years ago
peizhilin 238ef94702 fix the build issue on gpu mode for win
6 years ago
dzhwinter ce0394bcd0 merge develop branch. test=develop
6 years ago
Xin Pan 74bc55c2a6
Merge pull request #14975 from dzhwinter/ir_inplace_pass
6 years ago
dzhwinter cca71532eb add skip send.recv test=develop
6 years ago
dzhwinter 9f001c6525 skip dist. test=develop
6 years ago
Yan Chunwei dc5e25fc7f
remove dot marked node (#15606)
6 years ago
dzhwinter 2561a6fc59 follow comment. test=develop
6 years ago
dzhwinter 2a5ecb68b0 follow comment. test=develop
6 years ago
dzhwinter 9f693fcac4 rerun ci. test=develop
6 years ago
dzhwinter e537634d16 delete graph print pass. test=develop
6 years ago
dzhwinter 4f01de6378 Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
liuwei1031 6e84eb131f expose peak gpu memory API to python test=develop (#15529)
6 years ago
dzhwinter 5cab99a686 fuck windows. rerun windows ci. test=develop
6 years ago
dzhwinter 9c9ad7d40b Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
dzhwinter 0a63234c85 follow comments. test=develop
6 years ago
Yan Chunwei 897789b16e
fix save_inferece_model bug (#15365)
6 years ago
dzhwinter 9e87fbebb7 rerun windows ci. test=develop
6 years ago
dzhwinter 6f9904e99a rerun windows ci. test=develop
6 years ago
dzhwinter a52be7c081 refine build strategy. test=develop
6 years ago
dzhwinter 32a2014939 refine build strategy. test=develop
6 years ago
Krzysztof Binias b1bdcd4de8 Make separate folders for mkldnn codes
6 years ago
dzhwinter 06f2448848 Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
dzhwinter 8156fedf56 merge develop branch. test=develop
6 years ago
Jiabin Yang fd286f3596
Merge pull request #15534 from JiabinYang/fix/multi_output_support_imperative
6 years ago
dzhwinter ee3aae56cd merge develop branch. test=develop
6 years ago
dzhwinter d6d3e6afe2 add more skip strategy
6 years ago
Yan Chunwei b62b756b28
add version support (#15469)
6 years ago
tensor-tang 3c224e7e79
Merge pull request #15537 from baojun-nervana/rm_ngraph_operator
6 years ago
Zhaolong Xing 97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
6 years ago
Jiabin Yang 10bc9ffc2d
Merge pull request #15518 from JiabinYang/fix/refine_error_message
6 years ago
dzhwinter 2739096eec compatibable with python side mem_opt
6 years ago
gongweibao d303270a0e
revert test=develop (#15535)
6 years ago
baojun-nervana 8e9308a51a mv ngraph_bridge to ngraph directory test=develop
6 years ago
baojun-nervana da3f9cc512 rm ngraph_operator.cc test=develop
6 years ago
JiabinYang 5639f49b16 test=develop, fix/multi_output_support_imperative
6 years ago
gongweibao d54494ba87
cleanup test=develop (#15347)
6 years ago
JiabinYang c52f57de5b test=develop, refine_error_message for data type
6 years ago
baojun efce25673c Adding ngraph_engine_op (#14948)
6 years ago
Yiqun Liu 3008fa1261
Add the CUDA kernel for beam_search op (#15020)
6 years ago
nhzlx 0779e35544 fix two bug:
6 years ago
Zeng Jinle dec89bd7ed
Merge pull request #15460 from sneaxiy/try_to_turn_on_remove_unnecessary_lock
6 years ago
Xin Pan 58cb18d9d9
Merge pull request #15322 from velconia/imperative_resnet
6 years ago
sneaxiy ef788603d4 merge develop
6 years ago
sneaxiy d8568acd19 turn on remove_unnecessary_lock
6 years ago