Commit Graph

2421 Commits (4a1b7fec967837ca283210aa669da3513c9f1aa1)

Author SHA1 Message Date
Wojciech Uss 2579ade45f Add cpu_quantize_pass for C-API quantization (#16127)
6 years ago
qingqing01 86e912c544 Fix windows compiling (#16230)
6 years ago
minqiyang 362253732c Polish code
6 years ago
minqiyang c0ddb93ccc Polish code
6 years ago
minqiyang b5078c211a Make infer var type virtual
6 years ago
minqiyang 438bca9c3d Implement Runtime Var Type Inference
6 years ago
luotao1 46ee6bb1aa fix distributed unit-tests
6 years ago
luotao1 1b59bed989 Merge branch 'develop' into runtime_context
6 years ago
luotao1 6ce25c99a0 Merge branch 'develop' into runtime_context
6 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
6 years ago
minqiyang ca392c7e97 Implement infer var type context
6 years ago
liuwei1031 1c6caf8466
1. disable reuse SELECTED_ROWS type variable (#16150)
6 years ago
Wojciech Uss b9252f3df8 Add cpu_quantize_squash_pass for C-API quantization (#16128)
6 years ago
luotao1 b2898c0f57 Merge branch 'develop' into runtime_context
6 years ago
sneaxiy a7a4f053da Merge develop
6 years ago
Tao Luo 4ef6f738c3
Merge pull request #16154 from luotao1/infershape_example
6 years ago
minqiyang 42e96a029f Accelerate CPU part
6 years ago
sneaxiy 682f2dbf29 merge develop
6 years ago
sneaxiy 2c4fcaa683 merge develop
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
Yan Xu 30568473ec
fix broadcast on mp mode (#15951)
6 years ago
baojun e3c37bd564 remove const_cast and refactor ngraph engine code (#15925)
6 years ago
luotao1 b561ad1e55 Merge branch 'develop' into runtime_context
6 years ago
Zhen Wang 41b8cf0bae
Merge pull request #16162 from wzzju/fix_nan_static_quant
6 years ago
luotao1 fe78a92e6e refine with comments
6 years ago
Zhen Wang 94b7c1ea7b
Merge pull request #16107 from wzzju/add_graph_clone
6 years ago
wopeizl 85709f4378
restore the exception caught since it is necessary for python call stack (#16160)
6 years ago
Zhen Wang 5685a48c23 Add some fixme. test=develop
6 years ago
luotao1 8f6597aa0e Merge branch 'develop' into infershape_example
6 years ago
Zhen Wang ac6ef06ffa Add the Clone method in Graph. test=develop
6 years ago
Zhen Wang 01eddf125c Not add graph copy construction method. test=develop
6 years ago
Zhen Wang 1b9c8d5f06 add clone function for IrGraph. test=develop
6 years ago
Zeng Jinle 472f16b5aa
Merge pull request #16063 from sneaxiy/enhance_gc
6 years ago
luotao1 31ccaf0916 add all_kernels_must_compute_runtime_shape example for speedup infershape
6 years ago
chengduo ad80bde824
Revert "Revert "Add Event for TensorCopy"" (#16035)
6 years ago
sneaxiy 732fa00eaf disable gc in recurrent_op currently
6 years ago
Qiao Longfei ff8054c5a7 can run
6 years ago
Yihua Xu 40f1dd818b Fix the node's order issue when the content of graph is changed (#16088)
6 years ago
Zhaolong Xing 3d63aa0a11
Merge pull request #15729 from NHZlX/add_static_model_load_for_trt
6 years ago
Qiao Longfei 3225e19591 fix remove recv op
6 years ago
Qiao Longfei fe6a840924 fix delete recv ops
6 years ago
Wu Yi d206582337
add parallel graph dist test (#16076)
6 years ago
Qiao Longfei 446fdf9563 fix compile problem
6 years ago
Qiao Longfei a23f1ee85a optimize code
6 years ago
Qiao Longfei a0bb18beec Merge branch 'add-async-ssa-graph-executor' of ssh://github.com/jacquesqiao/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
sneaxiy 2a639d5c2a add allocator chain to fix bug
6 years ago
liuwei1031 045e5911bf
fix a code bug which cause crash when empty variable is used, test=develop (#16080)
6 years ago
sneaxiy 7b608396fe fix travis-ci format check
6 years ago
Qiao Longfei 255b36dad2 can run
6 years ago
Qiao Longfei 5e8de51409 code format test=develop
6 years ago
Qiao Longfei 4e218dabc5 code format test=develop
6 years ago
Tao Luo 6375fe45d7
Merge pull request #16039 from luotao1/execution_context
6 years ago
sneaxiy 814a759061 merge develop
6 years ago
sneaxiy 597dc65e76 enhance gc
6 years ago
liuwei1031 caadd0581d
add IfElse test case for ir memory optimize (#15998)
6 years ago
Qiao Longfei f28c258453 code clean test=develop
6 years ago
Qiao Longfei 8c38aca954 tmp commit
6 years ago
Tao Luo f4587789d8 remove legacy function in ExecutionContext
6 years ago
Liu Yiqun 1041e18c47 Refine codes.
6 years ago
luotao1 c0b240aa43 try to fix distributed unit-test
6 years ago
luotao1 784826a4f5 enhance cache runtime_context for different scope
6 years ago
Qiao Longfei fab1b54d99 Merge branch 'add-communicator' of ssh://github.com/jacquesqiao/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Qiao Longfei 8744f9a083 fix parallel executor async mode
6 years ago
Qiao Longfei e70b1727ef Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Liu Yiqun d8a939d8a8 Merge branch 'develop' into core_opt_choose_kernel
6 years ago
chengduo e2da3a5b22
Revert "Add Event for TensorCopy" (#16022)
6 years ago
luotao1 2fb38c108c Merge branch 'develop' into runtime_context
6 years ago
sneaxiy a9ea99d700 merge develop
6 years ago
Qiao Longfei 3691a46fa3 improve communicator
6 years ago
chengduo ae37f82964
Unified ParallelExecutor and Compiler (#15970)
6 years ago
chengduo 7235fd662b
Add Event for TensorCopy (#15953)
6 years ago
luotao1 82b0bb9d72 fix cpplint error
6 years ago
Liu Yiqun d4674dab13 Cache the chosen kernel of operators'.
6 years ago
luotao1 9773f38f99 cache runtime_context
6 years ago
tangwei12 6d5a04c1e7
add op type in check nan/inf (#15986)
6 years ago
Qiao Longfei 847e4f4e85 pure async mode train
6 years ago
Qiyang Min 187cffd019
Merge pull request #15928 from velconia/imperative_backward_hooks
6 years ago
Yiqun Liu 798925453e
Revert "Optimize while_op when is_test is true. (#15811)" (#15968)
6 years ago
minqiyang e5f3435dd5 Add missing headers
6 years ago
minqiyang 50639fafdb Polish code
6 years ago
Yiqun Liu 613d9d0756
Optimize while_op when is_test is true. (#15811)
6 years ago
nhzlx 2eff3e26b6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_static_model_load_for_trt
6 years ago
nhzlx 06a088a199 fix comments and fix cpplint
6 years ago
dzhwinter 225c11a91f polish cudnn related code and fix bug. (#15164)
6 years ago
Tao Luo d5a888e15c
Merge pull request #15943 from kbinias/kbinias/add-placement-pass-tester
6 years ago
Krzysztof Binias 72253391b6 Add MKL-DNN placement pass tester
6 years ago
minqiyang cb85ee987b Remove var op deps in imperative mode
6 years ago
Tao Luo effec86600
Merge pull request #15913 from liangan1/func_coverage
6 years ago
Qiao Longfei 49f2f4f91d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
Qiao Longfei f768fbf715 support multi graph
6 years ago
Tao Luo 60546b78cc
Merge pull request #15923 from Sand3r-/mgallus/conv-residual-ut
6 years ago
Qiao Longfei ff01d70583 fix style
6 years ago
Qiao Longfei dab7f36909 optimize code test=develop
6 years ago
Qiao Longfei cf0511f21e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Michal Gallus 6a2bc9a275 Add Conv Residual Connection UT for Projection
6 years ago
Qiao Longfei 43c82376cb use one graph
6 years ago
dzhwinter 660e410655
Merge pull request #15855 from dzhwinter/fix/nightly_test
6 years ago
minqiyang b420ec3a92 invoke backward_hooks after reduce op's depcounts map
6 years ago
Qiyang Min 4bd28b304b
Merge pull request #15831 from velconia/imperative_engine
6 years ago
Xin Pan a6e3cd5eb7
Merge pull request #15425 from panyx0718/api
6 years ago
Qiao Longfei b8491bfd4e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
liangan1 4acc522087 Enable function coverage for U8/S8 ConvMKLDNNOpKernel
6 years ago
Xin Pan 44e7fcddc5
Merge pull request #15844 from panyx0718/infer
6 years ago
dzhwinter a71f2fbe4f fix default value. test=develop
6 years ago
Jacek Czaja dec9cf53c8 [MKL-DNN] MKL-DNN specific Tensor modification (#15429)
6 years ago
minqiyang 84bf4d7b06 Move ClearBlock into OpBase and VarBase's destructor
6 years ago
Xin Pan 5dd281f738 polish
6 years ago
Qiao Longfei 10393dd0d1 add some check test=develop
6 years ago
乔龙飞 Qiao Longfei ec8e878200
Merge pull request #15840 from jacquesqiao/revert-15684-revert-15661-fix-cpu-broadcast
6 years ago
minqiyang a15a3fc314 Polish code
6 years ago
Qiao Longfei 2b7931d5c9 refine code test=develop
6 years ago
Qiao Longfei b5b8e6cc9c revert the change of scope test=develop
6 years ago
Xin Pan 8d83e38a6b remove mutex
6 years ago
Xin Pan 0362ef75f4 fix
6 years ago
minqiyang 9dc64edfd9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_engine
6 years ago
Xin Pan 12a0e2ed9d polish codes
6 years ago
Xin Pan 19d78f6797 polish
6 years ago
Qiao Longfei ecedd531c1 fix code bug test=develop
6 years ago
Qiao Longfei f4f4816b0c fix gpu error test=develop
6 years ago
Xin Pan 32d5a16036 resolve conflicts
6 years ago
Qiao Longfei 3f9263f67e optimize style test=develop
6 years ago
Qiao Longfei 4233d0a820 add more comment test=develop
6 years ago
Michał Gallus c4faf36e7a MKL-DNN: Add test for conv bias fuse pass (#15824)
6 years ago
Qiao Longfei 3bccc1e6e2 optimize broadcast logic test=develop
6 years ago
Tao Luo 3831a4695d
Merge pull request #15862 from sfraczek/add-override-to-apply_impl
6 years ago
Xin Pan 26e32e095a allow compiler to use graph
6 years ago
Sylwester Fraczek 0b926114c0 add override to ApplyImpl
6 years ago
Sylwester Fraczek 543e53db05 fix typo releated->related
6 years ago
Qiao Longfei 12f6b8c3d6 change the include of ThreadPool.h test=develop
6 years ago
Qiao Longfei 7f3be09045 fix multi graph test=develop
6 years ago
Qiao Longfei 9465c3d0c3 fix compile problem
6 years ago
Xin Pan 5eb87506bc add per kernel config and remove const_cast.
6 years ago
Qiao Longfei 31a05d3efd Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Qiao Longfei 62f1248ff5 fix use gpu test=develop
6 years ago
Xin Pan 6019054cdd
Merge pull request #15716 from Yancey1989/refine_pg
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
Qiao Longfei cc71e89499 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
minqiyang f53e1d5c4b implement ClearBlock
6 years ago
dzhwinter f2e8409f5a
Merge pull request #15795 from dzhwinter/fix/block_desc
6 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
6 years ago
Yancey1989 4b193db14c polish code test=develop
6 years ago
dzhwinter 6deb17ed8c fix default value. test=develop
6 years ago
dzhwinter 089d262c41 fix default value. test=develop
6 years ago
dzhwinter 9c92d0304f fix default value. test=develop
6 years ago
Yancey1989 d5090c892d polish code test=develop
6 years ago
dzhwinter 28609b3435
Merge pull request #15696 from dzhwinter/cherry-pick/memory
6 years ago
Yancey1989 0f8bd73cc9 cleanup code test=develop
6 years ago
Yancey1989 5677c9d4ee update comment test=develop
6 years ago
Yancey1989 642fd68ce0 update by comment test=develop
6 years ago
dzhwinter d94a314db5 add reference. test=develop
6 years ago
dzhwinter 591ad33e32 polish code for reading. test=develop
6 years ago
dzhwinter 18afb77e78 polish code for reading. test=develop
6 years ago
Yan Chunwei 077d12b939
fix scale cleaner (#15742)
6 years ago
dzhwinter 684b572307 polish code for reading. test=develop
6 years ago
dzhwinter 3787e61fca polish code for reading. test=develop
6 years ago
dzhwinter c1455e606d Merge remote-tracking branch 'origin/develop' into cherry-pick/memory
6 years ago
dzhwinter d376cf71b7 polish code for reading. test=develop
6 years ago
nhzlx ecc12fb430 3. when runing in trt mode, do not allocate memory for parameters in fluid.
6 years ago
Yancey1989 7cd6de37f5 fix cpu test=develop
6 years ago
Yancey1989 bd0d44af24 fix build failed test=develop
6 years ago
Yancey1989 ecdd1166b8 cleanup code test=develop
6 years ago
Yancey1989 73005ee00d cleanup code test=develop
6 years ago
Yancey1989 88d3dc949e Merge branch 'develop' of github.com:PaddlePaddle/Paddle into refine_pg
6 years ago
Yancey1989 f3463ecb6e refine pg execution
6 years ago
dzhwinter 283573c6aa add details. test=develop
6 years ago
chengduo 5a03b515ae
fix potential bug in async_executor (#15707)
6 years ago
乔龙飞 Qiao Longfei 45b19cbc9a
Revert "Revert "cpu reduce mode did not need to broadcast params test=develop""
6 years ago
chengduo ad61e1b22c
fix potential bug (#15688)
6 years ago
dzhwinter 6d6ddcfe15 add details. test=develop
6 years ago
dzhwinter f9ac88e1a0
Merge pull request #15694 from liuwei1031/fix_security_issue
6 years ago
dzhwinter 11afbe0f53 add details. test=develop
6 years ago
tensor-tang e49706c80e
Merge pull request #15659 from GBuella/add_to_string
6 years ago
liuwei1031 b1f97a6fa9 fix security issue 27, 38 test=develop
6 years ago
Gabor Buella da9c94da33 Clang build fixes (#15628)
6 years ago
kolinwei acfe28d5eb
Merge pull request #15684 from PaddlePaddle/revert-15661-fix-cpu-broadcast
6 years ago
Xin Pan d670d8ef1d
Merge pull request #15671 from cjld/fix_graph
6 years ago
乔龙飞 Qiao Longfei 6e0e706198
Revert "cpu reduce mode did not need to broadcast params test=develop"
6 years ago
Qiao Longfei 97b143fb49 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-cpu-broadcast
6 years ago
dzhwinter 381f2015a5
Merge pull request #15665 from dzhwinter/experiment/refactor_memory
6 years ago
Qiao Longfei ffd0d1d216 clean need_broadcast_var_ test=develop
6 years ago
Qiao Longfei fbadd4b60c follow comment test=develop
6 years ago
dzhwinter 04e9776aef add details. test=develop
6 years ago
baojun f4a0e68481 Fix ngraph compile WITH_DISTRIBUTE=ON (#15636)
6 years ago
Dun Liang 1905f1a108 bug fix && test=develop
6 years ago
Qiao Longfei 2171aa77f1 async ssa exe only support local mode
6 years ago
Qiao Longfei c4ded17e8c async mode support dist train
6 years ago
Qiao Longfei 84367cf8bc support async mode in dist mode parallel executor
6 years ago
Qiao Longfei e72637ddd2 ThreadedSSAGraphExecutor support num_iteration_per_run test=develop
6 years ago
Qiao Longfei a7152613f7 Merge branch 'fix-cpu-broadcast' of ssh://github.com/jacquesqiao/Paddle into add-communicator
6 years ago
Qiao Longfei 76072261f8 fix compiler
6 years ago
Qiao Longfei b99db0e2c2 cpu reduce mode did not need to broadcast test=develop
6 years ago
Qiao Longfei 5cf0092825 add more log and fix test_dist_base in multi_batch_merge_pass
6 years ago
Gabor Buella 4975a9050a Tests - add some missing to_string calls
6 years ago
Qiao Longfei b1fe8d4570 add a check for async_ssa_graph_exe test=develop
6 years ago
Qiao Longfei 16af1dbc7b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Qiao Longfei 381f383989 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
wopeizl c1e18b13aa
Merge pull request #15635 from wopeizl/fixbuildissue
6 years ago
dzhwinter 5d30b55de1 rerun ci. test=develop
6 years ago
dzhwinter 4ef34916a4 enhanced print message. test=develop
6 years ago
peizhilin 238ef94702 fix the build issue on gpu mode for win
6 years ago
dzhwinter ce0394bcd0 merge develop branch. test=develop
6 years ago
Xin Pan 74bc55c2a6
Merge pull request #14975 from dzhwinter/ir_inplace_pass
6 years ago
dzhwinter cca71532eb add skip send.recv test=develop
6 years ago
dzhwinter 9f001c6525 skip dist. test=develop
6 years ago
Yan Chunwei dc5e25fc7f
remove dot marked node (#15606)
6 years ago
dzhwinter 2561a6fc59 follow comment. test=develop
6 years ago
dzhwinter 2a5ecb68b0 follow comment. test=develop
6 years ago
dzhwinter 9f693fcac4 rerun ci. test=develop
6 years ago
dzhwinter e537634d16 delete graph print pass. test=develop
6 years ago
dzhwinter 4f01de6378 Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
liuwei1031 6e84eb131f expose peak gpu memory API to python test=develop (#15529)
6 years ago
dzhwinter 5cab99a686 fuck windows. rerun windows ci. test=develop
6 years ago
dzhwinter 9c9ad7d40b Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
dzhwinter 0a63234c85 follow comments. test=develop
6 years ago
Yan Chunwei 897789b16e
fix save_inferece_model bug (#15365)
6 years ago
dzhwinter 9e87fbebb7 rerun windows ci. test=develop
6 years ago
dzhwinter 6f9904e99a rerun windows ci. test=develop
6 years ago
dzhwinter a52be7c081 refine build strategy. test=develop
6 years ago
dzhwinter 32a2014939 refine build strategy. test=develop
6 years ago
Krzysztof Binias b1bdcd4de8 Make separate folders for mkldnn codes
6 years ago
dzhwinter 06f2448848 Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
dzhwinter 8156fedf56 merge develop branch. test=develop
6 years ago
Qiao Longfei d6c0dcaa16 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Jiabin Yang fd286f3596
Merge pull request #15534 from JiabinYang/fix/multi_output_support_imperative
6 years ago
Qiao Longfei c7e3868007 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
Qiao Longfei 02dab46ab8 add some debug info
6 years ago
dzhwinter ee3aae56cd merge develop branch. test=develop
6 years ago
dzhwinter d6d3e6afe2 add more skip strategy
6 years ago
Yan Chunwei b62b756b28
add version support (#15469)
6 years ago
tensor-tang 3c224e7e79
Merge pull request #15537 from baojun-nervana/rm_ngraph_operator
6 years ago
Zhaolong Xing 97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
6 years ago
Jiabin Yang 10bc9ffc2d
Merge pull request #15518 from JiabinYang/fix/refine_error_message
6 years ago
Qiao Longfei be738a646e add some debug infor
6 years ago
Qiao Longfei 62549e0714 add GenParentScopeTreeDebugInfo
6 years ago
dzhwinter 2739096eec compatibable with python side mem_opt
6 years ago
Qiao Longfei a66115bed5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
Qiao Longfei fab8457e6b code optimize
6 years ago
gongweibao d303270a0e
revert test=develop (#15535)
6 years ago
baojun-nervana 8e9308a51a mv ngraph_bridge to ngraph directory test=develop
6 years ago
baojun-nervana da3f9cc512 rm ngraph_operator.cc test=develop
6 years ago
Qiao Longfei ada43e89c3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
6 years ago
JiabinYang 5639f49b16 test=develop, fix/multi_output_support_imperative
6 years ago
gongweibao d54494ba87
cleanup test=develop (#15347)
6 years ago
JiabinYang c52f57de5b test=develop, refine_error_message for data type
6 years ago
baojun efce25673c Adding ngraph_engine_op (#14948)
6 years ago
Yiqun Liu 3008fa1261
Add the CUDA kernel for beam_search op (#15020)
6 years ago
nhzlx 0779e35544 fix two bug:
6 years ago
Qiao Longfei ca5d96bb3d complete send lod tensor
6 years ago
Zeng Jinle dec89bd7ed
Merge pull request #15460 from sneaxiy/try_to_turn_on_remove_unnecessary_lock
6 years ago
Xin Pan 58cb18d9d9
Merge pull request #15322 from velconia/imperative_resnet
6 years ago
Qiao Longfei be72940b76 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
6 years ago
sneaxiy ef788603d4 merge develop
6 years ago
sneaxiy d8568acd19 turn on remove_unnecessary_lock
6 years ago
sneaxiy eac5a0aa0c Merge develop
6 years ago
WangZhen 3ce6172052 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
6 years ago
dzhwinter 8f3b252392 squash commits. test=develop
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
minqiyang 8ce198b2e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet
6 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
6 years ago
WangZhen e2ff300b02 add UT for quantization.
6 years ago
WangZhen 451896fce4 init quantization.
6 years ago
Qiao Longfei f3210b60ba fix copy_memory and share_memory
6 years ago
Qiao Longfei 9958775b31 add NewTmpScope to scope
6 years ago
Qiao Longfei 7021979bc2 init communicator
6 years ago
Qiao Longfei 69484f71e0 remote communicator
6 years ago
Qiao Longfei 88d71fa2f9 support num_iteration_per_run
6 years ago
gongweibao 7cd4dd7ce4
Hide varhandle members. (#15382)
6 years ago
Qiao Longfei ea66979684 can run
6 years ago
Qiao Longfei afda840126 init communicator
6 years ago
Qiao Longfei 92a6c7a049 init async ssa executor
6 years ago
tensor-tang 3759c1db8c
Merge pull request #14805 from mozga-intel/mozga-intel/element_wise_operator_ngraph
6 years ago
mozga-intel cba729404d Enable softmax operator for a ngraph engine
6 years ago
tensor-tang a7fc3d42a0
Merge pull request #15304 from tensor-tang/fuse/second_order_mul_sub
6 years ago
乔龙飞 Qiao Longfei b14d4cdd75
Merge pull request #14890 from jacquesqiao/multithread-sparse-adam
6 years ago
Qiao Longfei 9b4fe283e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago
peizhilin 5e450833bd test=develop
6 years ago
peizhilin eea75a1d93 fix issue when type is invalid
6 years ago
peizhilin 9adb158e5b Merge remote-tracking branch 'upstream/develop' into debug/support
6 years ago
tensor-tang 84b0ecdcce Merge remote-tracking branch 'ups/develop' into fuse/second_order_mul_sub
6 years ago
chengduo 46d01d798e
Revert "Revert "Remove workspace_handle in conv_cudnn (#15186)"" (#15290)
6 years ago
tensor-tang d618e48309 fix fuse square mat order and refine test
6 years ago
tensor-tang a5d2a6d1ad add fuse pass of sequared mat sub fusion
6 years ago
tensor-tang ca6fdc6e33 refine and fix test
6 years ago
tensor-tang a89296ac1f add repeated fc relu pass
6 years ago
Xin Pan 50b4ac08b0 fix
6 years ago
Xin Pan a1bfb35dd6 try fix py2
6 years ago
Xin Pan 6a18c0f9ff
Merge pull request #15278 from chengduoZH/revert_remove_workspace_handle_in_conv2d_cudnn
6 years ago
Zhaolong Xing 98e85f3735 add_transpose_flatten_concat_fuse (#15121)
6 years ago
chengduozh 358e657f68 Revert "Remove workspace_handle in conv_cudnn (#15186)"
6 years ago
tensor-tang fc9fbab6a0
Merge pull request #15271 from tensor-tang/fix/typo
6 years ago
chengduo 064512aa47
Remove workspace_handle in conv_cudnn (#15186)
6 years ago
tensor-tang c3a9f3c4b2 fix typo and refine
6 years ago
tensor-tang ab9c4b2a9f refine seqpool concat pass and remove unused nodes
6 years ago
tensor-tang ce909664d8 Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
6 years ago
flame fb63cd89d4
Add python ir graph API (#14917)
6 years ago
tensor-tang a0a27bd240 add seqpool concat fuse pass tester
6 years ago
sneaxiy 594dc4d8f0 partial gc 1st version
6 years ago
tensor-tang 8e086a8521 follow comment and fix typo
6 years ago
tensor-tang 48410b9bfe
Merge pull request #15237 from tensor-tang/fuse/seqpool_concat_2
6 years ago
peizhilin c1235c935f add the enable_debug flag
6 years ago