Commit Graph

2192 Commits (cc4def6ba5a640845ee5b2cf84d1366837aab118)

Author SHA1 Message Date
sneaxiy f8ed2c229e try to fix ci error
6 years ago
Wojciech Uss 46677fb080 Move cpu_quantize_* passes into mkldnn subfolder
6 years ago
sneaxiy c20db6357b split PR
6 years ago
sneaxiy 072d95d8f6 Merge develop
6 years ago
sneaxiy a93a9eef8f add op registry type
6 years ago
chengduo a6a3b2fbbc
[Speed]Refine ParallelExecutor (#16190)
6 years ago
nhzlx 3df7b98a0f Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
chengduo 33965527fd
Add unit test for fuse all reduce (#16354)
6 years ago
sneaxiy 953214ad97 add more unittest
6 years ago
Wojciech Uss cbe2dbf0db Add enabling quantization (#16326)
6 years ago
Tao Luo 9a05859179
Merge pull request #16322 from wojtuss/wojtuss/fix_cpu_quantize_pass
6 years ago
nhzlx c407dfa3cb cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276
6 years ago
nhzlx a25331bc26 cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
6 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
6 years ago
nhzlx a1d200a5de cherry-pick from feature/anakin-engine: Anakin support facebox #16111
6 years ago
Tao Luo a5124ee0bb
Merge pull request #16301 from luotao1/runtime_context_pass
6 years ago
chengduo f26ba5bddd
Fuse AllReduce (#15921)
6 years ago
Zeng Jinle d0ef682552
Merge pull request #16274 from sneaxiy/fix_grad_maker
6 years ago
Wojciech Uss 104a9f1e27 fix pattern maching conv2d with(out) ResidualData
6 years ago
Wu Yi 6382b62f6b
Collective ops (#15572)
6 years ago
sneaxiy 023a3a3d62 fix op grad maker
6 years ago
luotao1 82af8031d9 add runtime_context_cache_pass
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
sneaxiy fd23262e0c merge develop, fix conflict
6 years ago
Qiyang Min c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
6 years ago
Jacek Czaja 13816dd4ac [MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)
6 years ago
Wojciech Uss af03008890 Add cpu_quantize_placement_pass for C-API quantization (#16265)
6 years ago
Tao Luo dbb92ee4b1
Merge pull request #16002 from luotao1/runtime_context
6 years ago
minqiyang b40e41fbd1 Polish code style
6 years ago
Qiyang Min 8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
6 years ago
minqiyang 36dce65bb3 Take DataType and VarType apart
6 years ago
luotao1 cc0ae1f1a1 refine with comments
6 years ago
luotao1 a275fd6e0c Merge branch 'develop' into runtime_context
6 years ago
Wojciech Uss 2579ade45f Add cpu_quantize_pass for C-API quantization (#16127)
6 years ago
qingqing01 86e912c544 Fix windows compiling (#16230)
6 years ago
minqiyang 362253732c Polish code
6 years ago
minqiyang c0ddb93ccc Polish code
6 years ago
minqiyang b5078c211a Make infer var type virtual
6 years ago
minqiyang 438bca9c3d Implement Runtime Var Type Inference
6 years ago
luotao1 46ee6bb1aa fix distributed unit-tests
6 years ago
luotao1 1b59bed989 Merge branch 'develop' into runtime_context
6 years ago
luotao1 6ce25c99a0 Merge branch 'develop' into runtime_context
6 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
6 years ago
minqiyang ca392c7e97 Implement infer var type context
6 years ago
liuwei1031 1c6caf8466
1. disable reuse SELECTED_ROWS type variable (#16150)
6 years ago
Wojciech Uss b9252f3df8 Add cpu_quantize_squash_pass for C-API quantization (#16128)
6 years ago
luotao1 b2898c0f57 Merge branch 'develop' into runtime_context
6 years ago
sneaxiy a7a4f053da Merge develop
6 years ago
Tao Luo 4ef6f738c3
Merge pull request #16154 from luotao1/infershape_example
6 years ago
minqiyang 42e96a029f Accelerate CPU part
6 years ago
sneaxiy 682f2dbf29 merge develop
6 years ago
sneaxiy 2c4fcaa683 merge develop
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
Yan Xu 30568473ec
fix broadcast on mp mode (#15951)
6 years ago
baojun e3c37bd564 remove const_cast and refactor ngraph engine code (#15925)
6 years ago
luotao1 b561ad1e55 Merge branch 'develop' into runtime_context
6 years ago
Zhen Wang 41b8cf0bae
Merge pull request #16162 from wzzju/fix_nan_static_quant
6 years ago
luotao1 fe78a92e6e refine with comments
6 years ago
Zhen Wang 94b7c1ea7b
Merge pull request #16107 from wzzju/add_graph_clone
6 years ago
wopeizl 85709f4378
restore the exception caught since it is necessary for python call stack (#16160)
6 years ago
Zhen Wang 5685a48c23 Add some fixme. test=develop
6 years ago
luotao1 8f6597aa0e Merge branch 'develop' into infershape_example
6 years ago
Zhen Wang ac6ef06ffa Add the Clone method in Graph. test=develop
6 years ago
Zhen Wang 01eddf125c Not add graph copy construction method. test=develop
6 years ago
Zhen Wang 1b9c8d5f06 add clone function for IrGraph. test=develop
6 years ago
Zeng Jinle 472f16b5aa
Merge pull request #16063 from sneaxiy/enhance_gc
6 years ago
luotao1 31ccaf0916 add all_kernels_must_compute_runtime_shape example for speedup infershape
6 years ago
chengduo ad80bde824
Revert "Revert "Add Event for TensorCopy"" (#16035)
6 years ago
sneaxiy 732fa00eaf disable gc in recurrent_op currently
6 years ago
Yihua Xu 40f1dd818b Fix the node's order issue when the content of graph is changed (#16088)
6 years ago
Zhaolong Xing 3d63aa0a11
Merge pull request #15729 from NHZlX/add_static_model_load_for_trt
6 years ago
Wu Yi d206582337
add parallel graph dist test (#16076)
6 years ago
sneaxiy 2a639d5c2a add allocator chain to fix bug
6 years ago
liuwei1031 045e5911bf
fix a code bug which cause crash when empty variable is used, test=develop (#16080)
6 years ago
sneaxiy 7b608396fe fix travis-ci format check
6 years ago
Tao Luo 6375fe45d7
Merge pull request #16039 from luotao1/execution_context
6 years ago
sneaxiy 814a759061 merge develop
6 years ago
sneaxiy 597dc65e76 enhance gc
6 years ago
liuwei1031 caadd0581d
add IfElse test case for ir memory optimize (#15998)
6 years ago
Tao Luo f4587789d8 remove legacy function in ExecutionContext
6 years ago
luotao1 c0b240aa43 try to fix distributed unit-test
6 years ago
luotao1 784826a4f5 enhance cache runtime_context for different scope
6 years ago
chengduo e2da3a5b22
Revert "Add Event for TensorCopy" (#16022)
6 years ago
luotao1 2fb38c108c Merge branch 'develop' into runtime_context
6 years ago
sneaxiy a9ea99d700 merge develop
6 years ago
chengduo ae37f82964
Unified ParallelExecutor and Compiler (#15970)
6 years ago
chengduo 7235fd662b
Add Event for TensorCopy (#15953)
6 years ago
luotao1 82b0bb9d72 fix cpplint error
6 years ago
luotao1 9773f38f99 cache runtime_context
6 years ago
tangwei12 6d5a04c1e7
add op type in check nan/inf (#15986)
6 years ago
Qiyang Min 187cffd019
Merge pull request #15928 from velconia/imperative_backward_hooks
6 years ago
Yiqun Liu 798925453e
Revert "Optimize while_op when is_test is true. (#15811)" (#15968)
6 years ago
minqiyang e5f3435dd5 Add missing headers
6 years ago
minqiyang 50639fafdb Polish code
6 years ago
Yiqun Liu 613d9d0756
Optimize while_op when is_test is true. (#15811)
6 years ago
nhzlx 2eff3e26b6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_static_model_load_for_trt
6 years ago
nhzlx 06a088a199 fix comments and fix cpplint
6 years ago
dzhwinter 225c11a91f polish cudnn related code and fix bug. (#15164)
6 years ago
Tao Luo d5a888e15c
Merge pull request #15943 from kbinias/kbinias/add-placement-pass-tester
6 years ago
Krzysztof Binias 72253391b6 Add MKL-DNN placement pass tester
6 years ago