Commit Graph

7248 Commits (2e9a836c6f5a451a8bf4e53cb6837299daa069c5)

Author SHA1 Message Date
nhzlx f3a2e4b3d8 1. Add ANAKIN_ROOT compile option
6 years ago
phlrain 77a08750e9 add var name in optimizer; test=develop
6 years ago
chengduo 33965527fd
Add unit test for fuse all reduce (#16354)
6 years ago
Hongyu Liu 18a0f6d97a
Merge pull request #16351 from phlrain/fix_topk_shape_check
6 years ago
Hongyu Liu 15444430b0
Merge pull request #16348 from phlrain/fix_squeeze_check
6 years ago
phlrain 5dc9b51994 fix time; test=develop
6 years ago
phlrain 686b8935fe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_floordiv_and_mod
6 years ago
phlrain 18d107c27a add floordiv and mod op; test=develop
6 years ago
phlrain ff112813de Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check
6 years ago
phlrain 8274d9d733 fix concat shape check; test=develop
6 years ago
Hongyu Liu 0d779f15f6
Merge pull request #16261 from phlrain/fix_sequence_pad_2
6 years ago
Hongyu Liu 8c81d9949e
Merge pull request #16347 from phlrain/fix_matmul_check
6 years ago
phlrain 4e3b5e727d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check
6 years ago
sneaxiy 953214ad97 add more unittest
6 years ago
Kaipeng Deng 5670530ca7 Revert "not use PERSISTENT in batch_norm. test=develop" (#16333)
6 years ago
qingqing01 d2b938ef5a
Refine gradient proto maker and python API for affine_channel_op (#16340)
6 years ago
phlrain b3d0336603 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check
6 years ago
phlrain 249546bf45 fix concat shape check; test=develop
6 years ago
Tao Luo 294cdf6f48
Merge pull request #16177 from fc500110/remove_visualizer
6 years ago
Hongyu Liu b3b7e352ee
Merge pull request #16259 from phlrain/fix_lod_reset
6 years ago
phlrain 6a5545aa22 fix squeeze shape check; test=develop
6 years ago
phlrain e7fb344c99 fix squeeze op shape check; test=develop
6 years ago
phlrain 190cfd692c fix squeeze shape check; test=develop
6 years ago
phlrain 0e40298949 fix matmul shape check; test=develop
6 years ago
phlrain 56c2d384c7 add elementwise floordiv, mod; test=develop
6 years ago
Wu Yi b7baeed7bb fix win gpu build test=develop (#16334)
6 years ago
liuwei1031 df5d19aa9d
temoprarily disable the code of use kCUDNN, test=develop (#16205)
6 years ago
ruri 09e05a110b
Merge pull request #16217 from ceci3/doc
6 years ago
zhhsplendid 124f1df481 Add flags for init and re-alloc gpu
6 years ago
Zhen Wang 8965819fbb rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop
6 years ago
Wojciech Uss cbe2dbf0db Add enabling quantization (#16326)
6 years ago
lujun 09442fb27e checkpoint pr be moved here, test=develop
6 years ago
Tao Luo 9a05859179
Merge pull request #16322 from wojtuss/wojtuss/fix_cpu_quantize_pass
6 years ago
qingqing01 8caa785e83
Enhance affine_channel_op infer-shape check (#16317)
6 years ago
flame 08838f3909
Fix save inference model bug (#16242)
6 years ago
Kaipeng Deng 957ea995fc
Merge pull request #16243 from heavengate/batch_norm_not_persistent
6 years ago
nhzlx 4f4daa4b66 cherry-pick from feature/anakin-engine: add data type for zero copy #16313
6 years ago
nhzlx 07dcf2856c git cherry-pick from feature/anakin-engine: update anakin subgraph #16278
6 years ago
nhzlx c407dfa3cb cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276
6 years ago
nhzlx a25331bc26 cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
6 years ago
nhzlx c79f06d3d8 cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178
6 years ago
nhzlx 69d37f81d7 cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
6 years ago
nhzlx a1d200a5de cherry-pick from feature/anakin-engine: Anakin support facebox #16111
6 years ago
flame a32d420043 cherry-pick from feature/anakin-engine: batch norm (#16110)
6 years ago
flame 0945b97f07 cherry-pick feature/anakin-engine: add anakin softmax/transpose/batch_norm/flatten/reshape op (#16020)
6 years ago
nhzlx b21770a2aa cherry-pick from feature/anakin-engine: Add subgraph fuse support and anakin engine #16018
6 years ago
nhzlx 084310f536 paddle-anakin: concat, split, pool2d converter#16003
6 years ago
flame be523baad2 Add anakin conv2d/relu/sigmoid/tanh converter (#15997)
6 years ago
Yan Chunwei d0ce6a9044 fix anakin converter registry (#15993)
6 years ago
Tao Luo a5124ee0bb
Merge pull request #16301 from luotao1/runtime_context_pass
6 years ago
lujun 622fe6a56b checkpoint pr be moved here, test=develop
6 years ago
baojun 2de263a5d9 Add softmax_with_cross_entropy_op to ngraph engine (#16304)
6 years ago
sneaxiy bb166a1e10 fix API.spec
6 years ago
ruri a3b8028d46
Merge pull request #16202 from shippingwang/add_sqrt_doc
6 years ago
phlrain dd080b17c3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_sequence_pad_2
6 years ago
phlrain 1580be5d6c fix sequence pad; test=develop
6 years ago
dengkaipeng aba2713ffc fix comment. test=develop
6 years ago
chengduo f26ba5bddd
Fuse AllReduce (#15921)
6 years ago
Zeng Jinle d0ef682552
Merge pull request #16274 from sneaxiy/fix_grad_maker
6 years ago
baojun 804afc51db Minor ngraph fix (#16270)
6 years ago
Tao Luo 9195c3bb03
Merge pull request #16280 from luotao1/cos_sim_infershape
6 years ago
Wojciech Uss 104a9f1e27 fix pattern maching conv2d with(out) ResidualData
6 years ago
Wu Yi 6382b62f6b
Collective ops (#15572)
6 years ago
lujun bed0ecf3d2 checkpoint pr be moved here, test=develop
6 years ago
sneaxiy 023a3a3d62 fix op grad maker
6 years ago
luotao1 82af8031d9 add runtime_context_cache_pass
6 years ago
Zhen Wang ec88b6cc5a add channel wise quantization in ir pass.
6 years ago
Tao Luo b9fc80a133
Merge pull request #16287 from PaddlePaddle/revert-16002-runtime_context
6 years ago
whs 18911b6eea
[enhence] Make step_input of dynamic_rnn support custom lod level. (#15972)
6 years ago
zhhsplendid 22715487dc add allocator flags
6 years ago
luotao1 c05af910bc refine cos_sim infershape
6 years ago
Hongyu Liu d3acf68044
Merge pull request #16258 from phlrain/fix_concat_1
6 years ago
Tao Luo 7d2740db83
Revert "cache runtime_context"
6 years ago
sneaxiy fd23262e0c merge develop, fix conflict
6 years ago
ceci3 c79cdf258e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into doc
6 years ago
ceci3 9d2ccec0f2 test=develop, fix doc
6 years ago
tensor-tang ead558b7f6
Merge pull request #16256 from tensor-tang/refine/seqenum
6 years ago
ceci3 27f7a72641 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into doc
6 years ago
Qiyang Min c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
6 years ago
Zeng Jinle f8df9eb32e fix api doc (#16201)
6 years ago
sneaxiy 3a09693f5c change API name
6 years ago
Jacek Czaja 13816dd4ac [MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)
6 years ago
Yibing Liu 7e20e7691e
Fix the bug in fp16 backward kernel (#16269)
6 years ago
shippingwang 97c6051822 add api.spec, test=develop
6 years ago
Wojciech Uss af03008890 Add cpu_quantize_placement_pass for C-API quantization (#16265)
6 years ago
Tao Luo dbb92ee4b1
Merge pull request #16002 from luotao1/runtime_context
6 years ago
shippingwang 4f42504eef Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sqrt_doc
6 years ago
sneaxiy e893cbd286 add auto increment best fit allocator
6 years ago
minqiyang 565b19b7a5 fix set data type bug
6 years ago
minqiyang 8364688c30 Fix py_func_op's problem
6 years ago
sneaxiy 161b8ddcaa Merge develop
6 years ago
Zeng Jinle 6429d2a887
Merge pull request #16188 from sneaxiy/fix_const_cast
6 years ago
minqiyang b40e41fbd1 Polish code style
6 years ago
Hongyu Liu 9874530ee8
Update lod_reset_op.cc
6 years ago
xiaolil1 e818fa1004 Enable INT8 transpose kernel for MobileNet-SSD improvement. (#16159)
6 years ago
Hongyu Liu 58c69a9eb9
Update lod_reset_op.cc
6 years ago
Xin Pan 374abcf361
Merge pull request #16247 from panyx0718/imperative
6 years ago
Hongyu Liu 96c54cd9b1
Update lod_reset_op.cc
6 years ago
Tao Luo c072998ac1
Merge pull request #16219 from luotao1/fc_infershape
6 years ago
phlrain e46a94a4a2 fix lod_rest; test=develop
6 years ago
phlrain 10ececb213 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_sequence_pad_2
6 years ago
phlrain 802b33489a remove resize then seq num == 1; test=develop
6 years ago
phlrain 152beec540 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_lod_reset
6 years ago
tangwei12 8ea4218ce1
update load persistables for increment, test=develop (#15576)
6 years ago
phlrain a21fdde218 fix lod reset op;test=develop
6 years ago
phlrain dcba2e7236 fix conncat; test=develop
6 years ago
phlrain 955fad7a90 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
6 years ago
phlrain a7fe3b508e fix concat; test=develop
6 years ago
tensor-tang 50931dee1d refine seq enum op
6 years ago
Qiyang Min 8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
6 years ago
minqiyang 36dce65bb3 Take DataType and VarType apart
6 years ago
Xin Pan 3e9319f3ab add more imperative layer tests.
6 years ago
luotao1 d9f0e7252a refine with comments
6 years ago
luotao1 6fa52f83ba Merge branch 'develop' into fc_infershape
6 years ago
luotao1 cc0ae1f1a1 refine with comments
6 years ago
dengkaipeng 9d435973d2 not use PERSISTENT in batch_norm. test=develop
6 years ago
luotao1 a275fd6e0c Merge branch 'develop' into runtime_context
6 years ago
Wojciech Uss 2579ade45f Add cpu_quantize_pass for C-API quantization (#16127)
6 years ago
Xin Pan 7458114b5b
Merge pull request #16228 from panyx0718/imperative
6 years ago
chengduo efca4de78e Fix cross_entropy bug (#16236)
6 years ago
qingqing01 86e912c544 Fix windows compiling (#16230)
6 years ago
sneaxiy 4b073c95dc fix compiler
6 years ago
Tao Luo 38898c2808
Merge pull request #16212 from Aurelius84/develop
6 years ago
minqiyang db0c970823 Polish code
6 years ago
Kaipeng Deng b77ebb2af2
Merge pull request #15919 from heavengate/yolo_box
6 years ago
minqiyang 362253732c Polish code
6 years ago
minqiyang c0ddb93ccc Polish code
6 years ago
minqiyang b5078c211a Make infer var type virtual
6 years ago
minqiyang 9041b238e3 Polish code
6 years ago
minqiyang 438bca9c3d Implement Runtime Var Type Inference
6 years ago
Xin Pan 50ff898378 graph neural network for imperative mode
6 years ago
luotao1 5ecdc49c6b set enable_runtime_context_cache_ default false
6 years ago
ceci3 cd906fc603 change api.spec
6 years ago
Zhaolong Xing c49e604906
Merge pull request #16213 from qingqing01/compile_infer_shape
6 years ago
achao2013 81b4fad8b9 add moving average absmax op and fix bug (#15155)
6 years ago
luotao1 721c2c00ef refine fc_infershape
6 years ago
Kaipeng Deng 74037cc1c8
Merge branch 'develop' into yolo_box
6 years ago
Xin Pan 92b9ce3479
Merge pull request #16073 from heavengate/yolov3_loss_imporve
6 years ago
luotao1 46ee6bb1aa fix distributed unit-tests
6 years ago
luotao1 1b59bed989 Merge branch 'develop' into runtime_context
6 years ago
Aurelius84 2d1e76fb0c fix API.spec test=develop
6 years ago
luotao1 6ce25c99a0 Merge branch 'develop' into runtime_context
6 years ago
Aurelius84 6cfd20dea8 fix words spell error test=develop
6 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
6 years ago
shippingwang 98d9552f0f update sqrt explaination, test=develop
6 years ago
minqiyang ca392c7e97 Implement infer var type context
6 years ago
Yibing Liu 4ae23cc3c5
Impl fp16 compute kernel for slice_op (#16206)
6 years ago
sneaxiy f0d108f589 fix const_cast
6 years ago
Dang Qingqing e5e7628a62 Skip compile infer shape in box_coder_op
6 years ago
Aurelius84 a59b7d47a8 improve layers.fc api doc test=develop
6 years ago