Commit Graph

7150 Commits (c34b24ede782612464bc4c7cad47c40661616e9d)

Author SHA1 Message Date
Hongyu Liu 9874530ee8
Update lod_reset_op.cc
6 years ago
xiaolil1 e818fa1004 Enable INT8 transpose kernel for MobileNet-SSD improvement. (#16159)
6 years ago
Hongyu Liu 58c69a9eb9
Update lod_reset_op.cc
6 years ago
Xin Pan 374abcf361
Merge pull request #16247 from panyx0718/imperative
6 years ago
Hongyu Liu 96c54cd9b1
Update lod_reset_op.cc
6 years ago
Tao Luo c072998ac1
Merge pull request #16219 from luotao1/fc_infershape
6 years ago
phlrain e46a94a4a2 fix lod_rest; test=develop
6 years ago
phlrain 10ececb213 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_sequence_pad_2
6 years ago
phlrain 802b33489a remove resize then seq num == 1; test=develop
6 years ago
phlrain 152beec540 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_lod_reset
6 years ago
tangwei12 8ea4218ce1
update load persistables for increment, test=develop (#15576)
6 years ago
phlrain a21fdde218 fix lod reset op;test=develop
6 years ago
phlrain dcba2e7236 fix conncat; test=develop
6 years ago
phlrain 955fad7a90 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
6 years ago
phlrain a7fe3b508e fix concat; test=develop
6 years ago
tensor-tang 50931dee1d refine seq enum op
6 years ago
Qiyang Min 8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
6 years ago
minqiyang 36dce65bb3 Take DataType and VarType apart
6 years ago
Xin Pan 3e9319f3ab add more imperative layer tests.
6 years ago
luotao1 d9f0e7252a refine with comments
6 years ago
luotao1 6fa52f83ba Merge branch 'develop' into fc_infershape
6 years ago
luotao1 cc0ae1f1a1 refine with comments
6 years ago
dengkaipeng 9d435973d2 not use PERSISTENT in batch_norm. test=develop
6 years ago
luotao1 a275fd6e0c Merge branch 'develop' into runtime_context
6 years ago
Wojciech Uss 2579ade45f Add cpu_quantize_pass for C-API quantization (#16127)
6 years ago
Xin Pan 7458114b5b
Merge pull request #16228 from panyx0718/imperative
6 years ago
chengduo efca4de78e Fix cross_entropy bug (#16236)
6 years ago
qingqing01 86e912c544 Fix windows compiling (#16230)
6 years ago
sneaxiy 4b073c95dc fix compiler
6 years ago
Tao Luo 38898c2808
Merge pull request #16212 from Aurelius84/develop
6 years ago
minqiyang db0c970823 Polish code
6 years ago
Kaipeng Deng b77ebb2af2
Merge pull request #15919 from heavengate/yolo_box
6 years ago
minqiyang 362253732c Polish code
6 years ago
minqiyang c0ddb93ccc Polish code
6 years ago
minqiyang b5078c211a Make infer var type virtual
6 years ago
minqiyang 9041b238e3 Polish code
6 years ago
minqiyang 438bca9c3d Implement Runtime Var Type Inference
6 years ago
Xin Pan 50ff898378 graph neural network for imperative mode
6 years ago
luotao1 5ecdc49c6b set enable_runtime_context_cache_ default false
6 years ago
ceci3 cd906fc603 change api.spec
6 years ago
Zhaolong Xing c49e604906
Merge pull request #16213 from qingqing01/compile_infer_shape
6 years ago
achao2013 81b4fad8b9 add moving average absmax op and fix bug (#15155)
6 years ago
luotao1 721c2c00ef refine fc_infershape
6 years ago
Kaipeng Deng 74037cc1c8
Merge branch 'develop' into yolo_box
6 years ago
Xin Pan 92b9ce3479
Merge pull request #16073 from heavengate/yolov3_loss_imporve
6 years ago
luotao1 46ee6bb1aa fix distributed unit-tests
6 years ago
luotao1 1b59bed989 Merge branch 'develop' into runtime_context
6 years ago
Aurelius84 2d1e76fb0c fix API.spec test=develop
6 years ago
luotao1 6ce25c99a0 Merge branch 'develop' into runtime_context
6 years ago
Aurelius84 6cfd20dea8 fix words spell error test=develop
6 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
6 years ago
shippingwang 98d9552f0f update sqrt explaination, test=develop
6 years ago
minqiyang ca392c7e97 Implement infer var type context
6 years ago
Yibing Liu 4ae23cc3c5
Impl fp16 compute kernel for slice_op (#16206)
6 years ago
sneaxiy f0d108f589 fix const_cast
6 years ago
Dang Qingqing e5e7628a62 Skip compile infer shape in box_coder_op
6 years ago
Aurelius84 a59b7d47a8 improve layers.fc api doc test=develop
6 years ago
sneaxiy 3e03695629 fix numeric error
6 years ago
sneaxiy 5a92e4c097 revert revert 16144
6 years ago
sneaxiy e993effb29 include unordered_map to cross_entropy_op.cc
6 years ago
Zeng Jinle a91964c8fe Revert "PaddingRNN model memory optimize"
6 years ago
liuwei1031 1c6caf8466
1. disable reuse SELECTED_ROWS type variable (#16150)
6 years ago
Wojciech Uss b9252f3df8 Add cpu_quantize_squash_pass for C-API quantization (#16128)
6 years ago
minqiyang f83739499c Polish code
6 years ago
minqiyang 7355d41834 1. Add imperative gperf profiler
6 years ago
Zeng Jinle 0b49e43d3a
Merge pull request #16144 from sneaxiy/rnn_mem_opt
6 years ago
luotao1 b2898c0f57 Merge branch 'develop' into runtime_context
6 years ago
minqiyang 98dfb492bb Release GIL lock
6 years ago
sneaxiy ac0e0f5181 merge develop
6 years ago
Tao Luo 4ef6f738c3
Merge pull request #16154 from luotao1/infershape_example
6 years ago
minqiyang 42e96a029f Accelerate CPU part
6 years ago
sneaxiy 487624e15d fix travis-ci
6 years ago
luotao1 1510b866b6 turn off runtime_context_cache for tensorrt
6 years ago
guomingz decdbed054 resolve #15618 (#16114)
6 years ago
sneaxiy 1e9fd40777 combine op files
6 years ago
luotao1 d94fd97230 add runtime_context_cache_pass
6 years ago
Kaipeng Deng 1a4a90a81d
Merge pull request #16140 from tink2123/arc_function
6 years ago
Yan Xu 30568473ec
fix broadcast on mp mode (#15951)
6 years ago
baojun e3c37bd564 remove const_cast and refactor ngraph engine code (#15925)
6 years ago
fc500110 1c6e72b905 remove visualizer, which can be replaced by python IrGraph draw API
6 years ago
chengduo 0979956619
Add memory profiler (#16137)
6 years ago
luotao1 b561ad1e55 Merge branch 'develop' into runtime_context
6 years ago
tink2123 61a6165c2c modified api.spec
6 years ago
Zhen Wang 41b8cf0bae
Merge pull request #16162 from wzzju/fix_nan_static_quant
6 years ago
luotao1 fe78a92e6e refine with comments
6 years ago
Zhen Wang 94b7c1ea7b
Merge pull request #16107 from wzzju/add_graph_clone
6 years ago
dengkaipeng 0ff9a403d0 fix format. test=develop
6 years ago
wopeizl 85709f4378
restore the exception caught since it is necessary for python call stack (#16160)
6 years ago
Zhen Wang 5420cf95f5
Merge pull request #16070 from wzzju/channel_wise_quant_op
6 years ago
Zhen Wang 5685a48c23 Add some fixme. test=develop
6 years ago
dengkaipeng b33e6bf5ef remove comment code. test=develop
6 years ago
tink2123 eb09bd456a modified api.spec
6 years ago
dengkaipeng 746740c41b fix API.spec. test=develop
6 years ago
dengkaipeng e4e3764060 use memory Copy. test=develop
6 years ago
dengkaipeng d31693afec no use _gt_score. test=develop
6 years ago
luotao1 5d20954ac4 add runtime shape for fuse_emb_seq_pool_grad
6 years ago
dengkaipeng aad62eeca0 add doc for param default. test=develop
6 years ago
tink2123 a8e375d463 refine doc
6 years ago
dengkaipeng 585766acc0 fix spell mistake in doc. test=develop
6 years ago
dengkaipeng b307533b7d fix format. test=develop
6 years ago
dengkaipeng afdf3c3f84 fix doc.test=develop
6 years ago
dengkaipeng 5b37cf0add fix API.spec for yolov3_loss. test=develop
6 years ago
dengkaipeng af4ef80e5b fix API.spec not add defaults. test=develop
6 years ago
dengkaipeng 0d1a9996ac fix unittest for yolov3_loss. test=develop
6 years ago
dengkaipeng f0804433b0 add mixup score and label_smooth for yolov3_loss. test=develop
6 years ago
dengkaipeng 626fb859d9 add param default doc. test=develop
6 years ago
dengkaipeng 33c8607ef3 fix doc. test=develop
6 years ago
dengkaipeng abb5a9c726 fix doc statement. test=develop
6 years ago
dengkaipeng b399ee2a23 fix doc. test=develop
6 years ago
dengkaipeng ad897304f9 fix pre-commit. test=develop
6 years ago
dengkaipeng 72a18bb160 add bbox range limit. test=develop
6 years ago
dengkaipeng fb863b4820 add API.spec for yolo_box. test=develop
6 years ago
dengkaipeng c9d4676bee fix multi batch idx error. test=develop
6 years ago
dengkaipeng 7808f4c097 fix unittest for yolo_box_op. test=develop
6 years ago
dengkaipeng cb2dca53c1 fix cuda kernel error
6 years ago
dengkaipeng 04b8b9e96c add yolo_box_op CUDA kernel
6 years ago
dengkaipeng 452373decb resize box in input image scale. test=develop
6 years ago
dengkaipeng 3896d955c7 add yolo_box_op CPU kernel
6 years ago
luotao1 8f6597aa0e Merge branch 'develop' into infershape_example
6 years ago
sneaxiy b26e9bd232 refine code
6 years ago
Zhen Wang ac6ef06ffa Add the Clone method in Graph. test=develop
6 years ago
Zhen Wang 01eddf125c Not add graph copy construction method. test=develop
6 years ago
Zhen Wang 1b9c8d5f06 add clone function for IrGraph. test=develop
6 years ago
Tao Luo ccc7c358b3
Merge pull request #16104 from tensor-tang/refine/jit
6 years ago
Tao Luo c49b7855fa
Merge pull request #16120 from Xreki/fix_cmake_compress
6 years ago
Qiyang Min 1f4aa7a202 Imperative remove all descs (#16045)
6 years ago
Zeng Jinle 472f16b5aa
Merge pull request #16063 from sneaxiy/enhance_gc
6 years ago
Tao Luo e31f6e9831
Merge pull request #16146 from luotao1/zero_copy
6 years ago
luotao1 31ccaf0916 add all_kernels_must_compute_runtime_shape example for speedup infershape
6 years ago
tensor-tang 14d871121b enhance jitkernel unit test
6 years ago
Liu Yiqun 4e052e0ac9 Disable inference download for WIN32 temporary.
6 years ago
chengduo ad80bde824
Revert "Revert "Add Event for TensorCopy"" (#16035)
6 years ago
luotao1 1283833395 zero_copy tensor support INT32
6 years ago
tensor-tang cfc83c1445 refine jitcodekey and enhance unit tests
6 years ago
tensor-tang 6ff230a624 Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
luotao1 31c4e1d9fc Merge branch 'develop' into zero_copy
6 years ago
wopeizl a38db3cb99
Fixrecordio (#16124)
6 years ago
sneaxiy fc12f38394 add API.spec
6 years ago
sneaxiy b80d76f784 merge develop
6 years ago
sneaxiy cfd012e2cb add unittest
6 years ago
sneaxiy d7407c90aa refine cross_entropy mem
6 years ago
luotao1 9e2c7e69fb simplify the zero_copy tests
6 years ago
tink2123 cfc59b13e9 modified api.spec
6 years ago
sneaxiy 732fa00eaf disable gc in recurrent_op currently
6 years ago
tink2123 e4e0d03459 fix format
6 years ago
Tink_Y 5579fae1d2
Update activation_op.cc
6 years ago
tensor-tang 45bdd84dac enhance the jitkernel helper and add unit tests
6 years ago
tink2123 837ad7f86f Add the inverse trigonometric function
6 years ago
ceci3 415d74a08e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into npair_loss0
6 years ago
luotao1 aeee4cbe71 add compare between zerocopy and analysis
6 years ago