Commit Graph

8233 Commits (174d0d0b90a610807d6f82927aad4def227ee643)

Author SHA1 Message Date
chengduo 33965527fd
Add unit test for fuse all reduce (#16354)
6 years ago
phlrain 5dc9b51994 fix time; test=develop
6 years ago
phlrain 686b8935fe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_floordiv_and_mod
6 years ago
Hongyu Liu 8c81d9949e
Merge pull request #16347 from phlrain/fix_matmul_check
6 years ago
lujun ac32bf6f77 update input params type, test=develop
6 years ago
qingqing01 d2b938ef5a
Refine gradient proto maker and python API for affine_channel_op (#16340)
6 years ago
phlrain 0e40298949 fix matmul shape check; test=develop
6 years ago
phlrain 56c2d384c7 add elementwise floordiv, mod; test=develop
6 years ago
ruri 09e05a110b
Merge pull request #16217 from ceci3/doc
6 years ago
zhhsplendid 124f1df481 Add flags for init and re-alloc gpu
6 years ago
Zhen Wang 8965819fbb rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop
6 years ago
Wu Yi 8bebfe5640
add resnet nccl2 dist training, mp training unit test (#16167)
6 years ago
flame 08838f3909
Fix save inference model bug (#16242)
6 years ago
baojun 2de263a5d9 Add softmax_with_cross_entropy_op to ngraph engine (#16304)
6 years ago
sneaxiy bb166a1e10 fix API.spec
6 years ago
chengduo f26ba5bddd
Fuse AllReduce (#15921)
6 years ago
Wu Yi 6382b62f6b
Collective ops (#15572)
6 years ago
lujun bed0ecf3d2 checkpoint pr be moved here, test=develop
6 years ago
Zhen Wang ec88b6cc5a add channel wise quantization in ir pass.
6 years ago
whs 18911b6eea
[enhence] Make step_input of dynamic_rnn support custom lod level. (#15972)
6 years ago
zhhsplendid 22715487dc add allocator flags
6 years ago
ceci3 27f7a72641 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into doc
6 years ago
ceci3 3f5f5ed361 fix dropout doc
6 years ago
Zeng Jinle f8df9eb32e fix api doc (#16201)
6 years ago
sneaxiy 3a09693f5c change API name
6 years ago
Yibing Liu 7e20e7691e
Fix the bug in fp16 backward kernel (#16269)
6 years ago
sneaxiy 161b8ddcaa Merge develop
6 years ago
xiaolil1 e818fa1004 Enable INT8 transpose kernel for MobileNet-SSD improvement. (#16159)
6 years ago
Xin Pan 374abcf361
Merge pull request #16247 from panyx0718/imperative
6 years ago
tangwei12 8ea4218ce1
update load persistables for increment, test=develop (#15576)
6 years ago
Qiyang Min 8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
6 years ago
Xin Pan 3e9319f3ab add more imperative layer tests.
6 years ago
Xin Pan 7458114b5b
Merge pull request #16228 from panyx0718/imperative
6 years ago
sneaxiy 4b073c95dc fix compiler
6 years ago
Tao Luo 38898c2808
Merge pull request #16212 from Aurelius84/develop
6 years ago
Kaipeng Deng b77ebb2af2
Merge pull request #15919 from heavengate/yolo_box
6 years ago
Xin Pan 3be7e971ab polish
6 years ago
Xin Pan 50ff898378 graph neural network for imperative mode
6 years ago
achao2013 81b4fad8b9 add moving average absmax op and fix bug (#15155)
6 years ago
Kaipeng Deng 74037cc1c8
Merge branch 'develop' into yolo_box
6 years ago
Xin Pan 92b9ce3479
Merge pull request #16073 from heavengate/yolov3_loss_imporve
6 years ago
Aurelius84 6cfd20dea8 fix words spell error test=develop
6 years ago
ceci3 cd82e2b03b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into doc
6 years ago
ceci3 ede33c6260 fix formula in dropout
6 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
6 years ago
Yibing Liu 4ae23cc3c5
Impl fp16 compute kernel for slice_op (#16206)
6 years ago
Aurelius84 a59b7d47a8 improve layers.fc api doc test=develop
6 years ago
sneaxiy 3e03695629 fix numeric error
6 years ago
sneaxiy 5a92e4c097 revert revert 16144
6 years ago
sneaxiy ad5f0e6018 merge develop
6 years ago
sneaxiy 55ba7f610b fix numeric error
6 years ago
Zeng Jinle a91964c8fe Revert "PaddingRNN model memory optimize"
6 years ago
minqiyang f83739499c Polish code
6 years ago
minqiyang 7355d41834 1. Add imperative gperf profiler
6 years ago
Zeng Jinle 0b49e43d3a
Merge pull request #16144 from sneaxiy/rnn_mem_opt
6 years ago
sneaxiy ac0e0f5181 merge develop
6 years ago
minqiyang 42e96a029f Accelerate CPU part
6 years ago
Kaipeng Deng 1a4a90a81d
Merge pull request #16140 from tink2123/arc_function
6 years ago
Yan Xu 30568473ec
fix broadcast on mp mode (#15951)
6 years ago
baojun e3c37bd564 remove const_cast and refactor ngraph engine code (#15925)
6 years ago
Zhen Wang 41b8cf0bae
Merge pull request #16162 from wzzju/fix_nan_static_quant
6 years ago
Qiyang Min d4e8c99f4b
Merge pull request #16095 from velconia/transfer_gru_unit
6 years ago
dengkaipeng 2c0abba0c3 fix test_detection. test=develop
6 years ago
Zhen Wang 94b7c1ea7b
Merge pull request #16107 from wzzju/add_graph_clone
6 years ago
Zhen Wang 5420cf95f5
Merge pull request #16070 from wzzju/channel_wise_quant_op
6 years ago
dengkaipeng e4e3764060 use memory Copy. test=develop
6 years ago
dengkaipeng d31693afec no use _gt_score. test=develop
6 years ago
dengkaipeng aad62eeca0 add doc for param default. test=develop
6 years ago
Zhen Wang b8d1f5038e Add the executor test for the graph clone API. test=develop
6 years ago
dengkaipeng afdf3c3f84 fix doc.test=develop
6 years ago
dengkaipeng af4ef80e5b fix API.spec not add defaults. test=develop
6 years ago
dengkaipeng 0d1a9996ac fix unittest for yolov3_loss. test=develop
6 years ago
dengkaipeng 626fb859d9 add param default doc. test=develop
6 years ago
dengkaipeng 00e822d2a0 fix test_detection. test=develop
6 years ago
dengkaipeng abb5a9c726 fix doc statement. test=develop
6 years ago
dengkaipeng ad897304f9 fix pre-commit. test=develop
6 years ago
dengkaipeng deec3ac1e9 fix infer error.
6 years ago
dengkaipeng c9d4676bee fix multi batch idx error. test=develop
6 years ago
dengkaipeng cb2dca53c1 fix cuda kernel error
6 years ago
dengkaipeng 452373decb resize box in input image scale. test=develop
6 years ago
dengkaipeng 3896d955c7 add yolo_box_op CPU kernel
6 years ago
Zhen Wang 4ab0607634 fix NaN bugs.
6 years ago
Qiyang Min 4e8c03bdfe Imperative fix conv2d's attrs bugs (#16158)
6 years ago
Zhen Wang ac6ef06ffa Add the Clone method in Graph. test=develop
6 years ago
Zhen Wang 01eddf125c Not add graph copy construction method. test=develop
6 years ago
Zhen Wang 1b9c8d5f06 add clone function for IrGraph. test=develop
6 years ago
jerrywgz 08e757318c
Merge pull request #16145 from ceci3/npair_loss0
6 years ago
minqiyang de212ae274 Polish code
6 years ago
Qiyang Min 1f4aa7a202 Imperative remove all descs (#16045)
6 years ago
Zeng Jinle 472f16b5aa
Merge pull request #16063 from sneaxiy/enhance_gc
6 years ago
minqiyang d17bb4e600 Add unit test for gru unit
6 years ago
minqiyang 0d27d204d8 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into transfer_gru_unit
6 years ago
Qiyang Min 5f33164f3d
Merge pull request #16125 from panyx0718/imperative
6 years ago
minqiyang 2050f31c26 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into transfer_gru_unit
6 years ago
wopeizl a38db3cb99
Fixrecordio (#16124)
6 years ago
sneaxiy b80d76f784 merge develop
6 years ago
sneaxiy aa2335c218 add py_reader doc
6 years ago
ceci3 8468c39318 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into npair_loss0
6 years ago
ceci3 24fbe6d610 test=develop, replace sce
6 years ago
sneaxiy cfd012e2cb add unittest
6 years ago