dengkaipeng
eb2123e12d
fix doc and jit. test=develop
6 years ago
dengkaipeng
7920e3be02
revert test_softmax_cudnn. test=develop
6 years ago
dengkaipeng
93701dba50
add jit kernel for softmax axis. test=develop
6 years ago
dengkaipeng
8b88960dce
fix doc. test=develop
6 years ago
dengkaipeng
2ddd23dac8
fix format. test=develop
6 years ago
dengkaipeng
365e6cfd15
add mkldnn support. test=develop
6 years ago
dengkaipeng
217db27337
add mkldnn support. test=develop
6 years ago
dengkaipeng
6cb66721d2
add cudnn support. test=develop
6 years ago
xiaolil1
e818fa1004
Enable INT8 transpose kernel for MobileNet-SSD improvement. ( #16159 )
...
* Enable INT8 transpose kernel for MobileNet-SSD improvement.
test=develop
* Refine the license year.
test=develop
* Delete redundant code.
test=develop
* Add axis check.
test=develop
6 years ago
Xin Pan
374abcf361
Merge pull request #16247 from panyx0718/imperative
...
add more imperative layer tests.
6 years ago
tangwei12
8ea4218ce1
update load persistables for increment, test=develop ( #15576 )
...
* update load persistables for increment, test=develop
* update load persistables for increment, test=develop
* update API Spec, test=develop
* update API Spec, test=develop
* add doc, test=develop
* add doc, test=develop
* Update lookup_table_utils.py
* Update API.spec
* Update lookup_table_utils.py
test=develop
* Update API.spec
test=develop
* fix api spec
* Update lookup_table_utils.py
test=develop
6 years ago
Qiyang Min
8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
...
Improve imperative mode training speed
6 years ago
Xin Pan
3e9319f3ab
add more imperative layer tests.
...
test=develop
6 years ago
Xin Pan
7458114b5b
Merge pull request #16228 from panyx0718/imperative
...
graph neural network for imperative mode
6 years ago
Tao Luo
38898c2808
Merge pull request #16212 from Aurelius84/develop
...
improve layers.fc api doc
6 years ago
Kaipeng Deng
b77ebb2af2
Merge pull request #15919 from heavengate/yolo_box
...
add yolo_box for detection box calc in YOLOv3
6 years ago
Xin Pan
3be7e971ab
polish
...
test=develop
6 years ago
Xin Pan
50ff898378
graph neural network for imperative mode
...
test=develop
6 years ago
achao2013
81b4fad8b9
add moving average absmax op and fix bug ( #15155 )
...
* Add moving average absmax op in quantilize-aware training.
6 years ago
Kaipeng Deng
74037cc1c8
Merge branch 'develop' into yolo_box
6 years ago
Xin Pan
92b9ce3479
Merge pull request #16073 from heavengate/yolov3_loss_imporve
...
Yolov3 loss: add mixup score and label smooth
6 years ago
Aurelius84
6cfd20dea8
fix words spell error test=develop
6 years ago
qingqing01
8ad672a287
Support sync batch norm. ( #16121 )
...
* Support Sync Batch Norm.
* Note, do not enable it in one device.
Usage:
build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
loss_name=loss_mean.name,
build_strategy=build_strategy)
6 years ago
Yibing Liu
4ae23cc3c5
Impl fp16 compute kernel for slice_op ( #16206 )
...
* Impl fp16 compute kernel for slice_op
test=develop
* Use data() to replace mutable_data()
6 years ago
Aurelius84
a59b7d47a8
improve layers.fc api doc test=develop
6 years ago
sneaxiy
3e03695629
fix numeric error
...
test=develop
6 years ago
sneaxiy
5a92e4c097
revert revert 16144
...
test=develop
6 years ago
sneaxiy
ad5f0e6018
merge develop
6 years ago
sneaxiy
55ba7f610b
fix numeric error
...
test=develop
6 years ago
Zeng Jinle
a91964c8fe
Revert "PaddingRNN model memory optimize"
...
test=develop
6 years ago
minqiyang
f83739499c
Polish code
...
test=develop
6 years ago
minqiyang
7355d41834
1. Add imperative gperf profiler
...
2. Add binutils 2.27 in manylinux support
test=develop
6 years ago
Zeng Jinle
0b49e43d3a
Merge pull request #16144 from sneaxiy/rnn_mem_opt
...
PaddingRNN model memory optimize
6 years ago
minqiyang
42e96a029f
Accelerate CPU part
6 years ago
Kaipeng Deng
1a4a90a81d
Merge pull request #16140 from tink2123/arc_function
...
Add the inverse trigonometric function
6 years ago
Yan Xu
30568473ec
fix broadcast on mp mode ( #15951 )
...
* fix broadcast with mp mode
* polish code test=develop
* fix bcast strategy test=develop
* fic cpplint test=develop
* fix py3 failed test=develop
* fix comment test=develop
* update comment test=develop
6 years ago
baojun
e3c37bd564
remove const_cast and refactor ngraph engine code ( #15925 )
...
* remove concast_cast and refactor code test=develop
* reduce flag use test=develop
6 years ago
Zhen Wang
41b8cf0bae
Merge pull request #16162 from wzzju/fix_nan_static_quant
...
Fix NaN bugs for static quantization strategy (mutil-cards train).
6 years ago
Qiyang Min
d4e8c99f4b
Merge pull request #16095 from velconia/transfer_gru_unit
...
Imperative transfer gru unit
6 years ago
dengkaipeng
2c0abba0c3
fix test_detection. test=develop
6 years ago
Zhen Wang
94b7c1ea7b
Merge pull request #16107 from wzzju/add_graph_clone
...
Add clone function for IrGraph.
6 years ago
Zhen Wang
5420cf95f5
Merge pull request #16070 from wzzju/channel_wise_quant_op
...
Add channel wise quant op and channel wise dequant op.
6 years ago
dengkaipeng
e4e3764060
use memory Copy. test=develop
6 years ago
dengkaipeng
d31693afec
no use _gt_score. test=develop
6 years ago
dengkaipeng
aad62eeca0
add doc for param default. test=develop
6 years ago
Zhen Wang
b8d1f5038e
Add the executor test for the graph clone API. test=develop
6 years ago
dengkaipeng
afdf3c3f84
fix doc.test=develop
6 years ago
dengkaipeng
af4ef80e5b
fix API.spec not add defaults. test=develop
6 years ago
dengkaipeng
0d1a9996ac
fix unittest for yolov3_loss. test=develop
6 years ago
dengkaipeng
626fb859d9
add param default doc. test=develop
6 years ago