sneaxiy
4c8254e3bf
revert some loop op revision
...
test=develop
6 years ago
Zeng Jinle
644e8af4cf
Merge pull request #16424 from sneaxiy/fix_allocator_bug
...
Fix allocator bug
6 years ago
sneaxiy
c4c6205268
fix gc bug
...
test=develop
6 years ago
sneaxiy
16f0994728
Merge develop
...
test=develop
6 years ago
sneaxiy
63651c1968
fix grad desc maker
...
test=develop
6 years ago
Yihua Xu
57dc3c1943
Disable compare for Issue#16316 ( #16466 )
...
* Disable compare for accuracy issue.
test=develop
* Add todo comments to show more information.
test=develop
6 years ago
Zeng Jinle
c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
...
Enhance gc to support deleting tensor buffer in advance
6 years ago
Jiabin Yang
54a73578a8
Feature/install check ( #16044 )
...
* test=develop, add install check
* test=develop, add install check scripts
* test=develop, refine language
* test=develop, add api spec
* test=develop, change cdn to bj to pass ci
6 years ago
wopeizl
c300b1ba69
Tensor index ( #16223 )
...
* extend the slice function for python
test=develop
6 years ago
Jiabin Yang
0d9d25d40f
Feature/refactor layers to Layers ( #16337 )
...
* test=develop, add some Layers and tests
* test=develop, add more layers
* test=develop, add more layers
* test=develop, add force cpu option
* Update test_layers.py
remove pdb
* test=develop, refine code
6 years ago
dengkaipeng
3e352388eb
fix format. test=develop
6 years ago
dengkaipeng
eb2123e12d
fix doc and jit. test=develop
6 years ago
liuwei1031
8d22bc17a4
Memory optimize ( #16410 )
...
* fix cdn issue, test=develop
* fix memory optimize bugs, test=develop
* fix memory optimize bugs, test=develop
* remove add/sub_2 op, test=develop
* disable memory_optimize by default, test=develop
* disable inplace activation in python, test=develop
* fix unittests, test=develop
* fix unittests, test=develop
* bug-fix, test=develop
6 years ago
Xin Pan
f8c279b11c
Merge pull request #16454 from panyx0718/imperative2
...
polish deepCF model to support real dataset
6 years ago
Zhaolong Xing
fa1796a30a
Merge pull request #16330 from NHZlX/merge_anakin_branch_to_dev
...
Cherry-pick from PaddlePaddle:feature/anakin-engine: Anakin subgraph support.
6 years ago
lujun
3f8b2f5ff5
fix multiplex doc, test=develop
6 years ago
sneaxiy
a0f4fefb60
delete source file no_need_buffer_vars_inference.cc
...
test=develop
6 years ago
tensor-tang
1eff834e97
update jitkernel doc ( #16327 )
...
* update jitkernel doc
test=develop
* follow comments
* follow comments
test=develop
6 years ago
Yiqun Liu
98802e1f75
Optimize the implementation of while_op again, for cases when is_test is true. ( #16359 )
...
test=develop
6 years ago
lujun
c34b24ede7
Merge pull request #16425 from junjun315/checkpoint-hotfix
...
Checkpoint hotfix
6 years ago
Wu Yi
9ffd5eecef
test fix fetch bar place for ce ( #16406 )
...
* test fix fetch bar place for ce
* fix ps mode dist train in develop test=develop
* fix style check test=develop
* update test=develop
6 years ago
sneaxiy
318072c26b
add comments of allocator design
...
test=develop
6 years ago
chengduo
4f2278f032
Add doc for CPUPlace CUDAPlace CUDAPinPlace ( #16442 )
...
test=develop
6 years ago
nhzlx
953bdde058
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
...
test=develop
6 years ago
Tao Luo
e0a3a49096
Merge pull request #16438 from wojtuss/wojtuss/move-cpu-quantize-passes
...
Move cpu_quantize_* passes into mkldnn subfolder
6 years ago
gongweibao
ec6519e806
Fix allreducedep bug ( #16443 )
6 years ago
sneaxiy
78fb3a62e0
fix env variable settting bug
...
test=develop
6 years ago
sneaxiy
2d92b6be98
merge develop
...
test=develop
6 years ago
Jiabin Yang
f735102eab
add layer norm to Layers, add transformer test in imperative mode ( #16092 )
...
* add layer norm to Layers, add transformer prepare encoding
* little change
* finish encoder part
* add decoder part
* finish model part
* add test case and part of data feed
* add transformer test
* add to_parameter, add remove in set_attr
* test=develop, fix pos encoding bug, create_parameter with stantard name
* test=develop, rm dropout test in imperative
* test=develop, fix cpu error
* test=develop, fix minize bug
* test=develop, fix one hot not stop gradient
* test=develop, fix one hot not stop gradient
* test=develop, refine parameter name
* test=develop, fix transformer test in imperative mode
* test=develop, fix transformer test in imperative mode
* test=develop, fix boost and mkl download error
* test=develop, fix boost and mkl download error
* test=develop, fix ci and refine code
* test=develop, fix ci and refine code
6 years ago
Xin Pan
fd24ab47ab
polish
...
test=develop
6 years ago
Xin Pan
1f89249a95
update DeepCF model
...
test=develop
6 years ago
sneaxiy
a7d0ac50b8
Merge develop
6 years ago
sneaxiy
7000ec85d9
fix some op grad maker
...
fix ctest eager deletion disable bug
test=develop
6 years ago
nhzlx
45b3766fdf
fix comments
...
test=develop
6 years ago
zhaoyuchen2018
cdb315e9d8
Merge branch 'develop' into docrefine
6 years ago
zhhsplendid
3909108cae
Add SpectralNormGradOpDescMaker
...
Use SpectralNormGradOpDescMaker instead of DefaultGradOpDescMaker
to avoid registering useless variables to improve GPU usage.
test=develop
6 years ago
dengkaipeng
ceb31d30f0
fix formax. test=develop
6 years ago
Zeng Jinle
4cc9809cae
Merge pull request #15799 from sneaxiy/feature/decoupled_reader
...
Try to decouple reader with program_desc
6 years ago
whs
e9bec9369b
[slim] Add quantization strategy and distillation strategy. ( #16408 )
...
* Add fsp operator.
1 Add unitest.
2. Add python API.
3. Add layer test.
* Add quantization strategy.
1. Add API.
2. Add unitest.
* Add distillatoin strategy.
* Add unitest config file for quantization
* Fix Copyright
test=develop
* Fix setup.py
* Fix document of layers.py.
test=develop
* Fix unitest in python3.
test=develop
* Fix documents.
test=develop
* 1. refine fsp op by batched gemm
2. remove unused import
test=develop
* Fix test_dist_se_resnext.
1. disable test distillation.
2. reset framework.py
test=develop
* Enable unitest of distillation after fixing Block._clone_variable
test=develop
* Fix cdn issue.
test=develop
6 years ago
dengkaipeng
d54005a7f4
fix unittest. test=develop
6 years ago
liuwei1031
de3b70a101
fix cdn issue, test=develop ( #16423 )
...
* fix cdn issue, test=develop
* fix cdn issue, test=develop
6 years ago
dengkaipeng
90bd038d35
fix format. test=develop
6 years ago
lujun
bc4d1c7246
fix mix input type error, test=develop
6 years ago
sneaxiy
f8ed2c229e
try to fix ci error
...
test=develop
6 years ago
lujun
18aa59493e
fix mix input type error, test=develop
6 years ago
Wojciech Uss
46677fb080
Move cpu_quantize_* passes into mkldnn subfolder
...
test=develop
6 years ago
sneaxiy
c20db6357b
split PR
...
test=develop
6 years ago
Zeng Jinle
c64d959343
Merge pull request #16295 from zhhsplendid/zhenghuihuang-dev-2
...
Add support for init_memory and re-allocate_memory
6 years ago
lujun
1b6a2a09e8
fix mix input type error, test=develop
6 years ago
nhzlx
a1d11bb175
fix ci bug: cudnn handler in multi card
...
test=develop
6 years ago
sneaxiy
2f54d9f995
Merge develop
...
test=develop
6 years ago
sneaxiy
c75a880386
fix windows bug
...
test=develop
6 years ago
sneaxiy
072d95d8f6
Merge develop
...
test=develop
6 years ago
sneaxiy
a93a9eef8f
add op registry type
...
refine gc code
test=develop
6 years ago
dengkaipeng
f45aced59b
add jit test. develop=test
6 years ago
whs
2e5831f0dc
[slim] Refine framework of slim and add filter pruning strategy ( #16226 )
...
* First pr of paddle slim.
1. Add framework of paddle slim
2. Add filter pruning strategy
test=develop
* Rename unitest to tests.
test=develop
* Add prettytable into requirements.
test=develop
* Change in_nodes and out_nodes to odered dict.
test=develop
* Remove distillation.
test=develop
* Fix API.spec
test=develop
* Fix unitest.
test=develop
* Fix unitest in windows.
test=develop
* Fix unitest in windows.
test=develop
* Fix unitest.
test=develop
* Hide some functions.
test=develop
* Fix python import in python3.5
test=develop
* Fix compress pass.
test=develop
* Fix unitest of test_dist_ctr.
test=develop
* Enhence flops.
* use os.path.join
* Fix pickle for python3
Fix log and comments.
test=develop
* 1. Remove feed_reader in compress pass
2. Fix cache reader
3. Rename CompressPass to Compressor
4. Add comments for distiller optimizer
5. Remove unused pruner currently
6. Add some comments.
7. Change API.spec
test=develop
* Fix pruning in python3.
test=develop
* Fix unitest in python3.
test=develop
* Fix format in python3.
test=develop
6 years ago
whs
18779b5b8f
[Operator] Add range op. ( #15431 )
...
* Add range op.
test=develop
* Add more unitests.
test=develop
* Fix API.spec
test=develop
* Fix API.spec
test=develop
* Fix API.spec
test=develop
6 years ago
phlrain
7dc4a7f4f8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_var_name_in_opt_2
6 years ago
Zhen Wang
ec11135d54
Merge pull request #16341 from wzzju/add_channel_wise_in_quant_pass
...
Add channel wise in quant pass.
6 years ago
xiaolil1
e235882c18
Enable MKL-DNN INT8 Concat Kernel. ( #16156 )
...
* Enable INT8 Concat Kernel to improve the performance of MobileNet-SSD.
test=develop
* Optimize UT format.
test=develop
* Fix UT file address issue.
test=develop
* Refine the license year.
test=develop
* Optimize code for new API.
test=develop
* Restructure INT8 Concat kernel.
test=develop
6 years ago
Qiyang Min
171df5b56b
Merge pull request #16303 from junjun315/checkpoint
...
for Checkpoint save and load
6 years ago
Hongyu Liu
e3bca9011c
Merge pull request #16357 from phlrain/fix_concat_check
...
Fix concat check
6 years ago
Hongyu Liu
e5478ab5c8
Merge pull request #16346 from phlrain/add_floordiv_and_mod
...
add elementwise floordiv, mod
6 years ago
chengduo
a6a3b2fbbc
[Speed]Refine ParallelExecutor ( #16190 )
...
* refine parallelExecutor
test=develop
* Polish op_handle
test=develop
* Remove unnecessary op_handle
test=develop
* Fix Travis CI
test=develop
* Fix fetch bug
test=develop
* Remove WaitInputVarGenerated
* Fix OpHandleBase::Run
test=develop
* debug
test=develop
* use origin fetch_op_handle
test=develop
* Revert op_handle_base.cc
test=develop
* Polish code
test=develop
* Fix OpHandleBase::Run
test=develop
* code refine
* test CI and CE
test=develop
* fix OpHandle::Run
test=develop
* refine AllReduceOpHandle
test=develop
* Polish code
test=develop
6 years ago
nhzlx
3df7b98a0f
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
nhzlx
f3a2e4b3d8
1. Add ANAKIN_ROOT compile option
...
2. refine trt code
test=develop
6 years ago
phlrain
77a08750e9
add var name in optimizer; test=develop
6 years ago
chengduo
33965527fd
Add unit test for fuse all reduce ( #16354 )
...
* refine fused_all_reduce_op
* add unit test in test_parallel_executor_seresnext
test=develop
6 years ago
Hongyu Liu
18a0f6d97a
Merge pull request #16351 from phlrain/fix_topk_shape_check
...
Fix topk shape check
6 years ago
Hongyu Liu
15444430b0
Merge pull request #16348 from phlrain/fix_squeeze_check
...
fix squeeze shape check
6 years ago
phlrain
5dc9b51994
fix time; test=develop
6 years ago
phlrain
686b8935fe
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_floordiv_and_mod
6 years ago
phlrain
18d107c27a
add floordiv and mod op; test=develop
6 years ago
phlrain
ff112813de
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check
6 years ago
phlrain
8274d9d733
fix concat shape check; test=develop
6 years ago
Hongyu Liu
0d779f15f6
Merge pull request #16261 from phlrain/fix_sequence_pad_2
...
Fix sequence pad 2
6 years ago
Hongyu Liu
8c81d9949e
Merge pull request #16347 from phlrain/fix_matmul_check
...
fix matmul shape check
6 years ago
phlrain
4e3b5e727d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check
6 years ago
sneaxiy
953214ad97
add more unittest
...
modify allocator strategy
remove changes of legacy buddy_allocator
test=develop
6 years ago
Kaipeng Deng
5670530ca7
Revert "not use PERSISTENT in batch_norm. test=develop" ( #16333 )
6 years ago
qingqing01
d2b938ef5a
Refine gradient proto maker and python API for affine_channel_op ( #16340 )
...
* Rewrite gradient ProtoMaker for affine_channel_op to remove the Output as the input.
* Add act in Python API to make the act can be in-place by layer_help.py
6 years ago
phlrain
b3d0336603
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check
6 years ago
phlrain
249546bf45
fix concat shape check; test=develop
6 years ago
Tao Luo
294cdf6f48
Merge pull request #16177 from fc500110/remove_visualizer
...
remove graph visualizer tool, which can be replaced by python IrGraph draw API
6 years ago
Hongyu Liu
b3b7e352ee
Merge pull request #16259 from phlrain/fix_lod_reset
...
Fix lod reset
6 years ago
phlrain
3b46680913
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_squeeze_check
6 years ago
phlrain
6a5545aa22
fix squeeze shape check; test=develop
6 years ago
phlrain
d96f498bcf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_topk_shape_check
6 years ago
chenjiawen
a859bfc31d
test=develop
6 years ago
phlrain
e7fb344c99
fix squeeze op shape check; test=develop
6 years ago
phlrain
190cfd692c
fix squeeze shape check; test=develop
6 years ago
chenjiawen
be6e78b7c4
test=develop;add approval RD check api
6 years ago
phlrain
0e40298949
fix matmul shape check; test=develop
6 years ago
phlrain
56c2d384c7
add elementwise floordiv, mod; test=develop
6 years ago
Wu Yi
b7baeed7bb
fix win gpu build test=develop ( #16334 )
6 years ago
dengkaipeng
51536f7f52
StrideASum. test=develop
6 years ago
liuwei1031
df5d19aa9d
temoprarily disable the code of use kCUDNN, test=develop ( #16205 )
...
* temoprarily disable the code of use kCUDNN, test=develop
* add TODO comment, test=develop
6 years ago
ruri
09e05a110b
Merge pull request #16217 from ceci3/doc
...
fix formula in dropout
6 years ago
zhhsplendid
124f1df481
Add flags for init and re-alloc gpu
...
test=develop
6 years ago
Zhen Wang
8965819fbb
rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop
6 years ago
Wojciech Uss
cbe2dbf0db
Add enabling quantization ( #16326 )
...
* Add enabling quantization
test=develop
* remove unused (here) function
6 years ago
lujun
09442fb27e
checkpoint pr be moved here, test=develop
6 years ago
Tao Luo
9a05859179
Merge pull request #16322 from wojtuss/wojtuss/fix_cpu_quantize_pass
...
fix pattern maching conv2d with(out) ResidualData
6 years ago
qingqing01
8caa785e83
Enhance affine_channel_op infer-shape check ( #16317 )
...
* Enhance affine_channel_op infer-shape check
6 years ago
flame
08838f3909
Fix save inference model bug ( #16242 )
...
* save infer model bug fix, return target vars' name list
6 years ago
Kaipeng Deng
957ea995fc
Merge pull request #16243 from heavengate/batch_norm_not_persistent
...
not use PERSISTENT in batch_norm. test=develop
6 years ago
nhzlx
4f4daa4b66
cherry-pick from feature/anakin-engine: add data type for zero copy #16313
...
1. refine anakin engine
2. add data type for zero copy
align dev branch and PaddlePaddle:feature/anakin-engine brach
the cudnn workspace modify was not included for now, because we use a hard code way
in feature/anakin-engine branch. There should be a better way to implement it,
and subsequent submissions will be made.
test=develop
6 years ago
nhzlx
07dcf2856c
git cherry-pick from feature/anakin-engine: update anakin subgraph #16278
6 years ago
nhzlx
c407dfa3cb
cherry-pick from feature/anakin-engine: refine paddle-anakin to new interface. #16276
6 years ago
nhzlx
a25331bc26
cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189
6 years ago
nhzlx
c79f06d3d8
cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178
6 years ago
nhzlx
69d37f81d7
cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157
...
support change input size
6 years ago
nhzlx
a1d200a5de
cherry-pick from feature/anakin-engine: Anakin support facebox #16111
6 years ago
flame
a32d420043
cherry-pick from feature/anakin-engine: batch norm ( #16110 )
...
* use anakin batch norm and scale implement fluid batch norm
6 years ago
flame
0945b97f07
cherry-pick feature/anakin-engine: add anakin softmax/transpose/batch_norm/flatten/reshape op ( #16020 )
...
* add anakin softmax/ flatten/reshape/transpose/batch_norm op converter
6 years ago
nhzlx
b21770a2aa
cherry-pick from feature/anakin-engine: Add subgraph fuse support and anakin engine #16018
6 years ago
nhzlx
084310f536
paddle-anakin: concat, split, pool2d converter#16003
6 years ago
flame
be523baad2
Add anakin conv2d/relu/sigmoid/tanh converter ( #15997 )
...
* add activation op
* test conv2d relu sigmoid tanh
6 years ago
Yan Chunwei
d0ce6a9044
fix anakin converter registry ( #15993 )
6 years ago
Tao Luo
a5124ee0bb
Merge pull request #16301 from luotao1/runtime_context_pass
...
add runtime_context_cache_pass
6 years ago
lujun
622fe6a56b
checkpoint pr be moved here, test=develop
6 years ago
baojun
2de263a5d9
Add softmax_with_cross_entropy_op to ngraph engine ( #16304 )
...
* Add softmax_with_cross_entropy_op test=develop
* simplify implementation test=develop
6 years ago
sneaxiy
bb166a1e10
fix API.spec
...
test=develop
6 years ago
ruri
a3b8028d46
Merge pull request #16202 from shippingwang/add_sqrt_doc
...
update sqrt explaination
6 years ago
phlrain
dd080b17c3
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_sequence_pad_2
6 years ago
phlrain
1580be5d6c
fix sequence pad; test=develop
6 years ago
dengkaipeng
aba2713ffc
fix comment. test=develop
6 years ago
chengduo
f26ba5bddd
Fuse AllReduce ( #15921 )
...
* fuse all_reduce
test=develop
* add fuse_parameter_groups_size
test=develop
* Polish code
test=develop
* Fix travis-ci
test=develop
* Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize
test=develop
* Add SetGroupAccordingToMemorySize
test=develop
* fix multi_devices_graph
test=develop
* reset params_grads
test=develop
* Polish code
test=develop
6 years ago
Zeng Jinle
d0ef682552
Merge pull request #16274 from sneaxiy/fix_grad_maker
...
Remove unused variables in op grad maker
6 years ago
dengkaipeng
93701dba50
add jit kernel for softmax axis. test=develop
6 years ago
baojun
804afc51db
Minor ngraph fix ( #16270 )
...
* take care edge cases test=develop
* use pragma test=develop
6 years ago
Tao Luo
9195c3bb03
Merge pull request #16280 from luotao1/cos_sim_infershape
...
refine cos_sim infershape
6 years ago
Wojciech Uss
104a9f1e27
fix pattern maching conv2d with(out) ResidualData
...
test=develop
6 years ago
Wu Yi
6382b62f6b
Collective ops ( #15572 )
...
* wip allreduce in op
* wip
* wip
* wip
* wip adding test
* wip for conflict with mp mode
* fix tests test=develop
* fix cpu build test=develop
* fix travis clang format test=develop
* fix cpu build test=develop
* update api.spec test=develop
* delete comment test=develop
* fix cpplint test=develop
* fix test=develop
* follow comment test=develop
* add file test=develop
* fix build test=develop
* update test=develop
* to be compatible with sync_bn, and fix mp mode in develop test=develop
6 years ago
lujun
bed0ecf3d2
checkpoint pr be moved here, test=develop
6 years ago
sneaxiy
023a3a3d62
fix op grad maker
...
test=develop
6 years ago
luotao1
82af8031d9
add runtime_context_cache_pass
...
test=develop
6 years ago
Zhen Wang
ec88b6cc5a
add channel wise quantization in ir pass.
6 years ago
Tao Luo
b9fc80a133
Merge pull request #16287 from PaddlePaddle/revert-16002-runtime_context
...
Revert "cache runtime_context"
6 years ago
whs
18911b6eea
[enhence] Make step_input of dynamic_rnn support custom lod level. ( #15972 )
...
* Make step_input support custom lod level.
test=develop
* Fix API.spec
test=develop
* Fix API.spec.
test=develop
* Fix API.spec
test=develop
* Add default value in document of step_input.
test=develop
* Fix document.
test=develop
* Fix API.spec
test=develop
6 years ago
zhhsplendid
22715487dc
add allocator flags
...
test=develop
6 years ago
luotao1
c05af910bc
refine cos_sim infershape
...
test=develop
6 years ago
Hongyu Liu
d3acf68044
Merge pull request #16258 from phlrain/fix_concat_1
...
Fix concat
6 years ago
Tao Luo
7d2740db83
Revert "cache runtime_context"
6 years ago
sneaxiy
fd23262e0c
merge develop, fix conflict
...
test=develop
6 years ago
ceci3
c79cdf258e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into doc
6 years ago
ceci3
9d2ccec0f2
test=develop, fix doc
6 years ago
tensor-tang
ead558b7f6
Merge pull request #16256 from tensor-tang/refine/seqenum
...
refine sequence enumerate op
6 years ago
ceci3
27f7a72641
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into doc
6 years ago
Qiyang Min
c7f1f3ed0c
Merge pull request #16214 from velconia/imperative_infer_var_type
...
Implement imperative infer var type
6 years ago
Zeng Jinle
f8df9eb32e
fix api doc ( #16201 )
...
test=develop
6 years ago
sneaxiy
3a09693f5c
change API name
...
test=develop
6 years ago
Jacek Czaja
13816dd4ac
[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used ( #16233 )
...
* - Fix to crash of Transformer when mkldnn is to be used
Desc: TensorCopy was not setting MKLDNN primitive descriptor when layout was to be kMKLDNN
test=develop
* - Enable transformer for mkl-dnn
test=develo
* - Compilation fix
test=develop
* - Removed manual selection of MKL-DNN ops to be used in Transformer test
test=develop
6 years ago
Yibing Liu
7e20e7691e
Fix the bug in fp16 backward kernel ( #16269 )
...
test=develop
6 years ago
shippingwang
97c6051822
add api.spec, test=develop
6 years ago
Wojciech Uss
af03008890
Add cpu_quantize_placement_pass for C-API quantization ( #16265 )
...
* Add cpu_quantize_placement_pass for C-API quantization
test=develop
* added a comment on required pass attributes
test=develop
6 years ago
Tao Luo
dbb92ee4b1
Merge pull request #16002 from luotao1/runtime_context
...
cache runtime_context
6 years ago
shippingwang
4f42504eef
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sqrt_doc
6 years ago
dengkaipeng
6c64182709
refine softmax kernel. test=develop
6 years ago
dengkaipeng
412b7cbdf1
fix format. test=develop
6 years ago
dengkaipeng
8b88960dce
fix doc. test=develop
6 years ago
dengkaipeng
2ddd23dac8
fix format. test=develop
6 years ago
dengkaipeng
3e4f3434e6
fix API.spec. test=develop
6 years ago
dengkaipeng
365e6cfd15
add mkldnn support. test=develop
6 years ago
dengkaipeng
217db27337
add mkldnn support. test=develop
6 years ago
dengkaipeng
6cb66721d2
add cudnn support. test=develop
6 years ago
dengkaipeng
518325f1e7
add softmax_axis CPU kernel. test=develop
6 years ago
sneaxiy
e893cbd286
add auto increment best fit allocator
...
test=develop
6 years ago
minqiyang
565b19b7a5
fix set data type bug
...
test=develop
6 years ago
minqiyang
8364688c30
Fix py_func_op's problem
6 years ago
sneaxiy
161b8ddcaa
Merge develop
6 years ago
Zeng Jinle
6429d2a887
Merge pull request #16188 from sneaxiy/fix_const_cast
...
Remove const_cast in optimizers
6 years ago
minqiyang
b40e41fbd1
Polish code style
...
test=develop
6 years ago
Hongyu Liu
9874530ee8
Update lod_reset_op.cc
6 years ago
xiaolil1
e818fa1004
Enable INT8 transpose kernel for MobileNet-SSD improvement. ( #16159 )
...
* Enable INT8 transpose kernel for MobileNet-SSD improvement.
test=develop
* Refine the license year.
test=develop
* Delete redundant code.
test=develop
* Add axis check.
test=develop
6 years ago
Hongyu Liu
58c69a9eb9
Update lod_reset_op.cc
6 years ago
Xin Pan
374abcf361
Merge pull request #16247 from panyx0718/imperative
...
add more imperative layer tests.
6 years ago
Hongyu Liu
96c54cd9b1
Update lod_reset_op.cc
6 years ago
Tao Luo
c072998ac1
Merge pull request #16219 from luotao1/fc_infershape
...
refine fc_infershape
6 years ago
phlrain
e46a94a4a2
fix lod_rest; test=develop
6 years ago
phlrain
10ececb213
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_sequence_pad_2
6 years ago
phlrain
802b33489a
remove resize then seq num == 1; test=develop
6 years ago
phlrain
152beec540
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_lod_reset
6 years ago
tangwei12
8ea4218ce1
update load persistables for increment, test=develop ( #15576 )
...
* update load persistables for increment, test=develop
* update load persistables for increment, test=develop
* update API Spec, test=develop
* update API Spec, test=develop
* add doc, test=develop
* add doc, test=develop
* Update lookup_table_utils.py
* Update API.spec
* Update lookup_table_utils.py
test=develop
* Update API.spec
test=develop
* fix api spec
* Update lookup_table_utils.py
test=develop
6 years ago
phlrain
a21fdde218
fix lod reset op;test=develop
6 years ago
phlrain
dcba2e7236
fix conncat; test=develop
6 years ago
phlrain
955fad7a90
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
6 years ago
phlrain
a7fe3b508e
fix concat; test=develop
6 years ago
tensor-tang
50931dee1d
refine seq enum op
...
test=develop
6 years ago
Qiyang Min
8e4ad008fb
Merge pull request #16198 from velconia/imperative_train_speed
...
Improve imperative mode training speed
6 years ago
minqiyang
36dce65bb3
Take DataType and VarType apart
...
test=develop
6 years ago
Xin Pan
3e9319f3ab
add more imperative layer tests.
...
test=develop
6 years ago
luotao1
d9f0e7252a
refine with comments
...
test=develop
6 years ago
luotao1
6fa52f83ba
Merge branch 'develop' into fc_infershape
6 years ago
luotao1
cc0ae1f1a1
refine with comments
...
test=develop
6 years ago
dengkaipeng
9d435973d2
not use PERSISTENT in batch_norm. test=develop
6 years ago
luotao1
a275fd6e0c
Merge branch 'develop' into runtime_context
6 years ago
Wojciech Uss
2579ade45f
Add cpu_quantize_pass for C-API quantization ( #16127 )
...
* Add cpu_quantize_pass for C-API quantization
test=develop
* add cpu_quantize_pass test
* fix lint: add include memory unorderd_map and unordered_set
test=develop
* fuse_relu 1
test=develop
* tuned 2 without squash
* fixes
test=develop
* remove unused vars
test=develop
* refactored
test=develop
* fix lint c-style cast -> C++ style cast
test=develop
* remove QuantMax and c style casts
test=develop
* last usage of QuantMax removed
test=develop
* Fix Analysis Predictor UT
Check if memory_optimize_pass has already been added
to the analysis config before adding a new one, so
that it is not added multiple times.
test=develop
* change map to unordered_map
fix the forgotten part of cpu_quantize_pass_tester.cc
test=develop
* removed quantized attribute
* fixed cpu_quantize_pass_tester and op attr comments
test=develop
* removed redundant line
test=debug
* removed gmock
test=develop
* fix after merge
6 years ago
Xin Pan
7458114b5b
Merge pull request #16228 from panyx0718/imperative
...
graph neural network for imperative mode
6 years ago
chengduo
efca4de78e
Fix cross_entropy bug ( #16236 )
...
test=develop
6 years ago