chengduo
1d91a49d2f
Some trivial optimization ( #13530 )
...
* some trivial opt
* remove the fix of lod_tensor and shrink_rnn_memory_op
* refine ShrinkRNNMemoryOp
test=develop
7 years ago
ktlichkid
5093afced1
Fix bug in sequence_slice_op
7 years ago
Xin Pan
ebf9171df3
Merge pull request #13532 from panyx0718/infer
...
Add a test to load a full model and allow to to post-train from C++
7 years ago
nhzlx
6c81230683
update code for config change
...
test=develop
7 years ago
tensor-tang
92031968d7
init vmul kernel
7 years ago
tensor-tang
b9acbcc8c5
init lstm kernel
7 years ago
tensor-tang
c260bf942d
init jit kernel
7 years ago
nhzlx
5c57e15044
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
7 years ago
Tao Luo
f67483bf3b
add seq_conv UT ( #13517 )
...
* add multi_label UT
* rename, fix typo, add fuse_statis check
7 years ago
Tao Luo
c07b2a97a9
Merge pull request #13521 from Sand3r-/mgallus/fix-pooling-ceiled-size
...
Enable MKL-DNN in Analysis Predictor
7 years ago
Tao Luo
d0000082c4
Merge pull request #13552 from sfraczek/sfraczek/conv-relu-update
...
little update to conv relu fuse pass (fix)
7 years ago
dzhwinter
cc20867d49
flags ( #13542 )
7 years ago
dzhwinter
7806c5625f
fix enforce ( #13544 )
7 years ago
Michal Gallus
0e6b303f54
MKLDNN Pooling: inline functions handling ceiled mode
7 years ago
minqiyang
b1448ded40
Port clip and clip_by_norm op to nn and change API.sepc
7 years ago
Yu Yang
21bb9e91fc
Merge pull request #13201 from reyoung/revert_callstack
...
Revert "Revert "Add Python Callstacks when Op::Run error (#12759 )""
7 years ago
gongweibao
be97c47efc
merge
7 years ago
gongweibao
3dc54af2d3
merge
7 years ago
Michal Gallus
f465b03ef9
Enable MKLDNN in Analysis Predictor
...
Also fix MKL-DNN pooling integration for ceil mode
7 years ago
Xin Pan
cbdf9833e3
hide create_passes_from_strategy for now
7 years ago
Sylwester Fraczek
e5d1bd1e93
remove unused variable nodes2delete
7 years ago
Sylwester Fraczek
a49aa4dac9
make bias unnecessary for ConvRelu fuse
7 years ago
Sylwester Fraczek
493ef0c8df
do not remove conv node just rewire the output
7 years ago
Sylwester Fraczek
667b661786
updated the test
7 years ago
Zeng Jinle
2cd558fb36
Merge pull request #13561 from sneaxiy/fix_api_kwargs
...
[Urgent] Add out params to some apis
7 years ago
Yan Chunwei
e426cdae32
fix inference output with lod ( #13557 )
7 years ago
Xin Pan
bc1fa4fd6f
Merge pull request #13556 from panyx0718/doc
...
clean a few more kwargs
7 years ago
sneaxiy
48d82bd008
add out params
7 years ago
Dang Qingqing
b7146d60e4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantize_transpiler_update
7 years ago
Yu Yang
46498bf1a0
Merge pull request #13508 from reyoung/hide_parameter
...
Hide parameter
7 years ago
gongweibao
6ba86617ba
Merge branch 'generator2' of https://github.com/gongweibao/Paddle into generator2
7 years ago
gongweibao
1113337764
merge
7 years ago
Xin Pan
7ba55aa294
fix CMAKE
7 years ago
Xin Pan
6974265292
support offline train
7 years ago
Yu Yang
606dfb13d5
Merge pull request #13442 from reyoung/feature/remove_trainer_api
...
Move trainer to contrib
7 years ago
Xin Pan
f117feab0c
modify comments
7 years ago
Yu Yang
7119d6c3cf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_callstack
7 years ago
Xin Pan
c2b3838cf5
add some comments
7 years ago
Yan Chunwei
5de14c6b96
refine inference api ( #13518 )
7 years ago
Wu Yi
aeb2dc2b05
Nccl2 dist API ( #13506 )
...
* add nccl2 dist api
* update apispec
* update
* update api spec
7 years ago
dzhwinter
c66a8d2cd8
add guide ( #13332 )
...
* add guide
* "fix doc"
* Update windows_inference.md
Looks like there is a little problem in markdown format writing of head lines
7 years ago
dzhwinter
24447ec517
flags ( #13541 )
7 years ago
dzhwinter
4fd5eb2255
"refine cmake" ( #13546 )
7 years ago
Xin Pan
0d9ee0dced
fix resolve conflicts
7 years ago
Xin Pan
b43e49fa31
fix
7 years ago
Xin Pan
afc603c108
update API.spec
7 years ago
Xin Pan
36c2a9af27
pass builder allow cutomize pass in python.
7 years ago
Xin Pan
355a2265a0
update API.spec
7 years ago
Xin Pan
eb1aeb175b
Merge pull request #13538 from baiyfbupt/softshrink
...
Remove kwargs in softshrink
7 years ago
dzhwinter
97636a9fcf
"fix link error" ( #13545 )
7 years ago
Jiabin Yang
efc2ac950c
Merge pull request #13527 from JiabinYang/mac/fix_mac_compile
...
fix roi_perspective_transform_op.cc unused variable caused error on Mac
7 years ago
Qiao Longfei
bcc7bff12f
Merge pull request #13488 from jacquesqiao/fix-img_conv_group-doc
...
fix img_conv_group doc
7 years ago
nhzlx
baae7e4f63
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
7 years ago
baiyf
3986242e5e
remove kwargs
7 years ago
Zeng Jinle
a8f66365c9
Merge pull request #13524 from sneaxiy/fix_api_kwargs
...
Remove kwargs in elementwise layers and scale layer
7 years ago
sneaxiy
70e70d7d38
fix api.spec
7 years ago
nhzlx
2763321684
fix comments
7 years ago
Yan Chunwei
90bc14da24
simple fix on inference tester helper ( #13507 )
7 years ago
sneaxiy
6724801884
fix conflict
7 years ago
Xin Pan
a54c423e61
Merge pull request #13512 from tensor-tang/remove/kwargs
...
remove kwargs of some activation api
7 years ago
chengduo
cdf3a4c244
Fix concat_op InferShape ( #13513 )
...
* add ShareLoDs
* refine
* add Is EmptyVarName
* refine Sharedlod
7 years ago
gongweibao
08b22cf1f5
Merge branch 'generator2' of https://github.com/gongweibao/Paddle into generator2
7 years ago
gongweibao
a30194743b
fix api
7 years ago
gongweibao
bee213e51f
fix conflict
7 years ago
gongweibao
54f685dbb1
add api
7 years ago
tensor-tang
b9d7bd487f
Merge branch 'develop' into remove/kwargs
7 years ago
sneaxiy
3ad3635de0
fix conflict
7 years ago
sneaxiy
3ee0a6489d
remove kwargs in python api
7 years ago
Xin Pan
6537b175c4
Merge pull request #13522 from wzzju/remove-kwargs
...
remove the kwargs of elu,relu6,pow,stanh,hard_sigmoid and swish
7 years ago
Zeng Jinle
f855c05fff
Merge pull request #13520 from sneaxiy/enhance_eager_delete
...
Enhance eager delete and sparse Adam
7 years ago
gongweibao
b619c5e655
merge
7 years ago
gongweibao
01396d91db
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into generator2
7 years ago
gongweibao
b07c563acf
clean
7 years ago
nhzlx
0514882bc5
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
7 years ago
nhzlx
1f6c9dbad4
fix typo
7 years ago
gongweibao
dda9c3556e
fix
7 years ago
JiabinYang
c324cdeff8
fix roi_perspective_transform_op.cc unused variable caused error on macos
7 years ago
Dang Qingqing
ba8ba3000b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantize_transpiler_update
7 years ago
Yu Yang
3043f51b3a
Merge pull request #13511 from reyoung/fix_ce
...
Revert "Merge pull request #13431 from chengduoZH/refine_lod"
7 years ago
gongweibao
ff47841796
fix
7 years ago
tensor-tang
6d2ce745a6
Merge remote-tracking branch 'ups/develop' into remove/kwargs
7 years ago
Yu Yang
f7af695801
Merge pull request #13505 from reyoung/fix_selected_rows_functor_test
...
Fix unstable selected_rows_functor_test.cu
7 years ago
Wu Yi
29c63d180f
[Feature] dist op role and lr op role, to support memory optimize with dist training ( #13220 )
...
* wip
* clean up
* should fix running with memopt
* add ut
* mark lr schedule op role
* hide lr_schedule_guard
* use op_role_var instead of ufind
* unify dist test name
* wip for py3 support
* fix var deref
* fix python3 mem_opt order
* remove comments
7 years ago
tensor-tang
f42a12dadc
Merge remote-tracking branch 'ups/develop' into remove/kwargs
...
fix conflicts
7 years ago
Yu Yang
e1913bc5d8
Fix MixedVector
7 years ago
tensor-tang
560ad776e3
update api spec
7 years ago
Dang Qingqing
cc94b157bf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantize_transpiler_update
7 years ago
Tao Luo
dbf07982a7
Merge pull request #13498 from luotao1/for_test
...
use clone(for_test=True) replace get_inference_program
7 years ago
tensor-tang
567b711db9
remove kwargs of brelu, leaky_relu and soft_relu
7 years ago
whs
fc44087d31
Add roi perspective transform op. ( #13176 )
...
* Add roi perspective transform.
* Add roi_perspective_transform_op.
* Fix code style.
* Add python api and fix doc.
* Fix API.spec
* Fix python api.
* Fix API.spec
* Move src to detection.
7 years ago
tensor-tang
2582fecaf7
fix dtype
7 years ago
Zhen Wang
2b2337ee77
change the line 382 of API.spec
7 years ago
tensor-tang
215cdafeae
fix api spec order
7 years ago
Zhen Wang
26857692cd
remove the kwargs of elu,relu6,pow,stanh,hard_sigmoid and swish
7 years ago
sneaxiy
192c49cb07
modification
7 years ago
sneaxiy
03a99a8afa
Merge develop
7 years ago
sneaxiy
0a36ef3c18
enhance eager deletion
7 years ago
tensor-tang
3b7d9f7b83
remove kwargs of some activation api
7 years ago
Dang Qingqing
605907fe85
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantize_transpiler_update
7 years ago
Qiao Longfei
a32ad64484
Merge branch 'develop' into fix-img_conv_group-doc
7 years ago
Yu Yang
6d2c6f96f1
Revert "Revert "Merge pull request #13431 from chengduoZH/refine_lod""
...
This reverts commit a6c8d6b9a2
.
7 years ago
Xin Pan
f66d08c2e0
Merge pull request #13493 from panyx0718/doc
...
convert **kwargs to explicit arguments
7 years ago
Yu Yang
a6c8d6b9a2
Revert "Merge pull request #13431 from chengduoZH/refine_lod"
...
This reverts commit bd79e04667
, reversing
changes made to 6b4d290c18
.
7 years ago
nhzlx
f277f53c33
out of memory... i bet it's the last time commit for this pr
7 years ago
Yu Yang
3ef32b6754
Remove Parameter APIs
7 years ago
Zeng Jinle
7f1e312677
Merge pull request #13456 from sneaxiy/refine_sparse_adam
...
Fix sparse Adam and Gradient clip of SelectedRows
7 years ago
qiaolongfei
946dabad3a
add paddle.fluid.nets.img_conv_group to API.spec
7 years ago
luotao1
abf019f61d
Merge branch 'develop' into for_test
7 years ago
Yu Yang
b5996fa124
Fix unstable selected_rows_functor_test.cu
7 years ago
Tao Luo
b75887514e
Refine infer api test ( #13472 )
...
* refine analyzer_nlp_tester
* refine analyzer_rnn/vis_tester
7 years ago
sneaxiy
d775087dc7
Merge develop
7 years ago
Dang Qingqing
9e580631aa
Fix conflicts.
7 years ago
luotao1
3ba7e74db2
use clone(for_test=True) replace get_inference_program
7 years ago
sneaxiy
a29b4227eb
fix sparse gradient clip
7 years ago
Yihua Xu
87086b1386
Refine activation for GRU operator ( #13275 )
...
* Optimize GRU with AVX instruction
* Clean code
* Add the Unitest and fix the align issue
* Remove the remanent part of the unitest part
* Code clean
* Fix the parameters length issue for fusion_gru to pass CI
* Change the default type as float32
7 years ago
chengduo
d402234ba8
Feature/op_fuse_pass ( #12440 )
...
* Add Preface
* Add demo code
* Save file
* Refine code
* seems can work
* use elementwise strategy
* Use ElementwiseComputeEx
* Add comments
* extract functions from operator
* Refine code
* Follow comment
* code refine
* add op_fuse pass
* add backward
* code refine
* use TopologySortOperations
* follow comments
* refine IsFusible
* code enhance
* fix op_fusion_pass
* refine code
* refine fuse_elemwise_act_op
* adjust the input and output
* refine logic
* add intermediate_edge
* disable inplace
* follow comments
* refine logic
* follow comments
* Remove the removable IntermediateOut
* change strategy
* code refine
* enable fuse backward
* code refine
* code refine
* rename unit test
* follow comments
7 years ago
nhzlx
a0a6789347
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
7 years ago
nhzlx
0c51170052
fix the ut test error :)
7 years ago
nhzlx
4801beb101
add arguments for trt config
7 years ago
Xin Pan
2b007fb55d
update API.spec
7 years ago
Qingsheng Li
21ec93aa7e
[WIP]Sequence Scatter Op ( #12625 )
...
Sequence Scatter Op
7 years ago
nhzlx
202e0a1ee1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
7 years ago
nhzlx
4c52be07dd
fix ut error
7 years ago
Tao Luo
f1cbb4b0bd
Merge pull request #13479 from luotao1/inference_transpiler
...
hidden InferenceTranspiler
7 years ago
Xin Pan
23ba766252
Merge pull request #13475 from panyx0718/ir5
...
avoid creating dangling ir::Node.
7 years ago
Jiabin Yang
60d163ab17
Merge pull request #13473 from JiabinYang/mac/fix_compile
...
fix mac compile error
7 years ago
nhzlx
dcc09dce35
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
7 years ago
nhzlx
94a57f1d83
add trt config to arguments
7 years ago
Xin Pan
759dae0493
Merge pull request #13461 from panyx0718/ir3
...
clean unused inference_optimize c++ implementation
7 years ago
luotao1
ff37993bf3
hidden InferenceTranspiler
7 years ago
chengduo
fd8d83e68a
Fix the nested dyn_rnn ( #13417 )
...
* add unit test for nested drnn
* add nested dyn_rnn
* refine while_op
* fix bug
7 years ago
whs
cf128231c6
Add truncated gaussian initializer. ( #13000 )
...
* Add truncated gaussian initializer.
* Fix unitest.
* Update API.spec
* Fix code style and fix bug.
* Fix code style.
* Small fix.
7 years ago
Xin Pan
0bd7a67eaf
avoid creating dangling ir::Node.
...
Node should be created by Graph::CreateXXX so that
they are managed by graph.
7 years ago
Zeng Jinle
642cf6ca2f
Merge pull request #13418 from sneaxiy/dam_save_memory
...
Modify some ops to save memory
7 years ago
JiabinYang
9d2d3096c5
fix mac compile error
7 years ago
Yu Yang
aa79bccfff
Merge pull request #13460 from reyoung/fix_data_transform
...
Wait input when data transform
7 years ago
whs
5dc5175007
Add python api for expand op. ( #13453 )
...
* Add python api for expand op.
* Fix unitest.
* Remove 'out' from arguments and fix code style.
* fix API.spec
* Fix API
* Fix unitest
7 years ago
Dun
253f618ac7
loosen the restriction of output_size in conv2d_transpose ( #12292 )
...
* loosen the restriction of output_size in conv2d_transpose
* test and docs
* fix code style
* fix ci test error
* bug fix
* fix python3 issue
7 years ago
Yu Yang
155ebbb9dc
Merge pull request #13449 from chengduoZH/speed_up_lod_tensor_to_array
...
Speed up lod_tensor to array and array to lod_tensor
7 years ago
Yu Yang
93456fc676
Merge pull request #13427 from reyoung/feature/hide_recordio_reader
...
Hide RecordIO Reader
7 years ago
Wu Yi
efafc72f62
Hide program APIs ( #12315 )
...
* hide program APIs
* fix merge error
* update
7 years ago
nhzlx
68fb818aa8
add ut of trt common models
7 years ago
qingqing01
c9e5c1e41b
Hidden ParallelDo. ( #13454 )
7 years ago
chengduo
6757a31552
[Accelerate] Refine seq_softmax_op ( #13421 )
...
* refine seq_softmax_op
* fix seq_softmax
* use cub in seq_softmax
7 years ago
Yibing Liu
9cc1937d43
Merge pull request #13400 from kuke/fix_seq_pad
...
Get sequence length in sequence_pad op & fix sequence_mask op
7 years ago
Yu Yang
2c31ea9293
Merge pull request #13424 from chengduoZH/refine_seq_concat
...
Refine seq_concat
7 years ago
Yu Yang
5996e224fa
Merge pull request #13430 from chengduoZH/refine_seq_pool
...
Refine seq_pool
7 years ago
sneaxiy
fd3e32ea7d
Merge develop
7 years ago
Xin Pan
5ce77889f4
clean unused inference_optimize c++ implementation
7 years ago
Yu Yang
922dee3b43
Wait input when data transform
7 years ago