chentianyu03
b59b6d7ae6
Complex op test ( #29753 )
...
* delete no need to calculate inputs in dygraph op_test
* delete no need to calculate inputs in dygraph op_test
5 years ago
liym27
096c048b45
Fix unitest test_slice ( #29740 )
...
Before this commit, test_slice use old api `dygraph_to_static_func` to use Dynamic-t-Static and use Executor explicitly,which is not recommended to users.
After fixed, use recommended API `paddle.jit.to_static` to replace `dygraph_to_static_func`, which won't trigger the random exception on coverage CI.
5 years ago
Huihuang Zheng
2e788bd81e
Reduce batch size ot fix CPU memory, test=develop ( #29736 )
...
Unit test reported memory not enough on CPU machines. Reduce batch size again.
5 years ago
LielinJiang
10edfb6f21
Update en docs of to_tensor ( #29718 )
...
* update to_tensor en docs
5 years ago
chentianyu03
71063b8137
add conj op for complex types ( #29527 )
...
* add conj op for complex types
* add conj for complex types
* add more test case
* add conj_op test
* modify conj api and impl
* add complex type for fill_constant_op xpu
* add setConstant for complex type
* remove complex conj test file
* user define grad for test_conj_op
* add test case for static mode of conj api
* modify conj doc
* change input args name to x
* remove useless codes
* conj support real types
* add conj test case for real number
5 years ago
WangXi
9cbcc6cadc
fleet sync build strategy, test=develop ( #29732 )
5 years ago
Chen Weihang
6cfa59de1b
[Complex] Add real & imag op and api for complex tensor ( #29672 )
...
* add complex real op & api & unittest
* add imag op & api & unittest
* refactor op impl
* revert simplify writing due to complile failed
* polish details
* polish grad op code
5 years ago
LiuChiachi
572810eecb
Update EarlyStopping sample code ( #29723 )
...
* update EarlyStopping doc
* update EarlyStopping doc, test=document_fix
5 years ago
TTerror
af8ded773a
update activation op on kunlun ( #29577 )
...
* fix expand && concat/transpose to new api
* update xpu_header
* update activation op on kunlun
* update activation op on kunlun
* update activation op on kunlun
* update activation op on kunlun
* update activation op on kunlun
* add nearest_interp on kunlun
* update error message
5 years ago
ceci3
cc387159f3
add pad and concat double grad ( #29549 )
...
* add constant pad double grad
5 years ago
liuyuhui
f13c3a9cd7
[Kunlun] PR1:Support one Kunlun card training in parallel executor ( #29337 )
5 years ago
huangxu96
b96dada4f0
add static.amp into setup.pu.in ( #29621 )
...
* add static.amp into setup.pu.in
* add unittest for api
5 years ago
YUNSHEN XIE
d0b789d27f
disable ut test_cumsum_op ( #29613 )
5 years ago
Jack Zhou
84bae27779
fix wmt14 doc, remove backward, add bidirect direction in rnn api ( #29633 )
...
* fix wmt14 doc, remove backward, add bidirect direction in rnn api
* fix rnn unittest
* fix test_rnn_nets_static.py bug
5 years ago
YUNSHEN XIE
2926e74326
New UT should not exceed 15s ( #29492 )
...
* added UT should not exceed 15s
* fix error
* UT limit of 15s is the first to be executed
* fix error
* fix error with CI_SKIP_CPP_TEST
* modfied tiemout setting
* fix error
5 years ago
Chen Weihang
f02aece1f0
Add complex dtype op (add) test example ( #29603 )
...
* add op test case for complex
* polish code details
* add xpu set constant support
* fix argument rror
* remove useless pyc file
5 years ago
AshburnLee
efea540ca9
Add tf32 support for A100 tensor core acceleration for cuBLAS ( #28732 )
5 years ago
lijianshe02
7779768b53
add transpose double grad test=develop ( #29600 )
...
* add transpose double grad test=develop
5 years ago
huangxu96
c05170d3d8
add alias for fluid.contrib.mixed_precision ( #29562 )
...
* add alias for fluid.contrib.mixed_precision
5 years ago
ShenLiang
fb6697b424
Fix the dowanload bug in the case of multiple machines ( #29551 )
...
* fix the dowanload bug
* add sort for ips
5 years ago
ShenLiang
1efef8baed
Fix bug of matmul_v2 for broadcast case ( #29599 )
...
* fix bug of matmul_v2 for broadcast
5 years ago
qingqing01
8d549fc85d
Add clip double grad ( #29590 )
5 years ago
Tao Luo
81acc3278c
disable test_parallel_executor_profiler in cuda 10.1 ( #29581 )
...
* disable test_parallel_executor_profiler in cuda 10.1
* update set_tests_properties
5 years ago
wangchaochaohu
ac4bae8ee9
elementwise_add_grad Op optimization ( #29575 )
5 years ago
huangxu96
2cb6f94888
add float16 into adaptive_avg_pool2d check list. ( #29547 )
5 years ago
yukavio
ee1a7d020c
add some feature for paddle.flops ( #29572 )
5 years ago
WangXi
467c716963
gen nccl id use socket ( #29431 )
5 years ago
Bai Yifan
d72604cd46
fix unittst unstable issue on ci machine ( #29588 )
...
* fix unittst unstable issue on ci machine
* fix unittst unstable issue on ci machine
* fix unittst unstable issue on ci machine
5 years ago
QingshuChen
79a41a9ed6
support roi_align & affine_channel for kunlun ( #29561 )
...
* support roi_align & affine_channel for kunlun
* minor
5 years ago
liym27
0cad1152f4
[Dy2Stat] 1. Fix bug of for-range stmts. 2. Support that step value is negative in for-range stmts ( #29519 )
...
1. Fix error in _build_cond_stmt of for-range stmts.
2. Support that step value is negative in for-range stmts
3. Fix code because of the diff between Py2 and Py3
5 years ago
Huihuang Zheng
831e9135b9
Fix Windows Unittest ( #29543 )
...
Fix 3 Windows Unittests
test_fuse_all_reduce_pass: Paddle cannot run multiple-GPU on Windows so set single visible GPU flag
test_feed_data_check_shape_type: Paddle cannot run multiple-GPU on Windows so set single visible GPU flag
test_tsm: Winodws GPU size is not enough so decrease batch size and data size.
5 years ago
GeminiCarrie
08f24a3108
Fix precision problem ( #29567 )
...
* Fix a bug when running on an operating system without "bash."
* add execution condition
* for ci-coverage
* get cpu information to check the precision problem
* Update compilation environment for musl version
* update dependencies
* remove test code
check cpu info
remove test code
review
* update alpine and third_party denpendencies
* add newline for ci Code format
5 years ago
JZ-LIANG
d33d468f02
[Sharding] add hybrid-dp feature ( #29518 )
...
* Sharding add hybrid-dp feature
* update sharding in distributed_strategy
* update sharding unitest
* revise code format for sharding
5 years ago
Chen Weihang
c1a26e2a05
fix train eval set error in static mode ( #29540 )
5 years ago
taixiurong
760d015c14
add xpu ops for training transformer in kunlun ( #29539 )
...
* 1.fix matmul bug 2. add one hot
* add xpu error msg
5 years ago
Leo Chen
0fdd365665
Add fast path for dropout when p == 0 ( #29553 )
...
* add fast path for p == 0 in dropout
* add ut
5 years ago
Wojciech Uss
917a11495f
fix ininite scale values ( #29386 )
5 years ago
lijianshe02
bd29052e33
fix random seed in nll_loss unitest test=develop ( #29538 )
...
* fix random seed in nll_loss unitest test=develop
5 years ago
joanna.wozna.intel
0ce6d7fa77
Fix bf16 activations test for softmax and gelu ( #29502 )
...
* Fix bf16 activations test for softmax and gelu
* Resolve conflict
5 years ago
huangxu96
4001979309
Add ReserveSpace in dygraph batch_norm. ( #29221 )
...
* Add ReserveSpace in dygraph batch_norm.
* Add unittest for reservespace
5 years ago
arlesniak
b781953ef5
[oneDNN] Fix flags use test for #29080 , assert condition more general ( #29493 )
...
* Flags assert condition more general, print output if pattern not found
* removed test_flags_use_mkldnn form skip list regarding #29080 descr
5 years ago
Zhen Wang
5ac71b36fb
Remove tensor copy in the update_loss_scaling op. ( #29426 )
...
* remove tensor copy in the update_loss_scaling op
* not use thrust.
* fix some cuda memory access error.
5 years ago
Zhou Wei
e74e1a226c
support deepcopy for Layer/Tensor/Paramerbase ( #29387 )
...
* support deepcopy for Layer/Tensor/Paramerbase
* fix some code
5 years ago
joejiong
50d3117d30
Add random_split and Subset dataset ( #29291 )
...
As the title
5 years ago
joejiong
87e75a77c2
Add tangent operator ( #29207 )
...
As the title
5 years ago
Wei Shengyu
dc8bb76c68
remove addcmul ( #28937 )
...
* remove addcmul
* remove unittest and other related code of addcmul
* fix bug
* fix merge conflict
5 years ago
Zhong Hui
f459dd9634
fix abs double grad unittest ( #29478 )
...
fix abs double grad unittest & define the data range for the abs double grad
5 years ago
huangxu96
576d0d938b
add fp16 check into max and avg pool ( #29479 )
5 years ago
ShenLiang
2ef9e0e23c
Rebuild group automatically in dynamic graph distributed ( #29255 )
...
* add tensor_indices in AssignGroupBySize
* add rebuild group in reducer
5 years ago
procr
3a0558339d
support mobilenet for kunlun ( #29458 )
5 years ago
Aurelius84
5d530c9319
fix amp support fleet ( #29491 )
5 years ago
ShenLiang
311b3b44fc
Fix the bug where embedding can‘t be processed correctly in reducer ( #29485 )
...
* fix the bug of reducer in embedding
* add comment
5 years ago
Pei Yang
2480bdef6c
change hard_swish from plugin to layer ( #29177 )
...
* change hard_swish from plugin to layer
* add ut when threshold != scale
5 years ago
lilong12
b122d0bb76
Fix bug in gloo that gloo initialization hangs ( #29447 )
...
* update, test=develop
5 years ago
taixiurong
ecca6585cd
1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op ( #29448 )
...
Co-authored-by: root <root@bjhw-sys-rpm0223.bjhw.baidu.com>
5 years ago
LoveAn
03b42d9fa7
fix unittest on windows, test=develop ( #29365 )
5 years ago
ShenLiang
22e6b9e373
Fix the ut of matmulv2 for broadcast case ( #29461 )
...
* fix the ut of matmulv2 for broadcast
5 years ago
TTerror
a5fcc4b545
update reduce_sum op on xpu ( #29367 )
...
* update reduce_sum op on xpu
* update reduce_sum op on xpu
* support running on xpu
5 years ago
chentianyu03
acce962133
remove complex module direction ( #29419 )
5 years ago
Zhang Ting
6296f4ed09
revert cast eigen kernel ( #29427 )
5 years ago
Leo Chen
a040c055a5
fix layer_norm accuracy ( #29434 )
5 years ago
Shang Zhizhou
225a9c4ed8
Fix unittest ( #29412 )
...
* fix tensorrt unittest precision error
* fix unittest precision error. test_trt_subgraph_pass && test_trt_dynamic_shape_transformer_prune
5 years ago
Pei Yang
f860de4af7
support clip op trt converter ( #29411 )
5 years ago
Bai Yifan
87bb726258
Add deform_conv2d,DeformConv2D ( #29364 )
...
* add deform_conv2d,DeformConv2D
5 years ago
chentianyu03
64e4e17f0c
remove complexvariable ( #29390 )
...
* rm complexvariable
* modify test_var_base unittest
* remove duplicated codes
5 years ago
chajchaj
79e6086743
change shape of output in cross_entropy, test=develop ( #29220 )
5 years ago
liuyuhui
2ee7a6b08c
[paddle v2.0.0rc1: API fixs] assign/conv2d/conv2d_transpose/cast/ParamAttr ( #29171 )
...
* fix DLTP-15151, paddle.ParamAttr API
* fix DLTP-15083/DLTP-15274, paddle.nn.functionl.assign paddle.cast API
* fix DLTP-15431/DLTP-15432, paddle.static.nn.conv2d paddle.static.nn.conv2d_transpose API
* fix DLTP-15083, paddle.nn.functionl.assign API
* fix DLTP-15431/DLTP-15432, paddle.static.nn.conv2d paddle.static.nn.conv2d_transpose API
* support in_dygraph_mode for cast op, test=develop
* fix bug,test=develop
* fix doc
* fix DLTP-15431/DLTP-15432, paddle.static.nn.conv2d paddle.static.nn.conv2d_transpose API
5 years ago
Guo Sheng
8fc7f1b66a
Fix api docs in RNN, Transformer, layer_norm, WeightNormParamAttr ( #29235 )
...
* Fix api docs in RNN, Transformer, layer_norm, WeightNormParamAttr.
test=develop
* Fix api doc for print in label_smooth.
test=develop
* Update api docs according to review comments.
Add name argument in RNN back.
test=develop
5 years ago
Chen Long
c940f842ca
remove rarfile from requirements ( #29319 )
5 years ago
yongqiangma
7c508d8668
update unbind norm add CUDAPlace api doc information ( #29322 )
...
* enhance array_to_lod_tensor_op lod_tensor_to_array_op errors information. test=develop
* fix format. test=develop
* format fix. test=develop
* add lod_rank_table. test=develop
* fix format. test=develop
* fix doc info. test=develop
* fix np error
* add unbind dygraph api. test=develop
* fix unbind doc.test=develop
5 years ago
chentianyu03
879e913b6d
Make transpose, trace, kron, reshape, sum op support complex type ( #29321 )
...
* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types
* add test cases for complex elementwise, matmul and getitem unittest
* add test cases for complex types
* add test cases for complex matmul unittest
* kron, reshape, transpose support complex types
* sum and trace op support complex types
* add test case of sum and trace op
* fix the bug of imag part of complex not initialized
* format file
* format code style
* kron support type promotion; modify test cases
5 years ago
Chen Long
66fd1c00a0
fix some docs test=develop;test=document_fix ( #29374 )
5 years ago
liym27
5f84d0b375
Fix bug: delete wrong check_type of paddle.concat and support LoDTensorArray ( #29306 )
5 years ago
Feiyu Chan
f7cdcefa65
fix multiple documentation errors, test=document_fix ( #29210 )
...
* fix multiple documentation error, test=document_fix
* fix more rst syntax errors, test=document_fix
* fix format issues in docstring, test=document_fix
5 years ago
卖鱼的哲学
074065e5de
fix expand/uniform_random && concat/transpose to new api on xpu ( #29280 )
...
* fix expand && concat/transpose to new api
* update uniform_random_op
* update xpu_header
5 years ago
ShenLiang
4064354a01
support dp run single card ( #29358 )
5 years ago
gongweibao
8989053443
Fix bug of test_fleet_launch_async.sh ( #29332 )
5 years ago
Huihuang Zheng
8f7627907c
[Dy2stat] Reduce Exception Type for Better Error Message ( #29268 )
...
Reduce exception type so that if covert_to_static failed, it reports right error message.
5 years ago
liym27
61a8f2874f
[Dy2Stat] Fix bug: Do not use gast.Subscript to replace gast.Name in when transforming for_enumerate_loop ( #29310 )
5 years ago
liym27
b10ecd9d3a
[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows ( #29267 )
5 years ago
Chen Weihang
9ad800ebb2
Support type promote for basic math ops (quantum required) ( #29265 )
...
* basic impl of type promote
* add comment & another testcase
* fix complex bugs & support python op promote type
* fix failed unittests & polish code
* add unittest for coverage
* change to only promote complex type
* polish code details
* polish several comments
5 years ago
LielinJiang
f31e5adab5
fix typo in ProgBarLogger ( #29329 )
5 years ago
tangwei12
8358791607
fix gpu outofrange ( #29238 )
...
* fix gpu emb out of range
Change-Id: I5794ac73bd634d5ea069a6fbbd914274b6d6b7bf
* fix doc
Change-Id: I5a3350b2930a9ab2f52116c192b087307faf8fdf
5 years ago
YUNSHEN XIE
28164b266f
disable test_rnn_decode_api and test_complex_matmul on windows ( #29252 )
5 years ago
Leo Chen
b58cfff89d
use has_grad instead of train_mode ( #29309 )
...
* use has_grad instead of train_mode
* add vlog for debug
* fix ut
* fix ut
5 years ago
Aurelius84
67c700b479
[Dy2Stat] Add cache for Executor and Context in run_program_op ( #28421 )
5 years ago
ShenLiang
d6753e1e6d
fix matmulv2 for windows ( #29327 )
5 years ago
gongweibao
96de8b008f
cleanup enum test=develop ( #29294 )
5 years ago
liym27
b9a8ebd50f
[Dy2stat] Add a decorator paddle.jit.not_to_static to support that not to convert a function in Dynamic-to-Static. ( #29253 )
...
Usage scenarios:A function could have run successfully in static mode, you can use it to decorate a function in the following cases:
1. An unknown error occurs in the dynamic-to-static conversion process of the function;
2. In the internal implementation of the function, it has two branches: dynamic branch and static branch;
3. Users don't want to convert the function in the process of dynamic to static.
5 years ago
ShenLiang
2d6aa1a5bb
fix warning of fleet ( #29317 )
5 years ago
ShenLiang
2cd0bf5764
Fix doc of fleet api ( #29282 )
...
* fix doc, test=document_fix
5 years ago
ShenLiang
c00af94435
fix matmulv2 for windows ( #29302 )
5 years ago
Steffy-zxf
41f17aeb8b
fix DATA_HOME path in win ( #29222 )
...
* fix DATA_HOME path in win
5 years ago
Jack Zhou
cf43322139
fix nll_loss doc;test=document_fix; ( #29247 )
...
* fix nll_loss doc;test=document_fix;
* remove numpy and set_device;test=document_fix;
* remove numpy;test=document_fix;
5 years ago
LielinJiang
b9f1f4343b
Move temporal_shift to paddle.nn.functional ( #29261 )
...
* move temporal_shift to functional
5 years ago
Chen Weihang
a2e9d95a4a
change test_imperative_signal_handler_to_exclusive ( #29283 )
5 years ago
Zhen Wang
be3777a50a
Add pure fp16 training with master weights. ( #27712 )
...
* add the weight decay func for the momentum op
* Add the multi_precision function in Momentum Optimizer.
* Make sure that the initial value of master weights are same with the fp16 weights.
* add static loss scaling.
* add the rescale_grad function in the pure fp16 training.
* use the original momentum updating method.
* Polish some codes, such as variable names.
* add docstring for apis.
* update the var creation details of _create_master_weight.
* not modify codes about imperative momentum updating.
* Fix the error of test_dist_sparse_tensor_load_momentum UT.
* add unit test for multi precision fp16 training.
* add more unit tests for CI.
* Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.
* For CI Coverage Checking.
5 years ago
chentianyu03
976961de6d
fix random failed of complex matmul ( #29285 )
5 years ago
furnace
7584bb5096
Layer norm fp16 ( #29169 )
...
* add fp16 for layer_norm op
* revert layernorm api
* fix forward
* fix forward
* fix backward for layernorm with fp16
* fix unit test for layernorm with fp16
* fix with_mkldnn compile error for layernorm with fp16
* 1. revert to PADDLE_ENFORCE_NOT_NULL, 2. change static_cast<float> to static_cast<U>
* fix with_mkldnn compile error for layernorm with fp16
* fix with_mkldnn compile error for layernorm with fp16
Co-authored-by: zhiqiu <chenqiuliang@baidu.com>
5 years ago
mls1999725
a37963b890
Update APIs in text/datasets and dataloader ( #29219 )
...
* Update IterableDataset API
* Update TensorDataset API
* Update APIs in paddle/text/datasets
* Update dataset.py
5 years ago
mls1999725
493568b070
Update Codes of Cifar and VOC2012 ( #29204 )
...
* Update Cifar Codes
* Update VOC2012 Codes
* Update voc2012.py
* Update voc2012.py
* Update cifar.py
* Update cifar.py
* Update voc2012.py
5 years ago
mls1999725
0aedd463ee
Update get_worker_info API ( #29190 )
...
* Update get_worker_info API
* Update dataloader_iter.py
* Update dataloader_iter.py
* Update dataloader_iter.py
5 years ago
mls1999725
6a9a62c3ef
Update conv3d API ( #29205 )
...
* Update conv3d API
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
5 years ago
Huihuang Zheng
aec05d811c
[Dy2stat] Fix PaddleGan Deoldify Model Dy2stat Problems ( #29226 )
...
This PR fixes several problems in dy2stat for Deoldify model in PaddleGan.
In model, software engineer wrote if x.shape == y.shape, the Tenser shape is a tuple in dygraph so the == returns True/False, but in static graph the == becomes element-wise comparison, which is a different behavior. In this PR we reduce the element-wise comparison result.
If software engineer write computations which uses parameters in hooks, the static graph can loss the parameter variable because we put param_guard at forward of a Layer. In this PR we made param_guard cover pre-hook and post-hook.
In PaddleGan, software engineer calculated some parameter values in __init__ by running some dygraph code. Those code also run during dy2stat. So some variables may be assign as a VarBase (Tensor) first and then Variable, which raised an error. We fixed the bug in this PR by handling the case.
TODO: We just added testcase for the 1. shape comparison. Should add test case for 2. and 3. But since we are chasing 2.0RC, I will do it in the near future PR
5 years ago
Leo Chen
116305ea4b
Improve performance of elementwise_add grad op ( #29187 )
...
* pass stop_gradient for cast op
* improve performance of elementwise_add grad
* use tensor copy async
* dygraph branch
* fix dygraph branch
* add ut
5 years ago
卖鱼的哲学
07c67d5a8b
add deformable_conv op on xpu ( #29234 )
...
* rebase develop
* update deformable_conv op on xpu
* update deformable_conv op on xpu
5 years ago
Chen Weihang
1de32f823d
Hot fix complle failed in gcc4.8 caused by complex impl ( #29254 )
...
* hot fix complle failed in gcc4.8
* fix failed unittest
5 years ago
yukavio
a71ea00922
add unit test ( #29228 )
5 years ago
ShenLiang
46b73e6cd9
Change the api of DataParallel and Fleet ( #29224 )
5 years ago
Leo Chen
73e51a17e7
add stop_gradient property and remove reduce redundant information ( #29185 )
...
* add stop_gradient property and remove reduce redundant information
* refine code
5 years ago
QingshuChen
64f29fbb70
update kunlun conv2d/softmax/elementwise implemetation ( #29229 )
...
* update conv2d & softmax to new xpu api
* test=kunlun
* remove useless comments
* test=kunlun
* remote softmax xpu op
* test=kunlun
* update kunlun softmax
* test=kunlun
* update xpu unitest
* test=kunlun
* fix elementwise_grad bug for kunlun
*test=kunlun
5 years ago
Jiawei Wang
b11ab12787
Fix doc (adadelta, sgd, momentum) ( #29212 )
...
* fix 3 doc
* fix 3 doc
* Update adadelta.py
5 years ago
lijianshe02
76312deb30
fix nll_loss test random fail bug test=develop ( #29236 )
5 years ago
LielinJiang
8a2dd34a1e
fix depthwise conv ( #29227 )
5 years ago
huangxu96
dbdeecd665
Modify doc mistakes of grad API. ( #29176 )
5 years ago
Jiawei Wang
a5d13d593c
Momentum Velocity init in Momentum.__init__() ( #29223 )
...
* add lamb optimizer and unittest
* fix momentum resume training
* fix momentum acc
5 years ago
Leo Chen
4556ad76b4
Upgrade string literals to raw string [part 2]( #29217 )
5 years ago
wanghuancoder
2b2cd1864a
revert python file coverage, delete coverage run --include, test=develop ( #29230 )
5 years ago
chentianyu03
8f45d14263
add complex64 and complex128 type; add +-*/@ and slice opreator for c… ( #29199 )
...
* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types
* add test cases for complex elementwise, matmul and getitem unittest
* add test cases for complex types
* add test cases for complex matmul unittest
5 years ago
123malin
cc9c619679
test=develop, fix doc ( #29200 )
...
* fix fleet api doc
5 years ago
Zhou Wei
c0a991c874
accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept ( #28429 )
...
* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor
* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor
* fix coverage
* fix api doc
* fix CI unittest
* fix CI unittest
* fix unitest
* empty tensor does’t need inner_var_
* fix some error message
5 years ago
huangjun12
b6a26749dc
fix doc of alpha_dropout/dropout/dropout2d/dropout3d/npair_loss ( #29136 )
...
* fix en doc, test=document_fix
* add blank after code declare, test=document_fix
* refine doc of dropout, test=document_fix
* refine npair_loss and dropout, test=document_fix
5 years ago
LielinJiang
d8eef4e4a4
Remove dependence of scipy ( #29121 )
...
* lazy import for scipy
* rm unused check
5 years ago
yaoxuefeng
a069e1ca91
fix docs ( #29097 )
5 years ago
Chen Weihang
786e69e9c7
diable test_yolov3 in musl ( #29216 )
5 years ago
hong19860320
f23665e5d5
Refine the doc and unit test for Sigmoid and stanh ( #29198 )
5 years ago
123malin
b5c6342336
Update ps gpu ( #29209 )
...
* fix paramete prefetch & device guard
Co-authored-by: MrChengmo <cmchengmo@163.com>
Co-authored-by: chengmo <chengmo@baidu.com>
5 years ago
liym27
865a45984f
Check whether there is any inplace operation affecting gradient calculation. ( #27901 )
...
* Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable.
* Add a new attribute `_inplace_version` for VarBase.
* Raise exception if an inplace operation can result in incorrect gradient computation.
* Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation.
* For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode.
* Use original var_wrapper if the inplace_version is not changed.
* Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.
5 years ago
lilong12
08fb079dbc
Fix the doc for shard_index api ( #29183 )
...
* update, test=develop
5 years ago
qingqing01
058f1b2284
Enhance paddle.metric.Accuracy ( #29125 )
5 years ago
joejiong
dc070ecfb0
Remove cast from paddle.pow api ( #29134 )
...
As the title
5 years ago
WangXi
0c2a51d240
optimizer amp, all use fp16 communication, overlap last comm and compute ( #28957 )
5 years ago
Chen Weihang
0b032faeee
Polish unittests details and execution conditions to adapt to MUSL ( #29044 )
...
* fix failed tests in yingchun gived list
* add unittests into static_mode_white_list
* add enable static
* fix dist unittest
* skip test_sigmoid_focal_loss_op & add gym
* revert no need skip unittests
* remove gym
5 years ago
123malin
92817f8005
test=develop, rm pathlib ( #28658 )
...
* test=develop, rm pathlib
5 years ago
Wojciech Uss
4fd4095d1b
Add quantization of multi_gru op and tests ( #28615 )
5 years ago
Thunderbrook
4adddcc89a
add set_trainer_num api in dataset ( #29133 )
5 years ago
liym27
e03440812a
fix code: if y is True -> if y ( #29184 )
5 years ago
danleifeng
7e7b4b9e5d
remove sampled_softmax_with_cross_entropy alias;test=develop ( #29180 )
5 years ago
WeiXin
1476e1f998
save model after jit.load ( #28748 )
...
* Changed a variable name error
* Add comments
* Move member functions of TranslatedLayer out of function
* edit code according to review
* Edit input argument of '_run_static_graph'
* reset due to Segmentation fault
* rename variables when stitching graph
* modify code according CI
* Add comments to '__i_m_p_l__'
* remove blanks befor 'Get...'
* edit code according to review
* Add a comment to '_execution_method_creator'
* Edit a comment to '_execution_method_creator'
5 years ago
wanghuancoder
0239f79695
Generate code coverage reports only for incremental files ( #28508 )
...
* Generate code coverage reports only for incremental files, test=develop
* Generate code coverage reports only for incremental files, test=develop
* Generate code coverage reports only for incremental files, test=develop
* test for diff python file, test=develop
* fix no python diff report, test=develop
* add cc test file, test=develop
* fix bug in generic.cmake, test=develop
* for debug no cc report, test=develp
* modify compire branch form test_pr to test, test=develop
* fix bug, test=develop
* test for h file changed, test=develop
* debug for redefinition of argument optimize error, test=develop
* close -o3 for test, test=develop
* remove -o3 for test, test=develop
* remove coverage option for nvcc, test=develop
* use CMAKE_CXX_FLAGS open coverage option when header file changed, test=develop
* reopen -o3, test=develop
* remove debug code, test=develop
* remove unused code, test=develop
5 years ago
zhang wenhui
8388abe66b
Fix api 1128 ( #29174 )
...
* fix 2.0 api, test=develop
* fix api, test=develop
5 years ago
LielinJiang
f92fdfb8ef
Add ReduceLROnPlateau ( #29113 )
...
* add ReduceLROnPlateau
5 years ago
Huihuang Zheng
27b4218333
[Dy2stat] Disable PaddleInference IR Optimization in test_mnist for CUDA11 ( #29105 )
...
test_mnist failed on CUDA11. We found that it is due to PaddleInference IR Optimization after debugging. We disable it in this PR and we will re-enable it after PaddleInference fixes it.
5 years ago
liym27
01bdea7c31
[Dy2Stat] Don't conver the function from third library logging ( #29161 )
5 years ago
liym27
a7433cc379
[Dy2Stat] Fix bug: the return statement should be transformed to an equivalent Paddle/Python if statement, which depends on if conditions of the return stmt. ( #29165 )
5 years ago
Huihuang Zheng
4a0a870177
[dy2stat] Set shape for linspace to Fix dy2stat for GridGenerator Model ( #29173 )
...
GridGenerator model failed because the output shape of `linspace` is (-1). The reason is that C++ InferShape fixes the shape to (-1):
5da3d514eb/paddle/fluid/operators/linspace_op.cc (L49)
We cannot set the shape in C++ infer shape because this Tensor may not be initialized during compile time, but when input `num` of `linspace` is an integer, we know the shape at compiler time. This PR simply set the shape in Python and add GridGenerator as unittest.
5 years ago
Aurelius84
cb680c8013
[Dy2Stat]Refine code of test_lac unittest ( #29087 )
5 years ago
ShenLiang
e2d01eb650
Support dynamic graph distributed ( #28997 )
...
* add reducer
* refine envent for memorycopy
* add concat&split for allreduce
* apply concat & split for fuse tensor
* fix nccl dep
* fix the untest, compile problem and ddp initialize problem
* fix untest for mac & add some comments & solve the repeated param in sublayers
* fix untest for windows & fix document
5 years ago
lilong12
7e5e9934fe
update expand as op to use the shape of the target tensor instead of the target tensor itself. ( #29020 )
...
* update, test=develop
5 years ago
Kaipeng Deng
f4c894a693
alias yolo_loss & yolo_box to paddle.vision. ( #28520 )
...
* alias yolo_loss & decode_yolo_box to paddle.vision. test=develop
5 years ago
Shibo Tao
4ceedec33d
enhance doc. add kwargs for backward compatibility. test=develop ( #29143 )
5 years ago
LutaoChu
28280647eb
add paddle.subtract, optimize paddle.maximum and paddle.minimum
...
add paddle.subtract, optimize paddle.maximum and paddle.minimum
5 years ago
徐铭远
3c2a46bd7b
fix doc of erf,rank,mm,cross_entropy,pixel_shuffle,kron... ( #29126 )
...
* fix doc example, test=develop, test=document_fix
5 years ago
Chen Long
d576d6ddeb
fix some docs test=develop;test=document_fix ( #29159 )
5 years ago
yukavio
5da3d514eb
solve pretty table dependent in flops api ( #29132 )
...
* solve pretty table dependent in flops api
* add unittest dependent
* temp
5 years ago
pangyoki
6df685ab64
fix nce, multinomial, Categorical, Normal, Uniform en doc ( #28541 )
...
* fix Categorical en doc
* fix doc for apis
* remove numpy in sample code
5 years ago
LielinJiang
9f53f3d09e
Enhance logger callback for benchmark ( #29106 )
...
* enhance logger callback for benchmark
5 years ago
Jack Zhou
085260f3de
Add eigen gru and fix the dropout bug in the rnn
...
Add eigen gru and fix the dropout bug in the rnn
5 years ago
yaoxuefeng
545df287fc
add user_define_dump ( #28596 )
5 years ago
lilong12
216e085605
update, test=develop ( #29139 )
5 years ago
liym27
5fe44571f0
[Dynamic-to-Static] Support **kwargs as input of the function which is decorated by `jit.save.to_static` ( #29098 )
5 years ago
Aurelius84
71815637cc
Move gym into unittest/requirements.txt ( #29149 )
5 years ago
YUNSHEN XIE
0fca8cdfdf
fix error with ut timeout and failed ( #29148 )
5 years ago
GaoWei8
a7ef724dd5
polish softamx doc ( #29153 )
5 years ago
lilong12
a1add716bc
Add a flag to control whether to initialize gloo ( #29150 )
5 years ago
Chen Weihang
0d1900d329
add debug msg for test_buffer_shared_memory_reuse_pass ( #29151 )
5 years ago
guofei
638402274a
Integrate ImperativeOutScale into ImperativeQuantAware. ( #27956 )
...
* Optimiz the unittest test_imperative_out_scale
test=develop
5 years ago
Shang Zhizhou
b9e76a0103
detect tensorRT plugin fp16 in runtime ( #27933 )
...
* remove -DSUPPORTS_CUDA_FP16 in cuda.cmake
* comile with cuda9
* add some unittest
* notest;test=coverage
* add unittest for trt plugin swish && split
* update ernie unittest
* fix some error message
* remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter
* fix comile errror when CUDA_ARCH_NAME < Pascal"
* fix comile error
* update unittest timeout
* compile with cuda9
* update error msg
* fix code style
* add some comments
* add define IF_CUDA_ARCH_SUPPORT_FP16
* rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
5 years ago
Chen Weihang
c39da29db7
Add symlink force for unittest test_static_save_load ( #29137 )
...
* add symlink force for unittest
* open unittest
5 years ago
xiaoting
9cc0e72619
Fix interpolate doc ( #29104 )
...
* fix interpolate example, test=develop;test=document_fix
* fix format, test=develop, test=document_fix
* update upsample doc, test=develop, test=document_fix
5 years ago
whs
9b39af3f22
Fix docs in 2.0 API ( #29081 )
...
* 1. grid_sample
1.1 fix has_print
2. conv1d_transpose
2.1 fix code_example error
3. conv1d
4. affine_grid
4.1 has_print
4.2 has_disable_static
5. Conv1DTranspose
5.1 fix code_example error
5.2 has_disable_static
6. Conv1d
6.1 code_example
6.2 has_disable_static
5 years ago
Guanghua Yu
bb64efb1d0
fix softmax_with_cross_entropy api en docs ( #29116 )
5 years ago
Chen Weihang
0dfb81614b
polish static save load doc, test=document_fix ( #29115 )
5 years ago
Chen Weihang
95a0f87b44
support jit.save datra parallel ( #29135 )
5 years ago
lilong12
449903dead
add paddle.broadcast_to api which is a alias of paddle.expand ( #28706 )
...
* update, test=develop
5 years ago
Noel
da71173bc9
Fix ops doc for some ops
...
Fix ops doc for some ops
5 years ago
Leo Chen
770395cb93
Split train_mode and has_grad for tracer ( #29064 )
...
* split train_mode and has_grad
* fix format
* fix ci problems
* fix sample code
5 years ago
ShenLiang
cddc70964d
fix InMemoryDataset doc ( #28688 )
...
* add Inmemorydataset
5 years ago
ShenLiang
bb5f8e3575
fix doc of data,matmul,dot,cholesky,scatter,divide,remainder,inverse,sign ( #28665 )
5 years ago
yukavio
29b5050726
Revert "add hapi api flops ( #28755 )" ( #29144 )
...
This reverts commit 63e90ee331 .
5 years ago
Aurelius84
cc92b27d4e
Add prettytable in unittest/requirements.txt ( #29147 )
5 years ago
YUNSHEN XIE
27d04a3b1f
disable ut test_static_save_load ( #29119 )
5 years ago
JZ-LIANG
0dadacc4eb
[sharding] doc, api, bug fixed ( #28983 )
...
* add lars to fleet meta optimizer
* add lamb to proto
* add lamb to fleet meta optimizer
* fixed syntax bug
* fixed syntax bug
* fixed syntax error in lamb, add config setter of lamb in distributed_strategy
* trigger unitest to rerun
* add new unitest func for lamb
* revise unitest for lars and lamb
* revise dgc meta unitest
* revise lars document in distribute_strategy
* revise lars lamb document in distributed_strategy.py
* revise lars lamb document in distributed_strategy.py
* add weight decay exclude logic to lars
* restore optimzier.py
* restore optimizer.py as develop except lars
* add epsilon and exclude fn to distributed_sttrategy
* add lars epsilon
* revise unitest for fleet lars and lamb
* revise lars lamb unitest for CI coverage
* revise lars argument api
* revise lars argument api
* revise lars argument api
* revise api doc of lars
* fix op role
* add sharding save and add_sync_comm_for_test function
* add comm_analyse to utlis
* revise sharding_utils
* add sharding saving unittest
* revise sharding utils for unittest
* revise sharding en doc
* update sharding utils api
* add doc for sharding
* fixed bug in sharding var size count
* update varsize count in sharding
* fix sharding num_nccl_comm
* Revert "fix sharding num_nccl_comm"
This reverts commit d51587c15e9323acf226ddd36154275f0d1daf76.
5 years ago
lilong12
2a864c70c4
fix the bug in gloo ( #29112 )
...
* update, test=develop
5 years ago
whs
7de2db4a81
Fix grid_sample in cudnn mode ( #29124 )
5 years ago
Yanghello
dd41775059
fix crypto ut test error for windows ci ( #29090 )
5 years ago
Chen Weihang
d0129fcd88
Add static_only decorator for static apis ( #29015 )
...
* add static_only for static api
* addd static_only for class init
* remove static_only for default_main_program
* remove creater_parameter & startup_program
* remove failed apis
* revert py_func import
* remove global scope
* remove some api
* remove cuda pinned place
5 years ago
WangXi
e931c7baf9
Fix multi nccl comm & wait server ready ( #28663 )
5 years ago
ceci3
e7caf3b8d9
fix examples, test=document_fix ( #29019 )
...
* fix examples, test=document_fix
* fix, test=document_fix
5 years ago
Shibo Tao
db41258501
add API serialize_program, serialize_persistables, save_to_file, deserialize_program, deserialize_persistables, load_from_file. ( #29034 )
5 years ago
Aurelius84
14013a2eba
Remove prettytable in requirements.txt ( #29100 )
5 years ago
Kaipeng Deng
b052149dcf
remove BatchSampler type check ( #29114 )
...
* remove BatchSampler type check. test=develop
5 years ago
yukavio
63e90ee331
add hapi api flops ( #28755 )
...
* add hapi api flops
* fix bug
* fix some bug
* add unit test
* fix unit test
* solve ci coverage
* fix doc
* fix doc
* fix static flops
* delete the comment
* fix some grammar problem in doc
* fix some bug
* fix some doc
* fix some doc
5 years ago
hutuxian
db85f4cf8f
Add dygraph implementation for multiplex op ( #29049 )
5 years ago
joanna.wozna.intel
b0d1ac161e
Add bf16 pool2d and unify bf16 unit tests ( #29039 )
...
* Add bf16 pool2d and unify bf16 unit tests
* Add change default ops test
5 years ago
gongweibao
1358397e97
Clean up the redundant files and unify the launch interface. ( #28928 )
5 years ago
Guanghua Yu
47af5c3c9d
fix smooth_l1_loss en docs ( #29093 )
5 years ago
LielinJiang
6951052431
add default conv init ( #29092 )
5 years ago
Chen Weihang
fea0e294ee
Hide the C++ stack by default and add hints ( #29042 )
...
* default not show cpp statck & add hint
* fix failed unittest
* fix failed unittests
5 years ago
Yiqun Liu
eb9ae55849
Optimize the performance of piecewise_decay. ( #29077 )
5 years ago
qingqing01
8bbedc2371
Fix doc format for callbacks, metrics and Model ( #28638 )
...
* Fix doc format for callbacks, metrics and Model
* Fix code sample and doc
5 years ago
Chen Weihang
a1486091f1
Polish load_program_state design to loading file onebyone ( #29041 )
...
* change load dict file one by one to warn
* add unittests for coverage
* polish error message
* fix cond error
5 years ago
GaoWei8
a049dff78f
Modify the default setting of softmax cudnn ( #28672 )
5 years ago
WeiXin
fafadbab70
Rename variables when use 'jit.load' ( #28933 )
...
* Rename variables when use 'jit.load'
* Check whether the original graph contains the variable with the same name
* add comment
* rename output/input of op and edit unittest
* modify the code according to CI
* edit code according to CI
* edit code according to CI
* edit code according to CI
* edit code according to CI
* edit code according to CI
* edit code according to CI
5 years ago
ceci3
a3faa520ec
Fix syncbn ( #29013 )
...
* fix syncbn
* add unittest
5 years ago
joejiong
582c0a0468
add uint8 for reshape op ( #28996 )
...
add uint8 for reshape operator
5 years ago
FlyingQianMM
f0e614feae
change print([.*].numpy()) to print([.*]) in example codes of sigmoid_focal_loss ( #29094 )
...
* rewrite the sigmoid_focal_loss code example. test=develop
* fix spelling mistake in comments of code example.test=develop
* change print([.*].numpy()) to print([.*]) in example codes of sigmoid_focal_loss. test=document_fix
5 years ago
liym27
682cc17f53
[Dynamic-to-Static] Fix bug: support pop from a dict and polish code of convert_pop ( #29023 )
...
* Support pop for dict in dy2stat
* Move convert_pop to convert_operators.py and polish convert_pop
5 years ago
Zhou Wei
8ca0a8a859
fix tensor detach to zero copy ( #27921 )
...
* fix tensor detach to zero copy
* fix tensor detach to zero copy
5 years ago
GaoWei8
13828db313
Add reminder of fluid.layers.lstm ( #28964 )
...
* add reminder of fluid.layers.lstm
5 years ago
Aurelius84
8af0d85ea4
fix unittest failed on windows GPU ( #29072 )
5 years ago
chajchaj
dfaf6b5eea
save one name in cross_entropy and softmax_cross_entropy, test=develop ( #29074 )
...
* save one name in cross_entropy and softmax_cross_entropy, test=develop
* change used function in CrossEntropy from softmax_cross_entropy to cross_entropy, test=develop
5 years ago
taixiurong
a5aa4dc7a9
add xpu elementwise ops ( #29031 )
5 years ago
tianshuo78520a
e9acd9c941
Update CI Python3 Docker for Cuda11 ( #28401 )
5 years ago
joejiong
b04c78ef5e
Update pow ( #29000 )
...
Simple code clean up
5 years ago
huangxu96
40f5453725
Quant nn2.0 ( #28764 )
...
* Impelement 2.0 API version Conv2d and Linear layer quantization in imperative mode.
* use cudnn softmax in static Lenet
* Modified ChannelwiseQAT Unittest for 2.0 API.
* For CI python coverage.
5 years ago
Chen Weihang
bb16c2515d
Polish parallel api impl & doc details ( #28980 )
...
* polish parallel api impl & doc details
* add unittest for coverage
* remove spawn test in py2.7
* add parallel api into white list
5 years ago
joejiong
4b05a8be88
delete axis parameter in multiply api ( #28647 )
...
As the title
5 years ago
YUNSHEN XIE
c098a2e159
fixing cmake error for WITH_GPU=ON and WITH_DISTRIBUTE=OFF ( #29030 )
5 years ago
chajchaj
b52427327d
add soft_label and axis for CrossEntropyLoss and improve performance ( #29024 )
...
* add soft_label and axis for CrossEntropyLoss and improve performance,test=develop
* fix conflict in nn/functional/loss.py, test=develop
5 years ago
Chen Long
018e169923
fix some docs ( #29011 )
...
* fix some docs test=develop;test=document_fix
* add code example test=develop;test=document_fix
* fix code example test=develop;test=document_fix
* fix code example test=develop;test=document_fix
* fix code example test=develop;test=document_fix
5 years ago
ysh329
9a475582e0
fix clip by norm EN doc. test=develop ( #28994 )
...
* fix clip by norm eng doc. test=develop
5 years ago
Huihuang Zheng
5b339262bc
[Dy2stat] Add Tuple as Assign Target for Tensor Shape ( #28775 )
...
Add support for using tuple as tensor.shape (For example: a, b, c, d = x.shape)
5 years ago
YUNSHEN XIE
5cb8e17a18
restore timeout value ( #29027 )
5 years ago
Jiawei Wang
f21513307a
add lamb optimizer and unittest ( #28772 ) TODO:FIX BUGS LATER
...
* add lamb optimizer and unittest
* fix lamb
* fix lamb v2 op
* fix sampling id
* fix lamb sample code
* Update lamb.py
* fix doc
* fix doc
* Update lamb.py
5 years ago
Leo Chen
3815d7aa40
Upgrade string literals to raw string ( #28989 )
...
* upgrade comment string to raw string
* fix string in
* fix string with ' '
* revert update on comments
* upgrade only necessary
* fix sample code checker
* fix comments with '''
5 years ago
liym27
85292e0b46
[Dynamic-to-Static] Fix bug of convert_logical_and/convert_logical_or: the operands are executed sequentially( #28993 )
...
1) The operands are executed sequentially according to the running logic of Python.
2) If the left hand operand is True(for convert_logical_or)/False(for convert_logical_and), the right hand operand should be executed.
5 years ago
Huihuang Zheng
96126532cd
Fix Incorrect After Node Vars in IfElseTransformer, test=develop ( #28992 )
...
The PR description is long. See details in the PR link.
5 years ago
LielinJiang
982fd0f3c2
fix mnist fmnist ( #29018 )
5 years ago
wanghuancoder
887a35113e
fix eng doc for some api ( #28477 )
...
* fix eng doc, test=develop
* add import deprecated for layers, test=develop
* add block line for doc generate, test=develop
* remove todo for create_variable, test=develop
* add blank line for doc generate, test=develop
* add blank line for doc generate, test=develop
5 years ago
smallv0221
83cee3c9d7
Delete mq2007 dataset. ( #28995 )
...
* Fix en doc for rnn.py. test=document_fix
* Delete mq2007 dataset.
5 years ago
Chen Weihang
9cd09a8586
Polish dataloader doc detail & update example ( #28975 )
...
* polish dataloader doc detail, test=decument_fix
* fix commnet error
* fix word error
5 years ago
123malin
fbf9564f6b
【paddle.distributed.fleet】Optimize ParameterServer's Async Mode ( #28442 )
...
* test=develop, optimize global_step
5 years ago
Leo Chen
98adc8f054
Dev/fix doc of some api ( #28785 )
...
* refine doc of bernoulli
* fix some problems
* fix unsqueeze
* fix squeeze
* fix doc
5 years ago
lilong12
f77a78cdee
enable pipeline to run with Executor.run() ( #28373 )
...
* update, test=develop
5 years ago
lijianshe02
9f642ed881
fix English doc for dice_loss, log_loss, unfold and NLLLoss API test=… ( #28739 )
...
* fix English doc for dice_loss, log_loss, unfold and NLLLoss API test=document_fix
5 years ago
joejiong
4e00c095fa
fix warning in english doc ( #28981 )
5 years ago
LiuChiachi
70385518a6
Add EarlyStopping ( #28691 )
...
* add early stopping
* add doc for early stopping
* fix sample code bugs
* update infer of mode, update doc, add unittests to increase coverage rate
* fix sample code for early stopping
* update sample code and unittests
* reduce time cost of test_callbacks unittest
* fix model.py code style error
5 years ago
LiuChiachi
8c8b42f28a
Update path name of saving in hapi ( #28462 )
...
* update hapi save_inference_model output pathname
* update hapi save_inference_model output pathname
* use new 2.0-api paddle.static.io.load_inference_model
* add unittests to increase coverage rate
5 years ago
Thunderbrook
0073f9bdb0
support ps-gpu ( #28752 )
...
* ps gpu transpile
* ps gpu
* remove op
* gps trainer
* local ps
* add macro
* HeterBox
* def cuda
* tab
* code style
* style
Co-authored-by: Thunderbrook <a754913769#163.com>
5 years ago
Chen Weihang
768dab441e
polish two api doc detail, test=document_fix ( #28971 )
5 years ago
Chen Weihang
7c7cdf082f
remove fluid & variable in program ( #28966 )
5 years ago
furnace
8ff3550658
refactor momentum op to combine weight ( #27414 )
...
* refactor momentum op to combine weight_decay (scale op and sum op)
5 years ago
YUNSHEN XIE
50113f9268
fix error for Can not find test to add properties to:test_parallel_dygraph_transformer,test_parallel_dygraph_sparse_embedding ( #28968 )
5 years ago
Zhang Ting
3416c35996
refine doc of default_startup_program, test=document_fix ( #28951 )
5 years ago
Zhang Ting
842fd2933c
remove fluid in default_main_program doc, test=document_fix ( #28941 )
5 years ago
Zhang Ting
b5218227d6
refine the doc of dist op, test=document_fix ( #28947 )
5 years ago
LielinJiang
8f8a02fda4
Optimize conv performance ( #28766 )
...
* optimize conv performance
5 years ago
LielinJiang
00e55ded4a
Add lr scheduler callback for high level api ( #28737 )
...
* add lr scheduler
5 years ago
yaoxuefeng
6369463a5d
fix test datagenerator ut fail ( #28767 )
5 years ago
Kaipeng Deng
ed7aa8f9c3
Fix conv transpose ( #28629 )
...
* fix conv_transpose unittest. test=develop
5 years ago
Chen Weihang
bff4179cc7
lazily init global group in collective ( #28780 )
5 years ago
gongweibao
1dad8ceaab
Fix gpu memory allocation bug. ( #28703 )
5 years ago
Kaipeng Deng
91bab752a9
fix dataloader default value and doc ( #28728 )
...
* fix dataloader. test=develop
5 years ago
Chen Weihang
0ed80e09fc
Fix param base trainable set failed ( #28756 )
...
* fix param base trainable set failed
* add unittest
* fix typo
* polish comment
5 years ago
Aurelius84
d6aee7597c
[Dy2Stat]Set buff.persistable=False when it's not initialized ( #28749 )
5 years ago
joejiong
1a532d5133
add uint8 support for squeeze operator ( #28734 )
...
Adding uint8 support for squeeze operator.
5 years ago
123malin
9066828b1b
test=develop, bug fix for embeddings padding ( #28708 )
...
* test=develop, bug fix for embeddings padding
* fix raise Value for Embedding
Change-Id: I6d343fceee369a5796ad59cca5c91fdd15429125
Co-authored-by: seiriosPlus <tangwei12@baidu.com>
5 years ago
Bai Yifan
655d5eb1db
fix code example ( #28636 )
...
* fix code example, test=document_fix
5 years ago
joanna.wozna.intel
8c0ea4bffe
Add bf16 matmul, fc, elementwise add and mul ( #28729 )
...
* Add bf16 matmul, fc, elementwise add and mul
* Correct unit test
5 years ago
Zhou Wei
3b0dd5f620
fix bug that to_tensor not support paddle.Place ( #28717 )
5 years ago
Aurelius84
e1c8d6bce5
Fix wrong out dtype inferred from helper.input_dtype ( #28715 )
...
* Fix wrong out dtype from helper.input_dtype
* add unittest
* remove disable_static in op_test
* fix param name typo
5 years ago
taixiurong
d3d1a6b6e0
add kunlun kernel: slice, slice_grad, top_k, cast. *test=kunlun ( #28542 )
...
* 1.add xpu slice op 2. add xpu top_k op 3.modify xpu cast to new api
* 1.add xpu slice op 2. add xpu top_k op 3.modify xpu cast to new api
5 years ago
Jack Zhou
9362d85e0e
Add LSTM, Simple RNN and GRU CPU kernel ( #28577 )
...
* add lstm, simple rnn op kernel
* fix the test_lstm for the rnn op
* change func name
* fix forward postprocess bug
* add gru forward, backward code
* remove unittest.skipIf; use a big rnn op instead of combination op
* fix input doesn't have gradient bug
* add eigen lstm forward, backward
Co-authored-by: wawltor <fangzeyang0904@hotmail.com>
5 years ago
QingshuChen
30ef3815b3
adjust kunlun header file ( #28536 )
...
* adjust kunlun header file
*test=kunlun
* update kunlun unittest
*test=kunlun
* update xpu unitest
* test = kunlun
* update xpu unittest
* test=kunlun
* update xpu unitest
* test=kunlun
5 years ago
wangguanzhong
d12aa4957f
remove alias for iou_similarity and ssd_loss ( #28742 )
5 years ago
Double_V
960135285b
fix pool APIs en doc, delete disable_static ( #28679 )
...
* fix pool exclusive and delete disable_static, test=develop
* fix pool1d exclusive, test=develop
* fix pool APIs en doc, test=document_fix
5 years ago
Wilber
60a5eb68a0
update paramattr doc. test=document_fix ( #28745 )
5 years ago
liym27
269470d62e
[Dynamic-to-Static] Remove unnecessary variables of the arguments in true_func/false_func ( #28722 )
5 years ago
danleifeng
7d32e100c5
upgrade to use paddle-2.0 API for sample code of paddle.t ( #28726 )
5 years ago
liym27
e5f0e6b003
[Dynamic-to-Static] Fix bug in loop_transformer: loop vars should contain the var from ancestor-for-node ( #28735 )
5 years ago
Wojciech Uss
04bcc13fac
Add multi_gru op and tests ( #28591 )
...
* Add multi_gru op and tests
* removed redundant disable_dygraph()
5 years ago
Wilber
fe2cf39f77
[2.0] Update py_func English doc. ( #28646 )
5 years ago
YUNSHEN XIE
16a808149a
modfied the timeout value for 5 ut ( #28714 )
...
* modfied the timeout value for 5 ut
* modified timeout value for test_resnet
5 years ago
liym27
9881738e13
[Dynamic-to-Static ErrorMessage] Support dy2stat error message when call jit.save and polish error message ( #28713 )
...
* Support dy2stat error message when call jit.save;
* Polish dy2stat error message:
(1) the original dygraph code is marked with (* user code *) ;
(2) "In user code:" -> "In transformed code:"
5 years ago
joejiong
32b90b1c2d
add log10 ( #28576 )
...
Add new operator log10
5 years ago
Shibo Tao
f0806bdaf2
fix save_inference_model and load_inference_mode alias. test=develop ( #28736 )
5 years ago
TeslaZhao
abbc507a81
Fix two english api documents, transpose and strided_slice ( #28687 )
...
* Fix two english api documents, transpose and strided_slice
* delete nouse comments
5 years ago
Zhou Wei
5562d8094f
fix API optimizer get_lr to support static graph mode ( #28681 )
...
* fix doc of save/load
* fix API optimizer get_lr
* fix API optimizer get_lr
5 years ago
furnace
caffa85ffe
add alias for fluid.initializer.set_global_initializer, alias is nn.initializer.set_global_initializer ( #28690 )
5 years ago
Shibo Tao
f096af83a0
fix document sample. test=develop ( #28721 )
5 years ago
Chen Weihang
3c5f2cac68
fix save parse error for dict input ( #28712 )
5 years ago
Huihuang Zheng
9ab335bbd9
Fix convert_call May be Called Multiple Times, test=develop ( #28710 )
...
Fix convert_callmMay be called multiple times in Dy2stat. Also strip some strings to make sure no influence from blank spaces.
5 years ago
wawltor
20b1276598
faster the compare ops dygraph model speed
...
faster the compare ops dygraph model speed
5 years ago
JZ-LIANG
5a9f6889c1
[Sharding] add new features ( #28568 )
...
* add lars to fleet meta optimizer
* add lamb to proto
* add lamb to fleet meta optimizer
* fixed syntax bug
* fixed syntax bug
* fixed syntax error in lamb, add config setter of lamb in distributed_strategy
* trigger unitest to rerun
* add new unitest func for lamb
* revise unitest for lars and lamb
* revise dgc meta unitest
* revise lars document in distribute_strategy
* revise lars lamb document in distributed_strategy.py
* revise lars lamb document in distributed_strategy.py
* add weight decay exclude logic to lars
* restore optimzier.py
* restore optimizer.py as develop except lars
* add epsilon and exclude fn to distributed_sttrategy
* add lars epsilon
* revise unitest for fleet lars and lamb
* revise lars lamb unitest for CI coverage
* revise lars argument api
* revise lars argument api
* revise lars argument api
* revise api doc of lars
* fix op role
* add sharding save and add_sync_comm_for_test function
* add comm_analyse to utlis
* revise sharding_utils
* add sharding saving unittest
* revise sharding utils for unittest
5 years ago
xiaoting
8c75b2554a
Support Tensor for attr_scale and attr_size ( #28677 )
...
* update interpolate, test=develop
* fix coverage, test=develop
5 years ago
YUNSHEN XIE
e880c90c5a
fix error when setting ut timeout value ( #28696 )
5 years ago
LielinJiang
01a14e1be2
Add with_pool args for vgg ( #28684 )
...
* add arg for vgg
5 years ago
LielinJiang
532e4bbf2a
fix docs ( #28683 )
5 years ago
Shibo Tao
db2e6cee62
add two paddle-2.0 apis: paddle.static.io.save_inference_model and paddle.static.io.load_inference_model ( #28606 )
...
* add two apis: paddle.static.io.save_inference_model and paddle.static.io.load_inference_mode, which are campatible with paddle.fluid.io.save_inference_model and paddle.fluid.io.load_inference_model respectively.
* add unittest for new save_inference_model and load_inference_model. test=develop
* enhance doc. test=develop
* add paddle.enable_static() to test_inference_model_io.py. test=develop
5 years ago
Chen Weihang
358d6bc90f
Fix test_weight_decay_extend random failed on windows ( #28643 )
...
* add debuging code
* change seed & add debug message
5 years ago
Bai Yifan
5050e761b8
Support user-defined activation/weight quantize and preprocess. ( #28570 )
...
* support user-defined quant and preprocess
5 years ago
Leo Chen
11e32baf1e
Add matmtl_v2 to amp list ( #28693 )
...
* add matmtl_v2 to amp list
* support dygraph
5 years ago
Huihuang Zheng
b6f86b8491
Fix Using "isinstance" in Loop, test=develop ( #28641 )
...
Fix a bug that used in PaddleGAN model which used `isinstance` in a for loop
5 years ago
lilong12
e4f9415338
update doc, test=document_fix ( #28498 )
5 years ago
Jacek Czaja
6d8d3d4c22
[oneDNN] Layer norm bf16 kernel ( #28619 )
5 years ago
LielinJiang
cdc4e6620d
fix lenet num classes ( #28642 )
5 years ago
wangchaochaohu
912a5c30b4
fix the matmul_v2 test for cuda11 ( #28635 )
5 years ago
littletomatodonkey
82f0b5ea5c
adapt pad const ( #28585 )
...
* adapt pad const
* fix comment and rm fluid import
* rm stdout
* fix note
5 years ago
Double_V
d71c3463b0
fix pool exclusive and delete disable_static ( #28655 )
...
* fix pool exclusive and delete disable_static, test=develop
* fix pool1d exclusive, test=develop
5 years ago
Aurelius84
8040fa2bca
Fix output dtype inconsistent with input ( #28649 )
...
* fix output dtyp inconsistent with input
* refine code
5 years ago
wangchaochaohu
68ee7f7312
fix overwrite for gather OP of API2.0( #28659 )
5 years ago
zhupengyang
2cd10fc465
fix 2.0 api docs ( #28445 )
5 years ago
Chen Weihang
a083c76ab4
adjust signal failed wait time ( #28640 )
5 years ago
Zhou Wei
361a53930f
fix doc of save/load ( #28645 )
5 years ago
cc
d1e84f3e9e
Add some ops for cacluating output scale, test=develop ( #28644 )
5 years ago
wawltor
8f2656ef5c
fix the gradient bug for the topk v2
...
fix the gradient bug for the topk v2
5 years ago
wangchaochaohu
a972c33fd7
refine gather OP performance for dynamic mode ( #28587 )
5 years ago
Kaipeng Deng
ece1e4cd9d
Add weighted random sampler ( #28545 )
...
* add WeightedRandomSampler. test=develop
5 years ago
joanna.wozna.intel
2cb71c0cde
Add checkpoint to quantize ( #28612 )
...
* Add checkpoint to quantize
* Change bfloat16 option
5 years ago
lidanqing
804271cff9
Op version python mkldnn_inplace test ( #28354 )
...
* add mkldnn inplace op version test
* update mkldnn_inplace fuse pass
* update the inplace test
5 years ago
pangyoki
72e068f1ba
fix test_multinomial ( #28558 )
...
* fix test_multinomial
* fix test_multinomial add 0 prob
5 years ago
Kaipeng Deng
89d27de90f
DataLoader support not auto collate batch ( #28425 )
...
* DataLoader support not auto collate batch. test=develop
5 years ago
Huihuang Zheng
c5c273c13e
[Dy2stat] Fix Using Tuple for Transpose in Dy2stat ( #28574 )
...
PaddleSeg uses tuple as parameter of transpose in dygraph code:
https://github.com/PaddlePaddle/PaddleSeg/blob/release/v0.7.0/dygraph/paddleseg/models/danet.py#L152
However, in dy2stat, static code doesn't support the perm as a tuple. This PR fixed it.
5 years ago
GaoWei8
2b1e7e5b02
Polish where english doc ( #28595 )
5 years ago
Guo Sheng
110febdc54
Fix gradients with ignore_idx in softmax_with_cross_entropy ( #28622 )
...
* Fix gradients with ignore_idx in softmax_with_cross_entropy.
test=develop
* Fix gradients with ignore_idx in softmax_with_cross_entropy on cpu.
Remove softmax_with_cross_entropy from op_threshold_white_list.
test=develop
* Fix test_softmax_cross_entropy_op.py.
test=develop
5 years ago
Guo Sheng
a3bc3bcd48
Fix scaled_params append error in AdamW. ( #28633 )
...
Fix no_grad setting in AdamW.
test=develop
5 years ago
YUNSHEN XIE
c4d22c845b
modified timeout value for some ut ( #28616 )
5 years ago
qingqing01
f7dd889ca4
Support squeezed label as input in paddle.metric.Accuracy ( #28535 )
...
* Support squeezed label as input in paddle.metric.Accuracy
* Revert cifar and fix UT
5 years ago
Wilber
8b97bb2e1f
Update cmake for arm ft and fix a bug for Predictor dtor. ( #28586 )
5 years ago
LielinJiang
1c3eef4cee
Fix vgg error when num_classes is given ( #28557 )
...
* fix vgg num classes
5 years ago
LielinJiang
1de3cdd0ab
Fix summary api for rnn gru lstm ( #28566 )
...
* fix summary for rnn gru lstm
5 years ago
danleifeng
a24d186814
fix nccl init failed in parallel dygraph mode ( #28497 )
5 years ago
Huihuang Zheng
1adc09b56a
Make Batch Size Smaller on Mac Because of CI Machine ( #28569 )
...
I found the unittest failed due to batch size. Maybe the reason is that our CI machine has limited memory. I decreased the batch size.
5 years ago
lilong12
ed9dd7c9f0
add send and recv ops ( #28590 )
...
* update, test=develop
5 years ago
LiuChiachi
5579edfb3c
save dtype of inputs ( #28301 )
5 years ago
Yang Zhang
c47bfe98cf
Reduce input data size for syncbn unit test by 8x ( #28571 )
...
this is a preemptive measure, the unit test was failing occasionally with
cudnn internal error
5 years ago
channings
c545b9b673
Add ONNX Exporter ( #27831 )
...
* add onnx export module, test=develop
* add unit test for paddle.onnx.export
* adjust api & doc
* fix some typo
5 years ago
Zhou Wei
bf6e7cba7a
updata 2.0 API english doc ( #28525 )
...
* make Numpy version is below 1.19.3
* fix 2.0 doc
5 years ago
Yang Zhang
614f20f6ba
Update `add` `clip` `MSELoss` and `no_grad` ( #28530 )
...
also fix `no_grad` alias
5 years ago
Shang Zhizhou
8699f38d08
裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug ( #28517 )
...
* skip_layernorm_op done
* add unittest
* slice op convertor support trt < 6
* skip_layernorm only work in ernie
5 years ago
joejiong
08d2413142
add log2 operator ( #28319 )
...
As the title
5 years ago
lidanqing
0fc181dbd0
[Fix bug] If the pass name is not found, IsCompatible should return false ( #28475 )
5 years ago
Steffy-zxf
b258caf467
fix add_n doc (eng) ( #28464 )
...
fix the add_n english doc
5 years ago
Leo Chen
543ff333cd
Refine the format of printing tensor 3 (support scaler tensor) ( #28544 )
5 years ago
furnace
26d292b108
bugfix for api (mv, empty, empty_like op) ( #28513 )
5 years ago
YUNSHEN XIE
621b31c526
modified timeout value for test_resnet_v2 and test_resnet ( #28532 )
5 years ago
Kaipeng Deng
5305b2749a
deprecated APIs under paddle.dataset. test=develop ( #28423 )
5 years ago
liym27
0ce933a9ee
[API2.0] Fix documents of 6 APIs to fit API2.0: ( #28514 )
...
1. Remove 'fluid';
2. Variable -> Tensor
APIs:
sum, convert_call, convert_ifelse,
convert_logical_and, convert_logical_or, convert_logical_not
5 years ago
Huihuang Zheng
3b0d31ab89
Modify ProgramTranslator and TracedLayer Doc for API 2.0 ( #28509 )
...
Modify ProgramTranslator and TracedLayer Doc for API 2.0
5 years ago
Zhou Wei
c70c1c520d
make Numpy version is below 1.19.3 ( #28510 )
5 years ago
Huihuang Zheng
546b1c1dc4
Fix paddle.jit.dy2static.data_layer_not_check 2.0 API Doc ( #28461 )
...
Remove "fluid", "variable" in 2.0 API doc
5 years ago
yukavio
8f664a5c49
fix one_hot example code ( #28432 )
5 years ago
wangchaochaohu
241505c262
refine the doc for API2.0 ( #28386 )
5 years ago
YUNSHEN XIE
2159646796
modified timeout value for test_conv3d_mkldnn_op,test=document_fix ( #28496 )
5 years ago
Huihuang Zheng
eb0855615c
Set Exclusive for test_yolov3 to Avoid Memory Segmentation Fault in CI Random Failure ( #28485 )
...
As the title
5 years ago
YUNSHEN XIE
e1a5fc449d
fix ut exec timeout notest,test=kunlun ( #28495 )
...
* fix ut exec timeout notest,test=kunlun
* fix error for executing ut timeout,test=document_fix
5 years ago
YUNSHEN XIE
d3b2d07d6e
modified timeout value on windows ( #28499 )
...
* modified timeout value on windows
* fix some error
5 years ago
YUNSHEN XIE
1e698c600e
fix cmake error when setting ut timeout properity ( #28492 )
5 years ago
YUNSHEN XIE
faa6549877
setting timeout properity for test_flags_mkldnn_ops_on_off ( #28489 )
5 years ago
Tao Luo
7fd2077211
set NCCL_SHM_DISABLE=1 for test_parallel_executor_profilery.py ( #28484 )
5 years ago
smallv0221
4fa1d39205
Fix en doc for rnn.py. test=document_fix ( #28470 )
5 years ago
WangXi
18525d3635
fix Tanh remainder en doc ( #28455 )
5 years ago
wangchaochaohu
e14ed71cc2
refine the performance of gather Op ( #28458 )
5 years ago
YUNSHEN XIE
72c78e4dbb
exec ut no more than 15s 2 ( #28441 )
...
* exec ut no more than 15s 2
* fix for ut test_inplace_addto_strategy timeout
5 years ago
YUNSHEN XIE
ba0756325a
exec ut no more than 15s 1 ( #28439 )
...
* disable ut test_parallel_executor_fetch_isolated_var,test=document_fix
* test for limiting ut exec time as 15S
* fix an error caused by cannot find ut
* fix some error
* can not find test_transformer
* fix error caused by ut not run in windows
* fix error caused by Compiler Options
* fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt
* setting timeout value to 120s for old ut
* add the timeout value setting
* fix error caused by ut only run in coverage_ci
* add analyzer_transformer_profile_tester
* fix some error
* fix some error
* fix error with inference option
* fix error with inference option setting as ON_INFER
* add some ut to set timeout
* modified some option
* fix error
* fix some timeout error
* fix error
* fix error
* fix timeout for test_analyzer_bfloat16_resnet50
* fix error
* setting timeout properity for some ut
* first pr for new ut timeout as 15S
5 years ago
Chen Weihang
155b4f9b6c
Remove selected rows all reduce over height check ( #28460 )
...
* remove slelected rows all reduce over height check
* polish unittest
5 years ago
WeiXin
b5e662f837
refine jit.save/load to add support for other method, not only forward ( #28376 )
...
* refine jit.save/load to add support for other method, not only forward
* refine the code based on unit tests
* Add unit test for the code
* Add unit test for the code
* Modify the code according to the unit test
* Delete useless comments, save only one info file, etc.
* remove static_mode_white_list.pyc
* edit the code that generate 'extra_var_info'
5 years ago
Zhang Ting
7fe5f9ccad
Fix unittest random failure ( #28363 )
...
* fix random failure
* use two input(x)
* fix model
5 years ago
QingshuChen
6bba8e57b1
fix batch_norm_xpu bug & remove xpusimulator dependence ( #28430 )
...
*test=kunlun
5 years ago
joanna.wozna.intel
7821759d48
Add bfloat16 softmax and gelu ( #28394 )
...
* Add bfloat16 softmax and gelu
* Add pass attr bfloat16_enabled_op_types
* Changes from review
5 years ago
Aurelius84
bd8dfe38ca
[Dy2Stat] Refine code example for 2.0 ( #28440 )
5 years ago
littletomatodonkey
f6834034cc
fix sample code ( #28446 )
5 years ago
zhupengyang
0155f91679
enable softmax unittest ( #28362 )
5 years ago
Aurelius84
2500dca878
[Dy2Stat] Fix bug in convert_call ( #28368 )
...
* Fix bug in convert_call
* refine unittest
* refine code
* refine code
* fix unittest failed
* add assert
5 years ago
Jacek Czaja
ca41541472
[oneDNN]Sum bf16 kernel ( #28382 )
...
* - Added sum bf16 oneDNN
test=develop
* - Fix to UT of sum bf16
test=develop
5 years ago
LielinJiang
b89b4e321d
add fashion dataset ( #28411 )
5 years ago
Kaipeng Deng
463075a82b
add paddle.io.ComposeDataset & paddle.io.ChainDataset ( #28311 )
...
* add paddle.io.ComposeDataset & paddle.io.ChainDataset. test=develop
5 years ago
Qi Li
a4303496b2
[DOC] activation api doc, test=document_fix ( #28405 )
5 years ago
LielinJiang
3c95acc3eb
Remove cv2 dependence of normalize ( #28361 )
...
* rm cv2 dependence of normalize
5 years ago
Leo Chen
44a476c2ab
support cuda pinned place ( #28416 )
5 years ago
lidanqing
12b9587be5
Add conv_bias pass version python test ( #28278 )
...
* add conv_bias pass version test
* update according to reviews
5 years ago
Li Fuchen
f401907775
modified sample code of CTCLoss & ctc_loss by remove disable_static() & print([.*].numpy()) & alias, test=document_fix ( #28403 )
5 years ago
Kaipeng Deng
95b1868366
update DataLoader doc ( #28290 )
...
* update DataLoader doc. test=develop
5 years ago
Leo Chen
71d6220772
Skip reader op in mixed_precision decorator ( #28353 )
...
* skip reader op in mixed_precision decorator
* add ut
5 years ago
Leo Chen
8b2436a776
Add broadcast_shape api ( #28257 )
...
* add broadcast_shape api
* add ut
* follow comments
* add example code, test=dodument_fix
* update example code, test=document_fix
5 years ago
石晓伟
21a63f6f90
enhance the op_version_registry, test=develop ( #28347 )
...
* enhance the op_version_registry, test=develop
* add unittests, test=develop
* enhance the op_version_registry, test=develop
* fix bugs, test=develop
* revert pybind_boost_headers.h, test=develop
* fix a attribute bug, test=develop
5 years ago
Shang Zhizhou
ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 ( #28367 )
...
* fp16 result ok
* change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS
* auto detect special slice op converter for ernie with trt oss
* ernie oss only support fp16
* fix special_slice_plugin serialize bug
* matmul in tensorrt ok
* ernie unittest ok
* add matmul tensorrt unittest
* remove demo code
5 years ago
Jacek Czaja
84cc61b2cd
[oneDNN] sum op refactor ( #28318 )
5 years ago
Wilber
6f0f45f69c
copy_to_cpu support uint8 ( #28372 )
5 years ago
Wilber
09fd2b2aab
Paddle support compile on sw ( #27858 )
5 years ago
Double_V
17db031a05
fix pool bug, test=develop ( #28359 )
5 years ago
Leo Chen
6115c14fca
Pool2d cuda kernel supports fp16 ( #28316 )
...
* pool2d cuda kernel supports fp16
* fix compile issue of template
* add ut
5 years ago
Guo Sheng
9a600df373
Add rnn_op ( #28197 )
...
* Add rnn_op.
test=develop
* Fix rnn_op grad maker's drop_empty_grad.
test=develop
5 years ago
Huihuang Zheng
acc11c2a62
Retry CUDA Initialization to Fix Random Failure, test=develop ( #28323 )
...
This PR is follow up of #28213 . On that PR we tried to decrease GPU usage, however the CI still randomly failed. So I added retry logic for the initialization of nccl and cusolver. If the initialization failed, we can retry to avoid the random failure.
5 years ago
wangguanzhong
5262b02585
add generate_proposals_v2 op ( #28214 )
...
* add generate_proposals_v2 op
5 years ago
Guo Sheng
b96869bc31
Fix lr setting of AdamW when lr is an instance of LRScheduler ( #28300 )
...
* Fix lr setting of AdamW when lr is an instance of LRScheduler.
test=develop
* Fix static graph test mode in test_adamw_op.py.
test=develop
5 years ago
Aurelius84
57e4411ab5
[Dy2stat] Support to modify value of buffer tensor ( #28328 )
...
* [Dy2stat] Support to modify value of buffer tensor
* remove "defaultTest"
* fix name confliction
5 years ago
石晓伟
d9b5f1261c
update the version of pybind, test=develop ( #28284 )
...
* update version pybind to v2.4.3, test=develop
* update unittests, test=develop
5 years ago
Leo Chen
18c86fb2fb
hide some logs of p2p ( #28307 )
5 years ago
Chen Weihang
4086f48ea1
Check and fix tensor and scalar type promotion ( #28299 )
...
* check and fix tensor and scalar type promotion
* fix else branch error
* fix scalar method error
* fix test_math_op_path unittest
* add future division for unittest
* rm useless bin file
5 years ago
LielinJiang
fb1e0c93c9
Make vision datasets return PIL.Image as default ( #28264 )
...
* return pil image as default according backend
5 years ago
zhulei
26ede6e07e
Add median api. ( #28310 )
...
* Add median api.
* Add median api.
* Add median api.
* Add median api.
* Add median api.
5 years ago
wanghuancoder
3ccc0a2f5e
enable test_parallel_executor_fetch_isolated_var ( #28219 )
...
* enable test_parallel_executor_fetch_isolated_var, test=develop
* add enable_static, test=develop
* set test_parallel_executor_fetch_isolated_var RUN_TYPE=DIST, develop=test
5 years ago
joanna.wozna.intel
571a63e7ec
Add bf16 transpose2, reshape2, concat ops ( #28195 )
5 years ago
Guanghua Yu
e8f2614da5
Enhance multiclass_nms op to support LoD for dygraph mode ( #28276 )
...
* Enhance multiclass_nms to support LoD for dygraph mode
* fix some error in multiclass_nms
* update GetLodFromRoisNum to GetNmsLodFromRoisNum
5 years ago
LielinJiang
0351168967
Fix lr scheduler step error in hapi when use static mode ( #28297 )
...
* fix lr scheduler
* fix code style
5 years ago
Leo Chen
8953038400
Fix transpose in conv cudnn kernel when addto enabled ( #28295 )
5 years ago
chentianyu03
6cebd71454
add + - * / @ [] operator to ComplexVariable ( #28217 )
...
* add + - * / @ [] operator to ComplexVariable, also add unittest
* fix circular reference bug
* fit for py2.7
* remove reverse oprators which not supported now
5 years ago
Zhong Hui
a98c69b6c6
fix dygraph gather api
...
fix dygraph gather api
5 years ago
Leo Chen
2853f0c4f9
Set static shape for shape tensor with constant [part 1] ( #28275 )
...
* set static shape for shape tensor with constant
* remove debug code
* fix typo
* add ut
* refine code
* refine example
5 years ago
Chengmo
4dc8c44ba1
【Paddle.Fleet】Fix fleetrun heter ( #28252 )
...
* fix fleetrun heter ps on paddlecloud
5 years ago
pangyoki
8f83d5d875
fix AMP auto_cast and grad_scaler En doc ( #28177 )
...
* fix AMP auto_cast and grad_scaler En doc
* fix indentation problem
* change Conv2d to Conv2D
5 years ago
Zhou Wei
b63e0ccb4a
fix load check_point bug of LinearWarmup ( #28280 )
5 years ago
Jacek Czaja
c11d9b3035
[oneDNN ] conv2d fwd&bwd optimization ( #27871 )
5 years ago
LiuChiachi
d932b5618f
set random seed to in test_export_deploy_model ( #28274 )
5 years ago
wangxinxin08
41d26a8287
update matrix nms op to api 2.0 ( #28265 )
...
* update matrix nms op to api 2.0
* modify code according to review
5 years ago
Leo Chen
7fcb32ddf3
fill_constant op supports NINF ( #28270 )
5 years ago
wangchaochaohu
495a9ceb95
fix the input error of size Op ( #28272 )
5 years ago
liym27
b1eb28d74b
[Dy2Stat-log] Call warnings.warn() to display the warning-message only once when calling StaticFunc.__call__ or ProgramTranslator().get_output ( #28260 )
5 years ago
Zhang Ting
fdc06f2158
add Fuse bn add act pass ( #28196 )
...
* add fuse_bn_add_act pass
6 years ago
Chen Weihang
813b2ade34
Enrich the python error types of paddle & polish format ( #28124 )
...
* add multiple exception type
* define all exception & polish compile pystack
* mapping paddle error to python exception
* polish static mode error format
* fix failed unittests
* fix dytostatic test_error
* fix check_nan_inf failed
* add unittest for coverage
* revert some code try to solve compile error
* refactor enforce & error change
* polish code & add unittest
6 years ago
Aurelius84
a5c18204e9
[Dy2stat]Join break cond with while cond in some pattern ( #28171 )
...
* Join break cond with while cond
* remove usless code
* refine the if code
* Split into BreakTransfromOptimizer
* add BreakTransformOptimizer in ast_transformer
* add more comment
6 years ago
Aurelius84
7a3a05cccb
[Dy2Stat]Support to save model with nested output ( #28224 )
6 years ago
Kaipeng Deng
4671d85a03
fix DataLoader return same format between static & dynamic in single mode ( #28176 )
...
* fix DataLoader return same format between static & dynamic in single mode. test=develop
6 years ago
Adam Osewski
7db747d9e8
oneDNN BatchNorm + Act fusion pass. ( #27912 )
6 years ago
Zhou Wei
fb7f85291b
fix print tensor place,add cpu/cuda/pin_memory API for Tensor ( #28200 )
6 years ago
cnn
9940871888
[cherry pick ] cherry pick 28108 28198 28199 from release2.0rc ( #28215 )
...
* Release 2.0rc cherry pick api rename #28108 (#28184 )
* rename count_include_pad-->exclusive return_indices-->return_mask
* remove track_running_stats
* fix typo.
* rename xxxd-->xxxxD
* solve conflicts
* 2.0rc api add all any (#28199 )
* reduce trt warning message (#28011 )
add paddle.enable_static() on sample code
alias recude_all-->all, reduce_any-->any
add import reduce_all and reduce_any in python/paddle/tensor/math.py
import all and any in python/paddle/tensor/__init__.py
remove all and any OP in python/paddle/tensor/logic.py, add all and any OP in python/paddle/tensor/math.py
fix import error
remove TestAllAPI temporary
* fix doc of recdue_all and reduce_any, test=document_fix
* fix typo
* fix unittest for all and any API
Co-authored-by: Pei Yang <peiyang@baidu.com>
* rename conv_transposeXd-->convXd_transpose (#28198 )
* fix sample code of reduce_all and reduce_any
Co-authored-by: Pei Yang <peiyang@baidu.com>
6 years ago
Leo Chen
f4f823c860
Refine the format of printing tensor 2 ( #28216 )
...
* refine format
* update doc
* handle uninitialized tensor
* add ut
6 years ago
tianshuo78520a
11089cacdb
Fix xpu notest ( #28204 )
...
* Fix xpu notest;test=kunlun
* fix
* test=kunlun
* test=kunlun
6 years ago
mapingshuo
81244fbfab
add sharding strategy in fleet( #27900 )
...
* add sharding
6 years ago
Huihuang Zheng
a1e7fd4a13
Fix test_parallel_executor_test_while_train Random Failure by Decreasing GPU Usage ( #28213 )
...
Recently, test_parallel_executor_test_while_train randomly failed on CI. On all CI logs, it showed NCCL initialization failed or cusolver initialization failed. I found online that those failure is usually caused by GPU shortage. Those API calls CUDA APIs directly so it shouldn't be the problem of allocator. It may be somewhere in PaddlePaddle increases GPU usage.
However, I run this test for 1000 times on my machine and the CI machine, either of them can reproduce the random failure. Maybe there is something related to the environment only happened in test env.
To verify my assumption that somewhere in PaddlePaddle increases GPU usage and also fix this CI, I decreased the batch_size to see whether the random failure disappears in test env.
6 years ago
WangXi
11acbfae06
refine auto strategy, test=document_fix ( #28211 )
6 years ago
Feiyu Chan
efe6e2840c
fix strided_slice_op's GetExpectedKernelType ( #28192 )
...
* fix strided_slice_op's GetExpectedKernelType when input tensor is at CUDAPinnedPlace
* add unittest for tensors in cuda pinned place
* skip test for cuda pinned place on cpu machines
6 years ago
Leo Chen
1f3be85914
Fix bug of fetch_async_op_handle when fetching the feed variable ( #28194 )
...
* fix bug of fetch_async_op_handle
* revert some changes of test_buffer_shared_memory_reuse_pass
* revert some changes of test_buffer_shared_memory_reuse_pass
6 years ago
Aurelius84
e730516090
[Dy2stat] Refine return mechanism in @to_static ( #28116 )
...
* remove some judgement
* fix len(outputs) == 1
6 years ago
LielinJiang
68449d19a5
Update hapi predict interface ( #28180 )
...
* update hapi predict interface
* fix code style
* fix docs
* fix docs
* fix docs
* update docs
* fix codes style
* fix unittest
* fix unittest
* fix coverage
6 years ago
Chen Weihang
5d73bfdb98
fix test_weight_decay_extend error ( #28178 )
6 years ago
lilong12
4873c20d74
modify ut cmakefile ( #28140 )
...
* modify ut cmakefile, test=develop
6 years ago
swtkiwi
e8db4412d0
delete matplotlib ( #28159 )
6 years ago
cnn
7c1aa0d69d
2.0rc api rename ( #28088 )
...
* rename manual_seed to seed
* rename xxx1d-->xxx1D, xxx2d-->xxx2D, xxx3d-->xxx3D
* rename manual_seed --> seed
* do not rename .cc, .cu and .h file
* rename manual_seed --> seed
* rename manual_seed --> seed
* rename manual_seed --> seed
* rename manual_seed --> seed
* disable_static on doc example code
* donot change manual_seed on generator
* add enable_static on sample code
* convert python/paddle/fluid/layers/nn.py to bak
* fix typo
* fix code style
* fix seed to manual_seed when call functions of Generator()
* fix bug
6 years ago
liu zhengxi
bc46069225
fix dynamic decode imperative ( #28160 )
6 years ago
pangyoki
2d45d9a04f
add static_mode_white_list ( #28112 )
...
* add static_mode_white_list
* add Mac CI static list
* add Win CI white_list
* add Coverage and Py3 CI white_list, add test_unittest
6 years ago
wangguanzhong
5cd97a1cb0
support multiclass nms for multi-batch, test=develop ( #28154 )
6 years ago
李灿
085b9619f5
fix bugs test=develop ( #28125 )
6 years ago
zhulei
446d184e11
Add new api: is_tensor ( #28111 )
...
* Add new api: is_tensor
* Add new api: is_tensor
* Add new api: is_tensor
* Add new api: is_tensor
6 years ago
lilong12
cd372447b9
disable test_dist_mnist_hallreduce, test=develop ( #28129 )
6 years ago
wangguanzhong
d1e1f17482
fix generate_proposal_labels in cascade-rcnn series model, test=develop ( #27892 )
...
* fix generate_proposal_labels in cascade-rcnn series model, test=develop
* fix example code & unittest, test=develop
* update code from review comments, test=develop
6 years ago
Leo Chen
a911c19eb0
fill_constant op supports NaN and Inf ( #28109 )
...
* fill_constant supports nan and inf
* add ut
6 years ago
LielinJiang
74c8a81127
Add pil backend for vision transforms ( #28035 )
...
* add pil backend
6 years ago
Aurelius84
135b62a4ec
[Dy2stat] Refine code of DygraphToStaticAst ( #28103 )
...
* refine code of DygraphToStaticAst
* add __init__ function
6 years ago
hong
5a589b2f86
reduce imperative ocr attention config; test=develop ( #28079 )
6 years ago
zhang wenhui
af70924061
fix test_group_norm_op_v2.py, test=develop ( #28104 )
6 years ago
Double_V
d43f75e4cc
add rois_num for roi_align xpu OP ( #28077 )
...
* add stack pool2d roi_align xpu op,test=kunlun
* error message opt, test=kunlun
* add xpu unittest,test=kunlun
* skip check grad,test=kunlun
* fix boostget , test=kunlun
* error message opt for XPU, test=kunlun
* add rois_num for roi_align xpu OP, test=develop
6 years ago
hong
651dab4af6
Catch exception in download ( #28090 )
...
* cat exeception in request download; test=develop
* add re-try in download; test=develop
6 years ago
LielinJiang
8327accc58
Fix dataloader when stack input data with different type ( #27950 )
...
* fix dataloader
6 years ago
yinhaofeng
6f0c3d1f06
xpu adam op ( #28031 )
...
* lookup_table_xpu op report errors;test=kunlun
* add adam xpu op;test=kunlun
* reset lookup
* change adam wrong;test=kunlun
6 years ago
TeslaZhao
a5c95cd588
Add xpu transpose2 op.test=kunlun ( #28086 )
6 years ago
Kaipeng Deng
a5f65d516f
hapi/model step learning rate on batch end. ( #27991 )
...
* hapi/model step learning rate on batch end. test=develop
6 years ago
LutaoChu
c8d32c8c10
Fix diag OP bug on Windows Python3.8
...
Fix diag OP bug on Windows Python3.8 ,remove the std::min
6 years ago