cc
61820fd217
add the time threshold of quantization tests, test=develop ( #29786 )
5 years ago
xiaoting
55725cd2e1
fix for timeout, test=develop ( #29788 )
5 years ago
LielinJiang
a94c3cbbf3
register cudnn conv double grad for depthwise conv ( #29807 )
5 years ago
ShenLiang
01e2874a0e
Support multi-stream communication for dynamic graph distributed ( #29525 )
...
* fix fleet for multi-stream
* fix memcpy for ncclid
* use sync to solve move operation
5 years ago
huangxu96
a29006d128
Optimizer trans momentum ( #29597 )
...
* merge amp related function in Momentum from paddle.fluid.contrib.optimizer into paddle.optimizer.
* Add unittest for 2.0 Momentum API.
* fix some bugs in weight_decay.
5 years ago
liym27
0cc42e34c6
Migrate 4 APIs about array to paddle.tensor.* ( #29565 )
...
4 APIs: array_length, array_read, array_write, create_array
5 years ago
yukavio
96934b7430
fix flops ( #29758 )
...
* fix flops
* fix flops
5 years ago
liym27
41a7b07159
[Dy2Stat] Fix bug for loop: a variable is used and created in loop, but used before created ( #29769 )
5 years ago
LielinJiang
e5af650b71
Add double grad for conv_transpose ( #29706 )
...
* add double grad for conv_transpose
5 years ago
huangxu96
97e29411eb
fix a bug in multi_precision_fp16 unittest. ( #29756 )
5 years ago
Wojciech Uss
6ef8129dcc
upgrade oneDNN with GRU INT8 optimizations ( #28420 )
...
* upgrade oneDNN with GRU INT8 optimizations
* fix test
5 years ago
Huihuang Zheng
dfffee8a5d
[Dy2stat] Enable jit.save to Save Without Running ( #29579 )
...
Enable jit.save to Save Without Running.
5 years ago
liym27
a0b60716f1
[Dy2Stat] Support grammar: for ele in var[idx] ( #29541 )
...
Support to transformfor ele in var stms in which var is a slice of Tensor.
5 years ago
chentianyu03
b59b6d7ae6
Complex op test ( #29753 )
...
* delete no need to calculate inputs in dygraph op_test
* delete no need to calculate inputs in dygraph op_test
5 years ago
liym27
096c048b45
Fix unitest test_slice ( #29740 )
...
Before this commit, test_slice use old api `dygraph_to_static_func` to use Dynamic-t-Static and use Executor explicitly,which is not recommended to users.
After fixed, use recommended API `paddle.jit.to_static` to replace `dygraph_to_static_func`, which won't trigger the random exception on coverage CI.
5 years ago
Huihuang Zheng
2e788bd81e
Reduce batch size ot fix CPU memory, test=develop ( #29736 )
...
Unit test reported memory not enough on CPU machines. Reduce batch size again.
5 years ago
LielinJiang
10edfb6f21
Update en docs of to_tensor ( #29718 )
...
* update to_tensor en docs
5 years ago
chentianyu03
71063b8137
add conj op for complex types ( #29527 )
...
* add conj op for complex types
* add conj for complex types
* add more test case
* add conj_op test
* modify conj api and impl
* add complex type for fill_constant_op xpu
* add setConstant for complex type
* remove complex conj test file
* user define grad for test_conj_op
* add test case for static mode of conj api
* modify conj doc
* change input args name to x
* remove useless codes
* conj support real types
* add conj test case for real number
5 years ago
WangXi
9cbcc6cadc
fleet sync build strategy, test=develop ( #29732 )
5 years ago
Chen Weihang
6cfa59de1b
[Complex] Add real & imag op and api for complex tensor ( #29672 )
...
* add complex real op & api & unittest
* add imag op & api & unittest
* refactor op impl
* revert simplify writing due to complile failed
* polish details
* polish grad op code
5 years ago
LiuChiachi
572810eecb
Update EarlyStopping sample code ( #29723 )
...
* update EarlyStopping doc
* update EarlyStopping doc, test=document_fix
5 years ago
TTerror
af8ded773a
update activation op on kunlun ( #29577 )
...
* fix expand && concat/transpose to new api
* update xpu_header
* update activation op on kunlun
* update activation op on kunlun
* update activation op on kunlun
* update activation op on kunlun
* update activation op on kunlun
* add nearest_interp on kunlun
* update error message
5 years ago
ceci3
cc387159f3
add pad and concat double grad ( #29549 )
...
* add constant pad double grad
5 years ago
liuyuhui
f13c3a9cd7
[Kunlun] PR1:Support one Kunlun card training in parallel executor ( #29337 )
5 years ago
huangxu96
b96dada4f0
add static.amp into setup.pu.in ( #29621 )
...
* add static.amp into setup.pu.in
* add unittest for api
5 years ago
YUNSHEN XIE
d0b789d27f
disable ut test_cumsum_op ( #29613 )
5 years ago
Jack Zhou
84bae27779
fix wmt14 doc, remove backward, add bidirect direction in rnn api ( #29633 )
...
* fix wmt14 doc, remove backward, add bidirect direction in rnn api
* fix rnn unittest
* fix test_rnn_nets_static.py bug
5 years ago
YUNSHEN XIE
2926e74326
New UT should not exceed 15s ( #29492 )
...
* added UT should not exceed 15s
* fix error
* UT limit of 15s is the first to be executed
* fix error
* fix error with CI_SKIP_CPP_TEST
* modfied tiemout setting
* fix error
5 years ago
Chen Weihang
f02aece1f0
Add complex dtype op (add) test example ( #29603 )
...
* add op test case for complex
* polish code details
* add xpu set constant support
* fix argument rror
* remove useless pyc file
5 years ago
AshburnLee
efea540ca9
Add tf32 support for A100 tensor core acceleration for cuBLAS ( #28732 )
5 years ago
lijianshe02
7779768b53
add transpose double grad test=develop ( #29600 )
...
* add transpose double grad test=develop
5 years ago
huangxu96
c05170d3d8
add alias for fluid.contrib.mixed_precision ( #29562 )
...
* add alias for fluid.contrib.mixed_precision
5 years ago
ShenLiang
fb6697b424
Fix the dowanload bug in the case of multiple machines ( #29551 )
...
* fix the dowanload bug
* add sort for ips
5 years ago
ShenLiang
1efef8baed
Fix bug of matmul_v2 for broadcast case ( #29599 )
...
* fix bug of matmul_v2 for broadcast
5 years ago
qingqing01
8d549fc85d
Add clip double grad ( #29590 )
5 years ago
Tao Luo
81acc3278c
disable test_parallel_executor_profiler in cuda 10.1 ( #29581 )
...
* disable test_parallel_executor_profiler in cuda 10.1
* update set_tests_properties
5 years ago
wangchaochaohu
ac4bae8ee9
elementwise_add_grad Op optimization ( #29575 )
5 years ago
huangxu96
2cb6f94888
add float16 into adaptive_avg_pool2d check list. ( #29547 )
5 years ago
yukavio
ee1a7d020c
add some feature for paddle.flops ( #29572 )
5 years ago
WangXi
467c716963
gen nccl id use socket ( #29431 )
5 years ago
Bai Yifan
d72604cd46
fix unittst unstable issue on ci machine ( #29588 )
...
* fix unittst unstable issue on ci machine
* fix unittst unstable issue on ci machine
* fix unittst unstable issue on ci machine
5 years ago
QingshuChen
79a41a9ed6
support roi_align & affine_channel for kunlun ( #29561 )
...
* support roi_align & affine_channel for kunlun
* minor
5 years ago
liym27
0cad1152f4
[Dy2Stat] 1. Fix bug of for-range stmts. 2. Support that step value is negative in for-range stmts ( #29519 )
...
1. Fix error in _build_cond_stmt of for-range stmts.
2. Support that step value is negative in for-range stmts
3. Fix code because of the diff between Py2 and Py3
5 years ago
Huihuang Zheng
831e9135b9
Fix Windows Unittest ( #29543 )
...
Fix 3 Windows Unittests
test_fuse_all_reduce_pass: Paddle cannot run multiple-GPU on Windows so set single visible GPU flag
test_feed_data_check_shape_type: Paddle cannot run multiple-GPU on Windows so set single visible GPU flag
test_tsm: Winodws GPU size is not enough so decrease batch size and data size.
5 years ago
GeminiCarrie
08f24a3108
Fix precision problem ( #29567 )
...
* Fix a bug when running on an operating system without "bash."
* add execution condition
* for ci-coverage
* get cpu information to check the precision problem
* Update compilation environment for musl version
* update dependencies
* remove test code
check cpu info
remove test code
review
* update alpine and third_party denpendencies
* add newline for ci Code format
5 years ago
JZ-LIANG
d33d468f02
[Sharding] add hybrid-dp feature ( #29518 )
...
* Sharding add hybrid-dp feature
* update sharding in distributed_strategy
* update sharding unitest
* revise code format for sharding
5 years ago
Chen Weihang
c1a26e2a05
fix train eval set error in static mode ( #29540 )
5 years ago
taixiurong
760d015c14
add xpu ops for training transformer in kunlun ( #29539 )
...
* 1.fix matmul bug 2. add one hot
* add xpu error msg
5 years ago
Leo Chen
0fdd365665
Add fast path for dropout when p == 0 ( #29553 )
...
* add fast path for p == 0 in dropout
* add ut
5 years ago
Wojciech Uss
917a11495f
fix ininite scale values ( #29386 )
5 years ago
lijianshe02
bd29052e33
fix random seed in nll_loss unitest test=develop ( #29538 )
...
* fix random seed in nll_loss unitest test=develop
5 years ago
joanna.wozna.intel
0ce6d7fa77
Fix bf16 activations test for softmax and gelu ( #29502 )
...
* Fix bf16 activations test for softmax and gelu
* Resolve conflict
5 years ago
huangxu96
4001979309
Add ReserveSpace in dygraph batch_norm. ( #29221 )
...
* Add ReserveSpace in dygraph batch_norm.
* Add unittest for reservespace
5 years ago
arlesniak
b781953ef5
[oneDNN] Fix flags use test for #29080 , assert condition more general ( #29493 )
...
* Flags assert condition more general, print output if pattern not found
* removed test_flags_use_mkldnn form skip list regarding #29080 descr
5 years ago
Zhen Wang
5ac71b36fb
Remove tensor copy in the update_loss_scaling op. ( #29426 )
...
* remove tensor copy in the update_loss_scaling op
* not use thrust.
* fix some cuda memory access error.
5 years ago
Zhou Wei
e74e1a226c
support deepcopy for Layer/Tensor/Paramerbase ( #29387 )
...
* support deepcopy for Layer/Tensor/Paramerbase
* fix some code
5 years ago
joejiong
50d3117d30
Add random_split and Subset dataset ( #29291 )
...
As the title
5 years ago
joejiong
87e75a77c2
Add tangent operator ( #29207 )
...
As the title
5 years ago
Wei Shengyu
dc8bb76c68
remove addcmul ( #28937 )
...
* remove addcmul
* remove unittest and other related code of addcmul
* fix bug
* fix merge conflict
5 years ago
Zhong Hui
f459dd9634
fix abs double grad unittest ( #29478 )
...
fix abs double grad unittest & define the data range for the abs double grad
5 years ago
huangxu96
576d0d938b
add fp16 check into max and avg pool ( #29479 )
5 years ago
ShenLiang
2ef9e0e23c
Rebuild group automatically in dynamic graph distributed ( #29255 )
...
* add tensor_indices in AssignGroupBySize
* add rebuild group in reducer
5 years ago
procr
3a0558339d
support mobilenet for kunlun ( #29458 )
5 years ago
Aurelius84
5d530c9319
fix amp support fleet ( #29491 )
5 years ago
ShenLiang
311b3b44fc
Fix the bug where embedding can‘t be processed correctly in reducer ( #29485 )
...
* fix the bug of reducer in embedding
* add comment
5 years ago
Pei Yang
2480bdef6c
change hard_swish from plugin to layer ( #29177 )
...
* change hard_swish from plugin to layer
* add ut when threshold != scale
5 years ago
lilong12
b122d0bb76
Fix bug in gloo that gloo initialization hangs ( #29447 )
...
* update, test=develop
5 years ago
taixiurong
ecca6585cd
1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op ( #29448 )
...
Co-authored-by: root <root@bjhw-sys-rpm0223.bjhw.baidu.com>
5 years ago
LoveAn
03b42d9fa7
fix unittest on windows, test=develop ( #29365 )
5 years ago
ShenLiang
22e6b9e373
Fix the ut of matmulv2 for broadcast case ( #29461 )
...
* fix the ut of matmulv2 for broadcast
5 years ago
TTerror
a5fcc4b545
update reduce_sum op on xpu ( #29367 )
...
* update reduce_sum op on xpu
* update reduce_sum op on xpu
* support running on xpu
5 years ago
chentianyu03
acce962133
remove complex module direction ( #29419 )
5 years ago
Zhang Ting
6296f4ed09
revert cast eigen kernel ( #29427 )
5 years ago
Leo Chen
a040c055a5
fix layer_norm accuracy ( #29434 )
5 years ago
Shang Zhizhou
225a9c4ed8
Fix unittest ( #29412 )
...
* fix tensorrt unittest precision error
* fix unittest precision error. test_trt_subgraph_pass && test_trt_dynamic_shape_transformer_prune
5 years ago
Pei Yang
f860de4af7
support clip op trt converter ( #29411 )
5 years ago
Bai Yifan
87bb726258
Add deform_conv2d,DeformConv2D ( #29364 )
...
* add deform_conv2d,DeformConv2D
5 years ago
chentianyu03
64e4e17f0c
remove complexvariable ( #29390 )
...
* rm complexvariable
* modify test_var_base unittest
* remove duplicated codes
5 years ago
chajchaj
79e6086743
change shape of output in cross_entropy, test=develop ( #29220 )
5 years ago
liuyuhui
2ee7a6b08c
[paddle v2.0.0rc1: API fixs] assign/conv2d/conv2d_transpose/cast/ParamAttr ( #29171 )
...
* fix DLTP-15151, paddle.ParamAttr API
* fix DLTP-15083/DLTP-15274, paddle.nn.functionl.assign paddle.cast API
* fix DLTP-15431/DLTP-15432, paddle.static.nn.conv2d paddle.static.nn.conv2d_transpose API
* fix DLTP-15083, paddle.nn.functionl.assign API
* fix DLTP-15431/DLTP-15432, paddle.static.nn.conv2d paddle.static.nn.conv2d_transpose API
* support in_dygraph_mode for cast op, test=develop
* fix bug,test=develop
* fix doc
* fix DLTP-15431/DLTP-15432, paddle.static.nn.conv2d paddle.static.nn.conv2d_transpose API
5 years ago
Guo Sheng
8fc7f1b66a
Fix api docs in RNN, Transformer, layer_norm, WeightNormParamAttr ( #29235 )
...
* Fix api docs in RNN, Transformer, layer_norm, WeightNormParamAttr.
test=develop
* Fix api doc for print in label_smooth.
test=develop
* Update api docs according to review comments.
Add name argument in RNN back.
test=develop
5 years ago
yongqiangma
7c508d8668
update unbind norm add CUDAPlace api doc information ( #29322 )
...
* enhance array_to_lod_tensor_op lod_tensor_to_array_op errors information. test=develop
* fix format. test=develop
* format fix. test=develop
* add lod_rank_table. test=develop
* fix format. test=develop
* fix doc info. test=develop
* fix np error
* add unbind dygraph api. test=develop
* fix unbind doc.test=develop
5 years ago
chentianyu03
879e913b6d
Make transpose, trace, kron, reshape, sum op support complex type ( #29321 )
...
* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types
* add test cases for complex elementwise, matmul and getitem unittest
* add test cases for complex types
* add test cases for complex matmul unittest
* kron, reshape, transpose support complex types
* sum and trace op support complex types
* add test case of sum and trace op
* fix the bug of imag part of complex not initialized
* format file
* format code style
* kron support type promotion; modify test cases
5 years ago
Chen Long
66fd1c00a0
fix some docs test=develop;test=document_fix ( #29374 )
5 years ago
liym27
5f84d0b375
Fix bug: delete wrong check_type of paddle.concat and support LoDTensorArray ( #29306 )
5 years ago
Feiyu Chan
f7cdcefa65
fix multiple documentation errors, test=document_fix ( #29210 )
...
* fix multiple documentation error, test=document_fix
* fix more rst syntax errors, test=document_fix
* fix format issues in docstring, test=document_fix
5 years ago
卖鱼的哲学
074065e5de
fix expand/uniform_random && concat/transpose to new api on xpu ( #29280 )
...
* fix expand && concat/transpose to new api
* update uniform_random_op
* update xpu_header
5 years ago
ShenLiang
4064354a01
support dp run single card ( #29358 )
5 years ago
gongweibao
8989053443
Fix bug of test_fleet_launch_async.sh ( #29332 )
5 years ago
Huihuang Zheng
8f7627907c
[Dy2stat] Reduce Exception Type for Better Error Message ( #29268 )
...
Reduce exception type so that if covert_to_static failed, it reports right error message.
5 years ago
liym27
61a8f2874f
[Dy2Stat] Fix bug: Do not use gast.Subscript to replace gast.Name in when transforming for_enumerate_loop ( #29310 )
5 years ago
liym27
b10ecd9d3a
[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows ( #29267 )
5 years ago
Chen Weihang
9ad800ebb2
Support type promote for basic math ops (quantum required) ( #29265 )
...
* basic impl of type promote
* add comment & another testcase
* fix complex bugs & support python op promote type
* fix failed unittests & polish code
* add unittest for coverage
* change to only promote complex type
* polish code details
* polish several comments
5 years ago
LielinJiang
f31e5adab5
fix typo in ProgBarLogger ( #29329 )
5 years ago
tangwei12
8358791607
fix gpu outofrange ( #29238 )
...
* fix gpu emb out of range
Change-Id: I5794ac73bd634d5ea069a6fbbd914274b6d6b7bf
* fix doc
Change-Id: I5a3350b2930a9ab2f52116c192b087307faf8fdf
5 years ago
YUNSHEN XIE
28164b266f
disable test_rnn_decode_api and test_complex_matmul on windows ( #29252 )
5 years ago
Leo Chen
b58cfff89d
use has_grad instead of train_mode ( #29309 )
...
* use has_grad instead of train_mode
* add vlog for debug
* fix ut
* fix ut
5 years ago
Aurelius84
67c700b479
[Dy2Stat] Add cache for Executor and Context in run_program_op ( #28421 )
5 years ago
ShenLiang
d6753e1e6d
fix matmulv2 for windows ( #29327 )
5 years ago
gongweibao
96de8b008f
cleanup enum test=develop ( #29294 )
5 years ago
liym27
b9a8ebd50f
[Dy2stat] Add a decorator paddle.jit.not_to_static to support that not to convert a function in Dynamic-to-Static. ( #29253 )
...
Usage scenarios:A function could have run successfully in static mode, you can use it to decorate a function in the following cases:
1. An unknown error occurs in the dynamic-to-static conversion process of the function;
2. In the internal implementation of the function, it has two branches: dynamic branch and static branch;
3. Users don't want to convert the function in the process of dynamic to static.
5 years ago
ShenLiang
2d6aa1a5bb
fix warning of fleet ( #29317 )
5 years ago
ShenLiang
2cd0bf5764
Fix doc of fleet api ( #29282 )
...
* fix doc, test=document_fix
5 years ago
ShenLiang
c00af94435
fix matmulv2 for windows ( #29302 )
5 years ago
Steffy-zxf
41f17aeb8b
fix DATA_HOME path in win ( #29222 )
...
* fix DATA_HOME path in win
5 years ago
Jack Zhou
cf43322139
fix nll_loss doc;test=document_fix; ( #29247 )
...
* fix nll_loss doc;test=document_fix;
* remove numpy and set_device;test=document_fix;
* remove numpy;test=document_fix;
5 years ago
LielinJiang
b9f1f4343b
Move temporal_shift to paddle.nn.functional ( #29261 )
...
* move temporal_shift to functional
5 years ago
Chen Weihang
a2e9d95a4a
change test_imperative_signal_handler_to_exclusive ( #29283 )
5 years ago
Zhen Wang
be3777a50a
Add pure fp16 training with master weights. ( #27712 )
...
* add the weight decay func for the momentum op
* Add the multi_precision function in Momentum Optimizer.
* Make sure that the initial value of master weights are same with the fp16 weights.
* add static loss scaling.
* add the rescale_grad function in the pure fp16 training.
* use the original momentum updating method.
* Polish some codes, such as variable names.
* add docstring for apis.
* update the var creation details of _create_master_weight.
* not modify codes about imperative momentum updating.
* Fix the error of test_dist_sparse_tensor_load_momentum UT.
* add unit test for multi precision fp16 training.
* add more unit tests for CI.
* Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.
* For CI Coverage Checking.
5 years ago
chentianyu03
976961de6d
fix random failed of complex matmul ( #29285 )
5 years ago
furnace
7584bb5096
Layer norm fp16 ( #29169 )
...
* add fp16 for layer_norm op
* revert layernorm api
* fix forward
* fix forward
* fix backward for layernorm with fp16
* fix unit test for layernorm with fp16
* fix with_mkldnn compile error for layernorm with fp16
* 1. revert to PADDLE_ENFORCE_NOT_NULL, 2. change static_cast<float> to static_cast<U>
* fix with_mkldnn compile error for layernorm with fp16
* fix with_mkldnn compile error for layernorm with fp16
Co-authored-by: zhiqiu <chenqiuliang@baidu.com>
5 years ago
mls1999725
a37963b890
Update APIs in text/datasets and dataloader ( #29219 )
...
* Update IterableDataset API
* Update TensorDataset API
* Update APIs in paddle/text/datasets
* Update dataset.py
5 years ago
mls1999725
493568b070
Update Codes of Cifar and VOC2012 ( #29204 )
...
* Update Cifar Codes
* Update VOC2012 Codes
* Update voc2012.py
* Update voc2012.py
* Update cifar.py
* Update cifar.py
* Update voc2012.py
5 years ago
mls1999725
0aedd463ee
Update get_worker_info API ( #29190 )
...
* Update get_worker_info API
* Update dataloader_iter.py
* Update dataloader_iter.py
* Update dataloader_iter.py
5 years ago
mls1999725
6a9a62c3ef
Update conv3d API ( #29205 )
...
* Update conv3d API
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
5 years ago
Huihuang Zheng
aec05d811c
[Dy2stat] Fix PaddleGan Deoldify Model Dy2stat Problems ( #29226 )
...
This PR fixes several problems in dy2stat for Deoldify model in PaddleGan.
In model, software engineer wrote if x.shape == y.shape, the Tenser shape is a tuple in dygraph so the == returns True/False, but in static graph the == becomes element-wise comparison, which is a different behavior. In this PR we reduce the element-wise comparison result.
If software engineer write computations which uses parameters in hooks, the static graph can loss the parameter variable because we put param_guard at forward of a Layer. In this PR we made param_guard cover pre-hook and post-hook.
In PaddleGan, software engineer calculated some parameter values in __init__ by running some dygraph code. Those code also run during dy2stat. So some variables may be assign as a VarBase (Tensor) first and then Variable, which raised an error. We fixed the bug in this PR by handling the case.
TODO: We just added testcase for the 1. shape comparison. Should add test case for 2. and 3. But since we are chasing 2.0RC, I will do it in the near future PR
5 years ago
Leo Chen
116305ea4b
Improve performance of elementwise_add grad op ( #29187 )
...
* pass stop_gradient for cast op
* improve performance of elementwise_add grad
* use tensor copy async
* dygraph branch
* fix dygraph branch
* add ut
5 years ago
卖鱼的哲学
07c67d5a8b
add deformable_conv op on xpu ( #29234 )
...
* rebase develop
* update deformable_conv op on xpu
* update deformable_conv op on xpu
5 years ago
Chen Weihang
1de32f823d
Hot fix complle failed in gcc4.8 caused by complex impl ( #29254 )
...
* hot fix complle failed in gcc4.8
* fix failed unittest
5 years ago
yukavio
a71ea00922
add unit test ( #29228 )
5 years ago
ShenLiang
46b73e6cd9
Change the api of DataParallel and Fleet ( #29224 )
5 years ago
Leo Chen
73e51a17e7
add stop_gradient property and remove reduce redundant information ( #29185 )
...
* add stop_gradient property and remove reduce redundant information
* refine code
5 years ago
QingshuChen
64f29fbb70
update kunlun conv2d/softmax/elementwise implemetation ( #29229 )
...
* update conv2d & softmax to new xpu api
* test=kunlun
* remove useless comments
* test=kunlun
* remote softmax xpu op
* test=kunlun
* update kunlun softmax
* test=kunlun
* update xpu unitest
* test=kunlun
* fix elementwise_grad bug for kunlun
*test=kunlun
5 years ago
Jiawei Wang
b11ab12787
Fix doc (adadelta, sgd, momentum) ( #29212 )
...
* fix 3 doc
* fix 3 doc
* Update adadelta.py
5 years ago
lijianshe02
76312deb30
fix nll_loss test random fail bug test=develop ( #29236 )
5 years ago
LielinJiang
8a2dd34a1e
fix depthwise conv ( #29227 )
5 years ago
huangxu96
dbdeecd665
Modify doc mistakes of grad API. ( #29176 )
5 years ago
Jiawei Wang
a5d13d593c
Momentum Velocity init in Momentum.__init__() ( #29223 )
...
* add lamb optimizer and unittest
* fix momentum resume training
* fix momentum acc
5 years ago
Leo Chen
4556ad76b4
Upgrade string literals to raw string [part 2]( #29217 )
5 years ago
wanghuancoder
2b2cd1864a
revert python file coverage, delete coverage run --include, test=develop ( #29230 )
5 years ago
chentianyu03
8f45d14263
add complex64 and complex128 type; add +-*/@ and slice opreator for c… ( #29199 )
...
* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types
* add test cases for complex elementwise, matmul and getitem unittest
* add test cases for complex types
* add test cases for complex matmul unittest
5 years ago
123malin
cc9c619679
test=develop, fix doc ( #29200 )
...
* fix fleet api doc
5 years ago
Zhou Wei
c0a991c874
accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept ( #28429 )
...
* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor
* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor
* fix coverage
* fix api doc
* fix CI unittest
* fix CI unittest
* fix unitest
* empty tensor does’t need inner_var_
* fix some error message
5 years ago
huangjun12
b6a26749dc
fix doc of alpha_dropout/dropout/dropout2d/dropout3d/npair_loss ( #29136 )
...
* fix en doc, test=document_fix
* add blank after code declare, test=document_fix
* refine doc of dropout, test=document_fix
* refine npair_loss and dropout, test=document_fix
5 years ago
LielinJiang
d8eef4e4a4
Remove dependence of scipy ( #29121 )
...
* lazy import for scipy
* rm unused check
5 years ago
yaoxuefeng
a069e1ca91
fix docs ( #29097 )
5 years ago
Chen Weihang
786e69e9c7
diable test_yolov3 in musl ( #29216 )
5 years ago
hong19860320
f23665e5d5
Refine the doc and unit test for Sigmoid and stanh ( #29198 )
5 years ago
123malin
b5c6342336
Update ps gpu ( #29209 )
...
* fix paramete prefetch & device guard
Co-authored-by: MrChengmo <cmchengmo@163.com>
Co-authored-by: chengmo <chengmo@baidu.com>
5 years ago
liym27
865a45984f
Check whether there is any inplace operation affecting gradient calculation. ( #27901 )
...
* Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable.
* Add a new attribute `_inplace_version` for VarBase.
* Raise exception if an inplace operation can result in incorrect gradient computation.
* Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation.
* For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode.
* Use original var_wrapper if the inplace_version is not changed.
* Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.
5 years ago
lilong12
08fb079dbc
Fix the doc for shard_index api ( #29183 )
...
* update, test=develop
5 years ago
qingqing01
058f1b2284
Enhance paddle.metric.Accuracy ( #29125 )
5 years ago
joejiong
dc070ecfb0
Remove cast from paddle.pow api ( #29134 )
...
As the title
5 years ago
WangXi
0c2a51d240
optimizer amp, all use fp16 communication, overlap last comm and compute ( #28957 )
5 years ago
Chen Weihang
0b032faeee
Polish unittests details and execution conditions to adapt to MUSL ( #29044 )
...
* fix failed tests in yingchun gived list
* add unittests into static_mode_white_list
* add enable static
* fix dist unittest
* skip test_sigmoid_focal_loss_op & add gym
* revert no need skip unittests
* remove gym
5 years ago
123malin
92817f8005
test=develop, rm pathlib ( #28658 )
...
* test=develop, rm pathlib
5 years ago
Wojciech Uss
4fd4095d1b
Add quantization of multi_gru op and tests ( #28615 )
5 years ago
Thunderbrook
4adddcc89a
add set_trainer_num api in dataset ( #29133 )
5 years ago
liym27
e03440812a
fix code: if y is True -> if y ( #29184 )
5 years ago
danleifeng
7e7b4b9e5d
remove sampled_softmax_with_cross_entropy alias;test=develop ( #29180 )
5 years ago
WeiXin
1476e1f998
save model after jit.load ( #28748 )
...
* Changed a variable name error
* Add comments
* Move member functions of TranslatedLayer out of function
* edit code according to review
* Edit input argument of '_run_static_graph'
* reset due to Segmentation fault
* rename variables when stitching graph
* modify code according CI
* Add comments to '__i_m_p_l__'
* remove blanks befor 'Get...'
* edit code according to review
* Add a comment to '_execution_method_creator'
* Edit a comment to '_execution_method_creator'
5 years ago
wanghuancoder
0239f79695
Generate code coverage reports only for incremental files ( #28508 )
...
* Generate code coverage reports only for incremental files, test=develop
* Generate code coverage reports only for incremental files, test=develop
* Generate code coverage reports only for incremental files, test=develop
* test for diff python file, test=develop
* fix no python diff report, test=develop
* add cc test file, test=develop
* fix bug in generic.cmake, test=develop
* for debug no cc report, test=develp
* modify compire branch form test_pr to test, test=develop
* fix bug, test=develop
* test for h file changed, test=develop
* debug for redefinition of argument optimize error, test=develop
* close -o3 for test, test=develop
* remove -o3 for test, test=develop
* remove coverage option for nvcc, test=develop
* use CMAKE_CXX_FLAGS open coverage option when header file changed, test=develop
* reopen -o3, test=develop
* remove debug code, test=develop
* remove unused code, test=develop
5 years ago
zhang wenhui
8388abe66b
Fix api 1128 ( #29174 )
...
* fix 2.0 api, test=develop
* fix api, test=develop
5 years ago
LielinJiang
f92fdfb8ef
Add ReduceLROnPlateau ( #29113 )
...
* add ReduceLROnPlateau
5 years ago
Huihuang Zheng
27b4218333
[Dy2stat] Disable PaddleInference IR Optimization in test_mnist for CUDA11 ( #29105 )
...
test_mnist failed on CUDA11. We found that it is due to PaddleInference IR Optimization after debugging. We disable it in this PR and we will re-enable it after PaddleInference fixes it.
5 years ago
liym27
01bdea7c31
[Dy2Stat] Don't conver the function from third library logging ( #29161 )
5 years ago
liym27
a7433cc379
[Dy2Stat] Fix bug: the return statement should be transformed to an equivalent Paddle/Python if statement, which depends on if conditions of the return stmt. ( #29165 )
5 years ago
Huihuang Zheng
4a0a870177
[dy2stat] Set shape for linspace to Fix dy2stat for GridGenerator Model ( #29173 )
...
GridGenerator model failed because the output shape of `linspace` is (-1). The reason is that C++ InferShape fixes the shape to (-1):
5da3d514eb/paddle/fluid/operators/linspace_op.cc (L49)
We cannot set the shape in C++ infer shape because this Tensor may not be initialized during compile time, but when input `num` of `linspace` is an integer, we know the shape at compiler time. This PR simply set the shape in Python and add GridGenerator as unittest.
5 years ago
Aurelius84
cb680c8013
[Dy2Stat]Refine code of test_lac unittest ( #29087 )
5 years ago
ShenLiang
e2d01eb650
Support dynamic graph distributed ( #28997 )
...
* add reducer
* refine envent for memorycopy
* add concat&split for allreduce
* apply concat & split for fuse tensor
* fix nccl dep
* fix the untest, compile problem and ddp initialize problem
* fix untest for mac & add some comments & solve the repeated param in sublayers
* fix untest for windows & fix document
5 years ago
lilong12
7e5e9934fe
update expand as op to use the shape of the target tensor instead of the target tensor itself. ( #29020 )
...
* update, test=develop
5 years ago
Kaipeng Deng
f4c894a693
alias yolo_loss & yolo_box to paddle.vision. ( #28520 )
...
* alias yolo_loss & decode_yolo_box to paddle.vision. test=develop
5 years ago
Shibo Tao
4ceedec33d
enhance doc. add kwargs for backward compatibility. test=develop ( #29143 )
5 years ago
LutaoChu
28280647eb
add paddle.subtract, optimize paddle.maximum and paddle.minimum
...
add paddle.subtract, optimize paddle.maximum and paddle.minimum
5 years ago
徐铭远
3c2a46bd7b
fix doc of erf,rank,mm,cross_entropy,pixel_shuffle,kron... ( #29126 )
...
* fix doc example, test=develop, test=document_fix
5 years ago
Chen Long
d576d6ddeb
fix some docs test=develop;test=document_fix ( #29159 )
5 years ago
yukavio
5da3d514eb
solve pretty table dependent in flops api ( #29132 )
...
* solve pretty table dependent in flops api
* add unittest dependent
* temp
5 years ago
pangyoki
6df685ab64
fix nce, multinomial, Categorical, Normal, Uniform en doc ( #28541 )
...
* fix Categorical en doc
* fix doc for apis
* remove numpy in sample code
5 years ago
LielinJiang
9f53f3d09e
Enhance logger callback for benchmark ( #29106 )
...
* enhance logger callback for benchmark
5 years ago
Jack Zhou
085260f3de
Add eigen gru and fix the dropout bug in the rnn
...
Add eigen gru and fix the dropout bug in the rnn
5 years ago
yaoxuefeng
545df287fc
add user_define_dump ( #28596 )
5 years ago
lilong12
216e085605
update, test=develop ( #29139 )
5 years ago
liym27
5fe44571f0
[Dynamic-to-Static] Support **kwargs as input of the function which is decorated by `jit.save.to_static` ( #29098 )
5 years ago
YUNSHEN XIE
0fca8cdfdf
fix error with ut timeout and failed ( #29148 )
5 years ago
GaoWei8
a7ef724dd5
polish softamx doc ( #29153 )
5 years ago
lilong12
a1add716bc
Add a flag to control whether to initialize gloo ( #29150 )
5 years ago
Chen Weihang
0d1900d329
add debug msg for test_buffer_shared_memory_reuse_pass ( #29151 )
5 years ago
guofei
638402274a
Integrate ImperativeOutScale into ImperativeQuantAware. ( #27956 )
...
* Optimiz the unittest test_imperative_out_scale
test=develop
5 years ago
Shang Zhizhou
b9e76a0103
detect tensorRT plugin fp16 in runtime ( #27933 )
...
* remove -DSUPPORTS_CUDA_FP16 in cuda.cmake
* comile with cuda9
* add some unittest
* notest;test=coverage
* add unittest for trt plugin swish && split
* update ernie unittest
* fix some error message
* remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter
* fix comile errror when CUDA_ARCH_NAME < Pascal"
* fix comile error
* update unittest timeout
* compile with cuda9
* update error msg
* fix code style
* add some comments
* add define IF_CUDA_ARCH_SUPPORT_FP16
* rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
5 years ago
Chen Weihang
c39da29db7
Add symlink force for unittest test_static_save_load ( #29137 )
...
* add symlink force for unittest
* open unittest
5 years ago
xiaoting
9cc0e72619
Fix interpolate doc ( #29104 )
...
* fix interpolate example, test=develop;test=document_fix
* fix format, test=develop, test=document_fix
* update upsample doc, test=develop, test=document_fix
5 years ago
whs
9b39af3f22
Fix docs in 2.0 API ( #29081 )
...
* 1. grid_sample
1.1 fix has_print
2. conv1d_transpose
2.1 fix code_example error
3. conv1d
4. affine_grid
4.1 has_print
4.2 has_disable_static
5. Conv1DTranspose
5.1 fix code_example error
5.2 has_disable_static
6. Conv1d
6.1 code_example
6.2 has_disable_static
5 years ago
Guanghua Yu
bb64efb1d0
fix softmax_with_cross_entropy api en docs ( #29116 )
5 years ago
Chen Weihang
0dfb81614b
polish static save load doc, test=document_fix ( #29115 )
5 years ago
Chen Weihang
95a0f87b44
support jit.save datra parallel ( #29135 )
5 years ago
lilong12
449903dead
add paddle.broadcast_to api which is a alias of paddle.expand ( #28706 )
...
* update, test=develop
5 years ago
Noel
da71173bc9
Fix ops doc for some ops
...
Fix ops doc for some ops
5 years ago
Leo Chen
770395cb93
Split train_mode and has_grad for tracer ( #29064 )
...
* split train_mode and has_grad
* fix format
* fix ci problems
* fix sample code
5 years ago
ShenLiang
cddc70964d
fix InMemoryDataset doc ( #28688 )
...
* add Inmemorydataset
5 years ago
ShenLiang
bb5f8e3575
fix doc of data,matmul,dot,cholesky,scatter,divide,remainder,inverse,sign ( #28665 )
5 years ago
yukavio
29b5050726
Revert "add hapi api flops ( #28755 )" ( #29144 )
...
This reverts commit 63e90ee331
.
5 years ago
YUNSHEN XIE
27d04a3b1f
disable ut test_static_save_load ( #29119 )
5 years ago
JZ-LIANG
0dadacc4eb
[sharding] doc, api, bug fixed ( #28983 )
...
* add lars to fleet meta optimizer
* add lamb to proto
* add lamb to fleet meta optimizer
* fixed syntax bug
* fixed syntax bug
* fixed syntax error in lamb, add config setter of lamb in distributed_strategy
* trigger unitest to rerun
* add new unitest func for lamb
* revise unitest for lars and lamb
* revise dgc meta unitest
* revise lars document in distribute_strategy
* revise lars lamb document in distributed_strategy.py
* revise lars lamb document in distributed_strategy.py
* add weight decay exclude logic to lars
* restore optimzier.py
* restore optimizer.py as develop except lars
* add epsilon and exclude fn to distributed_sttrategy
* add lars epsilon
* revise unitest for fleet lars and lamb
* revise lars lamb unitest for CI coverage
* revise lars argument api
* revise lars argument api
* revise lars argument api
* revise api doc of lars
* fix op role
* add sharding save and add_sync_comm_for_test function
* add comm_analyse to utlis
* revise sharding_utils
* add sharding saving unittest
* revise sharding utils for unittest
* revise sharding en doc
* update sharding utils api
* add doc for sharding
* fixed bug in sharding var size count
* update varsize count in sharding
* fix sharding num_nccl_comm
* Revert "fix sharding num_nccl_comm"
This reverts commit d51587c15e9323acf226ddd36154275f0d1daf76.
5 years ago
lilong12
2a864c70c4
fix the bug in gloo ( #29112 )
...
* update, test=develop
5 years ago
whs
7de2db4a81
Fix grid_sample in cudnn mode ( #29124 )
5 years ago
Yanghello
dd41775059
fix crypto ut test error for windows ci ( #29090 )
5 years ago
Chen Weihang
d0129fcd88
Add static_only decorator for static apis ( #29015 )
...
* add static_only for static api
* addd static_only for class init
* remove static_only for default_main_program
* remove creater_parameter & startup_program
* remove failed apis
* revert py_func import
* remove global scope
* remove some api
* remove cuda pinned place
5 years ago
WangXi
e931c7baf9
Fix multi nccl comm & wait server ready ( #28663 )
5 years ago
ceci3
e7caf3b8d9
fix examples, test=document_fix ( #29019 )
...
* fix examples, test=document_fix
* fix, test=document_fix
5 years ago
Shibo Tao
db41258501
add API serialize_program, serialize_persistables, save_to_file, deserialize_program, deserialize_persistables, load_from_file. ( #29034 )
5 years ago
Aurelius84
14013a2eba
Remove prettytable in requirements.txt ( #29100 )
5 years ago
Kaipeng Deng
b052149dcf
remove BatchSampler type check ( #29114 )
...
* remove BatchSampler type check. test=develop
5 years ago
yukavio
63e90ee331
add hapi api flops ( #28755 )
...
* add hapi api flops
* fix bug
* fix some bug
* add unit test
* fix unit test
* solve ci coverage
* fix doc
* fix doc
* fix static flops
* delete the comment
* fix some grammar problem in doc
* fix some bug
* fix some doc
* fix some doc
5 years ago
hutuxian
db85f4cf8f
Add dygraph implementation for multiplex op ( #29049 )
5 years ago
joanna.wozna.intel
b0d1ac161e
Add bf16 pool2d and unify bf16 unit tests ( #29039 )
...
* Add bf16 pool2d and unify bf16 unit tests
* Add change default ops test
5 years ago
gongweibao
1358397e97
Clean up the redundant files and unify the launch interface. ( #28928 )
5 years ago
Guanghua Yu
47af5c3c9d
fix smooth_l1_loss en docs ( #29093 )
5 years ago
LielinJiang
6951052431
add default conv init ( #29092 )
5 years ago
Chen Weihang
fea0e294ee
Hide the C++ stack by default and add hints ( #29042 )
...
* default not show cpp statck & add hint
* fix failed unittest
* fix failed unittests
5 years ago
Yiqun Liu
eb9ae55849
Optimize the performance of piecewise_decay. ( #29077 )
5 years ago
qingqing01
8bbedc2371
Fix doc format for callbacks, metrics and Model ( #28638 )
...
* Fix doc format for callbacks, metrics and Model
* Fix code sample and doc
5 years ago
Chen Weihang
a1486091f1
Polish load_program_state design to loading file onebyone ( #29041 )
...
* change load dict file one by one to warn
* add unittests for coverage
* polish error message
* fix cond error
5 years ago
GaoWei8
a049dff78f
Modify the default setting of softmax cudnn ( #28672 )
5 years ago
WeiXin
fafadbab70
Rename variables when use 'jit.load' ( #28933 )
...
* Rename variables when use 'jit.load'
* Check whether the original graph contains the variable with the same name
* add comment
* rename output/input of op and edit unittest
* modify the code according to CI
* edit code according to CI
* edit code according to CI
* edit code according to CI
* edit code according to CI
* edit code according to CI
* edit code according to CI
5 years ago
ceci3
a3faa520ec
Fix syncbn ( #29013 )
...
* fix syncbn
* add unittest
5 years ago
joejiong
582c0a0468
add uint8 for reshape op ( #28996 )
...
add uint8 for reshape operator
5 years ago
FlyingQianMM
f0e614feae
change print([.*].numpy()) to print([.*]) in example codes of sigmoid_focal_loss ( #29094 )
...
* rewrite the sigmoid_focal_loss code example. test=develop
* fix spelling mistake in comments of code example.test=develop
* change print([.*].numpy()) to print([.*]) in example codes of sigmoid_focal_loss. test=document_fix
5 years ago
liym27
682cc17f53
[Dynamic-to-Static] Fix bug: support pop from a dict and polish code of convert_pop ( #29023 )
...
* Support pop for dict in dy2stat
* Move convert_pop to convert_operators.py and polish convert_pop
5 years ago
Zhou Wei
8ca0a8a859
fix tensor detach to zero copy ( #27921 )
...
* fix tensor detach to zero copy
* fix tensor detach to zero copy
5 years ago
GaoWei8
13828db313
Add reminder of fluid.layers.lstm ( #28964 )
...
* add reminder of fluid.layers.lstm
5 years ago
Aurelius84
8af0d85ea4
fix unittest failed on windows GPU ( #29072 )
5 years ago
chajchaj
dfaf6b5eea
save one name in cross_entropy and softmax_cross_entropy, test=develop ( #29074 )
...
* save one name in cross_entropy and softmax_cross_entropy, test=develop
* change used function in CrossEntropy from softmax_cross_entropy to cross_entropy, test=develop
5 years ago
taixiurong
a5aa4dc7a9
add xpu elementwise ops ( #29031 )
5 years ago
tianshuo78520a
e9acd9c941
Update CI Python3 Docker for Cuda11 ( #28401 )
5 years ago
joejiong
b04c78ef5e
Update pow ( #29000 )
...
Simple code clean up
5 years ago
huangxu96
40f5453725
Quant nn2.0 ( #28764 )
...
* Impelement 2.0 API version Conv2d and Linear layer quantization in imperative mode.
* use cudnn softmax in static Lenet
* Modified ChannelwiseQAT Unittest for 2.0 API.
* For CI python coverage.
5 years ago
Chen Weihang
bb16c2515d
Polish parallel api impl & doc details ( #28980 )
...
* polish parallel api impl & doc details
* add unittest for coverage
* remove spawn test in py2.7
* add parallel api into white list
5 years ago
joejiong
4b05a8be88
delete axis parameter in multiply api ( #28647 )
...
As the title
5 years ago
YUNSHEN XIE
c098a2e159
fixing cmake error for WITH_GPU=ON and WITH_DISTRIBUTE=OFF ( #29030 )
5 years ago
chajchaj
b52427327d
add soft_label and axis for CrossEntropyLoss and improve performance ( #29024 )
...
* add soft_label and axis for CrossEntropyLoss and improve performance,test=develop
* fix conflict in nn/functional/loss.py, test=develop
5 years ago
Chen Long
018e169923
fix some docs ( #29011 )
...
* fix some docs test=develop;test=document_fix
* add code example test=develop;test=document_fix
* fix code example test=develop;test=document_fix
* fix code example test=develop;test=document_fix
* fix code example test=develop;test=document_fix
5 years ago
ysh329
9a475582e0
fix clip by norm EN doc. test=develop ( #28994 )
...
* fix clip by norm eng doc. test=develop
5 years ago
Huihuang Zheng
5b339262bc
[Dy2stat] Add Tuple as Assign Target for Tensor Shape ( #28775 )
...
Add support for using tuple as tensor.shape (For example: a, b, c, d = x.shape)
5 years ago
YUNSHEN XIE
5cb8e17a18
restore timeout value ( #29027 )
5 years ago
Jiawei Wang
f21513307a
add lamb optimizer and unittest ( #28772 ) TODO:FIX BUGS LATER
...
* add lamb optimizer and unittest
* fix lamb
* fix lamb v2 op
* fix sampling id
* fix lamb sample code
* Update lamb.py
* fix doc
* fix doc
* Update lamb.py
5 years ago
Leo Chen
3815d7aa40
Upgrade string literals to raw string ( #28989 )
...
* upgrade comment string to raw string
* fix string in
* fix string with ' '
* revert update on comments
* upgrade only necessary
* fix sample code checker
* fix comments with '''
5 years ago
liym27
85292e0b46
[Dynamic-to-Static] Fix bug of convert_logical_and/convert_logical_or: the operands are executed sequentially( #28993 )
...
1) The operands are executed sequentially according to the running logic of Python.
2) If the left hand operand is True(for convert_logical_or)/False(for convert_logical_and), the right hand operand should be executed.
5 years ago
Huihuang Zheng
96126532cd
Fix Incorrect After Node Vars in IfElseTransformer, test=develop ( #28992 )
...
The PR description is long. See details in the PR link.
5 years ago
LielinJiang
982fd0f3c2
fix mnist fmnist ( #29018 )
5 years ago
wanghuancoder
887a35113e
fix eng doc for some api ( #28477 )
...
* fix eng doc, test=develop
* add import deprecated for layers, test=develop
* add block line for doc generate, test=develop
* remove todo for create_variable, test=develop
* add blank line for doc generate, test=develop
* add blank line for doc generate, test=develop
5 years ago
smallv0221
83cee3c9d7
Delete mq2007 dataset. ( #28995 )
...
* Fix en doc for rnn.py. test=document_fix
* Delete mq2007 dataset.
5 years ago
Chen Weihang
9cd09a8586
Polish dataloader doc detail & update example ( #28975 )
...
* polish dataloader doc detail, test=decument_fix
* fix commnet error
* fix word error
5 years ago
123malin
fbf9564f6b
【paddle.distributed.fleet】Optimize ParameterServer's Async Mode ( #28442 )
...
* test=develop, optimize global_step
5 years ago
Leo Chen
98adc8f054
Dev/fix doc of some api ( #28785 )
...
* refine doc of bernoulli
* fix some problems
* fix unsqueeze
* fix squeeze
* fix doc
5 years ago
lilong12
f77a78cdee
enable pipeline to run with Executor.run() ( #28373 )
...
* update, test=develop
5 years ago
lijianshe02
9f642ed881
fix English doc for dice_loss, log_loss, unfold and NLLLoss API test=… ( #28739 )
...
* fix English doc for dice_loss, log_loss, unfold and NLLLoss API test=document_fix
5 years ago
joejiong
4e00c095fa
fix warning in english doc ( #28981 )
5 years ago
LiuChiachi
70385518a6
Add EarlyStopping ( #28691 )
...
* add early stopping
* add doc for early stopping
* fix sample code bugs
* update infer of mode, update doc, add unittests to increase coverage rate
* fix sample code for early stopping
* update sample code and unittests
* reduce time cost of test_callbacks unittest
* fix model.py code style error
5 years ago
LiuChiachi
8c8b42f28a
Update path name of saving in hapi ( #28462 )
...
* update hapi save_inference_model output pathname
* update hapi save_inference_model output pathname
* use new 2.0-api paddle.static.io.load_inference_model
* add unittests to increase coverage rate
5 years ago
Thunderbrook
0073f9bdb0
support ps-gpu ( #28752 )
...
* ps gpu transpile
* ps gpu
* remove op
* gps trainer
* local ps
* add macro
* HeterBox
* def cuda
* tab
* code style
* style
Co-authored-by: Thunderbrook <a754913769#163.com>
5 years ago
Chen Weihang
768dab441e
polish two api doc detail, test=document_fix ( #28971 )
5 years ago
Chen Weihang
7c7cdf082f
remove fluid & variable in program ( #28966 )
5 years ago
furnace
8ff3550658
refactor momentum op to combine weight ( #27414 )
...
* refactor momentum op to combine weight_decay (scale op and sum op)
5 years ago
YUNSHEN XIE
50113f9268
fix error for Can not find test to add properties to:test_parallel_dygraph_transformer,test_parallel_dygraph_sparse_embedding ( #28968 )
5 years ago
Zhang Ting
3416c35996
refine doc of default_startup_program, test=document_fix ( #28951 )
5 years ago
Zhang Ting
842fd2933c
remove fluid in default_main_program doc, test=document_fix ( #28941 )
5 years ago
Zhang Ting
b5218227d6
refine the doc of dist op, test=document_fix ( #28947 )
5 years ago
LielinJiang
8f8a02fda4
Optimize conv performance ( #28766 )
...
* optimize conv performance
5 years ago
LielinJiang
00e55ded4a
Add lr scheduler callback for high level api ( #28737 )
...
* add lr scheduler
5 years ago
yaoxuefeng
6369463a5d
fix test datagenerator ut fail ( #28767 )
5 years ago
Kaipeng Deng
ed7aa8f9c3
Fix conv transpose ( #28629 )
...
* fix conv_transpose unittest. test=develop
5 years ago
Chen Weihang
bff4179cc7
lazily init global group in collective ( #28780 )
5 years ago
gongweibao
1dad8ceaab
Fix gpu memory allocation bug. ( #28703 )
5 years ago
Kaipeng Deng
91bab752a9
fix dataloader default value and doc ( #28728 )
...
* fix dataloader. test=develop
5 years ago
Chen Weihang
0ed80e09fc
Fix param base trainable set failed ( #28756 )
...
* fix param base trainable set failed
* add unittest
* fix typo
* polish comment
5 years ago
Aurelius84
d6aee7597c
[Dy2Stat]Set buff.persistable=False when it's not initialized ( #28749 )
5 years ago
joejiong
1a532d5133
add uint8 support for squeeze operator ( #28734 )
...
Adding uint8 support for squeeze operator.
5 years ago
123malin
9066828b1b
test=develop, bug fix for embeddings padding ( #28708 )
...
* test=develop, bug fix for embeddings padding
* fix raise Value for Embedding
Change-Id: I6d343fceee369a5796ad59cca5c91fdd15429125
Co-authored-by: seiriosPlus <tangwei12@baidu.com>
5 years ago
Bai Yifan
655d5eb1db
fix code example ( #28636 )
...
* fix code example, test=document_fix
5 years ago
joanna.wozna.intel
8c0ea4bffe
Add bf16 matmul, fc, elementwise add and mul ( #28729 )
...
* Add bf16 matmul, fc, elementwise add and mul
* Correct unit test
5 years ago
Zhou Wei
3b0dd5f620
fix bug that to_tensor not support paddle.Place ( #28717 )
5 years ago
Aurelius84
e1c8d6bce5
Fix wrong out dtype inferred from helper.input_dtype ( #28715 )
...
* Fix wrong out dtype from helper.input_dtype
* add unittest
* remove disable_static in op_test
* fix param name typo
5 years ago
taixiurong
d3d1a6b6e0
add kunlun kernel: slice, slice_grad, top_k, cast. *test=kunlun ( #28542 )
...
* 1.add xpu slice op 2. add xpu top_k op 3.modify xpu cast to new api
* 1.add xpu slice op 2. add xpu top_k op 3.modify xpu cast to new api
5 years ago
Jack Zhou
9362d85e0e
Add LSTM, Simple RNN and GRU CPU kernel ( #28577 )
...
* add lstm, simple rnn op kernel
* fix the test_lstm for the rnn op
* change func name
* fix forward postprocess bug
* add gru forward, backward code
* remove unittest.skipIf; use a big rnn op instead of combination op
* fix input doesn't have gradient bug
* add eigen lstm forward, backward
Co-authored-by: wawltor <fangzeyang0904@hotmail.com>
5 years ago
QingshuChen
30ef3815b3
adjust kunlun header file ( #28536 )
...
* adjust kunlun header file
*test=kunlun
* update kunlun unittest
*test=kunlun
* update xpu unitest
* test = kunlun
* update xpu unittest
* test=kunlun
* update xpu unitest
* test=kunlun
5 years ago
wangguanzhong
d12aa4957f
remove alias for iou_similarity and ssd_loss ( #28742 )
5 years ago
Double_V
960135285b
fix pool APIs en doc, delete disable_static ( #28679 )
...
* fix pool exclusive and delete disable_static, test=develop
* fix pool1d exclusive, test=develop
* fix pool APIs en doc, test=document_fix
5 years ago
Wilber
60a5eb68a0
update paramattr doc. test=document_fix ( #28745 )
5 years ago
liym27
269470d62e
[Dynamic-to-Static] Remove unnecessary variables of the arguments in true_func/false_func ( #28722 )
5 years ago
danleifeng
7d32e100c5
upgrade to use paddle-2.0 API for sample code of paddle.t ( #28726 )
5 years ago
liym27
e5f0e6b003
[Dynamic-to-Static] Fix bug in loop_transformer: loop vars should contain the var from ancestor-for-node ( #28735 )
5 years ago
Wojciech Uss
04bcc13fac
Add multi_gru op and tests ( #28591 )
...
* Add multi_gru op and tests
* removed redundant disable_dygraph()
5 years ago
Wilber
fe2cf39f77
[2.0] Update py_func English doc. ( #28646 )
5 years ago
YUNSHEN XIE
16a808149a
modfied the timeout value for 5 ut ( #28714 )
...
* modfied the timeout value for 5 ut
* modified timeout value for test_resnet
5 years ago
liym27
9881738e13
[Dynamic-to-Static ErrorMessage] Support dy2stat error message when call jit.save and polish error message ( #28713 )
...
* Support dy2stat error message when call jit.save;
* Polish dy2stat error message:
(1) the original dygraph code is marked with (* user code *) ;
(2) "In user code:" -> "In transformed code:"
5 years ago
joejiong
32b90b1c2d
add log10 ( #28576 )
...
Add new operator log10
5 years ago
Shibo Tao
f0806bdaf2
fix save_inference_model and load_inference_mode alias. test=develop ( #28736 )
5 years ago
TeslaZhao
abbc507a81
Fix two english api documents, transpose and strided_slice ( #28687 )
...
* Fix two english api documents, transpose and strided_slice
* delete nouse comments
5 years ago
Zhou Wei
5562d8094f
fix API optimizer get_lr to support static graph mode ( #28681 )
...
* fix doc of save/load
* fix API optimizer get_lr
* fix API optimizer get_lr
5 years ago
furnace
caffa85ffe
add alias for fluid.initializer.set_global_initializer, alias is nn.initializer.set_global_initializer ( #28690 )
5 years ago
Shibo Tao
f096af83a0
fix document sample. test=develop ( #28721 )
5 years ago
Chen Weihang
3c5f2cac68
fix save parse error for dict input ( #28712 )
5 years ago
Huihuang Zheng
9ab335bbd9
Fix convert_call May be Called Multiple Times, test=develop ( #28710 )
...
Fix convert_callmMay be called multiple times in Dy2stat. Also strip some strings to make sure no influence from blank spaces.
5 years ago
wawltor
20b1276598
faster the compare ops dygraph model speed
...
faster the compare ops dygraph model speed
5 years ago
JZ-LIANG
5a9f6889c1
[Sharding] add new features ( #28568 )
...
* add lars to fleet meta optimizer
* add lamb to proto
* add lamb to fleet meta optimizer
* fixed syntax bug
* fixed syntax bug
* fixed syntax error in lamb, add config setter of lamb in distributed_strategy
* trigger unitest to rerun
* add new unitest func for lamb
* revise unitest for lars and lamb
* revise dgc meta unitest
* revise lars document in distribute_strategy
* revise lars lamb document in distributed_strategy.py
* revise lars lamb document in distributed_strategy.py
* add weight decay exclude logic to lars
* restore optimzier.py
* restore optimizer.py as develop except lars
* add epsilon and exclude fn to distributed_sttrategy
* add lars epsilon
* revise unitest for fleet lars and lamb
* revise lars lamb unitest for CI coverage
* revise lars argument api
* revise lars argument api
* revise lars argument api
* revise api doc of lars
* fix op role
* add sharding save and add_sync_comm_for_test function
* add comm_analyse to utlis
* revise sharding_utils
* add sharding saving unittest
* revise sharding utils for unittest
5 years ago
xiaoting
8c75b2554a
Support Tensor for attr_scale and attr_size ( #28677 )
...
* update interpolate, test=develop
* fix coverage, test=develop
5 years ago
YUNSHEN XIE
e880c90c5a
fix error when setting ut timeout value ( #28696 )
5 years ago
LielinJiang
01a14e1be2
Add with_pool args for vgg ( #28684 )
...
* add arg for vgg
5 years ago
LielinJiang
532e4bbf2a
fix docs ( #28683 )
5 years ago
Shibo Tao
db2e6cee62
add two paddle-2.0 apis: paddle.static.io.save_inference_model and paddle.static.io.load_inference_model ( #28606 )
...
* add two apis: paddle.static.io.save_inference_model and paddle.static.io.load_inference_mode, which are campatible with paddle.fluid.io.save_inference_model and paddle.fluid.io.load_inference_model respectively.
* add unittest for new save_inference_model and load_inference_model. test=develop
* enhance doc. test=develop
* add paddle.enable_static() to test_inference_model_io.py. test=develop
5 years ago
Chen Weihang
358d6bc90f
Fix test_weight_decay_extend random failed on windows ( #28643 )
...
* add debuging code
* change seed & add debug message
5 years ago
Bai Yifan
5050e761b8
Support user-defined activation/weight quantize and preprocess. ( #28570 )
...
* support user-defined quant and preprocess
5 years ago
Leo Chen
11e32baf1e
Add matmtl_v2 to amp list ( #28693 )
...
* add matmtl_v2 to amp list
* support dygraph
5 years ago
Huihuang Zheng
b6f86b8491
Fix Using "isinstance" in Loop, test=develop ( #28641 )
...
Fix a bug that used in PaddleGAN model which used `isinstance` in a for loop
5 years ago
lilong12
e4f9415338
update doc, test=document_fix ( #28498 )
5 years ago
Jacek Czaja
6d8d3d4c22
[oneDNN] Layer norm bf16 kernel ( #28619 )
5 years ago
LielinJiang
cdc4e6620d
fix lenet num classes ( #28642 )
5 years ago
wangchaochaohu
912a5c30b4
fix the matmul_v2 test for cuda11 ( #28635 )
5 years ago
littletomatodonkey
82f0b5ea5c
adapt pad const ( #28585 )
...
* adapt pad const
* fix comment and rm fluid import
* rm stdout
* fix note
5 years ago
Double_V
d71c3463b0
fix pool exclusive and delete disable_static ( #28655 )
...
* fix pool exclusive and delete disable_static, test=develop
* fix pool1d exclusive, test=develop
5 years ago
Aurelius84
8040fa2bca
Fix output dtype inconsistent with input ( #28649 )
...
* fix output dtyp inconsistent with input
* refine code
5 years ago
wangchaochaohu
68ee7f7312
fix overwrite for gather OP of API2.0( #28659 )
5 years ago
zhupengyang
2cd10fc465
fix 2.0 api docs ( #28445 )
5 years ago
Chen Weihang
a083c76ab4
adjust signal failed wait time ( #28640 )
5 years ago
Zhou Wei
361a53930f
fix doc of save/load ( #28645 )
5 years ago
cc
d1e84f3e9e
Add some ops for cacluating output scale, test=develop ( #28644 )
5 years ago
wawltor
8f2656ef5c
fix the gradient bug for the topk v2
...
fix the gradient bug for the topk v2
5 years ago
wangchaochaohu
a972c33fd7
refine gather OP performance for dynamic mode ( #28587 )
5 years ago
Kaipeng Deng
ece1e4cd9d
Add weighted random sampler ( #28545 )
...
* add WeightedRandomSampler. test=develop
5 years ago
joanna.wozna.intel
2cb71c0cde
Add checkpoint to quantize ( #28612 )
...
* Add checkpoint to quantize
* Change bfloat16 option
5 years ago
lidanqing
804271cff9
Op version python mkldnn_inplace test ( #28354 )
...
* add mkldnn inplace op version test
* update mkldnn_inplace fuse pass
* update the inplace test
5 years ago
pangyoki
72e068f1ba
fix test_multinomial ( #28558 )
...
* fix test_multinomial
* fix test_multinomial add 0 prob
5 years ago
Kaipeng Deng
89d27de90f
DataLoader support not auto collate batch ( #28425 )
...
* DataLoader support not auto collate batch. test=develop
5 years ago
Huihuang Zheng
c5c273c13e
[Dy2stat] Fix Using Tuple for Transpose in Dy2stat ( #28574 )
...
PaddleSeg uses tuple as parameter of transpose in dygraph code:
https://github.com/PaddlePaddle/PaddleSeg/blob/release/v0.7.0/dygraph/paddleseg/models/danet.py#L152
However, in dy2stat, static code doesn't support the perm as a tuple. This PR fixed it.
5 years ago
GaoWei8
2b1e7e5b02
Polish where english doc ( #28595 )
5 years ago
Guo Sheng
110febdc54
Fix gradients with ignore_idx in softmax_with_cross_entropy ( #28622 )
...
* Fix gradients with ignore_idx in softmax_with_cross_entropy.
test=develop
* Fix gradients with ignore_idx in softmax_with_cross_entropy on cpu.
Remove softmax_with_cross_entropy from op_threshold_white_list.
test=develop
* Fix test_softmax_cross_entropy_op.py.
test=develop
5 years ago
Guo Sheng
a3bc3bcd48
Fix scaled_params append error in AdamW. ( #28633 )
...
Fix no_grad setting in AdamW.
test=develop
5 years ago
YUNSHEN XIE
c4d22c845b
modified timeout value for some ut ( #28616 )
5 years ago
qingqing01
f7dd889ca4
Support squeezed label as input in paddle.metric.Accuracy ( #28535 )
...
* Support squeezed label as input in paddle.metric.Accuracy
* Revert cifar and fix UT
5 years ago
LielinJiang
1c3eef4cee
Fix vgg error when num_classes is given ( #28557 )
...
* fix vgg num classes
5 years ago
LielinJiang
1de3cdd0ab
Fix summary api for rnn gru lstm ( #28566 )
...
* fix summary for rnn gru lstm
5 years ago
danleifeng
a24d186814
fix nccl init failed in parallel dygraph mode ( #28497 )
5 years ago
Huihuang Zheng
1adc09b56a
Make Batch Size Smaller on Mac Because of CI Machine ( #28569 )
...
I found the unittest failed due to batch size. Maybe the reason is that our CI machine has limited memory. I decreased the batch size.
5 years ago
lilong12
ed9dd7c9f0
add send and recv ops ( #28590 )
...
* update, test=develop
5 years ago
LiuChiachi
5579edfb3c
save dtype of inputs ( #28301 )
5 years ago
Yang Zhang
c47bfe98cf
Reduce input data size for syncbn unit test by 8x ( #28571 )
...
this is a preemptive measure, the unit test was failing occasionally with
cudnn internal error
5 years ago
channings
c545b9b673
Add ONNX Exporter ( #27831 )
...
* add onnx export module, test=develop
* add unit test for paddle.onnx.export
* adjust api & doc
* fix some typo
5 years ago
Zhou Wei
bf6e7cba7a
updata 2.0 API english doc ( #28525 )
...
* make Numpy version is below 1.19.3
* fix 2.0 doc
5 years ago
Yang Zhang
614f20f6ba
Update `add` `clip` `MSELoss` and `no_grad` ( #28530 )
...
also fix `no_grad` alias
5 years ago
Shang Zhizhou
8699f38d08
裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug ( #28517 )
...
* skip_layernorm_op done
* add unittest
* slice op convertor support trt < 6
* skip_layernorm only work in ernie
5 years ago
joejiong
08d2413142
add log2 operator ( #28319 )
...
As the title
5 years ago
lidanqing
0fc181dbd0
[Fix bug] If the pass name is not found, IsCompatible should return false ( #28475 )
5 years ago
Steffy-zxf
b258caf467
fix add_n doc (eng) ( #28464 )
...
fix the add_n english doc
5 years ago
Leo Chen
543ff333cd
Refine the format of printing tensor 3 (support scaler tensor) ( #28544 )
5 years ago
furnace
26d292b108
bugfix for api (mv, empty, empty_like op) ( #28513 )
5 years ago
YUNSHEN XIE
621b31c526
modified timeout value for test_resnet_v2 and test_resnet ( #28532 )
5 years ago
Kaipeng Deng
5305b2749a
deprecated APIs under paddle.dataset. test=develop ( #28423 )
5 years ago
liym27
0ce933a9ee
[API2.0] Fix documents of 6 APIs to fit API2.0: ( #28514 )
...
1. Remove 'fluid';
2. Variable -> Tensor
APIs:
sum, convert_call, convert_ifelse,
convert_logical_and, convert_logical_or, convert_logical_not
5 years ago
Huihuang Zheng
3b0d31ab89
Modify ProgramTranslator and TracedLayer Doc for API 2.0 ( #28509 )
...
Modify ProgramTranslator and TracedLayer Doc for API 2.0
5 years ago
Huihuang Zheng
546b1c1dc4
Fix paddle.jit.dy2static.data_layer_not_check 2.0 API Doc ( #28461 )
...
Remove "fluid", "variable" in 2.0 API doc
5 years ago
yukavio
8f664a5c49
fix one_hot example code ( #28432 )
5 years ago
wangchaochaohu
241505c262
refine the doc for API2.0 ( #28386 )
5 years ago
YUNSHEN XIE
2159646796
modified timeout value for test_conv3d_mkldnn_op,test=document_fix ( #28496 )
5 years ago
Huihuang Zheng
eb0855615c
Set Exclusive for test_yolov3 to Avoid Memory Segmentation Fault in CI Random Failure ( #28485 )
...
As the title
5 years ago
YUNSHEN XIE
e1a5fc449d
fix ut exec timeout notest,test=kunlun ( #28495 )
...
* fix ut exec timeout notest,test=kunlun
* fix error for executing ut timeout,test=document_fix
5 years ago
YUNSHEN XIE
d3b2d07d6e
modified timeout value on windows ( #28499 )
...
* modified timeout value on windows
* fix some error
5 years ago
YUNSHEN XIE
1e698c600e
fix cmake error when setting ut timeout properity ( #28492 )
5 years ago
YUNSHEN XIE
faa6549877
setting timeout properity for test_flags_mkldnn_ops_on_off ( #28489 )
5 years ago
Tao Luo
7fd2077211
set NCCL_SHM_DISABLE=1 for test_parallel_executor_profilery.py ( #28484 )
5 years ago
smallv0221
4fa1d39205
Fix en doc for rnn.py. test=document_fix ( #28470 )
5 years ago
WangXi
18525d3635
fix Tanh remainder en doc ( #28455 )
5 years ago
wangchaochaohu
e14ed71cc2
refine the performance of gather Op ( #28458 )
5 years ago
YUNSHEN XIE
72c78e4dbb
exec ut no more than 15s 2 ( #28441 )
...
* exec ut no more than 15s 2
* fix for ut test_inplace_addto_strategy timeout
5 years ago
YUNSHEN XIE
ba0756325a
exec ut no more than 15s 1 ( #28439 )
...
* disable ut test_parallel_executor_fetch_isolated_var,test=document_fix
* test for limiting ut exec time as 15S
* fix an error caused by cannot find ut
* fix some error
* can not find test_transformer
* fix error caused by ut not run in windows
* fix error caused by Compiler Options
* fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt
* setting timeout value to 120s for old ut
* add the timeout value setting
* fix error caused by ut only run in coverage_ci
* add analyzer_transformer_profile_tester
* fix some error
* fix some error
* fix error with inference option
* fix error with inference option setting as ON_INFER
* add some ut to set timeout
* modified some option
* fix error
* fix some timeout error
* fix error
* fix error
* fix timeout for test_analyzer_bfloat16_resnet50
* fix error
* setting timeout properity for some ut
* first pr for new ut timeout as 15S
5 years ago
Chen Weihang
155b4f9b6c
Remove selected rows all reduce over height check ( #28460 )
...
* remove slelected rows all reduce over height check
* polish unittest
5 years ago
WeiXin
b5e662f837
refine jit.save/load to add support for other method, not only forward ( #28376 )
...
* refine jit.save/load to add support for other method, not only forward
* refine the code based on unit tests
* Add unit test for the code
* Add unit test for the code
* Modify the code according to the unit test
* Delete useless comments, save only one info file, etc.
* remove static_mode_white_list.pyc
* edit the code that generate 'extra_var_info'
5 years ago
Zhang Ting
7fe5f9ccad
Fix unittest random failure ( #28363 )
...
* fix random failure
* use two input(x)
* fix model
5 years ago
joanna.wozna.intel
7821759d48
Add bfloat16 softmax and gelu ( #28394 )
...
* Add bfloat16 softmax and gelu
* Add pass attr bfloat16_enabled_op_types
* Changes from review
5 years ago
Aurelius84
bd8dfe38ca
[Dy2Stat] Refine code example for 2.0 ( #28440 )
5 years ago
littletomatodonkey
f6834034cc
fix sample code ( #28446 )
5 years ago
zhupengyang
0155f91679
enable softmax unittest ( #28362 )
5 years ago
Aurelius84
2500dca878
[Dy2Stat] Fix bug in convert_call ( #28368 )
...
* Fix bug in convert_call
* refine unittest
* refine code
* refine code
* fix unittest failed
* add assert
5 years ago
Jacek Czaja
ca41541472
[oneDNN]Sum bf16 kernel ( #28382 )
...
* - Added sum bf16 oneDNN
test=develop
* - Fix to UT of sum bf16
test=develop
5 years ago
LielinJiang
b89b4e321d
add fashion dataset ( #28411 )
5 years ago
Kaipeng Deng
463075a82b
add paddle.io.ComposeDataset & paddle.io.ChainDataset ( #28311 )
...
* add paddle.io.ComposeDataset & paddle.io.ChainDataset. test=develop
5 years ago
Qi Li
a4303496b2
[DOC] activation api doc, test=document_fix ( #28405 )
5 years ago
LielinJiang
3c95acc3eb
Remove cv2 dependence of normalize ( #28361 )
...
* rm cv2 dependence of normalize
5 years ago
Leo Chen
44a476c2ab
support cuda pinned place ( #28416 )
5 years ago
lidanqing
12b9587be5
Add conv_bias pass version python test ( #28278 )
...
* add conv_bias pass version test
* update according to reviews
5 years ago
Li Fuchen
f401907775
modified sample code of CTCLoss & ctc_loss by remove disable_static() & print([.*].numpy()) & alias, test=document_fix ( #28403 )
5 years ago
Kaipeng Deng
95b1868366
update DataLoader doc ( #28290 )
...
* update DataLoader doc. test=develop
5 years ago
Leo Chen
71d6220772
Skip reader op in mixed_precision decorator ( #28353 )
...
* skip reader op in mixed_precision decorator
* add ut
5 years ago
Leo Chen
8b2436a776
Add broadcast_shape api ( #28257 )
...
* add broadcast_shape api
* add ut
* follow comments
* add example code, test=dodument_fix
* update example code, test=document_fix
5 years ago
石晓伟
21a63f6f90
enhance the op_version_registry, test=develop ( #28347 )
...
* enhance the op_version_registry, test=develop
* add unittests, test=develop
* enhance the op_version_registry, test=develop
* fix bugs, test=develop
* revert pybind_boost_headers.h, test=develop
* fix a attribute bug, test=develop
5 years ago
Shang Zhizhou
ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 ( #28367 )
...
* fp16 result ok
* change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS
* auto detect special slice op converter for ernie with trt oss
* ernie oss only support fp16
* fix special_slice_plugin serialize bug
* matmul in tensorrt ok
* ernie unittest ok
* add matmul tensorrt unittest
* remove demo code
5 years ago
Jacek Czaja
84cc61b2cd
[oneDNN] sum op refactor ( #28318 )
5 years ago
Double_V
17db031a05
fix pool bug, test=develop ( #28359 )
5 years ago
Leo Chen
6115c14fca
Pool2d cuda kernel supports fp16 ( #28316 )
...
* pool2d cuda kernel supports fp16
* fix compile issue of template
* add ut
5 years ago
Guo Sheng
9a600df373
Add rnn_op ( #28197 )
...
* Add rnn_op.
test=develop
* Fix rnn_op grad maker's drop_empty_grad.
test=develop
5 years ago
Huihuang Zheng
acc11c2a62
Retry CUDA Initialization to Fix Random Failure, test=develop ( #28323 )
...
This PR is follow up of #28213 . On that PR we tried to decrease GPU usage, however the CI still randomly failed. So I added retry logic for the initialization of nccl and cusolver. If the initialization failed, we can retry to avoid the random failure.
5 years ago
wangguanzhong
5262b02585
add generate_proposals_v2 op ( #28214 )
...
* add generate_proposals_v2 op
5 years ago
Guo Sheng
b96869bc31
Fix lr setting of AdamW when lr is an instance of LRScheduler ( #28300 )
...
* Fix lr setting of AdamW when lr is an instance of LRScheduler.
test=develop
* Fix static graph test mode in test_adamw_op.py.
test=develop
5 years ago
Aurelius84
57e4411ab5
[Dy2stat] Support to modify value of buffer tensor ( #28328 )
...
* [Dy2stat] Support to modify value of buffer tensor
* remove "defaultTest"
* fix name confliction
5 years ago
石晓伟
d9b5f1261c
update the version of pybind, test=develop ( #28284 )
...
* update version pybind to v2.4.3, test=develop
* update unittests, test=develop
5 years ago
Leo Chen
18c86fb2fb
hide some logs of p2p ( #28307 )
5 years ago
Chen Weihang
4086f48ea1
Check and fix tensor and scalar type promotion ( #28299 )
...
* check and fix tensor and scalar type promotion
* fix else branch error
* fix scalar method error
* fix test_math_op_path unittest
* add future division for unittest
* rm useless bin file
5 years ago
LielinJiang
fb1e0c93c9
Make vision datasets return PIL.Image as default ( #28264 )
...
* return pil image as default according backend
5 years ago
zhulei
26ede6e07e
Add median api. ( #28310 )
...
* Add median api.
* Add median api.
* Add median api.
* Add median api.
* Add median api.
5 years ago
wanghuancoder
3ccc0a2f5e
enable test_parallel_executor_fetch_isolated_var ( #28219 )
...
* enable test_parallel_executor_fetch_isolated_var, test=develop
* add enable_static, test=develop
* set test_parallel_executor_fetch_isolated_var RUN_TYPE=DIST, develop=test
5 years ago
joanna.wozna.intel
571a63e7ec
Add bf16 transpose2, reshape2, concat ops ( #28195 )
5 years ago