123malin
2a98e9323a
test=develop, add distributed_infer ( #30300 )
...
* test=develop, add distributed_infer
4 years ago
Wilber
96784ed6c8
fix compile error on ARM ( #30398 )
4 years ago
Chen Weihang
ae1f32091a
fix prune input bug ( #30384 )
4 years ago
WeiXin
5ff4f1ad5e
move 'load_op_library','LayerHelper' to 'paddle/incubate' ( #30339 )
4 years ago
Huihuang Zheng
cd5f11b822
Decrease Batch Size for Windows CI, test=develop ( #30331 )
...
As the title
4 years ago
cc
8e3a294045
skip quantizing ops in cpu inference ( #30342 )
...
* skip quantizing ops in cpu inference, test=develop
4 years ago
Bai Yifan
ad6fee2fa8
fix quantize error in speical naming model ( #30354 )
4 years ago
huangxu96
342d62de60
add amp example document ( #30314 )
4 years ago
Huihuang Zheng
017a534888
Decrease Mac Input Size Because of CI Short Memory ( #30330 )
...
As the title
4 years ago
Leo Chen
3d015f1cf5
Set expected place in child thread for dataloader to avoid costing cuda memory on other card ( #30338 )
...
* set expected place in child thread for dataloader
* set device id when set tensor from numpy
* revert tensor_py change
* add compile guard
* fix ci
* fix bug
4 years ago
QingshuChen
2c1bba02e4
optimize memcpy perf for kunlun ( #30291 )
...
* optimize memcpy perf for kunlun
* remove useless unitest for kunlun mean
* minor
4 years ago
cnn
10ae31579b
update error information ( #30277 )
4 years ago
huangxu96
ee623bff64
Implemented AddQuantDequantPass in imperative quantization. ( #26692 )
...
* Implemented AddQuantDequantPass in imperative quantization.
* Supported LeakyReLU Quantization
* For meeting coverage rate.
* Changed the file name of test of AddQuantDequant
* Implemented more Quantized NoWeightLayers.
* Fix the loss cannot align problem between static and dynamic model quantization, add swish as supported quantized layer in imperative quantization.
* remove noweight_list
* support 2.0 API such as Pool2D and ReLu
4 years ago
ShenLiang
a60f17b89d
Support unused parameters in dynamic graph distributed ( #30224 )
4 years ago
JZ-LIANG
75936d838f
Recompute Offload ( #30233 )
4 years ago
houj04
dc12b5eedf
resolve #30141 ( #30145 )
...
fix compile problem on FT
4 years ago
lidanqing
a238298659
Skip some conv2d_int8 tests in windows ( #30128 )
4 years ago
Wojciech Uss
fc42faffc2
Wojtuss/upgrade one dnn 2.0 ( #30295 )
...
* upgrade oneDNN version to 2.0 master branch
* - Added workarounds for new lib onednn change
* fix regex
Co-authored-by: Jacek Czaja <jacek.czaja@intel.com>
4 years ago
tangwei12
5e839e4da5
add sparse embedding & load vars for 2.0 & gloo bug fix ( #30306 )
...
* add sparse embedding & load vars for 2.0
Change-Id: I36b59ed5f015189dc9d9d2e34a9357722d369f1b
* fix hdfs gloo
Change-Id: Ia84d579053720ad804183e54c9a04b4f031c79c6
* fix gloo hdfs
Change-Id: I5ab982fd483cddc10adcdef0b8aa83aca976cb9e
* move loadvar/sparse embedding from incubute to static
Change-Id: I57081d3545ad2efab78c72420d2162c0eacaf3a0
4 years ago
YUNSHEN XIE
da3ab010e0
disable test_pipeline ( #30204 )
...
* disable test_pipeline
* fix error
4 years ago
tangwei12
25f80fd304
Fix/distributed proto ( #29981 )
...
* rename sendrecv.proto to namespace paddle.distributed
* split ps with distributed
4 years ago
Chengmo
d479ae1725
【Paddle.Fleet】Support local save sparse param ( #30175 )
...
* add save tensor support
Co-authored-by: seiriosPlus <tangwei12@baidu.com>
4 years ago
chajchaj
113810c557
fix bug of celoss when using ignore_index and reduction ( #30180 )
...
* fix bug of using ignore_index and reduction,test=develop
* fix bug of celoss when using ignore_index and reduction, test=develop
* improve performance when ignore_index=-100, test=develop
* add test in test_cross_entropy_loss.py for coverage rate, test=develop
* rm comment in test_cross_entropy_loss.py, test=develop
* del hard code of "float64" in python/paddle/nn/functional/loss.py, test=develop
* change mask to a more simplified implementation, test=develop
* del comment in python/paddle/nn/functional/loss.py, test=develop
* del hard code and change mask to a more simplified implementation, test=develop
* change mask to a more simplified implementation, test=develop
* change mask to a more simplified implementation, test=develop
4 years ago
Double_V
231501fefc
fix elugradgrad test fail & error message opt ( #30171 )
...
* fix elugradgrad test fail and error message opt
* fix unitest,test=develop
* Update prroi_pool_op.h
fix error message
* opt message,test=develop
* fix ci fail,test=develop
4 years ago
Zhen Wang
fb49ea388e
Fix the accuracy problem of allclose op when using float64 data type in static mode. ( #29890 )
...
* Fix the accuracy problem of allclose op when using float64 data type in static mode.
* Format the code style.
4 years ago
furnace
77051cc9f0
add fp16 support for tril_triu op ( #30186 )
4 years ago
LielinJiang
86d81af5ef
reduce unittest time of test_datasets ( #30275 )
4 years ago
liym27
b4989fb744
Support vector<double> as type of op attribute and op set_value suppport vector<double> as value ( #30126 )
4 years ago
furnace
c6296b2b0e
fix empty op unit test fail sometimes ( #30225 )
4 years ago
AshburnLee
924aac2216
Add tf32 switch for cuDNN ( #29192 )
4 years ago
chentianyu03
c7371b7b20
type promotion for grad ( #30177 )
...
* type promotion for grad
* add type promotion for div op
4 years ago
YUNSHEN XIE
42a6442a08
disable ut test_tsm on windows ( #30017 )
...
* disable ut test_tsm on windows
* fix error
* add ut execuate time
4 years ago
Jiaqi Liu
b7335b4db7
Alias from paddle.fluid.layers.auc to paddle.static.auc ( #30206 )
...
* add alias from fluid.layers.auc to static.auc
* Update __init__.py
4 years ago
WeiXin
edafb5465a
Fix bug for 'save mutiple method' ( #30218 )
...
* Fix bug for 'save mutiple method'
* To pass coverage.
* edit code to pass coverage.
* edit code to pass coverage.
* add unittest for coverage.
* change for coverage.
* edit for coverage.
4 years ago
gongweibao
8700a7bd90
Fix unittests bugs. ( #30250 )
4 years ago
Bai Yifan
dd6f591991
fix test_pool3d_op timeout issue ( #30248 )
4 years ago
Huihuang Zheng
c372a76303
Add Static Variable Clone ( #30208 )
...
Add clone method for static Variable so that this interface will be same as dygraph. It fixed some bugs in dy2stat
4 years ago
XiaoguangHu
6bfdef727e
clean redundant API alias in 2.0 - part 2 ( #30013 )
...
* delete paddle.nn.functional.assign
* fix dynamic to static error
4 years ago
LielinJiang
e6a1e8757d
Delete incorrect warning message ( #30196 )
...
* fix warning and no grad
4 years ago
wangchaochaohu
af80859dd6
reduce the occupied size of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) ( #29885 )
4 years ago
pangyoki
da16b33f2e
add View(reuse allocation) strategy on squeeze, unsqueeze, reshape, flatten op ( #29913 )
...
* add view strategy on squeeze,unsqueeze,reshape,flatten
* add squeeze unittest
* add unittests
* use View strategy as name rather than Reuse Allacation
* fix view api doc
* fix format
* use core.ops when input of reshape2 is Tensor
* fix test_cross_entropy_loss error because of reshape2
* delete selected_rows
* change op_function
* little change
* solve HandleViewBetweenInputAndOutput
4 years ago
huangxu96
be5c2e6050
fix windows bug ( #29993 )
4 years ago
Chen Weihang
3016ba852e
remove distributed prepare context ( #30219 )
4 years ago
Zhen Wang
7f7dfccf20
Support pure fp16 training for AMP API. ( #29544 )
...
* add cast ops before and after unsupported fp16 ops.
* Keep partial net in FP32 pattern.
* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.
* Add fp16 support for adam op.
* add multi precision attr for adam.
* Fix the bug of test_multi_precision_fp16_train UT.
* Code format for CI.
* Fix the redefine error about MPTypeTrait on windows.
* fix bugs of the _create_accumulators func in Momentum.
* fix bug when inserting post cast op.
* Add the update_loss_scaling op in allow_set of UnusedVarCheck.
* Update for ci coverage.
* Add some doc for OptimizerWithMixedPrecision.
* Fix the code style.
* Imporve the doc of `amp_init`.
* Change for fp16 testing if users have the infer program defined in separate way.
4 years ago
Leo Chen
8696335f86
Fix dtype of ungenerated grad var ( #28511 )
...
* fix dtype of ungenerated grad var
* update ut
* refine code
* set default dtype
* fix could_use_cudnn bug
* remove debug code
* re-implement
* fix bug
4 years ago
Aurelius84
03e072736e
Skip convert tensor shape while using Paddle.shape ( #30223 )
...
* fix tensor shape bug
* fix op_num
* clean code
4 years ago
liym27
49411a20da
In creation.assgin, reuse implamention code of layers.tensor.assign to avoid maintain two code ( #30227 )
4 years ago
littletomatodonkey
e03171b7c7
fix pad ( #30222 )
4 years ago
liym27
31ed9a5ed3
[Dy2Stat] Use Paddle2.0 api paddle.tensor.array_* ( #30156 )
4 years ago
liym27
ad55f609d5
[Dy2Stat] Don't convert to paddle.shape if var_x.shape is not negetive ( #29965 )
...
1. When x is Variable, call nn.shape(x) only in following cases:
1)The shape of x is used in control flow condition.
2)The dim to be used is negetive
2. When x is Variable, but x.shape or x.shape[idx] doesn't contain negetive value, don't convert to paddle.shape()
4 years ago
Leo Chen
1f97d61c68
Add callback after TensorCopy ( #30123 )
...
* change to tensor copy sync
* change to tensor copy sync
* make copy_to safe when use TensorCopy
* refine code
* add ut
* add cudapinned garbagecollector
* add testcase: cpu place -> cuda pinned place
4 years ago
liym27
b2483d78a8
Fix test_slice: avoid unnecessary copying of TensorArray from subblock to parent block( #30168 )
...
In control flow, don't copy TensorArray from subblock to parent block when TensorArray is created in parent block.
4 years ago
Chengmo
528e03fc08
【Paddle.Fleet】Fix tensor table ( #30075 )
...
* add tensor table
4 years ago
guofei
1bdf924217
Quantization supports 2.0 APIs ( #30036 )
...
* Quantization supports 2.0 APIs
* Fix the error of save_quantized_model
4 years ago
Chen Weihang
d0fb06b27f
[Complex] Simplify prepared op impl to improve performance ( #30153 )
...
* simplify prepared op impl to improve performance
* fix kunlun compile error
* continue fix kunlun compile error
* only transform diff place when dtype diff
* fix failed unittests
* remove useless file
* polish impl by review comment
4 years ago
Chen Weihang
e503470700
try multi times for sys.exit ( #30188 )
4 years ago
WangXi
619c62bb48
fix adamw apply gradient ( #30130 )
4 years ago
LutaoChu
1ff69f58b6
fix paddle.pow doc, test=document_fix ( #30159 )
4 years ago
wangchaochaohu
7dd551e08b
refine the paddle place support using str ( #28769 )
4 years ago
Chen Weihang
8020e34e7c
Simplify the options of spawn based on fleetrun ( #30144 )
...
* Simplify the options of spawn based on fleetrun
* polish details
* polish doc details
4 years ago
tangwei12
4763e6bc4e
pre padding in dygraph ( #30163 )
...
Change-Id: Ia5279b0cbb6a5b3970aff66e9510e0d85efa70ce
4 years ago
123malin
198fbdfb60
Add Lookahead and ModelAverage Optimizer ( #30004 )
...
* test=develop, add model_average and lookahead
4 years ago
ceci3
6a19e41f1f
fix syncbn convert ( #30158 )
...
* fix syncbn convet
* add unittest
4 years ago
Leo Chen
adac38c506
add dispenable input for core.ops.reshape2/expand/slice ( #30072 )
...
* add dispenable input 'shape' for core.ops.reshape2
* add dispenable inputs for core.ops.reshape2/expand/slice
* add ut
4 years ago
Zhou Wei
30888ca343
Polish and Optimize the print/repr information of Layer ( #29998 )
...
* Polish and Optimize the print/repr message of all layer
* fix some code format
4 years ago
WeiXin
f3a2392662
Extend the timeout for the ( #30151 )
4 years ago
Zhou Wei
9c99d37906
fix unittest failed on windows ( #29837 )
4 years ago
liym27
9922bd4125
Fix bug: In dynamic mode, if start or end is negetive, __getitem__ return wrong result( #30003 )
...
1. when slice_item is a slice:
1) the start of __getitem__ should be std::max(start, 0) if slice
2) the start of __getitem__ should be std::min(end, dim)
2. when slice_item is an integer, it should be in [-dim_len, dim_len)
3. Fix error message to use accurate data
4 years ago
gongweibao
4d2a4bb27a
fix logs info test=develop ( #30071 )
4 years ago
ceci3
a125d6331f
fix bn docs ( #30096 )
4 years ago
ceci3
334247791a
add attribute for batch_norm ( #29950 )
...
* add attribute for batch_norm
4 years ago
Jiaqi Liu
2e8425b693
Fix beam search bug ( #29824 )
...
* fix beam search bug
* add dygraph unittest
* update dynamic_decode argument doc
* add warning info for state which has no lengths attribute
4 years ago
WeiXin
f43e1d8c57
Support storage of large parameters ( #29988 )
...
* Support storage of large parameters
* Reduce the complexity of the unittest
* Reduce the complexity of the unittest,commented out unittest for
* add unittest for static.save/load
* Increase the timeout threshold of 'test_static_save_load'
* Increase the timeout threshold of 'test_static_save_load'
* Increase the timeout threshold of 'test_static_save_load' and 'test_paddle_save_load'
* Increase the timeout threshold of 'test_static_save_load' and 'test_paddle_save_load'
4 years ago
chentianyu03
666e665132
change the kron gradient when complex types ( #29995 )
4 years ago
WangXi
ab04997846
[fleet] combine amp and gradient merge, test=develop ( #30086 )
4 years ago
wanghuancoder
88e6dc4ac5
optimize momentum to speedup dygraph, a little, test=develop ( #30099 )
4 years ago
Thunderbrook
0b8e1fadc5
add topo-aware in heter-ps ( #30087 )
...
* add topo aware
* resource.h
* topo aware
* format
4 years ago
gongweibao
eea7090c26
fix selected_gpus test=develop ( #30044 )
4 years ago
cc
1fa863da40
Support dygraph quant model ( #29927 )
...
* Avoid the scale to be infinity in quant2_int8_mkldnn_pass, test=develop
* support quantized model for paddle2.0 dygraph, test=develop
4 years ago
Chen Weihang
46c4695421
Set FLAGS_selected_gpus for spawn ( #29962 )
...
* set flags_selectedd_gpus for spawn
* add cond for unittest
* Delete test_no_single_process_using_multi_gpus_in_spawn.py
* Update spawn.py
* Update nccl_context.cc
4 years ago
WangXi
ee16006b5d
Optimization grad merge performance ( #29784 )
4 years ago
xiaoting
4d395203a2
Add alias for upsample ( #29983 )
...
* add alias for upsample, test=develop
* add alias for upsample
* fix example
4 years ago
lilong12
9e51e3833f
update, test=develop ( #30047 )
4 years ago
chentianyu03
e012930aa3
complex gradient matmul ( #29966 )
...
* dot op support complex types
* matmul support complex types
* add test case
* matmul broadcast gradient support complex
* move conjFunctor to complex_functor.h
4 years ago
lilong12
b0bd93de00
Disable gloo by default ( #29805 )
...
* update, test=develop
4 years ago
ShenLiang
b6fd262951
fix gather nd for untest ( #30037 )
4 years ago
Leo Chen
a253a78a85
fix error message ( #30020 )
4 years ago
lilong12
2bc5121da8
add the paddle.distributed.split api ( #29970 )
...
* add distributed.split, test=develop
4 years ago
cc
c3c064a8fc
Add mkldnn nearest_interp and bilinear_interp op ( #30016 )
...
* Add mkldnn nearest_interp and bilinear_interp op
* don't run mkldnn interpolate in default
* add interpolate_mkldnn_pass
4 years ago
zhupengyang
65d4ff753b
hardsigmoid add attr slope and offset ( #29999 )
4 years ago
tangwei12
ed856d254e
fix ut ( #29989 )
...
* fix ut
Change-Id: I151e152919a1863db07792bffb42d0ca68995756
4 years ago
cc
62f455e023
Support quantizing program_desc ( #29526 )
...
* Support quantizing program_desc, test=develop
4 years ago
Chen Long
af37285870
fix code bugs ( #29932 )
...
* fix code bugs
* fix code bugs test=document_fix
* fix code bugs test=document_fix
4 years ago
guofei
8212874f47
Fix test_imperative_skip_out ( #29939 )
...
* Fix unittest:test_imperative_skip_out
4 years ago
LielinJiang
ec2fad4d51
Fix rotation bug when use cv2 backend ( #29933 )
...
* fix cv2 rotation
4 years ago
Chen Weihang
a1d9a14e89
support grad accumulated across batch ( #29942 )
4 years ago
liuyuhui
bb20dcfc1a
[Kunlun] bug fix of PR2: Support MultiDevicePass and BKCL in parallel executor ( #29961 )
4 years ago
wawltor
587b67ef62
fix the state_dict bug for the xpu ( #29888 )
...
fix the state_dict bug for the xpu
4 years ago
QingshuChen
f4be9d6a32
add bkcl.so in whl for kunlun ( #29947 )
4 years ago
XiaoguangHu
726c78f293
clean redundant API alias in 2.0 - part 1 ( #29928 )
...
* rm check_import_scipy, rm chunk_eval and mean_iou in paddle.metric.__init__.py
* Revert "rm check_import_scipy, rm chunk_eval and mean_iou in paddle.metric.__init__.py"
This reverts commit 179ba8c2b22bc31fe8d8a126e31820792cbd0f4e.
* delete paddle.metric.chunk_eval and paddle.metric.mean_iou
* delete paddle.nn.clip and paddle.nn.clip_by_norm
* delete paddle.nn.functional.activation.hard_sigmoid and paddle.nn.functional.activation.hard_swish
* delete paddle.nn.Pool2D, paddle.nn.BilinearTensorProduct, paddle.nn.RowConv, paddle.nn.functional.row_conv
* fix extension import error
* fix unittest for row_conv and Pool2D
4 years ago
liym27
14bd77f941
[Windows CI test] Enable unittest test_optimizer_in_control_flow and remove unnecessay code ( #29851 )
4 years ago
Wilber
332da133a1
Support mips arch ( #29903 )
...
* Support MIPS arch.
4 years ago
littletomatodonkey
5c162fe66e
fix reg api ut fail ( #29921 )
4 years ago
Leo Chen
a4b9daf97c
fix optimizer dtype ( #29917 )
4 years ago
liuyuhui
4427df37cf
[Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor ( #29574 )
4 years ago
LielinJiang
0b74428db8
Fix Conv2DTanspose bug when padding='same' ( #29915 )
...
* fix conv_transpose bug when padding=same
4 years ago
LielinJiang
11de384c6d
Split callbacks unittest ( #29914 )
...
* split callback unittest
* rm test_callback from timeout list
4 years ago
lilong12
01950ceb42
fix the bug in pipeline data parallelism ( #29731 )
...
* update, test=develop
4 years ago
YUNSHEN XIE
2a01756bf3
remove duplicate ut names ( #29809 )
4 years ago
Chen Weihang
a6072055be
[Complex] Handle complex to real after type promotion ( #29855 )
...
* try to add fwd op input dtypes
* refactor base impl
* return tmp_ins after dygraph prepare data
* fix typo found in debug
* polish comment & add complex net test
* revert detail change
* fix unittest failed
* add complex kernel condition control
* fix xpu test failed & polish comment
* polish details by review comments
4 years ago
Chen Weihang
1a304e6c06
[Complex] Add support for complex grad accumulated ( #29889 )
...
* add support for complex grad accumulated
* add unittest for coverage
* update test dtype
* remove useless blank line
4 years ago
guofei
80eb77788f
Skip Windows Multi-GPU test of test_fetch_lod_tensor_array ( #29508 )
...
* Fix Windows unittest of test_fetch_lod_tensor_array
4 years ago
Leo Chen
6b258317cb
fix TransferInplaceBack ( #29830 )
4 years ago
QingshuChen
59b47f3b32
feat: support check_nan_inf for kunlun/xpu device ( #29694 )
...
* feat: support check_nan_inf for kunlun device
* support kunlun stack
* minor
4 years ago
wawltor
7498df2587
add the cumsum unit test for the develop ( #29881 )
4 years ago
wanghuancoder
26f9ab70f7
if PR have no .py files, do not use 'python coverage run', to speedup unit test ( #29739 )
...
* reopen python coverage --include for test, test=develop
* if no .py file modified, not use coverage run, test=develop
* remove test code, test=develop
* add WITH_INCREMENTAL_COVERAGE, test=develop
* refine if else, test=develop
4 years ago
Tao Luo
5d130d5670
Revert "fix conv2d int8 windows UT ( #29528 )" ( #29869 )
...
This reverts commit 067d7f1d0d
.
4 years ago
tangwei12
032414ca2a
[Feature] one ps (3/4) ( #29604 )
...
* oneps (3/4)
Co-authored-by: MrChengmo <cmchengmo@163.com>
Co-authored-by: malin10 <malin10@baidu.com>
Co-authored-by: chengmo <chengmo@baidu.com>
4 years ago
jakpiase
edc06c6a1b
Added fc + activation fuse pass (currently only gelu, sigmoid and tanh are supported) ( #29772 )
4 years ago
Chen Weihang
0e0bb1b97d
replace exit method ( #29862 )
4 years ago
lidanqing
067d7f1d0d
fix conv2d int8 windows UT ( #29528 )
4 years ago
liym27
97e75ad0f5
[setitem] Support Tensor setitem in static mode ( #29708 )
...
1. Type of index: int, slice(step must be 1).
2. Type of value:
(1) int32, int64, float32, bool;
(2) numpy.array(int32, int64, float32, bool);<Note: float64 is not supported>
(3) paddle.Tensor(int32, int64, float32, float64, bool);
4 years ago
YUNSHEN XIE
24ce051a84
remove duplicate ut reload ( #29810 )
...
* remove duplicate ut reload
* remove duplicate ut define in cmakelist
4 years ago
Thunderbrook
09b6e71928
heter box ( #29734 )
...
* add heter box
* add trainer, worker, wrapper...
* format
* for ci
* format
* remove boost get
* boost & copyright
* rename
* rename
* format
* format
* format
Co-authored-by: yaoxuefeng6 <yaoxuefeng@baidu.com>
4 years ago
LielinJiang
1092da82b2
Change the conditions of hapi printing logs ( #29792 )
...
* update condition of logger print
4 years ago
ceci3
c4eb5d0378
fix unittest timeout ( #29820 )
4 years ago
chentianyu03
ddfc3d2c2f
change grad elementwise_mul for complex types ( #29757 )
...
* add conj op for complex types
* add conj for complex types
* add more test case
* add conj_op test
* modify conj api and impl
* add complex type for fill_constant_op xpu
* add setConstant for complex type
* remove complex conj test file
* user define grad for test_conj_op
* add test case for static mode of conj api
* modify conj doc
* change input args name to x
* remove useless codes
* conj support real types
* add conj test case for real number
* delete no need to calculate inputs in dygraph op_test
* delete no need to calculate inputs in dygraph op_test
* modify grad of mul for complex types
* fix the grads of inputs args order not match bug
4 years ago
chentianyu03
2a260d9b0e
change the grad of div when complex types ( #29804 )
...
* change the grad of div when complex types
* fix the grads of inputs args order not match bug
4 years ago
syyxsxx
e219b8ccef
fix api link for the any, all, isfinite
...
fix api link for the any, all, isfinite
4 years ago
Guo Sheng
356efd36fa
Remove test_rnn_decode_api from disable list. ( #29814 )
...
test=develop
4 years ago
TTerror
82aa01c373
add nearest_interp_v2 on kunlun ( #29725 )
...
* add nearest_interp_v2 on kunlun
* add nearest_interp_v2 on kunlun
4 years ago
yukavio
0f97ff0368
fix flops ( #29818 )
4 years ago
whs
82630408b4
Support double backward rsqrt ( #29589 )
4 years ago
cc
61820fd217
add the time threshold of quantization tests, test=develop ( #29786 )
4 years ago
xiaoting
55725cd2e1
fix for timeout, test=develop ( #29788 )
4 years ago
LielinJiang
a94c3cbbf3
register cudnn conv double grad for depthwise conv ( #29807 )
4 years ago
ShenLiang
01e2874a0e
Support multi-stream communication for dynamic graph distributed ( #29525 )
...
* fix fleet for multi-stream
* fix memcpy for ncclid
* use sync to solve move operation
4 years ago
huangxu96
a29006d128
Optimizer trans momentum ( #29597 )
...
* merge amp related function in Momentum from paddle.fluid.contrib.optimizer into paddle.optimizer.
* Add unittest for 2.0 Momentum API.
* fix some bugs in weight_decay.
4 years ago
liym27
0cc42e34c6
Migrate 4 APIs about array to paddle.tensor.* ( #29565 )
...
4 APIs: array_length, array_read, array_write, create_array
4 years ago
yukavio
96934b7430
fix flops ( #29758 )
...
* fix flops
* fix flops
4 years ago
liym27
41a7b07159
[Dy2Stat] Fix bug for loop: a variable is used and created in loop, but used before created ( #29769 )
4 years ago
LielinJiang
e5af650b71
Add double grad for conv_transpose ( #29706 )
...
* add double grad for conv_transpose
4 years ago
huangxu96
97e29411eb
fix a bug in multi_precision_fp16 unittest. ( #29756 )
4 years ago
Wojciech Uss
6ef8129dcc
upgrade oneDNN with GRU INT8 optimizations ( #28420 )
...
* upgrade oneDNN with GRU INT8 optimizations
* fix test
4 years ago
Huihuang Zheng
dfffee8a5d
[Dy2stat] Enable jit.save to Save Without Running ( #29579 )
...
Enable jit.save to Save Without Running.
4 years ago
liym27
a0b60716f1
[Dy2Stat] Support grammar: for ele in var[idx] ( #29541 )
...
Support to transformfor ele in var stms in which var is a slice of Tensor.
4 years ago
chentianyu03
b59b6d7ae6
Complex op test ( #29753 )
...
* delete no need to calculate inputs in dygraph op_test
* delete no need to calculate inputs in dygraph op_test
4 years ago
liym27
096c048b45
Fix unitest test_slice ( #29740 )
...
Before this commit, test_slice use old api `dygraph_to_static_func` to use Dynamic-t-Static and use Executor explicitly,which is not recommended to users.
After fixed, use recommended API `paddle.jit.to_static` to replace `dygraph_to_static_func`, which won't trigger the random exception on coverage CI.
4 years ago
Huihuang Zheng
2e788bd81e
Reduce batch size ot fix CPU memory, test=develop ( #29736 )
...
Unit test reported memory not enough on CPU machines. Reduce batch size again.
4 years ago
LielinJiang
10edfb6f21
Update en docs of to_tensor ( #29718 )
...
* update to_tensor en docs
4 years ago