Leo Chen
1f97d61c68
Add callback after TensorCopy ( #30123 )
...
* change to tensor copy sync
* change to tensor copy sync
* make copy_to safe when use TensorCopy
* refine code
* add ut
* add cudapinned garbagecollector
* add testcase: cpu place -> cuda pinned place
4 years ago
liym27
b2483d78a8
Fix test_slice: avoid unnecessary copying of TensorArray from subblock to parent block( #30168 )
...
In control flow, don't copy TensorArray from subblock to parent block when TensorArray is created in parent block.
4 years ago
Chengmo
528e03fc08
【Paddle.Fleet】Fix tensor table ( #30075 )
...
* add tensor table
4 years ago
guofei
1bdf924217
Quantization supports 2.0 APIs ( #30036 )
...
* Quantization supports 2.0 APIs
* Fix the error of save_quantized_model
4 years ago
Chen Weihang
d0fb06b27f
[Complex] Simplify prepared op impl to improve performance ( #30153 )
...
* simplify prepared op impl to improve performance
* fix kunlun compile error
* continue fix kunlun compile error
* only transform diff place when dtype diff
* fix failed unittests
* remove useless file
* polish impl by review comment
4 years ago
Chen Weihang
e503470700
try multi times for sys.exit ( #30188 )
4 years ago
WangXi
619c62bb48
fix adamw apply gradient ( #30130 )
4 years ago
LutaoChu
1ff69f58b6
fix paddle.pow doc, test=document_fix ( #30159 )
4 years ago
wangchaochaohu
7dd551e08b
refine the paddle place support using str ( #28769 )
4 years ago
Chen Weihang
8020e34e7c
Simplify the options of spawn based on fleetrun ( #30144 )
...
* Simplify the options of spawn based on fleetrun
* polish details
* polish doc details
4 years ago
tangwei12
4763e6bc4e
pre padding in dygraph ( #30163 )
...
Change-Id: Ia5279b0cbb6a5b3970aff66e9510e0d85efa70ce
4 years ago
123malin
198fbdfb60
Add Lookahead and ModelAverage Optimizer ( #30004 )
...
* test=develop, add model_average and lookahead
4 years ago
ceci3
6a19e41f1f
fix syncbn convert ( #30158 )
...
* fix syncbn convet
* add unittest
4 years ago
Leo Chen
adac38c506
add dispenable input for core.ops.reshape2/expand/slice ( #30072 )
...
* add dispenable input 'shape' for core.ops.reshape2
* add dispenable inputs for core.ops.reshape2/expand/slice
* add ut
4 years ago
Zhou Wei
30888ca343
Polish and Optimize the print/repr information of Layer ( #29998 )
...
* Polish and Optimize the print/repr message of all layer
* fix some code format
4 years ago
WeiXin
f3a2392662
Extend the timeout for the ( #30151 )
4 years ago
Zhou Wei
9c99d37906
fix unittest failed on windows ( #29837 )
4 years ago
liym27
9922bd4125
Fix bug: In dynamic mode, if start or end is negetive, __getitem__ return wrong result( #30003 )
...
1. when slice_item is a slice:
1) the start of __getitem__ should be std::max(start, 0) if slice
2) the start of __getitem__ should be std::min(end, dim)
2. when slice_item is an integer, it should be in [-dim_len, dim_len)
3. Fix error message to use accurate data
4 years ago
gongweibao
4d2a4bb27a
fix logs info test=develop ( #30071 )
4 years ago
ceci3
a125d6331f
fix bn docs ( #30096 )
4 years ago
ceci3
334247791a
add attribute for batch_norm ( #29950 )
...
* add attribute for batch_norm
4 years ago
Jiaqi Liu
2e8425b693
Fix beam search bug ( #29824 )
...
* fix beam search bug
* add dygraph unittest
* update dynamic_decode argument doc
* add warning info for state which has no lengths attribute
4 years ago
WeiXin
f43e1d8c57
Support storage of large parameters ( #29988 )
...
* Support storage of large parameters
* Reduce the complexity of the unittest
* Reduce the complexity of the unittest,commented out unittest for
* add unittest for static.save/load
* Increase the timeout threshold of 'test_static_save_load'
* Increase the timeout threshold of 'test_static_save_load'
* Increase the timeout threshold of 'test_static_save_load' and 'test_paddle_save_load'
* Increase the timeout threshold of 'test_static_save_load' and 'test_paddle_save_load'
4 years ago
chentianyu03
666e665132
change the kron gradient when complex types ( #29995 )
4 years ago
WangXi
ab04997846
[fleet] combine amp and gradient merge, test=develop ( #30086 )
4 years ago
wanghuancoder
88e6dc4ac5
optimize momentum to speedup dygraph, a little, test=develop ( #30099 )
4 years ago
Thunderbrook
0b8e1fadc5
add topo-aware in heter-ps ( #30087 )
...
* add topo aware
* resource.h
* topo aware
* format
4 years ago
gongweibao
eea7090c26
fix selected_gpus test=develop ( #30044 )
4 years ago
cc
1fa863da40
Support dygraph quant model ( #29927 )
...
* Avoid the scale to be infinity in quant2_int8_mkldnn_pass, test=develop
* support quantized model for paddle2.0 dygraph, test=develop
4 years ago
Chen Weihang
46c4695421
Set FLAGS_selected_gpus for spawn ( #29962 )
...
* set flags_selectedd_gpus for spawn
* add cond for unittest
* Delete test_no_single_process_using_multi_gpus_in_spawn.py
* Update spawn.py
* Update nccl_context.cc
4 years ago
WangXi
ee16006b5d
Optimization grad merge performance ( #29784 )
4 years ago
xiaoting
4d395203a2
Add alias for upsample ( #29983 )
...
* add alias for upsample, test=develop
* add alias for upsample
* fix example
4 years ago
lilong12
9e51e3833f
update, test=develop ( #30047 )
4 years ago
chentianyu03
e012930aa3
complex gradient matmul ( #29966 )
...
* dot op support complex types
* matmul support complex types
* add test case
* matmul broadcast gradient support complex
* move conjFunctor to complex_functor.h
4 years ago
lilong12
b0bd93de00
Disable gloo by default ( #29805 )
...
* update, test=develop
4 years ago
ShenLiang
b6fd262951
fix gather nd for untest ( #30037 )
4 years ago
Leo Chen
a253a78a85
fix error message ( #30020 )
4 years ago
lilong12
2bc5121da8
add the paddle.distributed.split api ( #29970 )
...
* add distributed.split, test=develop
4 years ago
cc
c3c064a8fc
Add mkldnn nearest_interp and bilinear_interp op ( #30016 )
...
* Add mkldnn nearest_interp and bilinear_interp op
* don't run mkldnn interpolate in default
* add interpolate_mkldnn_pass
4 years ago
zhupengyang
65d4ff753b
hardsigmoid add attr slope and offset ( #29999 )
4 years ago
tangwei12
ed856d254e
fix ut ( #29989 )
...
* fix ut
Change-Id: I151e152919a1863db07792bffb42d0ca68995756
4 years ago
cc
62f455e023
Support quantizing program_desc ( #29526 )
...
* Support quantizing program_desc, test=develop
4 years ago
Chen Long
af37285870
fix code bugs ( #29932 )
...
* fix code bugs
* fix code bugs test=document_fix
* fix code bugs test=document_fix
4 years ago
guofei
8212874f47
Fix test_imperative_skip_out ( #29939 )
...
* Fix unittest:test_imperative_skip_out
4 years ago
LielinJiang
ec2fad4d51
Fix rotation bug when use cv2 backend ( #29933 )
...
* fix cv2 rotation
4 years ago
Chen Weihang
a1d9a14e89
support grad accumulated across batch ( #29942 )
4 years ago
liuyuhui
bb20dcfc1a
[Kunlun] bug fix of PR2: Support MultiDevicePass and BKCL in parallel executor ( #29961 )
4 years ago
wawltor
587b67ef62
fix the state_dict bug for the xpu ( #29888 )
...
fix the state_dict bug for the xpu
4 years ago
QingshuChen
f4be9d6a32
add bkcl.so in whl for kunlun ( #29947 )
4 years ago
XiaoguangHu
726c78f293
clean redundant API alias in 2.0 - part 1 ( #29928 )
...
* rm check_import_scipy, rm chunk_eval and mean_iou in paddle.metric.__init__.py
* Revert "rm check_import_scipy, rm chunk_eval and mean_iou in paddle.metric.__init__.py"
This reverts commit 179ba8c2b22bc31fe8d8a126e31820792cbd0f4e.
* delete paddle.metric.chunk_eval and paddle.metric.mean_iou
* delete paddle.nn.clip and paddle.nn.clip_by_norm
* delete paddle.nn.functional.activation.hard_sigmoid and paddle.nn.functional.activation.hard_swish
* delete paddle.nn.Pool2D, paddle.nn.BilinearTensorProduct, paddle.nn.RowConv, paddle.nn.functional.row_conv
* fix extension import error
* fix unittest for row_conv and Pool2D
4 years ago
liym27
14bd77f941
[Windows CI test] Enable unittest test_optimizer_in_control_flow and remove unnecessay code ( #29851 )
4 years ago
Wilber
332da133a1
Support mips arch ( #29903 )
...
* Support MIPS arch.
4 years ago
littletomatodonkey
5c162fe66e
fix reg api ut fail ( #29921 )
4 years ago
Leo Chen
a4b9daf97c
fix optimizer dtype ( #29917 )
4 years ago
liuyuhui
4427df37cf
[Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor ( #29574 )
4 years ago
LielinJiang
0b74428db8
Fix Conv2DTanspose bug when padding='same' ( #29915 )
...
* fix conv_transpose bug when padding=same
4 years ago
LielinJiang
11de384c6d
Split callbacks unittest ( #29914 )
...
* split callback unittest
* rm test_callback from timeout list
4 years ago
lilong12
01950ceb42
fix the bug in pipeline data parallelism ( #29731 )
...
* update, test=develop
4 years ago
YUNSHEN XIE
2a01756bf3
remove duplicate ut names ( #29809 )
4 years ago
Chen Weihang
a6072055be
[Complex] Handle complex to real after type promotion ( #29855 )
...
* try to add fwd op input dtypes
* refactor base impl
* return tmp_ins after dygraph prepare data
* fix typo found in debug
* polish comment & add complex net test
* revert detail change
* fix unittest failed
* add complex kernel condition control
* fix xpu test failed & polish comment
* polish details by review comments
4 years ago
Chen Weihang
1a304e6c06
[Complex] Add support for complex grad accumulated ( #29889 )
...
* add support for complex grad accumulated
* add unittest for coverage
* update test dtype
* remove useless blank line
4 years ago
guofei
80eb77788f
Skip Windows Multi-GPU test of test_fetch_lod_tensor_array ( #29508 )
...
* Fix Windows unittest of test_fetch_lod_tensor_array
4 years ago
Leo Chen
6b258317cb
fix TransferInplaceBack ( #29830 )
4 years ago
QingshuChen
59b47f3b32
feat: support check_nan_inf for kunlun/xpu device ( #29694 )
...
* feat: support check_nan_inf for kunlun device
* support kunlun stack
* minor
4 years ago
wawltor
7498df2587
add the cumsum unit test for the develop ( #29881 )
4 years ago
wanghuancoder
26f9ab70f7
if PR have no .py files, do not use 'python coverage run', to speedup unit test ( #29739 )
...
* reopen python coverage --include for test, test=develop
* if no .py file modified, not use coverage run, test=develop
* remove test code, test=develop
* add WITH_INCREMENTAL_COVERAGE, test=develop
* refine if else, test=develop
4 years ago
Tao Luo
5d130d5670
Revert "fix conv2d int8 windows UT ( #29528 )" ( #29869 )
...
This reverts commit 067d7f1d0d
.
4 years ago
tangwei12
032414ca2a
[Feature] one ps (3/4) ( #29604 )
...
* oneps (3/4)
Co-authored-by: MrChengmo <cmchengmo@163.com>
Co-authored-by: malin10 <malin10@baidu.com>
Co-authored-by: chengmo <chengmo@baidu.com>
4 years ago
jakpiase
edc06c6a1b
Added fc + activation fuse pass (currently only gelu, sigmoid and tanh are supported) ( #29772 )
4 years ago
Chen Weihang
0e0bb1b97d
replace exit method ( #29862 )
4 years ago
lidanqing
067d7f1d0d
fix conv2d int8 windows UT ( #29528 )
4 years ago
liym27
97e75ad0f5
[setitem] Support Tensor setitem in static mode ( #29708 )
...
1. Type of index: int, slice(step must be 1).
2. Type of value:
(1) int32, int64, float32, bool;
(2) numpy.array(int32, int64, float32, bool);<Note: float64 is not supported>
(3) paddle.Tensor(int32, int64, float32, float64, bool);
4 years ago
YUNSHEN XIE
24ce051a84
remove duplicate ut reload ( #29810 )
...
* remove duplicate ut reload
* remove duplicate ut define in cmakelist
4 years ago
Thunderbrook
09b6e71928
heter box ( #29734 )
...
* add heter box
* add trainer, worker, wrapper...
* format
* for ci
* format
* remove boost get
* boost & copyright
* rename
* rename
* format
* format
* format
Co-authored-by: yaoxuefeng6 <yaoxuefeng@baidu.com>
4 years ago
LielinJiang
1092da82b2
Change the conditions of hapi printing logs ( #29792 )
...
* update condition of logger print
4 years ago
ceci3
c4eb5d0378
fix unittest timeout ( #29820 )
4 years ago
chentianyu03
ddfc3d2c2f
change grad elementwise_mul for complex types ( #29757 )
...
* add conj op for complex types
* add conj for complex types
* add more test case
* add conj_op test
* modify conj api and impl
* add complex type for fill_constant_op xpu
* add setConstant for complex type
* remove complex conj test file
* user define grad for test_conj_op
* add test case for static mode of conj api
* modify conj doc
* change input args name to x
* remove useless codes
* conj support real types
* add conj test case for real number
* delete no need to calculate inputs in dygraph op_test
* delete no need to calculate inputs in dygraph op_test
* modify grad of mul for complex types
* fix the grads of inputs args order not match bug
4 years ago
chentianyu03
2a260d9b0e
change the grad of div when complex types ( #29804 )
...
* change the grad of div when complex types
* fix the grads of inputs args order not match bug
4 years ago
syyxsxx
e219b8ccef
fix api link for the any, all, isfinite
...
fix api link for the any, all, isfinite
4 years ago
Guo Sheng
356efd36fa
Remove test_rnn_decode_api from disable list. ( #29814 )
...
test=develop
4 years ago
TTerror
82aa01c373
add nearest_interp_v2 on kunlun ( #29725 )
...
* add nearest_interp_v2 on kunlun
* add nearest_interp_v2 on kunlun
4 years ago
yukavio
0f97ff0368
fix flops ( #29818 )
4 years ago
whs
82630408b4
Support double backward rsqrt ( #29589 )
4 years ago
cc
61820fd217
add the time threshold of quantization tests, test=develop ( #29786 )
4 years ago
xiaoting
55725cd2e1
fix for timeout, test=develop ( #29788 )
4 years ago
LielinJiang
a94c3cbbf3
register cudnn conv double grad for depthwise conv ( #29807 )
4 years ago
ShenLiang
01e2874a0e
Support multi-stream communication for dynamic graph distributed ( #29525 )
...
* fix fleet for multi-stream
* fix memcpy for ncclid
* use sync to solve move operation
4 years ago
huangxu96
a29006d128
Optimizer trans momentum ( #29597 )
...
* merge amp related function in Momentum from paddle.fluid.contrib.optimizer into paddle.optimizer.
* Add unittest for 2.0 Momentum API.
* fix some bugs in weight_decay.
4 years ago
liym27
0cc42e34c6
Migrate 4 APIs about array to paddle.tensor.* ( #29565 )
...
4 APIs: array_length, array_read, array_write, create_array
4 years ago
yukavio
96934b7430
fix flops ( #29758 )
...
* fix flops
* fix flops
4 years ago
liym27
41a7b07159
[Dy2Stat] Fix bug for loop: a variable is used and created in loop, but used before created ( #29769 )
4 years ago
LielinJiang
e5af650b71
Add double grad for conv_transpose ( #29706 )
...
* add double grad for conv_transpose
4 years ago
huangxu96
97e29411eb
fix a bug in multi_precision_fp16 unittest. ( #29756 )
4 years ago
Wojciech Uss
6ef8129dcc
upgrade oneDNN with GRU INT8 optimizations ( #28420 )
...
* upgrade oneDNN with GRU INT8 optimizations
* fix test
4 years ago
Huihuang Zheng
dfffee8a5d
[Dy2stat] Enable jit.save to Save Without Running ( #29579 )
...
Enable jit.save to Save Without Running.
4 years ago
liym27
a0b60716f1
[Dy2Stat] Support grammar: for ele in var[idx] ( #29541 )
...
Support to transformfor ele in var stms in which var is a slice of Tensor.
4 years ago
chentianyu03
b59b6d7ae6
Complex op test ( #29753 )
...
* delete no need to calculate inputs in dygraph op_test
* delete no need to calculate inputs in dygraph op_test
4 years ago
liym27
096c048b45
Fix unitest test_slice ( #29740 )
...
Before this commit, test_slice use old api `dygraph_to_static_func` to use Dynamic-t-Static and use Executor explicitly,which is not recommended to users.
After fixed, use recommended API `paddle.jit.to_static` to replace `dygraph_to_static_func`, which won't trigger the random exception on coverage CI.
4 years ago
Huihuang Zheng
2e788bd81e
Reduce batch size ot fix CPU memory, test=develop ( #29736 )
...
Unit test reported memory not enough on CPU machines. Reduce batch size again.
4 years ago
LielinJiang
10edfb6f21
Update en docs of to_tensor ( #29718 )
...
* update to_tensor en docs
4 years ago
chentianyu03
71063b8137
add conj op for complex types ( #29527 )
...
* add conj op for complex types
* add conj for complex types
* add more test case
* add conj_op test
* modify conj api and impl
* add complex type for fill_constant_op xpu
* add setConstant for complex type
* remove complex conj test file
* user define grad for test_conj_op
* add test case for static mode of conj api
* modify conj doc
* change input args name to x
* remove useless codes
* conj support real types
* add conj test case for real number
4 years ago
WangXi
9cbcc6cadc
fleet sync build strategy, test=develop ( #29732 )
4 years ago
Chen Weihang
6cfa59de1b
[Complex] Add real & imag op and api for complex tensor ( #29672 )
...
* add complex real op & api & unittest
* add imag op & api & unittest
* refactor op impl
* revert simplify writing due to complile failed
* polish details
* polish grad op code
4 years ago
LiuChiachi
572810eecb
Update EarlyStopping sample code ( #29723 )
...
* update EarlyStopping doc
* update EarlyStopping doc, test=document_fix
4 years ago
TTerror
af8ded773a
update activation op on kunlun ( #29577 )
...
* fix expand && concat/transpose to new api
* update xpu_header
* update activation op on kunlun
* update activation op on kunlun
* update activation op on kunlun
* update activation op on kunlun
* update activation op on kunlun
* add nearest_interp on kunlun
* update error message
4 years ago
ceci3
cc387159f3
add pad and concat double grad ( #29549 )
...
* add constant pad double grad
4 years ago
liuyuhui
f13c3a9cd7
[Kunlun] PR1:Support one Kunlun card training in parallel executor ( #29337 )
4 years ago
huangxu96
b96dada4f0
add static.amp into setup.pu.in ( #29621 )
...
* add static.amp into setup.pu.in
* add unittest for api
4 years ago
YUNSHEN XIE
d0b789d27f
disable ut test_cumsum_op ( #29613 )
4 years ago
Jack Zhou
84bae27779
fix wmt14 doc, remove backward, add bidirect direction in rnn api ( #29633 )
...
* fix wmt14 doc, remove backward, add bidirect direction in rnn api
* fix rnn unittest
* fix test_rnn_nets_static.py bug
4 years ago
YUNSHEN XIE
2926e74326
New UT should not exceed 15s ( #29492 )
...
* added UT should not exceed 15s
* fix error
* UT limit of 15s is the first to be executed
* fix error
* fix error with CI_SKIP_CPP_TEST
* modfied tiemout setting
* fix error
4 years ago
Chen Weihang
f02aece1f0
Add complex dtype op (add) test example ( #29603 )
...
* add op test case for complex
* polish code details
* add xpu set constant support
* fix argument rror
* remove useless pyc file
4 years ago
AshburnLee
efea540ca9
Add tf32 support for A100 tensor core acceleration for cuBLAS ( #28732 )
4 years ago
lijianshe02
7779768b53
add transpose double grad test=develop ( #29600 )
...
* add transpose double grad test=develop
4 years ago
huangxu96
c05170d3d8
add alias for fluid.contrib.mixed_precision ( #29562 )
...
* add alias for fluid.contrib.mixed_precision
4 years ago
ShenLiang
fb6697b424
Fix the dowanload bug in the case of multiple machines ( #29551 )
...
* fix the dowanload bug
* add sort for ips
4 years ago
ShenLiang
1efef8baed
Fix bug of matmul_v2 for broadcast case ( #29599 )
...
* fix bug of matmul_v2 for broadcast
4 years ago
qingqing01
8d549fc85d
Add clip double grad ( #29590 )
4 years ago
Tao Luo
81acc3278c
disable test_parallel_executor_profiler in cuda 10.1 ( #29581 )
...
* disable test_parallel_executor_profiler in cuda 10.1
* update set_tests_properties
4 years ago
wangchaochaohu
ac4bae8ee9
elementwise_add_grad Op optimization ( #29575 )
4 years ago
huangxu96
2cb6f94888
add float16 into adaptive_avg_pool2d check list. ( #29547 )
4 years ago
yukavio
ee1a7d020c
add some feature for paddle.flops ( #29572 )
4 years ago
WangXi
467c716963
gen nccl id use socket ( #29431 )
4 years ago
Bai Yifan
d72604cd46
fix unittst unstable issue on ci machine ( #29588 )
...
* fix unittst unstable issue on ci machine
* fix unittst unstable issue on ci machine
* fix unittst unstable issue on ci machine
4 years ago
QingshuChen
79a41a9ed6
support roi_align & affine_channel for kunlun ( #29561 )
...
* support roi_align & affine_channel for kunlun
* minor
4 years ago
liym27
0cad1152f4
[Dy2Stat] 1. Fix bug of for-range stmts. 2. Support that step value is negative in for-range stmts ( #29519 )
...
1. Fix error in _build_cond_stmt of for-range stmts.
2. Support that step value is negative in for-range stmts
3. Fix code because of the diff between Py2 and Py3
4 years ago
Huihuang Zheng
831e9135b9
Fix Windows Unittest ( #29543 )
...
Fix 3 Windows Unittests
test_fuse_all_reduce_pass: Paddle cannot run multiple-GPU on Windows so set single visible GPU flag
test_feed_data_check_shape_type: Paddle cannot run multiple-GPU on Windows so set single visible GPU flag
test_tsm: Winodws GPU size is not enough so decrease batch size and data size.
4 years ago
GeminiCarrie
08f24a3108
Fix precision problem ( #29567 )
...
* Fix a bug when running on an operating system without "bash."
* add execution condition
* for ci-coverage
* get cpu information to check the precision problem
* Update compilation environment for musl version
* update dependencies
* remove test code
check cpu info
remove test code
review
* update alpine and third_party denpendencies
* add newline for ci Code format
4 years ago
JZ-LIANG
d33d468f02
[Sharding] add hybrid-dp feature ( #29518 )
...
* Sharding add hybrid-dp feature
* update sharding in distributed_strategy
* update sharding unitest
* revise code format for sharding
4 years ago
Chen Weihang
c1a26e2a05
fix train eval set error in static mode ( #29540 )
4 years ago
taixiurong
760d015c14
add xpu ops for training transformer in kunlun ( #29539 )
...
* 1.fix matmul bug 2. add one hot
* add xpu error msg
4 years ago
Leo Chen
0fdd365665
Add fast path for dropout when p == 0 ( #29553 )
...
* add fast path for p == 0 in dropout
* add ut
4 years ago
Wojciech Uss
917a11495f
fix ininite scale values ( #29386 )
4 years ago
lijianshe02
bd29052e33
fix random seed in nll_loss unitest test=develop ( #29538 )
...
* fix random seed in nll_loss unitest test=develop
4 years ago
joanna.wozna.intel
0ce6d7fa77
Fix bf16 activations test for softmax and gelu ( #29502 )
...
* Fix bf16 activations test for softmax and gelu
* Resolve conflict
4 years ago
huangxu96
4001979309
Add ReserveSpace in dygraph batch_norm. ( #29221 )
...
* Add ReserveSpace in dygraph batch_norm.
* Add unittest for reservespace
4 years ago
arlesniak
b781953ef5
[oneDNN] Fix flags use test for #29080 , assert condition more general ( #29493 )
...
* Flags assert condition more general, print output if pattern not found
* removed test_flags_use_mkldnn form skip list regarding #29080 descr
4 years ago
Zhen Wang
5ac71b36fb
Remove tensor copy in the update_loss_scaling op. ( #29426 )
...
* remove tensor copy in the update_loss_scaling op
* not use thrust.
* fix some cuda memory access error.
4 years ago
Zhou Wei
e74e1a226c
support deepcopy for Layer/Tensor/Paramerbase ( #29387 )
...
* support deepcopy for Layer/Tensor/Paramerbase
* fix some code
4 years ago
joejiong
50d3117d30
Add random_split and Subset dataset ( #29291 )
...
As the title
4 years ago
joejiong
87e75a77c2
Add tangent operator ( #29207 )
...
As the title
4 years ago
Wei Shengyu
dc8bb76c68
remove addcmul ( #28937 )
...
* remove addcmul
* remove unittest and other related code of addcmul
* fix bug
* fix merge conflict
4 years ago
Zhong Hui
f459dd9634
fix abs double grad unittest ( #29478 )
...
fix abs double grad unittest & define the data range for the abs double grad
4 years ago
huangxu96
576d0d938b
add fp16 check into max and avg pool ( #29479 )
4 years ago
ShenLiang
2ef9e0e23c
Rebuild group automatically in dynamic graph distributed ( #29255 )
...
* add tensor_indices in AssignGroupBySize
* add rebuild group in reducer
4 years ago
procr
3a0558339d
support mobilenet for kunlun ( #29458 )
4 years ago
Aurelius84
5d530c9319
fix amp support fleet ( #29491 )
4 years ago
ShenLiang
311b3b44fc
Fix the bug where embedding can‘t be processed correctly in reducer ( #29485 )
...
* fix the bug of reducer in embedding
* add comment
4 years ago
Pei Yang
2480bdef6c
change hard_swish from plugin to layer ( #29177 )
...
* change hard_swish from plugin to layer
* add ut when threshold != scale
4 years ago
lilong12
b122d0bb76
Fix bug in gloo that gloo initialization hangs ( #29447 )
...
* update, test=develop
4 years ago
taixiurong
ecca6585cd
1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op ( #29448 )
...
Co-authored-by: root <root@bjhw-sys-rpm0223.bjhw.baidu.com>
4 years ago
LoveAn
03b42d9fa7
fix unittest on windows, test=develop ( #29365 )
4 years ago
ShenLiang
22e6b9e373
Fix the ut of matmulv2 for broadcast case ( #29461 )
...
* fix the ut of matmulv2 for broadcast
4 years ago
TTerror
a5fcc4b545
update reduce_sum op on xpu ( #29367 )
...
* update reduce_sum op on xpu
* update reduce_sum op on xpu
* support running on xpu
4 years ago
chentianyu03
acce962133
remove complex module direction ( #29419 )
4 years ago
Zhang Ting
6296f4ed09
revert cast eigen kernel ( #29427 )
4 years ago
Leo Chen
a040c055a5
fix layer_norm accuracy ( #29434 )
4 years ago
Shang Zhizhou
225a9c4ed8
Fix unittest ( #29412 )
...
* fix tensorrt unittest precision error
* fix unittest precision error. test_trt_subgraph_pass && test_trt_dynamic_shape_transformer_prune
4 years ago
Pei Yang
f860de4af7
support clip op trt converter ( #29411 )
4 years ago
Bai Yifan
87bb726258
Add deform_conv2d,DeformConv2D ( #29364 )
...
* add deform_conv2d,DeformConv2D
4 years ago
chentianyu03
64e4e17f0c
remove complexvariable ( #29390 )
...
* rm complexvariable
* modify test_var_base unittest
* remove duplicated codes
4 years ago
chajchaj
79e6086743
change shape of output in cross_entropy, test=develop ( #29220 )
4 years ago
liuyuhui
2ee7a6b08c
[paddle v2.0.0rc1: API fixs] assign/conv2d/conv2d_transpose/cast/ParamAttr ( #29171 )
...
* fix DLTP-15151, paddle.ParamAttr API
* fix DLTP-15083/DLTP-15274, paddle.nn.functionl.assign paddle.cast API
* fix DLTP-15431/DLTP-15432, paddle.static.nn.conv2d paddle.static.nn.conv2d_transpose API
* fix DLTP-15083, paddle.nn.functionl.assign API
* fix DLTP-15431/DLTP-15432, paddle.static.nn.conv2d paddle.static.nn.conv2d_transpose API
* support in_dygraph_mode for cast op, test=develop
* fix bug,test=develop
* fix doc
* fix DLTP-15431/DLTP-15432, paddle.static.nn.conv2d paddle.static.nn.conv2d_transpose API
4 years ago
Guo Sheng
8fc7f1b66a
Fix api docs in RNN, Transformer, layer_norm, WeightNormParamAttr ( #29235 )
...
* Fix api docs in RNN, Transformer, layer_norm, WeightNormParamAttr.
test=develop
* Fix api doc for print in label_smooth.
test=develop
* Update api docs according to review comments.
Add name argument in RNN back.
test=develop
4 years ago
Chen Long
c940f842ca
remove rarfile from requirements ( #29319 )
4 years ago
yongqiangma
7c508d8668
update unbind norm add CUDAPlace api doc information ( #29322 )
...
* enhance array_to_lod_tensor_op lod_tensor_to_array_op errors information. test=develop
* fix format. test=develop
* format fix. test=develop
* add lod_rank_table. test=develop
* fix format. test=develop
* fix doc info. test=develop
* fix np error
* add unbind dygraph api. test=develop
* fix unbind doc.test=develop
4 years ago
chentianyu03
879e913b6d
Make transpose, trace, kron, reshape, sum op support complex type ( #29321 )
...
* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types
* add test cases for complex elementwise, matmul and getitem unittest
* add test cases for complex types
* add test cases for complex matmul unittest
* kron, reshape, transpose support complex types
* sum and trace op support complex types
* add test case of sum and trace op
* fix the bug of imag part of complex not initialized
* format file
* format code style
* kron support type promotion; modify test cases
4 years ago
Chen Long
66fd1c00a0
fix some docs test=develop;test=document_fix ( #29374 )
4 years ago
liym27
5f84d0b375
Fix bug: delete wrong check_type of paddle.concat and support LoDTensorArray ( #29306 )
4 years ago
Feiyu Chan
f7cdcefa65
fix multiple documentation errors, test=document_fix ( #29210 )
...
* fix multiple documentation error, test=document_fix
* fix more rst syntax errors, test=document_fix
* fix format issues in docstring, test=document_fix
4 years ago
卖鱼的哲学
074065e5de
fix expand/uniform_random && concat/transpose to new api on xpu ( #29280 )
...
* fix expand && concat/transpose to new api
* update uniform_random_op
* update xpu_header
4 years ago
ShenLiang
4064354a01
support dp run single card ( #29358 )
4 years ago
gongweibao
8989053443
Fix bug of test_fleet_launch_async.sh ( #29332 )
4 years ago
Huihuang Zheng
8f7627907c
[Dy2stat] Reduce Exception Type for Better Error Message ( #29268 )
...
Reduce exception type so that if covert_to_static failed, it reports right error message.
4 years ago
liym27
61a8f2874f
[Dy2Stat] Fix bug: Do not use gast.Subscript to replace gast.Name in when transforming for_enumerate_loop ( #29310 )
4 years ago
liym27
b10ecd9d3a
[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows ( #29267 )
4 years ago
Chen Weihang
9ad800ebb2
Support type promote for basic math ops (quantum required) ( #29265 )
...
* basic impl of type promote
* add comment & another testcase
* fix complex bugs & support python op promote type
* fix failed unittests & polish code
* add unittest for coverage
* change to only promote complex type
* polish code details
* polish several comments
4 years ago
LielinJiang
f31e5adab5
fix typo in ProgBarLogger ( #29329 )
4 years ago
tangwei12
8358791607
fix gpu outofrange ( #29238 )
...
* fix gpu emb out of range
Change-Id: I5794ac73bd634d5ea069a6fbbd914274b6d6b7bf
* fix doc
Change-Id: I5a3350b2930a9ab2f52116c192b087307faf8fdf
4 years ago
YUNSHEN XIE
28164b266f
disable test_rnn_decode_api and test_complex_matmul on windows ( #29252 )
4 years ago
Leo Chen
b58cfff89d
use has_grad instead of train_mode ( #29309 )
...
* use has_grad instead of train_mode
* add vlog for debug
* fix ut
* fix ut
4 years ago
Aurelius84
67c700b479
[Dy2Stat] Add cache for Executor and Context in run_program_op ( #28421 )
4 years ago
ShenLiang
d6753e1e6d
fix matmulv2 for windows ( #29327 )
4 years ago
gongweibao
96de8b008f
cleanup enum test=develop ( #29294 )
4 years ago
liym27
b9a8ebd50f
[Dy2stat] Add a decorator paddle.jit.not_to_static to support that not to convert a function in Dynamic-to-Static. ( #29253 )
...
Usage scenarios:A function could have run successfully in static mode, you can use it to decorate a function in the following cases:
1. An unknown error occurs in the dynamic-to-static conversion process of the function;
2. In the internal implementation of the function, it has two branches: dynamic branch and static branch;
3. Users don't want to convert the function in the process of dynamic to static.
4 years ago
ShenLiang
2d6aa1a5bb
fix warning of fleet ( #29317 )
4 years ago
ShenLiang
2cd0bf5764
Fix doc of fleet api ( #29282 )
...
* fix doc, test=document_fix
4 years ago
ShenLiang
c00af94435
fix matmulv2 for windows ( #29302 )
4 years ago
Steffy-zxf
41f17aeb8b
fix DATA_HOME path in win ( #29222 )
...
* fix DATA_HOME path in win
4 years ago
Jack Zhou
cf43322139
fix nll_loss doc;test=document_fix; ( #29247 )
...
* fix nll_loss doc;test=document_fix;
* remove numpy and set_device;test=document_fix;
* remove numpy;test=document_fix;
4 years ago
LielinJiang
b9f1f4343b
Move temporal_shift to paddle.nn.functional ( #29261 )
...
* move temporal_shift to functional
4 years ago
Chen Weihang
a2e9d95a4a
change test_imperative_signal_handler_to_exclusive ( #29283 )
4 years ago
Zhen Wang
be3777a50a
Add pure fp16 training with master weights. ( #27712 )
...
* add the weight decay func for the momentum op
* Add the multi_precision function in Momentum Optimizer.
* Make sure that the initial value of master weights are same with the fp16 weights.
* add static loss scaling.
* add the rescale_grad function in the pure fp16 training.
* use the original momentum updating method.
* Polish some codes, such as variable names.
* add docstring for apis.
* update the var creation details of _create_master_weight.
* not modify codes about imperative momentum updating.
* Fix the error of test_dist_sparse_tensor_load_momentum UT.
* add unit test for multi precision fp16 training.
* add more unit tests for CI.
* Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.
* For CI Coverage Checking.
4 years ago
chentianyu03
976961de6d
fix random failed of complex matmul ( #29285 )
4 years ago
furnace
7584bb5096
Layer norm fp16 ( #29169 )
...
* add fp16 for layer_norm op
* revert layernorm api
* fix forward
* fix forward
* fix backward for layernorm with fp16
* fix unit test for layernorm with fp16
* fix with_mkldnn compile error for layernorm with fp16
* 1. revert to PADDLE_ENFORCE_NOT_NULL, 2. change static_cast<float> to static_cast<U>
* fix with_mkldnn compile error for layernorm with fp16
* fix with_mkldnn compile error for layernorm with fp16
Co-authored-by: zhiqiu <chenqiuliang@baidu.com>
4 years ago
mls1999725
a37963b890
Update APIs in text/datasets and dataloader ( #29219 )
...
* Update IterableDataset API
* Update TensorDataset API
* Update APIs in paddle/text/datasets
* Update dataset.py
4 years ago
mls1999725
493568b070
Update Codes of Cifar and VOC2012 ( #29204 )
...
* Update Cifar Codes
* Update VOC2012 Codes
* Update voc2012.py
* Update voc2012.py
* Update cifar.py
* Update cifar.py
* Update voc2012.py
4 years ago
mls1999725
0aedd463ee
Update get_worker_info API ( #29190 )
...
* Update get_worker_info API
* Update dataloader_iter.py
* Update dataloader_iter.py
* Update dataloader_iter.py
4 years ago
mls1999725
6a9a62c3ef
Update conv3d API ( #29205 )
...
* Update conv3d API
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
* Update nn.py
4 years ago
Huihuang Zheng
aec05d811c
[Dy2stat] Fix PaddleGan Deoldify Model Dy2stat Problems ( #29226 )
...
This PR fixes several problems in dy2stat for Deoldify model in PaddleGan.
In model, software engineer wrote if x.shape == y.shape, the Tenser shape is a tuple in dygraph so the == returns True/False, but in static graph the == becomes element-wise comparison, which is a different behavior. In this PR we reduce the element-wise comparison result.
If software engineer write computations which uses parameters in hooks, the static graph can loss the parameter variable because we put param_guard at forward of a Layer. In this PR we made param_guard cover pre-hook and post-hook.
In PaddleGan, software engineer calculated some parameter values in __init__ by running some dygraph code. Those code also run during dy2stat. So some variables may be assign as a VarBase (Tensor) first and then Variable, which raised an error. We fixed the bug in this PR by handling the case.
TODO: We just added testcase for the 1. shape comparison. Should add test case for 2. and 3. But since we are chasing 2.0RC, I will do it in the near future PR
4 years ago