Commit Graph

12554 Commits (8c19d7aa2f89a38b3a68e53c73d88af16a3de8ce)

Author SHA1 Message Date
ronnywang 8c19d7aa2f
[ROCM] fix test_conv2d_transpose_op (#31749)
5 years ago
Jacek Czaja 25fc2a1fdb
[oneDNN] Added Elementwise Mul grad fp32/bf16 (#31647)
5 years ago
Chen Weihang 878e117b6d
[CustomOp] Support float16 in custom op (#31725)
5 years ago
ronnywang c9e1d9dc31
[ROCM] fix test_rnn_op (#31735)
5 years ago
Chen Weihang e429deb0c4
[CustomOp] Support attribute in infershape function (#31713)
5 years ago
Adam Osewski a4a2b77def
[oneDNN] lookup_table op with support for BF16 data type. (#31558)
5 years ago
cc 1d197f6c97
[dgraph qat] Refine calculating output scale of dygraph qat (#31710)
5 years ago
ronnywang 420527f0d9
[ROCM] fix layer_norm, norm, p_norm, test_sequence_softmax_op, test_math_op_patch_var_base (#31709)
5 years ago
Chen Weihang 87852616aa
[CustomOp] Support complex dtype in custom op (#31657)
5 years ago
zlsh80826 fe241fd02f
[Paddle-TRT] gather converter (#31640)
5 years ago
Zhou Wei d4282ea97e
fix multi cuda environment bug (#31694)
5 years ago
Chengmo 09482ddec4
【Paddle.Fleet】Fix one ps gradient clip (#31664)
5 years ago
Kaipeng Deng 740359edaf
remove useless import (#31700)
5 years ago
Zhang Ting 7f50bb7ec1
support NHWC for temporal_shift op (#31642)
5 years ago
liym27 402288ad65
In __getitem__, convert integers to int64 Tensor not int32 to be compatible with Lite(#31658)
5 years ago
cc 19592d2b71
Refine dygraph qat, test=develop (#31680)
5 years ago
gongweibao 9c624b16d5
Extend unittest time of (#31570)
5 years ago
ronnywang da10c5cf8b
[ROCM] fix softmax_with_cross_entropy_op, test=develop (#31629)
5 years ago
LielinJiang 75433126df
Fix summary bug when calaculating output shape (#31549)
5 years ago
ShenLiang c3634c6b0a
fix amp bug of fleet (#31532)
5 years ago
Kaipeng Deng a32e8bf1e7
DataLoader supprot dict str (#31481)
5 years ago
Chen Weihang 30a627aaf3
Normalized function parameter writing (#31588)
5 years ago
Shang Zhizhou 50ac7dbfd0
Trt elementwise plugin serialize (#31587)
5 years ago
guofei ef0dd3efed
Support loading parameters from checkpoint to save quantized model (#31419)
5 years ago
hong 99dcd66508
try to fix imperative orc unitest error; test=develop (#31568)
5 years ago
Qi Li 3d5aa9d10a
[ROCM] fix conv2d and conv3d op, test=develop (#31553)
5 years ago
Chen Weihang 95cceb2dd7
[CustomOp] Support duplicable op input and output (#31535)
5 years ago
Aurelius84 def27bc801
[Dy2stat]Fix bug with static_convert_var_shape in locals scope (#31556)
5 years ago
LielinJiang ac493f2c72
Update comments for API `RandomResizedCrop` (#31539)
5 years ago
jiangcheng 9ed6c895f1
optimize range op by place parameters on cpu rather than gpu, test=develop (#30811)
5 years ago
Thunderbrook 3789a69923
solve bug in heter mode (#31531)
5 years ago
chajchaj 6148b87f9d
add softmax_switch for softmax_with_cross_entropy_op, test=develop (#31428)
5 years ago
Aurelius84 f3959e9ddc
[save/load] Fix bug with input_spec=dict[InputSpec] in jit.save (#31517)
5 years ago
WangXi 83a2fb1f08
Add collective async wait op (#31463)
5 years ago
lilong12 0205e9f84e
remove the send/recv of tensor size (#31460)
5 years ago
Aurelius84 c8ae837d52
[CustomOp]Fix setup_install timeout (#31484)
5 years ago
furnace 910f377fa5
Bugfix rocm (#31490)
5 years ago
Zhou Wei 634a12b368
fix bug of windows chineses msvc (#31493)
5 years ago
wangguanzhong 43d6abf0a5
update conv2d, test=develop (#31480)
5 years ago
wangguanzhong 50af0c2cbb
fix roi_align, test=develop (#31479)
5 years ago
Qi Li b85c8e03be
[ROCM] fix reduce op, test=develop (#31478)
5 years ago
Jacek Czaja 39a5424ed1
[oneDNN] elementwise add bf16 grad kernel with broadcasting (#31385)
5 years ago
Qi Li 133a914bd0
[ROCM] fix test_dist_op ci test, test=develop (#31468)
5 years ago
Aurelius84 fadabbe9b0
[CustomOp] Automatically specify PADDLE_WITH_MKLDNN & Remove Interpreter argument (#31391)
5 years ago
liuyuhui 9ebf05b003
[Kunlun]Multi xpu dygraph performance optimization , add distributed.spawn support for multi xpu and some bug-fixes (#31130)
5 years ago
liym27 522c91ec67
[Dy2Stat] Remove gast.Index for compatibility of gast 0.4.0 (#31358)
5 years ago
Zhang Ting 7d95e598c1
support float16 for temporal_shift op (#31432)
5 years ago
Zhang Ting dcce54ea76
improve performance of depthwise_conv2d (#31099)
5 years ago
liym27 0fff930667
Fix bug for set_value op when input dtype is not float32 (#31411)
5 years ago
Huihuang Zheng c40b98e068
Fix comment (#31424)
5 years ago