Commit Graph

7987 Commits (develop)

Author SHA1 Message Date
gongweibao 9c624b16d5
Extend unittest time of (#31570)
5 years ago
ronnywang da10c5cf8b
[ROCM] fix softmax_with_cross_entropy_op, test=develop (#31629)
5 years ago
Kaipeng Deng a32e8bf1e7
DataLoader supprot dict str (#31481)
5 years ago
Chen Weihang 30a627aaf3
Normalized function parameter writing (#31588)
5 years ago
Shang Zhizhou 50ac7dbfd0
Trt elementwise plugin serialize (#31587)
5 years ago
guofei ef0dd3efed
Support loading parameters from checkpoint to save quantized model (#31419)
5 years ago
hong 99dcd66508
try to fix imperative orc unitest error; test=develop (#31568)
5 years ago
Qi Li 3d5aa9d10a
[ROCM] fix conv2d and conv3d op, test=develop (#31553)
5 years ago
Chen Weihang 95cceb2dd7
[CustomOp] Support duplicable op input and output (#31535)
5 years ago
Aurelius84 def27bc801
[Dy2stat]Fix bug with static_convert_var_shape in locals scope (#31556)
5 years ago
jiangcheng 9ed6c895f1
optimize range op by place parameters on cpu rather than gpu, test=develop (#30811)
5 years ago
Thunderbrook 3789a69923
solve bug in heter mode (#31531)
5 years ago
chajchaj 6148b87f9d
add softmax_switch for softmax_with_cross_entropy_op, test=develop (#31428)
5 years ago
Aurelius84 f3959e9ddc
[save/load] Fix bug with input_spec=dict[InputSpec] in jit.save (#31517)
5 years ago
WangXi 83a2fb1f08
Add collective async wait op (#31463)
5 years ago
lilong12 0205e9f84e
remove the send/recv of tensor size (#31460)
5 years ago
Aurelius84 c8ae837d52
[CustomOp]Fix setup_install timeout (#31484)
5 years ago
furnace 910f377fa5
Bugfix rocm (#31490)
5 years ago
wangguanzhong 43d6abf0a5
update conv2d, test=develop (#31480)
5 years ago
wangguanzhong 50af0c2cbb
fix roi_align, test=develop (#31479)
5 years ago
Qi Li b85c8e03be
[ROCM] fix reduce op, test=develop (#31478)
5 years ago
Jacek Czaja 39a5424ed1
[oneDNN] elementwise add bf16 grad kernel with broadcasting (#31385)
5 years ago
Qi Li 133a914bd0
[ROCM] fix test_dist_op ci test, test=develop (#31468)
5 years ago
Aurelius84 fadabbe9b0
[CustomOp] Automatically specify PADDLE_WITH_MKLDNN & Remove Interpreter argument (#31391)
5 years ago
liuyuhui 9ebf05b003
[Kunlun]Multi xpu dygraph performance optimization , add distributed.spawn support for multi xpu and some bug-fixes (#31130)
5 years ago
liym27 522c91ec67
[Dy2Stat] Remove gast.Index for compatibility of gast 0.4.0 (#31358)
5 years ago
Zhang Ting 7d95e598c1
support float16 for temporal_shift op (#31432)
5 years ago
Zhang Ting dcce54ea76
improve performance of depthwise_conv2d (#31099)
5 years ago
liym27 0fff930667
Fix bug for set_value op when input dtype is not float32 (#31411)
5 years ago
Huihuang Zheng c40b98e068
Fix comment (#31424)
5 years ago
Huihuang Zheng 6bf02a1261
[Dy2stat] Fix Read-Only Attribute as while_loop Output (#31415)
5 years ago
jakpiase 5b4f8aac82
Added LSTM BF16 and fixed GRU BF16 (#31234)
5 years ago
Qi Li db50fb6766
[ROCM] fix softmax with loss and update python scripts, test=develop (#31373)
5 years ago
Pei Yang 32211fe9c4
TRT conv2d converter support SAME padding (#31379)
5 years ago
Zhou Wei 13e4280f82
[Custom OP]polish doc of custom OP (#31369)
5 years ago
Qi Li 946dbdae8c
[ROCM] update fluid operators for rocm (part6), test=develop (#31301)
5 years ago
wangna11BD 1cbccfa594
Add attrs `deformable_groups` for deformable_conv API (#31335)
5 years ago
Pei Yang 2e9e3fad15
add n-d input support for trt scale converter (#31316)
5 years ago
Gradie d79fdc3d62
lamb_op_xpu;test=kunlun (#31012)
5 years ago
Zhou Wei aebf223478
fix test_check_abi (#31288)
5 years ago
Zhou Wei cc89120a2c
[Custom OP]add MSVC compile check on Windows (#31265)
5 years ago
Zhou Wei af9066e89c
[Custom OP]add PD_THROW and PD_CHECK for User Error message (#31253)
5 years ago
Jiabin Yang 038ce70d69
[Custom OP] Support stream set on Custom Op (#31257)
5 years ago
Aurelius84 1dd40870fc
[Dy2Stat] Fix eval_if_exist_else_none bug (#31261)
5 years ago
pangyoki 6fafbdc39e
change np.int to int to fix paddle warning (#31221)
5 years ago
Jiabin Yang 0c38708a90
[Custom Op] Remove unsupport dtypes (#31232)
5 years ago
WangXi b8bce682e0
xpu support fuse allreduce (#31104)
5 years ago
Chen Weihang 126633c50f
[CustomOp] Split build op marco & polish details (#31229)
5 years ago
Aurelius84 e8d24b546a
[CustomOp] Add Modeling with Custom op unittest (#31218)
5 years ago
littletomatodonkey ad50fa710b
add int pad support for Pad1D/2D/3D (#31209)
5 years ago
jakpiase 2f1165342b
OneDNN hardswish integration (#30211)
5 years ago
Aurelius84 912022fa0c
[CustomOp]Add cpp_extension en doc (#31187)
5 years ago
Chen Weihang e8cdb49aa9
[CustomOp] Support attributes as func input in custom op (#31128)
5 years ago
Thunderbrook c4f279fe8d
support multi node in heterps (#31102)
5 years ago
Aurelius84 406f4a7513
[CustomOp] Support to specific extra_cflags and exctra_cuda_flags independently (#31059)
5 years ago
qingqing01 572cc8bd0f
Update doc for 2.0 API and some callback (#31180)
5 years ago
Pei Yang 00b09e86ac
[Paddle-TRT] support group_norm (#31040)
5 years ago
Chen Weihang c209751c8d
change test_multiprocess_reader_exception cmake (#31174)
5 years ago
YUNSHEN XIE 153121457f
fix ut timeout (#31061)
5 years ago
Chen Weihang 1ce96fa118
[CustomOp] Add new paddle custom op so (#31141)
5 years ago
tangwei12 ebbdf52557
fix entry (#31079)
5 years ago
Zhou Wei 4b220550ef
[Custom OP]Fix problem of custom op unitests on Windows CI (#31114)
5 years ago
chentianyu03 70131b475f
add warning message when dtypes of operator are not same (#31136)
5 years ago
Chen Weihang e60fd1f6a8
[CustomOp] Split test and add inference test (#31078)
5 years ago
xiemoyuan edacb6293c
Optimization of Transformer API (#30957)
5 years ago
WeiXin ee1801c1ad
Save load/save pickle protocol (#31044)
5 years ago
Zhou Wei 44ee251fde
fix UNIX cmake problem (#31113)
5 years ago
Thunderbrook 565354f676
support save multi sparse table in one path (#31108)
5 years ago
Huihuang Zheng cf43a321a8
[Dy2stat] Refactoring tensor_shape_transformer.py to Fix Change after Assign Bug (#31082)
5 years ago
tangwei12 0e4b154298
fix dist fleet ctr ut (#31087)
5 years ago
Zhou Wei adaec0073d
[2.0Custom OP]Support New Custom OP on Windows (#31063)
5 years ago
Chen Weihang 2168f08ac8
add optional for param attr args, test=document_fix (#31105)
5 years ago
Chen Weihang 6beeafe797
[CustomOp] Add more dispatch marco for users (#31058)
5 years ago
TTerror d5323dab41
add squeeze_op/unsqueeze_op on kunlun;fix conv op and parallel executor;optimize lookup_table op (#31056)
5 years ago
123malin 16b4260b2f
test=develop, save/load, shrink (#30625)
5 years ago
Shibo Tao 4424aac608
export paddle.static.normalize_program method. (#31072)
5 years ago
liym27 5b367dab44
[static setitem] Support the index is Tensor; step>1; step<0 .(#30949)
5 years ago
Huihuang Zheng ef627ac5b9
Fix that convert_var_shape doesn't support slice like [0:], test=develop (#31051)
5 years ago
Jacek Czaja f7465641c3
Added reshape grad bf16 (#31035)
5 years ago
Aurelius84 4dbe16c48f
[CustomOp] Refine name argument in setup (#31049)
5 years ago
Aurelius84 f2dc29a9fa
[CustomOp] Support output dtypes in generated Python API (#31045)
5 years ago
ShenLiang 9401173e3a
Remove scale loss before reduce in dygraph (#30807)
5 years ago
Kaipeng Deng c4ddc3ab0d
fix dataloader collate return list mix tensor and numpy array (#30904)
5 years ago
Guanghua Yu 5b267474a9
add offset parameter in roi_align,generate_proposals.etc ops (#30864)
5 years ago
Chen Weihang 75f81233ae
fix regex error & simplify marco name (#31031)
5 years ago
Pei Yang 9b54fe4154
add trt transpose and flatten converter (#31022)
5 years ago
Aurelius84 4c9f96c902
[CustomOp] Support Compile multi ops at same time (#30920)
5 years ago
joanna.wozna.intel caf9d39839
Add Conv Transpose BF16 (#30877)
5 years ago
Huihuang Zheng cbbe127483
Refine fake_interface Error Message (#30981)
5 years ago
Huihuang Zheng c137578341
Add Support for Tuple in for Loop (#30998)
5 years ago
Wojciech Uss 2497f4392f
Handle missing symlink method on Windows (#31006)
5 years ago
Aurelius84 5653c3a488
[CustomOp] Check Compiler ABI compatibility (#30869)
5 years ago
WeiXin 8ab29f4bea
delay timeout of unnittest 'test_static_save_load'. (#30975)
5 years ago
Chen Weihang f649442ddd
New custom operator extension mechanism (#30690)
5 years ago
Chen Weihang 010f2caa23
try to fix reader and signal test failed (#30960)
5 years ago
liym27 12c15bebe4
[Static setitem] Support index is ellipsis for setitem in static mode (#30836)
5 years ago
liuyuhui 87197f8c2e
[kunlun]fix sync in multi kunlun xpu dygraph training. (#30943)
5 years ago
wanghuancoder 823f499a8a
fix a bug of Sequential::__getitem__ (#30899)
5 years ago
Jacek Czaja 9e527d9956
[oneDNN] Added basic changes for elementwise_add_grad bf16 (#30925)
5 years ago
liuyuhui 4a8b8b4547
[Kunlun] add gen_bkcl_id_op, support multi XPU cards training using multiprocess (#30858)
5 years ago