Commit Graph

4673 Commits (976961de6dc4320e97ac931c3f4d49d68266dfca)

Author SHA1 Message Date
joanna.wozna.intel b0d1ac161e
Add bf16 pool2d and unify bf16 unit tests (#29039)
5 years ago
gongweibao 1358397e97
Clean up the redundant files and unify the launch interface. (#28928)
5 years ago
Chen Weihang fea0e294ee
Hide the C++ stack by default and add hints (#29042)
5 years ago
Chen Weihang a1486091f1
Polish load_program_state design to loading file onebyone (#29041)
5 years ago
WeiXin fafadbab70
Rename variables when use 'jit.load' (#28933)
5 years ago
ceci3 a3faa520ec
Fix syncbn (#29013)
5 years ago
joejiong 582c0a0468
add uint8 for reshape op (#28996)
5 years ago
liym27 682cc17f53
[Dynamic-to-Static] Fix bug: support pop from a dict and polish code of convert_pop (#29023)
5 years ago
Zhou Wei 8ca0a8a859
fix tensor detach to zero copy (#27921)
5 years ago
Aurelius84 8af0d85ea4
fix unittest failed on windows GPU (#29072)
5 years ago
taixiurong a5aa4dc7a9
add xpu elementwise ops (#29031)
5 years ago
tianshuo78520a e9acd9c941
Update CI Python3 Docker for Cuda11 (#28401)
5 years ago
Chen Weihang bb16c2515d
Polish parallel api impl & doc details (#28980)
5 years ago
joejiong 4b05a8be88
delete axis parameter in multiply api (#28647)
5 years ago
YUNSHEN XIE c098a2e159
fixing cmake error for WITH_GPU=ON and WITH_DISTRIBUTE=OFF (#29030)
5 years ago
chajchaj b52427327d
add soft_label and axis for CrossEntropyLoss and improve performance (#29024)
5 years ago
Huihuang Zheng 5b339262bc
[Dy2stat] Add Tuple as Assign Target for Tensor Shape (#28775)
5 years ago
YUNSHEN XIE 5cb8e17a18
restore timeout value (#29027)
5 years ago
Jiawei Wang f21513307a
add lamb optimizer and unittest (#28772) TODO:FIX BUGS LATER
5 years ago
Leo Chen 3815d7aa40
Upgrade string literals to raw string (#28989)
5 years ago
liym27 85292e0b46
[Dynamic-to-Static] Fix bug of convert_logical_and/convert_logical_or: the operands are executed sequentially(#28993)
5 years ago
Huihuang Zheng 96126532cd
Fix Incorrect After Node Vars in IfElseTransformer, test=develop (#28992)
5 years ago
lilong12 f77a78cdee
enable pipeline to run with Executor.run() (#28373)
5 years ago
furnace 8ff3550658
refactor momentum op to combine weight (#27414)
5 years ago
YUNSHEN XIE 50113f9268
fix error for Can not find test to add properties to:test_parallel_dygraph_transformer,test_parallel_dygraph_sparse_embedding (#28968)
5 years ago
yaoxuefeng 6369463a5d
fix test datagenerator ut fail (#28767)
5 years ago
Kaipeng Deng ed7aa8f9c3
Fix conv transpose (#28629)
5 years ago
Kaipeng Deng 91bab752a9
fix dataloader default value and doc (#28728)
5 years ago
Chen Weihang 0ed80e09fc
Fix param base trainable set failed (#28756)
5 years ago
Aurelius84 d6aee7597c
[Dy2Stat]Set buff.persistable=False when it's not initialized (#28749)
5 years ago
joejiong 1a532d5133
add uint8 support for squeeze operator (#28734)
5 years ago
joanna.wozna.intel 8c0ea4bffe
Add bf16 matmul, fc, elementwise add and mul (#28729)
5 years ago
Zhou Wei 3b0dd5f620
fix bug that to_tensor not support paddle.Place (#28717)
5 years ago
Aurelius84 e1c8d6bce5
Fix wrong out dtype inferred from helper.input_dtype (#28715)
5 years ago
taixiurong d3d1a6b6e0
add kunlun kernel: slice, slice_grad, top_k, cast. *test=kunlun (#28542)
5 years ago
Jack Zhou 9362d85e0e
Add LSTM, Simple RNN and GRU CPU kernel (#28577)
5 years ago
QingshuChen 30ef3815b3
adjust kunlun header file (#28536)
5 years ago
liym27 269470d62e
[Dynamic-to-Static] Remove unnecessary variables of the arguments in true_func/false_func (#28722)
5 years ago
liym27 e5f0e6b003
[Dynamic-to-Static] Fix bug in loop_transformer: loop vars should contain the var from ancestor-for-node (#28735)
5 years ago
Wojciech Uss 04bcc13fac
Add multi_gru op and tests (#28591)
5 years ago
YUNSHEN XIE 16a808149a
modfied the timeout value for 5 ut (#28714)
5 years ago
liym27 9881738e13
[Dynamic-to-Static ErrorMessage] Support dy2stat error message when call jit.save and polish error message (#28713)
5 years ago
joejiong 32b90b1c2d
add log10 (#28576)
5 years ago
Chen Weihang 3c5f2cac68
fix save parse error for dict input (#28712)
5 years ago
wawltor 20b1276598
faster the compare ops dygraph model speed
5 years ago
JZ-LIANG 5a9f6889c1
[Sharding] add new features (#28568)
5 years ago
xiaoting 8c75b2554a
Support Tensor for attr_scale and attr_size (#28677)
5 years ago
YUNSHEN XIE e880c90c5a
fix error when setting ut timeout value (#28696)
5 years ago
Shibo Tao db2e6cee62
add two paddle-2.0 apis: paddle.static.io.save_inference_model and paddle.static.io.load_inference_model (#28606)
5 years ago
Huihuang Zheng b6f86b8491
Fix Using "isinstance" in Loop, test=develop (#28641)
5 years ago
Jacek Czaja 6d8d3d4c22
[oneDNN] Layer norm bf16 kernel (#28619)
5 years ago
wangchaochaohu 912a5c30b4
fix the matmul_v2 test for cuda11 (#28635)
5 years ago
littletomatodonkey 82f0b5ea5c
adapt pad const (#28585)
5 years ago
Aurelius84 8040fa2bca
Fix output dtype inconsistent with input (#28649)
5 years ago
Chen Weihang a083c76ab4
adjust signal failed wait time (#28640)
5 years ago
wawltor 8f2656ef5c
fix the gradient bug for the topk v2
5 years ago
Kaipeng Deng ece1e4cd9d
Add weighted random sampler (#28545)
5 years ago
joanna.wozna.intel 2cb71c0cde
Add checkpoint to quantize (#28612)
5 years ago
lidanqing 804271cff9
Op version python mkldnn_inplace test (#28354)
5 years ago
pangyoki 72e068f1ba
fix test_multinomial (#28558)
5 years ago
Kaipeng Deng 89d27de90f
DataLoader support not auto collate batch (#28425)
5 years ago
Huihuang Zheng c5c273c13e
[Dy2stat] Fix Using Tuple for Transpose in Dy2stat (#28574)
5 years ago
Guo Sheng 110febdc54
Fix gradients with ignore_idx in softmax_with_cross_entropy (#28622)
5 years ago
YUNSHEN XIE c4d22c845b
modified timeout value for some ut (#28616)
5 years ago
Huihuang Zheng 1adc09b56a
Make Batch Size Smaller on Mac Because of CI Machine (#28569)
5 years ago
lilong12 ed9dd7c9f0
add send and recv ops (#28590)
5 years ago
Yang Zhang c47bfe98cf
Reduce input data size for syncbn unit test by 8x (#28571)
5 years ago
channings c545b9b673
Add ONNX Exporter (#27831)
5 years ago
Shang Zhizhou 8699f38d08
裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517)
5 years ago
joejiong 08d2413142
add log2 operator (#28319)
5 years ago
lidanqing 0fc181dbd0
[Fix bug] If the pass name is not found, IsCompatible should return false (#28475)
5 years ago
Leo Chen 543ff333cd
Refine the format of printing tensor 3 (support scaler tensor) (#28544)
5 years ago
YUNSHEN XIE 621b31c526
modified timeout value for test_resnet_v2 and test_resnet (#28532)
5 years ago
YUNSHEN XIE 2159646796
modified timeout value for test_conv3d_mkldnn_op,test=document_fix (#28496)
5 years ago
Huihuang Zheng eb0855615c
Set Exclusive for test_yolov3 to Avoid Memory Segmentation Fault in CI Random Failure (#28485)
5 years ago
YUNSHEN XIE e1a5fc449d
fix ut exec timeout notest,test=kunlun (#28495)
5 years ago
YUNSHEN XIE d3b2d07d6e
modified timeout value on windows (#28499)
5 years ago
YUNSHEN XIE 1e698c600e
fix cmake error when setting ut timeout properity (#28492)
5 years ago
YUNSHEN XIE faa6549877
setting timeout properity for test_flags_mkldnn_ops_on_off (#28489)
5 years ago
Tao Luo 7fd2077211
set NCCL_SHM_DISABLE=1 for test_parallel_executor_profilery.py (#28484)
5 years ago
YUNSHEN XIE 72c78e4dbb
exec ut no more than 15s 2 (#28441)
5 years ago
YUNSHEN XIE ba0756325a
exec ut no more than 15s 1 (#28439)
5 years ago
Chen Weihang 155b4f9b6c
Remove selected rows all reduce over height check (#28460)
5 years ago
WeiXin b5e662f837
refine jit.save/load to add support for other method, not only forward (#28376)
5 years ago
Zhang Ting 7fe5f9ccad
Fix unittest random failure (#28363)
5 years ago
joanna.wozna.intel 7821759d48
Add bfloat16 softmax and gelu (#28394)
5 years ago
zhupengyang 0155f91679
enable softmax unittest (#28362)
5 years ago
Aurelius84 2500dca878
[Dy2Stat] Fix bug in convert_call (#28368)
5 years ago
Jacek Czaja ca41541472
[oneDNN]Sum bf16 kernel (#28382)
5 years ago
Kaipeng Deng 463075a82b
add paddle.io.ComposeDataset & paddle.io.ChainDataset (#28311)
5 years ago
Leo Chen 44a476c2ab
support cuda pinned place (#28416)
5 years ago
lidanqing 12b9587be5
Add conv_bias pass version python test (#28278)
5 years ago
Leo Chen 8b2436a776
Add broadcast_shape api (#28257)
5 years ago
石晓伟 21a63f6f90
enhance the op_version_registry, test=develop (#28347)
5 years ago
Shang Zhizhou ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 (#28367)
5 years ago
Jacek Czaja 84cc61b2cd
[oneDNN] sum op refactor (#28318)
5 years ago
Leo Chen 6115c14fca
Pool2d cuda kernel supports fp16 (#28316)
5 years ago
Guo Sheng 9a600df373
Add rnn_op (#28197)
5 years ago
Huihuang Zheng acc11c2a62
Retry CUDA Initialization to Fix Random Failure, test=develop (#28323)
5 years ago
wangguanzhong 5262b02585
add generate_proposals_v2 op (#28214)
5 years ago