Commit Graph

5691 Commits (8f2656ef5ca4ab16f06d94b8ca9392d3f0f760ae)

Author SHA1 Message Date
wawltor 8f2656ef5c
fix the gradient bug for the topk v2
5 years ago
wangchaochaohu a972c33fd7
refine gather OP performance for dynamic mode (#28587)
5 years ago
joanna.wozna.intel 2cb71c0cde
Add checkpoint to quantize (#28612)
5 years ago
pangyoki b889a0cee2
add gaussian_random op_version (#28602)
5 years ago
Guo Sheng 110febdc54
Fix gradients with ignore_idx in softmax_with_cross_entropy (#28622)
5 years ago
Leo Chen f962bd3432
Fix cudnn workspace limit in cudnn-8 (#28611)
5 years ago
Leo Chen 90805e2df7
Register op_version for new attribute use_addto (#28463)
5 years ago
lilong12 ed9dd7c9f0
add send and recv ops (#28590)
5 years ago
Zhong Hui a829357e4d
register the op version for some ops
5 years ago
Zhou Wei bf6e7cba7a
updata 2.0 API english doc (#28525)
5 years ago
Shang Zhizhou 8699f38d08
裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517)
5 years ago
joejiong 08d2413142
add log2 operator (#28319)
5 years ago
wangchaochaohu c52fe48f6f
fix the GetKernelTypeForVar of input for fluid.gather (#28534)
5 years ago
wangchaochaohu d7cfee9b31
Checkout point add (#28488)
5 years ago
zhupengyang 47cbf61dd4
fix softmax unittest float16 random error (#28480)
5 years ago
wangchaochaohu e14ed71cc2
refine the performance of gather Op (#28458)
5 years ago
YUNSHEN XIE ba0756325a
exec ut no more than 15s 1 (#28439)
5 years ago
taixiurong fad4744aa4
fix crash in adam in xpu, *test=kunlun (#28433)
5 years ago
QingshuChen 6bba8e57b1
fix batch_norm_xpu bug & remove xpusimulator dependence (#28430)
5 years ago
joanna.wozna.intel 7821759d48
Add bfloat16 softmax and gelu (#28394)
5 years ago
石晓伟 c41fd033e5
check op_version_registry in CI test, test=develop (#28402)
5 years ago
Jacek Czaja ca41541472
[oneDNN]Sum bf16 kernel (#28382)
5 years ago
Leo Chen 8b2436a776
Add broadcast_shape api (#28257)
5 years ago
石晓伟 21a63f6f90
enhance the op_version_registry, test=develop (#28347)
5 years ago
Shang Zhizhou ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 (#28367)
5 years ago
Jacek Czaja 84cc61b2cd
[oneDNN] sum op refactor (#28318)
5 years ago
Wilber 09fd2b2aab
Paddle support compile on sw (#27858)
5 years ago
Leo Chen 6115c14fca
Pool2d cuda kernel supports fp16 (#28316)
5 years ago
Guo Sheng 9a600df373
Add rnn_op (#28197)
5 years ago
wangguanzhong 5262b02585
add generate_proposals_v2 op (#28214)
5 years ago
joanna.wozna.intel 571a63e7ec
Add bf16 transpose2, reshape2, concat ops (#28195)
5 years ago
Guanghua Yu e8f2614da5
Enhance multiclass_nms op to support LoD for dygraph mode (#28276)
5 years ago
Leo Chen 8953038400
Fix transpose in conv cudnn kernel when addto enabled (#28295)
5 years ago
Tao Luo e1e666a05f
fix conv mkldnn build error (#28288)
5 years ago
Jacek Czaja 0b678d401b
- sum (#28233)
5 years ago
Jacek Czaja c11d9b3035
[oneDNN ] conv2d fwd&bwd optimization (#27871)
5 years ago
wangxinxin08 41d26a8287
update matrix nms op to api 2.0 (#28265)
5 years ago
Leo Chen 7fcb32ddf3
fill_constant op supports NINF (#28270)
5 years ago
wangchaochaohu 6905608cea
refine yolo box Op for performace optimization (#28155)
5 years ago
wangchaochaohu cdadc8f019
refine temporal_shift_op for performance optimization using gpu kernel config (#28114)
5 years ago
Zhang Ting fdc06f2158
add Fuse bn add act pass (#28196)
5 years ago
Chen Weihang 2babd6ff67
Add compile limit for PADDLE_ENFORCE without error message (#28221)
5 years ago
Double_V 2db77be423
fix wrong data type, test=develop (#28203)
5 years ago
Feiyu Chan efe6e2840c
fix strided_slice_op's GetExpectedKernelType (#28192)
5 years ago
WangXi e450823b8b
Fix nccl op test failed, test=develop (#28172)
5 years ago
wangguanzhong 5cd97a1cb0
support multiclass nms for multi-batch, test=develop (#28154)
5 years ago
Double_V 5289b72acc
fix Wmaybe-uninitialized warning in pooling.cc, test=develop (#28126)
5 years ago
wangguanzhong d1e1f17482
fix generate_proposal_labels in cascade-rcnn series model, test=develop (#27892)
5 years ago
Leo Chen a911c19eb0
fill_constant op supports NaN and Inf (#28109)
5 years ago
zhupengyang 6dd64b0a30
randperm run error in multi-gpus (#27942)
5 years ago