Commit Graph

17844 Commits (ba0756325a8a64eedc5586cace20d9e2768d1f06)

Author SHA1 Message Date
YUNSHEN XIE ba0756325a
exec ut no more than 15s 1 (#28439)
4 years ago
Chen Weihang 155b4f9b6c
Remove selected rows all reduce over height check (#28460)
4 years ago
taixiurong fad4744aa4
fix crash in adam in xpu, *test=kunlun (#28433)
4 years ago
QingshuChen 6bba8e57b1
fix batch_norm_xpu bug & remove xpusimulator dependence (#28430)
4 years ago
Wilber ced5c40c41
Update memory release interface. (#28456)
4 years ago
joanna.wozna.intel 7821759d48
Add bfloat16 softmax and gelu (#28394)
4 years ago
iducn ba0fe0a812
revert the modified shell script (#28453)
4 years ago
Chen Weihang c42e656179
Add retry for dygraph parallel socket bind (#28404)
4 years ago
石晓伟 c41fd033e5
check op_version_registry in CI test, test=develop (#28402)
4 years ago
Jacek Czaja ca41541472
[oneDNN]Sum bf16 kernel (#28382)
4 years ago
Chen Weihang 23439b1688
show cpp stack when catch signal (#28415)
4 years ago
Leo Chen 44a476c2ab
support cuda pinned place (#28416)
4 years ago
lidanqing 12b9587be5
Add conv_bias pass version python test (#28278)
4 years ago
Wilber 05114693cf
[Inference] Memory modification for ShrinkMemory. (#28355)
4 years ago
Leo Chen 8b2436a776
Add broadcast_shape api (#28257)
4 years ago
石晓伟 21a63f6f90
enhance the op_version_registry, test=develop (#28347)
4 years ago
YUNSHEN XIE c1c3e21726
retry will not be executed when the number of failed ut is greater than 20 (#28374)
4 years ago
Shang Zhizhou ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 (#28367)
4 years ago
Jacek Czaja 84cc61b2cd
[oneDNN] sum op refactor (#28318)
4 years ago
Wilber 6f0f45f69c
copy_to_cpu support uint8 (#28372)
4 years ago
Wilber 09fd2b2aab
Paddle support compile on sw (#27858)
4 years ago
chen zhiyu 953302d9eb
add musl docker build script (#28027)
4 years ago
Leo Chen 6115c14fca
Pool2d cuda kernel supports fp16 (#28316)
4 years ago
Zhou Wei f41104efa3
fix compile out of memory temporary (#28346)
4 years ago
Guo Sheng 9a600df373
Add rnn_op (#28197)
4 years ago
wangchaochaohu 0f4b6247c8
refine the gpu config for performance optimization (#28291)
4 years ago
Huihuang Zheng acc11c2a62
Retry CUDA Initialization to Fix Random Failure, test=develop (#28323)
4 years ago
wangguanzhong 5262b02585
add generate_proposals_v2 op (#28214)
4 years ago
石晓伟 d9b5f1261c
update the version of pybind, test=develop (#28284)
4 years ago
Leo Chen 18c86fb2fb
hide some logs of p2p (#28307)
4 years ago
lidanqing 8cd1c102d9
Enable GRU infer model running CAPI (#28313)
4 years ago
wangguanzhong 1c385e26f9
add op_function_generator for box_coder (#28303)
4 years ago
iducn f763cb81a6
Modify the shell script according to the specification (#28302)
4 years ago
joanna.wozna.intel 571a63e7ec
Add bf16 transpose2, reshape2, concat ops (#28195)
4 years ago
Guanghua Yu e8f2614da5
Enhance multiclass_nms op to support LoD for dygraph mode (#28276)
4 years ago
石晓伟 842a4e5abd
fix analyzer_capi_tester, test=develop (#28289)
4 years ago
Leo Chen 8953038400
Fix transpose in conv cudnn kernel when addto enabled (#28295)
4 years ago
Tao Luo e1e666a05f
fix conv mkldnn build error (#28288)
4 years ago
Jacek Czaja 0b678d401b
- sum (#28233)
4 years ago
Jacek Czaja c11d9b3035
[oneDNN ] conv2d fwd&bwd optimization (#27871)
4 years ago
Zhou Wei 8f87c7eac4
fix judge bug of errorlevel on cmd (#28271)
4 years ago
wangxinxin08 41d26a8287
update matrix nms op to api 2.0 (#28265)
4 years ago
Leo Chen 7fcb32ddf3
fill_constant op supports NINF (#28270)
4 years ago
wangchaochaohu 6905608cea
refine yolo box Op for performace optimization (#28155)
4 years ago
wangchaochaohu cdadc8f019
refine temporal_shift_op for performance optimization using gpu kernel config (#28114)
4 years ago
Zhang Ting fdc06f2158
add Fuse bn add act pass (#28196)
4 years ago
Chen Weihang 813b2ade34
Enrich the python error types of paddle & polish format (#28124)
4 years ago
Adam Osewski 7db747d9e8
oneDNN BatchNorm + Act fusion pass. (#27912)
4 years ago
Zhou Wei fb7f85291b
fix print tensor place,add cpu/cuda/pin_memory API for Tensor (#28200)
4 years ago
tianshuo78520a 11089cacdb
Fix xpu notest (#28204)
4 years ago