Commit Graph

5319 Commits (e5bbffa84cf0c0768dd681b9aa0133169cca6c58)

Author SHA1 Message Date
joanna.wozna.intel e5bbffa84c
Add NOMINMAX define due to windows.h max/min macro conflict (#25637)
5 years ago
cnn 70cee22fde
New features, add sinh and cosh op, test=develop (#25495)
5 years ago
Zhang Ting a1350744eb
register fp16 kernel, test=develop (#25630)
5 years ago
mapingshuo 5453a912fe
add fp64 support in sequence_pool, test=develop (#25662)
5 years ago
GaoWei8 6e86fd3750
fix concat dimension (#25606)
5 years ago
donproc 95fa383df2
optimize embedding cuda kernel lookup_table_v2,test=develop (#25587)
5 years ago
石晓伟 7206417259
supports xpu runtime, test=develop (#25554)
5 years ago
Zhang Ting 30d1ff3bb4
call cublasGemmStridedBatchedEx when using fp16, test=develop (#25553)
5 years ago
Aurelius84 ca1185d06b
[Dy2Stat] Fix scope in run_program_op (#25579)
5 years ago
Jacek Czaja 7dbc441eab
[oneDNN] cache cosmetics improvement (#25576)
5 years ago
hong e362095e45
fix softmax with cross entropy out of bound; test=develop (#25549)
5 years ago
Huihuang Zheng d8fe517bf8
Add Support for SelectedRows for Transpose OP and Fix a Bug That SelectedRows Cannot be Supported in SimNet (#25536)
5 years ago
Wilber 848aca7ae8
[CI] [Lite-Subgraph] CI add lite subgraph check. (#25346)
5 years ago
Shibo Tao 71c71e684c
fix logical_* ops' doc (#25479)
5 years ago
Aurelius84 4717bdbcfb
Fix hang in seq_topk_avg_pooling op (#25522)
5 years ago
LielinJiang 7129f544f0
Add bilateral_slice op (#25401)
5 years ago
Zhang Ting ca725c82f2
improve fp16 performance of slice_grad, test=develop (#25523)
5 years ago
yaoxuefeng 5d3766ff3d
modify flip test=develop (#25312)
5 years ago
Chen Weihang 41d2247275
[Dy2static] Refactor ProgramTranslator save_inference_model API (#24989)
5 years ago
yaoxuefeng aaa7cbd56f
modify trace api test=develop (#25397)
5 years ago
Huihuang Zheng f9ac5fb992
[Dy2stat] Fix Memory Optimization in run_program_op and Add SimNet as Unit Test (#25383)
5 years ago
yaoxuefeng c42d662e2a
modify roll test=develop (#25321)
5 years ago
Zhen Wang 548cdbc544
Quantization-aware training for dygraph (#24634)
5 years ago
Chen Weihang 0b54d54fd8
Fix index overflow bug of the CUDA kernel loop increment (#25435)
5 years ago
zlsh80826 e528392de9
[Paddle-TRT] SkipLayernorm vectorized memory optimization (#25117)
5 years ago
zhupengyang 5b573c58e2
randperm API: remove out, devive, stop_gradient; add name (#25410)
5 years ago
Zhen Wang bb45af02ac
add the c++ part of Imperative QAT. test=develop (#25446)
5 years ago
Jacek Czaja 050a9bf79d
[oneDNN] LRN cleanup (#25416)
5 years ago
GaoWei8 1974aadcf0
fix concat shape error (#25414)
5 years ago
tangwei12 4b3778a3ee
Revert/barrier for sync (#25417)
5 years ago
ceci3 52be62c5ae
fix instance norm in dy (#24717)
5 years ago
zhupengyang eb3173e2b6
rand API: remove out, device, stop_gradient; add name (#25246)
5 years ago
zhupengyang 6de75082cb
fix test_hsigmoid windows ci (#25311)
5 years ago
WuHaobo f593c3fb2f
fix the formula of floor OP and ceil OP (#25292)
5 years ago
Zhang Ting bc7610583b
use eval() to improve CPU performance (#25243)
5 years ago
Kaipeng Deng 74468bf428
add mish op. (#24565)
5 years ago
Yang Zhang 6d6efafeeb
Add `matrix_nms_op` (#24400)
5 years ago
Chengmo e85fcaa712
Fix fluid.embedding in Distributed Training (#25174)
5 years ago
Yiqun Liu c00f827843
Avoid data transforming ShapeTensor from CPU to GPU in fill_constant op. (#25267)
5 years ago
123malin f1a9593d69
test=develop, bug fix for index_select and roll op (#25251)
5 years ago
FDInSky c2e072587c
test=develop fix generate_proposals's error (#25227)
5 years ago
Wilber 4c964abdf7
support build on arm. test=develop (#25212)
5 years ago
liym27 1458cc0c68
Fix bug: Don't check dims if contain_unknown_dim of cross_entropy_grad_op in compile time (#25221)
5 years ago
liu zhengxi 68e93d8a17
Fix beam_search InferShape (#25169)
5 years ago
Adam bd0b38e671
Refactor of conv fp32 oneDNN operator (#25137)
5 years ago
Shibo Tao 19c4db1b56
don't re-generate header file if content doesn't change (#25130)
5 years ago
Jacek Czaja a7944904d3
[oneDNN]elementwise_add and elementwise_mul int8 support (#24984)
5 years ago
Leo Chen fa657b3dbb
fix bug of prelu when rank not equal 4, test=develop (#25067)
5 years ago
zlsh80826 479c8834f7
[Paddle-TRT] Fixes #24731, opt for SoftmaxKernelWithEltadd kernel, test=develop (#24834)
5 years ago
Leo Chen 028de857d4
fix dtype error of compare op, test=develop (#25059)
5 years ago