Commit Graph

17184 Commits (20ee36bdeab5b7a77a8f8d30e07558febdbaa079)

Author SHA1 Message Date
liym27 1458cc0c68
Fix bug: Don't check dims if contain_unknown_dim of cross_entropy_grad_op in compile time (#25221)
5 years ago
liu zhengxi 68e93d8a17
Fix beam_search InferShape (#25169)
5 years ago
Chen Weihang 353ea9e8ad
Add default cudnn lib path (#25175)
5 years ago
Leo Chen ff5be2fb77
Refine error message in memory folder (#25095)
5 years ago
Adam bd0b38e671
Refactor of conv fp32 oneDNN operator (#25137)
5 years ago
Pei Yang b2f5a149e7
[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models (#25097)
5 years ago
Tao Luo 2996315fc9
fix profiler_test on win32 (#25073)
5 years ago
Shibo Tao 19c4db1b56
don't re-generate header file if content doesn't change (#25130)
5 years ago
iducn f282599229
disable unitest for gcc8(#25134)
5 years ago
tianshuo78520a 1eb9ee242b
delete buddy_allocator_test_data to make repo clean (#25046)
5 years ago
Chen Weihang b23801a262
polish tensor set error messag, test=develop (#25113)
5 years ago
Jacek Czaja a7944904d3
[oneDNN]elementwise_add and elementwise_mul int8 support (#24984)
5 years ago
Zhaolong Xing 843581154f
fix emb eltwise layernorm (#24873)
5 years ago
石晓伟 9ab3cf039c
remove useless test_dot, test=develop (#24957)
5 years ago
石晓伟 6783441e70
fix repeat definitions in liengine.cc, test=develop (#25020)
5 years ago
Leo Chen fa657b3dbb
fix bug of prelu when rank not equal 4, test=develop (#25067)
5 years ago
zlsh80826 479c8834f7
[Paddle-TRT] Fixes #24731, opt for SoftmaxKernelWithEltadd kernel, test=develop (#24834)
5 years ago
hutuxian 5822862d8a
Monitor Framework (#24079)
5 years ago
Leo Chen 028de857d4
fix dtype error of compare op, test=develop (#25059)
5 years ago
Jeng Bai-Cheng bef4afa6de
bugfix for unique_ptr of IOptimizationProfile (#23917)
5 years ago
zlsh80826 49e4ee27e1
[Paddle-TRT] slice kernel optimization (#24783)
5 years ago
tianshuo78520a 770c11a117
fix make device_context error (#25045)
5 years ago
tangwei12 be6a315fbd
Fix/sync barrier (#25016)
5 years ago
ceci3 8db66fc3f6
fix cos_sim, test=develop (#25017)
5 years ago
Leo Chen 25a4dac4c2
Use allow list instead of white list (#25002)
5 years ago
Zhang Ting 621b638550
improve performance of instance_norm, test=develop (#25005)
5 years ago
hutuxian 1c224e26af
support CMatchAuc (#24990)
5 years ago
Zhou Wei ff8ca52f88
windows publish package scripts (#24851)
5 years ago
Leo Chen bfa46c38d5
bn supports reverse_space, test=develop (#24988)
5 years ago
wangchaochaohu 613303dbf6
refine the slice Op to improve the performance of xlnet for fp16 training (#24967)
5 years ago
silingtong123 37bdb5269f
test=develop, add log message in the function UpdateDllFlag (#24937)
5 years ago
Chen Weihang d152d7231e
clear old var in scope, test=develop (#24976)
5 years ago
Sylwester Fraczek 53d563a0fe
Reshape transpose matmul coverage (#24970)
5 years ago
wawltor 0eb1b0bc01
Add support the 5d, 6d tensor support for the reduce ops
5 years ago
liuwei1031 8603b5fb72
fix randomly hang issue of PaddleDetection training task on windows (#24977)
5 years ago
silingtong123 640196c446
test=develop, remove the tensorrt dll file from windows package (#24922)
5 years ago
wangchaochaohu feba131893
fix the sgement fault error of profiler in seqseq model test=develop (#24952)
5 years ago
Sylwester Fraczek a7ee634b45
fix WARNING: ThreadSanitizer: heap-use-after-free (#24929)
5 years ago
mapingshuo 24e24987f0
fixes the place info in the Print op (#24934)
5 years ago
Aurelius84 6be0ee159e
Support LoDTensorArray in reverse_op (#24797)
5 years ago
Leo Chen 6190023ac9
Refine error message in pybind folder (#24886)
5 years ago
Zhou Wei 4058e736ff
temporarily disable these unittests failed on windows (#24942)
5 years ago
Leo Chen a7cb97a1a5
Fix/isfinite on windows (#24927)
5 years ago
silingtong123 ef9b36873d
test=develop, remove the gflags/gflags.h form paddle_api.h (#24921)
5 years ago
whs 4c01d6d53e
Enhance checking in some operator. (#24473)
5 years ago
Chen Weihang 4a702ef361
Support SelelctedRows allreduce in multi-cards imperative mode (#24690)
5 years ago
Pei Yang 14b8540551
add default ctor for AnalysisConfig python api. test=develop (#24924)
5 years ago
silingtong123 fc4435174b
test=develop, fix the bug of tensorrt package can't compile on windows (#24860)
5 years ago
lilong12 29de0d97a5
add the support to specify device index for device_guard (#24555)
5 years ago
lilong12 6e10022781
add queue_generator_op, dequeue_op, enqueue_op and ut (#24481)
5 years ago
hutuxian b8f17a049d
fix problem in dump and add log (#24891)
5 years ago
石晓伟 76cdbb84b0
ignore warnings of external libraries, test=develop (#24193)
5 years ago
Leo Chen 1e818158f5
Feature/add amp_checkout_finite_and_scale op (#24875)
5 years ago
zhangchunle 576d68083e
generate ci index (#24792)
5 years ago
leesusu a6beb96dd0
FTRL with sparse update, test=develop (#22092)
5 years ago
Chen Weihang 6aae034f8d
add dep for fs.cc, test=develop, test=document_fix (#24881)
5 years ago
Jacek Czaja 40a5f3fd86
[oneDNN] Clearing mkldnn cache in naiveexecutor destructor (#24756)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Chen Weihang a4f6003404
append try-catch to opbase run, test=develop (#24870)
5 years ago
Michał Gallus 23a85f030c
Remove old mkldnn_elementwise_mul test (#24855)
5 years ago
123malin 9d2bd0ac38
downpour_worker增加try_catch机制,打印program所有参数 (#24700)
5 years ago
Michał Gallus b2ba830eeb
Fix bug in ShareBufferWith causing eltwise_add inplace ut to fail (#24853)
5 years ago
Zhang Ting 7d0cbfd045
fix negative framework overhead in Profiling Report (#24850)
5 years ago
Yanghello aa47356b74
Add crypto python (#24836)
5 years ago
Leo Chen b67ded04f2
Support gradient accumulation of fp16 in imperative mode (#24823)
5 years ago
Wilber 1e190a9e02
[Inference] [unittest] Inference unit tests rely on dynamic libraries (2) (#24859)
5 years ago
Chen Weihang 0aed095188
The third time to simplify the C ++ error stack (#24831)
5 years ago
Qi Li 704cad6a66
Add histc op (#24562)
5 years ago
Qi Li a125697e2c
fix ut test_recognize_digits (#24794)
5 years ago
Yi Liu 12bffdc086
Enhance error message of checkpoint_notify_op, fake_init_op gen_nccl_id_op and listen_and_serv_op (#24554)
5 years ago
Wojciech Uss 78d4f0cc91
add option to exclude ops by id from quantization (#24689)
5 years ago
Yanghello 62b4ff7dd2
Aes_cipher_test and cipher_utils_test failed fixed (#24816)
5 years ago
Adam b490e41c1d
Add isCached() mechanism for BatchNorm and LRN oneDNN operators (#24798)
5 years ago
zhangchunle f9c7f58ef6
remove duplicated mac_test build (#24793)
5 years ago
Wilber f8e370ac7f
[Inference] [unittest] Inference unit tests rely on dynamic libraries (#24743)
5 years ago
silingtong123 126d3d693b
support C++ inference shared library on windows (#24672)
5 years ago
Zhen Wang 23d253e1be
Fix out of range error for outs map. test=develop (#24774)
5 years ago
Zhou Wei d1047d0a69
add WITH_GPU for cudaerror download (#24056)
5 years ago
Aurelius84 a7e21cbed3
Move input_size check into RunTime phrase of gru_unit_op and refine error message (#24776)
5 years ago
Zhou Wei f66594a558
fix bug that diag API can't use on Windows(#24762)
5 years ago
Leo Chen c0911fdd32
rename inplace/no_need_buffer inferer, part4, test=develop (#24781)
5 years ago
Chen Weihang be82de4c79
polish two error message, test=develop (#24778)
5 years ago
Yanghello 5a7a517cde
Add crypto api (#24694)
5 years ago
Leo Chen b0e7439fbc
rename inplace/no_need_buffer inferer, part2, test=develop (#24733)
5 years ago
Leo Chen a6fbba65ff
rename inplace/no_need_buffer inferer, part3, test=develop (#24734)
5 years ago
wangchaochaohu 355caee18b
fix conv_transpose Op fp16 error test=develop (#24695)
5 years ago
Chen Weihang 19e5f7879c
Append error op hint for GradOpMaker (#24750)
5 years ago
Pei Yang 181b1f5a30
adjust minimum trt version for hard_sigmoid converter to 5130. test=develop (#24746)
5 years ago
wangchaochaohu 79caed6667
fix the print error of PE record_event and framework overhead in profiler test=develop (#24744)
5 years ago
Adam 56a714a19b
Add isCached() machinism to oneDNN pooling primitive (#24724)
5 years ago
randytli d15fc95ef5
Delete unused Cmake variables(#24707)
5 years ago
Aurelius84 6db0e2b1a9
[Dy2stat] Support len syntax (#24638)
5 years ago
ShenLiang 950892044f
fix conflict, test=develop (#24238)
5 years ago
lidanqing c3c61d34c1
Update PADDLE_ENFORCE in DNNL related ops (#24333)
5 years ago
Wilber dbe2497768
fix mkldnn concat bug. test=develop (#24722)
5 years ago
Leo Chen de8b4f42cf
rename inplace/no_need_buffer inferer, part 1, test=develop (#24711)
5 years ago
hutuxian e6b87b3193
Support AucRunner in PaddleBox (#22884)
5 years ago
Chen Weihang 7fa9f16c17
Polish reader folder error message (#24698)
5 years ago
wangchaochaohu dbfe5333c5
Add pe profiler Event (#24611)
5 years ago
Liufang Sang 55b664a131
remove pow to speed up in dequantize_log op (#24607)
5 years ago