Commit Graph

17268 Commits (36027490d8d6935ee3695dca1a751da86e34b693)

Author SHA1 Message Date
zlsh80826 479c8834f7
[Paddle-TRT] Fixes #24731, opt for SoftmaxKernelWithEltadd kernel, test=develop (#24834)
5 years ago
hutuxian 5822862d8a
Monitor Framework (#24079)
5 years ago
Leo Chen 028de857d4
fix dtype error of compare op, test=develop (#25059)
5 years ago
Jeng Bai-Cheng bef4afa6de
bugfix for unique_ptr of IOptimizationProfile (#23917)
5 years ago
zlsh80826 49e4ee27e1
[Paddle-TRT] slice kernel optimization (#24783)
5 years ago
tianshuo78520a 770c11a117
fix make device_context error (#25045)
5 years ago
tangwei12 be6a315fbd
Fix/sync barrier (#25016)
5 years ago
ceci3 8db66fc3f6
fix cos_sim, test=develop (#25017)
5 years ago
Leo Chen 25a4dac4c2
Use allow list instead of white list (#25002)
5 years ago
Zhang Ting 621b638550
improve performance of instance_norm, test=develop (#25005)
5 years ago
hutuxian 1c224e26af
support CMatchAuc (#24990)
5 years ago
Zhou Wei ff8ca52f88
windows publish package scripts (#24851)
5 years ago
Leo Chen bfa46c38d5
bn supports reverse_space, test=develop (#24988)
5 years ago
wangchaochaohu 613303dbf6
refine the slice Op to improve the performance of xlnet for fp16 training (#24967)
5 years ago
silingtong123 37bdb5269f
test=develop, add log message in the function UpdateDllFlag (#24937)
5 years ago
Chen Weihang d152d7231e
clear old var in scope, test=develop (#24976)
5 years ago
Sylwester Fraczek 53d563a0fe
Reshape transpose matmul coverage (#24970)
5 years ago
wawltor 0eb1b0bc01
Add support the 5d, 6d tensor support for the reduce ops
5 years ago
liuwei1031 8603b5fb72
fix randomly hang issue of PaddleDetection training task on windows (#24977)
5 years ago
silingtong123 640196c446
test=develop, remove the tensorrt dll file from windows package (#24922)
5 years ago
wangchaochaohu feba131893
fix the sgement fault error of profiler in seqseq model test=develop (#24952)
5 years ago
Sylwester Fraczek a7ee634b45
fix WARNING: ThreadSanitizer: heap-use-after-free (#24929)
5 years ago
mapingshuo 24e24987f0
fixes the place info in the Print op (#24934)
5 years ago
Aurelius84 6be0ee159e
Support LoDTensorArray in reverse_op (#24797)
5 years ago
Leo Chen 6190023ac9
Refine error message in pybind folder (#24886)
5 years ago
Zhou Wei 4058e736ff
temporarily disable these unittests failed on windows (#24942)
5 years ago
Leo Chen a7cb97a1a5
Fix/isfinite on windows (#24927)
5 years ago
silingtong123 ef9b36873d
test=develop, remove the gflags/gflags.h form paddle_api.h (#24921)
5 years ago
whs 4c01d6d53e
Enhance checking in some operator. (#24473)
5 years ago
Chen Weihang 4a702ef361
Support SelelctedRows allreduce in multi-cards imperative mode (#24690)
5 years ago
Pei Yang 14b8540551
add default ctor for AnalysisConfig python api. test=develop (#24924)
5 years ago
silingtong123 fc4435174b
test=develop, fix the bug of tensorrt package can't compile on windows (#24860)
5 years ago
lilong12 29de0d97a5
add the support to specify device index for device_guard (#24555)
5 years ago
lilong12 6e10022781
add queue_generator_op, dequeue_op, enqueue_op and ut (#24481)
5 years ago
hutuxian b8f17a049d
fix problem in dump and add log (#24891)
5 years ago
石晓伟 76cdbb84b0
ignore warnings of external libraries, test=develop (#24193)
5 years ago
Leo Chen 1e818158f5
Feature/add amp_checkout_finite_and_scale op (#24875)
5 years ago
zhangchunle 576d68083e
generate ci index (#24792)
5 years ago
leesusu a6beb96dd0
FTRL with sparse update, test=develop (#22092)
5 years ago
Chen Weihang 6aae034f8d
add dep for fs.cc, test=develop, test=document_fix (#24881)
5 years ago
Jacek Czaja 40a5f3fd86
[oneDNN] Clearing mkldnn cache in naiveexecutor destructor (#24756)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Chen Weihang a4f6003404
append try-catch to opbase run, test=develop (#24870)
5 years ago
Michał Gallus 23a85f030c
Remove old mkldnn_elementwise_mul test (#24855)
5 years ago
123malin 9d2bd0ac38
downpour_worker增加try_catch机制,打印program所有参数 (#24700)
5 years ago
Michał Gallus b2ba830eeb
Fix bug in ShareBufferWith causing eltwise_add inplace ut to fail (#24853)
5 years ago
Zhang Ting 7d0cbfd045
fix negative framework overhead in Profiling Report (#24850)
5 years ago
Yanghello aa47356b74
Add crypto python (#24836)
5 years ago
Leo Chen b67ded04f2
Support gradient accumulation of fp16 in imperative mode (#24823)
5 years ago
Wilber 1e190a9e02
[Inference] [unittest] Inference unit tests rely on dynamic libraries (2) (#24859)
5 years ago
Chen Weihang 0aed095188
The third time to simplify the C ++ error stack (#24831)
5 years ago
Qi Li 704cad6a66
Add histc op (#24562)
5 years ago
Qi Li a125697e2c
fix ut test_recognize_digits (#24794)
5 years ago
Yi Liu 12bffdc086
Enhance error message of checkpoint_notify_op, fake_init_op gen_nccl_id_op and listen_and_serv_op (#24554)
5 years ago
Wojciech Uss 78d4f0cc91
add option to exclude ops by id from quantization (#24689)
5 years ago
Yanghello 62b4ff7dd2
Aes_cipher_test and cipher_utils_test failed fixed (#24816)
5 years ago
Adam b490e41c1d
Add isCached() mechanism for BatchNorm and LRN oneDNN operators (#24798)
5 years ago
zhangchunle f9c7f58ef6
remove duplicated mac_test build (#24793)
5 years ago
Wilber f8e370ac7f
[Inference] [unittest] Inference unit tests rely on dynamic libraries (#24743)
5 years ago
silingtong123 126d3d693b
support C++ inference shared library on windows (#24672)
5 years ago
Zhen Wang 23d253e1be
Fix out of range error for outs map. test=develop (#24774)
5 years ago
Zhou Wei d1047d0a69
add WITH_GPU for cudaerror download (#24056)
5 years ago
Aurelius84 a7e21cbed3
Move input_size check into RunTime phrase of gru_unit_op and refine error message (#24776)
5 years ago
Zhou Wei f66594a558
fix bug that diag API can't use on Windows(#24762)
5 years ago
Leo Chen c0911fdd32
rename inplace/no_need_buffer inferer, part4, test=develop (#24781)
5 years ago
Chen Weihang be82de4c79
polish two error message, test=develop (#24778)
5 years ago
Yanghello 5a7a517cde
Add crypto api (#24694)
5 years ago
Leo Chen b0e7439fbc
rename inplace/no_need_buffer inferer, part2, test=develop (#24733)
5 years ago
Leo Chen a6fbba65ff
rename inplace/no_need_buffer inferer, part3, test=develop (#24734)
5 years ago
wangchaochaohu 355caee18b
fix conv_transpose Op fp16 error test=develop (#24695)
5 years ago
Chen Weihang 19e5f7879c
Append error op hint for GradOpMaker (#24750)
5 years ago
Pei Yang 181b1f5a30
adjust minimum trt version for hard_sigmoid converter to 5130. test=develop (#24746)
5 years ago
wangchaochaohu 79caed6667
fix the print error of PE record_event and framework overhead in profiler test=develop (#24744)
5 years ago
Adam 56a714a19b
Add isCached() machinism to oneDNN pooling primitive (#24724)
5 years ago
randytli d15fc95ef5
Delete unused Cmake variables(#24707)
5 years ago
Aurelius84 6db0e2b1a9
[Dy2stat] Support len syntax (#24638)
5 years ago
ShenLiang 950892044f
fix conflict, test=develop (#24238)
5 years ago
lidanqing c3c61d34c1
Update PADDLE_ENFORCE in DNNL related ops (#24333)
5 years ago
Wilber dbe2497768
fix mkldnn concat bug. test=develop (#24722)
5 years ago
Leo Chen de8b4f42cf
rename inplace/no_need_buffer inferer, part 1, test=develop (#24711)
5 years ago
hutuxian e6b87b3193
Support AucRunner in PaddleBox (#22884)
5 years ago
Chen Weihang 7fa9f16c17
Polish reader folder error message (#24698)
5 years ago
wangchaochaohu dbfe5333c5
Add pe profiler Event (#24611)
5 years ago
Liufang Sang 55b664a131
remove pow to speed up in dequantize_log op (#24607)
5 years ago
tianshuo78520a 37ae661ca1
conda install cuda10 cudnn7.6 update (#24665)
5 years ago
Jacek Czaja ca68b13f85
[oneDNN] Fix to elementwise_add grad (#24639)
5 years ago
Adam 586b587519
Add isCached() check in Softmax handler (#24637)
5 years ago
Leo Chen 9c9e635c00
support tensor to varbase, test=develop (#24660)
5 years ago
zlsh80826 fdbe114b12
[Paddle-TRT] use float constant instead of double test=develop (#24544)
5 years ago
Leo Chen 14dd6388c5
fix bug of varbase.__getitem__, test=develop (#24642)
5 years ago
jinyuKING 6ca44cbaa3
API/OP(sequence_first_step, sequence_last_step, sequence_mask, beam_search, beam_search_decode) error message enhancement (#24590)
5 years ago
Wilber ba2f8f0ce4
fix embedding_eltwise_layernorm_fuse_pass. test=develop (#24592)
5 years ago
Huihuang Zheng f831f61216
Make buddy_allocator_test Exclusive (#24636)
5 years ago
hutuxian 0ec3a42e97
Random Dump (#24477)
5 years ago
Leo Chen 1d03469685
use vector instead of pointer, test=develop (#24620)
5 years ago
Yiqun Liu 6b464f969a
Add an operator node in unittest to make the fusing result unique. (#24617)
5 years ago
ShenLiang 837dd47a96
Add lod in gather/scatter (#24613)
5 years ago
Jacek Czaja 3292f0ef58
[onednn] elementwise add broadcasting support (#24594)
5 years ago
Yiqun Liu 560c815390
Add some check for CUDA Driver API and NVRTC (#22719)
5 years ago
Guo Sheng fe8d006f7e
API/OP(sequence_expand_as) error message enhancement (#23712)
5 years ago
Leo Chen d980d251f0
specify outs, test=develop (#24537)
5 years ago
yaoxuefeng 16817c70fc
OP(datanorm lookupsparsetable lookuptable) error message enhancement (#24506)
5 years ago
liu zhengxi fff9faae72
API(dynamic_gru, chunk_eval, BeamSearchDecoder) error message enhancement (#24513)
5 years ago
wangchaochaohu c4dd596de6
cusum op optimization for GPU kernel (#24321)
5 years ago
Leo Chen d43e4047ff
refine err msg, test=develop (#24550)
5 years ago
liym27 615a8bfcf4
Support int32 int64 and fix bug (#24407)
5 years ago
Zhaolong Xing f68d4fb3f1
fix bert bug using trt6 when compile with CUDA_ARCH_NAME=All (#24517)
5 years ago
lilong12 a13a4dbc7b
Improving error reporting messages for ops (#24438)
5 years ago
123malin 45ef6ff338
test=develop, error info improvement (#24496)
5 years ago
Adam dcf17f4813
Add isCached() mechanism to elementwise_add DNNL (#24563)
5 years ago
pawelpiotrowicz db2b6b6568
Hide globals & redesign restore PR (#24279)
5 years ago
Jiawei Wang 4a105f803e
SamplingID Op fix error print (#24521)
5 years ago
ShenLiang 53e3c53423
fix error message, test=develop (#24425)
5 years ago
ruri ea2c498712
Fix err message (#24507)
5 years ago
Wilber 4ec7287602
fix compile when with_nccl=off. test=develop (#24444)
5 years ago
chajchaj e5a624935f
Enhance error message of cross_entropy_op, sigmoid_cross_entropy_with_logits_op (#24485)
5 years ago
FDInSky 526a211774
update conv error info (#24430)
5 years ago
Yibing Liu a9520db24e
Format error message for ops (#24482)
5 years ago
Yang Zhang 2644cb819d
Remove extraneous comma in error messages (#24478)
5 years ago
suytingwan 05c9642dc3
Update paddle enforce message (#24498)
5 years ago
lijianshe02 9f83f0fe69
API/OP (group_norm, layer_norm, random_crop, unpool) error message enhancement (#24413)
5 years ago
hutuxian 7f85dfeb18
Upgrade Error Message for AucOP & MultiplexOP (#24458)
5 years ago
WangXi 027f9953bc
Optimize error message, include dgc, nccl, size op (#24456)
5 years ago
gongweibao f1c57d648c
Enhance error message of prefetch_op, proximal_adagrad_op, proximal_gd_op (#24436)
5 years ago
danleifeng d98e1182a1
fix check and error message for flatten hash is_empty op (#24434)
5 years ago
liu zhengxi 417b576c34
API(dynamic_lstm, dynamic_lstmp) error message enhancement (#24450)
5 years ago
wangchaochaohu 53bdee64e6
add tensor support for gaussian_random_op test=develop (#24389)
5 years ago
qingqing01 da4a1db7bb
Refine error message in some OPs (#24443)
5 years ago
Xing Wu 479c47f316
fix error info for transpose sequence_conv_pool max_sequence_len sequ… (#24437)
5 years ago
WuHaobo 1c00732dd0
Polish PADDLE_ENFORCE of unfold_op (#24423)
5 years ago
MRXLT 71ff32b65d
update error message for unstack op and lamb op; test=develop (#24439)
5 years ago
ForFishes 05c3bc3bf6
fix error message, test=develop (#24447)
5 years ago
ceci3 633aaea07e
Update err message (#24422)
5 years ago
Chen Weihang 05d20e5793
API/OP (Some SL API) error message enhancement (#24441)
5 years ago
tangwei12 a97d5a6153
fix op error, test=develop (#24451)
5 years ago
Yang Zhang 7c17ed57e0
Refine `squared_l2_distance_grad` error message (#24409)
5 years ago
huangjun12 046b7ebc03
refine Error message of modified_huber_loss (#24417)
5 years ago
zhang wenhui 621a4085b9
enhance cvm bpr_loss adam adagrad adamax ftrl error message, test=develop (#24452)
5 years ago
Li Fuchen e58619295e
OP(rank_loss, similarity_focus, squeeze) error message enhancement (#24448)
5 years ago
wangguanzhong abb49df26f
Enhance yolo_box & yolov3_loss (#24370)
5 years ago
mapingshuo d1bb76a2f7
fix error log, test=develop (#24419)
5 years ago
Jacek Czaja 8b88cd5167
[oneDNN] Fix to inplace pass (#24442)
5 years ago
Bai Yifan f1c4c14c7a
Fix fsp_op error message,test=develop (#24405)
5 years ago
Jacek Czaja d0307145a3
[oneDNN] disabling oneDNN inplace pass (#24406)
5 years ago
Kaipeng Deng 6435d1b4b8
fix PADDLE_THROW in spectral_norm_op.h. test=develop (#24414)
5 years ago
wangguanzhong cd327e66df
optimize error message, test=develop (#24420)
5 years ago
FDInSky 7fedf26b87
add linear interpolate operator (#23357)
5 years ago
Liufang Sang e24575c8ba
fix error message in accuracy_op.h test=develop (#24416)
5 years ago
wawltor 2de5075a6d
Fix the elementwise ops in broadcast in the process of backward (#24319)
5 years ago
Wojciech Uss fddf442470
add batch size to the mkldnn matmul cache key (#24408)
5 years ago
Tao Luo 72c370c8d2
remove unused test_multi_thread_helper.h (#24399)
5 years ago
hutuxian 123255cf9f
change InitializeGPU to InitializeGPUAndLoadModel (#24377)
5 years ago
qingqing01 43625bdabd
Add a high-level API with traning and inference into Paddle. (#24293)
5 years ago
Tao Luo 63da846de0
remove old inference C++ tests (#24368)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Pei Yang 9c073bbd53
skip pyramid_hash on jetson platforms to avoid compile error. test=develop (#24371)
5 years ago
guofei 70bc488979
Fix the error of recurrnet op in multithreading in eval process (#24357)
5 years ago
wangchaochaohu 01e45a06f6
add dependency for conda test=develop (#24318)
5 years ago
hong 67f66f0904
Fix get item out of range error (#24339)
5 years ago
Wojciech Uss db052009c7
Enabled quantize all and skip missing in QAT (#24281)
5 years ago
Aurelius84 1ed8baf9c8
[dy2static] Support for static graph training with @declarative decorator (#24259)
5 years ago
Huihuang Zheng 8a1a2af82e
Add Assert Op (#24280)
5 years ago
Pei Yang 8c296dea75
fix compile error(cpuid.h not found) on nvidia jetson platforms. test=develop (#24329)
5 years ago
Guo Sheng 4a5de14426
Remove cusolver potrfBatched support on Windows. (#24338)
5 years ago
Tao Luo c1df7048c7
add UT for mkldnn_cache_capacity (#24336)
5 years ago
Feiyu Chan 420707c26e
API/OP (margin_rank_loss, nce, row_conv, positive_negative_pair) erro… (#24246)
5 years ago
joanna.wozna.intel 356f5ee220
[Refactoring] Unify op-dequant squashes (#24277)
5 years ago
liym27 ac9a7eeea4
[Dy2Stat]Support list pop (#24250)
5 years ago
Tao Luo 9eedf05d2f
solve mklml memory leak on windows (#24015)
5 years ago
Zhang Ting b71abeee1b
use 32 bit index to improve activation ops (#24206)
5 years ago
xujiaqi01 1034ca316f
add timeout and http store in communication (#23436)
5 years ago
Guo Sheng 1fc6cc502a
Fix cusolver loader for Windows (#24157)
5 years ago
wawltor d1e1d85881
add the graph batch reader for pslib mode (#24178)
5 years ago
liym27 803559499d
[Dy2Stat]Support LoDTensorArray for slice op (#23091)
5 years ago
Feiyu Chan 78cd3dd507
fix kron_op: when only one input needs gradient, test=develop (#24269)
5 years ago
Li Fuchen 5dc069d050
OP(warpctc, add_position_encoding, scaled_dot_product_attention) error message enhancement (#24261)
5 years ago
liym27 19511dfa25
Fix bug in assign op: support to infer shape for LOD_TENSOR_ARRAY. (#24268)
5 years ago
joanna.wozna.intel b43b46e619
[INT8] Add requant-op squash (#24143)
5 years ago
石晓伟 17ac6e2580
update the analysis predictor for multi-stream support, test=develop (#24046)
5 years ago
hutuxian 3e2bc8715f
Try to fix UT Random Fail (#24223)
5 years ago
lidanqing 61ec30f030
Update QAT INT8 2.0 doc (#24127)
5 years ago
Sylwester Fraczek e1a7a88057
added reshape transpose matmul fuse pass (#23754)
5 years ago
Tao Luo d7850dcdc0
add noavx_axpy and noavx_axpy_noadd (#24207)
5 years ago
Huihuang Zheng 3047b69f94
Make Allocation Smaller on BuddyAllocatorTest to Decrease CI Failure (#24205)
5 years ago
Chen Weihang 9b851ba216
[dy2static] Add print transformer and unify print format (#24068)
5 years ago
Li Fuchen 077e5a0fe5
Add trace op (#23873)
5 years ago
wangchaochaohu fa43d74a3a
fix the intermediate node of graph for fusion group test=develop (#24184)
5 years ago
Yiqun Liu ecfddebbef
Add the implementation of inverse (#23310)
5 years ago
HappyAngel 8ea325aa53
c++ API ( average_accumulates, tensor_array_to_tensor and average_accumulates) error message enhance. test=develop (#23631)
5 years ago
Tao Luo e3179ea2f5
refine ccache statistics show (#24167)
5 years ago
ShenLiang 0fb9b208ab
Add batch_fc op in contrib (#24017)
5 years ago
liuwei1031 9a93f6aae0
improve efficiency of runtime InferVarType (#22778)
5 years ago
hong bfb60efb1f
change compare forece_cpu default value; test=develop (#23888)
5 years ago
wangchaochaohu 2270864019
Fusion group optimize for cuda codegen(#23940)
5 years ago
ShenLiang 94dfb7d770
opt the postprocess, test=develop (#24155)
5 years ago
Chengmo 28a558e867
Update index sample (#24109)
5 years ago
Zhang Ting ab8f8fa70d
fix example code, test=develop, test=document_fix (#24139)
5 years ago
Feiyu Chan e01262e691
add kron op (#24105)
5 years ago
Jacek Czaja eb411613e9
[DNNL] refine activations Inplace support (#24145)
5 years ago
wangchaochaohu 6bf26ef156
fix warning mac compiler (#24138)
5 years ago