Commit Graph

5451 Commits (7f3e6ca59622673c96e62d23973a99d554c19313)

Author SHA1 Message Date
Chen Weihang 3c8daa9b89
Add pin memory control for BufferedReader (#26026)
5 years ago
Chen Weihang ad4a0466a5
Add cuda pinned place branch in slice op GetExpectedKernelType (#26027)
5 years ago
Feiyu Chan e853ece0a2
update document template for unary elementwise layers (#25896)
5 years ago
joanna.wozna.intel 734cf1c3e9
Change use_quantizer attribute name and data type (#25838)
5 years ago
Leo Chen 5258d53d65
refine unsqueeze, test=develop (#25470)
5 years ago
yaoxuefeng 224620071b
add new flatten op test=develop (#25393)
5 years ago
Adam 68c6160e63
Add oneDNN fusion_gru kernel (#25594)
5 years ago
Zhong Hui dca56f47f5
fix invalid read of pnorm gradient function
5 years ago
Zhaolong Xing 358bc06c72
[CUDNN8 support] : support CUDNN8 (#25664)
5 years ago
Zhaolong Xing 5970871a64
add eltwise clip cuda impl. (#25689)
5 years ago
Pei Yang b717895f64
Fix registering trt plugin (#25744)
5 years ago
wawltor a697e94693
Update the code of the compare ops for the broadcast function
5 years ago
wangchaochaohu ff717d5158
Add support for tuple of concat Op test=develop (#25800)
5 years ago
tangwei12 253fd407e8
Fix/distibuted heart beat (#25902)
5 years ago
xujiaqi01 d11c140e28
fix dump, fix cvm check (#25400)
5 years ago
Zhang Ting 6486fe8a94
improve GPU performance of transpose, test=develop (#25862)
5 years ago
Zhang Ting 2d24f56a7a
avoid data transfer, test=develop (#25810)
5 years ago
ShenLiang bca303165a
fix inverse bug (#25641)
5 years ago
Aurelius84 e52dae6ef6
Using input.place() in GetExpectedKernel in slice_op (#25595)
5 years ago
wawltor 595a719795
Update the api for the compare_ops
5 years ago
wangchaochaohu 32b9577b2a
refine the split op for API 2.0 test=develop (#25320)
5 years ago
lilong12 ce506930c3
Fix the bug that Input(Offsets) and attr(offsets) cannot be set at the same time. (#24975)
5 years ago
tangwei12 2d9dbd31ad
Fix/mkl dnn (#25835)
5 years ago
tangwei12 caa90a6510
Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) (#22957)
5 years ago
cc 42189be67b
[Quant] Remove the output for moving_average_abs_max_scale op (#25697)
5 years ago
Chen Weihang 23d1228c4d
remove ProgramTranslator.save_inference_model (#25740)
5 years ago
Chen Weihang 1b3081b1b4
Simplify BufferedReader to improve DataLoader performance (#25648)
5 years ago
Zhou Wei e0a9115e28
fix random compile failure due to missing file (#25661)
5 years ago
Sylwester Fraczek 1aaa26f102
add dnnl sigmoid (logistic) activation (#25745)
5 years ago
wangchaochaohu 1e4ab728fb
refine the concat Op for API 2.0 test=develop (#25307)
5 years ago
Adam 98899b73d2
Fix FC + GRU fuse pass (#25687)
5 years ago
Leo Chen 4ec1251a1e
Refine squeeze, test=develop (#25281)
5 years ago
joanna.wozna.intel e5bbffa84c
Add NOMINMAX define due to windows.h max/min macro conflict (#25637)
5 years ago
cnn 70cee22fde
New features, add sinh and cosh op, test=develop (#25495)
5 years ago
Zhang Ting a1350744eb
register fp16 kernel, test=develop (#25630)
5 years ago
mapingshuo 5453a912fe
add fp64 support in sequence_pool, test=develop (#25662)
5 years ago
GaoWei8 6e86fd3750
fix concat dimension (#25606)
5 years ago
donproc 95fa383df2
optimize embedding cuda kernel lookup_table_v2,test=develop (#25587)
5 years ago
石晓伟 7206417259
supports xpu runtime, test=develop (#25554)
5 years ago
Zhang Ting 30d1ff3bb4
call cublasGemmStridedBatchedEx when using fp16, test=develop (#25553)
5 years ago
Aurelius84 ca1185d06b
[Dy2Stat] Fix scope in run_program_op (#25579)
5 years ago
Jacek Czaja 7dbc441eab
[oneDNN] cache cosmetics improvement (#25576)
5 years ago
hong e362095e45
fix softmax with cross entropy out of bound; test=develop (#25549)
5 years ago
Huihuang Zheng d8fe517bf8
Add Support for SelectedRows for Transpose OP and Fix a Bug That SelectedRows Cannot be Supported in SimNet (#25536)
5 years ago
Wilber 848aca7ae8
[CI] [Lite-Subgraph] CI add lite subgraph check. (#25346)
5 years ago
Shibo Tao 71c71e684c
fix logical_* ops' doc (#25479)
5 years ago
Aurelius84 4717bdbcfb
Fix hang in seq_topk_avg_pooling op (#25522)
5 years ago
LielinJiang 7129f544f0
Add bilateral_slice op (#25401)
5 years ago
Zhang Ting ca725c82f2
improve fp16 performance of slice_grad, test=develop (#25523)
5 years ago
yaoxuefeng 5d3766ff3d
modify flip test=develop (#25312)
5 years ago
Chen Weihang 41d2247275
[Dy2static] Refactor ProgramTranslator save_inference_model API (#24989)
5 years ago
yaoxuefeng aaa7cbd56f
modify trace api test=develop (#25397)
5 years ago
Huihuang Zheng f9ac5fb992
[Dy2stat] Fix Memory Optimization in run_program_op and Add SimNet as Unit Test (#25383)
5 years ago
yaoxuefeng c42d662e2a
modify roll test=develop (#25321)
5 years ago
Zhen Wang 548cdbc544
Quantization-aware training for dygraph (#24634)
5 years ago
Chen Weihang 0b54d54fd8
Fix index overflow bug of the CUDA kernel loop increment (#25435)
5 years ago
zlsh80826 e528392de9
[Paddle-TRT] SkipLayernorm vectorized memory optimization (#25117)
5 years ago
zhupengyang 5b573c58e2
randperm API: remove out, devive, stop_gradient; add name (#25410)
5 years ago
Zhen Wang bb45af02ac
add the c++ part of Imperative QAT. test=develop (#25446)
5 years ago
Jacek Czaja 050a9bf79d
[oneDNN] LRN cleanup (#25416)
5 years ago
GaoWei8 1974aadcf0
fix concat shape error (#25414)
5 years ago
tangwei12 4b3778a3ee
Revert/barrier for sync (#25417)
5 years ago
ceci3 52be62c5ae
fix instance norm in dy (#24717)
5 years ago
zhupengyang eb3173e2b6
rand API: remove out, device, stop_gradient; add name (#25246)
5 years ago
zhupengyang 6de75082cb
fix test_hsigmoid windows ci (#25311)
5 years ago
WuHaobo f593c3fb2f
fix the formula of floor OP and ceil OP (#25292)
5 years ago
Zhang Ting bc7610583b
use eval() to improve CPU performance (#25243)
5 years ago
Kaipeng Deng 74468bf428
add mish op. (#24565)
5 years ago
Yang Zhang 6d6efafeeb
Add `matrix_nms_op` (#24400)
5 years ago
Chengmo e85fcaa712
Fix fluid.embedding in Distributed Training (#25174)
5 years ago
Yiqun Liu c00f827843
Avoid data transforming ShapeTensor from CPU to GPU in fill_constant op. (#25267)
5 years ago
123malin f1a9593d69
test=develop, bug fix for index_select and roll op (#25251)
5 years ago
FDInSky c2e072587c
test=develop fix generate_proposals's error (#25227)
5 years ago
Wilber 4c964abdf7
support build on arm. test=develop (#25212)
5 years ago
liym27 1458cc0c68
Fix bug: Don't check dims if contain_unknown_dim of cross_entropy_grad_op in compile time (#25221)
5 years ago
liu zhengxi 68e93d8a17
Fix beam_search InferShape (#25169)
5 years ago
Adam bd0b38e671
Refactor of conv fp32 oneDNN operator (#25137)
5 years ago
Shibo Tao 19c4db1b56
don't re-generate header file if content doesn't change (#25130)
5 years ago
Jacek Czaja a7944904d3
[oneDNN]elementwise_add and elementwise_mul int8 support (#24984)
5 years ago
Leo Chen fa657b3dbb
fix bug of prelu when rank not equal 4, test=develop (#25067)
5 years ago
zlsh80826 479c8834f7
[Paddle-TRT] Fixes #24731, opt for SoftmaxKernelWithEltadd kernel, test=develop (#24834)
5 years ago
Leo Chen 028de857d4
fix dtype error of compare op, test=develop (#25059)
5 years ago
tianshuo78520a 770c11a117
fix make device_context error (#25045)
5 years ago
tangwei12 be6a315fbd
Fix/sync barrier (#25016)
5 years ago
ceci3 8db66fc3f6
fix cos_sim, test=develop (#25017)
5 years ago
Zhang Ting 621b638550
improve performance of instance_norm, test=develop (#25005)
5 years ago
wangchaochaohu 613303dbf6
refine the slice Op to improve the performance of xlnet for fp16 training (#24967)
5 years ago
Chen Weihang d152d7231e
clear old var in scope, test=develop (#24976)
5 years ago
wawltor 0eb1b0bc01
Add support the 5d, 6d tensor support for the reduce ops
5 years ago
mapingshuo 24e24987f0
fixes the place info in the Print op (#24934)
5 years ago
Aurelius84 6be0ee159e
Support LoDTensorArray in reverse_op (#24797)
5 years ago
Leo Chen a7cb97a1a5
Fix/isfinite on windows (#24927)
5 years ago
whs 4c01d6d53e
Enhance checking in some operator. (#24473)
5 years ago
lilong12 6e10022781
add queue_generator_op, dequeue_op, enqueue_op and ut (#24481)
5 years ago
Leo Chen 1e818158f5
Feature/add amp_checkout_finite_and_scale op (#24875)
5 years ago
leesusu a6beb96dd0
FTRL with sparse update, test=develop (#22092)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Michał Gallus 23a85f030c
Remove old mkldnn_elementwise_mul test (#24855)
5 years ago
Leo Chen b67ded04f2
Support gradient accumulation of fp16 in imperative mode (#24823)
5 years ago
Qi Li 704cad6a66
Add histc op (#24562)
5 years ago
Yi Liu 12bffdc086
Enhance error message of checkpoint_notify_op, fake_init_op gen_nccl_id_op and listen_and_serv_op (#24554)
5 years ago
Adam b490e41c1d
Add isCached() mechanism for BatchNorm and LRN oneDNN operators (#24798)
5 years ago
Aurelius84 a7e21cbed3
Move input_size check into RunTime phrase of gru_unit_op and refine error message (#24776)
5 years ago
Zhou Wei f66594a558
fix bug that diag API can't use on Windows(#24762)
5 years ago
Leo Chen c0911fdd32
rename inplace/no_need_buffer inferer, part4, test=develop (#24781)
5 years ago
Chen Weihang be82de4c79
polish two error message, test=develop (#24778)
5 years ago
Leo Chen b0e7439fbc
rename inplace/no_need_buffer inferer, part2, test=develop (#24733)
5 years ago
Leo Chen a6fbba65ff
rename inplace/no_need_buffer inferer, part3, test=develop (#24734)
5 years ago
wangchaochaohu 355caee18b
fix conv_transpose Op fp16 error test=develop (#24695)
5 years ago
Adam 56a714a19b
Add isCached() machinism to oneDNN pooling primitive (#24724)
5 years ago
Aurelius84 6db0e2b1a9
[Dy2stat] Support len syntax (#24638)
5 years ago
ShenLiang 950892044f
fix conflict, test=develop (#24238)
5 years ago
lidanqing c3c61d34c1
Update PADDLE_ENFORCE in DNNL related ops (#24333)
5 years ago
Wilber dbe2497768
fix mkldnn concat bug. test=develop (#24722)
5 years ago
Leo Chen de8b4f42cf
rename inplace/no_need_buffer inferer, part 1, test=develop (#24711)
5 years ago
Chen Weihang 7fa9f16c17
Polish reader folder error message (#24698)
5 years ago
Liufang Sang 55b664a131
remove pow to speed up in dequantize_log op (#24607)
5 years ago
Jacek Czaja ca68b13f85
[oneDNN] Fix to elementwise_add grad (#24639)
5 years ago
Adam 586b587519
Add isCached() check in Softmax handler (#24637)
5 years ago
jinyuKING 6ca44cbaa3
API/OP(sequence_first_step, sequence_last_step, sequence_mask, beam_search, beam_search_decode) error message enhancement (#24590)
5 years ago
ShenLiang 837dd47a96
Add lod in gather/scatter (#24613)
5 years ago
Jacek Czaja 3292f0ef58
[onednn] elementwise add broadcasting support (#24594)
5 years ago
Guo Sheng fe8d006f7e
API/OP(sequence_expand_as) error message enhancement (#23712)
5 years ago
yaoxuefeng 16817c70fc
OP(datanorm lookupsparsetable lookuptable) error message enhancement (#24506)
5 years ago
liu zhengxi fff9faae72
API(dynamic_gru, chunk_eval, BeamSearchDecoder) error message enhancement (#24513)
5 years ago
wangchaochaohu c4dd596de6
cusum op optimization for GPU kernel (#24321)
5 years ago
Leo Chen d43e4047ff
refine err msg, test=develop (#24550)
5 years ago
liym27 615a8bfcf4
Support int32 int64 and fix bug (#24407)
5 years ago
lilong12 a13a4dbc7b
Improving error reporting messages for ops (#24438)
5 years ago
123malin 45ef6ff338
test=develop, error info improvement (#24496)
5 years ago
Adam dcf17f4813
Add isCached() mechanism to elementwise_add DNNL (#24563)
5 years ago
pawelpiotrowicz db2b6b6568
Hide globals & redesign restore PR (#24279)
5 years ago
Jiawei Wang 4a105f803e
SamplingID Op fix error print (#24521)
5 years ago
ShenLiang 53e3c53423
fix error message, test=develop (#24425)
5 years ago
ruri ea2c498712
Fix err message (#24507)
5 years ago
Wilber 4ec7287602
fix compile when with_nccl=off. test=develop (#24444)
5 years ago
chajchaj e5a624935f
Enhance error message of cross_entropy_op, sigmoid_cross_entropy_with_logits_op (#24485)
5 years ago
FDInSky 526a211774
update conv error info (#24430)
5 years ago
Yibing Liu a9520db24e
Format error message for ops (#24482)
5 years ago
Yang Zhang 2644cb819d
Remove extraneous comma in error messages (#24478)
5 years ago
suytingwan 05c9642dc3
Update paddle enforce message (#24498)
5 years ago
lijianshe02 9f83f0fe69
API/OP (group_norm, layer_norm, random_crop, unpool) error message enhancement (#24413)
5 years ago
hutuxian 7f85dfeb18
Upgrade Error Message for AucOP & MultiplexOP (#24458)
5 years ago
WangXi 027f9953bc
Optimize error message, include dgc, nccl, size op (#24456)
5 years ago
gongweibao f1c57d648c
Enhance error message of prefetch_op, proximal_adagrad_op, proximal_gd_op (#24436)
5 years ago
danleifeng d98e1182a1
fix check and error message for flatten hash is_empty op (#24434)
5 years ago
liu zhengxi 417b576c34
API(dynamic_lstm, dynamic_lstmp) error message enhancement (#24450)
5 years ago
wangchaochaohu 53bdee64e6
add tensor support for gaussian_random_op test=develop (#24389)
5 years ago
qingqing01 da4a1db7bb
Refine error message in some OPs (#24443)
5 years ago
Xing Wu 479c47f316
fix error info for transpose sequence_conv_pool max_sequence_len sequ… (#24437)
5 years ago