Commit Graph

17735 Commits (263a9e97fd02489a8b3d3006417c120df6021e2a)

Author SHA1 Message Date
石晓伟 0d27591642
save operator version infomation to program desc, test=develop (#27668)
4 years ago
Qi Li b8d2a021f0
fix ut error of test_recognize_digits, test=develop (#27791)
4 years ago
Jacek Czaja 631c1f3018
- Fix to 27398 (#27770)
4 years ago
Feiyu Chan 0a7bab4e34
fix error mesage for negative_positive_pair_op and nce_op (#27779)
4 years ago
zhupengyang 395cb561aa
refine logsumexp error message and docs (#27713)
4 years ago
smallv0221 057e28bc8f
API(lstm_unit, lstmp, sequence_mask, sequence_enumerate, sequence_conv) error message enhancement (#27572)
4 years ago
Jacek Czaja 606611d351
[oneDNN] GRU BF16 kernel (#27731)
4 years ago
xiemoyuan 6c1acf34ed
Optimize the error message for OP (#27617)
4 years ago
cc ec7d11a492
refine fused_elemwise_activation error message (#27734)
4 years ago
Zhen Wang 365c2c9c89
fix error message showing in UpdateLossScalingOp (#27596)
4 years ago
LielinJiang 9089841b6e
Fix bilateral inference shape bug (#26822)
4 years ago
Yiqun Liu 65207b4560
Polish the error message of fc, fused_fc_elementwise_layernorm and fused_embedding_seq_pool. (#27692)
4 years ago
Wojciech Uss f399bed8d9
Add an option to set number of warmup iterations (#27739)
4 years ago
Jacek Czaja b9fda2ff09
Fix to issue #25537 (#27546)
4 years ago
Wojciech Uss 966447e338
Added support for quantization of fusion_gru (#27518)
4 years ago
joanna.wozna.intel 0cd4907eba
Add avx512 core instructions check (#27732)
4 years ago
hong19860320 7a96d5788d
Optimize the error messages of the CUDA implementation of activation ops (#27741)
4 years ago
tangwei12 fd616fadc2
repen heartbeat ut (#27684)
4 years ago
Qi Li f373269df0
update histogram op for performance optimization, test=develop (#24912)
4 years ago
tianshuo78520a 4d5ddbf106
add xpu test (#27622)
4 years ago
MRXLT 20fb01fb00
fix distributed error info (#27206)
4 years ago
pangyoki 7cd2c13f1b
add multinomial op (#27219)
4 years ago
Zhang Ting d2369dd91f
modify docs of CPUPlace and CUDAPinnedPlace, test=document_fix (#27587)
4 years ago
iducn 7c69e36131
add pip new requirements to windows (#27697)
4 years ago
Wojciech Uss 42d175385d
Add support for (de/re)quantization with shift (#27481)
4 years ago
123malin cc780b1977
test=develop, optimize geo communicator (#26857)
4 years ago
Pei Yang 8a4f85feb9
Add unittests and OP version registry for quant_conv2d_dequant_fuse_pass (#27689)
4 years ago
yukavio 7b46fb0f14
fix generate_proposals and affine grid error info (#27636)
4 years ago
Chen Weihang b14ecb8632
Polish api BuildStrategy/ExecutionStrategy doc & code example (#27662)
4 years ago
AshburnLee c3a3df6466
Add cuda support for unique op (#27646)
4 years ago
lilong12 bbc2add703
Initialize gloo for low level collective apis (#27672)
4 years ago
wawltor 29f4922906
optimize the error meesage for detetion_map_op
4 years ago
whs daf5aa9b8b
Fix round in grid sample op (#27657)
4 years ago
arlesniak 0ecf441af1
Add support for mkldnn ops types selection with FLAGS in dygraph (#27482)
4 years ago
Wilber 2bc70ab2e2
Fix lite_resnet50 unit test. (#27611)
4 years ago
ysh329 2f9cdd9038
API/OP clip_by_norm_op error message enhancement. test=develop (#27614)
4 years ago
yongqiangma aac57159c9
enhance array_to_lod_tensor_op lod_tensor_to_array_op errors informaiton (#27386)
4 years ago
lilong12 36c0410223
Revert "Initialize gloo for low level collective apis (#27356)", test=document_fix (#27665)
4 years ago
xiemoyuan 99e3337368
Optimize the error message of OP. (#27478)
4 years ago
ShenLiang e8f873df88
optimize the speed&memory of matmul op (#27610)
4 years ago
Pei Yang ae6e40a7fd
Add unittests and OP version registry for tensorrt_subgraph_pass (#27544)
4 years ago
tangwei12 9704582eef
fix op error (#27599)
4 years ago
wanghuancoder c68a0313a5
add paddle.fluid._cuda_synchronize (#27595)
4 years ago
yaoxuefeng c9a8801325
enhance error messages of lookup_tale, merge_ids, data_norm (#27619)
4 years ago
whs 9cc5603d56
Make grid support stopping graients. (#27630)
4 years ago
liym27 074a71bd25
Support assignment to a Variable in dynamic mode but not deal with backward. (#27471)
4 years ago
lilong12 5218b7af6b
add ncclSend and ncclRecv (#27621)
4 years ago
lilong12 fa73e4a284
Initialize gloo for low level collective apis (#27356)
4 years ago
furnace d01f626944
update mv op according PR#27024 (#27474)
4 years ago
Double_V 9d783aeddd
Error message opt, test=develop (#27467)
4 years ago
Li Fuchen 1501a80f74
add support to float64 input of warpctc op. (#27399)
4 years ago
QingshuChen 6b727e08b1
support elementwise add, activation, matmul on Baidu Kunlun (#27143)
4 years ago
Jack Zhou d37b3774fd
register log double grad kernel for cpu and cuda
4 years ago
Chengmo d014e29fc6
fix error message (#27318)
4 years ago
Leo Chen 35074963e3
Refine error msg in paddle/fluid/framework/details [part 2] (#27429)
4 years ago
Chengmo 0e101c4f6f
Fix test dist fleet heter ctr (#27513)
4 years ago
Zhong Hui a85592bcbf
fix cpplint error for the autmic max/min
4 years ago
joanna.wozna.intel b0ee1405f7
Add conv2d bfloat16 support (#27325)
4 years ago
Leo Chen a5b3263782
Refine error msg in paddle/fluid/imperative (#27521)
4 years ago
chalsliu 09f1953296
Revert "Disable ut quickly."
4 years ago
Thunderbrook 6f69a4cb05
add xpu in heter mode (#27000)
4 years ago
ceci3 8daccc9ea7
Fix batch norm double grad compute (#27549)
4 years ago
ShenLiang 6fc74bbaf6
add fp16 for matmul (#27523)
4 years ago
Zhong Hui fab4e6d08f
add abs support double grad
4 years ago
GaoWei8 36ed83d270
Refine PADDLE_ENFORCE (#27360)
4 years ago
liym27 effd51b6be
Fix error message in operator/utils.h (#27532)
4 years ago
Leo Chen 6bb02e8e3c
increase retry time (#27553)
4 years ago
Shang Zhizhou 77a36f8997
[buf fix]:fix some unittests error (#27540)
4 years ago
Zhong Hui 597345d17b
fix cuda atomic for ARCH<350 for the automic_max
4 years ago
WangXi e550fc02ae
fleet2.0 add fp16 grad compression (#27480)
4 years ago
cc c5c13473c6
Add compatibility check for four mkldnn pass (#27364)
4 years ago
mapingshuo c83ade6d6b
add AsDuplicable for sync_comm op(#27515)
4 years ago
Zhou Wei d20349b548
add unittest count ,install check on windows (#27492)
4 years ago
Wilber 3d5522146e
register seq_concat_fc_fuse pass. (#27479)
4 years ago
Wilber df7fabeedc
Fix memory leak for mkldnn. (#27493)
4 years ago
ruri b7319ef518
fix err msg in pixel shuffle op (#27503)
4 years ago
Kaipeng Deng d7f422c984
fix error message in conv/conv_transpose. test=develop (#27464)
4 years ago
Wilber ec4155d7d0
windows lib size crop from 5.4G to 3.9G (#27477)
4 years ago
ruri e1fb77d123
[2.0RC]refine error message in shuffle channel OP (#27505)
4 years ago
Aurelius84 f91c37e665
Refine error message of MatchMatrix and PyramidHash (#27484)
4 years ago
Shibo Tao 8f7bb52bd2
fix tensorrt 6 build error. test=develop (#27511)
4 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
4 years ago
chalsliu 29f1560d8f
Disable ut quickly.
4 years ago
wangchaochaohu dc713116e0
refine the error message for bath size like OP (#27446)
4 years ago
Zhong Hui 4a9d21de49
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
4 years ago
YUNSHEN XIE 66951ab2ea
modified timeout value for 4 ut (#27462)
4 years ago
Shang Zhizhou c17f9cf25f
[bug fix]:Memory increases after adapting the cudnn version to cudnn8 (#27436)
4 years ago
Zhou Wei 1e1ae5c54d
Make the Bind Method of Tensor more automatic (#27270)
4 years ago
LutaoChu 5508c78744
Fix bug: The calculation result of Diag_v2 Op under large size input is wrong (#27447)
4 years ago
tangwei12 bc5f0246a8
large scale kv speedup (#26510)
4 years ago
Qi Li d7b7dcd10e
fix cmake dependencies of test_recognize_digits, test=develop (#27475)
4 years ago
Zhou Wei 292b24aa6d
fix bug MD of compile, And add MD/STATIC/OPENBLAS inference lib check on windows (#27051)
4 years ago
Chen Weihang 41b5955538
Polish no onwer ops error message (#27448)
4 years ago
Zhang Ting 906e7f921e
add fuse_bn_act op (#27230)
4 years ago
Wilber 5034d181f3
update for 2.0 inference api. (#27473)
4 years ago
Chen Weihang 765064476b
Polish some lost invalid error message (#27445)
4 years ago
wangchaochaohu 76fb95fe76
avoid data transform for linspace OP (#27444)
4 years ago
123malin a04524759e
Enhance Op's Error Message (#27455)
4 years ago
wangchaochaohu 0a862fd356
refine the precious of linspace Op using half way (#27452)
4 years ago
Pei Yang fda54c0212
errmsg refine of trt plugin (#27309)
4 years ago