Chengmo
d014e29fc6
fix error message ( #27318 )
...
* fix sgd/momentum/dpsgd/rmsprop error message
4 years ago
Leo Chen
35074963e3
Refine error msg in paddle/fluid/framework/details [part 2] ( #27429 )
...
* refine broadcast_op_handle
* refine some error messages
* refine some files
* fix bug
* fix bug
* fix bug
* follow comments
* follow comments
4 years ago
Chengmo
0e101c4f6f
Fix test dist fleet heter ctr ( #27513 )
...
* fix test_dist_fleet_heter_ctr & peformance update
4 years ago
Zhong Hui
a85592bcbf
fix cpplint error for the autmic max/min
...
fix cpplint error for the autmic max/min
4 years ago
joanna.wozna.intel
b0ee1405f7
Add conv2d bfloat16 support ( #27325 )
4 years ago
Leo Chen
a5b3263782
Refine error msg in paddle/fluid/imperative ( #27521 )
...
* refine err msg
* follow comments
4 years ago
chalsliu
09f1953296
Revert "Disable ut quickly."
...
This reverts commit 29f1560d8f
.
4 years ago
Thunderbrook
6f69a4cb05
add xpu in heter mode ( #27000 )
...
* add xpu in heter mode
test=develop
* BOOST_CONST_GET; PADDLE_THROW
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* refine
test=develop
* refine
test=develop
* refine
test=develop
* refine code
test=develop
4 years ago
ceci3
8daccc9ea7
Fix batch norm double grad compute ( #27549 )
...
* fix bn double grad, test=develop
* update, test=develop
4 years ago
ShenLiang
6fc74bbaf6
add fp16 for matmul ( #27523 )
...
* add fp16 for matmul
4 years ago
Zhong Hui
fab4e6d08f
add abs support double grad
...
add abs support double grad for the api 2.0
4 years ago
GaoWei8
36ed83d270
Refine PADDLE_ENFORCE ( #27360 )
...
* refine PADDLE_ENFORCE
4 years ago
liym27
effd51b6be
Fix error message in operator/utils.h ( #27532 )
4 years ago
Leo Chen
6bb02e8e3c
increase retry time ( #27553 )
4 years ago
Shang Zhizhou
77a36f8997
[buf fix]:fix some unittests error ( #27540 )
...
* [buf fix]:fix unittest test_activation_op error
* split long-time unittests to smaller ones
* rename some unittests
4 years ago
Zhong Hui
597345d17b
fix cuda atomic for ARCH<350 for the automic_max
...
fix cuda atomic for ARCH<350 for the automic_max
4 years ago
WangXi
e550fc02ae
fleet2.0 add fp16 grad compression ( #27480 )
4 years ago
cc
c5c13473c6
Add compatibility check for four mkldnn pass ( #27364 )
...
* Add pass compatibility check for four mkldnn pass, test=develop
4 years ago
mapingshuo
c83ade6d6b
add AsDuplicable for sync_comm op( #27515 )
4 years ago
Zhou Wei
d20349b548
add unittest count ,install check on windows ( #27492 )
...
* add unittest count of windows
* Reduce the number of retries
4 years ago
Wilber
3d5522146e
register seq_concat_fc_fuse pass. ( #27479 )
4 years ago
Wilber
df7fabeedc
Fix memory leak for mkldnn. ( #27493 )
4 years ago
ruri
b7319ef518
fix err msg in pixel shuffle op ( #27503 )
4 years ago
Kaipeng Deng
d7f422c984
fix error message in conv/conv_transpose. test=develop ( #27464 )
...
* fix error message in conv/conv_transpose. test=develop
4 years ago
Wilber
ec4155d7d0
windows lib size crop from 5.4G to 3.9G ( #27477 )
4 years ago
ruri
e1fb77d123
[2.0RC]refine error message in shuffle channel OP ( #27505 )
...
* refine err msg in shuffle channel op
4 years ago
Aurelius84
f91c37e665
Refine error message of MatchMatrix and PyramidHash ( #27484 )
4 years ago
Shibo Tao
8f7bb52bd2
fix tensorrt 6 build error. test=develop ( #27511 )
...
* fix tensorrt 6 build error. test=develop
* fix. test=develop
* bug fix
* test=develop
4 years ago
wanghuancoder
df43905f12
use iwyu clean include ( #27267 )
...
* use iwyu clean include, test=develop, test=win
* compilation error, test=develop
* fix compilation error2, test=develop
* fix compilation error3, test=develop
* fix compilation error4, test=develop
* fix compilation error5, test=develop
* fix compilation error6, test=develop
* fix compilation error7, test=develop
* fix compilation error8, test=develop
* fix compilation error8, test=develop
* fix compilation error10, test=develop
* fix compilation error11, test=develop
4 years ago
chalsliu
29f1560d8f
Disable ut quickly.
4 years ago
wangchaochaohu
dc713116e0
refine the error message for bath size like OP ( #27446 )
...
* refine the error message for bath size like
4 years ago
Zhong Hui
4a9d21de49
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
...
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
4 years ago
YUNSHEN XIE
66951ab2ea
modified timeout value for 4 ut ( #27462 )
4 years ago
Shang Zhizhou
c17f9cf25f
[bug fix]:Memory increases after adapting the cudnn version to cudnn8 ( #27436 )
...
* [bug fix]:Memory increases after adapting the cudnn version to 8
* [bug fix]cudnnGetConvolutionForwardAlgorithm not defined
4 years ago
Zhou Wei
1e1ae5c54d
Make the Bind Method of Tensor more automatic ( #27270 )
...
* Makes the Bind Method more intelligent
* Makes the Bind Method more intelligent
* fix unittest
* fix unittest
* fix conflict
4 years ago
LutaoChu
5508c78744
Fix bug: The calculation result of Diag_v2 Op under large size input is wrong ( #27447 )
...
The calculation result of Diag_v2 Op under large size input is wrong
4 years ago
tangwei12
bc5f0246a8
large scale kv speedup ( #26510 )
...
* rename communicator meet->BatchesCounter
* fix parame recv for sparse
* geo sparse init from pserver
* optimize init from pserver
* add large scale optimizer fuse(SGD/ADAM)
* rectification init_worker and exe.run startup program
4 years ago
Qi Li
d7b7dcd10e
fix cmake dependencies of test_recognize_digits, test=develop ( #27475 )
4 years ago
Zhou Wei
292b24aa6d
fix bug MD of compile, And add MD/STATIC/OPENBLAS inference lib check on windows ( #27051 )
4 years ago
Chen Weihang
41b5955538
Polish no onwer ops error message ( #27448 )
...
* polish no onwer op error message
* fix unittest failed
* polish details based reviewer comment
4 years ago
Zhang Ting
906e7f921e
add fuse_bn_act op ( #27230 )
...
* add fused_bn_add_relu op
4 years ago
Wilber
5034d181f3
update for 2.0 inference api. ( #27473 )
4 years ago
Chen Weihang
765064476b
Polish some lost invalid error message ( #27445 )
...
* polish some lost error msg
* add some math file to white list
* polish detail based reviewer commnet
4 years ago
wangchaochaohu
76fb95fe76
avoid data transform for linspace OP ( #27444 )
4 years ago
123malin
a04524759e
Enhance Op's Error Message ( #27455 )
...
* test=develop, update error message
4 years ago
wangchaochaohu
0a862fd356
refine the precious of linspace Op using half way ( #27452 )
4 years ago
Pei Yang
fda54c0212
errmsg refine of trt plugin ( #27309 )
4 years ago
tianshuo78520a
9f3a9be76a
update python 2.7.15 ( #27435 )
4 years ago
石晓伟
dd4c2d86a5
enhance error messages, test=develop ( #27423 )
4 years ago
Zhou Wei
b7371fa55d
judge whether remove build dir to accelerate compile,test=develop ( #27334 )
4 years ago
Zhong Hui
f4c750d721
Add the cpu version of segment sum mean max min op
...
Add the cpu version of segment sum mean max min op
4 years ago
Wilber
afe94903c3
Rename fluid_inference to paddle_inference. ( #27422 )
4 years ago
Pei Yang
8182337096
clear pass logs ( #27434 )
4 years ago
furnace
13a4c74efd
add mv op(c++, python, unit test) ( #27024 )
4 years ago
LutaoChu
f11a53ee76
Optimize argsort Op performance on GPU
...
* argsort op acceleration on GPU when the input size is equal to the length of the ‘axis’ dimension
4 years ago
ceci3
1d3b27cae8
add double grad compute for batch norm ( #27296 )
...
* add double grad compute for batch norm,test=develop
* fix unittest, test=develop
* remove unuse tensor,test=develop
* add format,test=develop
* update, test=develop
4 years ago
Shang Zhizhou
d93661942e
fix bug sequececonv_eltadd_relu_fuse_pass ( #27404 )
...
* fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0
* fix seqconv_eltadd_relu_fuse_pass unitest error
4 years ago
Leo Chen
aba759ba16
[Feature] Enhance inplace addto strategy for gradient accumulation in static graph ( #27112 )
...
* support use add instead of sum to do gradient accumulation
* add inplace addto pass
* add grad_add op and inplace addto pass
* remove debug code
* code refine
* fix bug when sereral sum ops inserts at same op_idx
* fix Flags type
* add addto attribute for conv3d
* fix ut
* code clean
* fix type
4 years ago
LutaoChu
669efb98de
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor
...
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor
4 years ago
Wilber
39546aa2f3
Add pass compatible and unit test. ( #27377 )
4 years ago
huangxu96
02606d45ef
Quant op dev ( #25932 )
...
* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests.
* Finished channel-wise quantize strategy in imperative quantization.
* Added Cuda code of ChannelWiseQuantDequantMaxAbsOP
Add Cuda code of ChannelWiseQuantDequantMaxAbsOp
* Add quant_axis for channel_wise quant.
* fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement.
* Added some assert infomation and fixed some coding style mistakes.
4 years ago
Leo Chen
bbc84e0fe0
Refine error msg in paddle/fluid/framework/details [part 1] ( #25631 )
...
* refine error msg in var_handle.h, test=develop
* refine all_reduce_op_handle
* fix some error msg
* refine variable_visitor
* refine threaded_ssa_graph_executor
* refine inplace related files
* refine executor related files
* refine fetch_op_handle.cc
* fix bug
* follow comments
4 years ago
MRXLT
f936adbd2d
fix adam ( #27343 )
...
* fix adam
* rmsprop support double
4 years ago
tangwei12
99626502f7
【paddle.fleet】gloo and util ( #27213 )
...
* fix worker endpoints
* fix gloo wrapper for hdfs
* GPU fleetrun support gloo
* parameterserver fleetrun support gloo
* fix get server endpoint
4 years ago
Pei Yang
a5ef246cac
Optimize emb_eltwise_layernorm_plugin and support fp16 ( #27128 )
4 years ago
yaoxuefeng
d726fd5e86
enhance dataset err msg ( #27363 )
4 years ago
guofei
9fdcfe8981
Support python3.8 ( #26850 )
...
* Support python3.8
test=notest
4 years ago
Pei Yang
fd7ab4e63c
register pass compatibility ( #27357 )
...
* pass compatibility
* add compatibility registry
* add unittests for different padding
* add assert
* drop errmsg
4 years ago
haozech
7e6dfcf9b2
Add 3 pass version check ( #27283 )
4 years ago
GaoWei8
1a7559718e
fix cudnn dyload ( #27308 )
...
* fix cudnn dyload error
4 years ago
wawltor
b6a4349dd4
fix the error message for the math dir
...
https://github.com/PaddlePaddle/Paddle/pull/27332
4 years ago
HappyAngel
01659a6961
Polish operators error message in average_accumlate OP ( #27268 )
...
* fix op print error info problem. test=develop
* fix build error
* fix format
* fix error msg info
* fix format
4 years ago
Shang Zhizhou
3c11717988
add op version checker to ir passes ( #27329 )
4 years ago
furnace
515efe4240
add empty_like op (python, and unit test), use c++ implementation of empty op, ( #27287 )
...
and optimize the c++ implmentation of empty op as PR#26659 reviews,
and add bool for shape op.
4 years ago
Yi Liu
e9a0fbfff2
OP报错信息优化 ( #27301 )
...
paddle/fluid/operators/distributed_ops OP报错信息优化
4 years ago
tianshuo78520a
da583edf6d
Fix paddle build install requirements ( #27378 )
...
* Fix install pr requirements.txt
* test=document_fix
4 years ago
Jack Zhou
63203c4abc
enhance reduce op which can reduce tensor with arbitrary rank
...
enhance reduce op which can reduce tensor with arbitrary rank
4 years ago
YUNSHEN XIE
fd496fa9a3
modified the ut name with test_dist ( #27315 )
4 years ago
lilong12
9f9d15e285
fix the bug of non-exit, test=develop ( #27350 )
4 years ago
chalsliu
d773c6c94e
Support precision test
4 years ago
ShenLiang
9ee77b1f41
Fix elementwise_floordiv op ( #27352 )
...
* fix floordiv
4 years ago
Zhou Wei
ebc6d54446
fix cache file judge ( #27369 )
4 years ago
ShenLiang
54b81fa32c
add adaptivelsgd in meta_optimizer ( #27289 )
...
* add adaptivelsgd
* Todo fix the code to avoid the conflict.
4 years ago
Jack Zhou
6e29c2da05
Error description optimize for the math dir
...
Error description optimize for the math dir
4 years ago
Zhou Wei
f992f8d7ef
fix judge cache file of inference api more accurate ( #27175 )
...
fix judge cache file of inference api more accurate
4 years ago
Jacek Czaja
4582f697b6
- Fix to concat oneDNN overwritting data ( #27273 )
...
test=develop
4 years ago
ShenLiang
c296618c94
fix error message in broadcast/allreduce/gather ( #27302 )
...
* fix error message
4 years ago
Chen Weihang
4f9d6529fe
Polish framework error message part 7 ( #27266 )
...
* polish framework error message part 7
* fix typo
* polish by reviewes comment
4 years ago
wawltor
4e8582fe5a
update the error message check for the some ops
...
update the error message check for the some ops
4 years ago
wawltor
d003573f90
add the error message check for the some operator
...
add the error message check for the some operator
4 years ago
Wilber
dae62556cb
Enhance infer error info message ( #26731 )
4 years ago
Leo Chen
4c8ea492cd
use shared dev_ctx ( #27313 )
4 years ago
wanghuancoder
696a39e2f3
use clcache in windows ( #27279 )
4 years ago
Shang Zhizhou
47fdc60ecc
Optimize slice trt plugin ( #26970 )
...
* optimize slice TRT plugin
This patch removes unnecessary barrier for data transfer of needed offset,
so data transfer can be overlap with GPU kernel execution.
This patch also fixes incorrect name of slice plugin. That is, replaces
"layernorm" with "slice"
test=develop
* add serialize/deserialize to slice plugin
* add static shape slice trt plugin
* fix slice trt op convertor dynamic shape bug
* fix format by clang-format
* fix pylint format error
* fix problems commented by peiyang
Co-authored-by: Ryan Jeng <rjeng@nvidia.com>
4 years ago
Wilber
f827665ae6
[Pass Compatible] Bind python compatible. ( #27262 )
4 years ago
石晓伟
bd77a4258d
error messages of inference/tests, test=develop ( #27259 )
4 years ago
Chen Weihang
dafb0e3bb7
Polish framework error message part 6 ( #27257 )
...
* polish framework error msg part 6
* polish lossed item
* fix failed unittest
* polish by reviewer comments
4 years ago
Shang Zhizhou
e6e2e53782
Optimize error report ( #27254 )
...
* optimize errror report
* add test case for pad op converter
* fix some spelling mistake commented by peiyang
4 years ago
GaoWei8
ee1ed42c99
change sequence length attribute to input ( #27193 )
...
* replace sequence length attr to input
4 years ago
Pei Yang
3ae3b86489
fix trt_dynamic_shape_ernie_deserialize_test ( #27290 )
...
* fix trt_dynamic_shape_ernie_deserialize_test
* support when opt cache dir does not exist
4 years ago
joanna.wozna.intel
1483ea2304
Add bfloat16 passes ( #26999 )
4 years ago
lilong12
bf461fa524
Improving error report message for sequence_expand op ( #27245 )
...
* improve err report, test=develop
4 years ago
Zhong Hui
bbad3414e8
Enhance the error messages for files in operators/math
...
Enhance the error messages for files in operators/math
4 years ago
Chen Weihang
79149c8ee6
polish framework error message part 8 ( #27269 )
4 years ago
Pei Yang
aae41c6fca
refine error message related to paddle-TRT ( #27256 )
4 years ago
Zhen Wang
d708b21074
Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. ( #26240 )
...
* update amp_check_finite_and_scale_op for static_amp.
* use amp_check_finite_and_scale in static graph amp.
* update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).
* add update_loss_scaling op in cpp.
* add update_loss_scaling_op unit test.
* update the doc of the check_finite_and_unscale op
* Update the process of gradients updating skipping if the gradients have infinite values.
* update the way to zero grads.
* update test_update_loss_scaling_op.py
* add log info when find infinite grads.
* add the unit test for UpdateLossScaling Layer.
4 years ago
ShenLiang
2b6a5793fe
remove auto mode from localsgd optimizer ( #27237 )
...
* rm auto from localsgd
4 years ago
Adam
cc3f4b813a
Add int8 GRU kernel ( #27220 )
...
* Add int8 GRU kernel with UTs
* Lint fixes
* More lint fixes
4 years ago
石晓伟
255e0cf978
error messages of inference/capi, test=develop ( #27258 )
4 years ago
Jack Zhou
9437ce36c4
Error description optimize for math dir
...
Error description optimize for math dir
4 years ago
Zhang Ting
5c1bafbbc6
use eval to improve performance, test=develop ( #25459 )
4 years ago
lidanqing
5c4eed66fd
Fix GRU mkldnn kernel fail on look_table_v2 ( #27198 )
...
* Fix the lookup_table_v2 failed on GRU mkldnn kernel issue
test=develop
* fix according to reviews, removed x_num_col_dims
test=develop
* update gru model. change according to reviews
test=develop
* change according to reviews
test=develop
4 years ago
LoveAn
7745ad55ed
Add details to the summary for show more error informations ( #27165 )
...
* Add details to the summary and test it, test=document_fix
* Add set +e before example, test=document_fix
* Remove test code, test=document_fix
* Optimize summary information and test it, test=document_fix
* Remove test code, test=document_fix
4 years ago
Chen Weihang
33ff833af2
fix loaded no params layer run error ( #27241 )
4 years ago
Wilber
f1ab288201
enhance inference error info. ( #27251 )
4 years ago
Wilber
1b84c0bf43
Lite subgraph refine predictor ( #27167 )
4 years ago
furnace
2e59769612
add empty op (c++, python, unit test) ( #26659 )
4 years ago
Zhou Wei
f6be5989fd
Reduce the parallel compile count ( #27187 )
4 years ago
lilong12
c5f957ae38
add double grad for tile op and expand_v2 op ( #27114 )
...
* add double grad for tile, test=develop
* add double grad for expand_v2 op, test=develop
4 years ago
lilong12
58a88ba9af
add double grad for expand ( #27183 )
...
* add double grad for expand, test=develop
4 years ago
Qi Li
7c7fbd3218
fix error msg of fused_embedding_fc_lstm_op, test=develop ( #27231 )
4 years ago
Qi Li
78446ecdba
[UT] fix run type of ut test cases of test_train_recognize_digits and test_api_impl, test=develop ( #27218 )
4 years ago
Jacek Czaja
e005861598
[oneDNN]Introducing oneDNN 1.6 ( #27137 )
...
* - introducing oneDNN 1.6
test=develop
* - Removed redundant code
test=develop
4 years ago
ShenLiang
5bd84b22c4
revert divide ( #27202 )
4 years ago
wawltor
fde5cfe881
fix the CudaPinMemory bug for the equal op ( #27176 )
...
fix the CudaPinMemory bug for the equal op and add the test case for the equal op
4 years ago
zhupengyang
cc3306f7c8
restruct logsumexp to speed up compiling ( #27191 )
4 years ago
Steffy-zxf
50e60e8779
update error info for selected_rows_functor
...
update error info for selected_rows_functor
4 years ago
Wilber
edd962b1d0
Add 2.0 inference api doc. ( #27125 )
4 years ago
JZ-LIANG
5d039f4086
modified the implement of Lars optimizer ( #26733 )
...
add lars to fleet meta optimizer
4 years ago
wangchaochaohu
c71d79b1d2
[cuda11 support] change the CMakeLists to support the cuda11 ( #27124 )
4 years ago
Qinghe JING
43b0445b29
Add double grad in reduce sum ( #27115 )
...
* set default value to strategy in distributed_optimizer test=develop
4 years ago
kinghuin
ed292695c5
optimize the error message for math dir
...
optimize the error message for math dir
4 years ago
yongqiangma
4558d395e9
fix Norm op error ( #26771 )
...
* fix frobenius_norm error, rm p=0 2-axis support. test=develop
4 years ago
LielinJiang
4d7d661249
Fix kl and summary bug ( #27132 )
...
* fix summary rnn
* fix kl_div bug when input shape is [1] and reduction is batchmean
4 years ago
WeiXin
13804ed80c
Error msg/polish tensor error msg ( #26976 )
...
* polish one line error message in tensor.cc
* polish error messages in tensor.cc,tensor.h tensor_impl.h
* polish error messages in tensor.cc tensor.h tensor_impl.h
* polish error messages in tensor.cc,tensor.h tensor_impl.h
* polish error messages in tensor.cc tensor.h tensor_impl.h tensor_test.cc
* polish error messages in tensor.cc tensor.h tensor_impl.h
4 years ago
whs
eb01976037
[2.0 API]Add checker in grid_sample_grad op ( #27126 )
4 years ago
wangguanzhong
a28ae86e11
Enhance ops to support LoD as input for dygraph detection models. ( #25316 )
...
* enhance collect_op for dygraph, test=develop
* enhance detection ops with lod, test=develop
* support none bbox left in generate_proposals, test=develop
* unfiy MultiLevelRoisNum, test=develop
* update core.ops, test=develop
* add op register for new input & output, test=develop
4 years ago
Zhou Wei
753a0748ee
Temporarily turn off WITH_INFERENCE_API_TEST ( #27170 )
4 years ago
YUNSHEN XIE
d4710163eb
add timeout unittests retry ( #27152 )
...
* add timeout unittests retry
* modifed parameter use
4 years ago
LielinJiang
8df5b4d608
Add correlation api to contrib ( #27015 )
...
* add correlation api to contrib
4 years ago
LoveAn
cbcd5e407a
Fix problem that target name already exists when there isn't model data cache, test=develop ( #27142 )
4 years ago
kinghuin
1b102dd552
optimize the error message for unpooling.cc
...
fix the error message for the unpooling.cc
4 years ago
Pei Yang
5fb8c92054
fix multihead matmul shared params ( #27121 )
4 years ago
xiaoting
58f3ef982a
fix typo for interp_v2,test=develop ( #26843 )
...
* fix typo for interp_v2,test=develop
* align with torch, test=develop
* add area mode, test=develop
* fix bug, test=develop
* format notes, test=develop
* update for converage, test=develop
* fix bilinear, test=develop
* fix bicubic, test=develop
* fix typo, test=develop
* fix coverage, test=develop
* fix helper.input_dtype, test=develop
* polish notes, test=develop
* polish notes, test=develop
* polish notes, test=develop
4 years ago
LoveAn
ed2f57cc42
Restore file changes caused by pre-commit ( #27105 )
...
* Restore file changes caused by pre-commit and test it, test=document_fix
* Change argument of checkout, test=document_fix
* Remove test code, test=document_fix
4 years ago
YUNSHEN XIE
9fd5eae81d
add failed unittests retry on mac system ( #26813 )
...
* add retry on mac
* fix some error
* fix with some errors
4 years ago
YUNSHEN XIE
92bf0d47e3
add failed unittests retry on win system ( #26823 )
...
* add failed unittests retry on win system
* modified the value of retry times
4 years ago
wangchaochaohu
5af81f833c
fix gpu kernel for numel Op ( #27085 )
4 years ago
Wilber
632125415c
Refine python inference api ( #26958 )
4 years ago
YUNSHEN XIE
b150f2b3a6
disable test_trt_dynamic_shape_ernie_ser_deser,test=document_fix ( #27059 )
4 years ago
zhupengyang
19ca6d9dd2
add .part to speed up compile ( #27044 )
4 years ago
LoveAn
fab8bbf25b
Modify data download function and support unittests of inference APIs on windows ( #26988 )
...
* Modify data download function, and support unittests of inference APIs on windows, test=develop
* The import error compatible with py2 and py3, and fix unittests problems of inference APIs on Windows, test=develop
4 years ago
GaoWei8
4ff16eb201
Add padding cudnn interface ( #26370 )
...
* add lstm cudnn of padding data and refine cudnn codes
4 years ago
wawltor
8857e3911f
add the dynamic dtype check for the argmin/argma
...
update the check for the dtype check for the argmin, argmax
4 years ago
wangchaochaohu
041f4ab842
refine linspace Op for dtype setting( #27071 )
5 years ago
yaoxuefeng
9aa39584fe
fix cuda generator hard-coded offset step ( #27027 )
5 years ago
Jacek Czaja
f6653c71e9
[oneDNN] Fix to conv2d grad with groups ( #27006 )
...
* - Added fix to mobilenet
* - compilation fix
* - Fix to conv2d grad oneDNN with groups
test=develop
5 years ago
Chengmo
a72752263b
support heter-xpu-ps ( #27018 )
...
support heter-xpu-ps
5 years ago
whs
2660ea379d
Fix cuda kernel of affine grid ( #27003 )
...
test=develop
5 years ago
Zhou Wei
4204ceaed9
kill op_function_generator.exe ( #27005 )
5 years ago
Zhou Wei
5a48952a54
remove rmdir build ( #26965 )
5 years ago
zhangchunle
5866cde758
mac tests failed ( #26928 )
5 years ago
ShenLiang
ff3dc8ac73
fix the remainder ( #26995 )
5 years ago
yaoxuefeng
7f3e6ca596
add cuda generator ( #26786 )
5 years ago
iducn
35ae10272e
add shell of CPU's version info ( #26937 )
5 years ago
Feiyu Chan
c8cc094576
add template specialization for bfloat16 for gcc 4.8 compatability ( #26985 )
5 years ago
wangchaochaohu
3eacced950
[cuda11 support] add support for cublas load of same function name (parameter diff) ( #26963 )
5 years ago
Chen Weihang
209273e605
Support load state dict form `inference model` format save result ( #26718 )
...
* support load infer model format state dict
* add unittests
* remove keep name table
* recolve circle inport
* fix compatible problem
* recover unittest
* polish doc and comment
5 years ago
joanna.wozna.intel
95e1434bb2
Add bfloat16 data type ( #25402 )
5 years ago
Yang Zhang
29b844ad5e
Fix clip op attr ( #26924 )
5 years ago
LoveAn
26c698e2c9
Fix catch exit code failed caused by ( #26934 )
5 years ago
Shang Zhizhou
61fc7a3e45
Pass version check ( #26887 )
5 years ago
Zhou Wei
f772540d80
add time when test failed ( #26935 )
...
show unittest time even if unittest failed
5 years ago
huangjun12
e480168fae
fix dropout bug in backward when input is 1d tensor ( #26837 )
...
* fix dropout bug in backward when input is 1d tensor, test=develop
* add test case and refine error message, test=develop
* refine error message, test=develop
5 years ago
YUNSHEN XIE
d8984a6b90
limit timeout value setting on linux ( #26923 )
5 years ago
Zhou Wei
1771d9f880
fix cache judge more safe ( #26910 )
5 years ago
joanna.wozna.intel
0627a319b0
Restore "Add mkldnn bfloat16 option to C-API " ( #26882 )
...
* Add mkldnn bfloat16 option to C-API
* Add test for bfloat16 gpu
* Change coverage test
* Repair capi_gpu test
5 years ago
Jacek Czaja
5e874cc333
- Cosmetic fixes to align with PADDLE_ENFORCE guidelines ( #26891 )
...
test=develop
5 years ago
wanghuancoder
2d2c31a63a
Add FetchAsyncOpHandle, and use it in FastThreadedExecutor ( #26643 )
...
* optimized transformation form tensor to numpy, test=develop
* Modify fetch op handle, from memcpy Sync to memcpy Async, test=develop
* modify CUDAPinnedPlace to CPUPlace, test=develop
* modify CPUPlace to CUDAPinnedPlace, and set default inplace to false, test=develop
* revert fetch_op_handle, add fetch_async_op_handle, test=develop
* revert fetch_op_handle, add fetch_async_op_handle, test=develop
* fix error msg report, test=develop
* fix bug in cpuplace, test=develop
* fix bug in unmerge and tensorarray modle, test=develop
* fix bug, double copy gpu memory, test=develop
* fix chenweihang¡¯s review advice, test=develop
5 years ago
Thunderbrook
5205748481
fix eigen in push sparse; fix hadoop command ( #26872 )
...
* fix eigen in push sparse; fix hadoop command
test=develop
* add log in load_combine_op
test=develop
5 years ago
Zhaolong Xing
932bbe955b
fix pool trt plugin bug ( #26463 )
...
test=develop
5 years ago
wawltor
0a29fc85d6
fix the argmin,argmax op for the paddlepaddle 2.0
...
* fix the argmin,argmax op for the paddlepaddle 2.0, add checkPoint for the argmax/argmin
5 years ago
LoveAn
d067e66d39
Show more possible problems with build_and_check in file paddle_build.sh ( #26846 )
...
* Show more possible problems with build_and_check in file paddle_build.sh, test=develop
* Remove test codes modified in file device.py for build_and_check, test=document_fix
* Fix missing blank space in file device.py, test=document_fix
* Final process via summary_check_problems function, test=document_fix
5 years ago
Chengmo
d0962abd20
supplement bug fix of parameter server ( #26217 )
...
* fix fluid.embedding
5 years ago
zlsh80826
ad6e3dd69c
[Paddle-TRT] Stack op plugin ( #25605 )
...
* add stack_op to CMakeLists
* add dim=3 support for scale op
* add trt stack op, test=develop
* remove debug message
* add stack plugin serialize
* remove slice, scale op, will add later
* enhence error message
* revise trt ernie test to conver the stack op CI testi, test=develop
* add stack op serialization
* fix test shape after adding stack op
* remove slice op, will add after implementing serialization
* roll back to min_graph=5 to avoid using slice op
* fix scale op output layer
* implement stack op createPlugin
* use workspace and move the defination to .cu
* move stack plugin creator definition to .cu, test=develop
5 years ago
Leo Chen
60ffc22026
Refine bernoulli and unsqueeze op ( #26842 )
...
* add check for bernoulli and register bool for unsqueeze
* follow comments
5 years ago
YUNSHEN XIE
1e50b2a635
fix retry error with blank ( #26835 )
5 years ago
石晓伟
ced6e87eee
Revert "Add mkldnn bfloat16 option to C-API ( #26676 )" ( #26854 )
...
This reverts commit 02083bda40
.
5 years ago
tangwei12
ebc5f99789
add embedding 2.0 ( #26649 )
...
* add embedding 2.0
* add embedding support input int32
5 years ago
Zhou Wei
d85410109d
Count the time and packet size for Windows monitor ( #26678 )
...
* turn on WITH_INFERENCE_API_TEST
* Count the time and packet size on windows
* fix conflit
* fix conflit
* fix conflit
* fix date-time funciton
5 years ago
hong19860320
40378edfa8
Add the AddCheckpoint macro to softplus op ( #26809 )
5 years ago
GaoWei8
11fb8a1c10
Refine cudnn softmax ( #25757 )
...
* refine cudnn softmax
5 years ago
arlesniak
885c61f086
Add use of global flag 'use_mkldnn' to layer_helper ( #26497 )
...
* get use of global 'use_mkldnn' in layer_helper
* update for CI
* update for CI, relu test
* update for CI, relu test added, make FLAGS_use_mkldnn a public flag
* added more strict tests, fixes after review
* fixes after review
* fixes after review, CI stuff
5 years ago
swtkiwi
f44420c874
test=develop ( #26710 )
5 years ago
Pei Yang
78a530c219
[Paddle-TRT] TRT dynamic shape support PaddleSlim quant models ( #26536 )
...
* support trt dynamic shape int8
* add unittest
* add support for sigmoid; adapt to trt6+ api
5 years ago
wawltor
7ee70a47b8
update the doc for the some ops
...
update the doc for the some ops, ceil asin, atan
5 years ago
yaoxuefeng
a47d92d868
fleet add save with whitelist test=develop ( #23376 )
5 years ago
zhupengyang
0f1ad9b06c
leaky_relu and hardshrink add checkpoint for behavior changed ( #26802 )
5 years ago
Chengmo
7f2aa2db3c
【paddle.fleet】Support Heter Parameter Server ( #25998 )
...
* Support Heter Parameter Server
5 years ago
zlsh80826
ac63c7cdef
fix a skip_layernorm bug, test=develop ( #26800 )
5 years ago