Paddle

Commit Graph

Author	SHA1	Message	Date
Aurelius84	f91c37e665	Refine error message of MatchMatrix and PyramidHash (#27484 )	4 years ago
Shibo Tao	8f7bb52bd2	fix tensorrt 6 build error. test=develop (#27511 ) * fix tensorrt 6 build error. test=develop * fix. test=develop * bug fix * test=develop	4 years ago
wanghuancoder	df43905f12	use iwyu clean include (#27267 ) * use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop	4 years ago
chalsliu	29f1560d8f	Disable ut quickly.	4 years ago
wangchaochaohu	dc713116e0	refine the error message for bath size like OP (#27446 ) * refine the error message for bath size like	4 years ago
Zhong Hui	4a9d21de49	Add GPU Kernels of Segment Ops, support, sum, max, min, mean Add GPU Kernels of Segment Ops, support, sum, max, min, mean	4 years ago
YUNSHEN XIE	66951ab2ea	modified timeout value for 4 ut (#27462 )	4 years ago
Shang Zhizhou	c17f9cf25f	[bug fix]:Memory increases after adapting the cudnn version to cudnn8 (#27436 ) * [bug fix]:Memory increases after adapting the cudnn version to 8 * [bug fix]cudnnGetConvolutionForwardAlgorithm not defined	4 years ago
Zhou Wei	1e1ae5c54d	Make the Bind Method of Tensor more automatic (#27270 ) * Makes the Bind Method more intelligent * Makes the Bind Method more intelligent * fix unittest * fix unittest * fix conflict	4 years ago
LutaoChu	5508c78744	Fix bug: The calculation result of Diag_v2 Op under large size input is wrong (#27447 ) The calculation result of Diag_v2 Op under large size input is wrong	4 years ago
tangwei12	bc5f0246a8	large scale kv speedup (#26510 ) * rename communicator meet->BatchesCounter * fix parame recv for sparse * geo sparse init from pserver * optimize init from pserver * add large scale optimizer fuse(SGD/ADAM) * rectification init_worker and exe.run startup program	4 years ago
Qi Li	d7b7dcd10e	fix cmake dependencies of test_recognize_digits, test=develop (#27475 )	4 years ago
Zhou Wei	292b24aa6d	fix bug MD of compile, And add MD/STATIC/OPENBLAS inference lib check on windows (#27051 )	4 years ago
Chen Weihang	41b5955538	Polish no onwer ops error message (#27448 ) * polish no onwer op error message * fix unittest failed * polish details based reviewer comment	4 years ago
Zhang Ting	906e7f921e	add fuse_bn_act op (#27230 ) * add fused_bn_add_relu op	4 years ago
Wilber	5034d181f3	update for 2.0 inference api. (#27473 )	4 years ago
Chen Weihang	765064476b	Polish some lost invalid error message (#27445 ) * polish some lost error msg * add some math file to white list * polish detail based reviewer commnet	4 years ago
wangchaochaohu	76fb95fe76	avoid data transform for linspace OP (#27444 )	4 years ago
123malin	a04524759e	Enhance Op's Error Message (#27455 ) * test=develop, update error message	4 years ago
wangchaochaohu	0a862fd356	refine the precious of linspace Op using half way (#27452 )	4 years ago
Pei Yang	fda54c0212	errmsg refine of trt plugin (#27309 )	4 years ago
tianshuo78520a	9f3a9be76a	update python 2.7.15 (#27435 )	4 years ago
石晓伟	dd4c2d86a5	enhance error messages, test=develop (#27423 )	4 years ago
Zhou Wei	b7371fa55d	judge whether remove build dir to accelerate compile,test=develop (#27334 )	4 years ago
Zhong Hui	f4c750d721	Add the cpu version of segment sum mean max min op Add the cpu version of segment sum mean max min op	4 years ago
Wilber	afe94903c3	Rename fluid_inference to paddle_inference. (#27422 )	4 years ago
Pei Yang	8182337096	clear pass logs (#27434 )	4 years ago
furnace	13a4c74efd	add mv op(c++, python, unit test) (#27024 )	4 years ago
LutaoChu	f11a53ee76	Optimize argsort Op performance on GPU * argsort op acceleration on GPU when the input size is equal to the length of the ‘axis’ dimension	4 years ago
ceci3	1d3b27cae8	add double grad compute for batch norm (#27296 ) * add double grad compute for batch norm,test=develop * fix unittest, test=develop * remove unuse tensor,test=develop * add format,test=develop * update, test=develop	4 years ago
Shang Zhizhou	d93661942e	fix bug sequececonv_eltadd_relu_fuse_pass (#27404 ) * fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0 * fix seqconv_eltadd_relu_fuse_pass unitest error	4 years ago
Leo Chen	aba759ba16	[Feature] Enhance inplace addto strategy for gradient accumulation in static graph (#27112 ) * support use add instead of sum to do gradient accumulation * add inplace addto pass * add grad_add op and inplace addto pass * remove debug code * code refine * fix bug when sereral sum ops inserts at same op_idx * fix Flags type * add addto attribute for conv3d * fix ut * code clean * fix type	4 years ago
LutaoChu	669efb98de	Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor	4 years ago
Wilber	39546aa2f3	Add pass compatible and unit test. (#27377 )	4 years ago
huangxu96	02606d45ef	Quant op dev (#25932 ) * Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests. * Finished channel-wise quantize strategy in imperative quantization. * Added Cuda code of ChannelWiseQuantDequantMaxAbsOP Add Cuda code of ChannelWiseQuantDequantMaxAbsOp * Add quant_axis for channel_wise quant. * fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement. * Added some assert infomation and fixed some coding style mistakes.	4 years ago
Leo Chen	bbc84e0fe0	Refine error msg in paddle/fluid/framework/details [part 1] (#25631 ) * refine error msg in var_handle.h, test=develop * refine all_reduce_op_handle * fix some error msg * refine variable_visitor * refine threaded_ssa_graph_executor * refine inplace related files * refine executor related files * refine fetch_op_handle.cc * fix bug * follow comments	4 years ago
MRXLT	f936adbd2d	fix adam (#27343 ) * fix adam * rmsprop support double	4 years ago
tangwei12	99626502f7	【paddle.fleet】gloo and util (#27213 ) * fix worker endpoints * fix gloo wrapper for hdfs * GPU fleetrun support gloo * parameterserver fleetrun support gloo * fix get server endpoint	4 years ago
Pei Yang	a5ef246cac	Optimize emb_eltwise_layernorm_plugin and support fp16 (#27128 )	4 years ago
yaoxuefeng	d726fd5e86	enhance dataset err msg (#27363 )	4 years ago
guofei	9fdcfe8981	Support python3.8 (#26850 ) * Support python3.8 test=notest	4 years ago
Pei Yang	fd7ab4e63c	register pass compatibility (#27357 ) * pass compatibility * add compatibility registry * add unittests for different padding * add assert * drop errmsg	4 years ago
haozech	7e6dfcf9b2	Add 3 pass version check (#27283 )	4 years ago
GaoWei8	1a7559718e	fix cudnn dyload (#27308 ) * fix cudnn dyload error	4 years ago
wawltor	b6a4349dd4	fix the error message for the math dir https://github.com/PaddlePaddle/Paddle/pull/27332	4 years ago
HappyAngel	01659a6961	Polish operators error message in average_accumlate OP (#27268 ) * fix op print error info problem. test=develop * fix build error * fix format * fix error msg info * fix format	4 years ago
Shang Zhizhou	3c11717988	add op version checker to ir passes (#27329 )	4 years ago
furnace	515efe4240	add empty_like op (python, and unit test), use c++ implementation of empty op, (#27287 ) and optimize the c++ implmentation of empty op as PR#26659 reviews, and add bool for shape op.	4 years ago
Yi Liu	e9a0fbfff2	OP报错信息优化 (#27301 ) paddle/fluid/operators/distributed_ops OP报错信息优化	4 years ago
tianshuo78520a	da583edf6d	Fix paddle build install requirements (#27378 ) * Fix install pr requirements.txt * test=document_fix	4 years ago
Jack Zhou	63203c4abc	enhance reduce op which can reduce tensor with arbitrary rank enhance reduce op which can reduce tensor with arbitrary rank	4 years ago
YUNSHEN XIE	fd496fa9a3	modified the ut name with test_dist (#27315 )	4 years ago
lilong12	9f9d15e285	fix the bug of non-exit, test=develop (#27350 )	4 years ago
chalsliu	d773c6c94e	Support precision test	4 years ago
ShenLiang	9ee77b1f41	Fix elementwise_floordiv op (#27352 ) * fix floordiv	4 years ago
Zhou Wei	ebc6d54446	fix cache file judge (#27369 )	4 years ago
ShenLiang	54b81fa32c	add adaptivelsgd in meta_optimizer (#27289 ) * add adaptivelsgd * Todo fix the code to avoid the conflict.	4 years ago
Jack Zhou	6e29c2da05	Error description optimize for the math dir Error description optimize for the math dir	4 years ago
Zhou Wei	f992f8d7ef	fix judge cache file of inference api more accurate (#27175 ) fix judge cache file of inference api more accurate	4 years ago
Jacek Czaja	4582f697b6	- Fix to concat oneDNN overwritting data (#27273 ) test=develop	4 years ago
ShenLiang	c296618c94	fix error message in broadcast/allreduce/gather (#27302 ) * fix error message	4 years ago
Chen Weihang	4f9d6529fe	Polish framework error message part 7 (#27266 ) * polish framework error message part 7 * fix typo * polish by reviewes comment	4 years ago
wawltor	4e8582fe5a	update the error message check for the some ops update the error message check for the some ops	4 years ago
wawltor	d003573f90	add the error message check for the some operator add the error message check for the some operator	4 years ago
Wilber	dae62556cb	Enhance infer error info message (#26731 )	4 years ago
Leo Chen	4c8ea492cd	use shared dev_ctx (#27313 )	4 years ago
wanghuancoder	696a39e2f3	use clcache in windows (#27279 )	4 years ago
Shang Zhizhou	47fdc60ecc	Optimize slice trt plugin (#26970 ) * optimize slice TRT plugin This patch removes unnecessary barrier for data transfer of needed offset, so data transfer can be overlap with GPU kernel execution. This patch also fixes incorrect name of slice plugin. That is, replaces "layernorm" with "slice" test=develop * add serialize/deserialize to slice plugin * add static shape slice trt plugin * fix slice trt op convertor dynamic shape bug * fix format by clang-format * fix pylint format error * fix problems commented by peiyang Co-authored-by: Ryan Jeng <rjeng@nvidia.com>	4 years ago
Wilber	f827665ae6	[Pass Compatible] Bind python compatible. (#27262 )	4 years ago
石晓伟	bd77a4258d	error messages of inference/tests, test=develop (#27259 )	4 years ago
Chen Weihang	dafb0e3bb7	Polish framework error message part 6 (#27257 ) * polish framework error msg part 6 * polish lossed item * fix failed unittest * polish by reviewer comments	4 years ago
Shang Zhizhou	e6e2e53782	Optimize error report (#27254 ) * optimize errror report * add test case for pad op converter * fix some spelling mistake commented by peiyang	4 years ago
GaoWei8	ee1ed42c99	change sequence length attribute to input (#27193 ) * replace sequence length attr to input	4 years ago
Pei Yang	3ae3b86489	fix trt_dynamic_shape_ernie_deserialize_test (#27290 ) * fix trt_dynamic_shape_ernie_deserialize_test * support when opt cache dir does not exist	4 years ago
joanna.wozna.intel	1483ea2304	Add bfloat16 passes (#26999 )	4 years ago
lilong12	bf461fa524	Improving error report message for sequence_expand op (#27245 ) * improve err report, test=develop	4 years ago
Zhong Hui	bbad3414e8	Enhance the error messages for files in operators/math Enhance the error messages for files in operators/math	4 years ago
Chen Weihang	79149c8ee6	polish framework error message part 8 (#27269 )	4 years ago
Pei Yang	aae41c6fca	refine error message related to paddle-TRT (#27256 )	4 years ago
Zhen Wang	d708b21074	Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240 ) * update amp_check_finite_and_scale_op for static_amp. * use amp_check_finite_and_scale in static graph amp. * update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op). * add update_loss_scaling op in cpp. * add update_loss_scaling_op unit test. * update the doc of the check_finite_and_unscale op * Update the process of gradients updating skipping if the gradients have infinite values. * update the way to zero grads. * update test_update_loss_scaling_op.py * add log info when find infinite grads. * add the unit test for UpdateLossScaling Layer.	4 years ago
ShenLiang	2b6a5793fe	remove auto mode from localsgd optimizer (#27237 ) * rm auto from localsgd	4 years ago
Adam	cc3f4b813a	Add int8 GRU kernel (#27220 ) * Add int8 GRU kernel with UTs * Lint fixes * More lint fixes	4 years ago
石晓伟	255e0cf978	error messages of inference/capi, test=develop (#27258 )	4 years ago
Jack Zhou	9437ce36c4	Error description optimize for math dir Error description optimize for math dir	4 years ago
Zhang Ting	5c1bafbbc6	use eval to improve performance, test=develop (#25459 )	4 years ago
lidanqing	5c4eed66fd	Fix GRU mkldnn kernel fail on look_table_v2 (#27198 ) * Fix the lookup_table_v2 failed on GRU mkldnn kernel issue test=develop * fix according to reviews, removed x_num_col_dims test=develop * update gru model. change according to reviews test=develop * change according to reviews test=develop	4 years ago
LoveAn	7745ad55ed	Add details to the summary for show more error informations (#27165 ) * Add details to the summary and test it, test=document_fix * Add set +e before example, test=document_fix * Remove test code, test=document_fix * Optimize summary information and test it, test=document_fix * Remove test code, test=document_fix	4 years ago
Chen Weihang	33ff833af2	fix loaded no params layer run error (#27241 )	4 years ago
Wilber	f1ab288201	enhance inference error info. (#27251 )	4 years ago
Wilber	1b84c0bf43	Lite subgraph refine predictor (#27167 )	4 years ago
furnace	2e59769612	add empty op (c++, python, unit test) (#26659 )	4 years ago
Zhou Wei	f6be5989fd	Reduce the parallel compile count (#27187 )	4 years ago
lilong12	c5f957ae38	add double grad for tile op and expand_v2 op (#27114 ) * add double grad for tile, test=develop * add double grad for expand_v2 op, test=develop	4 years ago
lilong12	58a88ba9af	add double grad for expand (#27183 ) * add double grad for expand, test=develop	4 years ago
Qi Li	7c7fbd3218	fix error msg of fused_embedding_fc_lstm_op, test=develop (#27231 )	4 years ago
Qi Li	78446ecdba	[UT] fix run type of ut test cases of test_train_recognize_digits and test_api_impl, test=develop (#27218 )	4 years ago
Jacek Czaja	e005861598	[oneDNN]Introducing oneDNN 1.6 (#27137 ) * - introducing oneDNN 1.6 test=develop * - Removed redundant code test=develop	4 years ago
ShenLiang	5bd84b22c4	revert divide (#27202 )	4 years ago
wawltor	fde5cfe881	fix the CudaPinMemory bug for the equal op (#27176 ) fix the CudaPinMemory bug for the equal op and add the test case for the equal op	4 years ago
zhupengyang	cc3306f7c8	restruct logsumexp to speed up compiling (#27191 )	4 years ago
Steffy-zxf	50e60e8779	update error info for selected_rows_functor update error info for selected_rows_functor	4 years ago
Wilber	edd962b1d0	Add 2.0 inference api doc. (#27125 )	4 years ago
JZ-LIANG	5d039f4086	modified the implement of Lars optimizer (#26733 ) add lars to fleet meta optimizer	4 years ago
wangchaochaohu	c71d79b1d2	[cuda11 support] change the CMakeLists to support the cuda11 (#27124 )	4 years ago
Qinghe JING	43b0445b29	Add double grad in reduce sum (#27115 ) * set default value to strategy in distributed_optimizer test=develop	4 years ago
kinghuin	ed292695c5	optimize the error message for math dir optimize the error message for math dir	4 years ago
yongqiangma	4558d395e9	fix Norm op error (#26771 ) * fix frobenius_norm error, rm p=0 2-axis support. test=develop	4 years ago
LielinJiang	4d7d661249	Fix kl and summary bug (#27132 ) * fix summary rnn * fix kl_div bug when input shape is [1] and reduction is batchmean	4 years ago
WeiXin	13804ed80c	Error msg/polish tensor error msg (#26976 ) * polish one line error message in tensor.cc * polish error messages in tensor.cc,tensor.h tensor_impl.h * polish error messages in tensor.cc tensor.h tensor_impl.h * polish error messages in tensor.cc,tensor.h tensor_impl.h * polish error messages in tensor.cc tensor.h tensor_impl.h tensor_test.cc * polish error messages in tensor.cc tensor.h tensor_impl.h	4 years ago
whs	eb01976037	[2.0 API]Add checker in grid_sample_grad op (#27126 )	4 years ago
wangguanzhong	a28ae86e11	Enhance ops to support LoD as input for dygraph detection models. (#25316 ) * enhance collect_op for dygraph, test=develop * enhance detection ops with lod, test=develop * support none bbox left in generate_proposals, test=develop * unfiy MultiLevelRoisNum, test=develop * update core.ops, test=develop * add op register for new input & output, test=develop	4 years ago
Zhou Wei	753a0748ee	Temporarily turn off WITH_INFERENCE_API_TEST (#27170 )	4 years ago
YUNSHEN XIE	d4710163eb	add timeout unittests retry (#27152 ) * add timeout unittests retry * modifed parameter use	4 years ago
LielinJiang	8df5b4d608	Add correlation api to contrib (#27015 ) * add correlation api to contrib	4 years ago
LoveAn	cbcd5e407a	Fix problem that target name already exists when there isn't model data cache, test=develop (#27142 )	4 years ago
kinghuin	1b102dd552	optimize the error message for unpooling.cc fix the error message for the unpooling.cc	4 years ago
Pei Yang	5fb8c92054	fix multihead matmul shared params (#27121 )	4 years ago
xiaoting	58f3ef982a	fix typo for interp_v2,test=develop (#26843 ) * fix typo for interp_v2,test=develop * align with torch, test=develop * add area mode, test=develop * fix bug, test=develop * format notes, test=develop * update for converage, test=develop * fix bilinear, test=develop * fix bicubic, test=develop * fix typo, test=develop * fix coverage, test=develop * fix helper.input_dtype, test=develop * polish notes, test=develop * polish notes, test=develop * polish notes, test=develop	4 years ago
LoveAn	ed2f57cc42	Restore file changes caused by pre-commit (#27105 ) * Restore file changes caused by pre-commit and test it, test=document_fix * Change argument of checkout, test=document_fix * Remove test code, test=document_fix	4 years ago
YUNSHEN XIE	9fd5eae81d	add failed unittests retry on mac system (#26813 ) * add retry on mac * fix some error * fix with some errors	4 years ago
YUNSHEN XIE	92bf0d47e3	add failed unittests retry on win system (#26823 ) * add failed unittests retry on win system * modified the value of retry times	4 years ago
wangchaochaohu	5af81f833c	fix gpu kernel for numel Op (#27085 )	4 years ago
Wilber	632125415c	Refine python inference api (#26958 )	4 years ago
YUNSHEN XIE	b150f2b3a6	disable test_trt_dynamic_shape_ernie_ser_deser,test=document_fix (#27059 )	4 years ago
zhupengyang	19ca6d9dd2	add .part to speed up compile (#27044 )	4 years ago
LoveAn	fab8bbf25b	Modify data download function and support unittests of inference APIs on windows (#26988 ) * Modify data download function, and support unittests of inference APIs on windows, test=develop * The import error compatible with py2 and py3, and fix unittests problems of inference APIs on Windows, test=develop	4 years ago
GaoWei8	4ff16eb201	Add padding cudnn interface (#26370 ) * add lstm cudnn of padding data and refine cudnn codes	4 years ago
wawltor	8857e3911f	add the dynamic dtype check for the argmin/argma update the check for the dtype check for the argmin, argmax	4 years ago
wangchaochaohu	041f4ab842	refine linspace Op for dtype setting(#27071 )	5 years ago
yaoxuefeng	9aa39584fe	fix cuda generator hard-coded offset step (#27027 )	5 years ago
Jacek Czaja	f6653c71e9	[oneDNN] Fix to conv2d grad with groups (#27006 ) * - Added fix to mobilenet * - compilation fix * - Fix to conv2d grad oneDNN with groups test=develop	5 years ago
Chengmo	a72752263b	support heter-xpu-ps (#27018 ) support heter-xpu-ps	5 years ago
whs	2660ea379d	Fix cuda kernel of affine grid (#27003 ) test=develop	5 years ago
Zhou Wei	4204ceaed9	kill op_function_generator.exe (#27005 )	5 years ago
Zhou Wei	5a48952a54	remove rmdir build (#26965 )	5 years ago
zhangchunle	5866cde758	mac tests failed (#26928 )	5 years ago
ShenLiang	ff3dc8ac73	fix the remainder (#26995 )	5 years ago
yaoxuefeng	7f3e6ca596	add cuda generator (#26786 )	5 years ago
iducn	35ae10272e	add shell of CPU's version info (#26937 )	5 years ago
Feiyu Chan	c8cc094576	add template specialization for bfloat16 for gcc 4.8 compatability (#26985 )	5 years ago
wangchaochaohu	3eacced950	[cuda11 support] add support for cublas load of same function name (parameter diff) (#26963 )	5 years ago
Chen Weihang	209273e605	Support load state dict form `inference model` format save result (#26718 ) * support load infer model format state dict * add unittests * remove keep name table * recolve circle inport * fix compatible problem * recover unittest * polish doc and comment	5 years ago
joanna.wozna.intel	95e1434bb2	Add bfloat16 data type (#25402 )	5 years ago
Yang Zhang	29b844ad5e	Fix clip op attr (#26924 )	5 years ago
LoveAn	26c698e2c9	Fix catch exit code failed caused by (#26934 )	5 years ago
Shang Zhizhou	61fc7a3e45	Pass version check (#26887 )	5 years ago
Zhou Wei	f772540d80	add time when test failed (#26935 ) show unittest time even if unittest failed	5 years ago
huangjun12	e480168fae	fix dropout bug in backward when input is 1d tensor (#26837 ) * fix dropout bug in backward when input is 1d tensor, test=develop * add test case and refine error message, test=develop * refine error message, test=develop	5 years ago
YUNSHEN XIE	d8984a6b90	limit timeout value setting on linux (#26923 )	5 years ago
Zhou Wei	1771d9f880	fix cache judge more safe (#26910 )	5 years ago
joanna.wozna.intel	0627a319b0	Restore "Add mkldnn bfloat16 option to C-API " (#26882 ) * Add mkldnn bfloat16 option to C-API * Add test for bfloat16 gpu * Change coverage test * Repair capi_gpu test	5 years ago
Jacek Czaja	5e874cc333	- Cosmetic fixes to align with PADDLE_ENFORCE guidelines (#26891 ) test=develop	5 years ago
wanghuancoder	2d2c31a63a	Add FetchAsyncOpHandle, and use it in FastThreadedExecutor (#26643 ) * optimized transformation form tensor to numpy, test=develop * Modify fetch op handle, from memcpy Sync to memcpy Async, test=develop * modify CUDAPinnedPlace to CPUPlace, test=develop * modify CPUPlace to CUDAPinnedPlace, and set default inplace to false, test=develop * revert fetch_op_handle, add fetch_async_op_handle, test=develop * revert fetch_op_handle, add fetch_async_op_handle, test=develop * fix error msg report, test=develop * fix bug in cpuplace, test=develop * fix bug in unmerge and tensorarray modle, test=develop * fix bug, double copy gpu memory, test=develop * fix chenweihang¡¯s review advice, test=develop	5 years ago
Thunderbrook	5205748481	fix eigen in push sparse; fix hadoop command (#26872 ) * fix eigen in push sparse; fix hadoop command test=develop * add log in load_combine_op test=develop	5 years ago
Zhaolong Xing	932bbe955b	fix pool trt plugin bug (#26463 ) test=develop	5 years ago
wawltor	0a29fc85d6	fix the argmin,argmax op for the paddlepaddle 2.0 * fix the argmin,argmax op for the paddlepaddle 2.0， add checkPoint for the argmax/argmin	5 years ago
LoveAn	d067e66d39	Show more possible problems with build_and_check in file paddle_build.sh (#26846 ) * Show more possible problems with build_and_check in file paddle_build.sh, test=develop * Remove test codes modified in file device.py for build_and_check, test=document_fix * Fix missing blank space in file device.py, test=document_fix * Final process via summary_check_problems function, test=document_fix	5 years ago
Chengmo	d0962abd20	supplement bug fix of parameter server (#26217 ) * fix fluid.embedding	5 years ago
zlsh80826	ad6e3dd69c	[Paddle-TRT] Stack op plugin (#25605 ) * add stack_op to CMakeLists * add dim=3 support for scale op * add trt stack op, test=develop * remove debug message * add stack plugin serialize * remove slice, scale op, will add later * enhence error message * revise trt ernie test to conver the stack op CI testi, test=develop * add stack op serialization * fix test shape after adding stack op * remove slice op, will add after implementing serialization * roll back to min_graph=5 to avoid using slice op * fix scale op output layer * implement stack op createPlugin * use workspace and move the defination to .cu * move stack plugin creator definition to .cu, test=develop	5 years ago
Leo Chen	60ffc22026	Refine bernoulli and unsqueeze op (#26842 ) * add check for bernoulli and register bool for unsqueeze * follow comments	5 years ago
YUNSHEN XIE	1e50b2a635	fix retry error with blank (#26835 )	5 years ago
石晓伟	ced6e87eee	Revert "Add mkldnn bfloat16 option to C-API (#26676 )" (#26854 ) This reverts commit `02083bda40`.	5 years ago
tangwei12	ebc5f99789	add embedding 2.0 (#26649 ) * add embedding 2.0 * add embedding support input int32	5 years ago
Zhou Wei	d85410109d	Count the time and packet size for Windows monitor (#26678 ) * turn on WITH_INFERENCE_API_TEST * Count the time and packet size on windows * fix conflit * fix conflit * fix conflit * fix date-time funciton	5 years ago
hong19860320	40378edfa8	Add the AddCheckpoint macro to softplus op (#26809 )	5 years ago
GaoWei8	11fb8a1c10	Refine cudnn softmax (#25757 ) * refine cudnn softmax	5 years ago
arlesniak	885c61f086	Add use of global flag 'use_mkldnn' to layer_helper (#26497 ) * get use of global 'use_mkldnn' in layer_helper * update for CI * update for CI, relu test * update for CI, relu test added, make FLAGS_use_mkldnn a public flag * added more strict tests, fixes after review * fixes after review * fixes after review, CI stuff	5 years ago
swtkiwi	f44420c874	test=develop (#26710 )	5 years ago
Pei Yang	78a530c219	[Paddle-TRT] TRT dynamic shape support PaddleSlim quant models (#26536 ) * support trt dynamic shape int8 * add unittest * add support for sigmoid; adapt to trt6+ api	5 years ago
wawltor	7ee70a47b8	update the doc for the some ops update the doc for the some ops, ceil asin, atan	5 years ago
yaoxuefeng	a47d92d868	fleet add save with whitelist test=develop (#23376 )	5 years ago
zhupengyang	0f1ad9b06c	leaky_relu and hardshrink add checkpoint for behavior changed (#26802 )	5 years ago
Chengmo	7f2aa2db3c	【paddle.fleet】Support Heter Parameter Server (#25998 ) * Support Heter Parameter Server	5 years ago
zlsh80826	ac63c7cdef	fix a skip_layernorm bug, test=develop (#26800 )	5 years ago
Jiawei Wang	a1b99fae07	Adadelta Optimizer (#26590 ) * add doc; notest * fix doc; notest * update doc; notest * refine optimizer && adam * refine optimizer; notest * add adam * fix doc * fix doc && add adamw; notest * add error message * bug fix * refine rmsprop && adamax * fix ci * buf fix * update comment * unify arguments place; notest * fix ut, test=develop * bug fix * fix conflicts, test=develop * add examples code * bug fix * fix comments * fix sample code * add sample code for Optimizer * add adamax ut, test=develop * fix rmsprop ut, test=develop * add ut for optimizer.py and adamw.py * first commit of adadelta optimizer * fix learning rate * fix adadelta doc and add sgd momentum * remove unused fluid * fix codestyle * Update test_adam_op.py * Update test_adam_op.py * fix SGD in 2 unittests * fix SGD in 2 unittests * fix ci * fix ut Co-authored-by: MRXLT <xlt2024@gmail.com> Co-authored-by: mapingshuo <mps2012@yeah.net>	5 years ago
LielinJiang	346689c6f1	Register conv_transpose Op version for compatible Op upgrades (#26745 ) * fix bug * add version check * fix docs, test=document_fix * fix formula, test=document_fix	5 years ago
Adam	8bcb1f29d9	Add conv+affine_channel fuse pass to MKLDNN pass strategy and fix it (#26779 )	5 years ago
Wilber	68e0560c2f	refine paddle inference api (#26774 ) * refine paddle inference api Co-authored-by: nhzlx <nhzlx.dragon@gmail.com>	5 years ago
iducn	64df9b99a9	add shell of GPU version (#26589 )	5 years ago
Wojciech Uss	7afb1df11e	Decouple weights and bias from fc primitive in MKLDNN cache (#26708 ) * decouple weights and bias from fc primitive in cache * removed reduntant update of pointers	5 years ago
Zhen Wang	f32ae272ec	Remove `sorted_sum_gradient_` form BasicEngine and PartialGradTask. (#26766 ) Use `Tensor` instead of `Variable` in the doc of paddle.grad.	5 years ago
Leo Chen	844583c8fd	Refine paddle.manual_seed (#26496 ) * refine manual seed * fix ci problem * fix unittests * fix unittest * set is_init_py=false in manual_seed * fix unittest * fix bernoulli_op * fix(unittest): change random_seed to manual_seed * 🐞fix(unittest): fix manual_seed * trigger ci * fix test_sentiment * fix test_imperative_save_load * fix test_uniform_random_op * fix test_uniform_random_op * fix test_jit_save_load * merge develop * fix manual_seed * fix manual_seed * use global engine * use shared_ptr * fix double free * fix bug * fix bug * fix bug * fix test bug * fix test bug * fix test bug * fix ci	5 years ago
Zhou Wei	2d88b9ffe7	turn on WITH_INFERENCE_API_TEST (#26746 )	5 years ago
Pei Yang	e3f8e5cf5c	trt int8 support conv2d_transpose (#26636 )	5 years ago
ShenLiang	29494d703d	fix remainder, floor_div (#26732 ) * fix remainder, floordiv	5 years ago
zhangchunle	623a4c2e56	fix ci coverage build error (#26761 )	5 years ago
lilong12	5f524efe56	modify error report message, test=develop (#26743 )	5 years ago
wangchaochaohu	4561fc37e2	Add check point for gather Op (#26696 )	5 years ago
joanna.wozna.intel	eb097d64f6	Fix int8 performace drop cpu_quantize_placement_pass (#26715 ) * Fix cpu quantize placement pass * Include string lib	5 years ago
joanna.wozna.intel	02083bda40	Add mkldnn bfloat16 option to C-API (#26676 ) * Add mkldnn bfloat16 option to C-API * Add test for bfloat16 gpu * Change coverage test	5 years ago
LutaoChu	1ec30cb160	register cumsum Op version for compatible Op upgrades (#26734 ) register cumsum Op version for compatible Op upgrades	5 years ago
Jack Zhou	c282db3a93	add broadcast feature for elementwise logical op add broadcast feature for elementwise logical op	5 years ago
Yang Zhang	63eef7632e	Fix clip input check (#26683 ) * Fix clip input check * Fix default min/max value * Allow both max and min to be None * Register op change * Revert OP signature change	5 years ago
Zhen Wang	f9066e6a6f	Update the demo code and the doc of varbase.backward. (#26506 ) * update the demo code and the doc of varbase.backward. * update the doc of the fake interface `paddle.fluid.Variable`. * remove BackwardStrategy.	5 years ago
Wilber	1c898b66d6	add bug fix enum. (#26736 )	5 years ago
Zhou Wei	8071d23073	fix bug that can't print int8_t (#26712 ) fix bug that can't print int8_t	5 years ago
joejiong	f311d3c1cf	Fix pow api type error with python side method, merge elementwise_pow and pow. (#26163 ) As the title	5 years ago
yongqiangma	e4cc6a28b0	Norm op support 2-axis (#26492 )	5 years ago
chalsliu	dc56c89822	Add the option to execute unit tests only at night (#26669 ) * Add the option to execute unit tests only at night * set ut nightly label for 3 cases.	5 years ago
xiaoting	89d7d86684	add intepolte_v2 (#26520 ) * add intepolte_v2 * fix linear interp * polish unittest, test=develop * update code samples to 2.0 API, test=develop * remove warning, test_develop * add name in attrs, test=develop * polish code, test=develop * change Align to align, test=develop * fix unittest in py3,test=develop * fix coverage, test=develop * fix coverage, test=develop * fix for windows ci, test=develop * fix coverage, test=develop	5 years ago
Adam Osewski	c2c689582e	Update Paddle-Lite commit hash. (#26413 ) * Update Paddle-Lite commit hash. * Add BF16 data type to VarTyp protobuf message.	5 years ago
Zhang Ting	97cebfa4d3	add dtype for unique (#26655 ) * update doc, test=document_fix * add attr(dtype) * refine code	5 years ago
lilong12	1c68138327	[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis (#26552 ) add collective op for cpu using gloo and paddle.distributed.* apis	5 years ago
joanna.wozna.intel	559e43eee4	Small change in conv2d and quantize pass (#26671 )	5 years ago
Bai Yifan	8986a82131	fix adaptive gpu grad bug, add doc refine (#26660 )	5 years ago
wawltor	286eca2d9e	update the code for the topk v2 add the top v2 for the paddlepaddle api 2.0	5 years ago
whs	f82384113b	Fix atomicAdd in grid sample op and affine grid op (#26647 ) test=develop	5 years ago
Wilber	32ba8602c6	Enhance py_func error info message. (#26557 )	5 years ago
chalsliu	cb3f131f1c	Set timeout properity for a few unitests	5 years ago
石晓伟	32ceacf317	update op_version_registry, test=develop (#26644 )	5 years ago
RandyLi	2f5bdd8dc7	Remove WOBOQ, gen_html() and sphinx (#26128 )	5 years ago
Dong Daxiang	08d736ad78	【paddle.fleet】add cudnn related strategies to DistributedStrategy (#26598 ) * add cudnn related strategies to DistributedStrategy	5 years ago
Zhang Ting	0a895bc0df	improve unique op (#26537 ) * add unique_v2 op * remove unique_v2 op * update doc	5 years ago
whs	a004dfde3d	Use atomicAdd defined in paddle fromework (#26631 ) test=develop	5 years ago
LoveAn	02fc1fef8b	Fix the cmake-function named inference_download_and_uncompress on Windows (#26512 ) * Fix the cmake-function named inference_download_and_uncompress with Windows, test=develop * Fix some problems when remove limit of unittests on Windows, test=develop * Using URL to download file instead of DOWNLOAD_COMMAND. test=develop	5 years ago
YUNSHEN XIE	a8b5741fb4	add a few unittests for setting timeout properity (#26630 )	5 years ago
zhangchunle	ef317b4b14	add mac tests failed exitcode (#26611 )	5 years ago
wanghuancoder	c1f5df5269	optimized transformation form tensor to numpy (#26447 ) * optimized transformation form tensor to numpy, test=develop * optimized transformation form tensor to numpy, pass pre-commit, test=develop * modify fetchophandle zerocopy to deepcopy in PE&CUP, test=develop * modify py:array construct, test=develop * fix _fetch_var to use deep copy, test=develop	5 years ago
zhupengyang	c80fcf901e	reduce_mean error if keepdim=True and reduce_all=True (#26614 )	5 years ago
whs	a065a24232	【2.0 API】Enhance affine grid operator (#26385 ) * Enhance affine grid operator: 1. Add cuda kernel 2. Add align corners options test=develop * Move new affine_grid api to functional test=develop * Add CUDA kernel for affine_grid. test=develop * Add more unitest for grid sample API test=develop	5 years ago
Qi Li	6f69fbc8ea	fix elu grad whne alpha less then zero, test=develop (#26543 )	5 years ago
whs	786373ba29	Use atomicAdd defined in paddle framework (#26628 ) test=develop	5 years ago
ruri	1f82c0cd62	[Api2.0] add pixel shuffle (#26071 )	5 years ago
Zhou Wei	1ed74aae7c	fix msbuild log level (#26607 )	5 years ago
wanghuancoder	422a162019	api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear (#26399 ) * api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear, test=develop * api2.0 fix code examples, test=develop * modify test_bilinear_api, about place,to_tensor , test=develop * re pass pre-commit, test=develop * Update common.py * fix BilinearTensorProduct ci error, test=develop	5 years ago
wanghuancoder	6e823cfec3	add op_function_generator.exe retry in windows, test=develop (#26591 ) add op_function_generator.exe retry in windows	5 years ago
石晓伟	fa08a834be	update op_version_registry, test=develop (#26592 )	5 years ago
whs	79539cf198	【2.0 API】Add CUDA kernel and enhance options for grid_sample (#26576 ) This PR enhance CPU kernel and add new CUDA kernel to make grid_sample support: - align_corners: with bool type. - padding mode: which can be in ['zeros', 'reflect', 'border'] - Interpolation mode: which ca be in ['bilinear', 'nearest'] The old CPU and CUDNN version only support align_corners=true, padding_mode='zeros' and interpolation_mode='bilinear'. The behavior of the new version op in default mode is compatible with the old version.	5 years ago
Guanghua Yu	8645591d66	support fp64 in huber_loss cuda kernel (#26583 )	5 years ago
yaoxuefeng	efee426742	support generator seed in related kernals test=develop (#26495 )	5 years ago
Zhong Hui	bf4a4636f1	change to use bce_loss op, add shape check for bce_loss change to use bce_loss op, add numel check for bce_loss.	5 years ago
ShenLiang	0e81626081	add div, floor_div, remainder (#26562 ) * add div, floor_div, remainder	5 years ago
石晓伟	656e60b18f	new class: op_version_registry, test=develop (#26542 )	5 years ago
qingqing01	24566e951c	Support empty bbox in bipartite math op (#26488 )	5 years ago
Jack Zhou	199b0c7c1b	Add isfinite v2 op (#26344 ) add the isnan, isfinite, isinf api for the paddle 2.0	5 years ago
Zhou Wei	28554c3f85	add --user for pip (#26440 )	5 years ago
wangchaochaohu	ebf9b2125e	add paddle.gather for API2.0 (#26455 )	5 years ago
wangchaochaohu	9219b79104	gather_nd Op for API 2.0 refine (#26540 )	5 years ago
zhupengyang	9b14117cac	logsumexp: impl kernel, refine docs (#26307 )	5 years ago
Wojciech Uss	5c2b9258a6	Fix (de/re)quantize cache keys (#26549 )	5 years ago
YUNSHEN XIE	df7fe1fe23	fix unittests run with error of Expression too big (#26573 )	5 years ago
wawltor	6b28456ed0	add the argmax, argmin for the api2.0 * add the new api and op for the argmax, argmin	5 years ago
LielinJiang	d26ae9ad87	Update conv_transpose api (#26427 ) * update conv_transpose api	5 years ago
lilong12	faa9b97b78	fix cscatter, test=develop (#26554 )	5 years ago
WangXi	45711dade7	【API】rename div to divide, add floor_divide, remainder (#26434 )	5 years ago
LutaoChu	4e0c6d91aa	add paddle.tensor.linalg.diag API, diag_v2 OP and CUDA kernel add paddle.tensor.linalg.diag API, diag_v2 OP and CUDA kernel.	5 years ago
zhupengyang	f8863e0603	leaky_relu and LeakyReLU: alpha->negative_slope (#26216 )	5 years ago
ShenLiang	c609066074	Add Matmul op (#26411 ) * add matmul_v2	5 years ago
Leo Chen	aa2a9b5d89	add bernoulli op (#26511 ) * add bernoulli op * fix cuda kernel and add unit test * refine doc * fix uniform	5 years ago
Adam	f3909020de	Add mechanism for blocking oneDNN cache clearing (#26502 ) * Add mechanism for blocking oneDNN cache clearing * Review changes and Add thread guards	5 years ago

... 3 4 5 6 7 ...

17806 Commits (4dc8c44ba163733eddf9edbae36b93a4f7374501)