Paddle

Commit Graph

Author	SHA1	Message	Date
Pei Yang	3ae3b86489	fix trt_dynamic_shape_ernie_deserialize_test (#27290 ) * fix trt_dynamic_shape_ernie_deserialize_test * support when opt cache dir does not exist	4 years ago
joanna.wozna.intel	1483ea2304	Add bfloat16 passes (#26999 )	4 years ago
lilong12	bf461fa524	Improving error report message for sequence_expand op (#27245 ) * improve err report, test=develop	4 years ago
Zhong Hui	bbad3414e8	Enhance the error messages for files in operators/math Enhance the error messages for files in operators/math	4 years ago
Chen Weihang	79149c8ee6	polish framework error message part 8 (#27269 )	4 years ago
Pei Yang	aae41c6fca	refine error message related to paddle-TRT (#27256 )	4 years ago
Zhen Wang	d708b21074	Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240 ) * update amp_check_finite_and_scale_op for static_amp. * use amp_check_finite_and_scale in static graph amp. * update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op). * add update_loss_scaling op in cpp. * add update_loss_scaling_op unit test. * update the doc of the check_finite_and_unscale op * Update the process of gradients updating skipping if the gradients have infinite values. * update the way to zero grads. * update test_update_loss_scaling_op.py * add log info when find infinite grads. * add the unit test for UpdateLossScaling Layer.	4 years ago
ShenLiang	2b6a5793fe	remove auto mode from localsgd optimizer (#27237 ) * rm auto from localsgd	4 years ago
Adam	cc3f4b813a	Add int8 GRU kernel (#27220 ) * Add int8 GRU kernel with UTs * Lint fixes * More lint fixes	4 years ago
石晓伟	255e0cf978	error messages of inference/capi, test=develop (#27258 )	4 years ago
Jack Zhou	9437ce36c4	Error description optimize for math dir Error description optimize for math dir	4 years ago
Zhang Ting	5c1bafbbc6	use eval to improve performance, test=develop (#25459 )	4 years ago
lidanqing	5c4eed66fd	Fix GRU mkldnn kernel fail on look_table_v2 (#27198 ) * Fix the lookup_table_v2 failed on GRU mkldnn kernel issue test=develop * fix according to reviews, removed x_num_col_dims test=develop * update gru model. change according to reviews test=develop * change according to reviews test=develop	4 years ago
LoveAn	7745ad55ed	Add details to the summary for show more error informations (#27165 ) * Add details to the summary and test it, test=document_fix * Add set +e before example, test=document_fix * Remove test code, test=document_fix * Optimize summary information and test it, test=document_fix * Remove test code, test=document_fix	4 years ago
Chen Weihang	33ff833af2	fix loaded no params layer run error (#27241 )	4 years ago
Wilber	f1ab288201	enhance inference error info. (#27251 )	4 years ago
Wilber	1b84c0bf43	Lite subgraph refine predictor (#27167 )	4 years ago
furnace	2e59769612	add empty op (c++, python, unit test) (#26659 )	4 years ago
Zhou Wei	f6be5989fd	Reduce the parallel compile count (#27187 )	4 years ago
lilong12	c5f957ae38	add double grad for tile op and expand_v2 op (#27114 ) * add double grad for tile, test=develop * add double grad for expand_v2 op, test=develop	4 years ago
lilong12	58a88ba9af	add double grad for expand (#27183 ) * add double grad for expand, test=develop	4 years ago
Qi Li	7c7fbd3218	fix error msg of fused_embedding_fc_lstm_op, test=develop (#27231 )	4 years ago
Qi Li	78446ecdba	[UT] fix run type of ut test cases of test_train_recognize_digits and test_api_impl, test=develop (#27218 )	4 years ago
Jacek Czaja	e005861598	[oneDNN]Introducing oneDNN 1.6 (#27137 ) * - introducing oneDNN 1.6 test=develop * - Removed redundant code test=develop	4 years ago
ShenLiang	5bd84b22c4	revert divide (#27202 )	4 years ago
wawltor	fde5cfe881	fix the CudaPinMemory bug for the equal op (#27176 ) fix the CudaPinMemory bug for the equal op and add the test case for the equal op	4 years ago
zhupengyang	cc3306f7c8	restruct logsumexp to speed up compiling (#27191 )	4 years ago
Steffy-zxf	50e60e8779	update error info for selected_rows_functor update error info for selected_rows_functor	4 years ago
Wilber	edd962b1d0	Add 2.0 inference api doc. (#27125 )	4 years ago
JZ-LIANG	5d039f4086	modified the implement of Lars optimizer (#26733 ) add lars to fleet meta optimizer	4 years ago
wangchaochaohu	c71d79b1d2	[cuda11 support] change the CMakeLists to support the cuda11 (#27124 )	4 years ago
Qinghe JING	43b0445b29	Add double grad in reduce sum (#27115 ) * set default value to strategy in distributed_optimizer test=develop	4 years ago
kinghuin	ed292695c5	optimize the error message for math dir optimize the error message for math dir	4 years ago
yongqiangma	4558d395e9	fix Norm op error (#26771 ) * fix frobenius_norm error, rm p=0 2-axis support. test=develop	4 years ago
LielinJiang	4d7d661249	Fix kl and summary bug (#27132 ) * fix summary rnn * fix kl_div bug when input shape is [1] and reduction is batchmean	4 years ago
WeiXin	13804ed80c	Error msg/polish tensor error msg (#26976 ) * polish one line error message in tensor.cc * polish error messages in tensor.cc,tensor.h tensor_impl.h * polish error messages in tensor.cc tensor.h tensor_impl.h * polish error messages in tensor.cc,tensor.h tensor_impl.h * polish error messages in tensor.cc tensor.h tensor_impl.h tensor_test.cc * polish error messages in tensor.cc tensor.h tensor_impl.h	4 years ago
whs	eb01976037	[2.0 API]Add checker in grid_sample_grad op (#27126 )	4 years ago
wangguanzhong	a28ae86e11	Enhance ops to support LoD as input for dygraph detection models. (#25316 ) * enhance collect_op for dygraph, test=develop * enhance detection ops with lod, test=develop * support none bbox left in generate_proposals, test=develop * unfiy MultiLevelRoisNum, test=develop * update core.ops, test=develop * add op register for new input & output, test=develop	4 years ago
Zhou Wei	753a0748ee	Temporarily turn off WITH_INFERENCE_API_TEST (#27170 )	4 years ago
YUNSHEN XIE	d4710163eb	add timeout unittests retry (#27152 ) * add timeout unittests retry * modifed parameter use	4 years ago
LielinJiang	8df5b4d608	Add correlation api to contrib (#27015 ) * add correlation api to contrib	4 years ago
LoveAn	cbcd5e407a	Fix problem that target name already exists when there isn't model data cache, test=develop (#27142 )	4 years ago
kinghuin	1b102dd552	optimize the error message for unpooling.cc fix the error message for the unpooling.cc	4 years ago
Pei Yang	5fb8c92054	fix multihead matmul shared params (#27121 )	4 years ago
xiaoting	58f3ef982a	fix typo for interp_v2,test=develop (#26843 ) * fix typo for interp_v2,test=develop * align with torch, test=develop * add area mode, test=develop * fix bug, test=develop * format notes, test=develop * update for converage, test=develop * fix bilinear, test=develop * fix bicubic, test=develop * fix typo, test=develop * fix coverage, test=develop * fix helper.input_dtype, test=develop * polish notes, test=develop * polish notes, test=develop * polish notes, test=develop	4 years ago
LoveAn	ed2f57cc42	Restore file changes caused by pre-commit (#27105 ) * Restore file changes caused by pre-commit and test it, test=document_fix * Change argument of checkout, test=document_fix * Remove test code, test=document_fix	4 years ago
YUNSHEN XIE	9fd5eae81d	add failed unittests retry on mac system (#26813 ) * add retry on mac * fix some error * fix with some errors	4 years ago
YUNSHEN XIE	92bf0d47e3	add failed unittests retry on win system (#26823 ) * add failed unittests retry on win system * modified the value of retry times	4 years ago
wangchaochaohu	5af81f833c	fix gpu kernel for numel Op (#27085 )	4 years ago
Wilber	632125415c	Refine python inference api (#26958 )	4 years ago
YUNSHEN XIE	b150f2b3a6	disable test_trt_dynamic_shape_ernie_ser_deser,test=document_fix (#27059 )	4 years ago
zhupengyang	19ca6d9dd2	add .part to speed up compile (#27044 )	4 years ago
LoveAn	fab8bbf25b	Modify data download function and support unittests of inference APIs on windows (#26988 ) * Modify data download function, and support unittests of inference APIs on windows, test=develop * The import error compatible with py2 and py3, and fix unittests problems of inference APIs on Windows, test=develop	4 years ago
GaoWei8	4ff16eb201	Add padding cudnn interface (#26370 ) * add lstm cudnn of padding data and refine cudnn codes	4 years ago
wawltor	8857e3911f	add the dynamic dtype check for the argmin/argma update the check for the dtype check for the argmin, argmax	4 years ago
wangchaochaohu	041f4ab842	refine linspace Op for dtype setting(#27071 )	5 years ago
yaoxuefeng	9aa39584fe	fix cuda generator hard-coded offset step (#27027 )	5 years ago
Jacek Czaja	f6653c71e9	[oneDNN] Fix to conv2d grad with groups (#27006 ) * - Added fix to mobilenet * - compilation fix * - Fix to conv2d grad oneDNN with groups test=develop	5 years ago
Chengmo	a72752263b	support heter-xpu-ps (#27018 ) support heter-xpu-ps	5 years ago
whs	2660ea379d	Fix cuda kernel of affine grid (#27003 ) test=develop	5 years ago
Zhou Wei	4204ceaed9	kill op_function_generator.exe (#27005 )	5 years ago
Zhou Wei	5a48952a54	remove rmdir build (#26965 )	5 years ago
zhangchunle	5866cde758	mac tests failed (#26928 )	5 years ago
ShenLiang	ff3dc8ac73	fix the remainder (#26995 )	5 years ago
yaoxuefeng	7f3e6ca596	add cuda generator (#26786 )	5 years ago
iducn	35ae10272e	add shell of CPU's version info (#26937 )	5 years ago
Feiyu Chan	c8cc094576	add template specialization for bfloat16 for gcc 4.8 compatability (#26985 )	5 years ago
wangchaochaohu	3eacced950	[cuda11 support] add support for cublas load of same function name (parameter diff) (#26963 )	5 years ago
Chen Weihang	209273e605	Support load state dict form `inference model` format save result (#26718 ) * support load infer model format state dict * add unittests * remove keep name table * recolve circle inport * fix compatible problem * recover unittest * polish doc and comment	5 years ago
joanna.wozna.intel	95e1434bb2	Add bfloat16 data type (#25402 )	5 years ago
Yang Zhang	29b844ad5e	Fix clip op attr (#26924 )	5 years ago
LoveAn	26c698e2c9	Fix catch exit code failed caused by (#26934 )	5 years ago
Shang Zhizhou	61fc7a3e45	Pass version check (#26887 )	5 years ago
Zhou Wei	f772540d80	add time when test failed (#26935 ) show unittest time even if unittest failed	5 years ago
huangjun12	e480168fae	fix dropout bug in backward when input is 1d tensor (#26837 ) * fix dropout bug in backward when input is 1d tensor, test=develop * add test case and refine error message, test=develop * refine error message, test=develop	5 years ago
YUNSHEN XIE	d8984a6b90	limit timeout value setting on linux (#26923 )	5 years ago
Zhou Wei	1771d9f880	fix cache judge more safe (#26910 )	5 years ago
joanna.wozna.intel	0627a319b0	Restore "Add mkldnn bfloat16 option to C-API " (#26882 ) * Add mkldnn bfloat16 option to C-API * Add test for bfloat16 gpu * Change coverage test * Repair capi_gpu test	5 years ago
Jacek Czaja	5e874cc333	- Cosmetic fixes to align with PADDLE_ENFORCE guidelines (#26891 ) test=develop	5 years ago
wanghuancoder	2d2c31a63a	Add FetchAsyncOpHandle, and use it in FastThreadedExecutor (#26643 ) * optimized transformation form tensor to numpy, test=develop * Modify fetch op handle, from memcpy Sync to memcpy Async, test=develop * modify CUDAPinnedPlace to CPUPlace, test=develop * modify CPUPlace to CUDAPinnedPlace, and set default inplace to false, test=develop * revert fetch_op_handle, add fetch_async_op_handle, test=develop * revert fetch_op_handle, add fetch_async_op_handle, test=develop * fix error msg report, test=develop * fix bug in cpuplace, test=develop * fix bug in unmerge and tensorarray modle, test=develop * fix bug, double copy gpu memory, test=develop * fix chenweihang¡¯s review advice, test=develop	5 years ago
Thunderbrook	5205748481	fix eigen in push sparse; fix hadoop command (#26872 ) * fix eigen in push sparse; fix hadoop command test=develop * add log in load_combine_op test=develop	5 years ago
Zhaolong Xing	932bbe955b	fix pool trt plugin bug (#26463 ) test=develop	5 years ago
wawltor	0a29fc85d6	fix the argmin,argmax op for the paddlepaddle 2.0 * fix the argmin,argmax op for the paddlepaddle 2.0， add checkPoint for the argmax/argmin	5 years ago
LoveAn	d067e66d39	Show more possible problems with build_and_check in file paddle_build.sh (#26846 ) * Show more possible problems with build_and_check in file paddle_build.sh, test=develop * Remove test codes modified in file device.py for build_and_check, test=document_fix * Fix missing blank space in file device.py, test=document_fix * Final process via summary_check_problems function, test=document_fix	5 years ago
Chengmo	d0962abd20	supplement bug fix of parameter server (#26217 ) * fix fluid.embedding	5 years ago
zlsh80826	ad6e3dd69c	[Paddle-TRT] Stack op plugin (#25605 ) * add stack_op to CMakeLists * add dim=3 support for scale op * add trt stack op, test=develop * remove debug message * add stack plugin serialize * remove slice, scale op, will add later * enhence error message * revise trt ernie test to conver the stack op CI testi, test=develop * add stack op serialization * fix test shape after adding stack op * remove slice op, will add after implementing serialization * roll back to min_graph=5 to avoid using slice op * fix scale op output layer * implement stack op createPlugin * use workspace and move the defination to .cu * move stack plugin creator definition to .cu, test=develop	5 years ago
Leo Chen	60ffc22026	Refine bernoulli and unsqueeze op (#26842 ) * add check for bernoulli and register bool for unsqueeze * follow comments	5 years ago
YUNSHEN XIE	1e50b2a635	fix retry error with blank (#26835 )	5 years ago
石晓伟	ced6e87eee	Revert "Add mkldnn bfloat16 option to C-API (#26676 )" (#26854 ) This reverts commit `02083bda40`.	5 years ago
tangwei12	ebc5f99789	add embedding 2.0 (#26649 ) * add embedding 2.0 * add embedding support input int32	5 years ago
Zhou Wei	d85410109d	Count the time and packet size for Windows monitor (#26678 ) * turn on WITH_INFERENCE_API_TEST * Count the time and packet size on windows * fix conflit * fix conflit * fix conflit * fix date-time funciton	5 years ago
hong19860320	40378edfa8	Add the AddCheckpoint macro to softplus op (#26809 )	5 years ago
GaoWei8	11fb8a1c10	Refine cudnn softmax (#25757 ) * refine cudnn softmax	5 years ago
arlesniak	885c61f086	Add use of global flag 'use_mkldnn' to layer_helper (#26497 ) * get use of global 'use_mkldnn' in layer_helper * update for CI * update for CI, relu test * update for CI, relu test added, make FLAGS_use_mkldnn a public flag * added more strict tests, fixes after review * fixes after review * fixes after review, CI stuff	5 years ago
swtkiwi	f44420c874	test=develop (#26710 )	5 years ago
Pei Yang	78a530c219	[Paddle-TRT] TRT dynamic shape support PaddleSlim quant models (#26536 ) * support trt dynamic shape int8 * add unittest * add support for sigmoid; adapt to trt6+ api	5 years ago
wawltor	7ee70a47b8	update the doc for the some ops update the doc for the some ops, ceil asin, atan	5 years ago
yaoxuefeng	a47d92d868	fleet add save with whitelist test=develop (#23376 )	5 years ago
zhupengyang	0f1ad9b06c	leaky_relu and hardshrink add checkpoint for behavior changed (#26802 )	5 years ago
Chengmo	7f2aa2db3c	【paddle.fleet】Support Heter Parameter Server (#25998 ) * Support Heter Parameter Server	5 years ago
zlsh80826	ac63c7cdef	fix a skip_layernorm bug, test=develop (#26800 )	5 years ago
Jiawei Wang	a1b99fae07	Adadelta Optimizer (#26590 ) * add doc; notest * fix doc; notest * update doc; notest * refine optimizer && adam * refine optimizer; notest * add adam * fix doc * fix doc && add adamw; notest * add error message * bug fix * refine rmsprop && adamax * fix ci * buf fix * update comment * unify arguments place; notest * fix ut, test=develop * bug fix * fix conflicts, test=develop * add examples code * bug fix * fix comments * fix sample code * add sample code for Optimizer * add adamax ut, test=develop * fix rmsprop ut, test=develop * add ut for optimizer.py and adamw.py * first commit of adadelta optimizer * fix learning rate * fix adadelta doc and add sgd momentum * remove unused fluid * fix codestyle * Update test_adam_op.py * Update test_adam_op.py * fix SGD in 2 unittests * fix SGD in 2 unittests * fix ci * fix ut Co-authored-by: MRXLT <xlt2024@gmail.com> Co-authored-by: mapingshuo <mps2012@yeah.net>	5 years ago
LielinJiang	346689c6f1	Register conv_transpose Op version for compatible Op upgrades (#26745 ) * fix bug * add version check * fix docs, test=document_fix * fix formula, test=document_fix	5 years ago
Adam	8bcb1f29d9	Add conv+affine_channel fuse pass to MKLDNN pass strategy and fix it (#26779 )	5 years ago
Wilber	68e0560c2f	refine paddle inference api (#26774 ) * refine paddle inference api Co-authored-by: nhzlx <nhzlx.dragon@gmail.com>	5 years ago
iducn	64df9b99a9	add shell of GPU version (#26589 )	5 years ago
Wojciech Uss	7afb1df11e	Decouple weights and bias from fc primitive in MKLDNN cache (#26708 ) * decouple weights and bias from fc primitive in cache * removed reduntant update of pointers	5 years ago
Zhen Wang	f32ae272ec	Remove `sorted_sum_gradient_` form BasicEngine and PartialGradTask. (#26766 ) Use `Tensor` instead of `Variable` in the doc of paddle.grad.	5 years ago
Leo Chen	844583c8fd	Refine paddle.manual_seed (#26496 ) * refine manual seed * fix ci problem * fix unittests * fix unittest * set is_init_py=false in manual_seed * fix unittest * fix bernoulli_op * fix(unittest): change random_seed to manual_seed * 🐞fix(unittest): fix manual_seed * trigger ci * fix test_sentiment * fix test_imperative_save_load * fix test_uniform_random_op * fix test_uniform_random_op * fix test_jit_save_load * merge develop * fix manual_seed * fix manual_seed * use global engine * use shared_ptr * fix double free * fix bug * fix bug * fix bug * fix test bug * fix test bug * fix test bug * fix ci	5 years ago
Zhou Wei	2d88b9ffe7	turn on WITH_INFERENCE_API_TEST (#26746 )	5 years ago
Pei Yang	e3f8e5cf5c	trt int8 support conv2d_transpose (#26636 )	5 years ago
ShenLiang	29494d703d	fix remainder, floor_div (#26732 ) * fix remainder, floordiv	5 years ago
zhangchunle	623a4c2e56	fix ci coverage build error (#26761 )	5 years ago
lilong12	5f524efe56	modify error report message, test=develop (#26743 )	5 years ago
wangchaochaohu	4561fc37e2	Add check point for gather Op (#26696 )	5 years ago
joanna.wozna.intel	eb097d64f6	Fix int8 performace drop cpu_quantize_placement_pass (#26715 ) * Fix cpu quantize placement pass * Include string lib	5 years ago
joanna.wozna.intel	02083bda40	Add mkldnn bfloat16 option to C-API (#26676 ) * Add mkldnn bfloat16 option to C-API * Add test for bfloat16 gpu * Change coverage test	5 years ago
LutaoChu	1ec30cb160	register cumsum Op version for compatible Op upgrades (#26734 ) register cumsum Op version for compatible Op upgrades	5 years ago
Jack Zhou	c282db3a93	add broadcast feature for elementwise logical op add broadcast feature for elementwise logical op	5 years ago
Yang Zhang	63eef7632e	Fix clip input check (#26683 ) * Fix clip input check * Fix default min/max value * Allow both max and min to be None * Register op change * Revert OP signature change	5 years ago
Zhen Wang	f9066e6a6f	Update the demo code and the doc of varbase.backward. (#26506 ) * update the demo code and the doc of varbase.backward. * update the doc of the fake interface `paddle.fluid.Variable`. * remove BackwardStrategy.	5 years ago
Wilber	1c898b66d6	add bug fix enum. (#26736 )	5 years ago
Zhou Wei	8071d23073	fix bug that can't print int8_t (#26712 ) fix bug that can't print int8_t	5 years ago
joejiong	f311d3c1cf	Fix pow api type error with python side method, merge elementwise_pow and pow. (#26163 ) As the title	5 years ago
yongqiangma	e4cc6a28b0	Norm op support 2-axis (#26492 )	5 years ago
chalsliu	dc56c89822	Add the option to execute unit tests only at night (#26669 ) * Add the option to execute unit tests only at night * set ut nightly label for 3 cases.	5 years ago
xiaoting	89d7d86684	add intepolte_v2 (#26520 ) * add intepolte_v2 * fix linear interp * polish unittest, test=develop * update code samples to 2.0 API, test=develop * remove warning, test_develop * add name in attrs, test=develop * polish code, test=develop * change Align to align, test=develop * fix unittest in py3,test=develop * fix coverage, test=develop * fix coverage, test=develop * fix for windows ci, test=develop * fix coverage, test=develop	5 years ago
Adam Osewski	c2c689582e	Update Paddle-Lite commit hash. (#26413 ) * Update Paddle-Lite commit hash. * Add BF16 data type to VarTyp protobuf message.	5 years ago
Zhang Ting	97cebfa4d3	add dtype for unique (#26655 ) * update doc, test=document_fix * add attr(dtype) * refine code	5 years ago
lilong12	1c68138327	[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis (#26552 ) add collective op for cpu using gloo and paddle.distributed.* apis	5 years ago
joanna.wozna.intel	559e43eee4	Small change in conv2d and quantize pass (#26671 )	5 years ago
Bai Yifan	8986a82131	fix adaptive gpu grad bug, add doc refine (#26660 )	5 years ago
wawltor	286eca2d9e	update the code for the topk v2 add the top v2 for the paddlepaddle api 2.0	5 years ago
whs	f82384113b	Fix atomicAdd in grid sample op and affine grid op (#26647 ) test=develop	5 years ago
Wilber	32ba8602c6	Enhance py_func error info message. (#26557 )	5 years ago
chalsliu	cb3f131f1c	Set timeout properity for a few unitests	5 years ago
石晓伟	32ceacf317	update op_version_registry, test=develop (#26644 )	5 years ago
RandyLi	2f5bdd8dc7	Remove WOBOQ, gen_html() and sphinx (#26128 )	5 years ago
Dong Daxiang	08d736ad78	【paddle.fleet】add cudnn related strategies to DistributedStrategy (#26598 ) * add cudnn related strategies to DistributedStrategy	5 years ago
Zhang Ting	0a895bc0df	improve unique op (#26537 ) * add unique_v2 op * remove unique_v2 op * update doc	5 years ago
whs	a004dfde3d	Use atomicAdd defined in paddle fromework (#26631 ) test=develop	5 years ago
LoveAn	02fc1fef8b	Fix the cmake-function named inference_download_and_uncompress on Windows (#26512 ) * Fix the cmake-function named inference_download_and_uncompress with Windows, test=develop * Fix some problems when remove limit of unittests on Windows, test=develop * Using URL to download file instead of DOWNLOAD_COMMAND. test=develop	5 years ago
YUNSHEN XIE	a8b5741fb4	add a few unittests for setting timeout properity (#26630 )	5 years ago
zhangchunle	ef317b4b14	add mac tests failed exitcode (#26611 )	5 years ago
wanghuancoder	c1f5df5269	optimized transformation form tensor to numpy (#26447 ) * optimized transformation form tensor to numpy, test=develop * optimized transformation form tensor to numpy, pass pre-commit, test=develop * modify fetchophandle zerocopy to deepcopy in PE&CUP, test=develop * modify py:array construct, test=develop * fix _fetch_var to use deep copy, test=develop	5 years ago
zhupengyang	c80fcf901e	reduce_mean error if keepdim=True and reduce_all=True (#26614 )	5 years ago
whs	a065a24232	【2.0 API】Enhance affine grid operator (#26385 ) * Enhance affine grid operator: 1. Add cuda kernel 2. Add align corners options test=develop * Move new affine_grid api to functional test=develop * Add CUDA kernel for affine_grid. test=develop * Add more unitest for grid sample API test=develop	5 years ago
Qi Li	6f69fbc8ea	fix elu grad whne alpha less then zero, test=develop (#26543 )	5 years ago
whs	786373ba29	Use atomicAdd defined in paddle framework (#26628 ) test=develop	5 years ago
ruri	1f82c0cd62	[Api2.0] add pixel shuffle (#26071 )	5 years ago
Zhou Wei	1ed74aae7c	fix msbuild log level (#26607 )	5 years ago
wanghuancoder	422a162019	api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear (#26399 ) * api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear, test=develop * api2.0 fix code examples, test=develop * modify test_bilinear_api, about place,to_tensor , test=develop * re pass pre-commit, test=develop * Update common.py * fix BilinearTensorProduct ci error, test=develop	5 years ago
wanghuancoder	6e823cfec3	add op_function_generator.exe retry in windows, test=develop (#26591 ) add op_function_generator.exe retry in windows	5 years ago
石晓伟	fa08a834be	update op_version_registry, test=develop (#26592 )	5 years ago
whs	79539cf198	【2.0 API】Add CUDA kernel and enhance options for grid_sample (#26576 ) This PR enhance CPU kernel and add new CUDA kernel to make grid_sample support: - align_corners: with bool type. - padding mode: which can be in ['zeros', 'reflect', 'border'] - Interpolation mode: which ca be in ['bilinear', 'nearest'] The old CPU and CUDNN version only support align_corners=true, padding_mode='zeros' and interpolation_mode='bilinear'. The behavior of the new version op in default mode is compatible with the old version.	5 years ago
Guanghua Yu	8645591d66	support fp64 in huber_loss cuda kernel (#26583 )	5 years ago
yaoxuefeng	efee426742	support generator seed in related kernals test=develop (#26495 )	5 years ago
Zhong Hui	bf4a4636f1	change to use bce_loss op, add shape check for bce_loss change to use bce_loss op, add numel check for bce_loss.	5 years ago
ShenLiang	0e81626081	add div, floor_div, remainder (#26562 ) * add div, floor_div, remainder	5 years ago
石晓伟	656e60b18f	new class: op_version_registry, test=develop (#26542 )	5 years ago
qingqing01	24566e951c	Support empty bbox in bipartite math op (#26488 )	5 years ago
Jack Zhou	199b0c7c1b	Add isfinite v2 op (#26344 ) add the isnan, isfinite, isinf api for the paddle 2.0	5 years ago
Zhou Wei	28554c3f85	add --user for pip (#26440 )	5 years ago
wangchaochaohu	ebf9b2125e	add paddle.gather for API2.0 (#26455 )	5 years ago
wangchaochaohu	9219b79104	gather_nd Op for API 2.0 refine (#26540 )	5 years ago
zhupengyang	9b14117cac	logsumexp: impl kernel, refine docs (#26307 )	5 years ago
Wojciech Uss	5c2b9258a6	Fix (de/re)quantize cache keys (#26549 )	5 years ago
YUNSHEN XIE	df7fe1fe23	fix unittests run with error of Expression too big (#26573 )	5 years ago
wawltor	6b28456ed0	add the argmax, argmin for the api2.0 * add the new api and op for the argmax, argmin	5 years ago
LielinJiang	d26ae9ad87	Update conv_transpose api (#26427 ) * update conv_transpose api	5 years ago
lilong12	faa9b97b78	fix cscatter, test=develop (#26554 )	5 years ago
WangXi	45711dade7	【API】rename div to divide, add floor_divide, remainder (#26434 )	5 years ago
LutaoChu	4e0c6d91aa	add paddle.tensor.linalg.diag API, diag_v2 OP and CUDA kernel add paddle.tensor.linalg.diag API, diag_v2 OP and CUDA kernel.	5 years ago
zhupengyang	f8863e0603	leaky_relu and LeakyReLU: alpha->negative_slope (#26216 )	5 years ago
ShenLiang	c609066074	Add Matmul op (#26411 ) * add matmul_v2	5 years ago
Leo Chen	aa2a9b5d89	add bernoulli op (#26511 ) * add bernoulli op * fix cuda kernel and add unit test * refine doc * fix uniform	5 years ago
Adam	f3909020de	Add mechanism for blocking oneDNN cache clearing (#26502 ) * Add mechanism for blocking oneDNN cache clearing * Review changes and Add thread guards	5 years ago
ShenLiang	b6eb37f5b3	add error message for cholesky (#26444 ) * add error message	5 years ago
QingshuChen	138ecf24aa	support Baidu Kunlun AI Accelerator (#25959 ) * support Baidu AI Accelerator * test=kunlun * minor * test=kunlun * support xpu op in separate file * test=kunlun * update XPU error message and remove duplicated code * test=kunlun * minor * test=kunlun * minor * test=kunlun	5 years ago
yaoxuefeng	4f259354d2	mod cvm test=develop (#25146 ) * mod cvm test=develop * mod code format test=develop	5 years ago
wangchaochaohu	e167e87974	【API2.0】add masked_select Op for API2.0 (#26374 )	5 years ago
Pei Yang	379222c3f1	add output scale and trt op teller support for hard_swish and hard_sigmoid (#26499 )	5 years ago
zhupengyang	6e5670b8bd	mean: not support int32, int64; add check for axis (#26401 )	5 years ago
zhupengyang	4ad504e7c7	hardshrink: support threshold < 0 (#26403 )	5 years ago
lilong12	e92f770c42	Add collective ops (reduce) (#26340 )	5 years ago
wangchaochaohu	bdb805505e	【API2.0】add numel API for paddle test=develop (#26311 )	5 years ago
wangchaochaohu	2073ffc04d	Enhance the data type of linspace API (#26247 )	5 years ago
hong19860320	40d193ed17	Add the ReLU6, Tanhshrink, SELU, Softplus, Softshrink and Softsign for the api 2.0 (#26376 )	5 years ago
Chen Weihang	9108282883	Polish framework error message part 5 (#26204 ) * polish framework error msg part 5 * revert enforce change * refine error type * trigger ci check * polish details by review comment	5 years ago
Zhaolong Xing	f00f982a02	add cub impl for arg max, min (#25941 ) test=develop	5 years ago
Zhang Ting	6914a12f82	rename the inputs of allclose (#26360 ) * rename input * add unittest, test=develop * use paddle.data instead of fluid.data, test=develop	5 years ago
YUNSHEN XIE	e3612de8d7	add failed unittests retry (#26342 )	5 years ago
littletomatodonkey	bcf03273f6	add pad func (#26106 ) * add pad func * add pad * test=develop, add pad op and apis * restore pad2d * test=develop, fix paddl declare * fix pad interface * test=develop, fix pad * test=develop, add all pad api and cos_sim * test=develop, remove padding default value * test=develop, rename var to tensor * test=develop, add more tests * test=develop, rename tovar to totensor * test=develop, fix init * test=develop, add more test * test=develop, add more tests	5 years ago
Chengmo	eeeef957c7	Fix ps gpu (#26218 ) * support ps-gpu	5 years ago
Zhong Hui	6cbeafb6c0	add zero norm, inf norm support for p_norm op (#26364 ) * add zero norm, inf norm support for p_norm op * fix the invalid argument check, fix the dtype problem in test case.	5 years ago
tianshuo78520a	029390b1d2	fix ci bug (#26276 )	5 years ago
Tao Luo	1b03ab3899	set opencv-python <=4.2.0.32 (#26415 )	5 years ago
Zhaolong Xing	b7a86e92a8	fix dy shape bug in trt7.1 (#26273 ) test=develop	5 years ago
ceci3	56890dc729	Add SyncBatchNorm (#26032 ) * add SyncBatchNorm,test=develop	5 years ago
GaoWei8	1fbee267d4	remove scope in cudnn lstm (#25188 )	5 years ago
Zhou Wei	da29760d58	add msvc log from quiet to minimal (#26383 )	5 years ago
Pei Yang	b757466b0d	fix trt dynamic ernie serialization unit test (#26228 )	5 years ago
Wilber	3ec0bcbbb8	[Bug] Fix prune for save_inference_model about transformer (#25347 )	5 years ago
cc	3f816bc8b4	[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639 ) * Conv2d_transpose and mul support channnelwise quantization, test=develop * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop * Fix error in test_user_defined_quantization, test=develop * Add depthwise_conv_bn_fuse, test=develop * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop	5 years ago
lilong12	638bbb6153	Improve expand as (#26290 ) align expand_as op to expand.	5 years ago
Thunderbrook	a83e0f264c	fix heter proto (#26093 ) test=develop	5 years ago
Leo Chen	049ac56c08	Print user-friendly error message in core.ops [part 2] (#26377 )	5 years ago
zhupengyang	586a6dd358	log_softmax and LogSoftmax: impl kernel and refind docs (#26088 )	5 years ago
yaoxuefeng	23261ff44b	add cpu random Generator (#26013 )	5 years ago
Sylwester Fraczek	69742bd9a4	Enable mkldnn layout conversion (#25778 ) * enable mkldnn layout conversion * review fix: remove tmp_place * fix test mkldnn swish * add UT for PrepareData CPU->MKLDNN * add #ifdef PADDLE_WITH_MKLDNN * Force-push commit Co-authored-by: grygielski <adam.grygielski@gmail.com>	5 years ago
Leo Chen	672578a797	Print user-friendly error message in core.ops (#26261 ) * print user-friendly error message * adjust error sumary	5 years ago
Zhou Wei	5017aa76e6	set default python3,fix incompatible,cache dir for third party,unify error code,for windows (#26178 ) * set default python3 for paddle windows,test=win * set default python3,cache dir for third party,error code,test=win * fix some incompatible * fix some error * set virtual environment,test=win	5 years ago
Jack Zhou	6d22f5c73e	Add PADDLE_ENFORCE in nll loss cuda kernel (#26294 ) * add nll loss API, update demo code of the comment	5 years ago
wangchaochaohu	0b81d76310	[API2.0] add op for cudnn version query test=develop (#26180 )	5 years ago
lilong12	241b44db14	[API 2.0] adaptive expand op to use shape instead of expand_times (#26206 ) * adaptive expand op to 2.0 (align to torch.expand) , test=develop	5 years ago
wangchaochaohu	bb11cbc250	[API2.0] add Device api (set_device and get_device)(#26103 )	5 years ago
Zhou Wei	6de463d3d1	expose and unify the Tensor concepts to the user (#25978 ) * expose and unify the Tensor concepts to the user * expose tensor to user * add copy place for Tensor * add copy place for Tensor * add note * add macro PADDLE_WITH_CUDA * remove RUN_TYPE=DIST * fix some error	5 years ago
lilong12	fbd4d3cc97	[API 2.0] add paddle.tile op (#26245 ) * add tile_op, test=develop	5 years ago
Zhou Wei	20147ace3f	fix_copy_if_different (#25868 )	5 years ago
Wilber	c84aa9c61f	update diff val. (#26242 )	5 years ago
Yang Zhang	a2d3e5c03b	Fix `paddle.abs` docstring (#25942 ) test=document_fix remove activation wording	5 years ago
Yang Zhang	22165934bc	Fix `paddle.acos` docstring (#25958 ) test=develop,test=document_fix remove activation wording	5 years ago
Yang Zhang	a5b5b00e02	Fix `paddle.asin` docstring (#25967 ) test=develop,test=document_fix remove activation wording	5 years ago
Yang Zhang	c758765769	Fix `paddle.atan` docstring (#25968 ) test=develop,test=document_fix remove activation wording tanh -> tan	5 years ago
Yang Zhang	c4e480efc5	Fix `paddle.cos` docstring (#25969 ) test=develop,test=document_fix explain input/out put range and out of boundary behavior	5 years ago
liuyuhui	935da32d25	【paddle.fleet】upgrade fleet: modify role_maker (#26038 ) * add unittest for paddlerolemaker with gloo	5 years ago
wawltor	2d6cc0b125	support the tuple for attribute of axis in min, max for api2.0 Update the code for the min,max, test=develop	5 years ago
Dong Daxiang	50a5bcfc9d	【paddle.fleet】paddle.fleet -> paddle.distributed.fleet. (#26186 ) * move paddle.fleet to paddle.distributed.fleet	5 years ago
Leo Chen	ffe52b4452	[OpDevOptimize] Add common infershape functions (#26096 ) * add unchaged infershape function * add broadcast infershape function * fix bug * rename infershape functions * add UnaryOpUnchangedInferShapeCheckAxis * add error message * add test for common infer shape functions * dont update existed ops * dont update op_desc.h * add more test * add error check, refine error message	5 years ago
Leo Chen	2d95280e1f	Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903 ) * add auto_cast, test=develop * add loss scaler, test=develop * add comments, test=develop * refine code, test=develop * refine code, test=develop * do not set flags automatically, test=develop * fix custom op bug, test=develop * add more test, test=develop * refine enable logic, test=develop * enable amp test with GPU, test=develop * add unittest * add test for found_inf * follow comments * follow comments * remove global variable, use singleton * add some notes * update comments * update comments * update comments * add use_dynamic_loss_scaling argument * refine found_inf * refine found_inf	5 years ago
Chen Weihang	838e36e9ed	Fix loaded variable suffix repeat error (#26169 ) * fix loaded var suffix repeat error * use new dygraph name for loaded param	5 years ago
Jack Zhou	dea41da715	add nll loss API for the paddlepaddle api2.0 * add nll loss API, update demo code of the comment	5 years ago
Wilber	fb72b192e7	[DOC] Fix dead link (#26154 )	5 years ago
wawltor	9c17b3c9f8	Add the max, min, maximum, minimum api for the API 2.0 * Add the max, min, maximum, minimum api for the API 2.0, test=develop	5 years ago
JZ-LIANG	54003b873e	【paddle.fleet】add lamb to fleet meta optimizer (#26025 ) add lamb to fleet meta optimizer	5 years ago
Yiqun Liu	1be6bf45ae	Add assign to fusion_group and enhance inplace execution in fusion_group. (#26121 )	5 years ago
lidanqing	65b97d6215	GRU model xnli dataset C++ tester (#25534 ) * Add laxical GRU unit test performance works * Get model accuracy * model and data name to be confirmed test=develop * update model name and output format test=develop * update according to reviews test=develop * add accuracy check * accuracy check between native and analysis test=develop * fix a reading bug, fix gru passes sequence test=develop * fix passes sequence test=develop	5 years ago
Zhen Wang	a86e8c0eef	add more error info for these ops without double grad ops. (#25987 )	5 years ago
tianshuo78520a	75a1311400	Fix inference CI bug (#26080 ) * Fix inference bug * fix inference lib	5 years ago
MRXLT	6559229b7e	fix encryption infer (#25979 ) * add encrypt for inference lib * fix code;test=develop * fix test; test=develop * bug fix; test=develop * add MakeCipher;test=develop * fix bug;test=develop * move MakeCipher to paddle space; test=develop * fix include dir ;test=develop * add include dir; test=develop * move include; test=develop * move include; test=develop * fix for windows ci * fix cmake; test=develop * fix bug bug fix	5 years ago
lilong12	8caee2ad51	【paddle.fleet】add the support for multi-node training for pipeline (#25907 ) * add the support for multi-node training	5 years ago
LutaoChu	bf2db646de	fix cumsum op for API 2.0, optimize performance update cumsum api and fix up the cumsum op	5 years ago
Adam	1893cd6bb8	Add oneDNN relu6 op (#26037 ) * Add oneDNN relu6 op * Lint fixes	5 years ago
Zhaolong Xing	50f149a48e	fix cudnn workspace size problem during inference. (#26021 ) test=develop	5 years ago
Zhou Wei	1f74b94d3f	fix compile warning on windows MSVC, fix paddle_build.bat more safe (#25933 ) * Fixed compile warning about incorrect compile options,fix paddle_build.bat * fix paddle_build.bat to more safe	5 years ago
tangwei12	c14ec8782b	【paddle.fleet】Feature/fleet ps api 2.0 (#25857 ) * add paddle.fleet.AsyncOptimizer Co-authored-by: dongdaxiang <dongdaxiang@baidu.com>	5 years ago
Chen Weihang	3c8daa9b89	Add pin memory control for BufferedReader (#26026 ) * add pin memory control * fix buffered reader init problem * fix unittest error * add unittest for coverage	5 years ago
Chen Weihang	ad4a0466a5	Add cuda pinned place branch in slice op GetExpectedKernelType (#26027 ) * add cuda pinned place branch * add unittest * add skip when not gpu	5 years ago
zhangchunle	86794cccbd	separate approve (#26035 )	5 years ago
Feiyu Chan	e853ece0a2	update document template for unary elementwise layers (#25896 ) 1. update document template for unary elementwise layers(a.k.a. activation layer); 2. remove generate_op_noattr and use generate_activation instead; remove redundant function copies; 3. minor update for docstring to fix rst format errors. 4. fix doc for Rsqrt OP 5. add sample code for each activation separately; 6. remove the unused deprecated decorator.	5 years ago

... 3 4 5 6 7 ...

17733 Commits (766b35152713d4410d4c5b05a3fa5e5a64a6fa60)