Paddle

Commit Graph

Author	SHA1	Message	Date
iducn	f1074e3b19	hide the token output to safely (#28716 )	4 years ago
joejiong	32b90b1c2d	add log10 (#28576 ) Add new operator log10	4 years ago
Leo Chen	3d09929b1f	Add check for non-dispensable input (#28666 ) * Add check for non-dispensable input * fix typo	4 years ago
Chen Weihang	7eeb99fe02	Add basic hook classes for dygraph & implement reduce hook (#28584 ) * add base hook classes and reduce hook impl * fix constructor typo * polish comment format * refactor baisc hook class design * polish design details	4 years ago
Guo Sheng	858ffa0c8b	Fix the dropout setting when not initialized in rnn_op. (#28561 ) test=develop	4 years ago
Jacek Czaja	6d8d3d4c22	[oneDNN] Layer norm bf16 kernel (#28619 )	4 years ago
lilong12	80d2024644	bug fix, test=develop (#28674 )	4 years ago
Zhou Wei	bf143652ac	fix lstm OP compile error on windows (#28667 ) * add unittest and check unittest for windows * fix lstm OP compile error on windows	4 years ago
石晓伟	57dab959ca	add datanorm op new scale_w register (#28657 ) Co-authored-by: yaoxuefeng6 <yaoxuefeng@baidu.com>	4 years ago
cc	65aac81191	Fix fake_quant error when cout > 1024, test=develop (#28603 )	4 years ago
lilong12	b2f7ab6636	bug fix, test=develop (#28648 )	4 years ago
wawltor	8f2656ef5c	fix the gradient bug for the topk v2 fix the gradient bug for the topk v2	4 years ago
wangchaochaohu	a972c33fd7	refine gather OP performance for dynamic mode (#28587 )	4 years ago
joanna.wozna.intel	2cb71c0cde	Add checkpoint to quantize (#28612 ) * Add checkpoint to quantize * Change bfloat16 option	4 years ago
lidanqing	804271cff9	Op version python mkldnn_inplace test (#28354 ) * add mkldnn inplace op version test * update mkldnn_inplace fuse pass * update the inplace test	4 years ago
pangyoki	b889a0cee2	add gaussian_random op_version (#28602 )	4 years ago
YUNSHEN XIE	cf2c42a937	fix exec nightly error on mac (#28567 )	4 years ago
Guo Sheng	110febdc54	Fix gradients with ignore_idx in softmax_with_cross_entropy (#28622 ) * Fix gradients with ignore_idx in softmax_with_cross_entropy. test=develop * Fix gradients with ignore_idx in softmax_with_cross_entropy on cpu. Remove softmax_with_cross_entropy from op_threshold_white_list. test=develop * Fix test_softmax_cross_entropy_op.py. test=develop	4 years ago
Wilber	8b97bb2e1f	Update cmake for arm ft and fix a bug for Predictor dtor. (#28586 )	4 years ago
Leo Chen	f962bd3432	Fix cudnn workspace limit in cudnn-8 (#28611 )	4 years ago
Leo Chen	90805e2df7	Register op_version for new attribute use_addto (#28463 ) * register op_version for addto * upgrade pass capability * change eq to le * change eq to le * fix merge	4 years ago
danleifeng	a24d186814	fix nccl init failed in parallel dygraph mode (#28497 )	4 years ago
Zhou Wei	93c39779b4	open a part of GPU unittest for windows (#28378 ) * open a part of GPU unittest for windows * open a part of GPU unittest for windows	4 years ago
lilong12	ed9dd7c9f0	add send and recv ops (#28590 ) * update, test=develop	4 years ago
Zhong Hui	a829357e4d	register the op version for some ops register the op version for some ops	4 years ago
Zhou Wei	bf6e7cba7a	updata 2.0 API english doc (#28525 ) * make Numpy version is below 1.19.3 * fix 2.0 doc	4 years ago
YUNSHEN XIE	7b1619e69b	disable test_trt_dynamic_shape_transformer_prune,test=document_fix (#28588 )	4 years ago
Zhou Wei	849467b5aa	fix user set CUDA_VISIBLE_DEVICES start/end with quotation marks (#28547 )	4 years ago
Shang Zhizhou	8699f38d08	裁剪transformer模型trt支持；修复tensorRT不支持DeletePass的bug (#28517 ) * skip_layernorm_op done * add unittest * slice op convertor support trt < 6 * skip_layernorm only work in ernie	4 years ago
joejiong	08d2413142	add log2 operator (#28319 ) As the title	4 years ago
lidanqing	0fc181dbd0	[Fix bug] If the pass name is not found, IsCompatible should return false (#28475 )	4 years ago
Wilber	1bf4836580	[Inference] Add TryShrinkMemory interface. (#28409 )	4 years ago
wangchaochaohu	c52fe48f6f	fix the GetKernelTypeForVar of input for fluid.gather (#28534 )	4 years ago
wangchaochaohu	d7cfee9b31	Checkout point add (#28488 ) * upgrade pass capability	4 years ago
YUNSHEN XIE	98dc11bb6a	add monitoring for executive ut at night (#28377 ) * add monitoring for executive ut at night * fix some error for paddle_build.bat * fix some error * fix some error in windows * fix some error on windows	4 years ago
Pei Yang	75196cda40	Paddle-TRT int8 support mul op channelwise quant (#28422 ) * paddle-trt support mul channelwise quant * add support for depthwise_conv2d * add errmsg for unsupported op type	4 years ago
zhupengyang	47cbf61dd4	fix softmax unittest float16 random error (#28480 )	4 years ago
Zhou Wei	53e9aa948d	remove diff with develop (#28504 )	4 years ago
YUNSHEN XIE	369605be1d	fix cmake error when execute build_inference_lib (#28503 )	4 years ago
Wilber	645e999afc	fix api_impl test. (#28483 )	4 years ago
YUNSHEN XIE	1e698c600e	fix cmake error when setting ut timeout properity (#28492 )	4 years ago
wangchaochaohu	e14ed71cc2	refine the performance of gather Op (#28458 )	4 years ago
wanghuancoder	e29ab5eacb	clear clcache cache file and reopen clcache (#28384 ) * clear clcache cache file and reopen clcache, test=develop * reopen clcache, test=develop	4 years ago
YUNSHEN XIE	ba0756325a	exec ut no more than 15s 1 (#28439 ) * disable ut test_parallel_executor_fetch_isolated_var,test=document_fix * test for limiting ut exec time as 15S * fix an error caused by cannot find ut * fix some error * can not find test_transformer * fix error caused by ut not run in windows * fix error caused by Compiler Options * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt * setting timeout value to 120s for old ut * add the timeout value setting * fix error caused by ut only run in coverage_ci * add analyzer_transformer_profile_tester * fix some error * fix some error * fix error with inference option * fix error with inference option setting as ON_INFER * add some ut to set timeout * modified some option * fix error * fix some timeout error * fix error * fix error * fix timeout for test_analyzer_bfloat16_resnet50 * fix error * setting timeout properity for some ut * first pr for new ut timeout as 15S	4 years ago
Chen Weihang	155b4f9b6c	Remove selected rows all reduce over height check (#28460 ) * remove slelected rows all reduce over height check * polish unittest	4 years ago
taixiurong	fad4744aa4	fix crash in adam in xpu, *test=kunlun (#28433 )	4 years ago
QingshuChen	6bba8e57b1	fix batch_norm_xpu bug & remove xpusimulator dependence (#28430 ) *test=kunlun	4 years ago
Wilber	ced5c40c41	Update memory release interface. (#28456 )	4 years ago
joanna.wozna.intel	7821759d48	Add bfloat16 softmax and gelu (#28394 ) * Add bfloat16 softmax and gelu * Add pass attr bfloat16_enabled_op_types * Changes from review	4 years ago
iducn	ba0fe0a812	revert the modified shell script (#28453 )	4 years ago
Chen Weihang	c42e656179	Add retry for dygraph parallel socket bind (#28404 ) * add retry for dygraph parallel socket bind * change to loop always * fix writing error	4 years ago
石晓伟	c41fd033e5	check op_version_registry in CI test, test=develop (#28402 )	4 years ago
Jacek Czaja	ca41541472	[oneDNN]Sum bf16 kernel (#28382 ) * - Added sum bf16 oneDNN test=develop * - Fix to UT of sum bf16 test=develop	4 years ago
Chen Weihang	23439b1688	show cpp stack when catch signal (#28415 )	4 years ago
Leo Chen	44a476c2ab	support cuda pinned place (#28416 )	4 years ago
lidanqing	12b9587be5	Add conv_bias pass version python test (#28278 ) * add conv_bias pass version test * update according to reviews	4 years ago
Wilber	05114693cf	[Inference] Memory modification for ShrinkMemory. (#28355 )	4 years ago
Leo Chen	8b2436a776	Add broadcast_shape api (#28257 ) * add broadcast_shape api * add ut * follow comments * add example code, test=dodument_fix * update example code, test=document_fix	4 years ago
石晓伟	21a63f6f90	enhance the op_version_registry, test=develop (#28347 ) * enhance the op_version_registry, test=develop * add unittests, test=develop * enhance the op_version_registry, test=develop * fix bugs, test=develop * revert pybind_boost_headers.h, test=develop * fix a attribute bug, test=develop	4 years ago
YUNSHEN XIE	c1c3e21726	retry will not be executed when the number of failed ut is greater than 20 (#28374 ) * retry will not be executed when the number of failed ut is greater than 20 * add log display * fix some error * fix some error * fix some error * fix some error	4 years ago
Shang Zhizhou	ea851796e5	TensorRT中ernie模型推理性能优化，支持变长输入 (#28367 ) * fp16 result ok * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS * auto detect special slice op converter for ernie with trt oss * ernie oss only support fp16 * fix special_slice_plugin serialize bug * matmul in tensorrt ok * ernie unittest ok * add matmul tensorrt unittest * remove demo code	4 years ago
Jacek Czaja	84cc61b2cd	[oneDNN] sum op refactor (#28318 )	4 years ago
Wilber	6f0f45f69c	copy_to_cpu support uint8 (#28372 )	4 years ago
Wilber	09fd2b2aab	Paddle support compile on sw (#27858 )	4 years ago
chen zhiyu	953302d9eb	add musl docker build script (#28027 ) * add musl docker build script * rm space test=document_fix * fix some docs and types errors test=document_fix	4 years ago
Leo Chen	6115c14fca	Pool2d cuda kernel supports fp16 (#28316 ) * pool2d cuda kernel supports fp16 * fix compile issue of template * add ut	4 years ago
Zhou Wei	f41104efa3	fix compile out of memory temporary (#28346 )	4 years ago
Guo Sheng	9a600df373	Add rnn_op (#28197 ) * Add rnn_op. test=develop * Fix rnn_op grad maker's drop_empty_grad. test=develop	4 years ago
wangchaochaohu	0f4b6247c8	refine the gpu config for performance optimization (#28291 )	4 years ago
Huihuang Zheng	acc11c2a62	Retry CUDA Initialization to Fix Random Failure, test=develop (#28323 ) This PR is follow up of #28213. On that PR we tried to decrease GPU usage, however the CI still randomly failed. So I added retry logic for the initialization of nccl and cusolver. If the initialization failed, we can retry to avoid the random failure.	4 years ago
wangguanzhong	5262b02585	add generate_proposals_v2 op (#28214 ) * add generate_proposals_v2 op	4 years ago
石晓伟	d9b5f1261c	update the version of pybind, test=develop (#28284 ) * update version pybind to v2.4.3, test=develop * update unittests, test=develop	4 years ago
Leo Chen	18c86fb2fb	hide some logs of p2p (#28307 )	4 years ago
lidanqing	8cd1c102d9	Enable GRU infer model running CAPI (#28313 ) * enable infer model running CAPI * output size should bigger than 0	4 years ago
wangguanzhong	1c385e26f9	add op_function_generator for box_coder (#28303 ) * add op_function_generator for box_coder * fix format	4 years ago
iducn	f763cb81a6	Modify the shell script according to the specification (#28302 ) * 01:Modify the shell script according to the specification * 01:Modify the shell script according to the specification	4 years ago
joanna.wozna.intel	571a63e7ec	Add bf16 transpose2, reshape2, concat ops (#28195 )	4 years ago
Guanghua Yu	e8f2614da5	Enhance multiclass_nms op to support LoD for dygraph mode (#28276 ) * Enhance multiclass_nms to support LoD for dygraph mode * fix some error in multiclass_nms * update GetLodFromRoisNum to GetNmsLodFromRoisNum	4 years ago
石晓伟	842a4e5abd	fix analyzer_capi_tester, test=develop (#28289 )	4 years ago
Leo Chen	8953038400	Fix transpose in conv cudnn kernel when addto enabled (#28295 )	4 years ago
Tao Luo	e1e666a05f	fix conv mkldnn build error (#28288 )	4 years ago
Jacek Czaja	0b678d401b	- sum (#28233 ) test=develop	4 years ago
Jacek Czaja	c11d9b3035	[oneDNN ] conv2d fwd&bwd optimization (#27871 )	4 years ago
Zhou Wei	8f87c7eac4	fix judge bug of errorlevel on cmd (#28271 ) * fix judge bug of errorlevel * fix some error	4 years ago
wangxinxin08	41d26a8287	update matrix nms op to api 2.0 (#28265 ) * update matrix nms op to api 2.0 * modify code according to review	4 years ago
Leo Chen	7fcb32ddf3	fill_constant op supports NINF (#28270 )	4 years ago
wangchaochaohu	6905608cea	refine yolo box Op for performace optimization (#28155 )	4 years ago
wangchaochaohu	cdadc8f019	refine temporal_shift_op for performance optimization using gpu kernel config (#28114 )	4 years ago
Zhang Ting	fdc06f2158	add Fuse bn add act pass (#28196 ) * add fuse_bn_add_act pass	4 years ago
Chen Weihang	813b2ade34	Enrich the python error types of paddle & polish format (#28124 ) * add multiple exception type * define all exception & polish compile pystack * mapping paddle error to python exception * polish static mode error format * fix failed unittests * fix dytostatic test_error * fix check_nan_inf failed * add unittest for coverage * revert some code try to solve compile error * refactor enforce & error change * polish code & add unittest	4 years ago
Adam Osewski	7db747d9e8	oneDNN BatchNorm + Act fusion pass. (#27912 )	4 years ago
Zhou Wei	fb7f85291b	fix print tensor place,add cpu/cuda/pin_memory API for Tensor (#28200 )	4 years ago
tianshuo78520a	11089cacdb	Fix xpu notest (#28204 ) * Fix xpu notest;test=kunlun * fix * test=kunlun * test=kunlun	4 years ago
mapingshuo	81244fbfab	add sharding strategy in fleet(#27900 ) * add sharding	4 years ago
Chen Weihang	2babd6ff67	Add compile limit for PADDLE_ENFORCE without error message (#28221 ) * add compile limit for paddle enforce * polish elementwise_op_function.cu.h * fix failed unittest * fix windows compile failed * detail polish * revert no type constructor	4 years ago
lidanqing	4ea2330759	use FLAGS_use_mkldnn to prevent unnecessary attrs copy (#28146 )	4 years ago
tianshuo78520a	d835118dbd	Hide log message (#28220 )	4 years ago
Double_V	2db77be423	fix wrong data type, test=develop (#28203 )	4 years ago
Feiyu Chan	efe6e2840c	fix strided_slice_op's GetExpectedKernelType (#28192 ) * fix strided_slice_op's GetExpectedKernelType when input tensor is at CUDAPinnedPlace * add unittest for tensors in cuda pinned place * skip test for cuda pinned place on cpu machines	4 years ago
Zhou Wei	271ee58f5c	Enhance build detection (#28123 ) * fix optimizer init * Enhance the detection of whether to keep the build directory * Enhance the detection of whether to keep the build directory	4 years ago
Leo Chen	1f3be85914	Fix bug of fetch_async_op_handle when fetching the feed variable (#28194 ) * fix bug of fetch_async_op_handle * revert some changes of test_buffer_shared_memory_reuse_pass * revert some changes of test_buffer_shared_memory_reuse_pass	4 years ago
WangXi	e450823b8b	Fix nccl op test failed, test=develop (#28172 )	4 years ago
tianshuo78520a	c226b2e45a	update dockerfile (#27589 ) * update dockerfile * update dockerfile * update dockerfile * update dockerfile * add opencv in ci * update cidockerfile * test nccl * fix diff * fix dockerfile * update ubuntu nccl2.7.8 * update ubuntu nccl2.7.8	4 years ago
Wilber	f935ca8a50	[lite-xpu-subgraph] Fix xpu compile and test xpu ci. (#27932 )	4 years ago
Zhou Wei	68c473e3e0	fix Automatic GPU detection failed on windows (#28148 )	4 years ago
danleifeng	f29fb396df	dygraph nccl init support host domain name (#28107 ) * nccl init support hostname and ip; test=develop	4 years ago
wangguanzhong	5cd97a1cb0	support multiclass nms for multi-batch, test=develop (#28154 )	4 years ago
Pei Yang	602d2ce5c9	change avg pooling from trt plugin to trt layer (#28032 )	4 years ago
Double_V	5289b72acc	fix Wmaybe-uninitialized warning in pooling.cc, test=develop (#28126 )	4 years ago
Zhou Wei	5d7000215a	fix dynamic_loader more safe and error message on windows (#28117 )	4 years ago
tianshuo78520a	d87d286707	Add build paddle inference (#28131 ) * Add build paddle inference;test=document_fix * Add build paddle inference;test=document_fix	4 years ago
wangguanzhong	d1e1f17482	fix generate_proposal_labels in cascade-rcnn series model, test=develop (#27892 ) * fix generate_proposal_labels in cascade-rcnn series model, test=develop * fix example code & unittest, test=develop * update code from review comments, test=develop	4 years ago
Leo Chen	a911c19eb0	fill_constant op supports NaN and Inf (#28109 ) * fill_constant supports nan and inf * add ut	4 years ago
zhupengyang	6dd64b0a30	randperm run error in multi-gpus (#27942 )	4 years ago
Double_V	d43f75e4cc	add rois_num for roi_align xpu OP (#28077 ) * add stack pool2d roi_align xpu op,test=kunlun * error message opt, test=kunlun * add xpu unittest,test=kunlun * skip check grad,test=kunlun * fix boostget , test=kunlun * error message opt for XPU, test=kunlun * add rois_num for roi_align xpu OP, test=develop	4 years ago
xiaoting	e3d02c9574	rm max_input in conv2d for kunlun, test=kunlun (#28062 )	4 years ago
joanna.wozna.intel	a21b57109c	Add AVX512 instruction check for C-API (#28087 ) * Add AVX512 instruction check for C-API * Fix formatting	4 years ago
wangchaochaohu	463c72c2d9	refine gpu kernel config for Paddle (#28085 )	4 years ago
yinhaofeng	2cb1ecb99e	lookup_table_v2_op_xpu report errors;test=kunlun (#28064 ) * lookup_table_v2_op_xpu report errors;test=kunlun * lookup_table_v2_op_xpu report errors;test=kunlun	4 years ago
yinhaofeng	6f0c3d1f06	xpu adam op (#28031 ) * lookup_table_xpu op report errors;test=kunlun * add adam xpu op;test=kunlun * reset lookup * change adam wrong;test=kunlun	4 years ago
TeslaZhao	a5c95cd588	Add xpu transpose2 op.test=kunlun (#28086 )	4 years ago
Chengmo	5f04875c30	Fix xpu error message (#28061 ) * fix error message,test=kunlun * fix, test=kunlun	4 years ago
LutaoChu	c8d32c8c10	Fix diag OP bug on Windows Python3.8 Fix diag OP bug on Windows Python3.8 ，remove the std::min	4 years ago
Pei Yang	a0b2f93689	reduce trt warning message (#28011 )	4 years ago
huangxu96	d466893820	Allclose op (#27891 ) * Still has bugs. * Fixed allclose_op bug, which cannot deal with some cases of fp64 inputs. * improved CUDA kernel performance. * Changed CUDA code. * Fixed a bug in cuda kernel which cannot deal with large dimension input, and added an unittest for it. * Add a test case for float32 input.	4 years ago
pangyoki	975bd8873b	Fix error message of multinomial op (#27946 ) * fix multinomial doc * fix multinomial error message * little doc change * fix Categorical class doc * optimize format of error message * fix CPU Kernel error message format * fix isinf and isnan error in WindowsOPENBLAS CI * delete inf and nan * add manual_seed in sample code * little error message change * change error message to InvalidArgument * add full point for error message and add manual_seed in CPU environment	4 years ago
Kaipeng Deng	b6eff4427c	update yolo_box support h != w. test=develop (#27327 )	4 years ago
Double_V	c1eed1fa24	error message opt for XPU, test=kunlun (#27972 ) * add stack pool2d roi_align xpu op,test=kunlun * error message opt, test=kunlun * add xpu unittest,test=kunlun * skip check grad,test=kunlun * fix boostget , test=kunlun * error message opt for XPU, test=kunlun	4 years ago
pangyoki	4c5b779a99	Add truncated_gaussian_random XPU kernel (#27861 ) * Add truncated_gaussian_random_op XPU kernel * Add truncated_gaussian_random_op XPU kernel, test=kunlun * little change, test=kunlun * change boost_get to BOOST_GET_CONST * change boost_get to BOOST_GET_CONST, test=kunlun * little change, test=kunlun * use Generator to generate random number and optimize format, test=kunlun * little change, test=kunlun * add TODO, test=kunlun	4 years ago
pangyoki	5b8e500135	Add gaussian_random XPU kernels (#27853 ) * Add gaussian_random XPU kernels * commit kunlun, test=kunlun * new version, test=kunlun * change boost_get to BOOST_GET_CONST, test=kunlun * use Generator to generate random number and optimize format, test=kunlun * add TODO, test=kunlun	4 years ago
pangyoki	74ce039743	Add uniform_random XPU kernel (#27846 ) * support uniform_random op on Baidu Kunlun * change dtype of attr shape from int to int64_t * kunlun ci, test=kunlun * new version, test=kunlun * change boost_get to BOOST_GET_CONST * change boost_get to BOOST_GET_CONST, test=kunlun * use Generator to generate random number and optimize format * run Kunlun CI, test=kunlun * add TODO, test=kunlun	4 years ago
xiaoting	abf4d52a74	Polish kunlun error (#27974 ) * polish error message,test=kunlun * polish error,test=kunlun * polish error,test=kunlun * polish error,test=kunlun	4 years ago
liuyuhui	3e9568653b	add cast/concat/assign xpu op (#27911 ) * addd * add cast_op_xpu, test=kunlun * fix bug for cast_op_xpu,test=kunlun * add concat_op_xpu, test=kunlun * slove conflicts, test=kunlun * fix bug,test=kunlun * add assign_op_xpu, test=kunlun * fix bug,test=kunlun * test=kunlun;test=develop * fix concat bug,test=kunlun * fix check_dygraph set in test_concat_op_xpu.py,test=kunlun * fix error message,test=kunlun Co-authored-by: mapingshuo <mps2012@yeah.net>	4 years ago
Guo Sheng	fa9d3fa5bf	Incorporate cudnn_lstm into LSTM api (#27217 ) * Incorporate cudnn_lstm into LSTM api. test=develop * Make coalesce_tensor support alignment optionally. test=develop * Reorganize RNN apis. test=develop * Fix cudnn rnn layout conversion. test=develop * Add sequence_length support for RNN cudnn implement. Add optional init_h and init_c gradient for cudnn_lstm_op. test=develop * Use create_parameter for rnn cudnn impl. test=develop * Move `self._flat_weight = self.create_parameter()` in RNNBase to main_program. test=develop * Update RNN api unittest to use set_device. test=develop * Fix set_place for unit tests of RNN apis. test=develop * Fix use_align in coalesce_tensor_op. test=develop * Adjust RNN apis arguments according to comments. test=develop * Polish documents for SimpleRNN apis. test=develop * Refine random seed in cudnn_lstm_op. Expose rnn params from sublayers to RNN. test=develop * Fix RNN saving for jit.save. Refine cudnn_lstm dropout behavior. test=develop * Fix doc of GRU. test=develop * Use ShareDataWith to avoid copying for cudnn_lstm_op test. test=develop * Remove updates on cudnn_lstm temporarily. test=develop * Use ShareDataWith to avoid copying for cudnn_lstm_op test. test=develop * Refine random seed in cudnn_lstm_op. test=develop * Fix test_lstm by adjust ConcreteProgram buffer getter. test=develop * Use create_parameter instead of create_var for rnn._flat_weight for static graph usage. test=develop * Remove W input for cudnn_lstm to pass unused_var_check. test=develop * Add test_predict for RNN unit tests coverage. test=develop * Fix code style of rnn. test=develop * Fix F.rnn usage in rnn.py. test=develop	4 years ago
chentianyu03	05fd49e974	change paddle.fluid.layers.reduce_sum to paddle.sum in sample codes (#27998 ) * change paddle.fluid.layers.reduce_sum to paddle.sum in sample codes * format codes	4 years ago
Guanghua Yu	f94d053705	error message optimization in mean_xpu,softmax_with_cross_entropy_op_xpu,test=kunlun (#27967 )	4 years ago
Jack Zhou	d330cf66cc	Fix xpu enforce (#27978 ) * test=kunlun; Add elementwise XPU OP kernel for KUNLUN core, including (but still cannot process common broadcast): * elementwise_div op * elementwise_max op * elementwise_mul op (with grad op) * elementwise_sub op (with grad op) * 0.05->0.01 * add xpu error message description;test=kunlun	4 years ago
lidanqing	7cb4a8b8f2	[oneDNN] Conv dilation support (#27914 ) * conv dilated mkldnn support: forward and backward pass * add mkldnn conv_transpose dilation UT test=develop * remove unnecessary PADDLE_ENFORCE * add int8 and bf16 dilated conv UT * update according to reviews	4 years ago
mapingshuo	64c2634995	fix kunlun kernel of reshape op (#27988 )	4 years ago
tangwei12	202bfab1be	Feature/large scale kv save base/delta (#27470 ) * add size method for large scale * add large scale UT * add ut for checkpoint	4 years ago
123malin	aa3b4ed717	【paddle.fleet】geo send sparse optimize (#27719 ) * test=develop, fix geo sgd communicator * test=develop, gloo_init_method * test=develop, bug fix for gloo http_init	4 years ago
Zhou Wei	2ac6c6c3af	fix bug of tensor copy of CUDAPinnedPlace (#27966 )	4 years ago
joanna.wozna.intel	840c521b77	Fix problem with flags fp32 and int8 (#27954 )	4 years ago
mapingshuo	5ccaaab8aa	reshape support bool, test=develop (#27944 )	4 years ago
Qinghe JING	4a4f773658	Add reduce sum and reduce mean xpu op (#27939 ) * add reduce xpu op test=develop;test=kunlun * add reduce xpu op test=develop;test=kunlun * add reduce xpu op test=develop;test=kunlun * add reduce xpu op test=develop;test=kunlun * add reduce xpu op test=develop;test=kunlun	4 years ago
Zhou Wei	bf412f4665	add tensor clone (#27953 ) * add tensor clone * fix unittest test_var_base	4 years ago
Feiyu Chan	2e845182d9	support channel last in BatchNormd 1. support channel last in BatchNormd (#27875) 2. fix a bug in batch_norm_op cuda kernel by extracting ResizeToChannelFist(Last), TransToChannelFirst(Last) to operators/layer_utils.h	4 years ago
guofei	6bbb6e7f45	Implement the function of OutScaleForTraining/OutScaleForInference in dygraph (#26601 ) * Implement the function of OueScaleForTraining/OutScaleForInference in dygraph test=develop	4 years ago
YUNSHEN XIE	fea09fe534	disable ut quickly (#27793 ) * disable ut quickly * fix some error * fix some error * install urllib2 package * use requests package instead of urllib2 * fix error caused by windows regular parameter * fix error on windows * fix some error * fix with format error * show disable ut in log * fix some error * fix some error * add the handling of error in executing get_quickly_disable_ut	4 years ago
chentianyu03	d05058d268	Remove and reorganize the alias of APIs (#27717 ) * modify cond while_loop to paddle.static.nn.cond * modify crop_tensor to paddle.crop * modify Variable to paddle.static.Variable * remove nn.beam_search, nn.beam_search_decode, nn.gather_tree * remove bpr_loss, center_loss, rank_loss, smooth_l1, teacher_student_sigmoid_loss, edit_distance, sampled_softmax_with_cross_entropy in nn.functional * remove apis in nn.functional.learn_rate.py * remove pool2d, pool3d, adaptive_pool2d, adaptive_pool3d in nn.functional * remove apis in nn.functional.vision * remove erf, soft_relu in nn.functional.activation * remove apis in nn.functional.extension * remove nn.functional.rnn * remove hash from nn.functional.lod * remove row_conv from nn.functional.extension * remove one_hot, pad2d, pad_constant_like from nn.functional.common * remove nn.gather_tree, nn.BilinearTensorProduct, nn.Pool2D, nn.Pad2D * remove apis from optimizer.__init * remove tensor.creation.fill_constant * remove elementwise_mul in nn.functional.common and modify to paddle.multiply * remove tensor.stat.reduce_mean * remove reduce_all, reduce_any in tensor.logic * remove apis in tensor.math * remove apis in tensor.__init__ * remove has_inf, has_nan in tensor.search * remove apis in framework.__init__ * remove apis in paddle.__init__ * remove apis in nn.functional.__init__ * modify removed alias apis to raw api in doc and unittests * fix remove grid_sample bug * modify removed alias apis to raw api in doc and unittests * modify removed alias apis to raw api in doc and unittests * modify removed alias apis to raw api in doc and unittests * modify removed alias apis to raw api in doc and unittests * modify removed alias apis to raw api in doc and unittests * modify removed alias apis to raw api in doc and unittests * delete alias api relastions in doc * reserve paddle.compat, paddle.sysconfig * remove unittest for paddle.reduce_all, paddle.reduce_any * modify removed alias apis to raw api in doc and unittests * recover paddle.save and paddle.load * resolve conflicts * fix sample code missing paddle.enable_static() bug * fix sample code missing paddle.enable_static() bug * fix to_string sample code error	4 years ago

1 2 3 4 5 ...

17987 Commits (ebf689197d61af28110fa6b45e91527c47f68076)