Paddle

Commit Graph

Author	SHA1	Message	Date
zhupengyang	3997743a5b	add input type and dtype check, enhance shape error message for concat_op (#20101 ) * add input type and dtype check, enhance shape error message for concat_op test=develop * enhance shape check test=develop * improve coverage test=develop	5 years ago
zhupengyang	95524a4d30	fix APIs: relu, relu6, hash (#20416 ) * fix APIs: relu, relu6, hash test=develop test=document_fix * fix relu6 doc test=develop test=document_fix * fix API.spec test=develop test=document_fix * add description link for hash test=develop test=document_fix	5 years ago
JesseyXujin	843bdbaae1	add input type and dtype check for accuracy_op (#20399 ) * add input type and dtype check for accuracy_op * add input type and dtype check for accuracy_op * modify python error on accuracy_op,add test=develop * modify details on accuracy_op, test=develop * test float16, test=develop * add warning, test=develop	5 years ago
lijianshe02	211f5b0319	enhance mul_op input error message test=develop (#20414 ) * enhance mul_op input error message test=develop	5 years ago
GaoWei8	5ea2cc6733	fix API:cos, exp, ceil, elu, brelu English doc (#20032 ) * fix API:cos, exp, ceil, elu, brelu English doc test=develop test=document_fix	5 years ago
wopeizl	3044a62f2a	fix the precise roi poop op test=develop (#20126 ) * fix the precise roi poop op test=develop add roi backward implementation, fix the output-channel	5 years ago
Wilber	2893cd1ae0	modify english api (#20159 ) * modify english api test=develop test=document_fix - leaky_relu - less_than - log - logical_and - logical_or - logical_xor - logical_not	5 years ago
zhouwei25	b1218d056b	fix English Doc of API:layers.py_func/sum (#20329 ) * fix English Doc of API:layers.py_func/sum	5 years ago
qingqing01	63194d6e67	Enhance InferShape in deformable_conv and prior_box op (#20372 )	5 years ago
tangwei12	a010d883b4	doc fix, test=develop, test=document_fix (#20239 ) * doc fix, test=develop, test=document_fix	5 years ago
huzhiqiang	6a8e54047f	fix reorder_lod_tensor_by_rank doc en (#20256 ) fix reorder_lod_tensor_by_rank doc en	5 years ago
Yibing Liu	899ab30df0	Fix several api docs (#20282 ) * Fix several api docs test=develop, test=document_fix	5 years ago
wangchaochaohu	1288ac2983	fix expand bug (#20340 ) * fix expand bug test=develop * fix style test=develop * fix style test=develop * fix style test=develop * fix style test=develop	5 years ago
SunGaofeng	a73e1f68b4	fix document of 11 APIs (#20278 ) * modify document of 11 APIs test=develop test=document_fix * fix dtype to data type and description of name parameter	5 years ago
Pei Yang	057d782d51	fix en api doc of [round, sin, sqrt], test=develop, test=document_fix (#20296 )	5 years ago
Kaipeng Deng	3833b511a6	refine en API doc (#20206 ) * refine en doc. test=develop. test=document_fix	5 years ago
wangchaochaohu	bc6126dd07	fix the reduce bug test=develop (#20102 )	5 years ago
FDInSky	e2c7b6821a	test=develop enhance uniform_random op python api (#20295 )	5 years ago
danleifeng	3a0f93b3f9	fix error message for elementwise_add/mul (#20283 )	5 years ago
liym27	670937e11d	add input type and dtype check for reshape op. (#20099 ) enhance shape error messages for reshape op. test=develop	5 years ago
Zeng Jinle	48029ab06c	Remove some DefaultGradOpDescMaker (#20185 ) * remove fc_grad, test=develop * remove fsp op since no unittests, test=develop	5 years ago
Aurelius84	729f5846cc	enhance shape error message of fc API (#20172 ) * add api check in fc test=develop * enforce shape error info of sum op test=develop * fix spelling test=develop * print x_dims info test=develop * enhance shape error info test=develop	5 years ago
wangguanzhong	6fbf441001	enhance input check for roi_align, test=develop (#20238 )	5 years ago
Yibing Liu	d849e9835f	Add detailed error messages for nce layer (#20231 ) * Add detailed error messages for nce layer test=develop * Fix some problems test=develop * Fix unit test coverage test=develop	5 years ago
Double_V	98da70f63f	fix API en doc (#20261 ) * test=develop,test=document_preview, test=document_fix * test=develop,test=document_preview, test=document_fix * fix API.spec, test=develop,test=document_preview, test=document_fix	5 years ago
zhaoyuchen2018	5ebf4078dc	add input type and dtype check for squeeze (#20100 ) * Add input check and refine error message * Refine test case and comments test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
liuwei1031	e03c1d8a9e	fix conv_op compilation issue on windows (#20230 )	5 years ago
JesseyXujin	407efcf7b1	fix API doc, solve conflict, test=develop, test=document_fix (#20196 ) * fix APIs,test=develop,test=document_fix * fix conflict, test=develop, test=document_fix * fix confict, test=develop, test=document_fix * fix confict, test=develop, test=document_fix * fix API.spec, test=develop, test=document_fix * change fluid.layers.data to fluid.data,test=develop, test=document_fix * fix bug on example code, test=develop, test=document_fix * fix API.spec, test=develop, test=document_fix	5 years ago
liym27	ad60b3b8ac	mv two function in conv op for good code style (#20116 ) * Delete PadFuntion, include padding.h instead. test=develop * move function(IsSymmetricPadding) from conv_cudnn_op.cu/conv_transpose_cudnn_op.cu to padding.h, test=develop	5 years ago
liym27	869cef6dc0	fix bug of infer shape in pool op. test=develop (#20213 )	5 years ago
lvmengsi	59a7c222ea	refine en doc (#20088 ) * update en doc	5 years ago
Zeng Jinle	3eebd5b391	refine sequence_softmax grad maker, test=develop (#20127 )	5 years ago
Chengmo	eb05db7104	Speed GEO-SGD (#20158 ) * delete debug vlog & add rpc function & fix word2vec bug & speed GEO-SGD	5 years ago
Zhang Ting	cf6919bf6e	conv_transpose supports channel_last input, test=develop, test=document_preview (#20072 )	5 years ago
tangwei12	c9139c3db3	trainer from dataset fetch targets (#19760 ) add executor.FetchHandler for train/infer from the dataset	5 years ago
tangwei12	b5a410466c	Trainer heartbeat for async mode (#19600 ) Heartbeat for distributed async training.	5 years ago
lvmengsi	76ba55e891	add error log for python api and c++ (#20061 ) * add error log	5 years ago
Yibing Liu	01ad8d2e06	Refactor linear chain crf op & crf decoding op (#19982 ) * Update crf_decoding api & example test=develop * Update api spec test=develop * Fix linear chain crf api test=develop * Avoid sharing data pointer with input test=develop * Simplify the logic in linear_chain_crf_decoding * Add unittest for crf_decoding when label & path both are set test=develop * Update API spec test=develop * Add unittest for layers && correct infer_shape in chunk_eval test=develop	5 years ago
wangchaochaohu	6e73e90bfb	fix the error message for reduce_mean and reduce_sum op (#20063 ) * fix the error message for reduce_mean and reduce_sum op test=develop * fix typo test=develop * fix according review advice test=develop * fix the test test=develop * fix test=develop	5 years ago
wangchaochaohu	9a76f3f916	Fill constant error message fix (#20075 ) * fix the constant error message test=develop * fix typo test=develop * fix typo test=develop * fix code style test=develop * fix comment and bugs test=develop * fix the bug test=develop * fix and add unittest test=develop * fix the typo test=develop * add support for the fill_constant op test=develop * add test for ci coverage test=develop	5 years ago
zhaoyuchen2018	e867366805	Add multihead op for ernie opt (#19933 ) * Add multihead op for ernie opt test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine softmax test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine kernel. test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine cuda kernel test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine cuda version test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine cmake test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
Chengmo	728ec1b43d	Add GEO-SGD distribute training algorithm (#20018 ) * refector geo sgd & communicator	5 years ago
Li Fuchen	5365cd2f14	Set lod level of sequence_unpad's output to 1 in compile time (#20068 ) * Set lod level of sequence_unpad's output to 1 in compile time	5 years ago
danleifeng	425279a57b	Improve elementwise operators performance in same dimensions. (#19763 ) Improve elementwise operators performance in same dimensions	5 years ago
liuwei1031	292aae4385	fix windows compilation issue when compile with VS2015, test=release/1.6 (#20114 )	5 years ago
Wilber	276b5e3440	fix compile paddle with anakin bug * fix compile with anakin bug * remove useless deps test=develop - 修复了联编anakin时，遇到的bug. - 编译test_anakin_activate 不通过 - 编译test_anakin_engine 不通过	5 years ago
silingtong123	649bcd5fe2	Modify the style of function names (#20071 )	5 years ago
liym27	3aa331d97e	fix conv2d and conv3d: (#20042 ) 1.support asymmetric padding; 2.support padding algorithm:"SAME" and "VALID"; 3.support channel_last: data_format NHWC and NDHWC; 4.change doc of python API and c++; test=develop, test=document_preview	5 years ago
chengjuntao	6f184775e8	Fix compling warning in deformable conv. (#20036 )	5 years ago
wangguanzhong	da892cafd5	Refine api doc (#20037 ) * refine doc, test=document_fix * add API.spec,test=develop,test=document_fix	5 years ago
silingtong123	f1eebf75aa	improve op uniform_random, argument shape support tensor and tensor in list (#19786 ) * test=develop, argument shape support tensor and tensor in list * test=develop,Increasing the coverage of CI tests * test=develop, modify the document and update API.spec * test=develop, modify the doc and update API.spec * test=develop, modify the doc and update API.spec * test=develop, modify the interface of UniformInitializer * test=develop, modify the interface of XavierInitializer and MSRAInitializer * test=develop, modify based on review's comments * test=develop, modify based on review's comments * test=develop, modify based on review's comments	5 years ago
liym27	24010472d4	fix pool2d pool3d,support asymmetric padding and channel_last (#19739 ) * fix pool2d pool3d: 1. support asymmetric padding; 2. support padding algorithm:"SAME" and "VALID"; 3. support channel_last: data_format NHWC and NDHWC; 4. support inferring shape when input with negative dims in compile time; 5. change doc of python API and c++; 6. fix bug in cuda kernel when Attr(adaptive) is true. test=develop,test=document_preview * fix 'tensors' to 'Tensors'. test=develop,test=document_preview * add test for converage ValueError.test=develop,test=document_preview * resolve conflict in test_pool2d. test=develop	5 years ago
Adam	fe581b0e8a	Minor GetMKLDNNFormat changes (#20055 ) test=develop	5 years ago
lvmengsi	c92348c3b9	fix conv_grad_grad (#20054 )	5 years ago
Kaipeng Deng	e7a6567be5	polish pool infer shape (#20038 ) * fix pool infershape. test=develop * fix unittest converage. test=develop * fix format. test=develop	5 years ago
chengduo	fb2a9cdf83	Add fp16 support for pad and split (#19881 ) * make pad and split support fp16 test=develop	5 years ago
lvmengsi	647ff784e2	fix mul double grad (#20040 )	5 years ago
tangwei12	8f0b3c0516	the integrated communicator (#19849 ) * add a base class for the Communicator * add AsyncCommunicator Impl for async distributed training	5 years ago
danleifeng	5cef7a2f25	Polish English docs of elementwise_add/sub/mul/div (#20027 ) Polish English docs of elementwise_add/sub/mul/div	5 years ago
Li Fuchen	c8e125872c	Fixed warpctc, test=develop (#20011 ) Use AllocateTmpTensor() for creating temporary tensors in warpctc.	5 years ago
wangchaochaohu	3409db950c	fix reduce bug test=develop (#19971 )	5 years ago
Adam	4b65af7719	MKLDNN BatchNorm operator refactor (#20012 ) test=develop	5 years ago
joanna.wozna.intel	1d32897c5c	Fix test pool2d int8 mkldnn (#19976 ) * Fix conv2d+dequantize squash for residual fusion test=develop * Correct int8 input test=develop * Add if exclude or include padding in pool2d mkldnn test=develop	5 years ago
Aurelius84	f58c8db668	Require x.dims=label.dims in huber_loss (#20017 ) * x.dims == y.dims test=develop * refine comment	5 years ago
Aurelius84	137e6336ef	Remove constraint that last dimension is forced to be 1 in rank_loss (#19997 ) * fix input shape check test=develop * move PADDLE_ENFORCE test=develop	5 years ago
chengduo	101a2b610a	Add dtype for coalesce_tensor_op (#20016 ) Add dtype for coalesce_tensor_op	5 years ago
Zhaolong Xing	f04f2b232a	fix if else error info (#19974 ) test=develop test=document_fix	5 years ago
gongweibao	a7512db2bc	Polish elementwise max min pow document to add more examples. (#19946 ) Polish elementwise max min pow document to add more examples	5 years ago
Aurelius84	2b5b4b3c5e	fix dataType in C++ comment in embedding op (#20004 )	5 years ago
Tao Luo	bcb2903e60	enhance shape error message of mul_op (#19998 ) test=develop	5 years ago
Chen Weihang	1409586eaa	Add LoD empty check for all related sequence ops (#19980 ) * add lod check for sequence op, test=develop * delete unnecessary check in expend op, test=develop	5 years ago
zhongpu	b1bb23841e	add kernel for fill_op, test=develop (#19719 ) * add kernel for fill_op, test=develop * modify PADDLE_ENFORCE to PADDLE_ENFORCE_EQ, test=develop * add op test for fill_op, test=develop * REGISTER COP CUDA KERNEL, test=develop * update test_fill_op.py, test=develop * change FillConstantOpVarTypeInference to FillOpVarTypeInference, test=develop * fix op test, test=develop * add head file, test=develop	5 years ago
wangchaochaohu	382d099dcb	add support tensor and tensorlist for strided_slice OP (#19929 ) * add support tensor and tensorlist for strided_slice OP test=develop * fix the commnet test=develop * fix test=develop * fix the bug test=develop * delete log test=develop * fix API.spec test=develop * fix test=develop	5 years ago
lvmengsi	619a241bd0	Fix OpTest of bn (#19062 ) * fix bn	5 years ago
Bob Zhu	c670058a8d	add support of matmul with multiple head even different width and height (#19708 ) * add support of matmul with multiple head even different width and height Original matmul with multiple head supports only the mat_a.width == mat_b.height, in that case, mat_b will be horizontally split. In this patch, we extend the support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height, in this case, mab_b will be vertically split. One example is A is [3, 8], B is [2, 16], head_number is 4. In this case, A will be split as [3, 2], B will be (vertically) split as [2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16] test=develop * add support of matmul with multiple head even different width and height Original matmul with multiple head supports only the mat_a.width == mat_b.height, in that case, mat_b will be horizontally split. In this patch, we extend the support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height, in this case, mab_b will be vertically split. One example is A is [3, 8], B is [2, 16], head_number is 4. In this case, A will be split as [3, 2], B will be (vertically) split as [2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16] test=develop * refactor the code of matmul with multiple head even different width and height test=develop	5 years ago
Liufang Sang	6884dc800a	refine ctc align op with padding (#19926 ) * refine ctc align op with padding * refine api sample code	5 years ago
Aurelius84	99a9615a4b	Removing length dims constraints of seq_pad and seq_unpad (#19497 ) * Removing last dims constraints of seq_pad and seq_unpad test=develop * fix test_layer api code test=develop * fix sequence_pad_op.cc conflict test=develop * remove test_analyzer_mm_dnn test=develop * fix vectorize bug test=develop * fix vectorize<int> test=develop	5 years ago
jhjiangcs	766bd529d1	add optimizer:dpsgd,test=develop (#19915 )	5 years ago
Yang Zhang	ebff68fa74	Add float16 support to `sync_batch_norm_op` (#19681 ) * Add float16 support to `sync_batch_norm_op` test=develop * Add test for sync_bn with FP16 input test=develop	5 years ago
Aurelius84	039b9710d5	Remove constraint that last dimension is forced to be 1 by adding lookup_table_v2 (#19735 ) * Remove constraint that last dimension is forced to be 1 by add lookup_table_v2 test=develop * modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop * Revert "modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop" This reverts commit 8a960bfc61e51aa27c3c529df8fb90b93ebd19f9. * move api into fluid.embedding test=develop * fix example code test=develop * move one_hot into fluid.one_hot * modify api.spec test=develop * fix loss shape test=develop	5 years ago
xujiaqi01	cedc04775c	support change shuffle and train thread num (#19841 ) * support change shuffle thread num * support change train thread num * fix receive shuffle data of each channel * data norm stop gradient * add check thread_tensor type and root_tensor type when merge metric * remove sleep in shuffle, add config * add config of pslib client to client communication * fix xbox str * add data norm op testcase * add flush in trainer finalize	5 years ago
Kaipeng Deng	14625ffe9e	add elementwise mod support float/double. test=develop (#19570 )	5 years ago
Jacek Czaja	5b07ca9cdd	- ReImplemented pooling fwd mkldnn (#19911 ) - First implementation of BWD and FWD of pooling mkl-dnn - Compilation fix - Fix - Fix - Fix - Fix to crash - Compilation fix - Combined AcquireBacward with Fwd test=develop	5 years ago
Zeng Jinle	b1e83b33b0	fix huber loss op attr type, test=develop (#19937 )	5 years ago
Zeng Jinle	cc157d5990	add inplace to assign op, test=develop (#19927 )	5 years ago
Leo Chen	57606205f5	Make OpTest check grad inplace even if forward has no inplace (#19847 ) * make OpTest check grad inplace even if forward has no inplace, test=develop * do not run PE when enable_inplace is False, test=develop * add conv3d cuda kernel for float16 type, test=develop * refactor OpTest for inplace, test=develop * add comments, test=develop	5 years ago
Zhang Ting	cb8f3c03a7	resize Ops support data_layout:channel_last, test=develop, test=document_preview (#19914 )	5 years ago
Kaipeng Deng	3f021781a1	fix softmax CE time limit check failed (#19846 ) * fix softmax ce time limit check failed. test=develop * refine softmax calc. test=develop	5 years ago
石晓伟	30adea0a23	tensor_array_to_tensor_op.cc, test=develop (#19289 )	5 years ago
lvmengsi	4155e62559	add instance norm (#19500 ) * add instance norm op	6 years ago
Adam	cb65439da8	Add support for other axes in MKLDNN softmax op (#19907 ) * Initial, functional commit * Clean commit related files test=develop	6 years ago
Pei Yang	baccd7e2ca	Add TRT input shape check between model and runtime (#19864 ) * add TRT shape check, test=develop * model_input_shape == runtime_input_shape, refine message, test=develop	6 years ago
Aurelius84	fcf53e55ff	support 2-level lod of input in sequence_pool (#19839 ) * support 2-level lod of input in sequence_pool test=develop * fix lod level bug in .cu test=develop	6 years ago
Zhang Ting	93364b45c1	group_norm support data_layout:NHWC, test=develop, test=document_preview (#19614 ) 1. group_norm support data_layout=NHWC 2. modified doc of group_norm	6 years ago
Jacek Czaja	619c797a7f	[MKL-DNN] LRN refactoring (#19798 ) - LRN mkl-dnn kernel refactor test=develop - compilation fix - Another compilation fix - Compilation fix - another compilation fix - compilation fix - Crash fix - optional LRN mkldnn workspace - Added mid allocation - Workaround for tests - Removed gradient from is_test ut - Removed mid for inference - Reverted LRN mid removal for is_test - PADDLE_ENFORCE adjusted - Rebase to templatization commit - Compilation fix - compilation fix test=develop - lint test=develop - Fix to crash - Rebase to recent codebase - lin - lint - compilation fix	6 years ago
Zhang Ting	439d95e157	modified interpolate op to support tensor attribute, test=develop, test=document_preview (#19287 ) modified interpolate_op to support tensor attribute 1. the parameter out_shape of image_resize、resize_nearest/bilinear/trilinear can be a list or a 1-D tensor variable. If a list, each element can be an integer or a tensor variable with shape: [1]. 2. the parameter scale of above Ops can be a 1-D tensor variable. modified document of image_resize, resize_nearest, resize_bilinear, resize_trilinear and add some code example.	6 years ago
Zhang Ting	b38889413d	add crop_tensor_op, test=develop, test=document_preview (#19314 ) add crop_tensor op. The main difference with crop is : 1. If the argument shape is a list, each element is an integer or a tensor variable with shape: [1]. This way is suitable for the case that the shape may be changed each iteration. 2. If the argument shape is a variable. Its rank must be 1. In crop op, the rank of shape must be the same as x offsets can be a list, in which each element is an integer or a tensor variavle with shape: [1].	6 years ago
lidanqing	2c32c2d649	Refactor conv computeINT8 (#19574 ) * fix conflicts test=develop * change mask_bias_reorder test=develop * add ComputeMask function to make code clear test=develop * change according to reviews test=develop * change according to reviews test=develop	6 years ago
Adam	c7e688921b	Add template functions for Acquire primitive/primitive_desc (#19867 ) * Add template functions for Acquire primitive/primitive_desc test=develop * Move acquire primitive descriptor to protected section test=develop	6 years ago
Aurelius84	b125e327aa	Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606 ) * Remove constraint that last dimension is forced to be 1 in cross_entropy test=develop * modify labels last dims test=develop	6 years ago
wopeizl	a7c440d303	add precise roi pooling op test=develop (#18960 ) * add precise roi pooling op test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * detail the description test=develop * test=develop * elaborate the doc for return type test=develop * test=develop	6 years ago
Yiqun Liu	3cd985a669	Add a pass to fuse fc+elementwise_add+layernorm (#19776 ) * Add fc_elementwise_layernorm_fuse pass and unittest. * Add fused_fc_elementwise_layernorm op and its GPU kernel. test=develop * Apply fc_elementwise_layernorm_fuse_pass to GPU inference. * Add the setting of attrs in the definition of binary_op. test=develop * Add comment. * Implement the unittest. test=develop * Change the unittest name of layer_norm. test=develop	6 years ago
wangchaochaohu	47af618f70	Strided slice (#19642 ) * strided_slice op basic function test=develop * test=develop rewrite and fix * fix bug test=develop * fix for the PADDLE_ENFORCE usage * add some unit testw * fix for the aip test and copright and fix test=develop * fix API.spec test=develop * fix API.spec test=develop * add axis parameter test=develop * fix for the build error test=develop * fix python api test=develop * fix the build test=develop * fix build test=develop * fix API spec test=develop * test=develop add some comment and single op test * fix API spece test=develop * fix test=develop * fix test=develop * fix api test=develop * fix api test=develop * fix API.spec test=develop * fix typo test=develop * fix API.spec test=develop * fix API typo test=develop * fix doc and API.spec test=develop	6 years ago
123malin	1bc285a53a	add retry function to try to solve grpc error code 14 (#19661 ) * rpc retry for asycsend/get/prefetch * test=develop, change retry vlog level to 3 * test=develop, set default grpc_retry_times is 3	6 years ago
LielinJiang	6d72a86b14	fix_roi_transform_bug (#19785 )	6 years ago
Zeng Jinle	3fd3b663a8	fix gc bug in controlflow ops, test=develop (#19827 )	6 years ago
Leo Chen	982e61f5ff	Update elementwise double grad to save gpu memory (#19509 ) * update elementwise double grad to save gpu memory, test=develop * update elementwise_mul/div_grad_grad to save memory, test=develop * remove eval function in eigen statement to save memory, test=develop * add unittest for elementwise_div_grad_grad without dout, test=develop * add unittest for elementwise_add_grad_grad without ddx, test=develop * add float16 cuda kernel for elementwise double grad op, test=develop	6 years ago
Adam	dfdd73cbc0	Add MKLDNNhandlerT templatized class (#19801 ) test=develop	6 years ago
Zeng Jinle	cabb9501bd	fix leaky_relu op when alpha is zero, test=develop (#19833 )	6 years ago
chengjuntao	00efd1d8a9	add deformable conv v1 op and cpu version of deformable conv v2 (#18500 ) * add deformable conv v1 op, test=develop	6 years ago
liym27	677e714425	fix pow op, support tensor for agument factor. (#19313 ) improve pow op according to reviews: 1. Delete unnecessary judgement statements in PowGradOpDescMaker; 2. Improve test of test_api; overload GetKernelTypeForVar add stop_gradient=True when attr(factor) is tensor Variable, change examples in API pow. test=develop,test=document_preview	6 years ago
liym27	bd89a27308	add tensor support for argument shape in reshape op; (#19268 ) add support parameter inference when argument shape is a list containing integer and tensor variable; test=develop fix reshape op according to reviews: 1. improve or message; 2. improve test of test_api. test=develop,test=document_preview fix reshape op: Add error message in nn.py, test=develop add stop_gradient=True when attr(shape) is tensor Variable. change examples in API reshape. test=develop,test=document_preview	6 years ago
liym27	88628016b2	add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208 ) add support parameter inference when arguments starts or ends is a list containing integer and tensor variable; test=develop,test=document_preview improve slice op according to review(from hongyu). test=develop fix slice op according to review: infer_flags, test=develop fix slice op: improve overload operator __getitem__ to support attrs(starts and ends) are Variable. test=develop,test=document_preview fix test_slice_op: add TestSliceOp_decs_dim_6 to resolve conflict with test_slice_ngraph_op. test=develop add stop_gradient=True when attr(starts) or attr(ends) is tensor Variable. test=develop,test=document_preview	6 years ago
liym27	e9e3c08777	fix expand op: (#19302 ) 1. add tensor support for argument expand_times in expand op; 2. add support parameter inference when argument expand_times is a list containing integer and tensor variable; improve expand op according to reviews: 1. add doc of ExpandTimes in expand_op.cc; 2. improve the test of test_api. add stop_gradient=True when attr(expand_times) is tensor Variable, change code examples. test=develop,test=document_preview	6 years ago
lvmengsi	b76343c3b7	cpu Conv double grad (#19672 ) * cpu conv_grad_grad	6 years ago
翟飞跃	93c85c930a	Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770 ) * Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * optimize bp with mkl sparse matrix test=develop * tmp add fused_emb_seq layer * Add the support of padding_idx attribute. test=develop * add padding_idx support test=develop * implement grad refer lego test=develop	6 years ago
Yiqun Liu	c67c8758cb	Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733 ) * Refine the codes related to fc op. * Add GPU implementation for fc functor. * Apply fc_fuse_pass in GPU inference. test=develop * Change the cmake for fc op. * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ. * Add an attribute to set the activation type in fc_op. * Enhance the unittest of fc_op. test=develop * Remove the declaration of FCOpGrad back to the header file. test=develop * Set default value for newly added arguments in test_fc_op. test=develop * Enhance fc_fuse_pass to enable fusing relu. * Allow print the shapes of var_desc in graph. test=develop * Enhance fc_fuse_pass_tester. * Remove the use of PADDLE_ENFORCE. test=develop * Correct the number of ops after fusing. test=develop * Fix a typo. test=develop * Set activation_type to null when there is no relu in fc. test=develop * Refine fc_fuse_pass's codes. * Enable the set of shape for tensor. * Refine repeated_fc_relu_pass and add unittest. test=develop	6 years ago
zhongpu	52673956de	add kernel for squeeze_op, test=develop (#19656 ) * add kernel for squeeze_op, test=develop * delete comment, test=develop	6 years ago
zhongpu	2a81c3679a	add kernel for unstack_op, test=develop (#19538 ) * add kernel for unstack_op, test=develop * add kernel for unstack_op, test=develop * add kernel for unstack_op, test=develop * adjust the code format, test=develop * modify some comment, test=develop	6 years ago
Kaipeng Deng	99c78b772a	fix softmax axis!=-1. test=develop (#19800 )	6 years ago
Adam	d4413a54bc	Add common CreateKey for mkldnn handlers (#19767 ) test=develop	6 years ago
Aurelius84	8c7e411908	Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716 ) * add one_hot_v2_op to remove last_dims==1 test=develop * add api unittest code for CI_Coverage test=develop * improve CI_Coverage rate by adding test_with_depth test=develop	6 years ago
Jacek Czaja	9e4c958552	Refactoring activation mkldnn op (#19748 ) test=develop - fix to BWD test=develop	6 years ago
Huihuang Zheng	12542320c5	Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989 ) TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory. We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton. Also added data_feed_proto to operator to fix CI in CPU compilation	6 years ago
Zeng Jinle	0daa5c9772	Make leaky relu inplacable (#19676 ) * make leaky relu inplacable, test=develop * force add unittests to pass coverage, test=develop	6 years ago
Zeng Jinle	078a678219	refine math_op_patch, test=develop (#19727 )	6 years ago
Jacek Czaja	47f670d58c	- Softmax mkl-dnn refactoring (#19615 ) test=develop - Cosmetic fixes test=develop	6 years ago
Yiqun Liu	a65c728e5d	Implement the GPU kernel of fc operator (#19687 ) * Refine the codes related to fc op. * Add GPU implementation for fc functor. * Apply fc_fuse_pass in GPU inference. test=develop * Change the cmake for fc op. * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ. * Add an attribute to set the activation type in fc_op. * Enhance the unittest of fc_op. test=develop * Remove the declaration of FCOpGrad back to the header file. test=develop * Set default value for newly added arguments in test_fc_op. test=develop	6 years ago
Aurelius84	22301115d0	Remove constraint that last dimension is forced to be 1 in huber_loss op (#19562 ) * Remove constraint that last dimension is forced to be 1 in huber_loss test=develop * add y[rank-1] == 1 when x_rank=y_rank test=develop * modify into contain_unknown_dim test=develop	6 years ago
Tao Luo	ec9bc1bd9f	paddle::framework::vectorize() templatization (#19730 ) remove unused accuracy-diff warpctc-cudnn implementation test=develop	6 years ago
Adam	428b2b9e17	MKLDNN handler cleanup (#19713 ) * MKLDNN handler cleanup * MKLDNN handler cleanup test=develop	6 years ago
Zeng Jinle	1c25c88aba	refine memory usage of some operators, test=develop (#19700 )	6 years ago
wangguanzhong	25dcd74d34	merge empty lod tensor, test=develop (#19228 ) * merge_empty_lod_tensor, test=develop * fix multiclass_nms, test=develop * refine API.spec, test=develop * add unittest case for fetch, test=develop * add lod tensor test, test=develop * return index for multiclass_nms, test=develop * add api for multiclass_nms2 * update API.spc, test=develop * refine api doc, test=develop * fix test_detection.py, test=develop * polish code, test=develop * add more unittest case, test=develop	6 years ago
yaoxuefeng	c6756ed225	fix instag op (#19591 ) * fix instag op * fix instag bug: Some tiny logical error, occurring when ins_tag (2nd input) is multiple. test=develop	6 years ago
zhongpu	5f627488db	add kernel for unsqueeze_op and Add unsqueezed op test, test=develop (#19436 ) * add kernel for unsqueeze_op, test=develop * add kernel for unsqueeze_op, test=develop * add kernel for unsqueeze_op, test=develop	6 years ago
Tao Luo	f05d2c519d	paddle::framework::vectorize() templatization [PART3] (#19643 ) * paddle::framework::vectorize() templatization test=develop * update pybind/imperative.cc test=develop * revert update on unsqueeze_op.cc and warpctc_cudnn_op.cu.cc test=develop	6 years ago
hutuxian	1ca6ea0318	fix cmakelist deps (#19668 ) fix cmakelist deps: remove unnecessary deps and add proper op deps	6 years ago
Tao Luo	bcddbc78d4	remove -Wmaybe-uninitialized warning (#19653 ) * remove -Wmaybe-uninitialized warning test=develop * remove uninitialized op_handle_ in scale_loss_grad_op_handle.cc test=develop	6 years ago
wangchaochaohu	4440d7ced0	test=develop cuda realization of label smooth op (#19175 )	6 years ago
chengduo	31c5a5ee26	Remove linear_chain_crf_op.cu (#19645 ) test=develop	6 years ago
wangchaochaohu	ed8f44ea21	codegen for fused elementwise operation (#19520 ) * test=develop codegen for fused elementwise operation * fix test=develop	6 years ago
Chen Weihang	73daa3d6c0	Code Cleanup: delete three useless raw variables in Conv2D (#19644 ) * delete useless raw variables in Conv2D, test=develop * adjust the vars number in test_graph_wrapper to pass unittest, test=develop	6 years ago
123malin	2f037c3189	fix the diff between async mode and async_half mode (#19535 ) * test=develop, communicator merge add => merge average	6 years ago
tangwei12	f45cb1c2ca	fix bug of communicator flag, test=develop (#19635 )	6 years ago
Tao Luo	3ae939e48a	unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631 ) * remove assert.h * change PADDLE_ASSERT_MSG to PADDLE_ENFORCE test=develop * fix tensorrt paddle_enforce test=develop	6 years ago
Leo Chen	af692c9140	update reduce_sum and reduce_mean to save memory, test=develop (#19608 )	6 years ago
Zeng Jinle	710767d894	Enable inplace support for some ops (#19612 ) * enable inplace for affine_channel op, dropout op, test=develop * remove dropout inplace for ngraph fails, test=develop	6 years ago
Tao Luo	d6c85c96dc	paddle::framework::vectorize() templatization (#19627 ) test=develop	6 years ago
danleifeng	8672e15363	elementwise broadcast function enhancement (#19536 ) elementwise broadcast function enhancement	6 years ago
Chen Weihang	8cb54ede8c	Add user-friendly error message in optimizer ops to give a hint about the position sensitive problem of run(startup_program) (#19605 ) * add extra error message hint in optimizer ops * polish format & delete useless change, test=develop * extract init judue from shape compare, test=develop	6 years ago

1 2 3 4 5 ...

4764 Commits (a5fc291fe5861417f0648b9cbd061ea0212ce2d5)