Paddle

Commit Graph

Author	SHA1	Message	Date
Michał Gallus	ed9ceb9f98	Refactor MKL-DNN ElementwiseMul (#21061 ) * Refactor MKL-DNN ElementwiseMul remove manual fallback, remove format attrs test=develop * Refine PADDLE_ENFORCEs in eltwise_mul_op.h test=develop * Make ElementwiseMulOp inherit from ElementwiseOp * Change type of simd_width to int test=develop * Remove Constructor extensions in ElementwiseOp and ElementwiseMulOp test=develop * Restore attributes test=develop * Fix test coverage for mkldnn eltwise mul test=develop * Conform to new is_run_common_broadcast API test=develop * Add UT for AreDimsAndFormatCorrect test=develop	5 years ago
zhouwei25	345b67b5e2	remove warning LNK4006 and warning LNK4221 (#21226 )	5 years ago
wangchaochaohu	6514f52e46	fix the fill_constant op precious problem (#21322 ) * fix the fill_constant op precious problem test=develop	5 years ago
zhaoyuchen2018	08c19c585d	Improve argsort performance. (#21267 ) * Improve argsort performance. - Give 200000 data to compute argsort on v100, can speed up ~190x before opt cost: 0.53s after opt cost:0.0027s - Add fp16 support * Refine error message * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
WangXi	8ac7687e36	Fix dgc accuracy by mv regularization to local (#21278 )	5 years ago
Leo Zhao	b19e1a1b56	use prefetch to load next mem into cache (#21206 ) * use prefetch to load next mem into cache test=develop * remove hard code memcpy om pyramid_hash_ff test=develop	5 years ago
gongweibao	ed2a185248	optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597 )	5 years ago
Yihua Xu	69dd5152cf	Fix the crash issue when scale or bias was null-pointer. (#21284 ) * Fix the crash issue when scale or bias was null-pointer. test=develop * Add the error message for passing CI. test=develop	5 years ago
Zhang Ting	698b8b73ad	optimize lod_reset op to avoid data transform	5 years ago
Liufang Sang	f0b1518438	add dequantize_abs_max op and modify lookup_table op (#20899 ) * add int8 kernel to lookup_table op and add dequantize op test=develop * change paddle_enforce to paddle_enforce_eq test=develop * change copyright and change some not suitable code test=develop * remove debug log test=develop * replace GetInputType with IndicateVarDataType test=develop * fix EmptyGradMaker test=develop * fix diff between cpu and gpu test=develop * use memcopy when int8_t test=develop	5 years ago
hutuxian	a6ce2306f9	support cvm_op run in gpu (#21300 ) Previously, CVM OP was only able to run in CPU. This PR implements its GPU kernel. What's more, we improve the UTs about CVM OP.	5 years ago
Yihua Xu	b085ecc258	Avoid the string as the key of map to improve the jit performance (#21292 ) * Avoid the string as the key of map to improve the jit performance. test=develop * Use map to replace unordered_map. test=develop	5 years ago
zhongpu	c4ede95c74	open dygraph op test, test=develop (#19787 ) * open dygraph op test, test=develop * modify to_variable, test=develop * modify input and output for dygraph, test=develop * modify input and output for dygraph(fix bug), test=develop * fix input processing of dygraph op test, test=develop * fix bug, test=develop * fix op test, test=develop * fix forward bug for dygraph, test=develop * fix mkldnn op test for forward, test=develop * update nn.py for dygraph, test=develop * fix crop_tensor_op, test=develop * fix elementwise_mul_op, test=develop * fix fill_op, test=develop * fix some mkldnn op, test=develop * open backward op test for dygraph, test=develop * delete log, test=develop * close backward op test for dygraph, test=develop * fix bug for edit_distance_op and test_lstm_cudnn_op, test=develop * fix optest backward bug for dygraph, test=develop * fix optest backward bug for dygraph, test=develop * close backward op test for dygraph, test=develop * close backward op test for dygraph, test=develop * open dygraph op test, test=develop * fix op test for dygraph, fix GradOpDescMaker, test=develop * fix bug for linear_chain_crf_op.h, test=develop * remove log, test=develop * remove log, test=develop * remove log for op_test.py, test=develop * remove log for op_test.py, test=develop * fix bug for var_conv_2d_op, change PADDLE_ENFORCE, test=develop * fix PADDLE_ENFORCE_EQ for hierarchical_sigmoid_op.cc, test=develop * fix bug for test_increment_ngraph_op.py, test=develop * fix lod for op test in dygraph, test=develop * refactor op_test.py to reduce redundant code, test=develop * fix lod optest, modify InputVar/OutputVar to HasInput/HasOutput, test=develop * remove debug log, test=develop * remove redundant code in base.py, test=develop * fix some error in optest, test=develop * fix ClearNoNeedBufferInputs function's bug for LoDTensor, test=develop * refactor op_test.py, test=develop * remove redundant writing, test=develop * fix error(get tensor of the grad variable), test=develop * fix test_concat_mkldnn test_conv2d_mkldnn, test=develop * fix optest.py for get tensor of LoDTensor, test=develop * fix optest.py for get tensor of LoDTensor, test=develop * fix optest.py for get tensor of LoDTensor, test=develop * fix some redundant code, test=develop * reslove conflict and rewrite paddle error message, test=develop	5 years ago
danleifeng	6fc3e8ec84	edit elementwise_mul doublegrad inplace (#21245 )	5 years ago
zhaoyuchen2018	3ff5cc2d5e	Fix topk compile failed on windows (#21243 ) * Fix topk compile failed on windows * Use explicit cast for assign data	5 years ago
Zhang Ting	01a9646323	optimize assign op to avoid copy data from GPU to GPU (#21181 ) * optimize assign op to avoid copy data from GPU to GPU, test=develop * modified GetkernelTypeForVar and just avoid device transform, test=develop	5 years ago
danleifeng	0e7baabe59	extend elementwise broadcast function (#20957 )	5 years ago
Adam	d623e863c9	Fix GELU grad error (#21204 ) test=develop	5 years ago
yaoxuefeng	b5d8ba8394	fix data_norm op to avoid impractical normalization result test=develop (#21152 ) * fix auc drop first commit test=develop * update datanorm op * update datanorm with enforce test=develop * update test=develop * update format test=develop * update format * update format test=develop * add unit test test=develop * update unit test test=develop * update format test=develop * update format test=develop * update API description test=develop * update API description test=develop * update format test=develop * fix codes as comments test=develop * fix description as comments test=develop * fix description as comments test=develop * update codes.. test=develop	5 years ago
Zhang Ting	9cbe7bccba	modified error message and API doc for channel_last supported Op (#21002 ) * modified error message for conv and conv_transpose, test=develop * modified doc of conv and conv_transpose op, test=develop * modified the expression for error message, test=develop * modified error message for group_norm op, test=develop * modified detail of Attr(data_format) or Attr(data_layout) * add ValueError in API doc for maxout op, test=develop	5 years ago
guofei	56b5d14704	Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118 )	5 years ago
WangXi	3c98ec90ce	Fix INF bug of softmax_cross_entropy_op (#21165 )	5 years ago
Yihua Xu	eec9c9cbe7	Fix jit tls issue (#21151 )	5 years ago
ruri	aeb887911f	Refine edit distance cn (#21121 )	5 years ago
Kaipeng Deng	98b59cb82c	fix elementwise_mod float point kernel. test=develop (#21183 )	5 years ago
whs	cfdd1fc2cd	Fix warpctc in padding mode. (#21033 )	5 years ago
Chen Weihang	8da0cd537a	Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134 ) * add examples for error msg spec, test=develop * change ENFORCE to ENFORCE_*, test=develop add more already exists examples, test=develop	5 years ago
zhaoyuchen2018	b93870e696	Improve topk performance. (#21087 ) * Improve topk performance. give 200000 data to compute topk, before opt: cost 1s after opt: cost 0.0028s. * Refine return value. * Add cuda util funtions. * Fix ComputeBlockSize bug & refine comments. Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
Chen Weihang	8414575b78	Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137 ) * add examples for error spec, test=develop * change ENFORCE to ENFORCE_**, test=develop	5 years ago
Chen Weihang	7e5f74b825	Add examples for error message writing specification - InvalidArgument (#21132 ) * add examples for error msg spec, test=develop * change ENFORCE to ENFORCE_*, test=develop fix error, test=develop	5 years ago
zhaoyuchen2018	4a544762a2	Add Asypadding for conv fusion. (#21041 ) * Add Asypadding for conv fusion. test=develop reference: pr/20042 * Fix eigen build link error * Change back file mode * Use math function & add more checks.	5 years ago
WangXi	de5d3ff688	Fix dgc buffer illegal & reuse velocity (#21012 )	5 years ago
ceci3	f62a929151	fix instance norm (#21042 ) * fix instance norm * update unitest,test=develop	5 years ago
lilong12	e249d9a3e2	fix the computation for dx (grad for x) for prelu operation. (#20949 ) * set the default value of alpha for prelu to 0.25, test=develop * add the call to __syncthreads(), test=develop * fix the implementation of cpu prelu, test=develop * repair the implementation of element mode prelu, test=develop * modify test_prelu_op.py, test=develop	5 years ago
Zhang Ting	e0285eae64	add check for input channels and Attr(groups), test=develop (#21095 )	5 years ago
Yiqun Liu	35f17ae28f	Add the check of lod_level between compile-time and runtime. (#20961 ) * Add the check of lod_level between compile-time and runtime. test=develop * Fix bug in check_compile_vs_runtime. test=develop * Fix the check of output when it is dispensiable or intermediate. test=develop * Share lod of x to out in match_matrix_tensor op in compile-time. * Implement GetLoDLevel in InferShapeContext. * Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op. test=develop * Enable check_compile_vs_runtime in test_match_matrix_tensor. * Add the implementation of SetLoDLevel in InferShapeContext. * Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead. * Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead. * Refine some ops and unittests. test=develop * Fix a typo. test=develop * Remove the check of var type, and change int to int32_t. test=develop * Add unittest for Get/SetLoDLevel. test=develop	5 years ago
Chen Weihang	826254f664	Add pre-condition check for fuse optimizer op pass (#21005 ) * add pre condition check for fuse optimizer op pass, test=develop * add log & set init to zero, test=develop * fix test_fuse_all_reduce_pass failed, test=develop * polish details, test=develop * refine PADDLE_ENFORCE & remove needless VLOG, test=develop * refactor op check method, test=develop	5 years ago
Aurelius84	1cd6721873	Optimizer mmcpy if _rand_len=16 and remove data copy in GradKernel (#21099 )	5 years ago
joanna.wozna.intel	77c2083586	Add transpose2 INT8 for mkl-dnn (#19424 ) * Add transpose2 INT8 for mkl-dnn test=develop * Fix test_transpose_int8_mkldnn test=develop * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2" This reverts commit 34011bdba4c859abb945e062ab13124f70508054, reversing changes made to 2ce6473f144da298aba4a43d46918f27d463cf7c. * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"" This reverts commit 23754dd78ca47ae56881161172b2aacd349aba90. * Add template to TransposeMKLDNNHandler test=develop * Resolve conflict test=develop * Restore get_size and refactor test=develop	5 years ago
LielinJiang	06063b7001	add op locality_aware_nms, test=develop (#20976 )	5 years ago
wangchaochaohu	fc385777e4	fix the compile cost long time test=develop (#21064 )	5 years ago
Chen Weihang	2f27b10331	Add dependency for error_codes.proto (#21084 ) * fix activation_functions deps, test=develop, test=document_fix * add error_codes_proto deps, test=develop, test=document_fix * try delete enforce.h, test=develop, test=document_fix	5 years ago
wangchaochaohu	149a1e3124	Expand refine (#21063 ) * fix the expand op compile time cost long time test=develop * add tag for just copy test=develop	5 years ago
Wojciech Uss	af3ff422cc	Fix dst memory allocation in elementwise_add (#21059 ) test=develop	5 years ago
liym27	26a6e27afe	fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997 ) * fix bug in pool/conv/conv_transpose: 1. It should be stride[i] not stride[0] in UpdatePaddingAndDilation; 2. fix bug of func _get_padding_with_SAME in test_conv/conv_transpose_op.py; 3. fix bug of the computation process in function conv2dtranspose_forward_naive. test=develop * change test to make the data of different dimensions different. test=develop	5 years ago
Chen Weihang	7ee25189c3	Enrich the type of error and declare the error type interfaces (#21024 ) * Enrich the type of error and declare the error type interfaces, test=develop * adjust tests to adapt new form, test=develop * add inference deps with error_codes.pb.h, test=develop * restore stack iter start pos, test=develop * polish code based review comments, test=develop	5 years ago
Adam	3fda695bb0	Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062 ) * Add asymetric padding support for mkldnn pooling test=develop * Add asymetric padding support for mkldnn conv test=develop * Add asymetric padding support for mkldnn conv_transpose test=develop	5 years ago
Huihuang Zheng	1957192f05	Add select_input_op and select_output_op (#21016 ) These ops are useful in control flow.	5 years ago
Liufang Sang	e5e699ecc0	set lod level for compile time test=develop (#21022 )	5 years ago
liym27	f0e95a6049	Polish error messages of pool_2d/3d and add Raises in English document. test=develop (#21017 )	5 years ago
zhaoyuchen2018	0059404e77	Fix ce ocr_recognition test fails (#20987 ) ocr_recognition fails, so add a path to handle small frame_size. test=develop	5 years ago
Chengmo	bc8e600ce5	Fix rpc not wait in GEO communicator (#20967 ) * test=develop,fix rpc not wait in geo	5 years ago
Tao Luo	25ffa8445d	refine murmurhash3_x64_128 for bloom_filter (#20996 ) test=develop	5 years ago
Zeng Jinle	878a40f57d	Support NoNeedBufferVarsInference in dygraph backward (#20868 ) * support no need buffer vars in dygraph, test=develop * fix inference compilation error, test=develop * update no_need_buffer_vars_inference, test=develop * add unittests for no_need_buffer_vars_context, test=develop * refine no_need_buffer_vars by return ref, test=develop * polish some codes, test=develop	5 years ago
wangchaochaohu	bf379fef96	refine code for code reuse test=develop (#20988 )	5 years ago
Zhang Ting	de9bec607e	lrn supports channel_last input, test=develop (#20954 )	5 years ago
Liufang Sang	9b666cae67	fix diff in dequantize op between cpu and gpu test=develop (#20953 )	5 years ago
Zhang Ting	f4f85831d3	fix the bug of conv_transpose cudnn kernel, test=develop (#20958 ) fix the bug of conv_transpose cudnn kernel: before version 1.6, the data_format is AnyLayout in inference model. When use version 1.6 and load the model which is saved by previous version, the error occurs. This is because the cudnn kernel in version 1.6 is not compitable with Anylayout setting.	5 years ago
zhaoyuchen2018	7f3a445e9a	Fix gru as small frame_size has error. (#20922 ) seems shuffle_sync cannot handle small size test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
123malin	20cdff0e02	Optimize decay (#20816 ) * update pserver decay blocks * update distributed notify handler	5 years ago
Chengmo	16596f6498	Fix Paddle Cloud role maker (#20860 ) * fix PaddleCloud Role maker & add warning in distribute transpiler & change rpc_retry_times	5 years ago
liym27	59de8e1214	Compatible int32 and int64 for attr in concat/split/unsqueeze. test=develop (#20912 )	5 years ago
Zhang Ting	8d1e9f0f7e	maxout supports channel_last input (#20846 ) * maxout support channel_last input, test=develop * modified details of Input(X) and Attr(groups, axis) in doc, test=develop	5 years ago
Yihua Xu	b6260f3866	Optimize the kernel implementation of layernorm with openmp (#20895 )	5 years ago
hong	8c4573a3cb	GradMaker for dygraph (#19706 ) * refactor dygraph,test=develop * fix failed unittest,test=develop * polish code,test=develop * check windows ci error,test=develop try to fix windows ci error by np.allclose,test=develop * polish vlog and profiler, test=develop * try to fix preceding ops order,test=develop * test transformer in windows ci, test=develop * use python c-api to speed up tracer.trace,test=develop * test=develop, fix docker with paddle nccl problem * test=develop, add ut for debug string and gradient_accumulator * test=develop, add tests for layer/gradient_accumulator/prepared_op * test=develop, fix complie error for test_prepared_op * test=develop, add more ut for dygraph * test=develop, create API.spec for dygraph api change * optimize grad maker; test=develop * optimize grad maker * test * grad make optim; test=develop * fix unittest bugs; test=develop * add dygraph grad op maker and split_op * grad op maker refactor; test=develop * add dygraph grad maker; test=develop * fix op deformable_conv_v1_op bug; test=develop * fix deformable_conv prroi pool bugs; * fix new op grad op maker bug; test=develop * fix split by ref bug; test=develop * fix dygraph auto prune bug; test=develop * fix test_trace bug; test=develop * fix fused emb seq pool bug; test=develop * remove useless code in op_desc file; test=develop * remove useless code, StrVarBaseNode; test=develop * fix review issues; test=develop * fix rank_loss grad maker; test=develop * remove flag in VarBase; test=develop * fix distributed_notify_op compile bug ; test=develop * fix reshape op double grad; test=develop * fix expand as op; test=develop * add impertive type_defs.h for demo_train; test=develop * fix inference lib cmake; test=develop * fix inference lib; test=develop * fix infernce_lib; test=develop * fix inference cmake; test=develop * fix inference lib; test=develop * fix inference lib; test=develop * remove condition dygraph grad maker, modify local name; test=develop * fix split grad maker bug; test=develop * fix pyramid_op bug; test=develop * change travis time out limit; test=develop * restore travis; test=develop * change timeout limit; test=develop	5 years ago
Chen Weihang	768551b25d	Add parameter init check add run_startup_progrom error message for fc(mul) (#20906 )	5 years ago
Zhang Ting	c18f1bd716	fix the bug of conv_transpose:compatible with Anylayout setting, test=develop (#20897 )	5 years ago
Wilber	b489760099	fix jit_matmul bug test=develop (#20886 ) * fix jit_matmul bug * update jit matmul and add test	5 years ago
Yiqun Liu	03ba0fdae6	Move the codes of fused operators to operators/fused directory. (#20881 ) * Move the codes of fused operators to operators/fused directory. test=develop * Correct the op name in cmake. * Change the use of PADDLE_ENFORCE. test=develop	5 years ago
zhang wenhui	d428912503	fix select_rows mergeadd bug, test=develop (#20876 )	5 years ago
liym27	6802539a2e	support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780 ) * improve split and concat op: 1. support Tensor for argument 'dim' in split op. 2. support Tensor for argument 'axis' in concat op. test=develop * redefine function GetDataFromTensor and set unknown output shape to - 1. test=develop * add check: Attr(sections) match Input(X). test=develop * support Tensor for attr(sections) and attr(sections) can contain -1. add check for attr(sections). test=develop * modify error message for concat and call Resize only when necessary. test=develop	5 years ago
wangchaochaohu	28ca2e5ffa	strided_slice perforamnce improvement test=develop (#20852 )	5 years ago
Yiqun Liu	6fcfd32e6c	Check and correct the output's lod_level in DynamicRNN related operators (#19144 ) * Refine the InferShape of ReadFrom and WriteTo op, and add comment to explain why not call ShareLoD for runtime. test=develop * Add comment for ReorderLoDTensorByRank op. * Add comment for lod_tensor_to_tensor_array op to explain why only call DecreaseLoDLevel for compile time. test=develop * ShrinkRNNMemory op should call ShareLoD for compile time. test=develop * Add the implementation of IncreaseLoDLevel and add the compile-time check of lod_level in InferShape of sequence_pool. test=develop * Refine the unittest of DynamicRNN. test=develop * Change PADDLE_ENFORCE to PADDLE_ENFORCE_NE. test=develop	5 years ago
liym27	84d221b667	improve unsqueeze op to support int, Tensor for argument axes (#20824 ) * improve unsqueeze op to support int, Tensor and Tensor list for argument axes. test=develop * call Resize only when necessary. test=develop	5 years ago
silingtong123	03d7f3ddb2	Make shape tensor support int32 (#20757 ) * Make shape tensor support int32	5 years ago
Huihuang Zheng	95ba4bd2ab	Add shape and type check at read_op (#20754 )	5 years ago
Aurelius84	aacd16dbb4	add pyramid_hash_op (#20698 )	5 years ago
Chen Weihang	8b59ac3ad0	delete paddle infershape enforce marco (#20832 )	5 years ago
whs	c8e49be2f1	Fix roi_perspective_transform op (#20764 )	5 years ago
Chen Weihang	26cc1fe508	Replace risky GetInputType method with secure IndicateVarDataType interface (#20668 ) * replace part of the old implementation, test=develop * restore concat op, test=develop * update all ops implemention & delete GetDataTypeOfVar func, test=develop	5 years ago
Yamei-Lee	cf717fd6dd	fix bug in reshape: (#20781 ) consider the situation that shape of input can contain more than one -1. test=develop	5 years ago
Zhang Ting	5a8d885d72	All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756 ) * All elements in attr(shape) of crop_tensor can be -1, test=develop, test=document_preview * fix the bug that attr(offsets) should be initialized, test=develop	5 years ago
danleifeng	9171f73714	fix fp16 grid_size for size=1; test=develop (#20812 )	5 years ago
Tao Luo	efbdad0596	make search_compute support avx default (#20779 ) * make search_compute support avx only * clean search_compute.h * rename sse_axpy to avx_axpy test=develop * update CMakeLists.txt test=develop	5 years ago
WangXi	250e72d254	Fix DGC algorithm flow to make it the same as paper (#20758 )	5 years ago
zhaoyuchen2018	6e6eab07e8	Fix multihead op bug. (#20783 ) The op should handle k=1024 test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
lvmengsi	dfa0549f87	Revert "fix_depthwise_conv_cudnn, test=develop (#20712 )" (#20782 ) This reverts commit `dc229b4195`.	5 years ago
whs	4c7d196d83	Add norm_by_time for warpctc op in padding mode. (#17580 )	5 years ago
Pei Yang	e89c16b90d	Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733 ) * fix pool2d trt converter, test=develop * add fix for split op converter, test=develop	5 years ago
石晓伟	37cd43545a	update the infer shape of matmul, test=develop (#20717 ) * update the infer shape of matmul, test=release/1.6 * add unittests of matmul, test=release/1.6 * change func names, test=develop	5 years ago
Adam	67b59ddb38	Minor MKL-DNN conv int8 performance fixes (#20753 ) test=develop	5 years ago
wangchaochaohu	0687bcd64f	Refine getitem of Variable (#20729 ) * add support for __get_item__ of Variable test=develop	5 years ago
danleifeng	79e08ecebf	add assertions on whether elementwise_div divison is zero (#20618 )	5 years ago
123malin	95e90aa102	test=develop, add communicator_is_sgd_optimizer flag (#20677 ) * test=develop, communicator_is_sgd_optimizer flags	5 years ago
Aurelius84	74a28f5ea4	fix fill_constant shape with -1 and enhance cross_entropy test=develop (#20722 )	5 years ago
lvmengsi	dc229b4195	fix_depthwise_conv_cudnn, test=develop (#20712 )	5 years ago
gongweibao	c1710e91b2	Disable GRPC_ARG_ALLOW_REUSEPORT to avoid potencial problem. (#20690 )	5 years ago
lidanqing	46e93f7c86	Revert "Refactor conv computeINT8" (#20640 ) * Revert "Refactor conv computeINT8 (#19574)" This reverts commit `2c32c2d649`. test=develop * replace PADDLE_ENFORCE test=develop	5 years ago
Zeng Jinle	ab575de725	Fix op run log when memory optimization strategy is enabled (#20695 )	5 years ago
Jacek Czaja	a1cd27f13f	[MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241 ) * - Flushing mkl-dnn cache test=develop - Disabled clearing cache for LoadModel - Added clearing of mkl-dnn cache when Executor is created test=develop - Do not clear for GPU places test=develop - compilation fix test=develop * - Moved clearing of mkl-dnn cache in destructor of executor test=develop * - Compilation fix test=develop - Reverted conditional clearing of mkl-dnn cache in Executors's destructor test=develop - compilation fix	5 years ago
Zeng Jinle	10505faf4e	polish codes, test=develop (#20672 )	5 years ago
Zeng Jinle	34e3adaece	Refine reduce codes to save compiling time and binary size (#20676 ) * refine reduce code to save compiling time and binary sizes, test=develop * add reduce rank check to avoid bug, test=develop	5 years ago
whs	a3e641e93c	Fix infer shape of warpctc op. (#20653 ) test=develop	5 years ago
Zeng Jinle	4922eb6da5	make_conv_workspace_size_configurable, test=develop (#20662 )	5 years ago
zhongpu	efa10937bd	fix elementwise_floordiv_op and elementwise_mod_op (#20534 ) * fix elementwise_floordiv_op and elementwise_mod_op, test=develop * fix API.spec, test=develop * fix API.spec, test=develop	5 years ago
tangwei12	04384502a8	fix bug with heart beat , test=develop (#20654 )	5 years ago
wangchaochaohu	7783d3bd43	Conv refine (#20644 ) * add condition judgement for performance improvement test=develop * add condition judgement for performance improvement test=develop * refine code style test=develop	5 years ago
Chen Weihang	003f369bb2	Add IndicateVarDataType interface to block tensor is not initialized problem in OP GetExceptedKernelType (#20044 ) * add indicate_var_data_type inferface, test=develop * add unittests & polish error message, test=develop * remove needless include, test=develop * extract public function & polish message, test=develop * delete empty var check, test=develop * change data_type to pointer parameter, test=develop * polish details, test=develop	5 years ago
gongweibao	f3f52fc1e2	Retry when failed to bind address. (#20642 )	5 years ago
qingqing01	01eddc1a04	Support fp16 in GPU impl of fused_elemwise_activation_op. (#20636 ) * Support fp16 in fused_elemwise_activation_op. * Fix unit testing in ONLY-CPU mode.	5 years ago
Chengmo	940c6ff1c8	Fix communicator slow bug & fix communicator stop bug (#20366 ) * test=develop,Fix communicator slow bug * test=develop, delete if() in stop_worker() * test=develop * fix UT, test=develop * fix bug in fetch handler, test=develop * fix bug in fetch handler, test=develop * test=develop, fix fetch barrier bug * test=develop, bug fix * test=develop, bug fix * test=develop, fix bug	5 years ago
zhaoyuchen2018	8314e64a8b	Fix sum op fails as no memory in tensor(#20602 ) test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
Yibing Liu	ee2869cae9	Remove redundant infershape in linear chain crf grad, test=develop (#20629 )	5 years ago
123malin	b4a3b75002	bug fix: invalid learning rate decay in pserver async mode (#20325 ) * bug fix: invalid learning rate decay in pserver async mode	5 years ago
石晓伟	a4753f3a79	Optimize error message of mean_op and matmul_op (#20413 ) * add data type check, test=develop * polish error messages, test=develop * polish error messages, test=develop * Remove support for the CPU architecture matmul, test=develop * fix syntax bug, test=develop	5 years ago
Leo Chen	d6c1d6ca56	update class name, test=develop (#20578 )	5 years ago
Double_V	0b39218749	memory optimizer for reshape op,test=develop (#20569 )	5 years ago
chengduo	36c85ef492	Add sub-scope check in RecurrentOp (#20468 ) * fix recurrent bug test=develop	5 years ago
JesseyXujin	2ff18e537f	add expand_as op, test=develop (#20565 ) * add expand_as op, test=develop * add expand_as op,test=develop * add expand_as op,test=develop * add nn.py, test=develop * delele paddle_enforce, test=develop	5 years ago
Zeng Jinle	40effc61af	Refine py_reader exit (#20331 ) * refine py_reader exit, test=develop * fix multiprocess_reader exception unittest, test=develop * increase code coverage for legacy fluid.layers.py_reader, test=develop	5 years ago
Zhang Ting	78910480c1	fix conv_transpose's bug: compatible with Anylayout setting, test=develop (#20589 )	5 years ago
Yuan Shuai	172e91c008	Refine error message of transpose_op (#20437 ) * Refine error message of transpose. * Fix transpose, multiplex, unsqueeze, unstack. test=develop, test=document_preview, test=document_fix	5 years ago
liym27	fc6ec3b9f6	fill_constant support Tensor; (#20521 ) 2. fix bug in backward.py: using fill_constant instead of fill_constant_batch_size_like 3. fix bug in ExpandGradOp. test=develop	5 years ago
Zhang Ting	0130cc969c	fixed group_norm's bug and modified unittest (#20506 ) * modified group_norm's unittest for pass statement, test=develop * fix group_norm's bug: scale or bias is None which causes segmentation fault, test=develop	5 years ago
zhaoyuchen2018	8fb569e5b9	Fix api doc example bug and polish square doc (#20491 ) * Refine create_array api en doc test=develop test=document_fix * Fix api doc example bug and polish square test=develop test=document_fix * Refine comment test=develop test=document_fix * refine API.spec test=develop test=document_fix	5 years ago
Guo Sheng	dfd1eee7f7	Add seq2seq api related code (#19820 )	5 years ago
lvmengsi	2384589383	Fix conv_grad_grad (#20469 ) * fix_conv_grad_grad * fix_bug, test=develop	5 years ago
Double_V	8299203370	Support reshape_op double gradient (#20304 ) * support reshape doubel grad, test=develop * fix reshape double grad, pass converage, test=develop * fix review, test=develop	5 years ago
hong19860320	4d0d5e4cc7	refine eng doc for hard_sigmoid op (#20442 ) * refine eng doc for hard_sigmoid op test=develop test=document_fix * refine the description of hard_sigmoid test=develop test=document_fix * update API.spec test=document_fix * Refine the decription of parameters of HardSigmoid op test=develop, test=document_fix * Update API.spec for hard_sigmoid op test=develop, test=document_fix	5 years ago
Aurelius84	22823df2e2	enhance embedding error message test=develop (#20246 ) * enhance embedding error message test=develop * enforce .h error test=develop * fix unittest code test=develop * Fix fp16 dtype in embedding test=develop * add import warnings test=develop	5 years ago
zhupengyang	3997743a5b	add input type and dtype check, enhance shape error message for concat_op (#20101 ) * add input type and dtype check, enhance shape error message for concat_op test=develop * enhance shape check test=develop * improve coverage test=develop	5 years ago
zhupengyang	95524a4d30	fix APIs: relu, relu6, hash (#20416 ) * fix APIs: relu, relu6, hash test=develop test=document_fix * fix relu6 doc test=develop test=document_fix * fix API.spec test=develop test=document_fix * add description link for hash test=develop test=document_fix	5 years ago
JesseyXujin	843bdbaae1	add input type and dtype check for accuracy_op (#20399 ) * add input type and dtype check for accuracy_op * add input type and dtype check for accuracy_op * modify python error on accuracy_op,add test=develop * modify details on accuracy_op, test=develop * test float16, test=develop * add warning, test=develop	5 years ago
lijianshe02	211f5b0319	enhance mul_op input error message test=develop (#20414 ) * enhance mul_op input error message test=develop	5 years ago
GaoWei8	5ea2cc6733	fix API:cos, exp, ceil, elu, brelu English doc (#20032 ) * fix API:cos, exp, ceil, elu, brelu English doc test=develop test=document_fix	5 years ago
wopeizl	3044a62f2a	fix the precise roi poop op test=develop (#20126 ) * fix the precise roi poop op test=develop add roi backward implementation, fix the output-channel	5 years ago
Wilber	2893cd1ae0	modify english api (#20159 ) * modify english api test=develop test=document_fix - leaky_relu - less_than - log - logical_and - logical_or - logical_xor - logical_not	5 years ago
zhouwei25	b1218d056b	fix English Doc of API:layers.py_func/sum (#20329 ) * fix English Doc of API:layers.py_func/sum	5 years ago
qingqing01	63194d6e67	Enhance InferShape in deformable_conv and prior_box op (#20372 )	5 years ago
tangwei12	a010d883b4	doc fix, test=develop, test=document_fix (#20239 ) * doc fix, test=develop, test=document_fix	5 years ago
huzhiqiang	6a8e54047f	fix reorder_lod_tensor_by_rank doc en (#20256 ) fix reorder_lod_tensor_by_rank doc en	5 years ago
Yibing Liu	899ab30df0	Fix several api docs (#20282 ) * Fix several api docs test=develop, test=document_fix	5 years ago
wangchaochaohu	1288ac2983	fix expand bug (#20340 ) * fix expand bug test=develop * fix style test=develop * fix style test=develop * fix style test=develop * fix style test=develop	5 years ago
SunGaofeng	a73e1f68b4	fix document of 11 APIs (#20278 ) * modify document of 11 APIs test=develop test=document_fix * fix dtype to data type and description of name parameter	5 years ago
Pei Yang	057d782d51	fix en api doc of [round, sin, sqrt], test=develop, test=document_fix (#20296 )	5 years ago
Kaipeng Deng	3833b511a6	refine en API doc (#20206 ) * refine en doc. test=develop. test=document_fix	5 years ago
wangchaochaohu	bc6126dd07	fix the reduce bug test=develop (#20102 )	5 years ago
FDInSky	e2c7b6821a	test=develop enhance uniform_random op python api (#20295 )	5 years ago
danleifeng	3a0f93b3f9	fix error message for elementwise_add/mul (#20283 )	5 years ago
liym27	670937e11d	add input type and dtype check for reshape op. (#20099 ) enhance shape error messages for reshape op. test=develop	5 years ago
Zeng Jinle	48029ab06c	Remove some DefaultGradOpDescMaker (#20185 ) * remove fc_grad, test=develop * remove fsp op since no unittests, test=develop	5 years ago
Aurelius84	729f5846cc	enhance shape error message of fc API (#20172 ) * add api check in fc test=develop * enforce shape error info of sum op test=develop * fix spelling test=develop * print x_dims info test=develop * enhance shape error info test=develop	5 years ago
wangguanzhong	6fbf441001	enhance input check for roi_align, test=develop (#20238 )	5 years ago
Yibing Liu	d849e9835f	Add detailed error messages for nce layer (#20231 ) * Add detailed error messages for nce layer test=develop * Fix some problems test=develop * Fix unit test coverage test=develop	5 years ago
Double_V	98da70f63f	fix API en doc (#20261 ) * test=develop,test=document_preview, test=document_fix * test=develop,test=document_preview, test=document_fix * fix API.spec, test=develop,test=document_preview, test=document_fix	5 years ago
zhaoyuchen2018	5ebf4078dc	add input type and dtype check for squeeze (#20100 ) * Add input check and refine error message * Refine test case and comments test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
liuwei1031	e03c1d8a9e	fix conv_op compilation issue on windows (#20230 )	5 years ago
JesseyXujin	407efcf7b1	fix API doc, solve conflict, test=develop, test=document_fix (#20196 ) * fix APIs,test=develop,test=document_fix * fix conflict, test=develop, test=document_fix * fix confict, test=develop, test=document_fix * fix confict, test=develop, test=document_fix * fix API.spec, test=develop, test=document_fix * change fluid.layers.data to fluid.data,test=develop, test=document_fix * fix bug on example code, test=develop, test=document_fix * fix API.spec, test=develop, test=document_fix	5 years ago
liym27	ad60b3b8ac	mv two function in conv op for good code style (#20116 ) * Delete PadFuntion, include padding.h instead. test=develop * move function(IsSymmetricPadding) from conv_cudnn_op.cu/conv_transpose_cudnn_op.cu to padding.h, test=develop	5 years ago
liym27	869cef6dc0	fix bug of infer shape in pool op. test=develop (#20213 )	5 years ago
lvmengsi	59a7c222ea	refine en doc (#20088 ) * update en doc	5 years ago
Zeng Jinle	3eebd5b391	refine sequence_softmax grad maker, test=develop (#20127 )	5 years ago
Chengmo	eb05db7104	Speed GEO-SGD (#20158 ) * delete debug vlog & add rpc function & fix word2vec bug & speed GEO-SGD	5 years ago
Zhang Ting	cf6919bf6e	conv_transpose supports channel_last input, test=develop, test=document_preview (#20072 )	5 years ago
tangwei12	c9139c3db3	trainer from dataset fetch targets (#19760 ) add executor.FetchHandler for train/infer from the dataset	5 years ago
tangwei12	b5a410466c	Trainer heartbeat for async mode (#19600 ) Heartbeat for distributed async training.	5 years ago
lvmengsi	76ba55e891	add error log for python api and c++ (#20061 ) * add error log	5 years ago
Yibing Liu	01ad8d2e06	Refactor linear chain crf op & crf decoding op (#19982 ) * Update crf_decoding api & example test=develop * Update api spec test=develop * Fix linear chain crf api test=develop * Avoid sharing data pointer with input test=develop * Simplify the logic in linear_chain_crf_decoding * Add unittest for crf_decoding when label & path both are set test=develop * Update API spec test=develop * Add unittest for layers && correct infer_shape in chunk_eval test=develop	5 years ago
wangchaochaohu	6e73e90bfb	fix the error message for reduce_mean and reduce_sum op (#20063 ) * fix the error message for reduce_mean and reduce_sum op test=develop * fix typo test=develop * fix according review advice test=develop * fix the test test=develop * fix test=develop	5 years ago
wangchaochaohu	9a76f3f916	Fill constant error message fix (#20075 ) * fix the constant error message test=develop * fix typo test=develop * fix typo test=develop * fix code style test=develop * fix comment and bugs test=develop * fix the bug test=develop * fix and add unittest test=develop * fix the typo test=develop * add support for the fill_constant op test=develop * add test for ci coverage test=develop	5 years ago
zhaoyuchen2018	e867366805	Add multihead op for ernie opt (#19933 ) * Add multihead op for ernie opt test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine softmax test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine kernel. test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine cuda kernel test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine cuda version test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine cmake test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
Chengmo	728ec1b43d	Add GEO-SGD distribute training algorithm (#20018 ) * refector geo sgd & communicator	5 years ago
Li Fuchen	5365cd2f14	Set lod level of sequence_unpad's output to 1 in compile time (#20068 ) * Set lod level of sequence_unpad's output to 1 in compile time	5 years ago
danleifeng	425279a57b	Improve elementwise operators performance in same dimensions. (#19763 ) Improve elementwise operators performance in same dimensions	5 years ago
liuwei1031	292aae4385	fix windows compilation issue when compile with VS2015, test=release/1.6 (#20114 )	5 years ago
Wilber	276b5e3440	fix compile paddle with anakin bug * fix compile with anakin bug * remove useless deps test=develop - 修复了联编anakin时，遇到的bug. - 编译test_anakin_activate 不通过 - 编译test_anakin_engine 不通过	5 years ago
silingtong123	649bcd5fe2	Modify the style of function names (#20071 )	5 years ago
liym27	3aa331d97e	fix conv2d and conv3d: (#20042 ) 1.support asymmetric padding; 2.support padding algorithm:"SAME" and "VALID"; 3.support channel_last: data_format NHWC and NDHWC; 4.change doc of python API and c++; test=develop, test=document_preview	5 years ago
chengjuntao	6f184775e8	Fix compling warning in deformable conv. (#20036 )	5 years ago
wangguanzhong	da892cafd5	Refine api doc (#20037 ) * refine doc, test=document_fix * add API.spec,test=develop,test=document_fix	5 years ago
silingtong123	f1eebf75aa	improve op uniform_random, argument shape support tensor and tensor in list (#19786 ) * test=develop, argument shape support tensor and tensor in list * test=develop,Increasing the coverage of CI tests * test=develop, modify the document and update API.spec * test=develop, modify the doc and update API.spec * test=develop, modify the doc and update API.spec * test=develop, modify the interface of UniformInitializer * test=develop, modify the interface of XavierInitializer and MSRAInitializer * test=develop, modify based on review's comments * test=develop, modify based on review's comments * test=develop, modify based on review's comments	5 years ago
liym27	24010472d4	fix pool2d pool3d,support asymmetric padding and channel_last (#19739 ) * fix pool2d pool3d: 1. support asymmetric padding; 2. support padding algorithm:"SAME" and "VALID"; 3. support channel_last: data_format NHWC and NDHWC; 4. support inferring shape when input with negative dims in compile time; 5. change doc of python API and c++; 6. fix bug in cuda kernel when Attr(adaptive) is true. test=develop,test=document_preview * fix 'tensors' to 'Tensors'. test=develop,test=document_preview * add test for converage ValueError.test=develop,test=document_preview * resolve conflict in test_pool2d. test=develop	5 years ago
Adam	fe581b0e8a	Minor GetMKLDNNFormat changes (#20055 ) test=develop	5 years ago
lvmengsi	c92348c3b9	fix conv_grad_grad (#20054 )	5 years ago
Kaipeng Deng	e7a6567be5	polish pool infer shape (#20038 ) * fix pool infershape. test=develop * fix unittest converage. test=develop * fix format. test=develop	5 years ago
chengduo	fb2a9cdf83	Add fp16 support for pad and split (#19881 ) * make pad and split support fp16 test=develop	5 years ago
lvmengsi	647ff784e2	fix mul double grad (#20040 )	5 years ago
tangwei12	8f0b3c0516	the integrated communicator (#19849 ) * add a base class for the Communicator * add AsyncCommunicator Impl for async distributed training	5 years ago
danleifeng	5cef7a2f25	Polish English docs of elementwise_add/sub/mul/div (#20027 ) Polish English docs of elementwise_add/sub/mul/div	5 years ago
Li Fuchen	c8e125872c	Fixed warpctc, test=develop (#20011 ) Use AllocateTmpTensor() for creating temporary tensors in warpctc.	5 years ago
wangchaochaohu	3409db950c	fix reduce bug test=develop (#19971 )	5 years ago
Adam	4b65af7719	MKLDNN BatchNorm operator refactor (#20012 ) test=develop	5 years ago
joanna.wozna.intel	1d32897c5c	Fix test pool2d int8 mkldnn (#19976 ) * Fix conv2d+dequantize squash for residual fusion test=develop * Correct int8 input test=develop * Add if exclude or include padding in pool2d mkldnn test=develop	5 years ago
Aurelius84	f58c8db668	Require x.dims=label.dims in huber_loss (#20017 ) * x.dims == y.dims test=develop * refine comment	5 years ago
Aurelius84	137e6336ef	Remove constraint that last dimension is forced to be 1 in rank_loss (#19997 ) * fix input shape check test=develop * move PADDLE_ENFORCE test=develop	5 years ago
chengduo	101a2b610a	Add dtype for coalesce_tensor_op (#20016 ) Add dtype for coalesce_tensor_op	5 years ago
Zhaolong Xing	f04f2b232a	fix if else error info (#19974 ) test=develop test=document_fix	5 years ago
gongweibao	a7512db2bc	Polish elementwise max min pow document to add more examples. (#19946 ) Polish elementwise max min pow document to add more examples	5 years ago
Aurelius84	2b5b4b3c5e	fix dataType in C++ comment in embedding op (#20004 )	5 years ago
Tao Luo	bcb2903e60	enhance shape error message of mul_op (#19998 ) test=develop	5 years ago
Chen Weihang	1409586eaa	Add LoD empty check for all related sequence ops (#19980 ) * add lod check for sequence op, test=develop * delete unnecessary check in expend op, test=develop	5 years ago
zhongpu	b1bb23841e	add kernel for fill_op, test=develop (#19719 ) * add kernel for fill_op, test=develop * modify PADDLE_ENFORCE to PADDLE_ENFORCE_EQ, test=develop * add op test for fill_op, test=develop * REGISTER COP CUDA KERNEL, test=develop * update test_fill_op.py, test=develop * change FillConstantOpVarTypeInference to FillOpVarTypeInference, test=develop * fix op test, test=develop * add head file, test=develop	5 years ago
wangchaochaohu	382d099dcb	add support tensor and tensorlist for strided_slice OP (#19929 ) * add support tensor and tensorlist for strided_slice OP test=develop * fix the commnet test=develop * fix test=develop * fix the bug test=develop * delete log test=develop * fix API.spec test=develop * fix test=develop	5 years ago
lvmengsi	619a241bd0	Fix OpTest of bn (#19062 ) * fix bn	5 years ago
Bob Zhu	c670058a8d	add support of matmul with multiple head even different width and height (#19708 ) * add support of matmul with multiple head even different width and height Original matmul with multiple head supports only the mat_a.width == mat_b.height, in that case, mat_b will be horizontally split. In this patch, we extend the support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height, in this case, mab_b will be vertically split. One example is A is [3, 8], B is [2, 16], head_number is 4. In this case, A will be split as [3, 2], B will be (vertically) split as [2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16] test=develop * add support of matmul with multiple head even different width and height Original matmul with multiple head supports only the mat_a.width == mat_b.height, in that case, mat_b will be horizontally split. In this patch, we extend the support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height, in this case, mab_b will be vertically split. One example is A is [3, 8], B is [2, 16], head_number is 4. In this case, A will be split as [3, 2], B will be (vertically) split as [2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16] test=develop * refactor the code of matmul with multiple head even different width and height test=develop	5 years ago
Liufang Sang	6884dc800a	refine ctc align op with padding (#19926 ) * refine ctc align op with padding * refine api sample code	5 years ago
Aurelius84	99a9615a4b	Removing length dims constraints of seq_pad and seq_unpad (#19497 ) * Removing last dims constraints of seq_pad and seq_unpad test=develop * fix test_layer api code test=develop * fix sequence_pad_op.cc conflict test=develop * remove test_analyzer_mm_dnn test=develop * fix vectorize bug test=develop * fix vectorize<int> test=develop	5 years ago
jhjiangcs	766bd529d1	add optimizer:dpsgd,test=develop (#19915 )	5 years ago
Yang Zhang	ebff68fa74	Add float16 support to `sync_batch_norm_op` (#19681 ) * Add float16 support to `sync_batch_norm_op` test=develop * Add test for sync_bn with FP16 input test=develop	5 years ago
Aurelius84	039b9710d5	Remove constraint that last dimension is forced to be 1 by adding lookup_table_v2 (#19735 ) * Remove constraint that last dimension is forced to be 1 by add lookup_table_v2 test=develop * modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop * Revert "modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop" This reverts commit 8a960bfc61e51aa27c3c529df8fb90b93ebd19f9. * move api into fluid.embedding test=develop * fix example code test=develop * move one_hot into fluid.one_hot * modify api.spec test=develop * fix loss shape test=develop	5 years ago
xujiaqi01	cedc04775c	support change shuffle and train thread num (#19841 ) * support change shuffle thread num * support change train thread num * fix receive shuffle data of each channel * data norm stop gradient * add check thread_tensor type and root_tensor type when merge metric * remove sleep in shuffle, add config * add config of pslib client to client communication * fix xbox str * add data norm op testcase * add flush in trainer finalize	5 years ago
Kaipeng Deng	14625ffe9e	add elementwise mod support float/double. test=develop (#19570 )	5 years ago
Jacek Czaja	5b07ca9cdd	- ReImplemented pooling fwd mkldnn (#19911 ) - First implementation of BWD and FWD of pooling mkl-dnn - Compilation fix - Fix - Fix - Fix - Fix to crash - Compilation fix - Combined AcquireBacward with Fwd test=develop	5 years ago
Zeng Jinle	b1e83b33b0	fix huber loss op attr type, test=develop (#19937 )	5 years ago
Zeng Jinle	cc157d5990	add inplace to assign op, test=develop (#19927 )	5 years ago
Leo Chen	57606205f5	Make OpTest check grad inplace even if forward has no inplace (#19847 ) * make OpTest check grad inplace even if forward has no inplace, test=develop * do not run PE when enable_inplace is False, test=develop * add conv3d cuda kernel for float16 type, test=develop * refactor OpTest for inplace, test=develop * add comments, test=develop	5 years ago
Zhang Ting	cb8f3c03a7	resize Ops support data_layout:channel_last, test=develop, test=document_preview (#19914 )	5 years ago
Kaipeng Deng	3f021781a1	fix softmax CE time limit check failed (#19846 ) * fix softmax ce time limit check failed. test=develop * refine softmax calc. test=develop	5 years ago
石晓伟	30adea0a23	tensor_array_to_tensor_op.cc, test=develop (#19289 )	5 years ago
lvmengsi	4155e62559	add instance norm (#19500 ) * add instance norm op	5 years ago
Adam	cb65439da8	Add support for other axes in MKLDNN softmax op (#19907 ) * Initial, functional commit * Clean commit related files test=develop	5 years ago
Pei Yang	baccd7e2ca	Add TRT input shape check between model and runtime (#19864 ) * add TRT shape check, test=develop * model_input_shape == runtime_input_shape, refine message, test=develop	5 years ago
Aurelius84	fcf53e55ff	support 2-level lod of input in sequence_pool (#19839 ) * support 2-level lod of input in sequence_pool test=develop * fix lod level bug in .cu test=develop	5 years ago
Zhang Ting	93364b45c1	group_norm support data_layout:NHWC, test=develop, test=document_preview (#19614 ) 1. group_norm support data_layout=NHWC 2. modified doc of group_norm	5 years ago
Jacek Czaja	619c797a7f	[MKL-DNN] LRN refactoring (#19798 ) - LRN mkl-dnn kernel refactor test=develop - compilation fix - Another compilation fix - Compilation fix - another compilation fix - compilation fix - Crash fix - optional LRN mkldnn workspace - Added mid allocation - Workaround for tests - Removed gradient from is_test ut - Removed mid for inference - Reverted LRN mid removal for is_test - PADDLE_ENFORCE adjusted - Rebase to templatization commit - Compilation fix - compilation fix test=develop - lint test=develop - Fix to crash - Rebase to recent codebase - lin - lint - compilation fix	5 years ago
Zhang Ting	439d95e157	modified interpolate op to support tensor attribute, test=develop, test=document_preview (#19287 ) modified interpolate_op to support tensor attribute 1. the parameter out_shape of image_resize、resize_nearest/bilinear/trilinear can be a list or a 1-D tensor variable. If a list, each element can be an integer or a tensor variable with shape: [1]. 2. the parameter scale of above Ops can be a 1-D tensor variable. modified document of image_resize, resize_nearest, resize_bilinear, resize_trilinear and add some code example.	5 years ago
Zhang Ting	b38889413d	add crop_tensor_op, test=develop, test=document_preview (#19314 ) add crop_tensor op. The main difference with crop is : 1. If the argument shape is a list, each element is an integer or a tensor variable with shape: [1]. This way is suitable for the case that the shape may be changed each iteration. 2. If the argument shape is a variable. Its rank must be 1. In crop op, the rank of shape must be the same as x offsets can be a list, in which each element is an integer or a tensor variavle with shape: [1].	5 years ago
lidanqing	2c32c2d649	Refactor conv computeINT8 (#19574 ) * fix conflicts test=develop * change mask_bias_reorder test=develop * add ComputeMask function to make code clear test=develop * change according to reviews test=develop * change according to reviews test=develop	5 years ago
Adam	c7e688921b	Add template functions for Acquire primitive/primitive_desc (#19867 ) * Add template functions for Acquire primitive/primitive_desc test=develop * Move acquire primitive descriptor to protected section test=develop	5 years ago
Aurelius84	b125e327aa	Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606 ) * Remove constraint that last dimension is forced to be 1 in cross_entropy test=develop * modify labels last dims test=develop	5 years ago
wopeizl	a7c440d303	add precise roi pooling op test=develop (#18960 ) * add precise roi pooling op test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * detail the description test=develop * test=develop * elaborate the doc for return type test=develop * test=develop	5 years ago
Yiqun Liu	3cd985a669	Add a pass to fuse fc+elementwise_add+layernorm (#19776 ) * Add fc_elementwise_layernorm_fuse pass and unittest. * Add fused_fc_elementwise_layernorm op and its GPU kernel. test=develop * Apply fc_elementwise_layernorm_fuse_pass to GPU inference. * Add the setting of attrs in the definition of binary_op. test=develop * Add comment. * Implement the unittest. test=develop * Change the unittest name of layer_norm. test=develop	5 years ago
wangchaochaohu	47af618f70	Strided slice (#19642 ) * strided_slice op basic function test=develop * test=develop rewrite and fix * fix bug test=develop * fix for the PADDLE_ENFORCE usage * add some unit testw * fix for the aip test and copright and fix test=develop * fix API.spec test=develop * fix API.spec test=develop * add axis parameter test=develop * fix for the build error test=develop * fix python api test=develop * fix the build test=develop * fix build test=develop * fix API spec test=develop * test=develop add some comment and single op test * fix API spece test=develop * fix test=develop * fix test=develop * fix api test=develop * fix api test=develop * fix API.spec test=develop * fix typo test=develop * fix API.spec test=develop * fix API typo test=develop * fix doc and API.spec test=develop	5 years ago
123malin	1bc285a53a	add retry function to try to solve grpc error code 14 (#19661 ) * rpc retry for asycsend/get/prefetch * test=develop, change retry vlog level to 3 * test=develop, set default grpc_retry_times is 3	5 years ago
LielinJiang	6d72a86b14	fix_roi_transform_bug (#19785 )	5 years ago
Zeng Jinle	3fd3b663a8	fix gc bug in controlflow ops, test=develop (#19827 )	5 years ago
Leo Chen	982e61f5ff	Update elementwise double grad to save gpu memory (#19509 ) * update elementwise double grad to save gpu memory, test=develop * update elementwise_mul/div_grad_grad to save memory, test=develop * remove eval function in eigen statement to save memory, test=develop * add unittest for elementwise_div_grad_grad without dout, test=develop * add unittest for elementwise_add_grad_grad without ddx, test=develop * add float16 cuda kernel for elementwise double grad op, test=develop	5 years ago
Adam	dfdd73cbc0	Add MKLDNNhandlerT templatized class (#19801 ) test=develop	5 years ago
Zeng Jinle	cabb9501bd	fix leaky_relu op when alpha is zero, test=develop (#19833 )	5 years ago
chengjuntao	00efd1d8a9	add deformable conv v1 op and cpu version of deformable conv v2 (#18500 ) * add deformable conv v1 op, test=develop	5 years ago
liym27	677e714425	fix pow op, support tensor for agument factor. (#19313 ) improve pow op according to reviews: 1. Delete unnecessary judgement statements in PowGradOpDescMaker; 2. Improve test of test_api; overload GetKernelTypeForVar add stop_gradient=True when attr(factor) is tensor Variable, change examples in API pow. test=develop,test=document_preview	5 years ago
liym27	bd89a27308	add tensor support for argument shape in reshape op; (#19268 ) add support parameter inference when argument shape is a list containing integer and tensor variable; test=develop fix reshape op according to reviews: 1. improve or message; 2. improve test of test_api. test=develop,test=document_preview fix reshape op: Add error message in nn.py, test=develop add stop_gradient=True when attr(shape) is tensor Variable. change examples in API reshape. test=develop,test=document_preview	5 years ago
liym27	88628016b2	add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208 ) add support parameter inference when arguments starts or ends is a list containing integer and tensor variable; test=develop,test=document_preview improve slice op according to review(from hongyu). test=develop fix slice op according to review: infer_flags, test=develop fix slice op: improve overload operator __getitem__ to support attrs(starts and ends) are Variable. test=develop,test=document_preview fix test_slice_op: add TestSliceOp_decs_dim_6 to resolve conflict with test_slice_ngraph_op. test=develop add stop_gradient=True when attr(starts) or attr(ends) is tensor Variable. test=develop,test=document_preview	5 years ago
liym27	e9e3c08777	fix expand op: (#19302 ) 1. add tensor support for argument expand_times in expand op; 2. add support parameter inference when argument expand_times is a list containing integer and tensor variable; improve expand op according to reviews: 1. add doc of ExpandTimes in expand_op.cc; 2. improve the test of test_api. add stop_gradient=True when attr(expand_times) is tensor Variable, change code examples. test=develop,test=document_preview	5 years ago
lvmengsi	b76343c3b7	cpu Conv double grad (#19672 ) * cpu conv_grad_grad	5 years ago
翟飞跃	93c85c930a	Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770 ) * Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * optimize bp with mkl sparse matrix test=develop * tmp add fused_emb_seq layer * Add the support of padding_idx attribute. test=develop * add padding_idx support test=develop * implement grad refer lego test=develop	5 years ago
Yiqun Liu	c67c8758cb	Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733 ) * Refine the codes related to fc op. * Add GPU implementation for fc functor. * Apply fc_fuse_pass in GPU inference. test=develop * Change the cmake for fc op. * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ. * Add an attribute to set the activation type in fc_op. * Enhance the unittest of fc_op. test=develop * Remove the declaration of FCOpGrad back to the header file. test=develop * Set default value for newly added arguments in test_fc_op. test=develop * Enhance fc_fuse_pass to enable fusing relu. * Allow print the shapes of var_desc in graph. test=develop * Enhance fc_fuse_pass_tester. * Remove the use of PADDLE_ENFORCE. test=develop * Correct the number of ops after fusing. test=develop * Fix a typo. test=develop * Set activation_type to null when there is no relu in fc. test=develop * Refine fc_fuse_pass's codes. * Enable the set of shape for tensor. * Refine repeated_fc_relu_pass and add unittest. test=develop	5 years ago
zhongpu	52673956de	add kernel for squeeze_op, test=develop (#19656 ) * add kernel for squeeze_op, test=develop * delete comment, test=develop	5 years ago
zhongpu	2a81c3679a	add kernel for unstack_op, test=develop (#19538 ) * add kernel for unstack_op, test=develop * add kernel for unstack_op, test=develop * add kernel for unstack_op, test=develop * adjust the code format, test=develop * modify some comment, test=develop	5 years ago
Kaipeng Deng	99c78b772a	fix softmax axis!=-1. test=develop (#19800 )	5 years ago

... 3 4 5 6 7 ...

4994 Commits (05c00af5f16da64d1e8953711c647512121ef3d2)