Paddle

Commit Graph

Author	SHA1	Message	Date
liu zhengxi	3cb6c0a059	Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240 ) * fix the CAPI ZeroCopy shape error and reconstruct the output obtain * use an anonymous namespace to cover the functor * fix unit tests because of the output of typeid(T).name() is different from linux and windows, test=develop	6 years ago
Yiqun Liu	6b1e1f0dda	Enable generating code for a given subgraph. (#21126 ) * Enable generating code for a given subgraph. * Support sorting the subgraph. * Remove the rearange of expressions because we use the sorted subgraph directly. * Enable generating code for a subgraph which is composed of grad ops. * Use expression information to check the accuracy in unittest. * Separate load and store from computation expressions. test=develop * Improve the loading statements in generated codes. test=develop * Remove unused arguments from formal list. test=develop	6 years ago
zhaoyuchen2018	3ff5cc2d5e	Fix topk compile failed on windows (#21243 ) * Fix topk compile failed on windows * Use explicit cast for assign data	6 years ago
Pei Yang	2e2f92a5b1	fix trt weight bug (#21231 ) added splitter "__" between weight name and suffix number to avoid conflicts.	6 years ago
Zhang Ting	01a9646323	optimize assign op to avoid copy data from GPU to GPU (#21181 ) * optimize assign op to avoid copy data from GPU to GPU, test=develop * modified GetkernelTypeForVar and just avoid device transform, test=develop	6 years ago
zhouwei25	c0dcb090a3	Determine whether to copy and link inference lib by ON_INFER (#20931 )	6 years ago
danleifeng	0e7baabe59	extend elementwise broadcast function (#20957 )	6 years ago
Adam	d623e863c9	Fix GELU grad error (#21204 ) test=develop	6 years ago
Zeng Jinle	a152315be7	refine Tensor method, test=develop (#21031 )	6 years ago
yaoxuefeng	b5d8ba8394	fix data_norm op to avoid impractical normalization result test=develop (#21152 ) * fix auc drop first commit test=develop * update datanorm op * update datanorm with enforce test=develop * update test=develop * update format test=develop * update format * update format test=develop * add unit test test=develop * update unit test test=develop * update format test=develop * update format test=develop * update API description test=develop * update API description test=develop * update format test=develop * fix codes as comments test=develop * fix description as comments test=develop * fix description as comments test=develop * update codes.. test=develop	6 years ago
Zeng Jinle	67e88424e5	Polish jit trace codes (#21218 ) * polish jit trace codes, test=develop * polish codes again by removing var_id, test=develop	6 years ago
Zeng Jinle	cdb3d27985	Fix warn of gcc8 (#21205 ) * fix warnings oof gcc 8 compilation, test=develop * fix boost::bad_get, test=develop * refine PADDLE_ENFORCE, test=develop	6 years ago
liuwei1031	d8b6cf2bcd	fix sporadically hang issue on windows(#21201 ) cudaStreamSynchronize randomly hang when used in multi-thread environment, replace it with cudaStreamQuery API on windows	6 years ago
Zhang Ting	9cbe7bccba	modified error message and API doc for channel_last supported Op (#21002 ) * modified error message for conv and conv_transpose, test=develop * modified doc of conv and conv_transpose op, test=develop * modified the expression for error message, test=develop * modified error message for group_norm op, test=develop * modified detail of Attr(data_format) or Attr(data_layout) * add ValueError in API doc for maxout op, test=develop	6 years ago
Zhaolong Xing	65f7052554	TRT int8: refine trt int8 for dynamic range set (#21112 ) * refine trt int8 for dynamic range set test=develop * refine trt int8 test=develop	6 years ago
guofei	56b5d14704	Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118 )	6 years ago
WangXi	3c98ec90ce	Fix INF bug of softmax_cross_entropy_op (#21165 )	6 years ago
xujiaqi01	23876de55b	fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052 ) * fix cache table bug * add save_paddle_inference_model * fix hdfs util bug * test=develop	6 years ago
Yihua Xu	eec9c9cbe7	Fix jit tls issue (#21151 )	6 years ago
GaoWei8	a9d4eed3a8	fix cmake fails on inference_download_and_uncompress (#21185 ) * solve cmake fails on inference_download_and_uncompress test=develop * solve cmake fails on inference_download_and_uncompress test=develop	6 years ago
xujiaqi01	9e045170c0	add copy table (#21086 ) * copy some feasigns and corresponding embeddings from one sparse table to another * copy all feasigns and corresponding embeddings from one sparse table to another * copy all dense params from one table to another * copy some local vars to other local vars	6 years ago
ruri	aeb887911f	Refine edit distance cn (#21121 )	6 years ago
Kaipeng Deng	98b59cb82c	fix elementwise_mod float point kernel. test=develop (#21183 )	6 years ago
Zeng Jinle	5fdfbe3413	Add friendly dygraph trace API (#21091 ) * friendly trace interface, test=develop * refine TracedLayer, test=develop * add some docs, test=develop	6 years ago
Chen Weihang	4bd9463630	fix detail error message error, test=develop (#21170 )	6 years ago
whs	cfdd1fc2cd	Fix warpctc in padding mode. (#21033 )	6 years ago
Chen Weihang	8da0cd537a	Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134 ) * add examples for error msg spec, test=develop * change ENFORCE to ENFORCE_*, test=develop add more already exists examples, test=develop	6 years ago
zhaoyuchen2018	b93870e696	Improve topk performance. (#21087 ) * Improve topk performance. give 200000 data to compute topk, before opt: cost 1s after opt: cost 0.0028s. * Refine return value. * Add cuda util funtions. * Fix ComputeBlockSize bug & refine comments. Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	6 years ago
Adam	d74ea0855f	Add relative error measure when (value > 1) (#21144 ) * Add relative error measure when value > 1 test=develop * Move code to CheckError function test=develop	6 years ago
Chen Weihang	b3a3e6f60c	change cuda enforce & add example (#21142 )	6 years ago
Chen Weihang	8414575b78	Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137 ) * add examples for error spec, test=develop * change ENFORCE to ENFORCE_**, test=develop	6 years ago
Chen Weihang	7e5f74b825	Add examples for error message writing specification - InvalidArgument (#21132 ) * add examples for error msg spec, test=develop * change ENFORCE to ENFORCE_*, test=develop fix error, test=develop	6 years ago
Chen Weihang	27fa9c100b	add examples for resource exhausted error, test=develop (#21140 )	6 years ago
zhaoyuchen2018	4a544762a2	Add Asypadding for conv fusion. (#21041 ) * Add Asypadding for conv fusion. test=develop reference: pr/20042 * Fix eigen build link error * Change back file mode * Use math function & add more checks.	6 years ago
WangXi	de5d3ff688	Fix dgc buffer illegal & reuse velocity (#21012 )	6 years ago
ceci3	f62a929151	fix instance norm (#21042 ) * fix instance norm * update unitest,test=develop	6 years ago
Zeng Jinle	d625aaf0c1	remove so many logs of parallel executor, test=develop (#21105 )	6 years ago
lilong12	e249d9a3e2	fix the computation for dx (grad for x) for prelu operation. (#20949 ) * set the default value of alpha for prelu to 0.25, test=develop * add the call to __syncthreads(), test=develop * fix the implementation of cpu prelu, test=develop * repair the implementation of element mode prelu, test=develop * modify test_prelu_op.py, test=develop	6 years ago
Chen Weihang	edd6680a71	Further simplify the C++ error info stack (#21093 ) * simplify C++ error stack by rewrite Place, test=develop * polish assignment overload func, test=develop	6 years ago
Zhang Ting	e0285eae64	add check for input channels and Attr(groups), test=develop (#21095 )	6 years ago
Yiqun Liu	35f17ae28f	Add the check of lod_level between compile-time and runtime. (#20961 ) * Add the check of lod_level between compile-time and runtime. test=develop * Fix bug in check_compile_vs_runtime. test=develop * Fix the check of output when it is dispensiable or intermediate. test=develop * Share lod of x to out in match_matrix_tensor op in compile-time. * Implement GetLoDLevel in InferShapeContext. * Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op. test=develop * Enable check_compile_vs_runtime in test_match_matrix_tensor. * Add the implementation of SetLoDLevel in InferShapeContext. * Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead. * Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead. * Refine some ops and unittests. test=develop * Fix a typo. test=develop * Remove the check of var type, and change int to int32_t. test=develop * Add unittest for Get/SetLoDLevel. test=develop	6 years ago
Chen Weihang	826254f664	Add pre-condition check for fuse optimizer op pass (#21005 ) * add pre condition check for fuse optimizer op pass, test=develop * add log & set init to zero, test=develop * fix test_fuse_all_reduce_pass failed, test=develop * polish details, test=develop * refine PADDLE_ENFORCE & remove needless VLOG, test=develop * refactor op check method, test=develop	6 years ago
Yiqun Liu	9091f8cdf9	Support generating code for grad_op (#21066 ) * Add the definition of operation in fusion_group. * Use operations in OperationMap to detect fusion_group of elementwise pattern. * Add namespace fusion_group in code_generator. * Use operations recorded in OperationMap to generate code. * Remove implementation codes to .cc file. * Refine Operation and CodeGenerator to make it easier to generate code for grad_op. Refine the unittest for better reuse. * Avoid recording the template's keyword in a array. * Support the generating of code for grad_op and add unittest. test=develop * Remove replaced_element_in_order and use use number instead. test=develop	6 years ago
Aurelius84	1cd6721873	Optimizer mmcpy if _rand_len=16 and remove data copy in GradKernel (#21099 )	6 years ago
joanna.wozna.intel	77c2083586	Add transpose2 INT8 for mkl-dnn (#19424 ) * Add transpose2 INT8 for mkl-dnn test=develop * Fix test_transpose_int8_mkldnn test=develop * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2" This reverts commit 34011bdba4c859abb945e062ab13124f70508054, reversing changes made to 2ce6473f144da298aba4a43d46918f27d463cf7c. * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"" This reverts commit 23754dd78ca47ae56881161172b2aacd349aba90. * Add template to TransposeMKLDNNHandler test=develop * Resolve conflict test=develop * Restore get_size and refactor test=develop	6 years ago
LielinJiang	06063b7001	add op locality_aware_nms, test=develop (#20976 )	6 years ago
wangchaochaohu	fc385777e4	fix the compile cost long time test=develop (#21064 )	6 years ago
Chen Weihang	2f27b10331	Add dependency for error_codes.proto (#21084 ) * fix activation_functions deps, test=develop, test=document_fix * add error_codes_proto deps, test=develop, test=document_fix * try delete enforce.h, test=develop, test=document_fix	6 years ago
wangchaochaohu	149a1e3124	Expand refine (#21063 ) * fix the expand op compile time cost long time test=develop * add tag for just copy test=develop	6 years ago
Wojciech Uss	af3ff422cc	Fix dst memory allocation in elementwise_add (#21059 ) test=develop	6 years ago
liym27	26a6e27afe	fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997 ) * fix bug in pool/conv/conv_transpose: 1. It should be stride[i] not stride[0] in UpdatePaddingAndDilation; 2. fix bug of func _get_padding_with_SAME in test_conv/conv_transpose_op.py; 3. fix bug of the computation process in function conv2dtranspose_forward_naive. test=develop * change test to make the data of different dimensions different. test=develop	6 years ago
GaoWei8	829bf871d7	Add ernie c++ inference test (#21015 ) * Add ernie unit test test=develop * Add ernie unit test test=develop * Add ernie unit test test=develop * remove ngraph * optimize gpu test test=develop * optimize codes test=develop	6 years ago
mapingshuo	b592deec90	add dlpack to imdb demo, test=develop (#21069 )	6 years ago
Chen Weihang	7ee25189c3	Enrich the type of error and declare the error type interfaces (#21024 ) * Enrich the type of error and declare the error type interfaces, test=develop * adjust tests to adapt new form, test=develop * add inference deps with error_codes.pb.h, test=develop * restore stack iter start pos, test=develop * polish code based review comments, test=develop	6 years ago
Adam	3fda695bb0	Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062 ) * Add asymetric padding support for mkldnn pooling test=develop * Add asymetric padding support for mkldnn conv test=develop * Add asymetric padding support for mkldnn conv_transpose test=develop	6 years ago
Huihuang Zheng	1957192f05	Add select_input_op and select_output_op (#21016 ) These ops are useful in control flow.	6 years ago
Liufang Sang	e5e699ecc0	set lod level for compile time test=develop (#21022 )	6 years ago
liym27	f0e95a6049	Polish error messages of pool_2d/3d and add Raises in English document. test=develop (#21017 )	6 years ago
Zeng Jinle	a710ccc0cb	refine error message of allocator again, test=develop (#21023 )	6 years ago
zhaoyuchen2018	0059404e77	Fix ce ocr_recognition test fails (#20987 ) ocr_recognition fails, so add a path to handle small frame_size. test=develop	6 years ago
Zeng Jinle	f56967c483	refine error message of gpu allocator, test=develop (#21008 )	6 years ago
Chengmo	bc8e600ce5	Fix rpc not wait in GEO communicator (#20967 ) * test=develop,fix rpc not wait in geo	6 years ago
Leo Chen	008ed65fd5	Add c++ global current tracer for dygraph (#20882 ) * Add c++ global current tracer for dygraph, test=develop * add tracer property in c++, test=develop * support different place, test=develop * add unittest for tracer, test=develop	6 years ago
Zeng Jinle	5aae595902	fix no_need_buffer_vars_dep, test=develop, test=document_fix (#21007 )	6 years ago
xujiaqi01	1d1a07937a	simplify master+patch，remove ins when size != merge_size or has conflict slot (#20913 ) * remove duplicate code and duplicate config of master+patch * drop all ins which has conflict slot or size < merge_size * user only need to set merge size，if ins num of same id is not equal to merge size, just drop these ins * user must make sure master data and patch data has no same slot whose feasigns are both non-zero, otherwise these ins will be dropped. (slot list should still be the same of both master and patch) * test=develop	6 years ago
Tao Luo	25ffa8445d	refine murmurhash3_x64_128 for bloom_filter (#20996 ) test=develop	6 years ago
Zeng Jinle	878a40f57d	Support NoNeedBufferVarsInference in dygraph backward (#20868 ) * support no need buffer vars in dygraph, test=develop * fix inference compilation error, test=develop * update no_need_buffer_vars_inference, test=develop * add unittests for no_need_buffer_vars_context, test=develop * refine no_need_buffer_vars by return ref, test=develop * polish some codes, test=develop	6 years ago
wangchaochaohu	bf379fef96	refine code for code reuse test=develop (#20988 )	6 years ago
Zhang Ting	de9bec607e	lrn supports channel_last input, test=develop (#20954 )	6 years ago
Liufang Sang	9b666cae67	fix diff in dequantize op between cpu and gpu test=develop (#20953 )	6 years ago
zhongpu	065804d39e	fix bug in grad_op compute for dygraph, test=develop (#20975 )	6 years ago
Wilber	c534149642	fix squared_mat_sub_fuse_pass when elementwise_op input is from persistable param test=develop (#20960 ) fix squared_mat_sub_fuse_pass when elementwise_op input is from persistable param	6 years ago
Zhang Ting	f4f85831d3	fix the bug of conv_transpose cudnn kernel, test=develop (#20958 ) fix the bug of conv_transpose cudnn kernel: before version 1.6, the data_format is AnyLayout in inference model. When use version 1.6 and load the model which is saved by previous version, the error occurs. This is because the cudnn kernel in version 1.6 is not compitable with Anylayout setting.	6 years ago
wangchaochaohu	7695b713e1	gpu info query refine test=develop (#20904 )	6 years ago
Leo Chen	2c3c579b9b	tensor.set() supports array list and remove unused code, test=develop (#20959 )	6 years ago
WangXi	eec4fa9099	And Enforce to fuse pass for DGC doesn't support fuse for now, test=develop (#20935 )	6 years ago
Leo Chen	9974e40787	Update Tensor.set() to support float16 (#19964 ) * don't expose numerous Tensor.set(), test=develop * fix condition, test=develop * fix float16 bug, test=develop * feed should be Tensor or np.array, not Variable or number, test=develop * use forcecast to copy numpy slice to new array, test=develop * remove float16-uint16 hacking, test=develop	6 years ago
zhaoyuchen2018	7f3a445e9a	Fix gru as small frame_size has error. (#20922 ) seems shuffle_sync cannot handle small size test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	6 years ago
Zeng Jinle	b0c0ffb9ae	refine pe when exception raises, test=develop (#20894 )	6 years ago
123malin	20cdff0e02	Optimize decay (#20816 ) * update pserver decay blocks * update distributed notify handler	6 years ago
Chengmo	16596f6498	Fix Paddle Cloud role maker (#20860 ) * fix PaddleCloud Role maker & add warning in distribute transpiler & change rpc_retry_times	6 years ago
liym27	59de8e1214	Compatible int32 and int64 for attr in concat/split/unsqueeze. test=develop (#20912 )	6 years ago
Zhang Ting	8d1e9f0f7e	maxout supports channel_last input (#20846 ) * maxout support channel_last input, test=develop * modified details of Input(X) and Attr(groups, axis) in doc, test=develop	6 years ago
Yihua Xu	b6260f3866	Optimize the kernel implementation of layernorm with openmp (#20895 )	6 years ago
hong	8c4573a3cb	GradMaker for dygraph (#19706 ) * refactor dygraph,test=develop * fix failed unittest,test=develop * polish code,test=develop * check windows ci error,test=develop try to fix windows ci error by np.allclose,test=develop * polish vlog and profiler, test=develop * try to fix preceding ops order,test=develop * test transformer in windows ci, test=develop * use python c-api to speed up tracer.trace,test=develop * test=develop, fix docker with paddle nccl problem * test=develop, add ut for debug string and gradient_accumulator * test=develop, add tests for layer/gradient_accumulator/prepared_op * test=develop, fix complie error for test_prepared_op * test=develop, add more ut for dygraph * test=develop, create API.spec for dygraph api change * optimize grad maker; test=develop * optimize grad maker * test * grad make optim; test=develop * fix unittest bugs; test=develop * add dygraph grad op maker and split_op * grad op maker refactor; test=develop * add dygraph grad maker; test=develop * fix op deformable_conv_v1_op bug; test=develop * fix deformable_conv prroi pool bugs; * fix new op grad op maker bug; test=develop * fix split by ref bug; test=develop * fix dygraph auto prune bug; test=develop * fix test_trace bug; test=develop * fix fused emb seq pool bug; test=develop * remove useless code in op_desc file; test=develop * remove useless code, StrVarBaseNode; test=develop * fix review issues; test=develop * fix rank_loss grad maker; test=develop * remove flag in VarBase; test=develop * fix distributed_notify_op compile bug ; test=develop * fix reshape op double grad; test=develop * fix expand as op; test=develop * add impertive type_defs.h for demo_train; test=develop * fix inference lib cmake; test=develop * fix inference lib; test=develop * fix infernce_lib; test=develop * fix inference cmake; test=develop * fix inference lib; test=develop * fix inference lib; test=develop * remove condition dygraph grad maker, modify local name; test=develop * fix split grad maker bug; test=develop * fix pyramid_op bug; test=develop * change travis time out limit; test=develop * restore travis; test=develop * change timeout limit; test=develop	6 years ago
Thunderbrook	59bcdc8a19	support dump param of model into afs (#20302 ) * support dump param to afs test=develop * code style test=develop * code style test=develop * dump param test=develop * dump param test=develop * dump param test=develop * dump param test=develop	6 years ago
Chen Weihang	768551b25d	Add parameter init check add run_startup_progrom error message for fc(mul) (#20906 )	6 years ago
Zhang Ting	c18f1bd716	fix the bug of conv_transpose:compatible with Anylayout setting, test=develop (#20897 )	6 years ago
Chen Weihang	3358455c86	Polish and arrange code in enforce.h (#20901 )	6 years ago
Yiqun Liu	16e4d02675	Refine the cache of program, context and scope in executor. (#18483 ) * Refine the cache of program, context and scope in executor. test=develop * Refine the unittest test_executor_and_use_program_cache. * Add the test the PaddingRNN with use_program_cache=True. test=develop * Remove a check. test=develop * Refine the unittest to check whether it is correct when setting use_program_cache=True. test=develop	6 years ago
Wilber	b489760099	fix jit_matmul bug test=develop (#20886 ) * fix jit_matmul bug * update jit matmul and add test	6 years ago
Yiqun Liu	03ba0fdae6	Move the codes of fused operators to operators/fused directory. (#20881 ) * Move the codes of fused operators to operators/fused directory. test=develop * Correct the op name in cmake. * Change the use of PADDLE_ENFORCE. test=develop	6 years ago
Leo Chen	a9bc92c314	add c++ unique_name_generator, test=develop (#20871 )	6 years ago
zhang wenhui	d428912503	fix select_rows mergeadd bug, test=develop (#20876 )	6 years ago
Zeng Jinle	c51722c820	refine err msg of allocator, test=develop (#20879 )	6 years ago
hong	ff0886a92a	save load problem fix and new feature add (#20823 ) * fix persistable; * fix save load bugs; test=develop * fix bug; test=develop * add example for new io api; test=develop * addd example; test=develop	6 years ago
liym27	6802539a2e	support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780 ) * improve split and concat op: 1. support Tensor for argument 'dim' in split op. 2. support Tensor for argument 'axis' in concat op. test=develop * redefine function GetDataFromTensor and set unknown output shape to - 1. test=develop * add check: Attr(sections) match Input(X). test=develop * support Tensor for attr(sections) and attr(sections) can contain -1. add check for attr(sections). test=develop * modify error message for concat and call Resize only when necessary. test=develop	6 years ago
wangchaochaohu	28ca2e5ffa	strided_slice perforamnce improvement test=develop (#20852 )	6 years ago
Yiqun Liu	6fcfd32e6c	Check and correct the output's lod_level in DynamicRNN related operators (#19144 ) * Refine the InferShape of ReadFrom and WriteTo op, and add comment to explain why not call ShareLoD for runtime. test=develop * Add comment for ReorderLoDTensorByRank op. * Add comment for lod_tensor_to_tensor_array op to explain why only call DecreaseLoDLevel for compile time. test=develop * ShrinkRNNMemory op should call ShareLoD for compile time. test=develop * Add the implementation of IncreaseLoDLevel and add the compile-time check of lod_level in InferShape of sequence_pool. test=develop * Refine the unittest of DynamicRNN. test=develop * Change PADDLE_ENFORCE to PADDLE_ENFORCE_NE. test=develop	6 years ago
Yiqun Liu	b5f3be8330	Implement a pass detect fusion group of elementwise op (#19884 ) * Add fusion_group_pass and elementwise pattern. * Rewrite the detector of elementwise group. test=develop * Add a comment in codegen. * Add more unittest cases. test=develop * Move code_generator related code to fusion_group directory. * Correct the including path. * Add the definition of SubGraph and finish the insert of fusion_group op in pass. * Insert graph_vis_pass in tester to visualize the graph for debug.	6 years ago
liym27	84d221b667	improve unsqueeze op to support int, Tensor for argument axes (#20824 ) * improve unsqueeze op to support int, Tensor and Tensor list for argument axes. test=develop * call Resize only when necessary. test=develop	6 years ago
silingtong123	03d7f3ddb2	Make shape tensor support int32 (#20757 ) * Make shape tensor support int32	6 years ago
Huihuang Zheng	95ba4bd2ab	Add shape and type check at read_op (#20754 )	6 years ago
Zeng Jinle	bb8d778358	lazy init of allocators, test=develop (#20854 )	6 years ago
Aurelius84	aacd16dbb4	add pyramid_hash_op (#20698 )	6 years ago
Zeng Jinle	98103d3003	remove some unnecessary logs in pe, test=develop (#20848 )	6 years ago
Chen Weihang	8b59ac3ad0	delete paddle infershape enforce marco (#20832 )	6 years ago
whs	c8e49be2f1	Fix roi_perspective_transform op (#20764 )	6 years ago
Chen Weihang	26cc1fe508	Replace risky GetInputType method with secure IndicateVarDataType interface (#20668 ) * replace part of the old implementation, test=develop * restore concat op, test=develop * update all ops implemention & delete GetDataTypeOfVar func, test=develop	6 years ago
xujiaqi01	48669aa8f0	fix several sparse table issuses (#20686 ) * no longer need to define all embedding layers (no one less) of all slots in each program. make trainer_param repeated in ps.proto. * add find_distributed_lookup_table_grads instead of hard code GRAD * support embedding stop gradient. push sparse has error before fix this.* * fix fill sparse, skip slots which do not have embedding. each slot's embedding in a sparse table should be used in all training programs before fix this. * fix pull sparse, skip slots which do not have embedding. * fix collect feasign label info, skip slots which do not have embedding. * support when there are multi sparse tables in one or multi training programs, each program can pull/push its own related sparse tables instead of all sparse tables. * test=develop	6 years ago
Yamei-Lee	cf717fd6dd	fix bug in reshape: (#20781 ) consider the situation that shape of input can contain more than one -1. test=develop	6 years ago
Chen Weihang	1d1552d106	Make formatted ENFORCE stack adapt to more situations (#20826 ) * Make formatted ENFORCE stack adapt to more situations and polish details, test=develop * restore template message position, test=develop	6 years ago
Zeng Jinle	378fc4fb1c	add some docs to jit.trace, test=develop (#20811 )	6 years ago
Zhang Ting	5a8d885d72	All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756 ) * All elements in attr(shape) of crop_tensor can be -1, test=develop, test=document_preview * fix the bug that attr(offsets) should be initialized, test=develop	6 years ago
danleifeng	9171f73714	fix fp16 grid_size for size=1; test=develop (#20812 )	6 years ago
Zeng Jinle	cd1c404353	refine err msg of allocator, test=develop (#20804 )	6 years ago
Zeng Jinle	ac813bbaf4	Add more error debug message to Operator::Run (#20793 ) * add more err msg, test=develop * add more unittests, test=develop	6 years ago
Tao Luo	efbdad0596	make search_compute support avx default (#20779 ) * make search_compute support avx only * clean search_compute.h * rename sse_axpy to avx_axpy test=develop * update CMakeLists.txt test=develop	6 years ago
zhongpu	3556514e97	add PADDLE_ENFORCE for dygraph to optimize error throw (#19783 ) * add PADDLE_ENFORCE for dygraph to optimize error throw, test=develop * fix some error, test=develop * delete PADDLE_ENFORCE_EQ in VarBase::NewVarBase, test=develop	6 years ago
WangXi	250e72d254	Fix DGC algorithm flow to make it the same as paper (#20758 )	6 years ago
wangchaochaohu	ba45dce35d	fix codetest for windows make test=develop (#20796 )	6 years ago
Zeng Jinle	8ff6b289bd	[Dygraph to static graph]JIT/Trace (#20775 ) * jit/trace 1st version, test=develop * add more unittests, test=develop	6 years ago
zhaoyuchen2018	6e6eab07e8	Fix multihead op bug. (#20783 ) The op should handle k=1024 test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	6 years ago
lvmengsi	dfa0549f87	Revert "fix_depthwise_conv_cudnn, test=develop (#20712 )" (#20782 ) This reverts commit `dc229b4195`.	6 years ago
whs	4c7d196d83	Add norm_by_time for warpctc op in padding mode. (#17580 )	6 years ago
Pei Yang	e89c16b90d	Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733 ) * fix pool2d trt converter, test=develop * add fix for split op converter, test=develop	6 years ago
石晓伟	37cd43545a	update the infer shape of matmul, test=develop (#20717 ) * update the infer shape of matmul, test=release/1.6 * add unittests of matmul, test=release/1.6 * change func names, test=develop	6 years ago
石晓伟	e742760f8e	optimize version error, test=develop (#20715 )	6 years ago
Adam	67b59ddb38	Minor MKL-DNN conv int8 performance fixes (#20753 ) test=develop	6 years ago
wangchaochaohu	0687bcd64f	Refine getitem of Variable (#20729 ) * add support for __get_item__ of Variable test=develop	6 years ago
zhongpu	72d1d72c09	fix ExecutionContext::HasInput and ExecutionContext::HasOutput depend on the scope structure, test=develop (#20721 )	6 years ago
danleifeng	79e08ecebf	add assertions on whether elementwise_div divison is zero (#20618 )	6 years ago
bingyanghuang	fd49ebcbd8	update int8 benchmark with 6271 data, test=develop test=document_fix (#20736 )	6 years ago
123malin	95e90aa102	test=develop, add communicator_is_sgd_optimizer flag (#20677 ) * test=develop, communicator_is_sgd_optimizer flags	6 years ago
Aurelius84	74a28f5ea4	fix fill_constant shape with -1 and enhance cross_entropy test=develop (#20722 )	6 years ago
石晓伟	48a774c713	fix ts_sort's bug, test=develop (#20720 )	6 years ago
lvmengsi	dc229b4195	fix_depthwise_conv_cudnn, test=develop (#20712 )	6 years ago
石晓伟	d8f4f4239d	Ensure backward compatibility with the anakin interface, test=develop (#20691 ) * support MLU nums, test=develop * change anakin apis, test=develop	6 years ago
wopeizl	9e5948230e	add support to gcc8, add docker env test=develop (#19807 ) * add support to gcc8, add docker env test=develop	6 years ago
xujiaqi01	5223b0dd9d	add check nan / inf in downpour worker (#20694 ) * add check nan / inf in downpour worker during training * test=develop	6 years ago
WangXi	507afa8a8a	Fix dgc nan by stripping nccl from sparseReduce. (#20630 )	6 years ago
gongweibao	c1710e91b2	Disable GRPC_ARG_ALLOW_REUSEPORT to avoid potencial problem. (#20690 )	6 years ago
lidanqing	46e93f7c86	Revert "Refactor conv computeINT8" (#20640 ) * Revert "Refactor conv computeINT8 (#19574)" This reverts commit `2c32c2d649`. test=develop * replace PADDLE_ENFORCE test=develop	6 years ago
liu zhengxi	d39777fefa	alter the capi of PD_PredictorRun to provide proper function, test=develop (#20697 ) modify the way to pass parameter out_size in function.	6 years ago
Zeng Jinle	4eeda9d676	fix tensor_util, test=develop (#20699 )	6 years ago
Zeng Jinle	ab575de725	Fix op run log when memory optimization strategy is enabled (#20695 )	6 years ago
Jacek Czaja	a1cd27f13f	[MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241 ) * - Flushing mkl-dnn cache test=develop - Disabled clearing cache for LoadModel - Added clearing of mkl-dnn cache when Executor is created test=develop - Do not clear for GPU places test=develop - compilation fix test=develop * - Moved clearing of mkl-dnn cache in destructor of executor test=develop * - Compilation fix test=develop - Reverted conditional clearing of mkl-dnn cache in Executors's destructor test=develop - compilation fix	6 years ago
Zeng Jinle	10505faf4e	polish codes, test=develop (#20672 )	6 years ago
liuwei1031	569951c418	improve the efficiency of BuddyAllocator (#19888 ) * improve save and load behaviour, test=develop * code cleaning, test=develop * disable check_guards and update_guards in release version, test=develop * fix compilation issue, test=develop * add buddy_allocator speed test data, test=develop * fix compilation issue, test=develop * fix comment, test=develop * update function names according to the google C++ style guide, test=develop * tweak the test data format, test=develop * move buddy_allocator_test_data to paddle/fluid/testdata, test=develop * add accessor and mutator for Desc, test=develop	6 years ago
Zeng Jinle	34e3adaece	Refine reduce codes to save compiling time and binary size (#20676 ) * refine reduce code to save compiling time and binary sizes, test=develop * add reduce rank check to avoid bug, test=develop	6 years ago

1 2 3 4 5 ...

9086 Commits (4f81d1bd5f9be4a564cfc8bd7e2a2bc3b21c24d1)