Paddle

Commit Graph

Author	SHA1	Message	Date
WangXi	3c98ec90ce	Fix INF bug of softmax_cross_entropy_op (#21165 )	5 years ago
xujiaqi01	23876de55b	fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052 ) * fix cache table bug * add save_paddle_inference_model * fix hdfs util bug * test=develop	5 years ago
Yihua Xu	eec9c9cbe7	Fix jit tls issue (#21151 )	5 years ago
GaoWei8	a9d4eed3a8	fix cmake fails on inference_download_and_uncompress (#21185 ) * solve cmake fails on inference_download_and_uncompress test=develop * solve cmake fails on inference_download_and_uncompress test=develop	5 years ago
xujiaqi01	9e045170c0	add copy table (#21086 ) * copy some feasigns and corresponding embeddings from one sparse table to another * copy all feasigns and corresponding embeddings from one sparse table to another * copy all dense params from one table to another * copy some local vars to other local vars	5 years ago
ruri	aeb887911f	Refine edit distance cn (#21121 )	5 years ago
Kaipeng Deng	98b59cb82c	fix elementwise_mod float point kernel. test=develop (#21183 )	5 years ago
Zeng Jinle	5fdfbe3413	Add friendly dygraph trace API (#21091 ) * friendly trace interface, test=develop * refine TracedLayer, test=develop * add some docs, test=develop	5 years ago
Chen Weihang	4bd9463630	fix detail error message error, test=develop (#21170 )	5 years ago
whs	cfdd1fc2cd	Fix warpctc in padding mode. (#21033 )	5 years ago
Chen Weihang	8da0cd537a	Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134 ) * add examples for error msg spec, test=develop * change ENFORCE to ENFORCE_*, test=develop add more already exists examples, test=develop	5 years ago
zhaoyuchen2018	b93870e696	Improve topk performance. (#21087 ) * Improve topk performance. give 200000 data to compute topk, before opt: cost 1s after opt: cost 0.0028s. * Refine return value. * Add cuda util funtions. * Fix ComputeBlockSize bug & refine comments. Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
Adam	d74ea0855f	Add relative error measure when (value > 1) (#21144 ) * Add relative error measure when value > 1 test=develop * Move code to CheckError function test=develop	5 years ago
Chen Weihang	b3a3e6f60c	change cuda enforce & add example (#21142 )	5 years ago
Chen Weihang	8414575b78	Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137 ) * add examples for error spec, test=develop * change ENFORCE to ENFORCE_**, test=develop	5 years ago
Chen Weihang	7e5f74b825	Add examples for error message writing specification - InvalidArgument (#21132 ) * add examples for error msg spec, test=develop * change ENFORCE to ENFORCE_*, test=develop fix error, test=develop	5 years ago
Chen Weihang	27fa9c100b	add examples for resource exhausted error, test=develop (#21140 )	5 years ago
zhaoyuchen2018	4a544762a2	Add Asypadding for conv fusion. (#21041 ) * Add Asypadding for conv fusion. test=develop reference: pr/20042 * Fix eigen build link error * Change back file mode * Use math function & add more checks.	5 years ago
WangXi	de5d3ff688	Fix dgc buffer illegal & reuse velocity (#21012 )	5 years ago
ceci3	f62a929151	fix instance norm (#21042 ) * fix instance norm * update unitest,test=develop	5 years ago
Zeng Jinle	d625aaf0c1	remove so many logs of parallel executor, test=develop (#21105 )	5 years ago
lilong12	e249d9a3e2	fix the computation for dx (grad for x) for prelu operation. (#20949 ) * set the default value of alpha for prelu to 0.25, test=develop * add the call to __syncthreads(), test=develop * fix the implementation of cpu prelu, test=develop * repair the implementation of element mode prelu, test=develop * modify test_prelu_op.py, test=develop	5 years ago
Chen Weihang	edd6680a71	Further simplify the C++ error info stack (#21093 ) * simplify C++ error stack by rewrite Place, test=develop * polish assignment overload func, test=develop	5 years ago
Zhang Ting	e0285eae64	add check for input channels and Attr(groups), test=develop (#21095 )	5 years ago
Yiqun Liu	35f17ae28f	Add the check of lod_level between compile-time and runtime. (#20961 ) * Add the check of lod_level between compile-time and runtime. test=develop * Fix bug in check_compile_vs_runtime. test=develop * Fix the check of output when it is dispensiable or intermediate. test=develop * Share lod of x to out in match_matrix_tensor op in compile-time. * Implement GetLoDLevel in InferShapeContext. * Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op. test=develop * Enable check_compile_vs_runtime in test_match_matrix_tensor. * Add the implementation of SetLoDLevel in InferShapeContext. * Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead. * Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead. * Refine some ops and unittests. test=develop * Fix a typo. test=develop * Remove the check of var type, and change int to int32_t. test=develop * Add unittest for Get/SetLoDLevel. test=develop	5 years ago
Chen Weihang	826254f664	Add pre-condition check for fuse optimizer op pass (#21005 ) * add pre condition check for fuse optimizer op pass, test=develop * add log & set init to zero, test=develop * fix test_fuse_all_reduce_pass failed, test=develop * polish details, test=develop * refine PADDLE_ENFORCE & remove needless VLOG, test=develop * refactor op check method, test=develop	5 years ago
Yiqun Liu	9091f8cdf9	Support generating code for grad_op (#21066 ) * Add the definition of operation in fusion_group. * Use operations in OperationMap to detect fusion_group of elementwise pattern. * Add namespace fusion_group in code_generator. * Use operations recorded in OperationMap to generate code. * Remove implementation codes to .cc file. * Refine Operation and CodeGenerator to make it easier to generate code for grad_op. Refine the unittest for better reuse. * Avoid recording the template's keyword in a array. * Support the generating of code for grad_op and add unittest. test=develop * Remove replaced_element_in_order and use use number instead. test=develop	5 years ago
Aurelius84	1cd6721873	Optimizer mmcpy if _rand_len=16 and remove data copy in GradKernel (#21099 )	5 years ago
joanna.wozna.intel	77c2083586	Add transpose2 INT8 for mkl-dnn (#19424 ) * Add transpose2 INT8 for mkl-dnn test=develop * Fix test_transpose_int8_mkldnn test=develop * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2" This reverts commit 34011bdba4c859abb945e062ab13124f70508054, reversing changes made to 2ce6473f144da298aba4a43d46918f27d463cf7c. * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"" This reverts commit 23754dd78ca47ae56881161172b2aacd349aba90. * Add template to TransposeMKLDNNHandler test=develop * Resolve conflict test=develop * Restore get_size and refactor test=develop	5 years ago
LielinJiang	06063b7001	add op locality_aware_nms, test=develop (#20976 )	5 years ago
wangchaochaohu	fc385777e4	fix the compile cost long time test=develop (#21064 )	5 years ago
Chen Weihang	2f27b10331	Add dependency for error_codes.proto (#21084 ) * fix activation_functions deps, test=develop, test=document_fix * add error_codes_proto deps, test=develop, test=document_fix * try delete enforce.h, test=develop, test=document_fix	5 years ago
wangchaochaohu	149a1e3124	Expand refine (#21063 ) * fix the expand op compile time cost long time test=develop * add tag for just copy test=develop	5 years ago
Wojciech Uss	af3ff422cc	Fix dst memory allocation in elementwise_add (#21059 ) test=develop	5 years ago
liym27	26a6e27afe	fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997 ) * fix bug in pool/conv/conv_transpose: 1. It should be stride[i] not stride[0] in UpdatePaddingAndDilation; 2. fix bug of func _get_padding_with_SAME in test_conv/conv_transpose_op.py; 3. fix bug of the computation process in function conv2dtranspose_forward_naive. test=develop * change test to make the data of different dimensions different. test=develop	5 years ago
GaoWei8	829bf871d7	Add ernie c++ inference test (#21015 ) * Add ernie unit test test=develop * Add ernie unit test test=develop * Add ernie unit test test=develop * remove ngraph * optimize gpu test test=develop * optimize codes test=develop	5 years ago
mapingshuo	b592deec90	add dlpack to imdb demo, test=develop (#21069 )	5 years ago
Chen Weihang	7ee25189c3	Enrich the type of error and declare the error type interfaces (#21024 ) * Enrich the type of error and declare the error type interfaces, test=develop * adjust tests to adapt new form, test=develop * add inference deps with error_codes.pb.h, test=develop * restore stack iter start pos, test=develop * polish code based review comments, test=develop	5 years ago
Adam	3fda695bb0	Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062 ) * Add asymetric padding support for mkldnn pooling test=develop * Add asymetric padding support for mkldnn conv test=develop * Add asymetric padding support for mkldnn conv_transpose test=develop	5 years ago
Huihuang Zheng	1957192f05	Add select_input_op and select_output_op (#21016 ) These ops are useful in control flow.	5 years ago
Liufang Sang	e5e699ecc0	set lod level for compile time test=develop (#21022 )	5 years ago
liym27	f0e95a6049	Polish error messages of pool_2d/3d and add Raises in English document. test=develop (#21017 )	5 years ago
ddokupil	c98712d56e	add ending message in paddle_build.sh (#20334 ) * add ending message Add ending message to indicate script execution properly finished * Update paddle_build.sh test=develop * Update paddle_build.sh test=document_fix * Update paddle_build.sh test=develop test=document_fix	5 years ago
Zeng Jinle	a710ccc0cb	refine error message of allocator again, test=develop (#21023 )	5 years ago
tianshuo78520a	d89ca2ffb5	split api_spec document (#20999 ) * split api_spec document;test=document_fix * change tools * test=develop;test=document_fix * test=develop;test=document_fix	5 years ago
zhaoyuchen2018	0059404e77	Fix ce ocr_recognition test fails (#20987 ) ocr_recognition fails, so add a path to handle small frame_size. test=develop	5 years ago
Zeng Jinle	f56967c483	refine error message of gpu allocator, test=develop (#21008 )	5 years ago
Chengmo	bc8e600ce5	Fix rpc not wait in GEO communicator (#20967 ) * test=develop,fix rpc not wait in geo	5 years ago
Leo Chen	008ed65fd5	Add c++ global current tracer for dygraph (#20882 ) * Add c++ global current tracer for dygraph, test=develop * add tracer property in c++, test=develop * support different place, test=develop * add unittest for tracer, test=develop	5 years ago
Zeng Jinle	5aae595902	fix no_need_buffer_vars_dep, test=develop, test=document_fix (#21007 )	5 years ago
xujiaqi01	1d1a07937a	simplify master+patch，remove ins when size != merge_size or has conflict slot (#20913 ) * remove duplicate code and duplicate config of master+patch * drop all ins which has conflict slot or size < merge_size * user only need to set merge size，if ins num of same id is not equal to merge size, just drop these ins * user must make sure master data and patch data has no same slot whose feasigns are both non-zero, otherwise these ins will be dropped. (slot list should still be the same of both master and patch) * test=develop	5 years ago
Tao Luo	25ffa8445d	refine murmurhash3_x64_128 for bloom_filter (#20996 ) test=develop	5 years ago
Zeng Jinle	878a40f57d	Support NoNeedBufferVarsInference in dygraph backward (#20868 ) * support no need buffer vars in dygraph, test=develop * fix inference compilation error, test=develop * update no_need_buffer_vars_inference, test=develop * add unittests for no_need_buffer_vars_context, test=develop * refine no_need_buffer_vars by return ref, test=develop * polish some codes, test=develop	5 years ago
wangchaochaohu	bf379fef96	refine code for code reuse test=develop (#20988 )	5 years ago
Yucheng	98f1cebd38	add sample code test under python3 and enabled multi-thread (#20950 ) * test=develop	5 years ago
Zhang Ting	de9bec607e	lrn supports channel_last input, test=develop (#20954 )	5 years ago
Liufang Sang	9b666cae67	fix diff in dequantize op between cpu and gpu test=develop (#20953 )	5 years ago
zhongpu	065804d39e	fix bug in grad_op compute for dygraph, test=develop (#20975 )	5 years ago
Wilber	c534149642	fix squared_mat_sub_fuse_pass when elementwise_op input is from persistable param test=develop (#20960 ) fix squared_mat_sub_fuse_pass when elementwise_op input is from persistable param	5 years ago
Zhang Ting	f4f85831d3	fix the bug of conv_transpose cudnn kernel, test=develop (#20958 ) fix the bug of conv_transpose cudnn kernel: before version 1.6, the data_format is AnyLayout in inference model. When use version 1.6 and load the model which is saved by previous version, the error occurs. This is because the cudnn kernel in version 1.6 is not compitable with Anylayout setting.	5 years ago
wangchaochaohu	7695b713e1	gpu info query refine test=develop (#20904 )	5 years ago
Leo Chen	2c3c579b9b	tensor.set() supports array list and remove unused code, test=develop (#20959 )	5 years ago
WangXi	eec4fa9099	And Enforce to fuse pass for DGC doesn't support fuse for now, test=develop (#20935 )	5 years ago
Leo Chen	9974e40787	Update Tensor.set() to support float16 (#19964 ) * don't expose numerous Tensor.set(), test=develop * fix condition, test=develop * fix float16 bug, test=develop * feed should be Tensor or np.array, not Variable or number, test=develop * use forcecast to copy numpy slice to new array, test=develop * remove float16-uint16 hacking, test=develop	5 years ago
zhaoyuchen2018	7f3a445e9a	Fix gru as small frame_size has error. (#20922 ) seems shuffle_sync cannot handle small size test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
Zeng Jinle	b0c0ffb9ae	refine pe when exception raises, test=develop (#20894 )	5 years ago
123malin	20cdff0e02	Optimize decay (#20816 ) * update pserver decay blocks * update distributed notify handler	5 years ago
Chengmo	16596f6498	Fix Paddle Cloud role maker (#20860 ) * fix PaddleCloud Role maker & add warning in distribute transpiler & change rpc_retry_times	5 years ago
liym27	59de8e1214	Compatible int32 and int64 for attr in concat/split/unsqueeze. test=develop (#20912 )	5 years ago
Zhang Ting	8d1e9f0f7e	maxout supports channel_last input (#20846 ) * maxout support channel_last input, test=develop * modified details of Input(X) and Attr(groups, axis) in doc, test=develop	5 years ago
Yihua Xu	b6260f3866	Optimize the kernel implementation of layernorm with openmp (#20895 )	5 years ago
hong	8c4573a3cb	GradMaker for dygraph (#19706 ) * refactor dygraph,test=develop * fix failed unittest,test=develop * polish code,test=develop * check windows ci error,test=develop try to fix windows ci error by np.allclose,test=develop * polish vlog and profiler, test=develop * try to fix preceding ops order,test=develop * test transformer in windows ci, test=develop * use python c-api to speed up tracer.trace,test=develop * test=develop, fix docker with paddle nccl problem * test=develop, add ut for debug string and gradient_accumulator * test=develop, add tests for layer/gradient_accumulator/prepared_op * test=develop, fix complie error for test_prepared_op * test=develop, add more ut for dygraph * test=develop, create API.spec for dygraph api change * optimize grad maker; test=develop * optimize grad maker * test * grad make optim; test=develop * fix unittest bugs; test=develop * add dygraph grad op maker and split_op * grad op maker refactor; test=develop * add dygraph grad maker; test=develop * fix op deformable_conv_v1_op bug; test=develop * fix deformable_conv prroi pool bugs; * fix new op grad op maker bug; test=develop * fix split by ref bug; test=develop * fix dygraph auto prune bug; test=develop * fix test_trace bug; test=develop * fix fused emb seq pool bug; test=develop * remove useless code in op_desc file; test=develop * remove useless code, StrVarBaseNode; test=develop * fix review issues; test=develop * fix rank_loss grad maker; test=develop * remove flag in VarBase; test=develop * fix distributed_notify_op compile bug ; test=develop * fix reshape op double grad; test=develop * fix expand as op; test=develop * add impertive type_defs.h for demo_train; test=develop * fix inference lib cmake; test=develop * fix inference lib; test=develop * fix infernce_lib; test=develop * fix inference cmake; test=develop * fix inference lib; test=develop * fix inference lib; test=develop * remove condition dygraph grad maker, modify local name; test=develop * fix split grad maker bug; test=develop * fix pyramid_op bug; test=develop * change travis time out limit; test=develop * restore travis; test=develop * change timeout limit; test=develop	5 years ago
Thunderbrook	59bcdc8a19	support dump param of model into afs (#20302 ) * support dump param to afs test=develop * code style test=develop * code style test=develop * dump param test=develop * dump param test=develop * dump param test=develop * dump param test=develop	5 years ago
Chen Weihang	768551b25d	Add parameter init check add run_startup_progrom error message for fc(mul) (#20906 )	5 years ago
Zhang Ting	c18f1bd716	fix the bug of conv_transpose:compatible with Anylayout setting, test=develop (#20897 )	5 years ago
Chen Weihang	3358455c86	Polish and arrange code in enforce.h (#20901 )	5 years ago
Yiqun Liu	16e4d02675	Refine the cache of program, context and scope in executor. (#18483 ) * Refine the cache of program, context and scope in executor. test=develop * Refine the unittest test_executor_and_use_program_cache. * Add the test the PaddingRNN with use_program_cache=True. test=develop * Remove a check. test=develop * Refine the unittest to check whether it is correct when setting use_program_cache=True. test=develop	5 years ago
Wilber	b489760099	fix jit_matmul bug test=develop (#20886 ) * fix jit_matmul bug * update jit matmul and add test	5 years ago
Yiqun Liu	03ba0fdae6	Move the codes of fused operators to operators/fused directory. (#20881 ) * Move the codes of fused operators to operators/fused directory. test=develop * Correct the op name in cmake. * Change the use of PADDLE_ENFORCE. test=develop	5 years ago
Leo Chen	a9bc92c314	add c++ unique_name_generator, test=develop (#20871 )	5 years ago
zhang wenhui	d428912503	fix select_rows mergeadd bug, test=develop (#20876 )	5 years ago
Zeng Jinle	c51722c820	refine err msg of allocator, test=develop (#20879 )	5 years ago
hong	ff0886a92a	save load problem fix and new feature add (#20823 ) * fix persistable; * fix save load bugs; test=develop * fix bug; test=develop * add example for new io api; test=develop * addd example; test=develop	5 years ago
liym27	6802539a2e	support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780 ) * improve split and concat op: 1. support Tensor for argument 'dim' in split op. 2. support Tensor for argument 'axis' in concat op. test=develop * redefine function GetDataFromTensor and set unknown output shape to - 1. test=develop * add check: Attr(sections) match Input(X). test=develop * support Tensor for attr(sections) and attr(sections) can contain -1. add check for attr(sections). test=develop * modify error message for concat and call Resize only when necessary. test=develop	5 years ago
wangchaochaohu	28ca2e5ffa	strided_slice perforamnce improvement test=develop (#20852 )	5 years ago
Yiqun Liu	6fcfd32e6c	Check and correct the output's lod_level in DynamicRNN related operators (#19144 ) * Refine the InferShape of ReadFrom and WriteTo op, and add comment to explain why not call ShareLoD for runtime. test=develop * Add comment for ReorderLoDTensorByRank op. * Add comment for lod_tensor_to_tensor_array op to explain why only call DecreaseLoDLevel for compile time. test=develop * ShrinkRNNMemory op should call ShareLoD for compile time. test=develop * Add the implementation of IncreaseLoDLevel and add the compile-time check of lod_level in InferShape of sequence_pool. test=develop * Refine the unittest of DynamicRNN. test=develop * Change PADDLE_ENFORCE to PADDLE_ENFORCE_NE. test=develop	5 years ago
Yiqun Liu	b5f3be8330	Implement a pass detect fusion group of elementwise op (#19884 ) * Add fusion_group_pass and elementwise pattern. * Rewrite the detector of elementwise group. test=develop * Add a comment in codegen. * Add more unittest cases. test=develop * Move code_generator related code to fusion_group directory. * Correct the including path. * Add the definition of SubGraph and finish the insert of fusion_group op in pass. * Insert graph_vis_pass in tester to visualize the graph for debug.	5 years ago
liym27	84d221b667	improve unsqueeze op to support int, Tensor for argument axes (#20824 ) * improve unsqueeze op to support int, Tensor and Tensor list for argument axes. test=develop * call Resize only when necessary. test=develop	5 years ago
silingtong123	03d7f3ddb2	Make shape tensor support int32 (#20757 ) * Make shape tensor support int32	5 years ago
Huihuang Zheng	95ba4bd2ab	Add shape and type check at read_op (#20754 )	5 years ago
Zeng Jinle	bb8d778358	lazy init of allocators, test=develop (#20854 )	5 years ago
Aurelius84	aacd16dbb4	add pyramid_hash_op (#20698 )	5 years ago
Zeng Jinle	98103d3003	remove some unnecessary logs in pe, test=develop (#20848 )	5 years ago
Chen Weihang	8b59ac3ad0	delete paddle infershape enforce marco (#20832 )	5 years ago
whs	c8e49be2f1	Fix roi_perspective_transform op (#20764 )	5 years ago
Chen Weihang	26cc1fe508	Replace risky GetInputType method with secure IndicateVarDataType interface (#20668 ) * replace part of the old implementation, test=develop * restore concat op, test=develop * update all ops implemention & delete GetDataTypeOfVar func, test=develop	5 years ago
xujiaqi01	48669aa8f0	fix several sparse table issuses (#20686 ) * no longer need to define all embedding layers (no one less) of all slots in each program. make trainer_param repeated in ps.proto. * add find_distributed_lookup_table_grads instead of hard code GRAD * support embedding stop gradient. push sparse has error before fix this.* * fix fill sparse, skip slots which do not have embedding. each slot's embedding in a sparse table should be used in all training programs before fix this. * fix pull sparse, skip slots which do not have embedding. * fix collect feasign label info, skip slots which do not have embedding. * support when there are multi sparse tables in one or multi training programs, each program can pull/push its own related sparse tables instead of all sparse tables. * test=develop	5 years ago
Yamei-Lee	cf717fd6dd	fix bug in reshape: (#20781 ) consider the situation that shape of input can contain more than one -1. test=develop	5 years ago
Chen Weihang	1d1552d106	Make formatted ENFORCE stack adapt to more situations (#20826 ) * Make formatted ENFORCE stack adapt to more situations and polish details, test=develop * restore template message position, test=develop	5 years ago
Zeng Jinle	378fc4fb1c	add some docs to jit.trace, test=develop (#20811 )	5 years ago
Zhang Ting	5a8d885d72	All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756 ) * All elements in attr(shape) of crop_tensor can be -1, test=develop, test=document_preview * fix the bug that attr(offsets) should be initialized, test=develop	5 years ago
danleifeng	9171f73714	fix fp16 grid_size for size=1; test=develop (#20812 )	5 years ago
Zeng Jinle	cd1c404353	refine err msg of allocator, test=develop (#20804 )	5 years ago
Zeng Jinle	ac813bbaf4	Add more error debug message to Operator::Run (#20793 ) * add more err msg, test=develop * add more unittests, test=develop	5 years ago
Tao Luo	efbdad0596	make search_compute support avx default (#20779 ) * make search_compute support avx only * clean search_compute.h * rename sse_axpy to avx_axpy test=develop * update CMakeLists.txt test=develop	5 years ago
zhongpu	3556514e97	add PADDLE_ENFORCE for dygraph to optimize error throw (#19783 ) * add PADDLE_ENFORCE for dygraph to optimize error throw, test=develop * fix some error, test=develop * delete PADDLE_ENFORCE_EQ in VarBase::NewVarBase, test=develop	5 years ago
WangXi	250e72d254	Fix DGC algorithm flow to make it the same as paper (#20758 )	5 years ago
wangchaochaohu	ba45dce35d	fix codetest for windows make test=develop (#20796 )	5 years ago
Zeng Jinle	8ff6b289bd	[Dygraph to static graph]JIT/Trace (#20775 ) * jit/trace 1st version, test=develop * add more unittests, test=develop	5 years ago
zhaoyuchen2018	6e6eab07e8	Fix multihead op bug. (#20783 ) The op should handle k=1024 test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
lvmengsi	dfa0549f87	Revert "fix_depthwise_conv_cudnn, test=develop (#20712 )" (#20782 ) This reverts commit `dc229b4195`.	5 years ago
whs	4c7d196d83	Add norm_by_time for warpctc op in padding mode. (#17580 )	5 years ago
Pei Yang	e89c16b90d	Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733 ) * fix pool2d trt converter, test=develop * add fix for split op converter, test=develop	5 years ago
tianshuo78520a	1105b93288	del uninstall protobuf (#20769 )	5 years ago
Tao Luo	2f5f19dfb5	mv sampcd_processor.py to tools/ (#20761 ) * mv sampcd_processor.py to tools test=develop test=document_fix * update example script test=develop test=document_fix	5 years ago
石晓伟	37cd43545a	update the infer shape of matmul, test=develop (#20717 ) * update the infer shape of matmul, test=release/1.6 * add unittests of matmul, test=release/1.6 * change func names, test=develop	5 years ago
石晓伟	e742760f8e	optimize version error, test=develop (#20715 )	5 years ago
Adam	67b59ddb38	Minor MKL-DNN conv int8 performance fixes (#20753 ) test=develop	5 years ago
wangchaochaohu	0687bcd64f	Refine getitem of Variable (#20729 ) * add support for __get_item__ of Variable test=develop	5 years ago
zhongpu	72d1d72c09	fix ExecutionContext::HasInput and ExecutionContext::HasOutput depend on the scope structure, test=develop (#20721 )	5 years ago
danleifeng	79e08ecebf	add assertions on whether elementwise_div divison is zero (#20618 )	5 years ago
bingyanghuang	fd49ebcbd8	update int8 benchmark with 6271 data, test=develop test=document_fix (#20736 )	5 years ago
123malin	95e90aa102	test=develop, add communicator_is_sgd_optimizer flag (#20677 ) * test=develop, communicator_is_sgd_optimizer flags	5 years ago
Aurelius84	74a28f5ea4	fix fill_constant shape with -1 and enhance cross_entropy test=develop (#20722 )	5 years ago
石晓伟	48a774c713	fix ts_sort's bug, test=develop (#20720 )	5 years ago
lvmengsi	dc229b4195	fix_depthwise_conv_cudnn, test=develop (#20712 )	5 years ago
石晓伟	d8f4f4239d	Ensure backward compatibility with the anakin interface, test=develop (#20691 ) * support MLU nums, test=develop * change anakin apis, test=develop	5 years ago
wopeizl	9e5948230e	add support to gcc8, add docker env test=develop (#19807 ) * add support to gcc8, add docker env test=develop	5 years ago
xujiaqi01	5223b0dd9d	add check nan / inf in downpour worker (#20694 ) * add check nan / inf in downpour worker during training * test=develop	5 years ago
WangXi	507afa8a8a	Fix dgc nan by stripping nccl from sparseReduce. (#20630 )	5 years ago
gongweibao	c1710e91b2	Disable GRPC_ARG_ALLOW_REUSEPORT to avoid potencial problem. (#20690 )	5 years ago
lidanqing	46e93f7c86	Revert "Refactor conv computeINT8" (#20640 ) * Revert "Refactor conv computeINT8 (#19574)" This reverts commit `2c32c2d649`. test=develop * replace PADDLE_ENFORCE test=develop	5 years ago
liu zhengxi	d39777fefa	alter the capi of PD_PredictorRun to provide proper function, test=develop (#20697 ) modify the way to pass parameter out_size in function.	5 years ago
Zeng Jinle	4eeda9d676	fix tensor_util, test=develop (#20699 )	5 years ago
Zeng Jinle	ab575de725	Fix op run log when memory optimization strategy is enabled (#20695 )	5 years ago
Jacek Czaja	a1cd27f13f	[MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241 ) * - Flushing mkl-dnn cache test=develop - Disabled clearing cache for LoadModel - Added clearing of mkl-dnn cache when Executor is created test=develop - Do not clear for GPU places test=develop - compilation fix test=develop * - Moved clearing of mkl-dnn cache in destructor of executor test=develop * - Compilation fix test=develop - Reverted conditional clearing of mkl-dnn cache in Executors's destructor test=develop - compilation fix	5 years ago
Zeng Jinle	10505faf4e	polish codes, test=develop (#20672 )	5 years ago
Tao Luo	dd3d8997cf	remove deprecated contrib/float16 directory (#20685 ) test=develop test=document_fix	5 years ago
liuwei1031	569951c418	improve the efficiency of BuddyAllocator (#19888 ) * improve save and load behaviour, test=develop * code cleaning, test=develop * disable check_guards and update_guards in release version, test=develop * fix compilation issue, test=develop * add buddy_allocator speed test data, test=develop * fix compilation issue, test=develop * fix comment, test=develop * update function names according to the google C++ style guide, test=develop * tweak the test data format, test=develop * move buddy_allocator_test_data to paddle/fluid/testdata, test=develop * add accessor and mutator for Desc, test=develop	5 years ago
tianshuo78520a	eafc7023c1	test=develop test=document_fix (#20682 )	5 years ago
Zeng Jinle	34e3adaece	Refine reduce codes to save compiling time and binary size (#20676 ) * refine reduce code to save compiling time and binary sizes, test=develop * add reduce rank check to avoid bug, test=develop	5 years ago
liu zhengxi	dbc2bb3376	improve the performance of capi in PD_PredictorRun (#20665 )	5 years ago
whs	a3e641e93c	Fix infer shape of warpctc op. (#20653 ) test=develop	5 years ago
Zeng Jinle	4922eb6da5	make_conv_workspace_size_configurable, test=develop (#20662 )	5 years ago
zhongpu	efa10937bd	fix elementwise_floordiv_op and elementwise_mod_op (#20534 ) * fix elementwise_floordiv_op and elementwise_mod_op, test=develop * fix API.spec, test=develop * fix API.spec, test=develop	5 years ago
tangwei12	04384502a8	fix bug with heart beat , test=develop (#20654 )	5 years ago
wangchaochaohu	7783d3bd43	Conv refine (#20644 ) * add condition judgement for performance improvement test=develop * add condition judgement for performance improvement test=develop * refine code style test=develop	5 years ago
lidanqing	57b656f956	Add document for int8 object detection quantization (#19356 )	5 years ago
Chen Weihang	003f369bb2	Add IndicateVarDataType interface to block tensor is not initialized problem in OP GetExceptedKernelType (#20044 ) * add indicate_var_data_type inferface, test=develop * add unittests & polish error message, test=develop * remove needless include, test=develop * extract public function & polish message, test=develop * delete empty var check, test=develop * change data_type to pointer parameter, test=develop * polish details, test=develop	5 years ago
Tao Luo	dfa239253c	reduce make install time in CI (#20643 ) test=develop	5 years ago
gongweibao	f3f52fc1e2	Retry when failed to bind address. (#20642 )	5 years ago
qingqing01	01eddc1a04	Support fp16 in GPU impl of fused_elemwise_activation_op. (#20636 ) * Support fp16 in fused_elemwise_activation_op. * Fix unit testing in ONLY-CPU mode.	5 years ago
Chengmo	940c6ff1c8	Fix communicator slow bug & fix communicator stop bug (#20366 ) * test=develop,Fix communicator slow bug * test=develop, delete if() in stop_worker() * test=develop * fix UT, test=develop * fix bug in fetch handler, test=develop * fix bug in fetch handler, test=develop * test=develop, fix fetch barrier bug * test=develop, bug fix * test=develop, bug fix * test=develop, fix bug	5 years ago
gongweibao	69e0b98fea	Cleanup the debug lines in paddle_build.sh (#20631 )	5 years ago
zhaoyuchen2018	8314e64a8b	Fix sum op fails as no memory in tensor(#20602 ) test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
Yibing Liu	ee2869cae9	Remove redundant infershape in linear chain crf grad, test=develop (#20629 )	5 years ago
123malin	b4a3b75002	bug fix: invalid learning rate decay in pserver async mode (#20325 ) * bug fix: invalid learning rate decay in pserver async mode	5 years ago
WangXi	cadc6a9704	fix dgc test and bug when not set trainers_endpoints_, test=develop (#20617 )	5 years ago
Zeng Jinle	40c258a77b	Refine API.spec mechanism (#20574 ) * refine API.spec, test=develop * add nn.py to test,test=develop * fix by removing new.spec, test=develop, test=document_fix * refine scripts when PYTHON_ABI is empty, follow partial comments, add extra cpp api doc, test=develop, test=document_fix * remove extra doc changes, remove API.spec, test=develop	5 years ago
石晓伟	a4753f3a79	Optimize error message of mean_op and matmul_op (#20413 ) * add data type check, test=develop * polish error messages, test=develop * polish error messages, test=develop * Remove support for the CPU architecture matmul, test=develop * fix syntax bug, test=develop	5 years ago
Leo Chen	d6c1d6ca56	update class name, test=develop (#20578 )	5 years ago
gongweibao	1d82025e89	Add interface so user can get scaled loss when they use customized loss. (#20571 )	5 years ago
liu zhengxi	922d432477	fix the PD_ZeroCopyPredictorRun output problem (#20612 ) * fix the PD_ZeroCopyPredictorRun output problem and add some checks and logs for users * modify the cmakelists depends and fix the cmakelists problem	5 years ago
翟飞跃	36acfaeeda	Add fused_embedding_seq layer into fluid.contrib (#19771 )	5 years ago
Double_V	0b39218749	memory optimizer for reshape op,test=develop (#20569 )	5 years ago
bingyanghuang	85e1f2150b	Modify the helper information in full_pascalvoc_test_preprocess.py (#20475 )	5 years ago
Thunderbrook	f76a32df4a	dump fix dov vec file num (#20539 ) * support dump multi file test=develop * dump fix num file test=develop	5 years ago
chengduo	36c85ef492	Add sub-scope check in RecurrentOp (#20468 ) * fix recurrent bug test=develop	5 years ago
633WHU	12e4be0382	Dlpack support (#20039 ) * support dlpack to tensor and implement python interface test=develop * add unittest for _to_dlpack and from_dlpack test=develop	5 years ago
Pei Yang	443f604c3b	add DisableGlogInfo() to AnalysisConfig, test=develop (#20581 )	5 years ago
JesseyXujin	2ff18e537f	add expand_as op, test=develop (#20565 ) * add expand_as op, test=develop * add expand_as op,test=develop * add expand_as op,test=develop * add nn.py, test=develop * delele paddle_enforce, test=develop	5 years ago
Zeng Jinle	40effc61af	Refine py_reader exit (#20331 ) * refine py_reader exit, test=develop * fix multiprocess_reader exception unittest, test=develop * increase code coverage for legacy fluid.layers.py_reader, test=develop	5 years ago
Zeng Jinle	a9c8bdad7b	refine pe codes, test=develop (#20479 )	5 years ago
Zeng Jinle	76b321872a	fix cuda dev_ctx by event, test=develop (#20553 )	5 years ago
Guo Sheng	bd99df715a	Fix basic_gru and docs of gru_unit and dynamic_gru (#19393 ) * Fix docs of gru_unit and dynamic_gru. Fix basic_gru in rnn_impl.py. Add error messages for param_attr setting in layer_norm api. Add int64 dtype for expand. test=develop * Reopen unit-tests of basic_gru/basic_lstm in rnn_impl.py. test=develop * Add unit test for layer_norm api. test=develop * Remove the deprecated gru doc fix. test=develop * Fix basic_gru test coverage. test=develop * Update API.spec. test=develop * Update API.spec. test=develop * Fix test_basic_gru coverage test. test=develop * Update test_basic_gru in test_layers to use fluid.data test=develop * Update test_basic_gru for coverage. test=develop	5 years ago
Zhang Ting	78910480c1	fix conv_transpose's bug: compatible with Anylayout setting, test=develop (#20589 )	5 years ago
Yuan Shuai	172e91c008	Refine error message of transpose_op (#20437 ) * Refine error message of transpose. * Fix transpose, multiplex, unsqueeze, unstack. test=develop, test=document_preview, test=document_fix	5 years ago
zhaoyuchen2018	b8333edef6	Add Multihead matmul fuse pass (#20167 ) * Add multihead fuse pass for ernie opt * Refine softmax test=develop * Refine cuda kernel * Refine cuda version * Refine cmake test=develop * refine header file * refine test case and pass * refine comments	5 years ago
liym27	fc6ec3b9f6	fill_constant support Tensor; (#20521 ) 2. fix bug in backward.py: using fill_constant instead of fill_constant_batch_size_like 3. fix bug in ExpandGradOp. test=develop	5 years ago
Zhang Ting	0130cc969c	fixed group_norm's bug and modified unittest (#20506 ) * modified group_norm's unittest for pass statement, test=develop * fix group_norm's bug: scale or bias is None which causes segmentation fault, test=develop	5 years ago
Adam	7faa3e9555	Add ConvTranspose + BatchNorm fuse pass (#20161 ) * Add ConvTranspose + BatchNorm fuse pass test=develop * Add tests for conv+bn and conv_transpose+bn passes test=develop	5 years ago
Diego Zhang	27d1ef6081	Refine seq enum expand mask pad (#20344 ) * disable nccl test * Update version. * fix term core only * fix transpiler error * fix protobuf memory leak (#11177) fix protobuf memory leak * "change eigen mirror" * refine en doc sequence enum pad expand mask d2s * refine seq enum expand mask pad test=develop, test=document_fix * remove cn char test=document_fix * spec test=document_fix * code style test=document_fix * test=document_fix * test=document_fix * test=document_fix * test=document_fix * test=document_fix * test=document_fix	5 years ago
Youwei Song	9a09ff14a5	fix en docs of Layer and guard (#20512 ) * fix en docs of Layer and guard test=document_fix, test=develop * fix en docs of Layer and guard test=document_fix, test=develop * minor fix test=document_fix, test=develop * minor fix test=document_fix, test=develop * fix api.spec test=document_fix, test=develop * fix api.spec test=document_fix, test=develop * fix docs test=document_fix, test=develop * fix docs test=document_fix, test=develop * fix docs test=document_fix, test=develop * fix api.spec test=document_fix, test=develop * fix api.spec test=document_fix, test=develop * add forward doc test=document_fix, test=develop * add "s" for parameters test=document_fix, test=develop	5 years ago
guofei	9b85f40140	Modify English documents (#20452 )	5 years ago
liuwei1031	9dc83dda7a	update data feeder API sample, change fluid.layers.data to fluid.data (#20568 ) * update data feeder API sample, fluid.layers.data => fluid.data * update API.spec	5 years ago
liuwei1031	9d6ee5eb1f	fix doc of default_main_program, multiprocess_reader (#20536 ) * fix doc of default_main_program, multiprocess_reader * update API.spec * fix comment	5 years ago
zhongpu	ece611b028	update paddle_build.sh, test=develop (#20443 ) This PR fix the tag error in paddle_build.sh	5 years ago
Yiqun Liu	ce1b25cc8b	Polish the English documentation of sums (#20495 ) * Refine the documentation of sums. * Remove Chinese comments and update API.spec. * Refine the description of input argument. * Update API.spec. test=develop test=document_fix	5 years ago
lanxianghit	22ecaef03b	Add API 'fluid.requird_version(min_version, max_version=None)' to check if the installed PaddlePaddle version is satisfied, test=develop (#20263 ) 添加API：fluid.version_required(min_version, max_version=None)，用于检查已安装的PaddlePaddle版本是否符合要求无返回值，如果已安装的版本不在区间[min_version, max_version]，则抛出异常。例：安装的版本为1.6.0，调用：fluid.require_version('1.5.0', '1.5.1')，则抛出异常： Exception: VersionError: PaddlePaddle version in [1.5.0, 1.5.1] required, but 1.6.0 installed.	5 years ago
zhaoyuchen2018	8fb569e5b9	Fix api doc example bug and polish square doc (#20491 ) * Refine create_array api en doc test=develop test=document_fix * Fix api doc example bug and polish square test=develop test=document_fix * Refine comment test=develop test=document_fix * refine API.spec test=develop test=document_fix	5 years ago
hong19860320	512c0bb04d	refine the en api doc of ones, zeros, reverse, assign, increment, hsigmoid and create_py_reader_by_data ops (#20343 ) * refine the en api doc of ones, zeros, reverse, increment, hsigmoid and create_py_reader_by_data ops test=develop, test=document_preview, test=document_fix * refine eng doc for hsigmoid and create_py_reader_by_data ops test=develop, test=document_preview, test=document_fix * update API.spec test=document_fix * Fix the parameter name axis of reverse op in eng doc test=develop, test=document_fix * Update API.spec test=develop, test=document_fix * Refine eng doc of zeros, ones, reverse and assign op test=develop, test=document_fix * Update API.spec for assign, ones, zeros and reverse test=develop, test=document_fix * Fix data type of reverse op in eng doc test=develop, test=document_fix * Update API.spec for reverse op test=develop, test=document_fix	5 years ago
Guo Sheng	dfd1eee7f7	Add seq2seq api related code (#19820 )	5 years ago
silingtong123	e87cabb7f2	updates document of sequence_softmax, sequence_scatter, sequence_unpad (#20269 )	5 years ago
silingtong123	d5aa2dd818	fix doc, updates API documents of uniform_random and uniform_random_batch_size_like (#20316 )	5 years ago
Wilber	751812a674	enable cpu machine to run paddle in gpu lib enable cpu machine to run paddle model in gpu lib	5 years ago
lvmengsi	2384589383	Fix conv_grad_grad (#20469 ) * fix_conv_grad_grad * fix_bug, test=develop	5 years ago
Double_V	8299203370	Support reshape_op double gradient (#20304 ) * support reshape doubel grad, test=develop * fix reshape double grad, pass converage, test=develop * fix review, test=develop	5 years ago
LielinJiang	faa8e30a14	Set batch norm and data norm argument 'do_model_average_for_mean_and_var' default as True (#20421 ) * fix_norm_model_average_bug * test=develop * refine comment test=develop * refine comment test=develop	5 years ago
hong19860320	4d0d5e4cc7	refine eng doc for hard_sigmoid op (#20442 ) * refine eng doc for hard_sigmoid op test=develop test=document_fix * refine the description of hard_sigmoid test=develop test=document_fix * update API.spec test=document_fix * Refine the decription of parameters of HardSigmoid op test=develop, test=document_fix * Update API.spec for hard_sigmoid op test=develop, test=document_fix	5 years ago
Aurelius84	22823df2e2	enhance embedding error message test=develop (#20246 ) * enhance embedding error message test=develop * enforce .h error test=develop * fix unittest code test=develop * Fix fp16 dtype in embedding test=develop * add import warnings test=develop	5 years ago

... 2 3 4 5 6 ...

16382 Commits (642b33564e9d4d132f87bd94cf87361fefff66d4)