Paddle

Commit Graph

Author	SHA1	Message	Date
Kaipeng Deng	ebfb720a63	add Adam beta1/beta2 support Variable (#21234 ) * add Adam beta1/beta2 support Variable. test=develop	5 years ago
Zeng Jinle	09696d5df8	Use system allocator in OpTest (#21335 ) * use system allocator in unittests, test=develop * fix op bugs, test=develop * fix tensor copy bug when src and dst are the same, test=develop	5 years ago
ruri	007c997572	Add masked select api (#21172 )	5 years ago
Kaipeng Deng	67c836fb5c	batch_norm momentum support variable (#21246 ) * batch_norm momentum support variable. test=develop * fix format. test=develop * add batch_norm momentum variable example. test=develop * move MomentumTensor to training branch. test=develop * split example. test=develop * fix doc. test=develop * fix PADDLE_ENFORCE ci. test=develop * fix format. test=develop	5 years ago
lidanqing	c0aa13672e	Fp32 vs int8 qat C++ performance (#21244 ) * add ut for comparing FP32 and QAT INT8 * add save qat transformed model python script test=develop * updated * added missing file * add "with_label" test=develop * performance benchmark as unit test test=develop * change names of unnecessary thing * Change CMakeList.txt for model downloading and UT test=develop * change names of functions and params for more readable code test=develop * Change PADDLE_ENFORCE messages test=develop * fix indent problems test=develop * indent problems test=develop	5 years ago
xujiaqi01	f1178e9d79	fix fleet save bug (#21362 ) * fix fleet save bug of save_infernece_model * test=develop	5 years ago
Liufang Sang	1840c1652c	add config file to avoid load checkpoint test=develop (#21373 )	5 years ago
Zeng Jinle	b97fc16d21	fix lod_reset bug, test=develop (#21392 )	5 years ago
hutuxian	47a82e38e3	Support data_norm gpu kernel (#21325 ) * support data_norm_op run in CUDA * add two parameters sync_stats & summary_decay_rate * add UT	5 years ago
Youwei Song	d5ff79e55e	Support numpy bridge (enabled by default in dygraph mode) (#20983 ) * add numpy bridge * fix template compile * add unittest, add default test=develop * fix unittest test=develop * fix unittest test=develop * zero_copy=True for to_variable, test=develop * bug fix test=develop * disable deprecated NumPy API test=develop * use better design of NumpyAllocator test=develop * fix Py_None check test=develop * reset c++ tracer when jump out dygraph guard test=develop * refine PADDLE_ENFORCE_xx format test=develop * bug fix of tracer switch test=develop * update decref test=develop	5 years ago
Michał Gallus	5d7d548275	INT8 Fully-connected (#17641 ) * Implement Int8 FC * Integrate FC into INT8v2 test=develop * int8 FC: transpose weights before computing scales test=develop * Add support for activation_type string in FC test=develop * Disable MKL-DNN's FC in VGG16 and 19 test=develop * Disable FC quantization when mkldnn FC is disabled test=develop * Solve PADDLE_ENFORCES in FC int8 * Fix Paddle enforces and remove const cast test=develop * Fix style changes test=develop * Fix quantizer_tester test and add fc quantization test=develop * Fix FC test fail on CUDA * Remove unnecessary log from quantize placement pass test=develop * Add Thread ID to FC hash key test=develop * Add comments to MKL-DNN FC Kernel test=develop * Refactor quantizer test=develop * Fix linter issues test=develop * Fix crash in slim googlenet test=develop * Fix PADDLE_ENFORCE messages test=develop	5 years ago
itminner	07e6a94268	paddleslim quantization skip pattern support list of string (#21141 )	5 years ago
Zhen Wang	be2e3e67d9	Fix some typos in AMP. (#21354 ) * fix some typos in AMP. test=develop * delete useless codes. test=develop	5 years ago
lilong12	41d13209d7	add the framework support for distfc (#21197 ) * add the framework support for distfc and ut, test=develop * fix the implementation of shard_index_op, test=develop	5 years ago
hong	a214a3081b	change download log format (#21290 ) * change download log formate; test=develop * add unittest for data download; test=develop * remove cache before download; test=develop	5 years ago
GaoWei8	234060f88f	Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972 ) * Add fc padding to solve mkl performance test=develop * fix gpu pass and error information test=develop * fix fc_fuse_pass_test test=develop * fix error information test=develop * fix error information test=develop * fix name and add fc op padding test test=develop * fix attributes test=develop * optimize fc padding test=develop * fix test test=develop	5 years ago
ruri	6cfcbe0510	reduce interp op input size to pass CI, test=develop (#21341 )	5 years ago
Jacek Czaja	f4cf028a8c	[MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207 )	5 years ago
Michał Gallus	ed9ceb9f98	Refactor MKL-DNN ElementwiseMul (#21061 ) * Refactor MKL-DNN ElementwiseMul remove manual fallback, remove format attrs test=develop * Refine PADDLE_ENFORCEs in eltwise_mul_op.h test=develop * Make ElementwiseMulOp inherit from ElementwiseOp * Change type of simd_width to int test=develop * Remove Constructor extensions in ElementwiseOp and ElementwiseMulOp test=develop * Restore attributes test=develop * Fix test coverage for mkldnn eltwise mul test=develop * Conform to new is_run_common_broadcast API test=develop * Add UT for AreDimsAndFormatCorrect test=develop	5 years ago
Dong Daxiang	0a93635b5f	fix logger problem (#21342 ) * fix logger problem test=develop * refine logger test=develop	5 years ago
wangchaochaohu	6514f52e46	fix the fill_constant op precious problem (#21322 ) * fix the fill_constant op precious problem test=develop	5 years ago
zhaoyuchen2018	08c19c585d	Improve argsort performance. (#21267 ) * Improve argsort performance. - Give 200000 data to compute argsort on v100, can speed up ~190x before opt cost: 0.53s after opt cost:0.0027s - Add fp16 support * Refine error message * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
lijianshe02	7fcaa39b36	fix Print_op input dtype list error test=develop (#21326 )	5 years ago
juncaipeng	84865b806b	add resnet50 test for post trainint quantization, test=develop (#21272 )	5 years ago
Thunderbrook	9a7832f8be	print table stat info for pslib (#21296 ) * print table stat test=develop * notes test=develop * notes test=develop	5 years ago
WangXi	8ac7687e36	Fix dgc accuracy by mv regularization to local (#21278 )	5 years ago
Zeng Jinle	b9f8ae8494	Add global value getter setter (#21285 ) * add global value getter setter, test=develop * fix error messages, test=develop	5 years ago
Dong Daxiang	691ced87c0	Refactor fetch handler (#21264 ) * fix fetch handler problem and refactor when a user define FetchHandler class, he or she should initialize a handler with variable dict. the key of a variable dict is a user defined name, the value of a variable dict is a Varaible generated from python API. For each fetching, a user should implement handler function in which fetched_result_dict will be available and the user can access the fetched value with user defined keys.	5 years ago
Yi Liu	f1b09ba30e	adapt test_collective_base.py for only two GPU cards available. (#21307 ) * adapt test_collective_base.py for only two GPU cards available. test=develop * fix bug of issue #21259 test=develop	5 years ago
gongweibao	ed2a185248	optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597 )	5 years ago
Liufang Sang	f0b1518438	add dequantize_abs_max op and modify lookup_table op (#20899 ) * add int8 kernel to lookup_table op and add dequantize op test=develop * change paddle_enforce to paddle_enforce_eq test=develop * change copyright and change some not suitable code test=develop * remove debug log test=develop * replace GetInputType with IndicateVarDataType test=develop * fix EmptyGradMaker test=develop * fix diff between cpu and gpu test=develop * use memcopy when int8_t test=develop	5 years ago
hutuxian	a6ce2306f9	support cvm_op run in gpu (#21300 ) Previously, CVM OP was only able to run in CPU. This PR implements its GPU kernel. What's more, we improve the UTs about CVM OP.	5 years ago
Chen Weihang	952508527a	Polish some PE code details (#21274 ) * polish code details, test=develop * futher polish hint msg, test=develop	5 years ago
Yi Liu	0fd1281ef8	fix bug of issue #21259 (#21287 ) pass the argument `allow_out_of_range` of one_hot op to c++ back end.	5 years ago
xujiaqi01	319d2ba925	fix fs_client_param bug (#21212 ) * fix fs_client_param bug， user can set this config through fleet_desc_file or fleet config * test=develop	5 years ago
Thunderbrook	0d17c1b816	solve pslib core in stop worker (#21263 ) * general table * add sparse table test=develop * no cvm test=develop * add no_cvm test=develop * add note test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * add key of optimizer test=develop * solve pslib stop core test=develop * barrier test=develop * add notes test=develop	5 years ago
zhongpu	fa4d055098	fix bug for python/paddle/fluid/tests/unittests/test_elementwise_mul_op.py, test=develop (#21289 )	5 years ago
zhongpu	c4ede95c74	open dygraph op test, test=develop (#19787 ) * open dygraph op test, test=develop * modify to_variable, test=develop * modify input and output for dygraph, test=develop * modify input and output for dygraph(fix bug), test=develop * fix input processing of dygraph op test, test=develop * fix bug, test=develop * fix op test, test=develop * fix forward bug for dygraph, test=develop * fix mkldnn op test for forward, test=develop * update nn.py for dygraph, test=develop * fix crop_tensor_op, test=develop * fix elementwise_mul_op, test=develop * fix fill_op, test=develop * fix some mkldnn op, test=develop * open backward op test for dygraph, test=develop * delete log, test=develop * close backward op test for dygraph, test=develop * fix bug for edit_distance_op and test_lstm_cudnn_op, test=develop * fix optest backward bug for dygraph, test=develop * fix optest backward bug for dygraph, test=develop * close backward op test for dygraph, test=develop * close backward op test for dygraph, test=develop * open dygraph op test, test=develop * fix op test for dygraph, fix GradOpDescMaker, test=develop * fix bug for linear_chain_crf_op.h, test=develop * remove log, test=develop * remove log, test=develop * remove log for op_test.py, test=develop * remove log for op_test.py, test=develop * fix bug for var_conv_2d_op, change PADDLE_ENFORCE, test=develop * fix PADDLE_ENFORCE_EQ for hierarchical_sigmoid_op.cc, test=develop * fix bug for test_increment_ngraph_op.py, test=develop * fix lod for op test in dygraph, test=develop * refactor op_test.py to reduce redundant code, test=develop * fix lod optest, modify InputVar/OutputVar to HasInput/HasOutput, test=develop * remove debug log, test=develop * remove redundant code in base.py, test=develop * fix some error in optest, test=develop * fix ClearNoNeedBufferInputs function's bug for LoDTensor, test=develop * refactor op_test.py, test=develop * remove redundant writing, test=develop * fix error(get tensor of the grad variable), test=develop * fix test_concat_mkldnn test_conv2d_mkldnn, test=develop * fix optest.py for get tensor of LoDTensor, test=develop * fix optest.py for get tensor of LoDTensor, test=develop * fix optest.py for get tensor of LoDTensor, test=develop * fix some redundant code, test=develop * reslove conflict and rewrite paddle error message, test=develop	5 years ago
xujiaqi01	eca66f317e	fix fleet util bug (#21254 ) * fix fleet util bug in save paddle inference model * test=develop	5 years ago
ShenLiang	1f39a9f17e	fix the bug of scatter_nd, test=develop (#21257 )	5 years ago
lijianshe02	382cf5d7e3	add input type and input data type check for Print_op test=develop (#21250 ) * add input type and input data type check for Print_op test=develop	5 years ago
Thunderbrook	349e82d669	support general embedding params (#21217 ) * general table * add sparse table test=develop * no cvm test=develop * add no_cvm test=develop * add note test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * add key of optimizer test=develop	5 years ago
liym27	b0fc822747	Add control flow api: case (#21114 ) * add control flow API: case. test=develop * delete 'raise TypeError' in _error_message() and return a string. test=develop * polish API document. test=develop	5 years ago
juncaipeng	29b63f0aa1	support set model_filename and params_filename in post_training_quantization, test=develop (#21213 ) * support set model_filename and params_filename in post_training_quantization, test=develop	5 years ago
Dong Daxiang	ccbdd7aad0	update worker_num for MPISymetricRoleMaker (#20798 ) test=develop	5 years ago
Liufang Sang	c91cb6c550	fix load checkpoint error in test_reader (#20924 )	5 years ago
danleifeng	0e7baabe59	extend elementwise broadcast function (#20957 )	5 years ago
yaoxuefeng	b5d8ba8394	fix data_norm op to avoid impractical normalization result test=develop (#21152 ) * fix auc drop first commit test=develop * update datanorm op * update datanorm with enforce test=develop * update test=develop * update format test=develop * update format * update format test=develop * add unit test test=develop * update unit test test=develop * update format test=develop * update format test=develop * update API description test=develop * update API description test=develop * update format test=develop * fix codes as comments test=develop * fix description as comments test=develop * fix description as comments test=develop * update codes.. test=develop	5 years ago
Zeng Jinle	67e88424e5	Polish jit trace codes (#21218 ) * polish jit trace codes, test=develop * polish codes again by removing var_id, test=develop	5 years ago
Zeng Jinle	cdb3d27985	Fix warn of gcc8 (#21205 ) * fix warnings oof gcc 8 compilation, test=develop * fix boost::bad_get, test=develop * refine PADDLE_ENFORCE, test=develop	5 years ago
danleifeng	3fe63d6780	add store_true to use_paddlecloud argument in launch.py (#21168 )	5 years ago
Zhang Ting	9cbe7bccba	modified error message and API doc for channel_last supported Op (#21002 ) * modified error message for conv and conv_transpose, test=develop * modified doc of conv and conv_transpose op, test=develop * modified the expression for error message, test=develop * modified error message for group_norm op, test=develop * modified detail of Attr(data_format) or Attr(data_layout) * add ValueError in API doc for maxout op, test=develop	5 years ago
liym27	9247528252	Control flow API: switch_case (#21103 ) * add API switch_case. test=develop add Nest * modify code according to reviews: 1.Attr(branch_index) support 'uint8' and 'int64' besides 'int32'. 2.remove useless code. test=develop * replace fluid.layers.data with fluid.data and polish API document. test=develop	5 years ago
guofei	56b5d14704	Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118 )	5 years ago
WangXi	3c98ec90ce	Fix INF bug of softmax_cross_entropy_op (#21165 )	5 years ago
Zeng Jinle	0f30d3a213	fix dygraph trace bug, test=develop (#21193 )	5 years ago
juncaipeng	00b11a4a1e	Support more ops in post training quantization, test=develop (#21073 ) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat	5 years ago
xujiaqi01	23876de55b	fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052 ) * fix cache table bug * add save_paddle_inference_model * fix hdfs util bug * test=develop	5 years ago
xujiaqi01	9e045170c0	add copy table (#21086 ) * copy some feasigns and corresponding embeddings from one sparse table to another * copy all feasigns and corresponding embeddings from one sparse table to another * copy all dense params from one table to another * copy some local vars to other local vars	5 years ago
ruri	aeb887911f	Refine edit distance cn (#21121 )	5 years ago
Kaipeng Deng	98b59cb82c	fix elementwise_mod float point kernel. test=develop (#21183 )	5 years ago
hong	835119c777	disable reshape inplace in dygraph model; test=develop (#21157 )	5 years ago
Zeng Jinle	5fdfbe3413	Add friendly dygraph trace API (#21091 ) * friendly trace interface, test=develop * refine TracedLayer, test=develop * add some docs, test=develop	5 years ago
whs	cfdd1fc2cd	Fix warpctc in padding mode. (#21033 )	5 years ago
Tao Luo	3976bbe2ce	add input type and dtype check template, and update some APIs check (#21161 ) * add input type and dtype check template, and update some APIs check * refine check template, and update some APIs check in nn.py * update some APIs check in loss.py test=develop	5 years ago
joanna.wozna.intel	37e0e7a96b	QAT int8 accuracy little improvement (#21074 ) test=develop	5 years ago
gongweibao	a5fc291fe5	Use 2 cards for hallreduce unit test. (#21085 ) use 2 cards test=develop	5 years ago
Tao Luo	8f659d4345	Split some APIs from nn.py to loss.py (#21117 ) * Split some APIs from nn.py to loss.py test=develop * fix test_detection unit-test test=develop	5 years ago
zhaoyuchen2018	4a544762a2	Add Asypadding for conv fusion. (#21041 ) * Add Asypadding for conv fusion. test=develop reference: pr/20042 * Fix eigen build link error * Change back file mode * Use math function & add more checks.	5 years ago
WangXi	de5d3ff688	Fix dgc buffer illegal & reuse velocity (#21012 )	5 years ago
lilong12	53148e0696	modify the implementation of save_persistables and save_inference_model for fleet collective mode (#20802 ) * modify the implementation of save_persistables and save_inference_model functions for fleet collective, test=develop * add ut, test=develop	5 years ago
Bai Yifan	bd8b0ebaba	fix distiller typo, test=develop (#21070 )	5 years ago
ceci3	f62a929151	fix instance norm (#21042 ) * fix instance norm * update unitest,test=develop	5 years ago
lilong12	e249d9a3e2	fix the computation for dx (grad for x) for prelu operation. (#20949 ) * set the default value of alpha for prelu to 0.25, test=develop * add the call to __syncthreads(), test=develop * fix the implementation of cpu prelu, test=develop * repair the implementation of element mode prelu, test=develop * modify test_prelu_op.py, test=develop	5 years ago
Huihuang Zheng	e64d55f04e	Add basic Python Cond Layer (#21050 )	5 years ago
Huihuang Zheng	dcf371b685	Disable cudnn_conv in unit tests. (#21080 )	5 years ago
Yiqun Liu	35f17ae28f	Add the check of lod_level between compile-time and runtime. (#20961 ) * Add the check of lod_level between compile-time and runtime. test=develop * Fix bug in check_compile_vs_runtime. test=develop * Fix the check of output when it is dispensiable or intermediate. test=develop * Share lod of x to out in match_matrix_tensor op in compile-time. * Implement GetLoDLevel in InferShapeContext. * Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op. test=develop * Enable check_compile_vs_runtime in test_match_matrix_tensor. * Add the implementation of SetLoDLevel in InferShapeContext. * Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead. * Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead. * Refine some ops and unittests. test=develop * Fix a typo. test=develop * Remove the check of var type, and change int to int32_t. test=develop * Add unittest for Get/SetLoDLevel. test=develop	5 years ago
Tao Luo	78cc1ca616	Split some APIs from nn.py to rnn.py and sequence_lod.py (#21030 ) * split some APIs from nn.py to rnn.py * split some APIs from nn.py to sequence_lod.py test=develop * fix unit-test bug test=develop * fix test_layers unit-test bug test=develop	5 years ago
joanna.wozna.intel	77c2083586	Add transpose2 INT8 for mkl-dnn (#19424 ) * Add transpose2 INT8 for mkl-dnn test=develop * Fix test_transpose_int8_mkldnn test=develop * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2" This reverts commit 34011bdba4c859abb945e062ab13124f70508054, reversing changes made to 2ce6473f144da298aba4a43d46918f27d463cf7c. * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"" This reverts commit 23754dd78ca47ae56881161172b2aacd349aba90. * Add template to TransposeMKLDNNHandler test=develop * Resolve conflict test=develop * Restore get_size and refactor test=develop	5 years ago
juncaipeng	2c07727fb0	delete test resnet50 in post train quantization to avoid timeout error, test=develop (#21081 )	5 years ago
LielinJiang	06063b7001	add op locality_aware_nms, test=develop (#20976 )	5 years ago
liym27	26a6e27afe	fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997 ) * fix bug in pool/conv/conv_transpose: 1. It should be stride[i] not stride[0] in UpdatePaddingAndDilation; 2. fix bug of func _get_padding_with_SAME in test_conv/conv_transpose_op.py; 3. fix bug of the computation process in function conv2dtranspose_forward_naive. test=develop * change test to make the data of different dimensions different. test=develop	5 years ago
Adam	3fda695bb0	Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062 ) * Add asymetric padding support for mkldnn pooling test=develop * Add asymetric padding support for mkldnn conv test=develop * Add asymetric padding support for mkldnn conv_transpose test=develop	5 years ago
Huihuang Zheng	1957192f05	Add select_input_op and select_output_op (#21016 ) These ops are useful in control flow.	5 years ago
hong	72e0969b27	fix uniform random (#21009 ) * fix uniform random; test=develop * add uniform random test; test=develop	5 years ago
Wojciech Uss	226bc22a29	Remove fuse_with_relu argument from batch_norm constructor (#21028 ) test=develop	5 years ago
liym27	f0e95a6049	Polish error messages of pool_2d/3d and add Raises in English document. test=develop (#21017 )	5 years ago
juncaipeng	fa522dffa0	Fix bug in add_quant_dequant_pass, test=develop (#21018 ) * Fix bug for inserting add_quant_dequant_op to same variable repetitively in add_quant_dequant_pass, test=develop	5 years ago
juncaipeng	175ba39c03	Add post_training_quantization (#20800 ) * add post training quantization, test=develop * specify the quantizable op type, test=develop	5 years ago
Leo Chen	008ed65fd5	Add c++ global current tracer for dygraph (#20882 ) * Add c++ global current tracer for dygraph, test=develop * add tracer property in c++, test=develop * support different place, test=develop * add unittest for tracer, test=develop	5 years ago
Huihuang Zheng	4cf96cd307	Add grad_name Property for Class Variable (#20991 )	5 years ago
xujiaqi01	1d1a07937a	simplify master+patch，remove ins when size != merge_size or has conflict slot (#20913 ) * remove duplicate code and duplicate config of master+patch * drop all ins which has conflict slot or size < merge_size * user only need to set merge size，if ins num of same id is not equal to merge size, just drop these ins * user must make sure master data and patch data has no same slot whose feasigns are both non-zero, otherwise these ins will be dropped. (slot list should still be the same of both master and patch) * test=develop	5 years ago
Thunderbrook	5970e8ac5e	find lookup table in order (#20932 ) test=develop	5 years ago
Zhang Ting	de9bec607e	lrn supports channel_last input, test=develop (#20954 )	5 years ago
tangwei12	3b96e3d20a	fix FetchHandler (#20900 ) * bug fix, test=develop	5 years ago
Dong Daxiang	a6747a6ef1	add launch_ps module so that we can launch a parameter server trainin… (#20936 ) * add launch_ps module so that we can launch a parameter server training job 1) a user can specify worker_num and server_num 2) parameter server can be killed after all workers exit 3) unit test is added test=develop	5 years ago
Leo Chen	2c3c579b9b	tensor.set() supports array list and remove unused code, test=develop (#20959 )	5 years ago
Leo Chen	9974e40787	Update Tensor.set() to support float16 (#19964 ) * don't expose numerous Tensor.set(), test=develop * fix condition, test=develop * fix float16 bug, test=develop * feed should be Tensor or np.array, not Variable or number, test=develop * use forcecast to copy numpy slice to new array, test=develop * remove float16-uint16 hacking, test=develop	5 years ago
123malin	20cdff0e02	Optimize decay (#20816 ) * update pserver decay blocks * update distributed notify handler	5 years ago
Chengmo	16596f6498	Fix Paddle Cloud role maker (#20860 ) * fix PaddleCloud Role maker & add warning in distribute transpiler & change rpc_retry_times	5 years ago
liym27	59de8e1214	Compatible int32 and int64 for attr in concat/split/unsqueeze. test=develop (#20912 )	5 years ago
liym27	7b4cb655bb	keep the size of symmetric padding is 2 for 2d and 3 for 3d. test=develop (#20903 )	5 years ago
Zhang Ting	8d1e9f0f7e	maxout supports channel_last input (#20846 ) * maxout support channel_last input, test=develop * modified details of Input(X) and Attr(groups, axis) in doc, test=develop	5 years ago
WangXi	9d8ec42353	launch.py remove setting for nccl sync, test=develop (#20909 )	5 years ago
hong	8c4573a3cb	GradMaker for dygraph (#19706 ) * refactor dygraph,test=develop * fix failed unittest,test=develop * polish code,test=develop * check windows ci error,test=develop try to fix windows ci error by np.allclose,test=develop * polish vlog and profiler, test=develop * try to fix preceding ops order,test=develop * test transformer in windows ci, test=develop * use python c-api to speed up tracer.trace,test=develop * test=develop, fix docker with paddle nccl problem * test=develop, add ut for debug string and gradient_accumulator * test=develop, add tests for layer/gradient_accumulator/prepared_op * test=develop, fix complie error for test_prepared_op * test=develop, add more ut for dygraph * test=develop, create API.spec for dygraph api change * optimize grad maker; test=develop * optimize grad maker * test * grad make optim; test=develop * fix unittest bugs; test=develop * add dygraph grad op maker and split_op * grad op maker refactor; test=develop * add dygraph grad maker; test=develop * fix op deformable_conv_v1_op bug; test=develop * fix deformable_conv prroi pool bugs; * fix new op grad op maker bug; test=develop * fix split by ref bug; test=develop * fix dygraph auto prune bug; test=develop * fix test_trace bug; test=develop * fix fused emb seq pool bug; test=develop * remove useless code in op_desc file; test=develop * remove useless code, StrVarBaseNode; test=develop * fix review issues; test=develop * fix rank_loss grad maker; test=develop * remove flag in VarBase; test=develop * fix distributed_notify_op compile bug ; test=develop * fix reshape op double grad; test=develop * fix expand as op; test=develop * add impertive type_defs.h for demo_train; test=develop * fix inference lib cmake; test=develop * fix inference lib; test=develop * fix infernce_lib; test=develop * fix inference cmake; test=develop * fix inference lib; test=develop * fix inference lib; test=develop * remove condition dygraph grad maker, modify local name; test=develop * fix split grad maker bug; test=develop * fix pyramid_op bug; test=develop * change travis time out limit; test=develop * restore travis; test=develop * change timeout limit; test=develop	5 years ago
Bai Yifan	ac87d4e6e1	fix hdfs.download, test=develop (#20907 )	5 years ago
Thunderbrook	59bcdc8a19	support dump param of model into afs (#20302 ) * support dump param to afs test=develop * code style test=develop * code style test=develop * dump param test=develop * dump param test=develop * dump param test=develop * dump param test=develop	5 years ago
Yiqun Liu	16e4d02675	Refine the cache of program, context and scope in executor. (#18483 ) * Refine the cache of program, context and scope in executor. test=develop * Refine the unittest test_executor_and_use_program_cache. * Add the test the PaddingRNN with use_program_cache=True. test=develop * Remove a check. test=develop * Refine the unittest to check whether it is correct when setting use_program_cache=True. test=develop	5 years ago
Wilber	b489760099	fix jit_matmul bug test=develop (#20886 ) * fix jit_matmul bug * update jit matmul and add test	5 years ago
gongweibao	3255fe69bb	Add custom black variable name set in amp interface. (#20875 ) * add custom black varname test=develop * fix dtype test=develop * fix num test=develop * fix ut test=develop * fix coverage test=develop * fix blackvar names test=develop	5 years ago
lvmengsi	aadd81b662	Fix gradients (#20857 ) * fix_gradients * fix_gradients, test=develop	5 years ago
hong	ff0886a92a	save load problem fix and new feature add (#20823 ) * fix persistable; * fix save load bugs; test=develop * fix bug; test=develop * add example for new io api; test=develop * addd example; test=develop	5 years ago
Youwei Song	2058bab1c0	Add Sequential api (#20789 ) * add Sequential api test=develop * fix unittest test=develop * refine code sample * test=develop	5 years ago
liym27	6802539a2e	support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780 ) * improve split and concat op: 1. support Tensor for argument 'dim' in split op. 2. support Tensor for argument 'axis' in concat op. test=develop * redefine function GetDataFromTensor and set unknown output shape to - 1. test=develop * add check: Attr(sections) match Input(X). test=develop * support Tensor for attr(sections) and attr(sections) can contain -1. add check for attr(sections). test=develop * modify error message for concat and call Resize only when necessary. test=develop	5 years ago
Yiqun Liu	6fcfd32e6c	Check and correct the output's lod_level in DynamicRNN related operators (#19144 ) * Refine the InferShape of ReadFrom and WriteTo op, and add comment to explain why not call ShareLoD for runtime. test=develop * Add comment for ReorderLoDTensorByRank op. * Add comment for lod_tensor_to_tensor_array op to explain why only call DecreaseLoDLevel for compile time. test=develop * ShrinkRNNMemory op should call ShareLoD for compile time. test=develop * Add the implementation of IncreaseLoDLevel and add the compile-time check of lod_level in InferShape of sequence_pool. test=develop * Refine the unittest of DynamicRNN. test=develop * Change PADDLE_ENFORCE to PADDLE_ENFORCE_NE. test=develop	5 years ago
Zeng Jinle	da9e9dd07f	fix py_reader combination ut, test=develop (#20861 )	5 years ago
liym27	84d221b667	improve unsqueeze op to support int, Tensor for argument axes (#20824 ) * improve unsqueeze op to support int, Tensor and Tensor list for argument axes. test=develop * call Resize only when necessary. test=develop	5 years ago
silingtong123	03d7f3ddb2	Make shape tensor support int32 (#20757 ) * Make shape tensor support int32	5 years ago
Huihuang Zheng	95ba4bd2ab	Add shape and type check at read_op (#20754 )	5 years ago
Aurelius84	aacd16dbb4	add pyramid_hash_op (#20698 )	5 years ago
Yang Zhang	cf670ec9ce	Serialize to pickle format (#20820 ) test=develop	5 years ago
whs	c8e49be2f1	Fix roi_perspective_transform op (#20764 )	5 years ago
Bai Yifan	6bdf99d37a	fix dcn doc about Mask introduction, test=develop, test=document_fix (#20836 )	5 years ago
Bai Yifan	fd5321b3f3	modify slim print precision to round(,6), test=develop (#20833 )	5 years ago
WangXi	e78d7f57bb	Print the rank which trainer is error in launch.py, test=develop (#20838 )	5 years ago
xujiaqi01	48669aa8f0	fix several sparse table issuses (#20686 ) * no longer need to define all embedding layers (no one less) of all slots in each program. make trainer_param repeated in ps.proto. * add find_distributed_lookup_table_grads instead of hard code GRAD * support embedding stop gradient. push sparse has error before fix this.* * fix fill sparse, skip slots which do not have embedding. each slot's embedding in a sparse table should be used in all training programs before fix this. * fix pull sparse, skip slots which do not have embedding. * fix collect feasign label info, skip slots which do not have embedding. * support when there are multi sparse tables in one or multi training programs, each program can pull/push its own related sparse tables instead of all sparse tables. * test=develop	5 years ago
whs	fa67e6e83e	Fix unitest of pruning in python3 env. (#20825 ) test=develop	5 years ago
Zeng Jinle	378fc4fb1c	add some docs to jit.trace, test=develop (#20811 )	5 years ago
Zhang Ting	5a8d885d72	All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756 ) * All elements in attr(shape) of crop_tensor can be -1, test=develop, test=document_preview * fix the bug that attr(offsets) should be initialized, test=develop	5 years ago
pkpk	370f0345b6	fix the bug in data_feeder.py (#20791 ) * test=develop * test=develop * test=develop * test=develop	5 years ago
Tao Luo	efbdad0596	make search_compute support avx default (#20779 ) * make search_compute support avx only * clean search_compute.h * rename sse_axpy to avx_axpy test=develop * update CMakeLists.txt test=develop	5 years ago
WangXi	250e72d254	Fix DGC algorithm flow to make it the same as paper (#20758 )	5 years ago
Zeng Jinle	8ff6b289bd	[Dygraph to static graph]JIT/Trace (#20775 ) * jit/trace 1st version, test=develop * add more unittests, test=develop	5 years ago
Aurelius84	28dd2a58df	refine Categorical and MultivariateNormalDiag en doc (#20723 ) * refine Categorical and MultivariateNormalDiag en doc test=develop, test=document_fix * refine Categorical and MultivariateNormalDiag en doc test=develop, test=document_fix	5 years ago
Tao Luo	2f5f19dfb5	mv sampcd_processor.py to tools/ (#20761 ) * mv sampcd_processor.py to tools test=develop test=document_fix * update example script test=develop test=document_fix	5 years ago
石晓伟	37cd43545a	update the infer shape of matmul, test=develop (#20717 ) * update the infer shape of matmul, test=release/1.6 * add unittests of matmul, test=release/1.6 * change func names, test=develop	5 years ago
gongweibao	e425124041	Wait pserver to complete initialization. (#20777 )	5 years ago
zhongpu	702aad5a0a	remove assert statement to support sqeeze op in drgraph, test=develop (#20763 )	5 years ago
gongweibao	8088395a84	Set unique port to every distribute test to avoid potential port conflicts (#20759 )	5 years ago
wangchaochaohu	0687bcd64f	Refine getitem of Variable (#20729 ) * add support for __get_item__ of Variable test=develop	5 years ago
Zhang Ting	483d0512ce	Add choice of CUDA Place and remove fluid.layers.data for python API test of resize Ops (#20689 ) * add cuda place and remove fluid.layers.data for test of python API, test=develop * add cuda place and remove fluid.layers.data for test of python API, test=develop * modified batch size for Python API test, test=develop	5 years ago
danleifeng	79e08ecebf	add assertions on whether elementwise_div divison is zero (#20618 )	5 years ago
bingyanghuang	fd49ebcbd8	update int8 benchmark with 6271 data, test=develop test=document_fix (#20736 )	5 years ago
123malin	95e90aa102	test=develop, add communicator_is_sgd_optimizer flag (#20677 ) * test=develop, communicator_is_sgd_optimizer flags	5 years ago
Aurelius84	74a28f5ea4	fix fill_constant shape with -1 and enhance cross_entropy test=develop (#20722 )	5 years ago
wangguanzhong	9a3e22aad4	move nms2 to contrib, test=develop (#20709 )	5 years ago
Zhang Ting	80c97e560d	fix bias_attr's bug of conv and conv_transpose, test=develop (#20704 )	5 years ago
xujiaqi01	5223b0dd9d	add check nan / inf in downpour worker (#20694 ) * add check nan / inf in downpour worker during training * test=develop	5 years ago
WangXi	507afa8a8a	Fix dgc nan by stripping nccl from sparseReduce. (#20630 )	5 years ago
gongweibao	c1710e91b2	Disable GRPC_ARG_ALLOW_REUSEPORT to avoid potencial problem. (#20690 )	5 years ago

1 2 3 4 5 ...

9588 Commits (eab124ba98174b8b67117a1fa6e06b8c6a24c1c2)