Paddle

Commit Graph

Author	SHA1	Message	Date
Wojciech Uss	666c3bb9b0	handle multi-inputs with empty inputs for mkldnn_concat_op (#21827 ) test=develop	5 years ago
GaoWei8	a9af87edbc	Remove self-set accuracy parameters of op tests: max_relative_error (#21823 ) * Remove self-set accuracy parameters of op tests: max_relative_error test=develop * fix error test=develop	5 years ago
GaoWei8	e53d5967c9	Remove self-set accuracy parameters of op tests: max_relative_error (#21816 ) * Remove self-set accuracy parameters of op tests: max_relative_erro test=develop * fix error test=develop	5 years ago
songyouwei	1395828408	Add dygraph Linear layer (#21265 ) * add Linear layer test=develop * update unittest for coverage test=develop	5 years ago
GaoWei8	d683b65b1a	Remove self-set accuracy parameters of op tests: atol (#21711 ) * Remove self-set accuracy parameters of op tests:atol test=develop * keep smaller parameters test=develop * fix error test=develop	5 years ago
juncaipeng	8b74fc4fa7	Fix post training quantization (#21745 ) * fix post training quantization bug of memory constrained, support the input be different, test=develop	5 years ago
Zeng Jinle	aa4d6a5d6c	Add some debug flags to auto growth allocator (#21766 ) * add some debug flags to auto growth allocator, test=develop * add comments about auto growth, test=develop	5 years ago
Leo Chen	c50ebeac12	add comments of inplace_atol (#21819 )	5 years ago
guofei	8b7c50f49a	Make While Op could run on GPU place and add while_loop unittest (#21672 ) 1. Make while_op accept GPU conditional data 2. Add more complex test cases for while_loop API	5 years ago
GaoWei8	a3a3558dd0	Remove self-set accuracy parameters of op tests: max_relative_error (#21817 ) test=develop	5 years ago
GaoWei8	187d1c38ef	Remove self-set accuracy parameters of op tests: max_relative_error (#21744 ) * Remove self-set accuracy parameters of op tests: max_relative_error test=develop * fix errors test=develop	5 years ago
silingtong123	3c33417905	modify the method of skipping CI in distributed unittests (#21764 )	5 years ago
juncaipeng	fb067fa4d2	Modify test framework, test=develop (#21789 ) *use dtype to determine whether check_grade is needed, and delete useless class	5 years ago
Huihuang Zheng	557bce77da	Fix Backward Bugs in Conditional Block (#21809 ) The fixed bugs: 1. The condition sub-graph is not pruned 2. When backward graph is extremely simple, the whole backward ops are pruned.	5 years ago
GaoWei8	eab124ba98	fix accuracy parameters of op tests (#21813 ) test=develop	5 years ago
juncaipeng	642b33564e	Update test precision from fp32 to fp64 (#21805 )	5 years ago
Zeng Jinle	04909137f5	fix lr assert, test=develop (#21780 )	5 years ago
juncaipeng	9894a4fb35	update test precision from fp32 to fp64, test=develop (#21783 )	5 years ago
Leo Chen	c96f06f2f6	add unary operator __neg__, test=develop (#21787 ) adds unary operator __neg__ for VarBase in dygraph mode, and for Variable in static graph mode.	5 years ago
GaoWei8	65c8eac9fe	Remove self-set accuracy parameters of op tests: max_relative_error (#21741 ) * Remove self-set accuracy parameters of op tests: max_relative_error test=develop * Remove self-set accuracy parameters of op tests: max_relative_error test=develop * Remove self-set parameters of op tests: max_relative_error test=develop	5 years ago
GaoWei8	579a159a7b	Remove self-set accuracy parameters of op tests: max_relative_error (#21739 ) * Remove self-set accuracy parameters of op tests: max_relative_error test=develop * Remove self-set accuracy parameters of op tests test=develop * keep smaller parameters test=develop	5 years ago
zhouwei25	ecb2419e5c	increase the explanation doc of py_func (#21631 ) * increase example code of py_func, fix some wrong description of English API doc	5 years ago
Youwei Song	b976ba3e83	fix unittests (#21786 ) test=develop	5 years ago
Zhang Ting	73e97d39b4	add check for check_grad in Op unittests (#21383 )	5 years ago
Huihuang Zheng	0677a1c1c1	Fix That conditional_block_op Doesn't Have InferShape (#21733 )	5 years ago
GaoWei8	ac666b8ac3	Remove self-set accuracy parameters of op tests: atol (#21731 ) * Remove self-set accuracy parameters of op tests test=develop * add cast test=develop * Remove self-set accuracy parameters of op tests:atol test=develop * lrn_ngraph_op test=develop * Keep smaller parameters test=develop	5 years ago
GaoWei8	c924353d4f	Remove self-set accuracy parameters of op tests: max_relative_error (#21740 )	5 years ago
Youwei Song	f6144d8463	remove build_once & name_scope (#21131 ) * remove build_once & name_scope (Conv2D) test=develop * fix unittest test=develop * Conv2DTranspose * Conv3D & Conv3DTranspose test=develop * Pool2D & BatchNorm * Embedding * LayerNorm * GRUUnit & NCE * PRelu * BilinearTensorProduct * GroupNorm & SpectralNorm * TreeConv test=develop * fix LayerNorm in transformer unnittest test=develop * disable LayerNorm or BatchNorm in multicard test=develop * refine Layer.create_parameter api test=develop * refine LayerNorm, remove begin_norm_axis param, add normed shape check test=develop * LayerNorm bug fix test=develop	5 years ago
WangXi	0fe16539ef	Fix dgc & launch tests in cpu ci (#21759 )	5 years ago
zhaoyuchen2018	a5a8d14414	Fix softmax cuda bug (#21720 ) * Fix softmax cuda bug * Refine multihead log and softmax logic	5 years ago
Kaipeng Deng	943a44492b	yolo_box OP add Attr(clip_bbox). (#21620 ) * yolo_box OP add Attr(clip_bbox). test=develop	5 years ago
zhupengyang	c4f8f3bddc	use large input shape for accuracy test (#21758 ) - ngarph: elementwise_sub, elementwise_mul - mkldnn: transpose, sum - others: scatter_nd test=develop	5 years ago
GaoWei8	92f83e46aa	fix accuracy parameters of op tests (#21729 ) test=develop	5 years ago
zhouwei25	e92d113590	fix bug that tuple(Variable) is converted to list(Variable) uncorrectly (#21687 )	5 years ago
Leo Chen	7181afd75c	Fix elementwise_pow bug on CUDA place with integer (#21675 ) * fix elementwise_pow bug on integer, test=develop * use llrint to support elementwise_pow_grad, test=develop * add some tests, test=develop * revert grad functor, test=develop	5 years ago
Chen Weihang	68999b6c7a	simplify dygraph data loader code, test=develop (#21722 )	5 years ago
Leo Chen	9c481e12ba	Patch math method for VarBase using auto-generated op functions (#21656 ) * patch math method for varbase using auto-generated op functions, test=develop * clean code that handles batch_size, test=develop * follow comments, test=develop * follow comments, test=develop * code clean, test=develop	5 years ago
lilong12	aa287e19fe	remove the dependency on ssl (#21712 )	5 years ago
Aurelius84	1ee5ba1c58	Enhance checking on sub branch of backward (#21582 ) * add backward unittest test=develop * add sub-branch in test_backward.py test=develop * refine code, add comment test=develop * reconstruct TestBackward Class test=develop * fix typo of comment test=develop	5 years ago
WangXi	4570295122	fix assert in nan inf tests, test=develop (#21742 )	5 years ago
gongweibao	549f24b5f1	run dist tests parallel(#21751 )	5 years ago
juncaipeng	a6e935f424	Update op test framework (#21599 ) * update op test framework	5 years ago
juncaipeng	7c38612347	disable op test of kldiv_loss (#21749 )	5 years ago
zhupengyang	d528ffaa04	use large input shape for accuracy test (#21716 ) affine_grid, label_smooth, spectral_norm, warpctc, nearest_interp, data_norm, match_matrix_tensor, var_conv_2d, fused_embedding_seq_pool test=develop	5 years ago
WangXi	a2175cfc96	Tmp fix fleet bug in py35 gcc8 CI, test=develop (#21703 )	5 years ago
zhupengyang	227edfe9da	use large input shape for op tests (#21705 ) sequence_expand_as, squared_l2_distance, gather_nd, center_loss, rank_loss, conv_shift, spp, modified_huber_loss, smooth_l1_loss, multiplex, sequence_softmax, nce, huber_loss, group_norm, kldiv_loss, hinge_loss, expand_as test=develop	5 years ago
joanna.wozna.intel	d419b859c0	Add reshape int8 mkldnn op (#21428 ) * Add reshape int8 op test=develop * Change test to CPUPlace test=develop * Correct tests test=develop	5 years ago
baojun	c047e713b0	make nGraph ut py3 compatible (#21679 )	5 years ago
baojun	3b915741b2	Fix nGraph UT for PY3 - Part I (#21678 ) * fix test for PY3 test=develop * reduce file changes test=develop	5 years ago
WangXi	8a0f611b64	Rewrite check nan inf tools (#21076 )	5 years ago
zhupengyang	019147eb8b	use large input shape for accuracy test, (#21693 ) sequence_unpad, expand, pad, pad_constant_like, norm, bilinear_tensor_product, flatten2, im2sequence, unpool, cos_sim, strided_slice, flatten, elementwise_min, abs, acos test=develop	5 years ago
juncaipeng	5c4106023c	disable qat int8 test for now, test=develop (#21696 )	5 years ago
tangwei12	9ad940fdfe	memory leak for cpu (#21174 ) * add fake init for the trainer, fix large memory hold in the trainer * do not merge recv vars from a remote endpoint, test=develop * add recv and save op, merge slice var in one op, save memory * remove hsigmoid with pull sparse, test=develop	5 years ago
zhupengyang	4c987a6003	fix input shape of op tests (#21682 ) * fix input shape of op tests for elementwise_sub, gather, pad2d, transpose, softmax, scale, elementwise_max, hierarchical_sigmoid, reshape2, sign, squeeze, reduce_sum, sum, squeeze2, unsqueeze, unsqueeze2, cast, reverse test=develop * fix cast, elementwise_mul, gather, scale, sign, softmax, transpose test=develop	5 years ago
juncaipeng	f64d006622	Change several tests to inherit the right parent class, test=develop (#21652 ) * change several tests to use the right parent class, test=develop * add dtype for TestLoDTensorAndSelectedRowsOp, test=develop	5 years ago
mapingshuo	686f0ecb6a	add `no_need_buffer_slots` interface to pybind (#21575 ) * add no_need_buffer_slots interface to pybind	5 years ago
juncaipeng	52f38877e4	Fix ci bug for deleting data files when other test is running (#21661 ) * fix ci bug for deleting data files, test=develop * update, test=develop	5 years ago
Huihuang Zheng	99331fa113	Fix current block in math_patch_op (#21189 )	5 years ago
Chen Weihang	d96acc3363	Refine dygraph DataLoader implementation (#21634 ) * refine dygraph dataloader & polish related code, test=develop * refine code based review comment, test=develop	5 years ago
mapingshuo	e2d849b989	Dropout with seed (#21590 ) * add seed op	5 years ago
Adam	e81f0228df	MKL-DNN 1.0 Update (#20162 ) * MKLDNN v1.0 rebase to Paddle 1.6 test=develop * Add hacky paddle::string::to_string() implementation * vectorize<int64-t>() -> vectorize() cleanup test=develop * PADDLE_ENFORCE and void_cast fixes test=develop * Rebase changes test=develop * Cosmetics test=develop * Delete MKL from mkldnn.cmake test=develop * CMake debug commands test=develop * Delete MKLDNN_VERBOSE and rebase fixes test=develop * Rebase fixes test=develop * Temporarily disable int8 resnet101 vgg16 and vgg19 tests test=develop * Add libmkldnn.so.1 to python setup test=develop * Add libmkldnn.so.1 to inference_lib cmake after rebase test=develop * Post rebase fixes + FC int8 changes test=develop * Fix LRN NHWC test=develop * Fix NHWC conv3d test=develop * Windows build fix + next conv3d fix test=develop * Fix conv2d on AVX2 machines test=develop	5 years ago
xujiaqi01	f404157205	fix master patch when slot is dense (#21580 ) * fix master patch when slot is dense * test=develop	5 years ago
Leo Chen	48600d7f17	Add op function generator for dygraph (#21569 ) * add op function generator, test=develop * add unittest, test=develop * follow comments, test=develop * fix windows compilation problem, test=develop	5 years ago
zhongpu	9a4dd1bc25	support float64 for GradClipByGlobalNorm in dygraph, test=develop (#21401 ) * support float64 for GradClipByGlobalNorm in dygraph, test=develop * fix comment for GradClipByGlobalNorm, test=develop	5 years ago
zhongpu	8777e8c1e9	fix Conv2DTranspose API, test=develop (#21403 )	5 years ago
lidanqing	fbf9eca0d3	QAT Int8 document (#21360 ) * update benchmark for int8v2, QAT1, QAT2 accuracy and performance test=document_fix * change according to reviews test=develop test=document_fix * improve some descriptions and some models test=develop test=document_fix * update models benchmark data test=develop test=document_fix * update int8v2 and qat2 performance test=develop test=document_fix	5 years ago
Huihuang Zheng	a1a5adc9b8	Refine English Doc of cond API (#21609 ) As the title	5 years ago
Zhang Ting	548efcd2e4	Fix unit tests to avoid check_grad checking failures (#21554 ) * fix python API tests that do not need to inherit OpTest, test=develop * fix fp16 cases that will only be enabled in GPU mode, test=develop * remove TestSoftmaxFP16Op from test cases of softmax_mkldnn_op, test=develop * fix tests so that the cases are only created in GPU mode, test=develop	5 years ago
zhongpu	4ad9b755c7	fix paddle compile errors under some python versions (#21616 ) * fix compile error in some python version, test=develop * remove redudant code, test=develop	5 years ago
liym27	be6a639655	Add CI for checking Input/Output/Attr of modified Ops (#21522 ) * add shell scripts. test=develop * rename test_pybind_inference to test_pybind_interface and print repeat process in check_op_desc.py. test=develop * add approval RD. test=develop	5 years ago
guofei	835ea12cde	Control flow API: while_loop (#21276 ) Add basic while_loop	5 years ago
Leo Chen	4f81d1bd5f	Refine VarBase init function (#21587 ) * refine init function, test=develop * add tests, test=develop * remove extern, which may cause symbol error in gcc-4.8, test=develop	5 years ago
lijianshe02	56882ce432	change input data type and decrease max_relative_error value in test_check_grad for grop_nom_op test test=develop (#21608 )	5 years ago
Huihuang Zheng	1dcf6a7212	Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532 ) Add tests to use dy/dx to make sure the gradient values calculated by the control flow backward is correct. Also fixed bugs detected by those tests. Fix bugs: 1. Unlike sum_op, optimizer ops don't allow uninitialized input tensor. But in conditional_block_grad_op, since the conditional_block may not run, the output gradient tensor may be uninitialized, which will cause the optimizer op error. To fix it, we should let optimizer ops support uninitialized input like sum_op or assign the uninitialized gradient to 0 when the conditional_block_grad_op doesn't run. I found there are about 10+ optimizer ops. To be simpler, I just assign output gradient of the conditional_block_grad_op to 0 in this PR. But it can be further explored whether we can make optimizer ops like sum_op to support uninitialized input tensor because theoretically we can speed up without the assigning in conditional_block_grad_op. 2. Infer parameter shapes during append_backward. I didn't know that all our parameters are in global block. When op_desc is inferring shapes at the sub-block, it may not know the shape of gradients of parameters whose shape information is at global block. I fixed it by inferring shapes of gradients from forward var. This PR also did some code clean up: 1. Print the var name when sgd_op catches shape error so that it is easier to debug 2. Fix a typo: dicta -> dict	5 years ago
hutuxian	c5aec2fe68	Paddlebox Related to Framework (#21586 ) * Add a single_process_multi_thread transpiler. * Add some UTs. * Fix some API description.	5 years ago
liym27	9da7e6b4d4	add file check_op_desc.py and add interface to get default value. (#21530 ) * add file check_op_desc.py and add interface to get default value. test=develop * add test for c++ coverage rate. test=develop * Correct typo. test=develop	5 years ago
Feiyu Chan	2057df7ac0	add fluid.layers.gelu & doc (#21515 ) Add a python interface for Gelu. Add documentation for fluid.layers.gelu.	5 years ago
wangchaochaohu	29c3844585	fix doc typo test=develop (#21566 )	5 years ago
Jacek Czaja	9ce0e29dc3	[MKL-DNN] Batch norm mkl-dnn NHWC support (#21553 ) * - BAtch norm mkl-dnn NHWC test=develop - compilation fix test=develop - UT fix - cosmetics test=develop - Fix to Batch Norm MKL-DNN NHWC UT test=develop Conflicts: paddle/fluid/operators/batch_norm_op.h * - Lint fixes test=develop	5 years ago
danleifeng	657053f262	remove elementwise x_should_larger_than_y restriction;test=develop (#21517 )	5 years ago
hong	08483a6819	Add dygraph linear warm up decay (#21186 ) * dygraph mode support linear lr warm up; test=develop * add unitest for linear warmup; test=develop * add input type check; test=develop * fix type check assert error; test=develop * change type error; test=develop	5 years ago
lilong12	da75ac8b6c	bugfix: construct a DistributedStrategy instance if the passed one is None (#21545 )	5 years ago
lilong12	de46b15951	Unify the rank of prelu alpha to 4, corresponding to [N, C, H, W], except for the all mode	5 years ago
Leo Chen	cdd46d7e02	Split VarBase from Python Variable for Dygraph (#21359 ) * test=develop, fix docker with paddle nccl problem * don't expose numerous Tensor.set(), test=develop * fix condition, test=develop * fix float16 bug, test=develop * feed should be Tensor or np.array, not Variable or number, test=develop * use forcecast to copy numpy slice to new array, test=develop * remove float16-uint16 hacking, test=develop * add variable method to varbase and refactor to_variable to support return varbase * support kwargs in varbase constructor * add VarBase constructor to support default python args * refine varbase initial method * reset branch * fix ut for change VarBase error info to PaddleEnforce * cherry is parameter change before * overload isinstance to replace too many change of is_variable * rm useless files * rm useless code merged by git * test=develop, fix some ut failed error * test=develop, fix test_graph_wrapper * add some tests, test=develop * refine __getitem__, test=develop * add tests, test=develop * fix err_msg, test=develop	5 years ago
Youwei Song	cdba41af4d	dygraph Embedding layer use lookuptable v2 (#21209 ) * dygraph Embedding layer use lookuptable v2 test=develop * fix test_nce test=develop	5 years ago
wangchaochaohu	4c9b3dafa7	fill_constant_batch_size_like OP precious problem fix (#21337 ) * fix fill_constant_batch_size_like_op precious problem test=develop	5 years ago
Zhang Ting	b1da35261b	fix unit tests that do not need to inherit OpTest (#21460 ) * fix PythonAPI test in Op unittest, test=develop * fix unit tests that do not need to inherit OpTest, test=develop	5 years ago
WangXi	768f9242e9	Fix dgc clip & rampup step, test=develop (#21491 )	5 years ago
Aurelius84	54382ce497	Add get_all_kernels api of registered data_type in pybind.cc (#21499 ) * add _get_all_register_op_kernels api test=develop * refine usage of check_op_register_type test=develop * add import in core test=develop	5 years ago
lilong12	e75ded08a0	fix the compatiable problem between PY2 and PY3 (issue#20749) (#20942 ) * fix the compatiable problem between PY2 and PY3. * add ut, test=develop * add proxy, test=develop * download dataset before test, test=develop	5 years ago
Zeng Jinle	a3535812f6	add _use_system_allocator to some op tests, test=develop (#21504 )	5 years ago
Jacek Czaja	18a5d30754	[MKL-DNN] Conv2d and Conv2d transpose MKL-DNN NHWC support (#21466 )	5 years ago
ruri	2445fef386	Fix density sample (#21506 )	5 years ago
zhongpu	6ebf0f47b8	support SelectedRows in dygraph, test=develop (#21078 ) * support SelectedRows in dygraph, test=develop * fix bug of _grad_ivar interface, test=develop * add optest for support seletedrows, test=develop * fix bug for gradient_accumulator in GPU mode, test=develop * fix error when Selectedrows addto LodTensor in sorted_gradient mdoe in dygraph, test=develop * refine and simplify gradient accumulator code, test=develop * add optest, test=develop * add optest and simplify code, test=develop * fix bug for test_imperative_selected_rows, test=develop * add optest for Coverage, test=develop * fix gradient interface and simplify code, test=develop * update api for gradient, test=develop * fix ShareDim's bug in DygraphExecutionContext class, test=develop * add optest, test=develop	5 years ago
lilong12	0bc8bdf724	set dim[0] to -1 if dim[0] < 0 during compiling for c_allgather op (#21402 ) * set dim[0] to -1 if dim[0] < 0 and remove assertion to runtime, test=develop * modify ENFORCE message, test=develop * add validation for x.shape[0] > 0, test=develop * add ut, test=develop	5 years ago
Aurelius84	4bf115b42d	Fix AdamOptimizer and Scale sample code Bug (#21478 ) * fix adam sample code bug test=document_fix * fix sample code bug in scale test=document_fix	5 years ago
ruri	94bef03539	Revert "Add masked select api (#21172 )" (#21456 ) This reverts commit `007c997572`.	5 years ago
ruri	3706ea67f8	fix sample code in density prior box	5 years ago
Zeng Jinle	87ab93af01	fix adam fp64, test=develop (#21423 )	5 years ago
liym27	beec87b911	fix bug in example codes of API case and switch_case. test=develop,test=document_fix (#21477 )	5 years ago
hutuxian	7e68bc896b	refactor AUC OP and add its CUDA Kernel (#21336 ) * refactor AUC OP and add its CUDA Kernel * the layout of global auc doesn't change	5 years ago
juncaipeng	1f57ac1241	delete concat in AddQuantDequantPass, test=develop (#21454 )	5 years ago
Zeng Jinle	2a54c359f0	add fraction of cpu memory to use, test=develop (#21453 )	5 years ago
Zhang Ting	101240d2c1	fix PythonAPI test in Op unittest, test=develop (#21462 ) There are PythonAPI tests in Op's unittest which don't need to inherit OpTest class.	5 years ago
wawltor	dbbe6e9cb6	fix the device supported of the op unique and unique_with_counts. (#21395 ) * fix the device supported of the op unique and unique_with_counts. test=develop test=document_fix * Fix the precision of test in the op of unique and unique_with_counts. test=develop test=document_fix	5 years ago
Huihuang Zheng	32959e031e	Add English Document for cond API (#21452 ) Add English doc for cond	5 years ago
Zhang Ting	3df13ab40c	fix PythonAPI test in Op unittest, test=develop (#21455 ) There are PythonAPI tests in Op's unittest which don't need to inherit OpTest class.	5 years ago
Jie Fang	5e813b53c5	nhwc optimization for batchnorm (#21090 )	5 years ago
Leo Chen	e0c9d856fb	add unused input vars check for OpWithKernel, test=develop (#21169 ) * add unused input vars check for OpWithKernel, test=develop * remove unused vars in some ops, test=develop * fix batch_norm, test=develop * add white list, test=develop * add CI check for white list, test=develop * :ove white list to c++, test=develop * solve failure of CI, test=develop * add unittest for unused_var_check, test=develop * refine code, enable check in operator_test, test=develop * skip mkldnn, test=develop * extend white list, test=develop * refine condition of mkldnn, test=develop * fix paddle_build, test=develop * follow comments, test=develop * fix GetExpectedKernelType * add wiki ref to err_msg, test=develop * follow comment, test=develop	5 years ago
Chen Weihang	664f958a02	Fix optimizer op infershape failed in dygraph multi-cards mode (#21374 ) * add param & grad shape check for sgd op * add _reshape_inplece interface for dygraph parallel * refine unittest based paddle/models scripts, test=develop * add unittest for parallel grad fuse, test=develop	5 years ago
Huihuang Zheng	630be31952	Fix Cond Bug for Nested Control Flow (#21340 ) * Commit before merging develop test=develop * Backup after working with Huihuang logs * Commit before deleting Huihuang debug loggings * Commit before debug test=develop * Fix bug commit test=develop * Backup of fixing bugs test=develop * Clean up code test=develop * Fix a bug in sum_op test=develop	5 years ago
Jacek Czaja	cd43c4440e	[MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375 )	5 years ago
xujiaqi01	ca879e5a77	fix skip_op bug (#21418 ) test=develop	5 years ago
zhaoyuchen2018	b16274556a	Add dscending for argsort (#21400 ) * Add ascending for argsort * Refine api doc description. * Refine descending description * Add int32 logic to speedup when data is small size. * Remove int32 opt as not support in python	5 years ago
hong	ac8546701d	Add dygraph execution context (#20157 ) * add_dygraph_execution_context * add dygraph infershape context and execution context; test=develop * fix imperative bug; test=develop * remove inputs outputs interface from execution context, because it have same function with inputNames; test=develop * remove tracer_test ctest; test=develop * fix split op bug; test=develop * fix unitests bug; test=develop * fix distribute test bug; test=develop * fix ngraph compile bug; test=develop * fix grad maker bug; test=develop * fix load op bugs; test=develop * fix operator.cc construct bug; test=develop * remove useless name find in operator; test=develop * add tracer_test; test=develop * fix concat, split bug; test=develop * remove tracer_test unitest; test=develop * fix attribute check bug; test=develop * add test code to fix converage; test=develop * remove useless code, change check backward input in engin; test=develop * unlock var type infer shape;test=develop * add ShareAllLoD api; test=develop * add dygraph infershape context unitest; test=develop * remove increase and decrease lod in dygraph; test=develop * addd override; test=develop * fix increase descrease lod; test=develop * fix paddle_enforce; test=develop * disable lod op dygraph check; test=develop * fix paddle enforce error; test=develop * add comment for op_registry and OperatorBase; test=develop * optimize the comment of op_registry; test=develop * fix format of comment; test=develop * fix format of comment; test=develop * optimize the format of comment; test=develop * optimize the format of the comment; test=develop * optimize comment of op_registry; test=develop	5 years ago
hutuxian	a6b089c614	add macro to ban windows (#21422 ) remove nccl related code in windows	5 years ago
Kaipeng Deng	ebfb720a63	add Adam beta1/beta2 support Variable (#21234 ) * add Adam beta1/beta2 support Variable. test=develop	5 years ago
Zeng Jinle	09696d5df8	Use system allocator in OpTest (#21335 ) * use system allocator in unittests, test=develop * fix op bugs, test=develop * fix tensor copy bug when src and dst are the same, test=develop	5 years ago
ruri	007c997572	Add masked select api (#21172 )	5 years ago
Kaipeng Deng	67c836fb5c	batch_norm momentum support variable (#21246 ) * batch_norm momentum support variable. test=develop * fix format. test=develop * add batch_norm momentum variable example. test=develop * move MomentumTensor to training branch. test=develop * split example. test=develop * fix doc. test=develop * fix PADDLE_ENFORCE ci. test=develop * fix format. test=develop	5 years ago
lidanqing	c0aa13672e	Fp32 vs int8 qat C++ performance (#21244 ) * add ut for comparing FP32 and QAT INT8 * add save qat transformed model python script test=develop * updated * added missing file * add "with_label" test=develop * performance benchmark as unit test test=develop * change names of unnecessary thing * Change CMakeList.txt for model downloading and UT test=develop * change names of functions and params for more readable code test=develop * Change PADDLE_ENFORCE messages test=develop * fix indent problems test=develop * indent problems test=develop	5 years ago
xujiaqi01	f1178e9d79	fix fleet save bug (#21362 ) * fix fleet save bug of save_infernece_model * test=develop	5 years ago
Liufang Sang	1840c1652c	add config file to avoid load checkpoint test=develop (#21373 )	5 years ago
Zeng Jinle	b97fc16d21	fix lod_reset bug, test=develop (#21392 )	5 years ago
hutuxian	47a82e38e3	Support data_norm gpu kernel (#21325 ) * support data_norm_op run in CUDA * add two parameters sync_stats & summary_decay_rate * add UT	5 years ago
Youwei Song	d5ff79e55e	Support numpy bridge (enabled by default in dygraph mode) (#20983 ) * add numpy bridge * fix template compile * add unittest, add default test=develop * fix unittest test=develop * fix unittest test=develop * zero_copy=True for to_variable, test=develop * bug fix test=develop * disable deprecated NumPy API test=develop * use better design of NumpyAllocator test=develop * fix Py_None check test=develop * reset c++ tracer when jump out dygraph guard test=develop * refine PADDLE_ENFORCE_xx format test=develop * bug fix of tracer switch test=develop * update decref test=develop	5 years ago
Michał Gallus	5d7d548275	INT8 Fully-connected (#17641 ) * Implement Int8 FC * Integrate FC into INT8v2 test=develop * int8 FC: transpose weights before computing scales test=develop * Add support for activation_type string in FC test=develop * Disable MKL-DNN's FC in VGG16 and 19 test=develop * Disable FC quantization when mkldnn FC is disabled test=develop * Solve PADDLE_ENFORCES in FC int8 * Fix Paddle enforces and remove const cast test=develop * Fix style changes test=develop * Fix quantizer_tester test and add fc quantization test=develop * Fix FC test fail on CUDA * Remove unnecessary log from quantize placement pass test=develop * Add Thread ID to FC hash key test=develop * Add comments to MKL-DNN FC Kernel test=develop * Refactor quantizer test=develop * Fix linter issues test=develop * Fix crash in slim googlenet test=develop * Fix PADDLE_ENFORCE messages test=develop	5 years ago
itminner	07e6a94268	paddleslim quantization skip pattern support list of string (#21141 )	5 years ago
Zhen Wang	be2e3e67d9	Fix some typos in AMP. (#21354 ) * fix some typos in AMP. test=develop * delete useless codes. test=develop	5 years ago
lilong12	41d13209d7	add the framework support for distfc (#21197 ) * add the framework support for distfc and ut, test=develop * fix the implementation of shard_index_op, test=develop	5 years ago
hong	a214a3081b	change download log format (#21290 ) * change download log formate; test=develop * add unittest for data download; test=develop * remove cache before download; test=develop	5 years ago
GaoWei8	234060f88f	Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972 ) * Add fc padding to solve mkl performance test=develop * fix gpu pass and error information test=develop * fix fc_fuse_pass_test test=develop * fix error information test=develop * fix error information test=develop * fix name and add fc op padding test test=develop * fix attributes test=develop * optimize fc padding test=develop * fix test test=develop	5 years ago
ruri	6cfcbe0510	reduce interp op input size to pass CI, test=develop (#21341 )	5 years ago
Jacek Czaja	f4cf028a8c	[MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207 )	5 years ago
Michał Gallus	ed9ceb9f98	Refactor MKL-DNN ElementwiseMul (#21061 ) * Refactor MKL-DNN ElementwiseMul remove manual fallback, remove format attrs test=develop * Refine PADDLE_ENFORCEs in eltwise_mul_op.h test=develop * Make ElementwiseMulOp inherit from ElementwiseOp * Change type of simd_width to int test=develop * Remove Constructor extensions in ElementwiseOp and ElementwiseMulOp test=develop * Restore attributes test=develop * Fix test coverage for mkldnn eltwise mul test=develop * Conform to new is_run_common_broadcast API test=develop * Add UT for AreDimsAndFormatCorrect test=develop	5 years ago
Dong Daxiang	0a93635b5f	fix logger problem (#21342 ) * fix logger problem test=develop * refine logger test=develop	5 years ago
wangchaochaohu	6514f52e46	fix the fill_constant op precious problem (#21322 ) * fix the fill_constant op precious problem test=develop	5 years ago
zhaoyuchen2018	08c19c585d	Improve argsort performance. (#21267 ) * Improve argsort performance. - Give 200000 data to compute argsort on v100, can speed up ~190x before opt cost: 0.53s after opt cost:0.0027s - Add fp16 support * Refine error message * Refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
lijianshe02	7fcaa39b36	fix Print_op input dtype list error test=develop (#21326 )	5 years ago
juncaipeng	84865b806b	add resnet50 test for post trainint quantization, test=develop (#21272 )	5 years ago
Thunderbrook	9a7832f8be	print table stat info for pslib (#21296 ) * print table stat test=develop * notes test=develop * notes test=develop	5 years ago
WangXi	8ac7687e36	Fix dgc accuracy by mv regularization to local (#21278 )	5 years ago
Zeng Jinle	b9f8ae8494	Add global value getter setter (#21285 ) * add global value getter setter, test=develop * fix error messages, test=develop	5 years ago
Dong Daxiang	691ced87c0	Refactor fetch handler (#21264 ) * fix fetch handler problem and refactor when a user define FetchHandler class, he or she should initialize a handler with variable dict. the key of a variable dict is a user defined name, the value of a variable dict is a Varaible generated from python API. For each fetching, a user should implement handler function in which fetched_result_dict will be available and the user can access the fetched value with user defined keys.	5 years ago
Yi Liu	f1b09ba30e	adapt test_collective_base.py for only two GPU cards available. (#21307 ) * adapt test_collective_base.py for only two GPU cards available. test=develop * fix bug of issue #21259 test=develop	5 years ago
gongweibao	ed2a185248	optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597 )	5 years ago
Liufang Sang	f0b1518438	add dequantize_abs_max op and modify lookup_table op (#20899 ) * add int8 kernel to lookup_table op and add dequantize op test=develop * change paddle_enforce to paddle_enforce_eq test=develop * change copyright and change some not suitable code test=develop * remove debug log test=develop * replace GetInputType with IndicateVarDataType test=develop * fix EmptyGradMaker test=develop * fix diff between cpu and gpu test=develop * use memcopy when int8_t test=develop	5 years ago
hutuxian	a6ce2306f9	support cvm_op run in gpu (#21300 ) Previously, CVM OP was only able to run in CPU. This PR implements its GPU kernel. What's more, we improve the UTs about CVM OP.	5 years ago
Chen Weihang	952508527a	Polish some PE code details (#21274 ) * polish code details, test=develop * futher polish hint msg, test=develop	5 years ago
Yi Liu	0fd1281ef8	fix bug of issue #21259 (#21287 ) pass the argument `allow_out_of_range` of one_hot op to c++ back end.	5 years ago
xujiaqi01	319d2ba925	fix fs_client_param bug (#21212 ) * fix fs_client_param bug， user can set this config through fleet_desc_file or fleet config * test=develop	5 years ago
Thunderbrook	0d17c1b816	solve pslib core in stop worker (#21263 ) * general table * add sparse table test=develop * no cvm test=develop * add no_cvm test=develop * add note test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * add key of optimizer test=develop * solve pslib stop core test=develop * barrier test=develop * add notes test=develop	5 years ago
zhongpu	fa4d055098	fix bug for python/paddle/fluid/tests/unittests/test_elementwise_mul_op.py, test=develop (#21289 )	5 years ago
zhongpu	c4ede95c74	open dygraph op test, test=develop (#19787 ) * open dygraph op test, test=develop * modify to_variable, test=develop * modify input and output for dygraph, test=develop * modify input and output for dygraph(fix bug), test=develop * fix input processing of dygraph op test, test=develop * fix bug, test=develop * fix op test, test=develop * fix forward bug for dygraph, test=develop * fix mkldnn op test for forward, test=develop * update nn.py for dygraph, test=develop * fix crop_tensor_op, test=develop * fix elementwise_mul_op, test=develop * fix fill_op, test=develop * fix some mkldnn op, test=develop * open backward op test for dygraph, test=develop * delete log, test=develop * close backward op test for dygraph, test=develop * fix bug for edit_distance_op and test_lstm_cudnn_op, test=develop * fix optest backward bug for dygraph, test=develop * fix optest backward bug for dygraph, test=develop * close backward op test for dygraph, test=develop * close backward op test for dygraph, test=develop * open dygraph op test, test=develop * fix op test for dygraph, fix GradOpDescMaker, test=develop * fix bug for linear_chain_crf_op.h, test=develop * remove log, test=develop * remove log, test=develop * remove log for op_test.py, test=develop * remove log for op_test.py, test=develop * fix bug for var_conv_2d_op, change PADDLE_ENFORCE, test=develop * fix PADDLE_ENFORCE_EQ for hierarchical_sigmoid_op.cc, test=develop * fix bug for test_increment_ngraph_op.py, test=develop * fix lod for op test in dygraph, test=develop * refactor op_test.py to reduce redundant code, test=develop * fix lod optest, modify InputVar/OutputVar to HasInput/HasOutput, test=develop * remove debug log, test=develop * remove redundant code in base.py, test=develop * fix some error in optest, test=develop * fix ClearNoNeedBufferInputs function's bug for LoDTensor, test=develop * refactor op_test.py, test=develop * remove redundant writing, test=develop * fix error(get tensor of the grad variable), test=develop * fix test_concat_mkldnn test_conv2d_mkldnn, test=develop * fix optest.py for get tensor of LoDTensor, test=develop * fix optest.py for get tensor of LoDTensor, test=develop * fix optest.py for get tensor of LoDTensor, test=develop * fix some redundant code, test=develop * reslove conflict and rewrite paddle error message, test=develop	5 years ago
Kaipeng Deng	3ab60f5bf9	fix mkldnn include. test=develop (#21247 ) * fix mkldnn include. test=develop * fix mkldnn inlcude. test=develop	5 years ago
xujiaqi01	eca66f317e	fix fleet util bug (#21254 ) * fix fleet util bug in save paddle inference model * test=develop	5 years ago
ShenLiang	1f39a9f17e	fix the bug of scatter_nd, test=develop (#21257 )	5 years ago
lijianshe02	382cf5d7e3	add input type and input data type check for Print_op test=develop (#21250 ) * add input type and input data type check for Print_op test=develop	5 years ago
Thunderbrook	349e82d669	support general embedding params (#21217 ) * general table * add sparse table test=develop * no cvm test=develop * add no_cvm test=develop * add note test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * add key of optimizer test=develop	5 years ago
liym27	b0fc822747	Add control flow api: case (#21114 ) * add control flow API: case. test=develop * delete 'raise TypeError' in _error_message() and return a string. test=develop * polish API document. test=develop	5 years ago
juncaipeng	29b63f0aa1	support set model_filename and params_filename in post_training_quantization, test=develop (#21213 ) * support set model_filename and params_filename in post_training_quantization, test=develop	5 years ago
Dong Daxiang	ccbdd7aad0	update worker_num for MPISymetricRoleMaker (#20798 ) test=develop	5 years ago
Liufang Sang	c91cb6c550	fix load checkpoint error in test_reader (#20924 )	5 years ago
Kaipeng Deng	4747940b08	add custom_op include: imperative, error_codes.pb.h, mkldnn.h. test=develop (#21227 )	5 years ago
danleifeng	0e7baabe59	extend elementwise broadcast function (#20957 )	5 years ago
yaoxuefeng	b5d8ba8394	fix data_norm op to avoid impractical normalization result test=develop (#21152 ) * fix auc drop first commit test=develop * update datanorm op * update datanorm with enforce test=develop * update test=develop * update format test=develop * update format * update format test=develop * add unit test test=develop * update unit test test=develop * update format test=develop * update format test=develop * update API description test=develop * update API description test=develop * update format test=develop * fix codes as comments test=develop * fix description as comments test=develop * fix description as comments test=develop * update codes.. test=develop	5 years ago
Zeng Jinle	67e88424e5	Polish jit trace codes (#21218 ) * polish jit trace codes, test=develop * polish codes again by removing var_id, test=develop	5 years ago
Zeng Jinle	cdb3d27985	Fix warn of gcc8 (#21205 ) * fix warnings oof gcc 8 compilation, test=develop * fix boost::bad_get, test=develop * refine PADDLE_ENFORCE, test=develop	5 years ago
Jeng Bai-Cheng	330b173c38	Better TensorRT support (#20858 ) * Fix TensorRT detection bug 1. Add new search path for TensorRT at tensorrt.cmake 2. Add better debug message 3. Fix the bug of detection of TensorRT version In NVIDIA official docker image, TensorRT headers are located at `/usr/include/x86_64-linux-gnu` and TensorRT libraries are located at `/usr/lib/x86_64-linux-gnu`, so using `-DTENSORRT_ROOT` will fail to detect TensorRT. There is no debug/warning message to tell developer that TensorRT is failed to be detected. In later version of TensorRT (e.g. v6), `NV_TENSORRT_MAJOR` is defined at `NvInferVersion.h` instead of `NvInfer.h`, so add compatibility fix. * Fix TensorRT variables in CMake 1. Replace `${TENSORRT_ROOT}/include` with `${TENSORRT_INCLUDE_DIR}` 2. Replace `${TENSORRT_ROOT}/lib` with `${TENSORRT_LIBRARY}` Manually type path may locate incorrect path of TensorRT. Use the paths detected by system instead. * Fix TensorRT library path 1. Add new variable - `${TENSORRT_LIBRARY_DIR}` 2. Fix TensorRT library path inference_lib.cmake and setup.py.in need the path of TensorRT library instead of the file of TensorRT library, so add new variable to fix it. * Add more general search rule for TensoRT Let system detect architecture instead of manually assign it, so replace `x86_64-linux-gnu` with `${CMAKE_LIBRARY_ARCHITECTURE}`. * Add more general search rule for TensorRT Remove duplicate search rules for TensorRT libraries. Use `${TENSORRT_LIBRARY_DIR}` to get full path of libnvinfer.so test=develop	5 years ago
danleifeng	3fe63d6780	add store_true to use_paddlecloud argument in launch.py (#21168 )	5 years ago
Zhang Ting	9cbe7bccba	modified error message and API doc for channel_last supported Op (#21002 ) * modified error message for conv and conv_transpose, test=develop * modified doc of conv and conv_transpose op, test=develop * modified the expression for error message, test=develop * modified error message for group_norm op, test=develop * modified detail of Attr(data_format) or Attr(data_layout) * add ValueError in API doc for maxout op, test=develop	5 years ago
liym27	9247528252	Control flow API: switch_case (#21103 ) * add API switch_case. test=develop add Nest * modify code according to reviews: 1.Attr(branch_index) support 'uint8' and 'int64' besides 'int32'. 2.remove useless code. test=develop * replace fluid.layers.data with fluid.data and polish API document. test=develop	5 years ago
guofei	56b5d14704	Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118 )	5 years ago
WangXi	3c98ec90ce	Fix INF bug of softmax_cross_entropy_op (#21165 )	5 years ago
Zeng Jinle	0f30d3a213	fix dygraph trace bug, test=develop (#21193 )	5 years ago
juncaipeng	00b11a4a1e	Support more ops in post training quantization, test=develop (#21073 ) * Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat	5 years ago
xujiaqi01	23876de55b	fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052 ) * fix cache table bug * add save_paddle_inference_model * fix hdfs util bug * test=develop	5 years ago
xujiaqi01	9e045170c0	add copy table (#21086 ) * copy some feasigns and corresponding embeddings from one sparse table to another * copy all feasigns and corresponding embeddings from one sparse table to another * copy all dense params from one table to another * copy some local vars to other local vars	5 years ago
ruri	aeb887911f	Refine edit distance cn (#21121 )	5 years ago
Kaipeng Deng	98b59cb82c	fix elementwise_mod float point kernel. test=develop (#21183 )	5 years ago
hong	835119c777	disable reshape inplace in dygraph model; test=develop (#21157 )	5 years ago
Zeng Jinle	5fdfbe3413	Add friendly dygraph trace API (#21091 ) * friendly trace interface, test=develop * refine TracedLayer, test=develop * add some docs, test=develop	5 years ago
whs	cfdd1fc2cd	Fix warpctc in padding mode. (#21033 )	5 years ago
Tao Luo	3976bbe2ce	add input type and dtype check template, and update some APIs check (#21161 ) * add input type and dtype check template, and update some APIs check * refine check template, and update some APIs check in nn.py * update some APIs check in loss.py test=develop	5 years ago
joanna.wozna.intel	37e0e7a96b	QAT int8 accuracy little improvement (#21074 ) test=develop	5 years ago
gongweibao	a5fc291fe5	Use 2 cards for hallreduce unit test. (#21085 ) use 2 cards test=develop	5 years ago
Tao Luo	8f659d4345	Split some APIs from nn.py to loss.py (#21117 ) * Split some APIs from nn.py to loss.py test=develop * fix test_detection unit-test test=develop	5 years ago
zhaoyuchen2018	4a544762a2	Add Asypadding for conv fusion. (#21041 ) * Add Asypadding for conv fusion. test=develop reference: pr/20042 * Fix eigen build link error * Change back file mode * Use math function & add more checks.	5 years ago
WangXi	de5d3ff688	Fix dgc buffer illegal & reuse velocity (#21012 )	5 years ago
lilong12	53148e0696	modify the implementation of save_persistables and save_inference_model for fleet collective mode (#20802 ) * modify the implementation of save_persistables and save_inference_model functions for fleet collective, test=develop * add ut, test=develop	5 years ago
Bai Yifan	bd8b0ebaba	fix distiller typo, test=develop (#21070 )	5 years ago
ceci3	f62a929151	fix instance norm (#21042 ) * fix instance norm * update unitest,test=develop	5 years ago
lilong12	e249d9a3e2	fix the computation for dx (grad for x) for prelu operation. (#20949 ) * set the default value of alpha for prelu to 0.25, test=develop * add the call to __syncthreads(), test=develop * fix the implementation of cpu prelu, test=develop * repair the implementation of element mode prelu, test=develop * modify test_prelu_op.py, test=develop	5 years ago
Huihuang Zheng	e64d55f04e	Add basic Python Cond Layer (#21050 )	5 years ago
Huihuang Zheng	dcf371b685	Disable cudnn_conv in unit tests. (#21080 )	5 years ago
Yiqun Liu	35f17ae28f	Add the check of lod_level between compile-time and runtime. (#20961 ) * Add the check of lod_level between compile-time and runtime. test=develop * Fix bug in check_compile_vs_runtime. test=develop * Fix the check of output when it is dispensiable or intermediate. test=develop * Share lod of x to out in match_matrix_tensor op in compile-time. * Implement GetLoDLevel in InferShapeContext. * Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op. test=develop * Enable check_compile_vs_runtime in test_match_matrix_tensor. * Add the implementation of SetLoDLevel in InferShapeContext. * Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead. * Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead. * Refine some ops and unittests. test=develop * Fix a typo. test=develop * Remove the check of var type, and change int to int32_t. test=develop * Add unittest for Get/SetLoDLevel. test=develop	5 years ago
Tao Luo	78cc1ca616	Split some APIs from nn.py to rnn.py and sequence_lod.py (#21030 ) * split some APIs from nn.py to rnn.py * split some APIs from nn.py to sequence_lod.py test=develop * fix unit-test bug test=develop * fix test_layers unit-test bug test=develop	5 years ago
joanna.wozna.intel	77c2083586	Add transpose2 INT8 for mkl-dnn (#19424 ) * Add transpose2 INT8 for mkl-dnn test=develop * Fix test_transpose_int8_mkldnn test=develop * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2" This reverts commit 34011bdba4c859abb945e062ab13124f70508054, reversing changes made to 2ce6473f144da298aba4a43d46918f27d463cf7c. * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"" This reverts commit 23754dd78ca47ae56881161172b2aacd349aba90. * Add template to TransposeMKLDNNHandler test=develop * Resolve conflict test=develop * Restore get_size and refactor test=develop	5 years ago
juncaipeng	2c07727fb0	delete test resnet50 in post train quantization to avoid timeout error, test=develop (#21081 )	5 years ago
LielinJiang	06063b7001	add op locality_aware_nms, test=develop (#20976 )	5 years ago
liym27	26a6e27afe	fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997 ) * fix bug in pool/conv/conv_transpose: 1. It should be stride[i] not stride[0] in UpdatePaddingAndDilation; 2. fix bug of func _get_padding_with_SAME in test_conv/conv_transpose_op.py; 3. fix bug of the computation process in function conv2dtranspose_forward_naive. test=develop * change test to make the data of different dimensions different. test=develop	5 years ago
Adam	3fda695bb0	Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062 ) * Add asymetric padding support for mkldnn pooling test=develop * Add asymetric padding support for mkldnn conv test=develop * Add asymetric padding support for mkldnn conv_transpose test=develop	5 years ago
Huihuang Zheng	1957192f05	Add select_input_op and select_output_op (#21016 ) These ops are useful in control flow.	5 years ago
hong	72e0969b27	fix uniform random (#21009 ) * fix uniform random; test=develop * add uniform random test; test=develop	5 years ago
Wojciech Uss	226bc22a29	Remove fuse_with_relu argument from batch_norm constructor (#21028 ) test=develop	5 years ago
liym27	f0e95a6049	Polish error messages of pool_2d/3d and add Raises in English document. test=develop (#21017 )	5 years ago
juncaipeng	fa522dffa0	Fix bug in add_quant_dequant_pass, test=develop (#21018 ) * Fix bug for inserting add_quant_dequant_op to same variable repetitively in add_quant_dequant_pass, test=develop	5 years ago
juncaipeng	175ba39c03	Add post_training_quantization (#20800 ) * add post training quantization, test=develop * specify the quantizable op type, test=develop	5 years ago
Leo Chen	008ed65fd5	Add c++ global current tracer for dygraph (#20882 ) * Add c++ global current tracer for dygraph, test=develop * add tracer property in c++, test=develop * support different place, test=develop * add unittest for tracer, test=develop	5 years ago
Huihuang Zheng	4cf96cd307	Add grad_name Property for Class Variable (#20991 )	5 years ago
xujiaqi01	1d1a07937a	simplify master+patch，remove ins when size != merge_size or has conflict slot (#20913 ) * remove duplicate code and duplicate config of master+patch * drop all ins which has conflict slot or size < merge_size * user only need to set merge size，if ins num of same id is not equal to merge size, just drop these ins * user must make sure master data and patch data has no same slot whose feasigns are both non-zero, otherwise these ins will be dropped. (slot list should still be the same of both master and patch) * test=develop	5 years ago
Thunderbrook	5970e8ac5e	find lookup table in order (#20932 ) test=develop	5 years ago
Zhang Ting	de9bec607e	lrn supports channel_last input, test=develop (#20954 )	5 years ago
tangwei12	3b96e3d20a	fix FetchHandler (#20900 ) * bug fix, test=develop	5 years ago
liuwei1031	2dd9bc7159	fix windows python setup issue caused by (#20641 ) (#20928 ) * fix windows python setup issue caused by #20641, test=develop * tweak python/CMakeLists.txt, test=develop * redirect log inside setup.py, test=develop * fix typo, test=develop	5 years ago
Dong Daxiang	a6747a6ef1	add launch_ps module so that we can launch a parameter server trainin… (#20936 ) * add launch_ps module so that we can launch a parameter server training job 1) a user can specify worker_num and server_num 2) parameter server can be killed after all workers exit 3) unit test is added test=develop	5 years ago
Leo Chen	2c3c579b9b	tensor.set() supports array list and remove unused code, test=develop (#20959 )	5 years ago
Leo Chen	9974e40787	Update Tensor.set() to support float16 (#19964 ) * don't expose numerous Tensor.set(), test=develop * fix condition, test=develop * fix float16 bug, test=develop * feed should be Tensor or np.array, not Variable or number, test=develop * use forcecast to copy numpy slice to new array, test=develop * remove float16-uint16 hacking, test=develop	5 years ago
123malin	20cdff0e02	Optimize decay (#20816 ) * update pserver decay blocks * update distributed notify handler	5 years ago
Chengmo	16596f6498	Fix Paddle Cloud role maker (#20860 ) * fix PaddleCloud Role maker & add warning in distribute transpiler & change rpc_retry_times	5 years ago
liym27	59de8e1214	Compatible int32 and int64 for attr in concat/split/unsqueeze. test=develop (#20912 )	5 years ago
liym27	7b4cb655bb	keep the size of symmetric padding is 2 for 2d and 3 for 3d. test=develop (#20903 )	5 years ago
Zhang Ting	8d1e9f0f7e	maxout supports channel_last input (#20846 ) * maxout support channel_last input, test=develop * modified details of Input(X) and Attr(groups, axis) in doc, test=develop	5 years ago
WangXi	9d8ec42353	launch.py remove setting for nccl sync, test=develop (#20909 )	5 years ago
hong	8c4573a3cb	GradMaker for dygraph (#19706 ) * refactor dygraph,test=develop * fix failed unittest,test=develop * polish code,test=develop * check windows ci error,test=develop try to fix windows ci error by np.allclose,test=develop * polish vlog and profiler, test=develop * try to fix preceding ops order,test=develop * test transformer in windows ci, test=develop * use python c-api to speed up tracer.trace,test=develop * test=develop, fix docker with paddle nccl problem * test=develop, add ut for debug string and gradient_accumulator * test=develop, add tests for layer/gradient_accumulator/prepared_op * test=develop, fix complie error for test_prepared_op * test=develop, add more ut for dygraph * test=develop, create API.spec for dygraph api change * optimize grad maker; test=develop * optimize grad maker * test * grad make optim; test=develop * fix unittest bugs; test=develop * add dygraph grad op maker and split_op * grad op maker refactor; test=develop * add dygraph grad maker; test=develop * fix op deformable_conv_v1_op bug; test=develop * fix deformable_conv prroi pool bugs; * fix new op grad op maker bug; test=develop * fix split by ref bug; test=develop * fix dygraph auto prune bug; test=develop * fix test_trace bug; test=develop * fix fused emb seq pool bug; test=develop * remove useless code in op_desc file; test=develop * remove useless code, StrVarBaseNode; test=develop * fix review issues; test=develop * fix rank_loss grad maker; test=develop * remove flag in VarBase; test=develop * fix distributed_notify_op compile bug ; test=develop * fix reshape op double grad; test=develop * fix expand as op; test=develop * add impertive type_defs.h for demo_train; test=develop * fix inference lib cmake; test=develop * fix inference lib; test=develop * fix infernce_lib; test=develop * fix inference cmake; test=develop * fix inference lib; test=develop * fix inference lib; test=develop * remove condition dygraph grad maker, modify local name; test=develop * fix split grad maker bug; test=develop * fix pyramid_op bug; test=develop * change travis time out limit; test=develop * restore travis; test=develop * change timeout limit; test=develop	5 years ago
Bai Yifan	ac87d4e6e1	fix hdfs.download, test=develop (#20907 )	5 years ago
Thunderbrook	59bcdc8a19	support dump param of model into afs (#20302 ) * support dump param to afs test=develop * code style test=develop * code style test=develop * dump param test=develop * dump param test=develop * dump param test=develop * dump param test=develop	5 years ago
qingqing01	728099d023	Simplify third_party path in Python package (#20776 ) * Simplify third_party path in Python package * Fix typo	5 years ago
Yiqun Liu	16e4d02675	Refine the cache of program, context and scope in executor. (#18483 ) * Refine the cache of program, context and scope in executor. test=develop * Refine the unittest test_executor_and_use_program_cache. * Add the test the PaddingRNN with use_program_cache=True. test=develop * Remove a check. test=develop * Refine the unittest to check whether it is correct when setting use_program_cache=True. test=develop	5 years ago
Wilber	b489760099	fix jit_matmul bug test=develop (#20886 ) * fix jit_matmul bug * update jit matmul and add test	5 years ago
gongweibao	3255fe69bb	Add custom black variable name set in amp interface. (#20875 ) * add custom black varname test=develop * fix dtype test=develop * fix num test=develop * fix ut test=develop * fix coverage test=develop * fix blackvar names test=develop	5 years ago
lvmengsi	aadd81b662	Fix gradients (#20857 ) * fix_gradients * fix_gradients, test=develop	5 years ago
hong	ff0886a92a	save load problem fix and new feature add (#20823 ) * fix persistable; * fix save load bugs; test=develop * fix bug; test=develop * add example for new io api; test=develop * addd example; test=develop	5 years ago
Youwei Song	2058bab1c0	Add Sequential api (#20789 ) * add Sequential api test=develop * fix unittest test=develop * refine code sample * test=develop	5 years ago
liym27	6802539a2e	support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780 ) * improve split and concat op: 1. support Tensor for argument 'dim' in split op. 2. support Tensor for argument 'axis' in concat op. test=develop * redefine function GetDataFromTensor and set unknown output shape to - 1. test=develop * add check: Attr(sections) match Input(X). test=develop * support Tensor for attr(sections) and attr(sections) can contain -1. add check for attr(sections). test=develop * modify error message for concat and call Resize only when necessary. test=develop	5 years ago
Yiqun Liu	6fcfd32e6c	Check and correct the output's lod_level in DynamicRNN related operators (#19144 ) * Refine the InferShape of ReadFrom and WriteTo op, and add comment to explain why not call ShareLoD for runtime. test=develop * Add comment for ReorderLoDTensorByRank op. * Add comment for lod_tensor_to_tensor_array op to explain why only call DecreaseLoDLevel for compile time. test=develop * ShrinkRNNMemory op should call ShareLoD for compile time. test=develop * Add the implementation of IncreaseLoDLevel and add the compile-time check of lod_level in InferShape of sequence_pool. test=develop * Refine the unittest of DynamicRNN. test=develop * Change PADDLE_ENFORCE to PADDLE_ENFORCE_NE. test=develop	5 years ago
Zeng Jinle	da9e9dd07f	fix py_reader combination ut, test=develop (#20861 )	5 years ago
liym27	84d221b667	improve unsqueeze op to support int, Tensor for argument axes (#20824 ) * improve unsqueeze op to support int, Tensor and Tensor list for argument axes. test=develop * call Resize only when necessary. test=develop	5 years ago
silingtong123	03d7f3ddb2	Make shape tensor support int32 (#20757 ) * Make shape tensor support int32	5 years ago
Huihuang Zheng	95ba4bd2ab	Add shape and type check at read_op (#20754 )	5 years ago
Aurelius84	aacd16dbb4	add pyramid_hash_op (#20698 )	5 years ago
Yang Zhang	cf670ec9ce	Serialize to pickle format (#20820 ) test=develop	5 years ago
whs	c8e49be2f1	Fix roi_perspective_transform op (#20764 )	5 years ago
Bai Yifan	6bdf99d37a	fix dcn doc about Mask introduction, test=develop, test=document_fix (#20836 )	5 years ago
Bai Yifan	fd5321b3f3	modify slim print precision to round(,6), test=develop (#20833 )	5 years ago
WangXi	e78d7f57bb	Print the rank which trainer is error in launch.py, test=develop (#20838 )	5 years ago
xujiaqi01	48669aa8f0	fix several sparse table issuses (#20686 ) * no longer need to define all embedding layers (no one less) of all slots in each program. make trainer_param repeated in ps.proto. * add find_distributed_lookup_table_grads instead of hard code GRAD * support embedding stop gradient. push sparse has error before fix this.* * fix fill sparse, skip slots which do not have embedding. each slot's embedding in a sparse table should be used in all training programs before fix this. * fix pull sparse, skip slots which do not have embedding. * fix collect feasign label info, skip slots which do not have embedding. * support when there are multi sparse tables in one or multi training programs, each program can pull/push its own related sparse tables instead of all sparse tables. * test=develop	5 years ago
whs	fa67e6e83e	Fix unitest of pruning in python3 env. (#20825 ) test=develop	5 years ago
Zeng Jinle	378fc4fb1c	add some docs to jit.trace, test=develop (#20811 )	5 years ago
Zhang Ting	5a8d885d72	All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756 ) * All elements in attr(shape) of crop_tensor can be -1, test=develop, test=document_preview * fix the bug that attr(offsets) should be initialized, test=develop	5 years ago

... 3 4 5 6 7 ...

10017 Commits (a90fa540926d832056aa1a0581372ad677dd960f)