Paddle

Commit Graph

Author	SHA1	Message	Date
Leo Chen	a4b9daf97c	fix optimizer dtype (#29917 )	4 years ago
liuyuhui	4427df37cf	[Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574 )	4 years ago
LielinJiang	0b74428db8	Fix Conv2DTanspose bug when padding='same' (#29915 ) * fix conv_transpose bug when padding=same	4 years ago
LielinJiang	11de384c6d	Split callbacks unittest (#29914 ) * split callback unittest * rm test_callback from timeout list	4 years ago
lilong12	01950ceb42	fix the bug in pipeline data parallelism (#29731 ) * update, test=develop	4 years ago
YUNSHEN XIE	2a01756bf3	remove duplicate ut names (#29809 )	4 years ago
Chen Weihang	a6072055be	[Complex] Handle complex to real after type promotion (#29855 ) * try to add fwd op input dtypes * refactor base impl * return tmp_ins after dygraph prepare data * fix typo found in debug * polish comment & add complex net test * revert detail change * fix unittest failed * add complex kernel condition control * fix xpu test failed & polish comment * polish details by review comments	4 years ago
Chen Weihang	1a304e6c06	[Complex] Add support for complex grad accumulated (#29889 ) * add support for complex grad accumulated * add unittest for coverage * update test dtype * remove useless blank line	4 years ago
guofei	80eb77788f	Skip Windows Multi-GPU test of test_fetch_lod_tensor_array (#29508 ) * Fix Windows unittest of test_fetch_lod_tensor_array	4 years ago
Leo Chen	6b258317cb	fix TransferInplaceBack (#29830 )	4 years ago
QingshuChen	59b47f3b32	feat: support check_nan_inf for kunlun/xpu device (#29694 ) * feat: support check_nan_inf for kunlun device * support kunlun stack * minor	4 years ago
wawltor	7498df2587	add the cumsum unit test for the develop (#29881 )	4 years ago
wanghuancoder	26f9ab70f7	if PR have no .py files, do not use 'python coverage run', to speedup unit test (#29739 ) * reopen python coverage --include for test, test=develop * if no .py file modified, not use coverage run, test=develop * remove test code, test=develop * add WITH_INCREMENTAL_COVERAGE, test=develop * refine if else, test=develop	4 years ago
Tao Luo	5d130d5670	Revert "fix conv2d int8 windows UT (#29528 )" (#29869 ) This reverts commit `067d7f1d0d`.	4 years ago
tangwei12	032414ca2a	[Feature] one ps (3/4) (#29604 ) * oneps (3/4) Co-authored-by: MrChengmo <cmchengmo@163.com> Co-authored-by: malin10 <malin10@baidu.com> Co-authored-by: chengmo <chengmo@baidu.com>	4 years ago
jakpiase	edc06c6a1b	Added fc + activation fuse pass (currently only gelu, sigmoid and tanh are supported) (#29772 )	4 years ago
Chen Weihang	0e0bb1b97d	replace exit method (#29862 )	4 years ago
lidanqing	067d7f1d0d	fix conv2d int8 windows UT (#29528 )	4 years ago
liym27	97e75ad0f5	[setitem] Support Tensor setitem in static mode (#29708 ) 1. Type of index: int, slice(step must be 1). 2. Type of value: (1) int32, int64, float32, bool; (2) numpy.array(int32, int64, float32, bool);<Note: float64 is not supported> (3) paddle.Tensor(int32, int64, float32, float64, bool);	4 years ago
YUNSHEN XIE	24ce051a84	remove duplicate ut reload (#29810 ) * remove duplicate ut reload * remove duplicate ut define in cmakelist	4 years ago
Thunderbrook	09b6e71928	heter box (#29734 ) * 　add heter box * add trainer, worker, wrapper... * format * for ci * format * remove boost get * boost & copyright * rename * 　rename * format * format * format Co-authored-by: yaoxuefeng6 <yaoxuefeng@baidu.com>	4 years ago
LielinJiang	1092da82b2	Change the conditions of hapi printing logs (#29792 ) * update condition of logger print	4 years ago
ceci3	c4eb5d0378	fix unittest timeout (#29820 )	4 years ago
chentianyu03	ddfc3d2c2f	change grad elementwise_mul for complex types (#29757 ) * add conj op for complex types * add conj for complex types * add more test case * add conj_op test * modify conj api and impl * add complex type for fill_constant_op xpu * add setConstant for complex type * remove complex conj test file * user define grad for test_conj_op * add test case for static mode of conj api * modify conj doc * change input args name to x * remove useless codes * conj support real types * add conj test case for real number * delete no need to calculate inputs in dygraph op_test * delete no need to calculate inputs in dygraph op_test * modify grad of mul for complex types * fix the grads of inputs args order not match bug	4 years ago
chentianyu03	2a260d9b0e	change the grad of div when complex types (#29804 ) * change the grad of div when complex types * fix the grads of inputs args order not match bug	4 years ago
syyxsxx	e219b8ccef	fix api link for the any, all, isfinite fix api link for the any, all, isfinite	4 years ago
Guo Sheng	356efd36fa	Remove test_rnn_decode_api from disable list. (#29814 ) test=develop	4 years ago
TTerror	82aa01c373	add nearest_interp_v2 on kunlun (#29725 ) * add nearest_interp_v2 on kunlun * add nearest_interp_v2 on kunlun	4 years ago
yukavio	0f97ff0368	fix flops (#29818 )	4 years ago
whs	82630408b4	Support double backward rsqrt (#29589 )	4 years ago
cc	61820fd217	add the time threshold of quantization tests, test=develop (#29786 )	4 years ago
xiaoting	55725cd2e1	fix for timeout, test=develop (#29788 )	4 years ago
LielinJiang	a94c3cbbf3	register cudnn conv double grad for depthwise conv (#29807 )	4 years ago
ShenLiang	01e2874a0e	Support multi-stream communication for dynamic graph distributed (#29525 ) * fix fleet for multi-stream * fix memcpy for ncclid * use sync to solve move operation	4 years ago
huangxu96	a29006d128	Optimizer trans momentum (#29597 ) * merge amp related function in Momentum from paddle.fluid.contrib.optimizer into paddle.optimizer. * Add unittest for 2.0 Momentum API. * fix some bugs in weight_decay.	4 years ago
liym27	0cc42e34c6	Migrate 4 APIs about array to paddle.tensor.* (#29565 ) 4 APIs: array_length, array_read, array_write, create_array	4 years ago
yukavio	96934b7430	fix flops (#29758 ) * fix flops * fix flops	4 years ago
liym27	41a7b07159	[Dy2Stat] Fix bug for loop: a variable is used and created in loop, but used before created (#29769 )	4 years ago
LielinJiang	e5af650b71	Add double grad for conv_transpose (#29706 ) * add double grad for conv_transpose	4 years ago
huangxu96	97e29411eb	fix a bug in multi_precision_fp16 unittest. (#29756 )	4 years ago
Wojciech Uss	6ef8129dcc	upgrade oneDNN with GRU INT8 optimizations (#28420 ) * upgrade oneDNN with GRU INT8 optimizations * fix test	4 years ago
Huihuang Zheng	dfffee8a5d	[Dy2stat] Enable jit.save to Save Without Running (#29579 ) Enable jit.save to Save Without Running.	4 years ago
liym27	a0b60716f1	[Dy2Stat] Support grammar: for ele in var[idx] (#29541 ) Support to transformfor ele in var stms in which var is a slice of Tensor.	4 years ago
chentianyu03	b59b6d7ae6	Complex op test (#29753 ) * delete no need to calculate inputs in dygraph op_test * delete no need to calculate inputs in dygraph op_test	4 years ago
liym27	096c048b45	Fix unitest test_slice (#29740 ) Before this commit, test_slice use old api `dygraph_to_static_func` to use Dynamic-t-Static and use Executor explicitly，which is not recommended to users. After fixed, use recommended API `paddle.jit.to_static` to replace `dygraph_to_static_func`, which won't trigger the random exception on coverage CI.	4 years ago
Huihuang Zheng	2e788bd81e	Reduce batch size ot fix CPU memory, test=develop (#29736 ) Unit test reported memory not enough on CPU machines. Reduce batch size again.	4 years ago
LielinJiang	10edfb6f21	Update en docs of to_tensor (#29718 ) * update to_tensor en docs	4 years ago
chentianyu03	71063b8137	add conj op for complex types (#29527 ) * add conj op for complex types * add conj for complex types * add more test case * add conj_op test * modify conj api and impl * add complex type for fill_constant_op xpu * add setConstant for complex type * remove complex conj test file * user define grad for test_conj_op * add test case for static mode of conj api * modify conj doc * change input args name to x * remove useless codes * conj support real types * add conj test case for real number	4 years ago
WangXi	9cbcc6cadc	fleet sync build strategy, test=develop (#29732 )	4 years ago
Chen Weihang	6cfa59de1b	[Complex] Add real & imag op and api for complex tensor (#29672 ) * add complex real op & api & unittest * add imag op & api & unittest * refactor op impl * revert simplify writing due to complile failed * polish details * polish grad op code	4 years ago
LiuChiachi	572810eecb	Update EarlyStopping sample code (#29723 ) * update EarlyStopping doc * update EarlyStopping doc, test=document_fix	4 years ago
TTerror	af8ded773a	update activation op on kunlun (#29577 ) * fix expand && concat/transpose to new api * update xpu_header * update activation op on kunlun * update activation op on kunlun * update activation op on kunlun * update activation op on kunlun * update activation op on kunlun * add nearest_interp on kunlun * update error message	4 years ago
ceci3	cc387159f3	add pad and concat double grad (#29549 ) * add constant pad double grad	4 years ago
liuyuhui	f13c3a9cd7	[Kunlun] PR1:Support one Kunlun card training in parallel executor (#29337 )	4 years ago
huangxu96	b96dada4f0	add static.amp into setup.pu.in (#29621 ) * add static.amp into setup.pu.in * add unittest for api	4 years ago
YUNSHEN XIE	d0b789d27f	disable ut test_cumsum_op (#29613 )	4 years ago
Jack Zhou	84bae27779	fix wmt14 doc, remove backward, add bidirect direction in rnn api (#29633 ) * fix wmt14 doc, remove backward, add bidirect direction in rnn api * fix rnn unittest * fix test_rnn_nets_static.py bug	4 years ago
YUNSHEN XIE	2926e74326	New UT should not exceed 15s (#29492 ) * added UT should not exceed 15s * fix error * UT limit of 15s is the first to be executed * fix error * fix error with CI_SKIP_CPP_TEST * modfied tiemout setting * fix error	4 years ago
Chen Weihang	f02aece1f0	Add complex dtype op (add) test example (#29603 ) * add op test case for complex * polish code details * add xpu set constant support * fix argument rror * remove useless pyc file	4 years ago
AshburnLee	efea540ca9	Add tf32 support for A100 tensor core acceleration for cuBLAS (#28732 )	4 years ago
lijianshe02	7779768b53	add transpose double grad test=develop (#29600 ) * add transpose double grad test=develop	4 years ago
huangxu96	c05170d3d8	add alias for fluid.contrib.mixed_precision (#29562 ) * add alias for fluid.contrib.mixed_precision	4 years ago
ShenLiang	fb6697b424	Fix the dowanload bug in the case of multiple machines (#29551 ) * fix the dowanload bug * add sort for ips	4 years ago
ShenLiang	1efef8baed	Fix bug of matmul_v2 for broadcast case (#29599 ) * fix bug of matmul_v2 for broadcast	4 years ago
qingqing01	8d549fc85d	Add clip double grad (#29590 )	4 years ago
Tao Luo	81acc3278c	disable test_parallel_executor_profiler in cuda 10.1 (#29581 ) * disable test_parallel_executor_profiler in cuda 10.1 * update set_tests_properties	4 years ago
wangchaochaohu	ac4bae8ee9	elementwise_add_grad Op optimization (#29575 )	4 years ago
huangxu96	2cb6f94888	add float16 into adaptive_avg_pool2d check list. (#29547 )	4 years ago
yukavio	ee1a7d020c	add some feature for paddle.flops (#29572 )	4 years ago
WangXi	467c716963	gen nccl id use socket (#29431 )	4 years ago
Bai Yifan	d72604cd46	fix unittst unstable issue on ci machine (#29588 ) * fix unittst unstable issue on ci machine * fix unittst unstable issue on ci machine * fix unittst unstable issue on ci machine	4 years ago
QingshuChen	79a41a9ed6	support roi_align & affine_channel for kunlun (#29561 ) * support roi_align & affine_channel for kunlun * minor	4 years ago
liym27	0cad1152f4	[Dy2Stat] 1. Fix bug of for-range stmts. 2. Support that step value is negative in for-range stmts (#29519 ) 1. Fix error in _build_cond_stmt of for-range stmts. 2. Support that step value is negative in for-range stmts 3. Fix code because of the diff between Py2 and Py3	4 years ago
Huihuang Zheng	831e9135b9	Fix Windows Unittest (#29543 ) Fix 3 Windows Unittests test_fuse_all_reduce_pass: Paddle cannot run multiple-GPU on Windows so set single visible GPU flag test_feed_data_check_shape_type: Paddle cannot run multiple-GPU on Windows so set single visible GPU flag test_tsm: Winodws GPU size is not enough so decrease batch size and data size.	4 years ago
GeminiCarrie	08f24a3108	Fix precision problem (#29567 ) * Fix a bug when running on an operating system without "bash." * add execution condition * for ci-coverage * get cpu information to check the precision problem * Update compilation environment for musl version * update dependencies * remove test code check cpu info remove test code review * update alpine and third_party denpendencies * add newline for ci Code format	4 years ago
JZ-LIANG	d33d468f02	[Sharding] add hybrid-dp feature (#29518 ) * Sharding add hybrid-dp feature * update sharding in distributed_strategy * update sharding unitest * revise code format for sharding	4 years ago
Chen Weihang	c1a26e2a05	fix train eval set error in static mode (#29540 )	4 years ago
taixiurong	760d015c14	add xpu ops for training transformer in kunlun (#29539 ) * 1.fix matmul bug 2. add one hot * add xpu error msg	4 years ago
Leo Chen	0fdd365665	Add fast path for dropout when p == 0 (#29553 ) * add fast path for p == 0 in dropout * add ut	4 years ago
Wojciech Uss	917a11495f	fix ininite scale values (#29386 )	4 years ago
lijianshe02	bd29052e33	fix random seed in nll_loss unitest test=develop (#29538 ) * fix random seed in nll_loss unitest test=develop	4 years ago
joanna.wozna.intel	0ce6d7fa77	Fix bf16 activations test for softmax and gelu (#29502 ) * Fix bf16 activations test for softmax and gelu * Resolve conflict	4 years ago
huangxu96	4001979309	Add ReserveSpace in dygraph batch_norm. (#29221 ) * Add ReserveSpace in dygraph batch_norm. * Add unittest for reservespace	4 years ago
arlesniak	b781953ef5	[oneDNN] Fix flags use test for #29080 , assert condition more general (#29493 ) * Flags assert condition more general, print output if pattern not found * removed test_flags_use_mkldnn form skip list regarding #29080 descr	4 years ago
Zhen Wang	5ac71b36fb	Remove tensor copy in the update_loss_scaling op. (#29426 ) * remove tensor copy in the update_loss_scaling op * not use thrust. * fix some cuda memory access error.	4 years ago
Zhou Wei	e74e1a226c	support deepcopy for Layer/Tensor/Paramerbase (#29387 ) * support deepcopy for Layer/Tensor/Paramerbase * fix some code	4 years ago
joejiong	50d3117d30	Add random_split and Subset dataset (#29291 ) As the title	4 years ago
joejiong	87e75a77c2	Add tangent operator (#29207 ) As the title	4 years ago
Wei Shengyu	dc8bb76c68	remove addcmul (#28937 ) * remove addcmul * remove unittest and other related code of addcmul * fix bug * fix merge conflict	4 years ago
Zhong Hui	f459dd9634	fix abs double grad unittest (#29478 ) fix abs double grad unittest & define the data range for the abs double grad	4 years ago
huangxu96	576d0d938b	add fp16 check into max and avg pool (#29479 )	4 years ago
ShenLiang	2ef9e0e23c	Rebuild group automatically in dynamic graph distributed (#29255 ) * add tensor_indices in AssignGroupBySize * add rebuild group in reducer	4 years ago
procr	3a0558339d	support mobilenet for kunlun (#29458 )	4 years ago
Aurelius84	5d530c9319	fix amp support fleet (#29491 )	4 years ago
ShenLiang	311b3b44fc	Fix the bug where embedding can‘t be processed correctly in reducer (#29485 ) * fix the bug of reducer in embedding * add comment	4 years ago
Pei Yang	2480bdef6c	change hard_swish from plugin to layer (#29177 ) * change hard_swish from plugin to layer * add ut when threshold != scale	4 years ago
lilong12	b122d0bb76	Fix bug in gloo that gloo initialization hangs (#29447 ) * update, test=develop	4 years ago
taixiurong	ecca6585cd	1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op (#29448 ) Co-authored-by: root <root@bjhw-sys-rpm0223.bjhw.baidu.com>	4 years ago
LoveAn	03b42d9fa7	fix unittest on windows, test=develop (#29365 )	4 years ago
ShenLiang	22e6b9e373	Fix the ut of matmulv2 for broadcast case (#29461 ) * fix the ut of matmulv2 for broadcast	4 years ago
TTerror	a5fcc4b545	update reduce_sum op on xpu (#29367 ) * update reduce_sum op on xpu * update reduce_sum op on xpu * support running on xpu	4 years ago
chentianyu03	acce962133	remove complex module direction (#29419 )	4 years ago
Zhang Ting	6296f4ed09	revert cast eigen kernel (#29427 )	4 years ago
Leo Chen	a040c055a5	fix layer_norm accuracy (#29434 )	4 years ago
Shang Zhizhou	225a9c4ed8	Fix unittest (#29412 ) * fix tensorrt unittest precision error * fix unittest precision error. test_trt_subgraph_pass && test_trt_dynamic_shape_transformer_prune	4 years ago
Pei Yang	f860de4af7	support clip op trt converter (#29411 )	4 years ago
Bai Yifan	87bb726258	Add deform_conv2d,DeformConv2D (#29364 ) * add deform_conv2d,DeformConv2D	4 years ago
chentianyu03	64e4e17f0c	remove complexvariable (#29390 ) * rm complexvariable * modify test_var_base unittest * remove duplicated codes	4 years ago
chajchaj	79e6086743	change shape of output in cross_entropy, test=develop (#29220 )	4 years ago
liuyuhui	2ee7a6b08c	[paddle v2.0.0rc1: API fixs] assign/conv2d/conv2d_transpose/cast/ParamAttr (#29171 ) * fix DLTP-15151, paddle.ParamAttr API * fix DLTP-15083/DLTP-15274, paddle.nn.functionl.assign paddle.cast API * fix DLTP-15431/DLTP-15432, paddle.static.nn.conv2d paddle.static.nn.conv2d_transpose API * fix DLTP-15083, paddle.nn.functionl.assign API * fix DLTP-15431/DLTP-15432, paddle.static.nn.conv2d paddle.static.nn.conv2d_transpose API * support in_dygraph_mode for cast op, test=develop * fix bug,test=develop * fix doc * fix DLTP-15431/DLTP-15432, paddle.static.nn.conv2d paddle.static.nn.conv2d_transpose API	4 years ago
Guo Sheng	8fc7f1b66a	Fix api docs in RNN, Transformer, layer_norm, WeightNormParamAttr (#29235 ) * Fix api docs in RNN, Transformer, layer_norm, WeightNormParamAttr. test=develop * Fix api doc for print in label_smooth. test=develop * Update api docs according to review comments. Add name argument in RNN back. test=develop	4 years ago
Chen Long	c940f842ca	remove rarfile from requirements (#29319 )	4 years ago
yongqiangma	7c508d8668	update unbind norm add CUDAPlace api doc information (#29322 ) * enhance array_to_lod_tensor_op lod_tensor_to_array_op errors information. test=develop * fix format. test=develop * format fix. test=develop * add lod_rank_table. test=develop * fix format. test=develop * fix doc info. test=develop * fix np error * add unbind dygraph api. test=develop * fix unbind doc.test=develop	4 years ago
chentianyu03	879e913b6d	Make transpose, trace, kron, reshape, sum op support complex type (#29321 ) * add complex64 and complex128 type; add +-/@ and slice opreator for complex types add test cases for complex elementwise, matmul and getitem unittest * add test cases for complex types * add test cases for complex matmul unittest * kron, reshape, transpose support complex types * sum and trace op support complex types * add test case of sum and trace op * fix the bug of imag part of complex not initialized * format file * format code style * kron support type promotion; modify test cases	4 years ago
Chen Long	66fd1c00a0	fix some docs test=develop;test=document_fix (#29374 )	4 years ago
liym27	5f84d0b375	Fix bug: delete wrong check_type of paddle.concat and support LoDTensorArray (#29306 )	4 years ago
Feiyu Chan	f7cdcefa65	fix multiple documentation errors, test=document_fix (#29210 ) * fix multiple documentation error, test=document_fix * fix more rst syntax errors, test=document_fix * fix format issues in docstring, test=document_fix	4 years ago
卖鱼的哲学	074065e5de	fix expand/uniform_random && concat/transpose to new api on xpu (#29280 ) * fix expand && concat/transpose to new api * update uniform_random_op * update xpu_header	4 years ago
ShenLiang	4064354a01	support dp run single card (#29358 )	4 years ago
gongweibao	8989053443	Fix bug of test_fleet_launch_async.sh (#29332 )	4 years ago
Huihuang Zheng	8f7627907c	[Dy2stat] Reduce Exception Type for Better Error Message (#29268 ) Reduce exception type so that if covert_to_static failed, it reports right error message.	4 years ago
liym27	61a8f2874f	[Dy2Stat] Fix bug: Do not use gast.Subscript to replace gast.Name in when transforming for_enumerate_loop (#29310 )	4 years ago
liym27	b10ecd9d3a	[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows (#29267 )	4 years ago
Chen Weihang	9ad800ebb2	Support type promote for basic math ops (quantum required) (#29265 ) * basic impl of type promote * add comment & another testcase * fix complex bugs & support python op promote type * fix failed unittests & polish code * add unittest for coverage * change to only promote complex type * polish code details * polish several comments	4 years ago
LielinJiang	f31e5adab5	fix typo in ProgBarLogger (#29329 )	4 years ago
tangwei12	8358791607	fix gpu outofrange (#29238 ) * fix gpu emb out of range Change-Id: I5794ac73bd634d5ea069a6fbbd914274b6d6b7bf * fix doc Change-Id: I5a3350b2930a9ab2f52116c192b087307faf8fdf	4 years ago
YUNSHEN XIE	28164b266f	disable test_rnn_decode_api and test_complex_matmul on windows (#29252 )	4 years ago
Leo Chen	b58cfff89d	use has_grad instead of train_mode (#29309 ) * use has_grad instead of train_mode * add vlog for debug * fix ut * fix ut	4 years ago
Aurelius84	67c700b479	[Dy2Stat] Add cache for Executor and Context in run_program_op (#28421 )	4 years ago
ShenLiang	d6753e1e6d	fix matmulv2 for windows (#29327 )	4 years ago
gongweibao	96de8b008f	cleanup enum test=develop (#29294 )	4 years ago
liym27	b9a8ebd50f	[Dy2stat] Add a decorator paddle.jit.not_to_static to support that not to convert a function in Dynamic-to-Static. (#29253 ) Usage scenarios：A function could have run successfully in static mode, you can use it to decorate a function in the following cases: 1. An unknown error occurs in the dynamic-to-static conversion process of the function; 2. In the internal implementation of the function, it has two branches: dynamic branch and static branch; 3. Users don't want to convert the function in the process of dynamic to static.	4 years ago
ShenLiang	2d6aa1a5bb	fix warning of fleet (#29317 )	4 years ago
ShenLiang	2cd0bf5764	Fix doc of fleet api (#29282 ) * fix doc, test=document_fix	4 years ago
ShenLiang	c00af94435	fix matmulv2 for windows (#29302 )	4 years ago
Steffy-zxf	41f17aeb8b	fix DATA_HOME path in win (#29222 ) * fix DATA_HOME path in win	4 years ago
Jack Zhou	cf43322139	fix nll_loss doc;test=document_fix; (#29247 ) * fix nll_loss doc;test=document_fix; * remove numpy and set_device;test=document_fix; * remove numpy;test=document_fix;	4 years ago
LielinJiang	b9f1f4343b	Move temporal_shift to paddle.nn.functional (#29261 ) * move temporal_shift to functional	4 years ago
Chen Weihang	a2e9d95a4a	change test_imperative_signal_handler_to_exclusive (#29283 )	4 years ago
Zhen Wang	be3777a50a	Add pure fp16 training with master weights. (#27712 ) * add the weight decay func for the momentum op * Add the multi_precision function in Momentum Optimizer. * Make sure that the initial value of master weights are same with the fp16 weights. * add static loss scaling. * add the rescale_grad function in the pure fp16 training. * use the original momentum updating method. * Polish some codes, such as variable names. * add docstring for apis. * update the var creation details of _create_master_weight. * not modify codes about imperative momentum updating. * Fix the error of test_dist_sparse_tensor_load_momentum UT. * add unit test for multi precision fp16 training. * add more unit tests for CI. * Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT. * For CI Coverage Checking.	4 years ago
chentianyu03	976961de6d	fix random failed of complex matmul (#29285 )	4 years ago
furnace	7584bb5096	Layer norm fp16 (#29169 ) * add fp16 for layer_norm op * revert layernorm api * fix forward * fix forward * fix backward for layernorm with fp16 * fix unit test for layernorm with fp16 * fix with_mkldnn compile error for layernorm with fp16 * 1. revert to PADDLE_ENFORCE_NOT_NULL, 2. change static_cast<float> to static_cast<U> * fix with_mkldnn compile error for layernorm with fp16 * fix with_mkldnn compile error for layernorm with fp16 Co-authored-by: zhiqiu <chenqiuliang@baidu.com>	4 years ago
mls1999725	a37963b890	Update APIs in text/datasets and dataloader (#29219 ) * Update IterableDataset API * Update TensorDataset API * Update APIs in paddle/text/datasets * Update dataset.py	4 years ago
mls1999725	493568b070	Update Codes of Cifar and VOC2012 (#29204 ) * Update Cifar Codes * Update VOC2012 Codes * Update voc2012.py * Update voc2012.py * Update cifar.py * Update cifar.py * Update voc2012.py	4 years ago
mls1999725	0aedd463ee	Update get_worker_info API (#29190 ) * Update get_worker_info API * Update dataloader_iter.py * Update dataloader_iter.py * Update dataloader_iter.py	4 years ago
mls1999725	6a9a62c3ef	Update conv3d API (#29205 ) * Update conv3d API * Update nn.py * Update nn.py * Update nn.py * Update nn.py * Update nn.py * Update nn.py * Update nn.py * Update nn.py	4 years ago
Huihuang Zheng	aec05d811c	[Dy2stat] Fix PaddleGan Deoldify Model Dy2stat Problems (#29226 ) This PR fixes several problems in dy2stat for Deoldify model in PaddleGan. In model, software engineer wrote if x.shape == y.shape, the Tenser shape is a tuple in dygraph so the == returns True/False, but in static graph the == becomes element-wise comparison, which is a different behavior. In this PR we reduce the element-wise comparison result. If software engineer write computations which uses parameters in hooks, the static graph can loss the parameter variable because we put param_guard at forward of a Layer. In this PR we made param_guard cover pre-hook and post-hook. In PaddleGan, software engineer calculated some parameter values in __init__ by running some dygraph code. Those code also run during dy2stat. So some variables may be assign as a VarBase (Tensor) first and then Variable, which raised an error. We fixed the bug in this PR by handling the case. TODO: We just added testcase for the 1. shape comparison. Should add test case for 2. and 3. But since we are chasing 2.0RC, I will do it in the near future PR	4 years ago
Leo Chen	116305ea4b	Improve performance of elementwise_add grad op (#29187 ) * pass stop_gradient for cast op * improve performance of elementwise_add grad * use tensor copy async * dygraph branch * fix dygraph branch * add ut	4 years ago
卖鱼的哲学	07c67d5a8b	add deformable_conv op on xpu (#29234 ) * rebase develop * update deformable_conv op on xpu * update deformable_conv op on xpu	4 years ago
Chen Weihang	1de32f823d	Hot fix complle failed in gcc4.8 caused by complex impl (#29254 ) * hot fix complle failed in gcc4.8 * fix failed unittest	4 years ago
yukavio	a71ea00922	add unit test (#29228 )	4 years ago
ShenLiang	46b73e6cd9	Change the api of DataParallel and Fleet (#29224 )	4 years ago
Leo Chen	73e51a17e7	add stop_gradient property and remove reduce redundant information (#29185 ) * add stop_gradient property and remove reduce redundant information * refine code	4 years ago
QingshuChen	64f29fbb70	update kunlun conv2d/softmax/elementwise implemetation (#29229 ) * update conv2d & softmax to new xpu api * test=kunlun * remove useless comments * test=kunlun * remote softmax xpu op * test=kunlun * update kunlun softmax * test=kunlun * update xpu unitest * test=kunlun * fix elementwise_grad bug for kunlun *test=kunlun	4 years ago
Jiawei Wang	b11ab12787	Fix doc (adadelta, sgd, momentum) (#29212 ) * fix 3 doc * fix 3 doc * Update adadelta.py	4 years ago
lijianshe02	76312deb30	fix nll_loss test random fail bug test=develop (#29236 )	4 years ago
LielinJiang	8a2dd34a1e	fix depthwise conv (#29227 )	4 years ago
huangxu96	dbdeecd665	Modify doc mistakes of grad API. (#29176 )	4 years ago
Jiawei Wang	a5d13d593c	Momentum Velocity init in Momentum.__init__() (#29223 ) * add lamb optimizer and unittest * fix momentum resume training * fix momentum acc	4 years ago
Leo Chen	4556ad76b4	Upgrade string literals to raw string [part 2](#29217 )	4 years ago
wanghuancoder	2b2cd1864a	revert python file coverage, delete coverage run --include, test=develop (#29230 )	4 years ago
chentianyu03	8f45d14263	add complex64 and complex128 type; add +-/@ and slice opreator for c… (#29199 ) add complex64 and complex128 type; add +-/@ and slice opreator for complex types add test cases for complex elementwise, matmul and getitem unittest * add test cases for complex types * add test cases for complex matmul unittest	4 years ago
123malin	cc9c619679	test=develop, fix doc (#29200 ) * fix fleet api doc	4 years ago
Zhou Wei	c0a991c874	accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept (#28429 ) * The leaf tensor concept is exposed and the gradient accumulation of leaf tensor * The leaf tensor concept is exposed and the gradient accumulation of leaf tensor * fix coverage * fix api doc * fix CI unittest * fix CI unittest * fix unitest * empty tensor does’t need inner_var_ * fix some error message	4 years ago
huangjun12	b6a26749dc	fix doc of alpha_dropout/dropout/dropout2d/dropout3d/npair_loss (#29136 ) * fix en doc, test=document_fix * add blank after code declare, test=document_fix * refine doc of dropout, test=document_fix * refine npair_loss and dropout, test=document_fix	4 years ago
LielinJiang	d8eef4e4a4	Remove dependence of scipy (#29121 ) * lazy import for scipy * rm unused check	4 years ago
yaoxuefeng	a069e1ca91	fix docs (#29097 )	4 years ago
Chen Weihang	786e69e9c7	diable test_yolov3 in musl (#29216 )	4 years ago
hong19860320	f23665e5d5	Refine the doc and unit test for Sigmoid and stanh (#29198 )	4 years ago
123malin	b5c6342336	Update ps gpu (#29209 ) * fix paramete prefetch & device guard Co-authored-by: MrChengmo <cmchengmo@163.com> Co-authored-by: chengmo <chengmo@baidu.com>	4 years ago
liym27	865a45984f	Check whether there is any inplace operation affecting gradient calculation. (#27901 ) * Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable. * Add a new attribute `_inplace_version` for VarBase. * Raise exception if an inplace operation can result in incorrect gradient computation. * Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation. * For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode. * Use original var_wrapper if the inplace_version is not changed. * Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.	4 years ago
lilong12	08fb079dbc	Fix the doc for shard_index api (#29183 ) * update, test=develop	4 years ago
qingqing01	058f1b2284	Enhance paddle.metric.Accuracy (#29125 )	4 years ago
joejiong	dc070ecfb0	Remove cast from paddle.pow api (#29134 ) As the title	4 years ago
WangXi	0c2a51d240	optimizer amp, all use fp16 communication, overlap last comm and compute (#28957 )	4 years ago
Chen Weihang	0b032faeee	Polish unittests details and execution conditions to adapt to MUSL (#29044 ) * fix failed tests in yingchun gived list * add unittests into static_mode_white_list * add enable static * fix dist unittest * skip test_sigmoid_focal_loss_op & add gym * revert no need skip unittests * remove gym	4 years ago
123malin	92817f8005	test=develop, rm pathlib (#28658 ) * test=develop, rm pathlib	4 years ago
Wojciech Uss	4fd4095d1b	Add quantization of multi_gru op and tests (#28615 )	4 years ago
Thunderbrook	4adddcc89a	add set_trainer_num api in dataset (#29133 )	4 years ago
liym27	e03440812a	fix code: if y is True -> if y (#29184 )	4 years ago
danleifeng	7e7b4b9e5d	remove sampled_softmax_with_cross_entropy alias;test=develop (#29180 )	4 years ago
WeiXin	1476e1f998	save model after jit.load (#28748 ) * Changed a variable name error * Add comments * Move member functions of TranslatedLayer out of function * edit code according to review * Edit input argument of '_run_static_graph' * reset due to Segmentation fault * rename variables when stitching graph * modify code according CI * Add comments to '__i_m_p_l__' * remove blanks befor 'Get...' * edit code according to review * Add a comment to '_execution_method_creator' * Edit a comment to '_execution_method_creator'	4 years ago
wanghuancoder	0239f79695	Generate code coverage reports only for incremental files (#28508 ) * Generate code coverage reports only for incremental files, test=develop * Generate code coverage reports only for incremental files, test=develop * Generate code coverage reports only for incremental files, test=develop * test for diff python file, test=develop * fix no python diff report, test=develop * add cc test file, test=develop * fix bug in generic.cmake, test=develop * for debug no cc report, test=develp * modify compire branch form test_pr to test, test=develop * fix bug, test=develop * test for h file changed, test=develop * debug for redefinition of argument optimize error, test=develop * close -o3 for test, test=develop * remove -o3 for test, test=develop * remove coverage option for nvcc, test=develop * use CMAKE_CXX_FLAGS open coverage option when header file changed, test=develop * reopen -o3, test=develop * remove debug code, test=develop * remove unused code, test=develop	4 years ago
zhang wenhui	8388abe66b	Fix api 1128 (#29174 ) * fix 2.0 api, test=develop * fix api, test=develop	4 years ago
LielinJiang	f92fdfb8ef	Add ReduceLROnPlateau (#29113 ) * add ReduceLROnPlateau	4 years ago
Huihuang Zheng	27b4218333	[Dy2stat] Disable PaddleInference IR Optimization in test_mnist for CUDA11 (#29105 ) test_mnist failed on CUDA11. We found that it is due to PaddleInference IR Optimization after debugging. We disable it in this PR and we will re-enable it after PaddleInference fixes it.	4 years ago
liym27	01bdea7c31	[Dy2Stat] Don't conver the function from third library logging (#29161 )	4 years ago
liym27	a7433cc379	[Dy2Stat] Fix bug: the return statement should be transformed to an equivalent Paddle/Python if statement, which depends on if conditions of the return stmt. (#29165 )	4 years ago
Huihuang Zheng	4a0a870177	[dy2stat] Set shape for linspace to Fix dy2stat for GridGenerator Model (#29173 ) GridGenerator model failed because the output shape of `linspace` is (-1). The reason is that C++ InferShape fixes the shape to (-1): `5da3d514eb/paddle/fluid/operators/linspace_op.cc (L49)` We cannot set the shape in C++ infer shape because this Tensor may not be initialized during compile time, but when input `num` of `linspace` is an integer, we know the shape at compiler time. This PR simply set the shape in Python and add GridGenerator as unittest.	4 years ago
Aurelius84	cb680c8013	[Dy2Stat]Refine code of test_lac unittest (#29087 )	4 years ago
ShenLiang	e2d01eb650	Support dynamic graph distributed (#28997 ) * add reducer * refine envent for memorycopy * add concat&split for allreduce * apply concat & split for fuse tensor * fix nccl dep * fix the untest, compile problem and ddp initialize problem * fix untest for mac & add some comments & solve the repeated param in sublayers * fix untest for windows & fix document	4 years ago
lilong12	7e5e9934fe	update expand as op to use the shape of the target tensor instead of the target tensor itself. (#29020 ) * update, test=develop	4 years ago
Kaipeng Deng	f4c894a693	alias yolo_loss & yolo_box to paddle.vision. (#28520 ) * alias yolo_loss & decode_yolo_box to paddle.vision. test=develop	4 years ago
Shibo Tao	4ceedec33d	enhance doc. add kwargs for backward compatibility. test=develop (#29143 )	4 years ago
LutaoChu	28280647eb	add paddle.subtract, optimize paddle.maximum and paddle.minimum add paddle.subtract, optimize paddle.maximum and paddle.minimum	4 years ago
徐铭远	3c2a46bd7b	fix doc of erf,rank,mm,cross_entropy,pixel_shuffle,kron... (#29126 ) * fix doc example, test=develop, test=document_fix	4 years ago
Chen Long	d576d6ddeb	fix some docs test=develop;test=document_fix (#29159 )	4 years ago
yukavio	5da3d514eb	solve pretty table dependent in flops api (#29132 ) * solve pretty table dependent in flops api * add unittest dependent * temp	4 years ago
pangyoki	6df685ab64	fix nce, multinomial, Categorical, Normal, Uniform en doc (#28541 ) * fix Categorical en doc * fix doc for apis * remove numpy in sample code	4 years ago
LielinJiang	9f53f3d09e	Enhance logger callback for benchmark (#29106 ) * enhance logger callback for benchmark	4 years ago

... 2 3 4 5 6 ...

12390 Commits (5bf25d1e8b6eef2eea8aa24f5dbacea0b832aae2)