Paddle

Commit Graph

Author	SHA1	Message	Date
Huihuang Zheng	1cbb282d77	Add Retry Logic to CublasHandlerHolder Add Retry Logic to CublasHandlerHolder to avoid random unittest failure.	4 years ago
yukavio	96934b7430	fix flops (#29758 ) * fix flops * fix flops	4 years ago
liym27	41a7b07159	[Dy2Stat] Fix bug for loop: a variable is used and created in loop, but used before created (#29769 )	4 years ago
LielinJiang	e5af650b71	Add double grad for conv_transpose (#29706 ) * add double grad for conv_transpose	4 years ago
Leo Chen	224f3bcbb1	format code (#29714 )	4 years ago
huangxu96	97e29411eb	fix a bug in multi_precision_fp16 unittest. (#29756 )	4 years ago
LoveAn	2e5b4a216c	Optimize compilation time with Unity Build (#29733 ) * Test compilation time with less parallel count, notest, test=windows_ci * optimize rules of Unity Build, notest, test=windows_ci, test=windows_op * limit parallel counts used only on GPU, test=develop * remove limit of argument /m:8 on Windows, test=develop	4 years ago
Zhang Jun	0c23ba95d8	enable MakeCiper api for inference;test=develop (#29692 )	4 years ago
wangchaochaohu	7b2dc4e6b1	optimization for fp16 elementwise add (#29744 )	4 years ago
chalsliu	27bdbec7fc	Refine precision test print message	4 years ago
chalsliu	e63a68feac	Retry when download failed for precision test	4 years ago
Jacek Czaja	07790ba13e	[oneDNN] Reimplemented elementwise_add grad (#29747 ) * - Reimplemented elementwise_add grad - lint * - fix after review * - Fix to fix after review	4 years ago
Wojciech Uss	6ef8129dcc	upgrade oneDNN with GRU INT8 optimizations (#28420 ) * upgrade oneDNN with GRU INT8 optimizations * fix test	4 years ago
Huihuang Zheng	dfffee8a5d	[Dy2stat] Enable jit.save to Save Without Running (#29579 ) Enable jit.save to Save Without Running.	4 years ago
Aurelius84	17c8e3adfe	Polish code in gpu_launch_config.h (#29730 )	4 years ago
wangchaochaohu	068d905e1e	fix the shape choose of vectorize for cuda	4 years ago
liym27	a0b60716f1	[Dy2Stat] Support grammar: for ele in var[idx] (#29541 ) Support to transformfor ele in var stms in which var is a slice of Tensor.	4 years ago
chentianyu03	b59b6d7ae6	Complex op test (#29753 ) * delete no need to calculate inputs in dygraph op_test * delete no need to calculate inputs in dygraph op_test	4 years ago
liym27	096c048b45	Fix unitest test_slice (#29740 ) Before this commit, test_slice use old api `dygraph_to_static_func` to use Dynamic-t-Static and use Executor explicitly，which is not recommended to users. After fixed, use recommended API `paddle.jit.to_static` to replace `dygraph_to_static_func`, which won't trigger the random exception on coverage CI.	4 years ago
syyxsxx	7c2affaa26	fix isfinite_v2_op OpProtoAndCheckerMaker AddComment bug (#29626 ) fix isfinite_v2_op OpProtoAndCheckerMaker AddComment bug	4 years ago
Huihuang Zheng	2e788bd81e	Reduce batch size ot fix CPU memory, test=develop (#29736 ) Unit test reported memory not enough on CPU machines. Reduce batch size again.	4 years ago
石晓伟	8bd2879ef7	update the operator registration for incompatible upgrade, test=develop (#29720 )	4 years ago
LielinJiang	10edfb6f21	Update en docs of to_tensor (#29718 ) * update to_tensor en docs	4 years ago
chentianyu03	71063b8137	add conj op for complex types (#29527 ) * add conj op for complex types * add conj for complex types * add more test case * add conj_op test * modify conj api and impl * add complex type for fill_constant_op xpu * add setConstant for complex type * remove complex conj test file * user define grad for test_conj_op * add test case for static mode of conj api * modify conj doc * change input args name to x * remove useless codes * conj support real types * add conj test case for real number	4 years ago
Wilber	b593d588aa	[Inference] EnableUseGpu has higher priority than flags (#29697 ) * enable_use_gpu has higher priority than FLAGS * update.	4 years ago
WangXi	9cbcc6cadc	fleet sync build strategy, test=develop (#29732 )	4 years ago
tianshuo78520a	638ccaabf4	fix ubuntu docker error (#29719 )	4 years ago
wanghuancoder	0c59ad2a1a	Windows generate pdb and dump, for debug (#29628 ) * Windows generate pdb and dump, for debug * fix code style, test=develop * modify cmakelist	4 years ago
Huihuang Zheng	4c4d4ba5e0	Modify CublasHandleHolder to Fix Random Unittest Failure. test=develop (#29617 ) Modify CublasHandleHolder from using PADDLE_ENFORCE_CUDA_SUCCESS to PADDLE_RETRY_CUDA_SUCCESS to fix random unittest failure. We checked that the unittest log showed CUDA allocation error at this file, which may due to GPU not enough. We fixed similar failure in the past, so we applied PADDLE_RETRY_CUDA_SUCCESS here.	4 years ago
Chen Weihang	6cfa59de1b	[Complex] Add real & imag op and api for complex tensor (#29672 ) * add complex real op & api & unittest * add imag op & api & unittest * refactor op impl * revert simplify writing due to complile failed * polish details * polish grad op code	4 years ago
Jacek Czaja	9eff1a674f	Added missing format of oneDNN (#29670 )	4 years ago
LiuChiachi	572810eecb	Update EarlyStopping sample code (#29723 ) * update EarlyStopping doc * update EarlyStopping doc, test=document_fix	4 years ago
wangchaochaohu	2e0d1ed00f	delete the code for fp16 optimization because it is not faster than common template code (#29715 )	4 years ago
LoveAn	bb5a7854f3	Add approval monitor for unity_build_rule.cmake (#29701 ) * Add approval monitor for unity_build_rule.cmake, test=develop * fix words spell error, test=document_fix	4 years ago
Qi Li	7684b91817	[GO] add two cgo api, test=develop (#29659 )	4 years ago
TTerror	af8ded773a	update activation op on kunlun (#29577 ) * fix expand && concat/transpose to new api * update xpu_header * update activation op on kunlun * update activation op on kunlun * update activation op on kunlun * update activation op on kunlun * update activation op on kunlun * add nearest_interp on kunlun * update error message	4 years ago
ceci3	cc387159f3	add pad and concat double grad (#29549 ) * add constant pad double grad	4 years ago
liuyuhui	f13c3a9cd7	[Kunlun] PR1:Support one Kunlun card training in parallel executor (#29337 )	4 years ago
Y_Xuan	76738504ad	添加rocm平台支持代码 (#29342 ) * 添加rocm平台支持代码 * 修改一些问题 * 修改一些歧义并添加备注 * 修改代码格式 * 解决冲突后的代码修改 * 修改operators.cmake * 修改格式 * 修正错误 * 统一接口 * 修改日期	4 years ago
huangxu96	b96dada4f0	add static.amp into setup.pu.in (#29621 ) * add static.amp into setup.pu.in * add unittest for api	4 years ago
Zhang Ting	1e9127f688	improve dropout grad (#29605 ) * improve grad perf	4 years ago
wangchaochaohu	eab44e1f32	refine (#29622 )	4 years ago
YUNSHEN XIE	d0b789d27f	disable ut test_cumsum_op (#29613 )	4 years ago
Jack Zhou	84bae27779	fix wmt14 doc, remove backward, add bidirect direction in rnn api (#29633 ) * fix wmt14 doc, remove backward, add bidirect direction in rnn api * fix rnn unittest * fix test_rnn_nets_static.py bug	4 years ago
WangXi	613c46bc07	fix gen_nccl_id_op_helper compile failed, test=develop (#29614 )	4 years ago
chen zhiyu	f5f8809c1a	1. add python version selection 2.add dynamic flags setting. (#29612 )	4 years ago
YUNSHEN XIE	2926e74326	New UT should not exceed 15s (#29492 ) * added UT should not exceed 15s * fix error * UT limit of 15s is the first to be executed * fix error * fix error with CI_SKIP_CPP_TEST * modfied tiemout setting * fix error	4 years ago
Chen Weihang	f02aece1f0	Add complex dtype op (add) test example (#29603 ) * add op test case for complex * polish code details * add xpu set constant support * fix argument rror * remove useless pyc file	4 years ago
AshburnLee	efea540ca9	Add tf32 support for A100 tensor core acceleration for cuBLAS (#28732 )	4 years ago
lijianshe02	7779768b53	add transpose double grad test=develop (#29600 ) * add transpose double grad test=develop	4 years ago

1 2 3 4 5 ...

29479 Commits (1cbb282d7774539a809d32f45bb9b443f56485a7) All Branches Search

29479 Commits (1cbb282d7774539a809d32f45bb9b443f56485a7)

All Branches