Paddle

Commit Graph

Author	SHA1	Message	Date
zhaoyuchen2018	4a544762a2	Add Asypadding for conv fusion. (#21041 ) * Add Asypadding for conv fusion. test=develop reference: pr/20042 * Fix eigen build link error * Change back file mode * Use math function & add more checks.	6 years ago
hong	8c4573a3cb	GradMaker for dygraph (#19706 ) * refactor dygraph,test=develop * fix failed unittest,test=develop * polish code,test=develop * check windows ci error,test=develop try to fix windows ci error by np.allclose,test=develop * polish vlog and profiler, test=develop * try to fix preceding ops order,test=develop * test transformer in windows ci, test=develop * use python c-api to speed up tracer.trace,test=develop * test=develop, fix docker with paddle nccl problem * test=develop, add ut for debug string and gradient_accumulator * test=develop, add tests for layer/gradient_accumulator/prepared_op * test=develop, fix complie error for test_prepared_op * test=develop, add more ut for dygraph * test=develop, create API.spec for dygraph api change * optimize grad maker; test=develop * optimize grad maker * test * grad make optim; test=develop * fix unittest bugs; test=develop * add dygraph grad op maker and split_op * grad op maker refactor; test=develop * add dygraph grad maker; test=develop * fix op deformable_conv_v1_op bug; test=develop * fix deformable_conv prroi pool bugs; * fix new op grad op maker bug; test=develop * fix split by ref bug; test=develop * fix dygraph auto prune bug; test=develop * fix test_trace bug; test=develop * fix fused emb seq pool bug; test=develop * remove useless code in op_desc file; test=develop * remove useless code, StrVarBaseNode; test=develop * fix review issues; test=develop * fix rank_loss grad maker; test=develop * remove flag in VarBase; test=develop * fix distributed_notify_op compile bug ; test=develop * fix reshape op double grad; test=develop * fix expand as op; test=develop * add impertive type_defs.h for demo_train; test=develop * fix inference lib cmake; test=develop * fix inference lib; test=develop * fix infernce_lib; test=develop * fix inference cmake; test=develop * fix inference lib; test=develop * fix inference lib; test=develop * remove condition dygraph grad maker, modify local name; test=develop * fix split grad maker bug; test=develop * fix pyramid_op bug; test=develop * change travis time out limit; test=develop * restore travis; test=develop * change timeout limit; test=develop	6 years ago
Yiqun Liu	03ba0fdae6	Move the codes of fused operators to operators/fused directory. (#20881 ) * Move the codes of fused operators to operators/fused directory. test=develop * Correct the op name in cmake. * Change the use of PADDLE_ENFORCE. test=develop	6 years ago
Chen Weihang	26cc1fe508	Replace risky GetInputType method with secure IndicateVarDataType interface (#20668 ) * replace part of the old implementation, test=develop * restore concat op, test=develop * update all ops implemention & delete GetDataTypeOfVar func, test=develop	6 years ago
Zeng Jinle	4922eb6da5	make_conv_workspace_size_configurable, test=develop (#20662 )	6 years ago
qingqing01	01eddc1a04	Support fp16 in GPU impl of fused_elemwise_activation_op. (#20636 ) * Support fp16 in fused_elemwise_activation_op. * Fix unit testing in ONLY-CPU mode.	6 years ago
Zeng Jinle	48029ab06c	Remove some DefaultGradOpDescMaker (#20185 ) * remove fc_grad, test=develop * remove fsp op since no unittests, test=develop	6 years ago
Yiqun Liu	3cd985a669	Add a pass to fuse fc+elementwise_add+layernorm (#19776 ) * Add fc_elementwise_layernorm_fuse pass and unittest. * Add fused_fc_elementwise_layernorm op and its GPU kernel. test=develop * Apply fc_elementwise_layernorm_fuse_pass to GPU inference. * Add the setting of attrs in the definition of binary_op. test=develop * Add comment. * Implement the unittest. test=develop * Change the unittest name of layer_norm. test=develop	6 years ago
翟飞跃	93c85c930a	Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770 ) * Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * optimize bp with mkl sparse matrix test=develop * tmp add fused_emb_seq layer * Add the support of padding_idx attribute. test=develop * add padding_idx support test=develop * implement grad refer lego test=develop	6 years ago
Yiqun Liu	a65c728e5d	Implement the GPU kernel of fc operator (#19687 ) * Refine the codes related to fc op. * Add GPU implementation for fc functor. * Apply fc_fuse_pass in GPU inference. test=develop * Change the cmake for fc op. * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ. * Add an attribute to set the activation type in fc_op. * Enhance the unittest of fc_op. test=develop * Remove the declaration of FCOpGrad back to the header file. test=develop * Set default value for newly added arguments in test_fc_op. test=develop	6 years ago
Tao Luo	d6c85c96dc	paddle::framework::vectorize() templatization (#19627 ) test=develop	6 years ago
翟飞跃	2e3ee57954	Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153 ) * Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * optimize bp with mkl sparse matrix test=develop	6 years ago
Yihua Xu	b920395842	Use sparse matrix to implement fused emb_seq_pool operator (#19064 ) * Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * Ignore the deprecated status for windows test=develop	6 years ago
Leo Chen	80eab822c1	Remove unused DefaultGradOpDescMaker in REGISTER_OPERATOR() (#19166 ) * remove unused DefaultGradOpDescMaker in REGISTER_OPERATOR(), test=develop * remove SplitIdsOpGradMaker since it is buggy and not tested, update spec file, test=develop	6 years ago
石晓伟	ee2f296ef8	Fusion: seqpool_cvm_concat (#18471 ) * add fusion_seqpool_cvm_concat test=develop * simplify pass, test=develop * fix code style, test=develop	6 years ago
翟飞跃	802ea50956	fix spelling errors (#17941 ) * fix spelling errors; test=develop * Update API.spec update md5 * Update API.spec * change the order of api;test=develop	6 years ago
Yiqun Liu	8fd39f3e99	Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236 ) * Enhance fused_elementwise_activation op. test=develop * Move the api fused_elementwise_activation to contrib. test=develop * Add including files. test=develop * Add the support of sigmoid in fused_elementwise_activetion op. * Update API.spec. test=develop	6 years ago
Zeng Jinle	0c335dcd2c	Make conv cudnn workspace size configurable (#17036 ) * make_conv_cudnn_ws_size_configurable, test=develop * change std::max to std::min test=develop	6 years ago
sneaxiy	2c836ff914	check default grad maker test=develop	6 years ago
Qiyang Min	c7f1f3ed0c	Merge pull request #16214 from velconia/imperative_infer_var_type Implement imperative infer var type	6 years ago
minqiyang	b40e41fbd1	Polish code style test=develop	6 years ago
luotao1	d9f0e7252a	refine with comments test=develop	6 years ago
luotao1	721c2c00ef	refine fc_infershape test=develop	6 years ago
minqiyang	ca392c7e97	Implement infer var type context	6 years ago
luotao1	fe78a92e6e	refine with comments test=develop	6 years ago
luotao1	5d20954ac4	add runtime shape for fuse_emb_seq_pool_grad test=develop	6 years ago
luotao1	8f6597aa0e	Merge branch 'develop' into infershape_example	6 years ago
luotao1	31ccaf0916	add all_kernels_must_compute_runtime_shape example for speedup infershape test=develop	6 years ago
tensor-tang	14a764c930	simplify the jitkernel templates and tests test=develop	6 years ago
tensor-tang	802f362ac4	unify the kernelfuncs cache and add unit test test=develop	6 years ago
tensor-tang	41a1270856	add vbroadcast jitkernel refer code and use it test=develop	6 years ago
luotao1	34404f9c31	refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool test=develop	6 years ago
tensor-tang	e1c707fe9c	fix warnings (#15790 ) * fix warnings test=develop * fix enforce test test=develop	6 years ago
tensor-tang	a3a3d3d861	add embseqpool jitkernel mkl impl and use it test=develop	6 years ago
tensor-tang	18bff5298d	extract fused_emb_seq_pool forward function test=develop	6 years ago
tensor-tang	ba02ac4692	use mat attr and refine test (#15448 ) * use mat attr and refine test test=develop * add matmul jitcode test=develop * fix mac compile test=develop	6 years ago
chengduo	f8f91fb4b3	Revert conv transpose cudnn (#15514 ) * Revert "set constant for loss" This reverts commit 167933f678ccbb3563e949710279efe004a27731. * Revert "remove workspace_handle" test=develop This reverts commit b4aca8ede9e685bce1dfb1c59e63919f33432572.	6 years ago
chengduo	5a8bd82c0c	Remove workspace_handle (#15376 ) * remove workspace_handle test=develop * set constant for loss test=develop	6 years ago
tensor-tang	d618e48309	fix fuse square mat order and refine test test=develop	7 years ago
tensor-tang	38de1ff472	add fusion squared mat sub op	7 years ago
tensor-tang	f347d6e4a1	add repeated fc relu unit test test=develop	7 years ago
tensor-tang	99010e6eae	init repeated fc relu op	7 years ago
tensor-tang	8e086a8521	follow comment and fix typo test=develop	7 years ago
tensor-tang	f702f8fd10	Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat	7 years ago
tensor-tang	316636404f	add seqpool concat unit test	7 years ago
tensor-tang	7923d7271f	add fusion seqpool concat op	7 years ago
minqiyang	0f94c1ac14	Polish code test=develop	7 years ago
minqiyang	c09a379015	remove const_cast test=develop	7 years ago
minqiyang	db8eb9b688	Polish code test=develop	7 years ago
minqiyang	39b98709b1	Move fused ops to fused dir test=develop	7 years ago

1 2

67 Commits (b6ce4f8b2fa85304cc3b95299d82212e90c663d7)