Paddle

Commit Graph

Author	SHA1	Message	Date
hjchen2	2c2a192eb1	Resolve merge conflicts test=develop	7 years ago
Yiqun Liu	8bc1c5d2ab	Implement the Tensorrt plugin for elementwise op (#14487 ) * Initialize the elementwise plugin. * Implement the basic CUDA kernel of elementwise plugin. test=develop	7 years ago
tensor-tang	7aa3aff338	Merge pull request #14465 from tensor-tang/fea/jit/exp jitcode act support all size	7 years ago
Tao Luo	1b894e495f	Merge pull request #14437 from jczaja/prv-softmax-mkl Introducing MKL to softmax for inference	7 years ago
dengkaipeng	bb2b35c85e	Add python example for resize_nearest. test=develop	7 years ago
chengduo	a94a7355f0	Refine the GraphNum check (#14144 ) * refine GraphCheck test=develop * fix ci fail test=develop	7 years ago
Yihua Xu	a906a361be	Add the macro for NVCC (test=develop)	7 years ago
Yihua Xu	d91740acb1	Revert "Remove the remnant code (test=develop)" This reverts commit `be50670348`.	7 years ago
Yihua Xu	be50670348	Remove the remnant code (test=develop)	7 years ago
minqiyang	a2fce6daf2	Polish code test=develop	7 years ago
minqiyang	6a017d9abe	Remove numpy's requirements or python3.7 will not be supported test=develop	7 years ago
hjchen2	1622cb9937	Fix alpha tensor key	7 years ago
tensor-tang	48be9dc3e1	Merge pull request #14489 from tensor-tang/api/example add api example of brelu, leaky_relu and soft_relu	7 years ago
hjchen2	a8c077df7c	Implement leaky relu tensorRT converter	7 years ago
qingqing01	9eefd2c766	Modify some infer-shape about detection operators in compile-time. (#14483 ) * Modify some infer-shape in compile-time.	7 years ago
Tao Luo	cf685f361b	Merge pull request #14458 from tpatejko/tpatejko/mkldnn-skip-connections [WIP] Correcting and extending MKLDNN residual connection fuse pass	7 years ago
tensor-tang	e3645c2708	add api example of brelu, leaky_relu and soft_relu test=develop	7 years ago
Yihua Xu	f4c869d872	Optimize the layer_norm operator with AVX intrinsic function (#14417 ) * Optimize layer_norm operator with AVX intrinsic functions * Revert the wrong modifications * Implement the jit kernel for layer_norm operator * Add math headfile to fix the compile issue (test=develop) * Add math headfile to fix the compile issue (test=develop) * Fixed the intrinsic headfile issue (test=develop) * Fix the conflicts (test=develop) * Revert for CUDA compiler (test=develop) * Fixed the cuda depency (test=develop) * Fix the marco issues (test=develop)	7 years ago
Houjiang Chen	816b464037	Merge pull request #14486 from hjchen2/develop Fix tensorrt plugin cmake dependency, test=develop	7 years ago
Yu Yang	f1a392a5fe	Merge pull request #13804 from sneaxiy/rewrite_allocation Rewrite allocation	7 years ago
Yihua Xu	f418f552df	Merge branch 'develop' into develop_7a64d48f5_stack_opt (test=develop)	7 years ago
minqiyang	a5249385a3	Fix ssl and yum install problem test=develop	7 years ago
hjchen2	2825685f2a	Fix tensorrt plugin cmake dependency, test=develop	7 years ago
Superjomn	e878a8e885	update test=develop	7 years ago
qingqing01	fd7e643153	Convolution fusion operator. (#14449 ) * Convolution fusion operator. * Clean code test=develop	7 years ago
Yu Yang	98bbfc17be	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation test=develop	7 years ago
Yu Yang	38143e5aca	Clean unused changes test=develop	7 years ago
Wojciech Uss	d36491c28a	add allocator.h copy The allocator.h header file is required for C-API inference applications test=develop	7 years ago
Yu Yang	7486b0ddec	fix(Mac): fix unittest of macos test=develop	7 years ago
Yu Yang	d424115f9e	Clean code test=develop	7 years ago
Wu Yi	d7bd0361cb	fix dist deps (#14471 ) * fix dist deps test=develop * update test=develop * update test=develop * update test=develop * update test=develop	7 years ago
Yu Yang	b12c77dae2	Fix unittests test=develop	7 years ago
Jacek Czaja	9b0eae3023	- Removing partial specialization of sotmax for inference for GPU test=develop	7 years ago
Qiao Longfei	05c15a0867	Merge pull request #14467 from jacquesqiao/update-trainer-retry optimize distribute checkport	7 years ago
tensor-tang	a19b3225a1	fix jitcode small size test=develop	7 years ago
Qiao Longfei	fbc529db91	update test=develop	7 years ago
Qiao Longfei	98a0437d70	optimize distribute checkport test=develop	7 years ago
Jacek Czaja	be80bb4f28	- Fix to GPU test=develop	7 years ago
tensor-tang	4dbdfa60ef	sigmoid and tanh support all size test=develop	7 years ago
tensor-tang	ccb8963705	refine exp jitcode with all size test=develop	7 years ago
tensor-tang	d3eae8f61b	refine relu and fix addrelu test	7 years ago
tensor-tang	4e67fe6a12	refine act and vxx with all size	7 years ago
tensor-tang	ba3eaed7a7	exp support all size	7 years ago
tensor-tang	d239801b90	Merge pull request #14463 from tensor-tang/fix/noavx fix build error on noavx	7 years ago
minqiyang	d2c9ddbc02	Polish code test=develop	7 years ago
Michal Gallus	def272cf42	MKLDNN elementwise_mul: Revert changes to eltwise_add tests	7 years ago
tensor-tang	1ffce8c0ae	fix build error on noavx test=develop	7 years ago
superjomn	4bf6817cbc	fix gpu load model the parameters will load from CPUPlace, that will keep copying data between CPU and GPU places. test=develop	7 years ago
Michal Gallus	c69c41604e	MKLDNN elementwise_mul: Move Kernel to KernelPool to avoid segfaults test=develop	7 years ago
Michal Gallus	99e3e36a57	MKLDNN elementwise_mul: Disable UT for CUDA test=develop	7 years ago

1 2 3 4 5 ...

19991 Commits (1f87f263a2906cb1130fdb3cf3c415197cf0d549) All Branches Search

19991 Commits (1f87f263a2906cb1130fdb3cf3c415197cf0d549)

All Branches