Paddle

Commit Graph

Author	SHA1	Message	Date
chengduo	a8d3aaae2a	print output log warning (#14497 ) test=develop	7 years ago
Tao Luo	5cc7946313	Merge pull request #14499 from luotao1/disable_openblas_test disable two openblas test temporary	7 years ago
Houjiang Chen	10ae3ba486	Merge pull request #14493 from hjchen2/develop Implement leaky relu converter from fluid to tensorRT	7 years ago
Houjiang Chen	33c65517fd	Update CMakeLists.txt test=develop	7 years ago
Tao Luo	1d3e9bde1e	Merge pull request #14488 from yihuaxu/develop_7a64d48f5_stack_opt Optimize the stack operator	7 years ago
Houjiang Chen	01bda73116	Update CMakeLists.txt	7 years ago
Tao Luo	09ee266f8e	disable two openblas test temporary test=develop	7 years ago
hjchen2	2c2a192eb1	Resolve merge conflicts test=develop	7 years ago
Yiqun Liu	8bc1c5d2ab	Implement the Tensorrt plugin for elementwise op (#14487 ) * Initialize the elementwise plugin. * Implement the basic CUDA kernel of elementwise plugin. test=develop	7 years ago
tensor-tang	7aa3aff338	Merge pull request #14465 from tensor-tang/fea/jit/exp jitcode act support all size	7 years ago
Tao Luo	1b894e495f	Merge pull request #14437 from jczaja/prv-softmax-mkl Introducing MKL to softmax for inference	7 years ago
chengduo	a94a7355f0	Refine the GraphNum check (#14144 ) * refine GraphCheck test=develop * fix ci fail test=develop	7 years ago
Yihua Xu	a906a361be	Add the macro for NVCC (test=develop)	7 years ago
Yihua Xu	d91740acb1	Revert "Remove the remnant code (test=develop)" This reverts commit `be50670348`.	7 years ago
Yihua Xu	be50670348	Remove the remnant code (test=develop)	7 years ago
hjchen2	1622cb9937	Fix alpha tensor key	7 years ago
hjchen2	a8c077df7c	Implement leaky relu tensorRT converter	7 years ago
qingqing01	9eefd2c766	Modify some infer-shape about detection operators in compile-time. (#14483 ) * Modify some infer-shape in compile-time.	7 years ago
Tao Luo	cf685f361b	Merge pull request #14458 from tpatejko/tpatejko/mkldnn-skip-connections [WIP] Correcting and extending MKLDNN residual connection fuse pass	7 years ago
Yihua Xu	f4c869d872	Optimize the layer_norm operator with AVX intrinsic function (#14417 ) * Optimize layer_norm operator with AVX intrinsic functions * Revert the wrong modifications * Implement the jit kernel for layer_norm operator * Add math headfile to fix the compile issue (test=develop) * Add math headfile to fix the compile issue (test=develop) * Fixed the intrinsic headfile issue (test=develop) * Fix the conflicts (test=develop) * Revert for CUDA compiler (test=develop) * Fixed the cuda depency (test=develop) * Fix the marco issues (test=develop)	7 years ago
Houjiang Chen	816b464037	Merge pull request #14486 from hjchen2/develop Fix tensorrt plugin cmake dependency, test=develop	7 years ago
Yu Yang	f1a392a5fe	Merge pull request #13804 from sneaxiy/rewrite_allocation Rewrite allocation	7 years ago
Yihua Xu	f418f552df	Merge branch 'develop' into develop_7a64d48f5_stack_opt (test=develop)	7 years ago
hjchen2	2825685f2a	Fix tensorrt plugin cmake dependency, test=develop	7 years ago
qingqing01	fd7e643153	Convolution fusion operator. (#14449 ) * Convolution fusion operator. * Clean code test=develop	7 years ago
Yu Yang	98bbfc17be	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation test=develop	7 years ago
Yu Yang	d424115f9e	Clean code test=develop	7 years ago
Wu Yi	d7bd0361cb	fix dist deps (#14471 ) * fix dist deps test=develop * update test=develop * update test=develop * update test=develop * update test=develop	7 years ago
Yu Yang	b12c77dae2	Fix unittests test=develop	7 years ago
Jacek Czaja	9b0eae3023	- Removing partial specialization of sotmax for inference for GPU test=develop	7 years ago
tensor-tang	a19b3225a1	fix jitcode small size test=develop	7 years ago
Jacek Czaja	be80bb4f28	- Fix to GPU test=develop	7 years ago
tensor-tang	4dbdfa60ef	sigmoid and tanh support all size test=develop	7 years ago
tensor-tang	ccb8963705	refine exp jitcode with all size test=develop	7 years ago
tensor-tang	d3eae8f61b	refine relu and fix addrelu test	7 years ago
tensor-tang	4e67fe6a12	refine act and vxx with all size	7 years ago
tensor-tang	ba3eaed7a7	exp support all size	7 years ago
tensor-tang	1ffce8c0ae	fix build error on noavx test=develop	7 years ago
Wu Yi	a2d9b34417	Refine operator cmake (#14413 ) * wip simplify operator framework * wip * wip * done test=develop * clean test=develop * fix test=develop * fix deps test=develop * fix cpu build test=develop * fix tensorrt build test=develop * fix tests test=develop * fix test=develop * fix cpu build test=develop	7 years ago
Tomasz Patejko	53da846d1e	MKLDNN residual connections fuse pass: initial implementation of fusion for projection pass test=develop	7 years ago
tensor-tang	7f17e561d7	Merge pull request #14423 from tensor-tang/fea/jit/act jitcode act relu, exp, sigmoid, tanh	7 years ago
Jiabin Yang	28bd5b7bad	fix space_to_depth_op unicode problem (#14430 ) * fix space_to_depth_op unicode problem * test=develop	7 years ago
Jacek Czaja	513bb6c151	Squashing MKL based softmax for inference test=develop - Added profiling to softmax functors - MKL based softmax inference op - Fix to softmax compuation via MKL - cleaning - Cosmetic fixes to softmax MKL - Fix to ON_INFER lack of propagation	7 years ago
Tomasz Patejko	dbc4fcd722	MKLDNN residual connections fuse pass: unit tests enabled and added	7 years ago
Tomasz Patejko	4224089354	MKLDNN residual connections fuse pass: Maybe removed and boost::optional used where it makes sense	7 years ago
Tomasz Patejko	86fd3b32be	MKLDNN residual connections fuse pass: counting statistics added to the pass	7 years ago
Tomasz Patejko	ee6f778beb	MKLDNN residual connections fuse pass: further refactoring	7 years ago
Tomasz Patejko	7423748e37	MKLDNN residual connections fuse pass: * implements reachability check between identity node and non-identity argument to elementwise_add * implements handling identity node as x and as y argument to elementwise_add	7 years ago
whs	1722678258	Make nce support more distribution. (#13549 ) * Fix truncated normal. * Fix. * Make nce support more distribution. * Fix API.spec. * Fix python API. * Fix. test=develop * Fix API.spec test=develop * Fix sampler. * Fix order of arguments in python API. test=develop	7 years ago
tensor-tang	1f00723fa3	exp, sigmoid, tanh jitcode support more size test=develop	7 years ago

1 2 3 4 5 ...

11905 Commits (a8d3aaae2a648ee552d60869fc5117e61d4ce1b0)