Commit Graph

19981 Commits (6e66fadb951fe02218ab2be2916bc12c4b966e00)
 

Author SHA1 Message Date
qingqing01 fd7e643153
Convolution fusion operator. (#14449)
6 years ago
peizhilin 6e23d6a2d7 disable mkl on windows by default
6 years ago
Yu Yang 98bbfc17be Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
Yu Yang 38143e5aca Clean unused changes
6 years ago
Wojciech Uss d36491c28a add allocator.h copy
6 years ago
Yu Yang 7486b0ddec fix(Mac): fix unittest of macos
6 years ago
peizhilin 4a6769da84 re-organize the cmake file
6 years ago
Yu Yang d424115f9e Clean code
6 years ago
peizhilin 1aff40a4c6 exclude warpctc_op on windows
6 years ago
peizhilin 7d51a0e887 disable DSO by default on windows
6 years ago
peizhilin b967e01cbe Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Wu Yi d7bd0361cb fix dist deps (#14471)
6 years ago
Yu Yang b12c77dae2 Fix unittests
6 years ago
Jacek Czaja 9b0eae3023 - Removing partial specialization of sotmax for inference for GPU
6 years ago
peizhilin c59d3e83bc test case fix
6 years ago
peizhilin 2ceb4ae941 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin a3e952f41d add the jit back
6 years ago
Qiao Longfei 05c15a0867
Merge pull request #14467 from jacquesqiao/update-trainer-retry
6 years ago
tensor-tang a19b3225a1 fix jitcode small size
6 years ago
Qiao Longfei fbc529db91 update test=develop
6 years ago
Qiao Longfei 98a0437d70 optimize distribute checkport
6 years ago
Jacek Czaja be80bb4f28 - Fix to GPU
6 years ago
tensor-tang 4dbdfa60ef sigmoid and tanh support all size
6 years ago
tensor-tang ccb8963705 refine exp jitcode with all size
6 years ago
peizhilin 1cc23ef67d merge from paddle:develop
6 years ago
tensor-tang d3eae8f61b refine relu and fix addrelu test
6 years ago
tensor-tang 4e67fe6a12 refine act and vxx with all size
6 years ago
tensor-tang ba3eaed7a7 exp support all size
6 years ago
tensor-tang d239801b90
Merge pull request #14463 from tensor-tang/fix/noavx
6 years ago
Michal Gallus def272cf42 MKLDNN elementwise_mul: Revert changes to eltwise_add tests
6 years ago
peizhilin b942f4760a fix cc_test on windows
6 years ago
tensor-tang 1ffce8c0ae fix build error on noavx
6 years ago
superjomn 4bf6817cbc fix gpu load model
6 years ago
Michal Gallus c69c41604e MKLDNN elementwise_mul: Move Kernel to KernelPool to avoid segfaults
6 years ago
Michal Gallus 99e3e36a57 MKLDNN elementwise_mul: Disable UT for CUDA
6 years ago
Michal Gallus 785066eb8a MKLDNN elementwise_mul: Check if AVX512 is available
6 years ago
Michal Gallus 08f63c4d12 MKLDNN elementwise_mul: Lint changes to UT & integration
6 years ago
Michal Gallus 73b7cd0482 Add Sand3r- to AUTHORS.md
6 years ago
Michal Gallus 49b09327f6 MKLDNN elementwise_mul: Reorder on non-nchw input, fallback on non-16 divisable fm
6 years ago
Michal Gallus f820573b9c MKLDNN elementwise_mul: Add UTs
6 years ago
Michal Gallus d14858e4ba MKLDNN elementwise_mul: Parallelize mul
6 years ago
Michal Gallus ed31936ba1 MKLDNN elementwise_mul: Support NCHW, update UT
6 years ago
Michal Gallus 4e54ab76ec Add HasAttr method to Operator
6 years ago
Tomasz Patejko 700bcbf74f MKLDNN elementwise_mul: h and w loops implemented in xbyak
6 years ago
Tomasz Patejko ad09facafe MKLDNN elementwise_mul: CPU tests initially refactored. MKLDNN mul test for broadcast added
6 years ago
Tomasz Patejko 2d73ad180a MKLDNN elementwise_mul: simple xbyak version for AVX512
6 years ago
Tomasz Patejko 213ec37d6a MKLDNN elementwise_add: simple initial implementation of the operator for MKLDNN format
6 years ago
Wu Yi a2d9b34417
Refine operator cmake (#14413)
6 years ago
peizhilin 695e2aba5e fix the gtest.cmake on windows
6 years ago
Tomasz Patejko 53da846d1e MKLDNN residual connections fuse pass: initial implementation of fusion for projection pass
6 years ago