Commit Graph

18709 Commits (bcb8ea397d594a37dcc271a38b27b0e186f745a4)
 

Author SHA1 Message Date
tensor-tang bcb8ea397d Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole
7 years ago
tensor-tang 8e182170ba refine and replace lstm peephole kernel
7 years ago
Dun 5f2e837847 optimize depthwise conv by register memory (#13778)
7 years ago
Qiao Longfei 5428cb9908
Profiler support merge data of all thread (#13811)
7 years ago
tensor-tang 7ef2699e18 init peephole runtime kernel
7 years ago
Qiyang Min f99ea99e36
Merge pull request #13720 from velconia/fix_grad_clip
7 years ago
Tao Luo afdc73090f
Merge pull request #13816 from PaddlePaddle/revert-13368-mgallus/conv-bias-pass
7 years ago
Tao Luo 9b11a17502
Revert "[MKLDNN] Pass: Fuse Conv + Bias"
7 years ago
Tao Luo ce248a15d9
Merge pull request #13368 from Sand3r-/mgallus/conv-bias-pass
7 years ago
whs 7e651c8641
Fix truncated norm (#13785)
7 years ago
Tao Luo 16b1beb244
Merge pull request #13486 from sfraczek/sfraczek/conv-bn-fuse-pass
7 years ago
Zhaolong Xing 5d5587fff7
Merge pull request #13792 from NHZlX/trt_dy_lib
7 years ago
Michal Gallus 40b17be4b0 Pass: Fuse Conv + Bias
7 years ago
minqiyang 1456b8ec7d Add unittest for clip_by_norm_op with SelectedRows
7 years ago
Tao Luo fd0dd07ab4
Merge pull request #13726 from jczaja/prv-fused_embedding_fc_lstm-ut
7 years ago
Qiyang Min ac8208b637
Merge pull request #13707 from velconia/fix_ci_random_fail
7 years ago
Qiyang Min 60ea0827d0
Merge pull request #13791 from velconia/fix_mac_14_py3_install
7 years ago
Sylwester Fraczek 3fcca40909 eigen sqrt fix and change 1e-5 to epsilon
7 years ago
minqiyang d8384c8e64 Polish code
7 years ago
Qiao Longfei 5fc305220c
Merge pull request #13787 from PaddlePaddle/revert-13637-optimize-opyreader
7 years ago
nhzlx 9445502f90 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_dy_lib
7 years ago
chengduo e1904ac2c8
Add doc (#13765)
7 years ago
nhzlx d347ea689a fix comments
7 years ago
chengduo e1761709f8
Set the right shape of selected_rows (#13723)
7 years ago
tensor-tang 3ee8f2c6cf thread local jit kernels
7 years ago
tensor-tang 9131a35676 replace the lstm compute with jitkernel
7 years ago
tensor-tang 2a36f0a3c8
Merge pull request #13784 from tensor-tang/fix/lodtensor
7 years ago
Qiao Longfei 9d087d5139 Revert "optimize pyreader"
7 years ago
tensor-tang b55c247678 add lstm compute unit test
7 years ago
nhzlx f3af90d121 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_dy_lib
7 years ago
nhzlx f569095084 add tensorrt api lib to paddle_fluid
7 years ago
minqiyang 9e3b01264c Make cmake support compile in MacOSX 10.14
7 years ago
tensor-tang dba6928775 fix lod tensor
7 years ago
qingqing01 6094a72308
Fix bug in reduce_op caused by PR #13534 (#13748)
7 years ago
Tao Luo 59c306eb52
Merge pull request #13776 from luotao1/revert_fast_math
7 years ago
Tao Luo 84a55155ec revert with_fast_math to ON
7 years ago
minqiyang 5ae34fb9ac Make code more compatible
7 years ago
Wu Yi cc36bab184
fix manylinux multi arch docker build test=develop (#13770)
7 years ago
Qiao Longfei b1d5135ffb
Merge pull request #13637 from jacquesqiao/optimize-opyreader
7 years ago
dzhwinter a46e30aa6d
enhance isinf/isnan in tensor util, avoid copy back to cpu (#12688)
7 years ago
tensor-tang 2a00969165 optimize lstm jitkernel keq8
7 years ago
tensor-tang f2adaf1c3e add vrelu and lstm kernel
7 years ago
Qiao Longfei 9ff5184fa6
Merge pull request #13732 from jacquesqiao/add-fake-reader
7 years ago
Xin Pan 943e4deb23
Merge pull request #13750 from panyx0718/fix
7 years ago
Jacek Czaja 9f15d8817e - Cleanup as suggessted by reviewers
7 years ago
qiaolongfei 91e8299144 fake reader support python3.x test=develop
7 years ago
Wu Yi 25262ed076
fix cuda9 docker build test=develop (#13701)
7 years ago
Sylwester Fraczek 78f98294c2 conv bn fuse pass
7 years ago
Jacek Czaja ae8b4717cc - Cleaning fused_embedding_fc_lstm op
7 years ago
Jacek Czaja fd31b54cf1 - Removed disabled code
7 years ago