Commit Graph

11061 Commits (d26e4507dac94e0de3a24816541f06082770bc35)

Author SHA1 Message Date
Sylwester Fraczek 55d6950a1a rewrite conv_bn fuse pass to eigen
7 years ago
chengduo 9c77b65c06
Fix layers.uniform_random (#13823)
7 years ago
nhzlx 320c78e16f fix commets
7 years ago
Yibing Liu cbe4292516 Add sequence unpad op
7 years ago
tensor-tang bcb8ea397d Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole
7 years ago
tensor-tang 8e182170ba refine and replace lstm peephole kernel
7 years ago
nhzlx efa5bac7ad fix demo_ci bug in vis_demo.cc
7 years ago
tensor-tang dc5a7b906d fix default number of threads when inference with or without MKLDNN
7 years ago
Xin Pan 228506618b Avoid GetMutable implicitly reset Var Type.
7 years ago
Dun 5f2e837847 optimize depthwise conv by register memory (#13778)
7 years ago
minqiyang 3f6ec90060 Polish code
7 years ago
minqiyang 9878eedbaa Change API.spec
7 years ago
Qiao Longfei 5428cb9908
Profiler support merge data of all thread (#13811)
7 years ago
nhzlx bf7a2789a0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_trt_pad_op
7 years ago
nhzlx 0cb88c34be add op converter
7 years ago
tensor-tang 7ef2699e18 init peephole runtime kernel
7 years ago
Qiyang Min f99ea99e36
Merge pull request #13720 from velconia/fix_grad_clip
7 years ago
minqiyang f40848828d Polish code
7 years ago
minqiyang e2e82bde32 Accelerate Reshape op
7 years ago
minqiyang 0385b0a1ea Accelerate SequencePool Op on SUM mode
7 years ago
minqiyang 8ec748cfa0 Accelerate SelectedRows Functors:
7 years ago
Xin Pan 63b2e98f3d Explain LoD and a few other concepts
7 years ago
Tao Luo 9b11a17502
Revert "[MKLDNN] Pass: Fuse Conv + Bias"
7 years ago
Tao Luo ce248a15d9
Merge pull request #13368 from Sand3r-/mgallus/conv-bias-pass
7 years ago
whs 7e651c8641
Fix truncated norm (#13785)
7 years ago
Tao Luo 16b1beb244
Merge pull request #13486 from sfraczek/sfraczek/conv-bn-fuse-pass
7 years ago
Zhaolong Xing 5d5587fff7
Merge pull request #13792 from NHZlX/trt_dy_lib
7 years ago
Michal Gallus 40b17be4b0 Pass: Fuse Conv + Bias
7 years ago
minqiyang 1456b8ec7d Add unittest for clip_by_norm_op with SelectedRows
7 years ago
Tao Luo fd0dd07ab4
Merge pull request #13726 from jczaja/prv-fused_embedding_fc_lstm-ut
7 years ago
Sylwester Fraczek 3fcca40909 eigen sqrt fix and change 1e-5 to epsilon
7 years ago
Qiao Longfei 5fc305220c
Merge pull request #13787 from PaddlePaddle/revert-13637-optimize-opyreader
7 years ago
nhzlx 9445502f90 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_dy_lib
7 years ago
chengduo e1904ac2c8
Add doc (#13765)
7 years ago
nhzlx d347ea689a fix comments
7 years ago
chengduo e1761709f8
Set the right shape of selected_rows (#13723)
7 years ago
tensor-tang 3ee8f2c6cf thread local jit kernels
7 years ago
tensor-tang 9131a35676 replace the lstm compute with jitkernel
7 years ago
Qiao Longfei 9d087d5139 Revert "optimize pyreader"
7 years ago
tensor-tang b55c247678 add lstm compute unit test
7 years ago
nhzlx f3af90d121 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_dy_lib
7 years ago
nhzlx f569095084 add tensorrt api lib to paddle_fluid
7 years ago
qingqing01 6094a72308
Fix bug in reduce_op caused by PR #13534 (#13748)
7 years ago
Tao Luo 59c306eb52
Merge pull request #13776 from luotao1/revert_fast_math
7 years ago
Tao Luo 84a55155ec revert with_fast_math to ON
7 years ago
Wu Yi cc36bab184
fix manylinux multi arch docker build test=develop (#13770)
7 years ago
Qiao Longfei b1d5135ffb
Merge pull request #13637 from jacquesqiao/optimize-opyreader
7 years ago
dzhwinter a46e30aa6d
enhance isinf/isnan in tensor util, avoid copy back to cpu (#12688)
7 years ago
tensor-tang 2a00969165 optimize lstm jitkernel keq8
7 years ago
tensor-tang f2adaf1c3e add vrelu and lstm kernel
7 years ago
Xin Pan 943e4deb23
Merge pull request #13750 from panyx0718/fix
7 years ago
Jacek Czaja 9f15d8817e - Cleanup as suggessted by reviewers
7 years ago
Wu Yi 25262ed076
fix cuda9 docker build test=develop (#13701)
7 years ago
Sylwester Fraczek 78f98294c2 conv bn fuse pass
7 years ago
Jacek Czaja ae8b4717cc - Cleaning fused_embedding_fc_lstm op
7 years ago
Jacek Czaja fd31b54cf1 - Removed disabled code
7 years ago
Jacek Czaja f9da2d6416 - Removed disabled diagnostic code
7 years ago
Jacek Czaja 809dbc5c17 - Added file for fused_embedded_fc_lstm_op unit test
7 years ago
Tao Luo 75bd0f188b
Merge pull request #13754 from luotao1/fast_math
7 years ago
qiaolongfei 5238a7f5b9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-opyreader
7 years ago
tensor-tang e6d8aca3bf refine code and fix
7 years ago
qingqing01 41e4f7ea0a
Optimize Topk when height is large. (#13710)
7 years ago
xuezhong 65ed45a154
Merge pull request #13717 from chengduoZH/enhance_while
7 years ago
minqiyang bcd8c2ccc3 Add unit test
7 years ago
Tao Luo 28889caea5 disable EIGEN_FAST_MATH and use_fast_math
7 years ago
tensor-tang ea7dc9cbf6 Merge remote-tracking branch 'ups/develop' into fea/jitkernel
7 years ago
Xin Pan d2079b1ddb clean unused code and small optimize
7 years ago
tensor-tang 2513b2cc4e fix bug vtanh
7 years ago
chengduoZH e59ab42caa add nodes for drnn
7 years ago
Xin Pan ab798a2832 clarify the fraction_of_gpu_memory flag
7 years ago
Tao Luo d770b9bda3
Merge pull request #13663 from luotao1/resnet50_ut
7 years ago
dzhwinter 32c260cd1f
"fix operators cmake" (#13581)
7 years ago
Tao Luo 6ef6a9180a
Merge pull request #13727 from Sand3r-/mgallus/enable-mkldnn-naive-exe
7 years ago
minqiyang f20fc95539 Resize output ddims and rows
7 years ago
qiaolongfei 91756a5a90 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-opyreader
7 years ago
Zeng Jinle 8cd17c04c1
Merge pull request #13718 from sneaxiy/fix_eager_deletion
7 years ago
Michal Gallus 09d9d77a8f Enable MKLDNN in Naive Executor
7 years ago
Jiabin Yang 8e63bc2307
Merge pull request #13700 from JiabinYang/fix/mac_ci_job
7 years ago
tensor-tang 5ef14dd386
Merge pull request #13715 from tensor-tang/fix/op
7 years ago
minqiyang 67308822f8 Add selected_rows merge for clip_by_norm op
7 years ago
sneaxiy 9606b37ce4 test=develop
7 years ago
Xin Pan c0dfd5eee8
Merge pull request #13712 from luotao1/doc_element
7 years ago
tensor-tang ea0b98e007 bugfix: fusion lstm and gru batch,seq mode switch
7 years ago
Tao Luo 69ed75e77c refine elementwise doc
7 years ago
Tao Luo 49cd43000c
Merge pull request #12981 from chenwhql/sequence_erase_op_infershape_note_polish
7 years ago
JiabinYang 248400f43a test=develop
7 years ago
tensor-tang cf8c8e72bd add vtanh and unit test
7 years ago
Tao Luo 067b8582af
Merge pull request #13625 from wanghaoshuang/fix_warning_roi
7 years ago
tensor-tang b37fe30417
Merge pull request #13690 from wangguibao/fix_cpu_lstm_compute_cc
7 years ago
dzhwinter 26771f41ba
"fix compile error" (#13579)
7 years ago
JiabinYang 4cc3c4c976 test=develop
7 years ago
Yu Yang 186b2b138d
Revert "Merge pull request #13201 from reyoung/revert_callstack" (#13697)
7 years ago
Tao Luo a89afd4c22
Merge pull request #13685 from luotao1/naive_cmake
7 years ago
tensor-tang d10a9df7b8 add vaddbias and unit test
7 years ago
tensor-tang 3c8b651187 add vsigmoid avx implementations and unit test
7 years ago
luotao1 9cbf2023ab rollback paddle_inference_helper.h to helper.h
7 years ago
sneaxiy 6f748a035d test=develop
7 years ago
tensor-tang 55e44761fb refine code and init vsigmoid
7 years ago
Xin Pan 2f5a7cc470
Merge pull request #13673 from panyx0718/infer
7 years ago
wangguibao 1940bc2d83 Avoid multiple definitions of lstm_compute_ctht when linking libpaddle_fluid.so
7 years ago