Commit Graph

21515 Commits (a5d2a6d1addf918c1f9ea30d677e260c80e201d7)
 

Author SHA1 Message Date
minqiyang 0f94c1ac14 Polish code
6 years ago
minqiyang 00e4de04bf Polish code
6 years ago
minqiyang 4bfa110fd8 Add no lock optimize pass
6 years ago
Qiyang Min 1df2399e00
Merge pull request #15180 from velconia/add_pyramid_dnn_support
6 years ago
chengduo eabb2105fa
Refactor MultiDevSSAGraphBuilder (#15090)
6 years ago
Yan Chunwei 875a07c32d
refactor inference analysis api (#14634)
6 years ago
minqiyang c09a379015 remove const_cast
6 years ago
tensor-tang 102d93712e Merge remote-tracking branch 'ups/develop' into jit/seqpool
6 years ago
tensor-tang 123b98f417 refine heigth and codesize and support all pool
6 years ago
tensor-tang 0145f40f45 use height from params of jitcode
6 years ago
tensor-tang e0591deebc enhance seqpool jitcode
6 years ago
Zeng Jinle 99e6e8b00f
Merge pull request #15179 from sneaxiy/fix_crf_grad_lod
6 years ago
minqiyang db8eb9b688 Polish code
6 years ago
minqiyang dc0ecffd6c Add ut for fused ops
6 years ago
minqiyang f4c990e7b8 Add fused embedding ops
6 years ago
minqiyang 39b98709b1 Move fused ops to fused dir
6 years ago
minqiyang 920d4a8b78 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fused_emb_seq_pool_op
6 years ago
minqiyang b2716909b4 Add changes to paddle_build
6 years ago
minqiyang 583f7ce173 Add dynamic jemalloc modules
6 years ago
Tao Luo 5ee596cae5
Merge pull request #15175 from baojun-nervana/intel/mkldnn
6 years ago
乔龙飞 Qiao Longfei 7c891e1ecc
Merge pull request #15111 from jacquesqiao/fix-adam-tmp-var
6 years ago
mozga-intel e77956c920 Enable mean operator for a ngraph
6 years ago
mozga-intel dd768714ab Enable scale operator for a ngraph
6 years ago
sneaxiy be425461a1 fix crf grad lod share
6 years ago
Xin Pan 5f0a0286e0 add doc
6 years ago
Xin Pan 8e2a592be2 fix
6 years ago
乔龙飞 Qiao Longfei e1679b8847
Merge pull request #14893 from JiabinYang/feature/add_prefech_hs
6 years ago
baojun-nervana f0cde74564 Update ngraph with elt-wise relu test=develop
6 years ago
tensor-tang 92201d3956 support avg and sqrt pool and add mkl impl
6 years ago
tensor-tang c50060bb26 add jitcode impl and use it
6 years ago
tensor-tang 142bb41748 add seqpool jitkernel test and benchmark
6 years ago
tensor-tang e58a569c6c use seqpool jitkernel
6 years ago
tensor-tang 3e01a4048f add refer seqpool jitkernel
6 years ago
wopeizl 796322d31a
Merge pull request #15134 from wopeizl/windows/whlsupport
6 years ago
Xin Pan 7526ac14e3 add comments
6 years ago
Xin Pan cb1891f97b polish
6 years ago
Xin Pan f1c7f4b016
Merge pull request #15142 from tianshuo78520a/tools
6 years ago
xiaolil1 bbc9336878 Enable basic MKL-DNN INT8 Conv OP (#15124)
6 years ago
Xin Pan 8ae9094e07 polish and resolve conflicts
6 years ago
Xin Pan beaae61a16 polish
6 years ago
Xin Pan 5e928e579a try unify Executor and ParallelExecutor
6 years ago
peizhilin c919b2f31d Merge remote-tracking branch 'upstream/develop' into windows/fixgpuissue
6 years ago
peizhilin fd4f4d0e5f fix build issue test=develop
6 years ago
Yan Xu a1e60ab19b
Merge pull request #14791 from Yancey1989/parallel_graph_mode
6 years ago
peizhilin 25523bb8e6 test=develop
6 years ago
peizhilin 9ae50dd07d fix gpu buils issue on windows test=develop
6 years ago
qingqing01 c981bf0f9d
Fix compling error with cuDNN v5 (#15148)
6 years ago
peizhilin 8bb513cad4 test=develop
6 years ago
Yancey1989 4ad9de74dd disable sync nccl by default test=develop
6 years ago
Yancey1989 449bf58ea6 disable parallelgraph mode by default test=develop
6 years ago