Commit Graph

3097 Commits (ec59f0d454569ef536c9ac0f7224bc7062b110ce)

Author SHA1 Message Date
tensor-tang ec59f0d454 add cpu vec
7 years ago
tensor-tang cf5ea925c3 fix bugs
7 years ago
tensor-tang 6ed20474d4 refine attention lstm infershape
7 years ago
tensor-tang 508548f897 implement attention lstm cpu forward
7 years ago
tensor-tang 9affc36c89 init attention lstm
7 years ago
tensor-tang 3dd66390b2 add blas vexp
7 years ago
tensor-tang 0ec1f65cf1 fix blas dot and add cblas scal
7 years ago
tensor-tang a2203d0466 add cblas dot
7 years ago
tensor-tang f72ab8961e refine blas gemm
7 years ago
qingqing01 f5d5d7b2d9
Disable in_place in batch_norm API. (#12736)
7 years ago
Tao Luo decda738b0
fea/anakin compile with demo (#12772)
7 years ago
Yan Chunwei 9ee698e605
enhance/ditu rnn with fc fuse (#12831)
7 years ago
Xin Pan 78415f326d
Merge pull request #12838 from panyx0718/infer
7 years ago
Xin Pan a2c0e52f3e speed up while_op
7 years ago
Zhaolong Xing 21ba32b065
Merge pull request #12843 from NHZlX/fix_ssa_bug_for_trt
7 years ago
Michał Gallus cd32ddac12 Fuse Convolution and Eltwise Add into MKLDNN's Conv+Bias (#12669)
7 years ago
nhzlx c999895e93 merge develop
7 years ago
nhzlx 276950291a 1. fix ssa bug with batchnorm, 2. refine the trt
7 years ago
Yan Chunwei 896a37b6e3
fea/link ir to inference analysis and fc fuse support (#12789)
7 years ago
dzhwinter e23ddf6ae4
status (#12764)
7 years ago
Tao Luo d04ef276a5
Merge pull request #12745 from tensor-tang/refine/op/elewise_mul
7 years ago
tangwei12 cbc6e6eb97
Merge pull request #12247 from seiriosPlus/dis_ckpt_fix
7 years ago
Qiyang Min 72965226e6
Merge pull request #12818 from velconia/fix_python3_CI_job
7 years ago
tangwei12 44bade8b17 fix api spec
7 years ago
Zhaolong Xing 470335e8c4
Merge pull request #12786 from NHZlX/add_batch_norm_trt_converter
7 years ago
Qingsheng Li 3d11d018e0
Fix scatter_op python API (#12742)
7 years ago
nhzlx ff052c0e6f merge develop
7 years ago
nhzlx c6a5c4b0c0 add comments for execute in ut_helper
7 years ago
minqiyang beb93bb901 Fix ut bug for graph_test
7 years ago
Tao Luo 8f9f414a14
Merge pull request #12805 from tensor-tang/fix/op/elewise_add
7 years ago
tensor-tang e955361267
Merge pull request #12737 from tensor-tang/feature/op/fusion_lstm
7 years ago
tensor-tang 82bb9170fb Merge remote-tracking branch 'ups/develop' into fix/op/elewise_add
7 years ago
tangwei12 99f74be561
Merge pull request #12802 from seiriosPlus/inference_teeny_mistakes
7 years ago
Tao Luo 2ae885e224
Merge pull request #12811 from luotao1/tensorrt_compiler_bug
7 years ago
Chen Weihang 57b34d9196
Merge pull request #12808 from chenwhql/remove_inplace_param_in_squeeze_and_unsqueeze
7 years ago
Xin Pan daf464af68
Merge pull request #12807 from panyx0718/fix
7 years ago
luotao1 808e5b1748 fix tensorrt compiler bug
7 years ago
Yihua Xu 084d4a9e9e Optimize CRF Decoding with AVX/AVX2/AVX512F instruction (#12767)
7 years ago
dzhwinter 00463fdfe3
cudnn windows support (#12757)
7 years ago
Xin Pan 4a4c469f61 add test
7 years ago
qingqing01 c62f68cb94
Fix bug in conditional_block_op. (#12246)
7 years ago
nhzlx 1bf9d9e90c fix comments
7 years ago
chenweihang bc471b6ac4 refactor: remove inplace parameter from squeeze and unsqueeze op
7 years ago
Xin Pan 7473d5f735 fix program_desc constructor
7 years ago
tensor-tang 0507f7bc3c fix SEGV elementwise add at debug mode
7 years ago
tangwei12 cfb12f09bf fix some teeny mistakes
7 years ago
Yu Yang c6af7201e9
Merge pull request #12692 from reyoung/feature/fast_executor
7 years ago
Xin Pan e525aa232e
Merge pull request #12780 from panyx0718/ir4
7 years ago
Tao Luo 7decbaaa13
Merge pull request #12762 from luotao1/anakin_cuda_env
7 years ago
nhzlx 324dd16816 merge develop
7 years ago