Commit Graph

26 Commits (27cd3efdd140bb156505b573aa222cfb690d169b)

Author SHA1 Message Date
Kevin c474e7ddf5 fix overflow by int32 mul test=develop (#16794)
6 years ago
Zeng Jinle d658244997
fix some grad op desc maker (#16581)
6 years ago
tensor-tang ead558b7f6
Merge pull request #16256 from tensor-tang/refine/seqenum
6 years ago
tensor-tang 50931dee1d refine seq enum op
6 years ago
luotao1 d9f0e7252a refine with comments
6 years ago
luotao1 721c2c00ef refine fc_infershape
6 years ago
sneaxiy 5a92e4c097 revert revert 16144
6 years ago
Zeng Jinle a91964c8fe Revert "PaddingRNN model memory optimize"
6 years ago
Zeng Jinle 0b49e43d3a
Merge pull request #16144 from sneaxiy/rnn_mem_opt
6 years ago
luotao1 fe78a92e6e refine with comments
6 years ago
sneaxiy b26e9bd232 refine code
6 years ago
luotao1 31ccaf0916 add all_kernels_must_compute_runtime_shape example for speedup infershape
6 years ago
whs 0f99d24083
Make sequence_erase op support for input with multi-level LoD. (#15982)
6 years ago
luotao1 34404f9c31 refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool
6 years ago
Yiqun Liu f4634d76d7
Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493)
6 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
6 years ago
Yiqun Liu eaad3e4c3d
Add check of input in sequence_expand op. (#15466)
6 years ago
sneaxiy b56aca82e9 merge develop
6 years ago
tangwei12 dc8eca826e
code style fix, test=develop (#15045)
6 years ago
sneaxiy 045dc12766 merge develop
6 years ago
sneaxiy a500dfa579 rewrite ddim
6 years ago
Yu Yang 9bd70a1e04 Change tensor uses proto::VarType::type
6 years ago
Yu Yang 7604b1ad51 Fix Eigen macro when using GPU
6 years ago
sneaxiy 65867d8989 test=develop
6 years ago
minqiyang 53433d7f2e Revert the changes of VLOG
6 years ago
Wu Yi a2d9b34417
Refine operator cmake (#14413)
6 years ago