Commit Graph

2063 Commits (94992a990b2716d19427b4758060a5196baf1c56)

Author SHA1 Message Date
zhouxiao-coder 53574e54a1 reslove merge conflict;reimplement ELU activation with functor
7 years ago
武毅 3f874143fe fix grad debug event (#4536)
7 years ago
Luo Tao 4724bdbe68 Merge branch 'develop' into interp
7 years ago
Yi Wang 99895730f7 Merge pull request #4609 from kavyasrinet/tanhshrink
7 years ago
Yan Chunwei 20a6ae7f1f Feature/tensor array add python binding (#4616)
7 years ago
qiaolongfei ffe1b69229 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_compile_time_infershape
7 years ago
kexinzhao 087addaa76 Merge pull request #4558 from kexinzhao/adagrad_op
7 years ago
Kexin Zhao 78f4c803f3 change learning rate and fix format
7 years ago
Kavya Srinet 0336304176 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rmsprop
7 years ago
Kavya Srinet 154a6ed29c Implementing tanhshrink operator
7 years ago
qiaolongfei 628715d602 clean code
7 years ago
qiaolongfei 352af966d7 add python unit test
7 years ago
kavyasrinet 3e2be065b9 Merge pull request #4604 from kavyasrinet/activations
7 years ago
sidgoyal78 c10da26cf5 Modify implementation
7 years ago
Abhinav Arora 828c5b3e1d Adding Adadelta optimization operator (#4576)
7 years ago
Kavya Srinet 11070e5f36 Updated the reltive error
7 years ago
Kavya Srinet 60af56c1b8 Added Leaky Relu activation
7 years ago
qiaolongfei 5917e09cde tmp work
7 years ago
Kavya Srinet fa12e51675 Adding the default attribute test case
7 years ago
Kavya Srinet 94855f4af0 Fixed changes proposed in the review
7 years ago
Abhinav Arora eed2c1e1d6 Changing SGD inputs and outputs to conform to Operator naming convention (#4586)
7 years ago
Abhinav Arora 324876bbbf Changing learning rate from type Input(float) to Input(tensor) (#4578)
7 years ago
zchen0211 94b94e5b68 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
sidgoyal78 d28b3094dd Add momentum operator
7 years ago
Abhinav Arora 42e7fe05a2 Changing learning rate from attribute to input(float) (#4568)
7 years ago
Kavya Srinet 163d287143 Made learning rate the input
7 years ago
Kexin Zhao d1de7ec630 Change learning rate from attribute to input tensor
7 years ago
Kavya Srinet 61c03f9d59 Adding the implementation for rmsprop operator
7 years ago
zchen0211 58174b12f7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
Kexin Zhao 1ac654a69f Implementing the Adagrad optimizer step operator
7 years ago
qiaolongfei 32f5c9dd93 recurrent_op pass the unit test
7 years ago
zchen0211 15941dbd8c solve conflict for cond_op and scatter
7 years ago
qiaolongfei 7163dd0413 revert code
7 years ago
caoying03 be8bef9bdd Merge branch 'develop' into add_config_helper_for_resize_layer
7 years ago
chengduoZH 14b2c98f90 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into Add_maxpool_withIdx_only
7 years ago
qiaolongfei af6f3c0423 use float32 in cond_op
7 years ago
Yu Yang 0900aedfa0 Merge pull request #4514 from reyoung/feature/remove_add_op
7 years ago
caoying03 480154896c add configuration helper for resize layer.
7 years ago
chengduoZH 2ed56df1e6 remove conflict
7 years ago
chengduoZH 6fc44800ed fix unit test
7 years ago
chengduoZH bee95fc891 fix code format and some bug
7 years ago
Yancey1989 a35e82a649 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into seqconcat_op
7 years ago
chengduo 4f5491b2b4 Merge pull request #4146 from chengduoZH/Add_pool_op
7 years ago
chengduoZH 6abcb74c8f fix unit test class name
7 years ago
Yu Yang aa52fa1c64 Merge pull request #4491 from reyoung/feature/stable_lstm
7 years ago
chengduoZH 2d8a5b97cc fix unit test
7 years ago
Qiao Longfei 7fe0297e64 remove Runtime InferShape for cond op (#4518)
7 years ago
Yu Yang 6164b8986e Fix CI
7 years ago
Yu Yang 762a99cc06 Remove add_op since it can be replaced by sum_op
7 years ago
Yancey1989 be3fa7926e add sequence concat op
7 years ago
guosheng 46641c6361 Fix infer when input is empty in v2/inference.py
7 years ago
zhouxiao-coder 601e2317fd update to latest
7 years ago
zhouxiao-coder a815d6abcf elu: Optimize gradient calculation;Add more comments
7 years ago
chengduoZH df59889984 remove conflict
7 years ago
Luo Tao bb7f555803 remove rowwise_add_op
7 years ago
Luo Tao 884e31a59b add interpolation op
7 years ago
Cao Ying 99130c6e94 Merge pull request #4498 from pengli09/fix-random-seed
7 years ago
Peng Li 4dfc10ccf7 a patch for fixing random seeds in gradient checkers
7 years ago
Liu Yiqun 8bafdda0ad Merge branch 'develop' into core_add_sequence_softmax_op
7 years ago
guosheng a53191f12a Add norm_op
7 years ago
Yu Yang f60f0eae11 Using double precision to stablize lstm gradient check
7 years ago
Yu Yang 9fbf94b61a Merge pull request #4487 from abhinavarora/softsign_activation
7 years ago
Abhinav Arora 0c3eee09ff Implementing the SoftSign activation operator
7 years ago
Yu Yang 279178e457 Fix bug in test_prelu and test_xe
7 years ago
Yu Yang 54892c0797 Simplify op_test
7 years ago
Yu Yang 61cc3ae4d1 Stablize elementwise_mul by using double precision
7 years ago
Yu Yang 6efcbc4fcb Fix bug in test_prelu and test_xe
7 years ago
zchen0211 88a8eedda1 scatter gather gpu
7 years ago
Yu Yang 6ed78729b2 Simplify op_test
7 years ago
Yu Yang fd479631e1 Stablize elementwise_mul by using double precision
7 years ago
Abhinav Arora b9336e6f8c Adding support for the sigmoid_cross_entropy_with_logits operator (#4448)
7 years ago
chengduoZH 6326c40d27 Add max pool with index
7 years ago
Guo Sheng ecef2e6b97 Merge pull request #4086 from guoshengCS/add-ReduceOp
7 years ago
Yibing Liu e303897f35 Merge branch 'develop' of upstream into margin_rank_loss_op_dev
7 years ago
Liu Yiqun 03897f251d Finish the SequenceSoftmaxGradKernel, using SoftmaxGradFunctor.
7 years ago
Yu Yang 21f63ec223 Merge pull request #4458 from reyoung/feature/compile_time_infer_shape
7 years ago
Yancey d7db15f3e5 Use StridedMemCpy in Concat/Split Kernel (#4188)
7 years ago
guosheng be58c6327d Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-ReduceOp
7 years ago
Yu Yang 6196209478 Remove OperatorBase::InferShape
7 years ago
Yu Yang ba4b0291ef Follow comments, check exception message
7 years ago
Yu Yang 680c20217e Merge branch 'develop' of github.com:baidu/Paddle into feature/make_python_catch_enforce_not_met
7 years ago
chengduoZH 3c0f079333 remove conflict and fix InferShape function
7 years ago
Liu Yiqun ce3171f3c4 Merge branch 'develop' into core_add_sequence_softmax_op
7 years ago
guosheng 99b8dbb14f Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-ReduceOp
7 years ago
Yu Yang dcfd31d736 Merge pull request #4397 from reyoung/feature/pybind_for_protobuf_desc
7 years ago
Yu Yang de35098779 Fix CI and follow comment
7 years ago
Liu Yiqun c8fc6037fd Merge branch 'develop' into core_add_sequence_softmax_op
7 years ago
Yu Yang 7b385ff206 Merge pull request #4407 from Canpio/fix_huber_loss_test_error
7 years ago
Yiqun Liu 29cb85634c Merge pull request #4144 from lcy-seso/softmax_with_cross_entropy_op
7 years ago
fengjiayi 36f3d0af22 Fix error in unit test of ModifiedHuberLossOp
7 years ago
Yu Yang 49697d9dab Merge branch 'develop' of github.com:baidu/Paddle into feature/make_python_catch_enforce_not_met
7 years ago
Yu Yang 9e5de16719 Merge branch 'feature/pybind_for_protobuf_desc' of github.com:reyoung/Paddle into feature/pybind_for_protobuf_desc
7 years ago
Yu Yang 62d597c176 Merge branch 'develop' of github.com:baidu/Paddle into feature/pybind_for_protobuf_desc
7 years ago
Yu Yang 67cdd5bc61 Make PyBind support C++ exception
7 years ago
Yibing Liu dc186af729 Merge branch 'develop' of upstream into margin_rank_loss_op_dev
7 years ago
Yibing Liu 367a54e08c Merge pull request #4360 from kuke/multiplex_modify_dev
7 years ago
chengduoZH 30a586df0c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into Add_pool_op
7 years ago
Tao Luo 0cc85d794a Merge pull request #4331 from tensor-tang/mkldnn_softmax
7 years ago
caoying03 3d77360b89 add negative clipping for softmax.
7 years ago
caoying03 360bde9a70 Merge branch 'develop' into softmax_with_cross_entropy_op
7 years ago
Cao Ying 7d65321620 Merge pull request #4237 from lcy-seso/optimize_cross_entropy_kernel
7 years ago
caoying03 000d75116f fix backward op.
7 years ago
Yibing Liu 089f8e2d37 Merge branch 'develop' of upstream into multiplex_modify_dev
7 years ago
caoying03 8b8ad6b164 fix implementations of supporting soft labels.
7 years ago
fengjiayi 6915c924a4 Fix bug
7 years ago
fengjiayi 4fb106afb0 Merge branch 'feature/pybind_for_protobuf_desc' of https://github.com/reyoung/Paddle into feature/pybind_for_protobuf_desc
7 years ago
fengjiayi 5419f16b38 Add unittests
7 years ago
Yu Yang 16c5f629bd Complete unittest for OP
7 years ago
Yu Yang f9f910a33b Complete op
7 years ago
Yu Yang 1cd2014007 Merge branch 'develop' of github.com:baidu/Paddle into feature/pybind_for_protobuf_desc
7 years ago
Zhuoyuan e5a3c1d2d5 Merge pull request #4372 from reyoung/feature/stable_prelu_grad_test
8 years ago
Zhuoyuan f698a49ce3 Merge pull request #4240 from zchen0211/develop
8 years ago
Yu Yang d54e8420be Stabilize prelu gradient check
8 years ago
Yibing Liu 236af56612 separate index tensor from candidate tensors in multiplex_op
8 years ago
chengduoZH b72854389e Fix (According to the review)
8 years ago
tensor-tang 7483087c8c enable mkldnn_softmax
8 years ago
Cao Ying 14a7399d22 Merge pull request #4329 from ranqiu92/r-doc
8 years ago
ranqiu 732c8973e0 Update annotations of layers.py
8 years ago
Liu Yiqun 9f32c8d896 Merge branch 'develop' into core_add_sequence_softmax_op
8 years ago
Yibing Liu 47fbc96fa1 Merge pull request #4064 from kuke/multiplex_op_dev
8 years ago
Tao Luo 01bec25734 Merge pull request #4193 from luotao1/seq_pool
8 years ago
caoying03 bb58b63b6c Merge branch 'develop' into softmax_with_cross_entropy_op
8 years ago
guosheng 1295e5ef54 Refine reduce_op unit test and add newline at end of file
8 years ago
guosheng c8d877195b Revise the reduce_op unit test accordingly
8 years ago
guosheng 3994e91a67 Add reduce_op
8 years ago
ranqiu 17622b482d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into r-doc
8 years ago
caoying03 201c2bcf20 delete redundant codes.
8 years ago
caoying03 6735585b0f fix cpu kernel with soft labels.
8 years ago
Yu Yang 9fa7c9306c Merge branch 'feature/pybind_for_protobuf_desc' of github.com:reyoung/Paddle into feature/pybind_for_protobuf_desc
8 years ago
fengjiayi 08e9900621 Fix bugs
8 years ago
Yu Yang b941865d44 Merge branch 'feature/simplify_attr_parse' into feature/pybind_for_protobuf_desc
8 years ago
fengjiayi 57c95c7957 Merge branch 'fix_lod_tensor_dim_64' into feature/pybind_for_protobuf_desc
8 years ago
Yu Yang ddf2448484 Update Input/Output of Op
8 years ago
Yu Yang dc643a3352 Hot fix unittest
8 years ago
Yu Yang bddb40609d Buggy code
8 years ago
fengjiayi f5aa8b4d7e Update namespace of pybind/protobuf.cc and .h
8 years ago
fengjiayi 6db6475460 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/pybind_for_protobuf_desc
8 years ago
fengjiayi ee547f6ac9 Add unittests
8 years ago
superjom b545b5b86b Merge branch 'develop' of github.com:PaddlePaddle/Paddle into feature/recurrent_op_backward_fix
8 years ago
ranqiu 0e6466423b Update the annotation of layers.py
8 years ago
caoying03 30bfaab36e Merge branch 'develop' into optimize_cross_entropy_kernel
8 years ago
gongweibao f99841dd2a Elementwise operator. (#4139)
8 years ago
dangqingqing efb56db770 tune max_relative_error in test_cos_sim_op.
8 years ago
qingqing01 7831b1d9ea Merge branch 'develop' into attr_bool
8 years ago
chengduoZH c2c2d610a4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into Add_pool_op
8 years ago
Yu Yang 618884dd69 Complete unittest for ProgramDesc
8 years ago
Yu Yang 70f398e207 Update
8 years ago
dangqingqing 0dce16a697 Use bool type for attr in cross_entropy_op.
8 years ago
Yibing Liu 85a5d38446 Merge branch 'develop' of upstream into multiplex_op_dev
8 years ago
chengduoZH 6f61b5df7d fix unit test
8 years ago
Luo Tao 0449b9c89e Merge branch 'develop' into seq_pool
8 years ago
chengduoZH 84a2512b90 fix parameter name and function define
8 years ago
Yibing Liu 756af4e73a regulate comments in margin_rank_loss_op
8 years ago
dangqingqing 58e3ad0a70 Fix conflicts.
8 years ago
caoying03 f1d5fb3b9a support soft labels.
8 years ago
Yibing Liu 6b3e9ccb3a pass unit test for margin_rank_loss_op
8 years ago
chengduoZH 50b8ec0564 fix unit test
8 years ago
Yibing Liu 2f12256186 Merge branch 'develop' of upstream into margin_rank_loss_op_dev
8 years ago
dangqingqing 6e2782e958 update to develop branch.
8 years ago
caoying03 a2a0d6f82a Merge branch 'develop' into softmax_with_cross_entropy_op
8 years ago
chengduoZH 3416f5e0f8 fix function define
8 years ago
Liu Yiqun 4d9293940b Merge branch 'develop' into core_add_sequence_softmax_op
8 years ago
chengduoZH 510f00800a Add pool3d unit test
8 years ago
chengduoZH 33d9999890 Add pool2d unit test
8 years ago
hedaoyuan 0ee967b513 Merge pull request #4288 from hedaoyuan/fix_bug
8 years ago
hedaoyuan ccbb285311 Increase the max_relative_error in TestConv2dOp.
8 years ago
QI JUN 8c3b8af31e Merge pull request #4071 from QiJune/activation_ops
8 years ago
Yibing Liu d827359c71 Merge pull request #4098 from kuke/rank_loss_op_dev
8 years ago
whs da2aabb628 Merge pull request #3906 from wanghaoshuang/crop_op
8 years ago
superjom 27aaee1181 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into feature/recurrent_op_backward_fix
8 years ago
Yibing Liu cf4b2db758 change the dims of input of rank_loss_op
8 years ago
Yibing Liu 79c2d90a7f add margin_rank_loss_op
8 years ago
Liu Yiqun f14a7966b0 Initialize the sequence softmax operator.
8 years ago
whs e53dc8a2e4 Merge pull request #3937 from wanghaoshuang/clip_op
8 years ago
X.Dragon c003895c1c Merge pull request #3920 from NHZlX/op_transpose
8 years ago
superjom 0da8133224 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into feature/recurrent_op_backward_fix
8 years ago
superjom 6a0c342874 make RecurrentOp's backward work
8 years ago
hedaoyuan 7a891a3321 Merge pull request #4042 from hedaoyuan/conv_op
8 years ago
Yan Chunwei b5e67fce70 RNNOp remove alias (#4274)
8 years ago
wanghaoshuang bc632df822 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into crop_op
8 years ago
wanghaoshuang c7b6d2c46d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into clip_op
8 years ago
superjom 68399ab921 Merge remote-tracking branch 'origin/rnn-backward-python' into feature/recurrent_op_backward_fix
8 years ago
Zhuoyuan 40e49c3f8b update python test
8 years ago
superjom 075e0e3c5d Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rnn_remove_alias
8 years ago
superjom 3c29224ef3 remove alias
8 years ago
zchen0211 7883227716 lstm
8 years ago
caoying03 a3a8a0900d optimize cross entropy kernel by using reduce.
8 years ago
Yibing Liu 9da5192f77 adapt multiplex_op to the dev of framework
8 years ago
Tao Luo 4400284685 Merge pull request #4224 from tensor-tang/act
8 years ago
tensor-tang eb26fdce46 add python interface for mkldnn_relu and mkldnn_tanh
8 years ago
Yang yaming 51f1148921 Merge pull request #3987 from pkuyym/fix-3923-c
8 years ago
Yang yaming cdda0cf3d4 Merge pull request #3913 from pkuyym/fix-3789
8 years ago
Yibing Liu 18dc201bd9 merge multiplex_op with the latest upstream
8 years ago
dangqingqing 39cf2e217d update to develop branch.
8 years ago
dangqingqing b65709e403 Share LoD between input and output of each opeators.
8 years ago
Yibing Liu ece329100a refine rank_loss_op
8 years ago
yangyaming 308ce9ac55 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-3923-c
8 years ago
yangyaming 4e3ba65f19 Refine doc.
8 years ago
yangyaming 12596a16ec Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-3789
8 years ago
ranqiu 37faf49565 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into r-doc
8 years ago
xzl 1792e58f20 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into op_transpose
8 years ago
xzl 0cd9b8c0aa modify the input\output name to X\Out
8 years ago
Yibing Liu f2cfa32411 Merge branch 'develop' of upstream into rank_loss_op_dev
8 years ago
wanghaoshuang 3f3848cdf7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into clip_op
8 years ago
dangqingqing 72ba02701b Add bool type for attribute and use it in dropout_op.
8 years ago
superjom b818e64720 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rnn_remove_alias
8 years ago
superjom 0d7e4294fc remove alias
8 years ago
dangqingqing 7ee916b0d3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into attr_bool
8 years ago
dangqingqing 2aa4d326ec Fix unit testint in test_prelu_op.
8 years ago
wanghaoshuang a3c3b7866e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into clip_op
8 years ago
wanghaoshuang ce709b75b3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into crop_op
8 years ago
qingqing01 5b42d2b21b Merge pull request #4081 from xinghai-sun/soft_label_cross_entropy
8 years ago
ranqiu 2ba70f5d36 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into r-doc
8 years ago
ranqiu 62377fd1f3 Update annotations about layer name of layers.py
8 years ago
Tao Luo de8aaf6c00 Merge pull request #4192 from qingqing01/fix_prelu
8 years ago
qingqing01 5882c1f6f0 Remove test_prelu_op since it failed and will be fixed later.
8 years ago
Xinghai Sun 19de8ae141 Fixed a error in mnist unitest.
8 years ago
ranqiu fe2c5936d9 Update annotation of layers.py
8 years ago
ranqiu 93e0183662 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into r-doc
8 years ago
Luo Tao 1b01f1ea7b implement framework of seq_pool_op and its unitest
8 years ago
Xinghai Sun d8046da0cd Use soft_label attribute for cross-entropy.
8 years ago
Xinghai Sun c7f91a94ec Merge pull request #3817 from xinghai-sun/dropout
8 years ago
wanghaoshuang a4b1abe5c4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into crop_op
8 years ago
dangqingqing fad48fa6b1 Add bool type for attr.
8 years ago
Xinghai Sun 8e7fe8cae5 Merge branch 'develop' into soft_label_cross_entropy
8 years ago
Xinghai Sun ffeeef82f3 Remove unnecessary mask operations in test phase for dropout operator.
8 years ago
Zhuoyuan f86c1ccdbe Merge pull request #4121 from zchen0211/develop
8 years ago
wanghaoshuang fa4908dc10 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into crop_op
8 years ago
xzl a9a7ba3cff Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into op_transpose
8 years ago
xzl 9de45e113a fixed bug when dims.size == 1, modify the variable naming, add judgement when input_grad is null
8 years ago
Xinghai Sun a2798ff25f Merge branch 'develop' into dropout
8 years ago
gaoyuan 71b3fbb18a Fix a ssd bug
8 years ago
Tao Luo d4d4580d5e Merge pull request #4140 from tensor-tang/mkldnn_pool
8 years ago
zchen0211 154d88c261 fix gradient not stable
8 years ago
zchen0211 3c3a6d90ae prelu finalize
8 years ago
zchen0211 4a2378845e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
8 years ago
Xinghai Sun 585d12a307 Add is_training attr and testing phrase compuation to dropout operator.
8 years ago
hedaoyuan f3669ca3f1 Support input_grad = null or filter_grad = null.
8 years ago
tensor-tang cc28fb4bb3 Merge remote-tracking branch 'upstream/develop' into mkldnn_pool
8 years ago
ranqiu a0187f1c55 Update the annotation about bias_attr of layers.py
8 years ago
xzl 35967e8658 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into op_transpose
8 years ago
xzl 5ede6fd434 delete cuda impl, complete comments, modify variable naming
8 years ago
ranqiu 82bff6eee3 Update the annotation of layers.py
8 years ago
Liu Yiqun 466d48fd23 Check and only check the output varibles specified by self.outputs.
8 years ago
Yancey 56b1b70142 Split operator with CPU kernel (#4046)
8 years ago
ranqiu c2dea5a877 Update the annotation of layers.py
8 years ago
wanghaoshuang 8d9d537b9f remove op_test_util.py
8 years ago
wanghaoshuang 44224f4b5b remove gradient_checker.py
8 years ago
wanghaoshuang 3102a52a67 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into clip_op
8 years ago
wanghaoshuang a345b7195e 1. Add CUDA stream when launching kernel.
8 years ago