Paddle

Commit Graph

Author	SHA1	Message	Date
jerrywgz	90f39b1123	Merge branch 'roialign' of https://github.com/jerrywgz/Paddle into roialign	6 years ago
Tao Luo	f8874b3cb2	Merge pull request #13884 from luotao1/rename_inference_lib_dist rename inference_lib_dist to fluid_lib_dist	6 years ago
Xin Pan	288a112ffd	Revert "Revert "Revert "Make variable::GetMutable robust"""	6 years ago
sneaxiy	5cedfb60c8	test=develop	6 years ago
Yibing Liu	b785798585	Expose layer's name for sequence pad & unpad test=develop	6 years ago
Yibing Liu	18e1c1e07d	Update API spec for seq slice test=develop	6 years ago
jerrywgz	5e52dafda5	add roi align	6 years ago
jerrywgz	c0e34eebec	add roi align	6 years ago
Tao Luo	c26f2b21eb	Merge pull request #13813 from sfraczek/sfraczek/conv-bn-fuse-pass-full-eigen rewrite conv_bn fuse pass fully to eigen	6 years ago
Yibing Liu	16b2c6dc78	Add py api for sequence_slice_op test=develop	6 years ago
superjomn	1cfd2b51a7	update test=develop	6 years ago
Xin Pan	fededdda20	Merge pull request #13872 from panyx0718/fix2 Revert "Revert "Make variable::GetMutable robust""	6 years ago
Qiao Longfei	b16e9cd105	a small fix for compile WITH_INFERENCE=OFF (#13869 ) test=develop	6 years ago
Qiao Longfei	ec25a09bd5	revert unused change test=develop	6 years ago
Qiao Longfei	60030e8678	change the use of FLAGS_reader_queue_speed_test_mode test=develop	6 years ago
Tao Luo	323d67cfc1	Merge pull request #13879 from panyx0718/doc fix doc	6 years ago
Sylwester Fraczek	50c5e9b0c6	reshape_2d used from ddim.h test=develop	6 years ago
Qiyang Min	cab29828a5	Merge pull request #13829 from velconia/accelerate_sequence_pool_op Accelerate SequencePool Op on SUM mode of CPU	6 years ago
minqiyang	aeec82acd5	Add unittest for reshape op test=develop	6 years ago
Xin Pan	ddb76d0d09	Make GetMutable more robust test=develop	6 years ago
Qiyang Min	c2842377ce	Merge pull request #13837 from velconia/add_pyramid_dnn_support Add margin_rank_loss_op to python	6 years ago
minqiyang	24c9fbdba3	Polish code test=develop	6 years ago
chengduo	2c9839c847	add cuda version display (#13885 ) test=develop	6 years ago
sneaxiy	d3ed070e10	test=develop	6 years ago
minqiyang	d9b202e717	Move tensor copy src_ptr and dst_ptr check to TensorCopy function test=develop	6 years ago
sneaxiy	fb6201e93e	test=develop	6 years ago
chengduo	8e2fdc54b1	Add check for opt op (#13840 ) * add check for opt op * fix opt op test=develop * fix test fail test=develop * fix optimization doc test=develop * test=develop	6 years ago
Yibing Liu	46e61d81a7	Wrapper py api for sequence_unpad test=develop	6 years ago
superjomn	28459592cc	update test=develop	6 years ago
Qiyang Min	e37c9e6732	Merge pull request #13828 from velconia/accelerate_selected_rows_functor Accelerate SelectedRows Functors:	6 years ago
Tao Luo	3d976f3f18	rename inference_lib_dist to fluid_lib_dist test=develop	6 years ago
sneaxiy	84d9300365	test=develop	6 years ago
chengduo	2562eb92b8	Add strategy doc (#13849 ) * add strategy doc test=develop * fix doc test=develop * add ParallelExecutor arg doc test=develop	6 years ago
Xin Pan	b11372a0af	fix doc test=develop	6 years ago
Zhaolong Xing	7413fa458f	Merge pull request #13838 from NHZlX/add_trt_pad_op Add trt pad op converter	6 years ago
superjomn	e2bd40ca82	update test=develop	6 years ago
dzhwinter	8329a1f139	add sparse update momentum. test=develop	6 years ago
superjomn	049fcbe125	update test=develop	6 years ago
superjomn	f5c0221c17	clean CreatePaddlePredictor test=develop	6 years ago
Xin Pan	d852be7c48	Revert "Make variable::GetMutable robust"	6 years ago
Qiao Longfei	c61e16b181	add reader_queue_speed_test_mode_flag test	6 years ago
Qiao Longfei	8686f7c68e	add reader_queue_speed_test_mode flag for speed test	6 years ago
Xin Pan	ae8b1c32bf	Merge pull request #13821 from panyx0718/fix Make variable::GetMutable robust	6 years ago
Xin Pan	9b3e0df375	Merge pull request #13819 from panyx0718/doc Explain LoD and a few other concepts	6 years ago
Zhaolong Xing	44f37d019a	Merge pull request #13848 from NHZlX/fix_demo_ci_with_trt fix demo_ci bug in vis_demo.cc	6 years ago
Sylwester Fraczek	55d6950a1a	rewrite conv_bn fuse pass to eigen test=develop	6 years ago
chengduo	9c77b65c06	Fix layers.uniform_random (#13823 ) * fix layers.uniform_random * fix uniform_random test=develop * remove var type set test=develop * fix similar error test=develop	6 years ago
nhzlx	320c78e16f	fix commets test=develop	6 years ago
Yibing Liu	cbe4292516	Add sequence unpad op test=develop	6 years ago
tensor-tang	bcb8ea397d	Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole test=develop	6 years ago
tensor-tang	8e182170ba	refine and replace lstm peephole kernel	6 years ago
nhzlx	efa5bac7ad	fix demo_ci bug in vis_demo.cc test=develop	6 years ago
tensor-tang	dc5a7b906d	fix default number of threads when inference with or without MKLDNN test=develop	6 years ago
Xin Pan	228506618b	Avoid GetMutable implicitly reset Var Type. This can cause a lot of problem: 1. Wrong operator implementation, Op can get a wrong type without failure. 2. Anytype can be Get without defined in VarType. Also fix wrong STEP_SCOPE usage. test=develop	6 years ago
jerrywgz	3c963336e4	fix roi pool register	6 years ago
Dun	5f2e837847	optimize depthwise conv by register memory (#13778 ) * optimize depthwise conv by register memory * test=develop	6 years ago
minqiyang	3f6ec90060	Polish code test=develop	6 years ago
minqiyang	9878eedbaa	Change API.spec test=develop	6 years ago
Qiao Longfei	5428cb9908	Profiler support merge data of all thread (#13811 ) * profiler infor merge thread statistic information * update profiler * fix bug * add merge thread msg to report * optimize report * statistic the time of ops in each thread but not all * optimize report format * optimize profile report * optimize profile report test=develop	6 years ago
nhzlx	bf7a2789a0	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_trt_pad_op test=develop	6 years ago
nhzlx	0cb88c34be	add op converter	6 years ago
tensor-tang	7ef2699e18	init peephole runtime kernel	6 years ago
Qiyang Min	f99ea99e36	Merge pull request #13720 from velconia/fix_grad_clip Merge selected_rows for clip_by_norm op	6 years ago
minqiyang	f40848828d	Polish code test=develop	6 years ago
minqiyang	e2e82bde32	Accelerate Reshape op	6 years ago
minqiyang	0385b0a1ea	Accelerate SequencePool Op on SUM mode test=develop	6 years ago
minqiyang	8ec748cfa0	Accelerate SelectedRows Functors: 1. Accelerate SelectedRows MergeAdd functor 2. Add SelectedRowsSumTo functor to support MergeAdd multiple SelectedRows into one test=develop	6 years ago
Xin Pan	63b2e98f3d	Explain LoD and a few other concepts test=develop	6 years ago
Tao Luo	9b11a17502	Revert "[MKLDNN] Pass: Fuse Conv + Bias"	6 years ago
Tao Luo	ce248a15d9	Merge pull request #13368 from Sand3r-/mgallus/conv-bias-pass [MKLDNN] Pass: Fuse Conv + Bias	6 years ago
whs	7e651c8641	Fix truncated norm (#13785 ) * Fix truncated normal. * test=develop	6 years ago
Tao Luo	16b1beb244	Merge pull request #13486 from sfraczek/sfraczek/conv-bn-fuse-pass Sfraczek/conv bn fuse pass	6 years ago
Zhaolong Xing	5d5587fff7	Merge pull request #13792 from NHZlX/trt_dy_lib add trt api lib to paddle_fluid lib	6 years ago
Michal Gallus	40b17be4b0	Pass: Fuse Conv + Bias test=develop	6 years ago
minqiyang	1456b8ec7d	Add unittest for clip_by_norm_op with SelectedRows test=develop	6 years ago
Tao Luo	fd0dd07ab4	Merge pull request #13726 from jczaja/prv-fused_embedding_fc_lstm-ut Unit test to Fused_embedding_fc_lstm op	6 years ago
Sylwester Fraczek	3fcca40909	eigen sqrt fix and change 1e-5 to epsilon test=develop	6 years ago
Qiao Longfei	5fc305220c	Merge pull request #13787 from PaddlePaddle/revert-13637-optimize-opyreader Revert "optimize pyreader"	6 years ago
nhzlx	9445502f90	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_dy_lib test=develop	6 years ago
chengduo	e1904ac2c8	Add doc (#13765 ) test=develop	6 years ago
nhzlx	d347ea689a	fix comments	6 years ago
chengduo	e1761709f8	Set the right shape of selected_rows (#13723 ) * set the right shape of selected_rows test=develop * enhance check * fix activation_op * remove cast * use ShareDimInfo replace SetDim and ShareLod * use ShareDimAndLod test=develop * follow comment test=develop * check whether the input has lod test=develop * Split ShareDimAndLod test=develop * checkout clip.py test=develop	6 years ago
tensor-tang	3ee8f2c6cf	thread local jit kernels test=develop	6 years ago
tensor-tang	9131a35676	replace the lstm compute with jitkernel test=develop	6 years ago
Qiao Longfei	9d087d5139	Revert "optimize pyreader" test=develop	6 years ago
tensor-tang	b55c247678	add lstm compute unit test	6 years ago
nhzlx	f3af90d121	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_dy_lib test=develop	6 years ago
nhzlx	f569095084	add tensorrt api lib to paddle_fluid	6 years ago
qingqing01	6094a72308	Fix bug in reduce_op caused by PR #13534 (#13748 ) * Fix bug in reduce_op caused by PR #13534 * Fix output shape and enhance unit test. test=develop	6 years ago
Tao Luo	84a55155ec	revert with_fast_math to ON test=develop	6 years ago
Qiao Longfei	b1d5135ffb	Merge pull request #13637 from jacquesqiao/optimize-opyreader optimize pyreader	6 years ago
dzhwinter	a46e30aa6d	enhance isinf/isnan in tensor util, avoid copy back to cpu (#12688 ) * "avoid copy back to cpu" * "add infinity support" * "fix ci" * "add cpu macro" * rerun ci; test=develop * "fix api" test=develop * test=develop * test=develop * test=develop * test=develop * test=develop	6 years ago
tensor-tang	2a00969165	optimize lstm jitkernel keq8 test=develop	6 years ago
tensor-tang	f2adaf1c3e	add vrelu and lstm kernel test=develop	6 years ago
Xin Pan	943e4deb23	Merge pull request #13750 from panyx0718/fix clean unused code and small optimize	6 years ago
Jacek Czaja	9f15d8817e	- Cleanup as suggessted by reviewers test=develop	6 years ago
Sylwester Fraczek	78f98294c2	conv bn fuse pass review fix review from hshen14 fix test=develop fix error in broadcast and code cleanup rename bias -> eltwise and added macro to shorten code formatting	6 years ago
Jacek Czaja	ae8b4717cc	- Cleaning fused_embedding_fc_lstm op test=develop	6 years ago
Jacek Czaja	fd31b54cf1	- Removed disabled code test=develop	6 years ago
Jacek Czaja	f9da2d6416	- Removed disabled diagnostic code test=develop	6 years ago
Jacek Czaja	809dbc5c17	- Added file for fused_embedded_fc_lstm_op unit test - Work in progress on unit test for fused_embedding_fc_lstm op - Added bias caching and ref x computing - Small update unit test - temporary storage - Fix to batchcompute - Cosmetic fixes - Style fixes	6 years ago
Tao Luo	75bd0f188b	Merge pull request #13754 from luotao1/fast_math disable EIGEN_FAST_MATH and use_fast_math	6 years ago
qiaolongfei	5238a7f5b9	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-opyreader test=develop	6 years ago
tensor-tang	e6d8aca3bf	refine code and fix	6 years ago
qingqing01	41e4f7ea0a	Optimize Topk when height is large. (#13710 )	6 years ago
xuezhong	65ed45a154	Merge pull request #13717 from chengduoZH/enhance_while Add nodes for Dyrnn	6 years ago
minqiyang	bcd8c2ccc3	Add unit test	6 years ago
Tao Luo	28889caea5	disable EIGEN_FAST_MATH and use_fast_math test=develop	6 years ago
tensor-tang	ea7dc9cbf6	Merge remote-tracking branch 'ups/develop' into fea/jitkernel test=develop	6 years ago
Xin Pan	d2079b1ddb	clean unused code and small optimize test=develop	6 years ago
tensor-tang	2513b2cc4e	fix bug vtanh	6 years ago
chengduoZH	e59ab42caa	add nodes for drnn test=develop	6 years ago
Xin Pan	ab798a2832	clarify the fraction_of_gpu_memory flag test=develop	6 years ago
Tao Luo	d770b9bda3	Merge pull request #13663 from luotao1/resnet50_ut add resnet50 inference unit-test	6 years ago
dzhwinter	32c260cd1f	"fix operators cmake" (#13581 ) * "fix operators cmake" * "rerun ci" test=develop	6 years ago
Tao Luo	6ef6a9180a	Merge pull request #13727 from Sand3r-/mgallus/enable-mkldnn-naive-exe Enable MKL-DNN in Naive Executor and Analysis Predictor	6 years ago
minqiyang	f20fc95539	Resize output ddims and rows	6 years ago
qiaolongfei	91756a5a90	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-opyreader	6 years ago
Zeng Jinle	8cd17c04c1	Merge pull request #13718 from sneaxiy/fix_eager_deletion Fix eager deletion bug in fetch_op_handle	6 years ago
Michal Gallus	09d9d77a8f	Enable MKLDNN in Naive Executor test=develop	6 years ago
tensor-tang	5ef14dd386	Merge pull request #13715 from tensor-tang/fix/op bugfix fusion lstm and gru batch,seq mode switch	6 years ago
minqiyang	67308822f8	Add selected_rows merge for clip_by_norm op test=develop	6 years ago
sneaxiy	9606b37ce4	test=develop	6 years ago
Xin Pan	c0dfd5eee8	Merge pull request #13712 from luotao1/doc_element refine elementwise doc	6 years ago
tensor-tang	ea0b98e007	bugfix: fusion lstm and gru batch,seq mode switch test=develop	6 years ago
Tao Luo	69ed75e77c	refine elementwise doc test=develop	6 years ago
Tao Luo	49cd43000c	Merge pull request #12981 from chenwhql/sequence_erase_op_infershape_note_polish Doc: unified infershape format	6 years ago
tensor-tang	cf8c8e72bd	add vtanh and unit test	6 years ago
Tao Luo	067b8582af	Merge pull request #13625 from wanghaoshuang/fix_warning_roi Fix warning of roi perspective transform op	6 years ago
tensor-tang	b37fe30417	Merge pull request #13690 from wangguibao/fix_cpu_lstm_compute_cc Avoid multiple definitions of lstm_compute_ctht when linking libpaddle_fluid.so	6 years ago
dzhwinter	26771f41ba	"fix compile error" (#13579 ) * "fix compile error" * "fix ci" * rerun ci test=develop * test=develop rerun ci	6 years ago
Yu Yang	186b2b138d	Revert "Merge pull request #13201 from reyoung/revert_callstack" (#13697 ) This reverts commit `21bb9e91fc`, reversing changes made to `3fa68dc101`. test=develop	6 years ago
Tao Luo	a89afd4c22	Merge pull request #13685 from luotao1/naive_cmake update libpaddle_fluid.a/so	6 years ago
tensor-tang	d10a9df7b8	add vaddbias and unit test	6 years ago
tensor-tang	3c8b651187	add vsigmoid avx implementations and unit test	6 years ago
luotao1	9cbf2023ab	rollback paddle_inference_helper.h to helper.h test=develop	6 years ago
sneaxiy	6f748a035d	test=develop	6 years ago
tensor-tang	55e44761fb	refine code and init vsigmoid	6 years ago
Xin Pan	2f5a7cc470	Merge pull request #13673 from panyx0718/infer hide all left over kwargs	6 years ago
wangguibao	1940bc2d83	Avoid multiple definitions of lstm_compute_ctht when linking libpaddle_fluid.so test=develop	6 years ago
sneaxiy	584c3f048f	fix sparse rmsprop	6 years ago
chengduo	d6747a9ac2	make check_graph choosable (#13674 ) test=develop	6 years ago
Tao Luo	824a82d728	Merge pull request #13672 from luotao1/gen_fluid_library reduce inference ci time	6 years ago
luotao1	d55d7e04fd	update libpaddle_fluid.so with zeroCopy test=develop	6 years ago
Xin Pan	425a882165	Merge pull request #13643 from panyx0718/ir2 clean up channel	6 years ago
luotao1	a989a4e7c2	refine paddle_inference_helper.h	6 years ago
Xin Pan	642905958a	fix compile error test=develop	6 years ago
Xin Pan	33b68fdf25	fix compile error test=develop	6 years ago
tensor-tang	ede4b230be	Merge pull request #13553 from jczaja/prv-fused_embedding_fc_lstm_op Adding fused_embedding_fc_lstm op	6 years ago
Jiabin Yang	618b3297e6	Merge pull request #13668 from JiabinYang/mac/fix_ci_unittest09272 fix random fail on test_api_impl	6 years ago
Xin Pan	6746b1fdf3	add missing header test=develop	6 years ago
Dun	161c3e31f7	Optimization of Kernels that related to DeepLabv3+ (#13534 ) * refine reduce by cub * optimize KernelDepthwiseConvFilterGrad * optimize depthwise conv and reduce mean and reduce sum * fix bug: dilation * cuda arch and cuda 8 compatible	6 years ago
Xin Pan	5fb72d840a	add header test=develop	6 years ago
Xin Pan	ddd60581b7	clean up channel test=develop	6 years ago
tensor-tang	2d0ff6a3c2	add vexp and unit test	6 years ago
tensor-tang	b3c63f40fa	add vscal and unit test	6 years ago
Xin Pan	3d339797fb	clean use_mkldnn options Add API.spec test=develop	6 years ago
Tao Luo	cfbd71c223	reduce inference ci time test=develop	6 years ago
Tao Luo	83ca657f96	Merge branch 'develop' into resnet50_ut	6 years ago
tensor-tang	0987f2b4d9	add vadd unit test	6 years ago
Jacek Czaja	e202f33aa9	- Yet another clarification to comment test=develop	6 years ago
JiabinYang	358b386953	test=develop	6 years ago
tensor-tang	3d928d4f9d	refine and seepdup	6 years ago
qiaolongfei	c5292b181e	change py_reader_by_data to create_py_reader_by_data	6 years ago
Xin Pan	00ca94578c	Merge pull request #13657 from panyx0718/fix Current scope needs to be thread-safe for training	6 years ago
Zeng Jinle	1cbaf71a68	Merge pull request #13620 from sneaxiy/fix_api_kwargs2 Hide out params in elementwise layers and scale layer	6 years ago
Tao Luo	21ee30595b	clean some CMakeLists test=develop	6 years ago
dzhwinter	2d00e65819	namespace issue (#13543 ) * flags * "follow comment"	6 years ago
Jacek Czaja	1df69f7c9d	- Fix to comment test=develop	6 years ago
Tao Luo	b31905c54d	Merge branch 'develop' into resnet50_ut	6 years ago
Tao Luo	1dcd6ee532	add resnet50 inference UT	6 years ago
Xin Pan	2c4b8393ce	Merge pull request #13573 from velconia/fix_api Fix kwargs in clip and logical API	6 years ago
tensor-tang	77fc42d2d1	Merge remote-tracking branch 'ups/develop' into fea/jitkernel	6 years ago
Yu Yang	593ad763cd	refactor(op): polish generate_proposals_op Polish styles in generate_proposals_op. 1. inline lambda functions rathar than use std::function to save var. 2. add `static inline` to template functions .cc * Make them static to prevent generating symbols. * Make them inline to give compiler a hit inline them as possible. * Not if the function is not static, they cannot be inlined since the symbols should be exported. 3. add `static` to global functions in .cc * Make them static to prevent generating symbols. 4. Use Vector<uint64> instead manually manange storage between devices. 5. Prefer to use platform::ForRange, so we can optimize `ForRange` by just changing `for_range.h` if it is needed. 6. Do not change shape of inputs test=develop	6 years ago
Xin Pan	d24f1f0aa4	Current scope needs to be thread-safe for training scope's API modifies its internal state. And scope's API can be called from multiple threads during traing. Hence, we need locks to protect the scope's internal states. We can optimize it in the future. But the current solution is buggy. test=develop	6 years ago
Wu Yi	7a5f3f750b	Fix memory optimization with dist train (#13535 ) * show detail error log on ci * test * fix memopt and dist * update apispec * will fix different batch issue test=develop	6 years ago
Yan Chunwei	c8744d118d	fea/infer executor and concurrency performance issue bug fix (#13451 ) - add naive executor - fix concurrency performance issue	6 years ago
tensor-tang	2937314d8e	refine vmul and test	6 years ago
Dang Qingqing	f189bf6a42	Update API.spec test=develop	6 years ago
Dang Qingqing	e79ad2ea87	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantize_transpiler_update	6 years ago
qiaolongfei	1d618225a7	add py_reader_by_data to API.spec	6 years ago
tensor-tang	6c986e127a	fix macro and add vmul unit test	6 years ago
chengduo	5175b3cb2b	Add GraphChecker (#13580 ) * add GraphNum test=develop * add graph number check in parallelExecutor test=develop * fix transformer_model bug test=develop * fix graph num	6 years ago
Jacek Czaja	910cd415f2	- Disabled embedding_fc_lstm_fuse by defult and extended test_text_classification ot use new op	6 years ago
sneaxiy	31e67b9042	test=develop	6 years ago
minqiyang	7aa0247bd1	Regenerate API.spec test=develop	6 years ago
qiaolongfei	85ddb5c76e	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-opyreader	6 years ago
Xin Pan	7cd2761736	Merge pull request #13416 from panyx0718/ir PassBuilder	6 years ago
minqiyang	4c89137427	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_api	6 years ago
chengduo	43a3af86be	refine sgd_op (#13626 ) test=develop	6 years ago
qiaolongfei	accf3f7505	optimize pyreader	6 years ago
Xin Pan	2c01c2216a	Merge pull request #13531 from gongweibao/generator2 Hide kwargs	6 years ago
qingqing01	479ad4bb92	Merge branch 'develop' into quantize_transpiler_update	6 years ago
Yu Yang	0be1582df0	Merge pull request #13525 from reyoung/fix_mixed_vector Fix mixed vector	6 years ago
Jacek Czaja	d5114c60b0	- Reviewers suggesstions to fused_embedding_fc_lstm_op	6 years ago
Jacek Czaja	7ab5626dee	- Added initial pass for embedding-fc-lstm - Added draft of new operator - Added fused embedding fc lstm files - First time embedding_fc_lstm_fuse_pass was invoked in test_text_classification - Added Embedding pattern - Not crashing - Enabled draft of embedding_fc_lstm pass (does it job) - First working (Seqcompute only) version - Removed diagnostic comment - First enabling of BatchCompute - Disabling pass for embedding with is_sparse and is_distributed - Cosmetics - Style - Style	6 years ago
chengduo	4e81e22827	add op frequence (#13328 )	6 years ago
qingqing01	fd4c4df93d	Cuda speed for generate_proposals_op. (#13596 ) * Add CUDA implementation for generate_proposals_op. * Clean code. * Update code.	6 years ago
tensor-tang	8c69764d12	add vmul unit tests	6 years ago
tensor-tang	084893a9a9	add vadd kernel	6 years ago

... 2 3 4 5 6 ...

4125 Commits (b58957d9d792b8ec85ad460a02ecc1f13575e7cd)