Paddle

Commit Graph

Author	SHA1	Message	Date
luotao1	b3cd2ae88b	Merge branch 'develop' into ner_ut2	7 years ago
Xin Pan	c709a04ae2	Merge pull request #13018 from panyx0718/name support op_namescope for better debugging	7 years ago
tensor-tang	596213906b	add gru seq mode forward	7 years ago
Yan Chunwei	af15f6f038	fea/refine fuse (#13076 )	7 years ago
Yu Yang	819af27d63	Merge pull request #13079 from reyoung/feature/fix_bug Fix bug when loss@GRAD is reused.	7 years ago
zenghsh3	d7495838b3	refine	7 years ago
zenghsh3	04a05d1d58	merged	7 years ago
luotao1	07cb64adc0	add unit-test for chinese_ner	7 years ago
qiaolongfei	583e21e60e	fix mac compie	7 years ago
zenghsh3	62f8d59d8d	Merge branch 'develop' of https://github.com/zenghsh3/Paddle into develop	7 years ago
zenghsh3	08b73b68c4	fix bug of sampling_id_op	7 years ago
Tao Luo	0148d8d2be	Merge pull request #13098 from luotao1/fix_mkldnn_include fix mkldnn include format	7 years ago
tensor-tang	b0d36c4c3d	add cross vec to speedup gru	7 years ago
Wu Yi	0b8067c0dc	fix dist train reduce mode (#13068 ) * fix dist train reduce mode * fix previous fix	7 years ago
yuyang18	555f03d8f0	Fix bug when loss@GRAD is reused.	7 years ago
luotao1	7169f9378c	fix mkldnn include format	7 years ago
Xin Pan	823c4f87be	Merge pull request #13058 from panyx0718/infer use fast RunPrepareContext for inference	7 years ago
Jiabin Yang	cceffca6bf	Update api_impl.cc	7 years ago
Jiabin Yang	a9be969cf9	Update attention_lstm_fuse_pass.cc	7 years ago
Jiabin Yang	5d5b70ad79	Update CMakeLists.txt	7 years ago
JiabinYang	7c7d3d6172	Fix mac	7 years ago
tensor-tang	038c16eed2	save intermediate data to out buffer	7 years ago
jerrywgz	1a88baae22	add rpn_target_assign api test (#13013 ) * Add unit test for rpn_target_assign API.	7 years ago
Xingyuan Bu	0a97d24b41	Faster RCNN Generate Proposal Labels (#12616 ) * Add generate_proposal_labels for Faster-RCNN.	7 years ago
Yan Chunwei	cfa6bbb755	move nodeid from graph to node (#13065 )	7 years ago
fengjiayi	d5f74b7308	use CudnnHolder in conv_transpose_cudnn_op	7 years ago
Yibing Liu	f88a8ba99a	Merge pull request #12793 from kuke/wrap_squeezes Wrap unsqueeze & squeeze ops	7 years ago
fengjiayi	15cc9128be	fix compile error	7 years ago
fengjiayi	407ff0bdbc	use CudnnHolder in conv_cudnn_op	7 years ago
fengjiayi	04bfd5c10c	add CudnnHolder to manage cudnn_handle and workspace	7 years ago
chengduo	3bd1d22a7d	Enhance fused_elementwise_activation_op (#12837 ) * Enhance the function of fused_elementwise_activation_op * enhance unit test * Clean Code And Add Doc * Add compound functors * Fix doc and enhance unit test * define Dx and Dy for d_binary_func * add mul_scale * add mul_scale * add elementwise_mul * code refine * code refine * add doc * add AsIntermediate	7 years ago
Xin Pan	5adf118ab5	polish	7 years ago
tensor-tang	2d0ddf8c41	refine cpu gru batch mode	7 years ago
tensor-tang	70d3981220	add cpu vec bias sub	7 years ago
Xin Pan	c558f059ad	fix	7 years ago
Xin Pan	4794d9cf70	use fast RunPrepareContext for inference	7 years ago
Xin Pan	7dff0b11c6	fix	7 years ago
tangwei12	515a756a3c	Merge pull request #13048 from TomorrowIsAnOtherDay/develop add python api for sampling_id op	7 years ago
Xin Pan	0aa8135b11	Merge pull request #13045 from panyx0718/ir5 fix graph_to_program_test to be platform-independent	7 years ago
Xin Pan	d1058003fe	fix style	7 years ago
Xin Pan	51ef0ad766	allow to use name_scope for debugging and visiualization	7 years ago
Chen Weihang	b98b744067	Merge branch 'develop' into sequence_enumerate_op	7 years ago
Wu Yi	9ae55dd70d	fix dist transpile with memopt (#12974 ) * fix dist transpile with memopt * update api.spec * polish dist transpile api * update apispec * update apispec	7 years ago
robot	31c90692f7	Merge branch 'develop' of https://github.com/TomorrowIsAnOtherDay/Paddle into develop	7 years ago
robot	e1f17a1e4f	add python op for sampling_id	7 years ago
Yan Chunwei	902f19b46a	fea/fuse attention lstm simplify.with fusion lstm.with sequnce expand (#13006 )	7 years ago
Xin Pan	c720ef972b	fix graph_to_program_test to be platform-independent	7 years ago
Xingyuan Bu	2ad5d91ef8	Faster RCNN Generate Proposals (#12056 ) * Add proposals generation operator for Faster-RCNN.	7 years ago
tensor-tang	89d6d69ce4	Merge pull request #12781 from tensor-tang/feature/op/fusion_gru add fusion gru	7 years ago
tensor-tang	d941192e74	fix gcc53 on cpu vec (#13020 )	7 years ago
Xin Pan	a08f6a17f0	Merge pull request #12886 from panyx0718/ir5 small fix of op_desc	7 years ago
tensor-tang	2328a69157	Merge pull request #13012 from tensor-tang/refine/seq2batch refine seq2batch	7 years ago
Xin Pan	ec5cb3762c	Merge branch 'develop' into ir5	7 years ago
Xin Pan	2bb15f437c	Merge pull request #12791 from panyx0718/ir3 graph to program pass	7 years ago
Qiao Longfei	a22309afe8	clean useless check code in auc_op (#13023 )	7 years ago
Yu Yang	8965cee89f	Polish PrintOp (#12895 ) * Polish PrintOp * Polish PrintOp * Polish PrintOp * Refine test_print_op	7 years ago
Yibing Liu	13509da69e	Merge upstream to branch wrap_squeezes	7 years ago
Yibing Liu	03f6292bef	Update API spec	7 years ago
chengduo	7ad39c4077	Enhance pad_constant_like_op (#12999 ) * enhance pad_constant_like_op * add API * add API	7 years ago
qingqing01	0353eddb51	Improve fake_dequantize_op. (#12877 ) * Improve fake_dequantize_op. * Follow comments.	7 years ago
Qiao Longfei	11e01d9b2d	Scale support selectedrows (#12960 ) * add ScaleOpVarTypeInference for scale op * scale op support scale selected rows * optimize code * use FindVar * use FindVarRecursive in ScaleOpVarTypeInference	7 years ago
fengjiayi	7b84c580e2	Merge pull request #12824 from JiayiFeng/dev_sequence_padding_op Sequence pad op	7 years ago
tensor-tang	fd4f7c3ab5	refine seq2batch	7 years ago
Wu Yi	0ee6fed05b	Refine dist rpc deps (#12899 ) * refine dist train RPC deps * clean up * clean up * fix ut * remove input for fetch_barrier * follow comments	7 years ago
Zhaolong Xing	3a0b6f97cc	Merge pull request #12942 from NHZlX/add_concat_op_converter Add concat op converter	7 years ago
fengjiayi	7e0c9f50ae	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_sequence_padding_op	7 years ago
Wu Yi	9b2b49ff26	test fix release branch api check (#12977 ) * test fix release branch api check * fix reviews > 30 * check approval after test, check api diff before test	7 years ago
Zeng Jinle	599a32641b	Merge pull request #12971 from sneaxiy/unstack_op Add unstack op	7 years ago
Tao Luo	0b77518f3a	Merge pull request #12982 from Sand3r-/mutable-data-check Enforce requested size of tensor to be sufficiently large	7 years ago
Tao Luo	26cac36bfd	Merge pull request #12515 from kbinias/kbinias/bnorm-fwd-reuse Reusing primitives for forward Batch Norm operator	7 years ago
Michal Gallus	b6d261dff5	Enforce requested size of tensor to be sufficiently large	7 years ago
tensor-tang	a481c5e98c	Merge remote-tracking branch 'ups/develop' into feature/op/fusion_expand_concat_fc	7 years ago
tensor-tang	49c31febb5	fix typo and op test	7 years ago
fengjiayi	9cb455fa7d	update function	7 years ago
Krzysztof Binias	fb4b4f8d57	Refactor code	7 years ago
Krzysztof Binias	50d3e6e96b	Reusing primitives for forward Batch Norm operator	7 years ago
Zeng Jinle	ef7bd03a03	Merge pull request #12964 from sneaxiy/fix_concat_sync Fix concat bug	7 years ago
sneaxiy	52a480bb98	Merge develop	7 years ago
tensor-tang	02909335e9	rename fusion seq_concat_fc to fusion seqexpand_concat_fc	7 years ago
Xin Pan	cf547e2714	fix program_desc feed/fetch names' order.	7 years ago
Xin Pan	08352fe56a	fix	7 years ago
Xin Pan	6fdb7f5348	add test	7 years ago
Xin Pan	880cb8c4c3	clean	7 years ago
Xin Pan	1a67061fee	graph to program pass fix a few other things	7 years ago
qingqing01	1f09bc320c	Support data type int8_t . (#12841 ) * Support int8 type.	7 years ago
chenweihang	0c4697f8cd	fix: change to enumerate by sentence	7 years ago
tensor-tang	c45cee0349	refine infershape and forward	7 years ago
sneaxiy	33b4def10a	add api.spec	7 years ago
sneaxiy	24264bc0b8	Merge develop	7 years ago
dzhwinter	0153c21d83	add unstack_op	7 years ago
tensor-tang	c7c2506733	add forward implementation	7 years ago
nhzlx	478eeabdd4	refine uttest of api_tensorrt_subgraph_engine	7 years ago
chengduo	3e1050a2e8	Add pad_constant_like_op (#12943 ) * Add pad_constant_batch_size_like * refine pad_op * optimize memory	7 years ago
dzhwinter	6cc7870517	fix concat synchronization bug	7 years ago
tensor-tang	954b0e113f	init fusion seq expand concat fc op	7 years ago
nhzlx	791aa7f49d	merge develop	7 years ago
dzhwinter	d361624c1d	platform module (#12932 ) * platform module * Update profiler.h	7 years ago
tensor-tang	c488ee96a7	Merge remote-tracking branch 'ups/develop' into refine/op/fusion_lstm	7 years ago
tensor-tang	e61cf3214d	complete reverse seq	7 years ago
Chen Weihang	4ec12496dd	Merge branch 'develop' into sequence_enumerate_op	7 years ago
tensor-tang	4b28fab8c9	enable more acts	7 years ago
tensor-tang	607c41952e	compute gates	7 years ago
dzhwinter	4fcc293617	memory module (#12931 ) * memory module * "fix ci"	7 years ago
Qiao Longfei	3c58b87b45	fix auc layer and add check for auc op (#12954 ) * fix auc layer and add check for auc op * use input to check if states are inited * optimize code	7 years ago
Yibing Liu	43b2438aee	Merge branch 'develop' of upstream into wrap_squeezes	7 years ago
Yibing Liu	c1488b1796	Merge pull request #12940 from sneaxiy/stack_op Speedup stack_op	7 years ago
Yibing Liu	efa6f0df4e	Remove inplace args in unsqueeze & squeeze ops	7 years ago
Yibing Liu	26710ccd08	Update api spec accordingly	7 years ago
dzhwinter	04b1e4dcea	tensor module windows support (#12934 ) * tensor windows support * "fix ci" * "remove utils"	7 years ago
dzhwinter	c790d57cd4	data_type (#12933 ) * data_type * "remove tabs"	7 years ago
Qiao Longfei	d0b713493e	enhance DebugStringEx (#12949 )	7 years ago
dzhwinter	eca4563e5d	operators module (#12938 )	7 years ago
tensor-tang	6be273cbdb	add seq mode lstm	7 years ago
tensor-tang	36363292c3	Merge pull request #12904 from tensor-tang/refine/jit optimize cpu vec activations	7 years ago
nhzlx	368dbc2835	merge develop	7 years ago
nhzlx	3de4556659	concat op && map cnn model support	7 years ago
Zeng Jinle	d189d4dbab	Merge pull request #12884 from sneaxiy/sequence_mask_op Add sequence_mask_op for DAM model	7 years ago
sneaxiy	3b38e5a4fc	speed up stack_op	7 years ago
Qiyang Min	038e2817f0	Merge pull request #12836 from velconia/disable_prelu_test_local Disable prelu_op_test until fixing Python3 issues	7 years ago
tensor-tang	7bdaf09664	Merge remote-tracking branch 'ups/develop' into refine/jit	7 years ago
Tao Luo	989cc2a4f4	Merge pull request #12913 from luotao1/concat enhance the forward of concat op	7 years ago
Tao Luo	8650f6ffae	Merge pull request #12898 from luotao1/expand remove broadcast in sequence_expand	7 years ago
Qiao Longfei	52948a0b50	Merge pull request #12909 from jacquesqiao/fix-sparse-update-bug fix sparse update bug	7 years ago
tensor-tang	ba943d38e3	make runtime avx act	7 years ago
tensor-tang	3462c29940	refine add bias with avx	7 years ago
tangwei12	ef6445ee39	Merge pull request #12908 from seiriosPlus/fill_constant_selectedrows add SelectedRows support in fill_constant_op	7 years ago
tensor-tang	bb9f98e10d	add inplace test	7 years ago
tensor-tang	f269614bcd	further optimize tanh with avx and mkl	7 years ago
chenweihang	733ea0d29b	adjust infershape details	7 years ago
luotao1	e999c74cff	Merge branch 'develop' into concat	7 years ago
luotao1	b61cf7ac4f	Merge branch 'develop' into expand	7 years ago
luotao1	2b4edacca0	enhance the forward of concat op	7 years ago
Tao Luo	3e3b5f4fda	Merge pull request #12675 from Sand3r-/fix-conv-mkldnn-0.15 Update MKLDNN to 0.15, fix convolution integration	7 years ago
tensor-tang	7a4924cd44	further optimize sigmoid with avx and avx512	7 years ago
Xin Pan	abeb71c895	small fix of op_desc	7 years ago
qiaolongfei	fcf20eed0f	fix sparse update bug	7 years ago
tangwei12	ca22586818	code optimize (cherry picked from commit 587cca7)	7 years ago
Xin Pan	557be6fc58	Merge pull request #12902 from PaddlePaddle/revert-12736 Revert "Disable in_place in batch_norm API. (#12736)"	7 years ago
tensor-tang	6bd89ba5b6	fix typo	7 years ago
minqiyang	6c775a0a06	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into disable_prelu_test_local	7 years ago
Chen Weihang	2969aba14f	Merge branch 'develop' into sequence_enumerate_op	7 years ago
chenweihang	219a2369da	feat: wrap sequence enumerate op	7 years ago
tensor-tang	e3bb98eb38	optimize relu with avx and avx512	7 years ago
minqiyang	0eccd59425	Keep APISpec the same with Python2	7 years ago
guochaorong	1f270275a6	Revert "Add Python Callstacks when Op::Run error (#12759 )" This reverts commit `b2df17003f`.	7 years ago
guochaorong	b1fc238694	Revert "Disable in_place in batch_norm API. (#12736 )" This reverts commit `f5d5d7b2d9`.	7 years ago
tensor-tang	25976fe736	optimize the sigmoid and tanh	7 years ago
minqiyang	3fd169daed	Resume all tests	7 years ago
tensor-tang	2eb46c2b06	add cpu vec test	7 years ago
sneaxiy	1083e99520	Merge develop	7 years ago
tensor-tang	0d46f518ae	refine avx condition and warning	7 years ago
minqiyang	23bfdf9987	Port APISpec check	7 years ago
tensor-tang	f0f06992c1	Merge pull request #12878 from tensor-tang/feature/op/attention_lstm Add attention lstm cpu forward	7 years ago
luotao1	83f4edabe9	remove broadcast in sequence_expand	7 years ago
sneaxiy	5ea7bf88ba	Merge pull request #12872 from sneaxiy/stack_op Add stack_op for DAM model	7 years ago
minqiyang	2aac36b3f9	For test	7 years ago
Tao Luo	ef2da86b4f	Merge pull request #12885 from luotao1/test_ditu_rnn enhance test_analyzer to profile ditu inference demo	7 years ago
minqiyang	13686c4474	Change to debug case	7 years ago
sneaxiy	e895c98f0a	add support to max_len is None	7 years ago
fengjiayi	0fb5e351c1	update API.spec	7 years ago
minqiyang	709c37023a	Polish code	7 years ago
fengjiayi	f4a4a4cbd9	add op comment and python layer	7 years ago
tangwei12	acdd95d5ca	bug fix	7 years ago
chenweihang	d2e5395b97	feat: add sequence enumerate op	7 years ago
luotao1	9c7fde45a7	enhance test_analyzer to profile ditu inference demo	7 years ago
chengduo	8ad9055804	Add is_test for while_op (#12874 ) * add is_test for while_op * Change API	7 years ago
minqiyang	80e3ce411d	For test	7 years ago
Wu Yi	c6f212a339	Merge pull request #12847 from typhoonzero/add_scope_info_in_graphviz Add scope info in graphviz debug	7 years ago
Wu Yi	b8da70c370	Resovle multi gpu async deps (#12828 ) * dist transpiler add control dependency var between send and recv * fix async deps * follow comments and refine * fix deps connect for rpc ops	7 years ago
sneaxiy	64464cb1fa	Merge develop	7 years ago
qingqing01	79918a8442	add sequence_mask_op for DAM model	7 years ago
Xin Pan	4a4567fc3d	Merge pull request #12876 from panyx0718/ir5 refine Node	7 years ago
Yu Yang	b2df17003f	Add Python Callstacks when Op::Run error (#12759 ) * Add Python Callstacks when Op::Run error * Skip op with sub-block * refactor: refine callstack info's format * Reshape only support matrix * Polish Python code * Fix UT * Fix Py3	7 years ago
Yu Yang	17fcc4f5d0	Merge pull request #12864 from reyoung/feature/process_lod_grad Feature/process lod grad	7 years ago
Xin Pan	698c926ce5	copy program and fix op_desc	7 years ago
minqiyang	8b8f6487d9	Add debug info for fetch feed	7 years ago
tensor-tang	4e538db14d	refine jit space	7 years ago
tensor-tang	5ca0bb9aad	support more activation type and remove some comments	7 years ago
sneaxiy	ba168bd2d2	modify API.spec	7 years ago
tensor-tang	d9bf73f3ab	Merge remote-tracking branch 'ups/develop' into feature/op/fusion_gru	7 years ago
tensor-tang	dd938d0b94	fix bugs and pass op test	7 years ago
tensor-tang	ec59f0d454	add cpu vec	7 years ago
tensor-tang	cf5ea925c3	fix bugs	7 years ago
tensor-tang	6ed20474d4	refine attention lstm infershape	7 years ago
tensor-tang	508548f897	implement attention lstm cpu forward	7 years ago
tensor-tang	9affc36c89	init attention lstm	7 years ago
tensor-tang	3dd66390b2	add blas vexp	7 years ago
tensor-tang	0ec1f65cf1	fix blas dot and add cblas scal	7 years ago
tensor-tang	a2203d0466	add cblas dot	7 years ago
tensor-tang	f72ab8961e	refine blas gemm	7 years ago
qingqing01	f5d5d7b2d9	Disable in_place in batch_norm API. (#12736 ) * Disable in_place in batch_norm API.	7 years ago
sneaxiy	c73c5ed573	use for_range	7 years ago
Xin Pan	b548ecbc2b	add stack_op	7 years ago
Yu Yang	eb8fd853bc	Fix sequence_softmax_cudnn op	7 years ago
Yu Yang	3768677980	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/process_lod_grad	7 years ago
Tao Luo	decda738b0	fea/anakin compile with demo (#12772 ) * anakin support x86 * fix code style * add anakin ditu cnn demo * add timer * add rnn * fix inference_anakin_cnn/rnn_test compile error * make anakin_rnn_tester run * add anakin_enable_op_time option * update api/CMakeLists.txt * enlarge the max_batch_size in anakin.config * update with comments	7 years ago
Yu Yang	2a36ad1a96	Handle LoD for concat & seq_softmax ops	7 years ago
Yu Yang	211d81863d	Process elemwise grad op's lod. mul_op's lod	7 years ago
Yan Chunwei	9ee698e605	enhance/ditu rnn with fc fuse (#12831 ) * make fc fuse work with ditu rnn * add ditu rnn data download to CMAKE	7 years ago
Xin Pan	78415f326d	Merge pull request #12838 from panyx0718/infer speed up while_op	7 years ago
fengjiayi	ce182d9037	bug fix	7 years ago
Xin Pan	a2c0e52f3e	speed up while_op	7 years ago
typhoonzero	dd7a79158b	add scope info in graphviz debug	7 years ago
tensor-tang	6f78fd7d1e	fuse fc in gru	7 years ago
tensor-tang	300180cc26	init fusion gru op	7 years ago
Zhaolong Xing	21ba32b065	Merge pull request #12843 from NHZlX/fix_ssa_bug_for_trt fix ssa bug with batch_norm and refine the trt	7 years ago
Michał Gallus	cd32ddac12	Fuse Convolution and Eltwise Add into MKLDNN's Conv+Bias (#12669 ) * Fuse Convolution and Eltwise Add into Conv+Bias * Reduce bias branching at conv_mkldnn_op * Add MKLDNN build checks for Conv Bias * Conv-bias: check if bias input exist befor assignment * Conv-bias: Remove Bias dim check from infershape It was causing conv3d test to crash upon\ncalling HasInput(Bias)	7 years ago
nhzlx	c999895e93	merge develop	7 years ago
nhzlx	276950291a	1. fix ssa bug with batchnorm, 2. refine the trt	7 years ago
Yan Chunwei	896a37b6e3	fea/link ir to inference analysis and fc fuse support (#12789 ) * link IR graph to analysis graph * add clean code and update * add infer_clean_pass * add ir_pass_manager * support fc fuse executation * fix ir circle	7 years ago
dzhwinter	e23ddf6ae4	status (#12764 )	7 years ago
Tao Luo	d04ef276a5	Merge pull request #12745 from tensor-tang/refine/op/elewise_mul Refine elementwise mul cpu forward	7 years ago
tangwei12	cbc6e6eb97	Merge pull request #12247 from seiriosPlus/dis_ckpt_fix add load slice_vars in io.py	7 years ago
Qiyang Min	72965226e6	Merge pull request #12818 from velconia/fix_python3_CI_job Fix python3 CI job	7 years ago
minqiyang	656c77e712	Resume cicheck	7 years ago
minqiyang	e1492f19e1	Change the sequence of ci check	7 years ago
tangwei12	44bade8b17	fix api spec	7 years ago
Zhaolong Xing	470335e8c4	Merge pull request #12786 from NHZlX/add_batch_norm_trt_converter Add batch norm trt converter	7 years ago
Qingsheng Li	3d11d018e0	Fix scatter_op python API (#12742 ) * Fix scatter_op python API and remove inconsistency between implementation and doc * API spec change * Change as review comment	7 years ago
nhzlx	ff052c0e6f	merge develop	7 years ago
nhzlx	c6a5c4b0c0	add comments for execute in ut_helper	7 years ago
minqiyang	50d66a0790	Fix prelu_op	7 years ago
minqiyang	beb93bb901	Fix ut bug for graph_test Port dist_transpiler new added codes Port ut for clone desc	7 years ago
Tao Luo	8f9f414a14	Merge pull request #12805 from tensor-tang/fix/op/elewise_add fix SEGV element wise add at debug mode	7 years ago
tensor-tang	e955361267	Merge pull request #12737 from tensor-tang/feature/op/fusion_lstm add fusion lstm	7 years ago
tensor-tang	82bb9170fb	Merge remote-tracking branch 'ups/develop' into fix/op/elewise_add	7 years ago
tangwei12	99f74be561	Merge pull request #12802 from seiriosPlus/inference_teeny_mistakes fix some teeny mistakes	7 years ago
Tao Luo	2ae885e224	Merge pull request #12811 from luotao1/tensorrt_compiler_bug fix tensorrt compiler bug	7 years ago
Chen Weihang	57b34d9196	Merge pull request #12808 from chenwhql/remove_inplace_param_in_squeeze_and_unsqueeze Refactor: remove inplace parameter from squeeze and unsqueeze op	7 years ago
Xin Pan	daf464af68	Merge pull request #12807 from panyx0718/fix fix program_desc constructor	7 years ago
luotao1	808e5b1748	fix tensorrt compiler bug	7 years ago
Yihua Xu	084d4a9e9e	Optimize CRF Decoding with AVX/AVX2/AVX512F instruction (#12767 ) * Optimize CRF decoding with AVX/AVX2 instruction * Enable the AVX2 flags for compiling * Clean the code and decrease the count of multiply calculation * Add the support of AVX512 instruction to optimize CRF Decoding * Clean the code * Enable the AVX512f flags for compiling * Clean the code for the invaluable switch * Fixed the issue to check AVX512F status * Clean the code * Add some explanation of the key points	7 years ago
fengjiayi	34b209cffa	Complete sequence_padding GPU kernel	7 years ago
dzhwinter	00463fdfe3	cudnn windows support (#12757 ) * cudnn widndows * "add comment" * "windows support" * "fix cmake error"	7 years ago
Xin Pan	4a4c469f61	add test	7 years ago
qingqing01	c62f68cb94	Fix bug in conditional_block_op. (#12246 ) * Fix bug in conditional_block_op. * Fix bug and add comments. * Rename arguments.	7 years ago
nhzlx	1bf9d9e90c	fix comments	7 years ago
chenweihang	bc471b6ac4	refactor: remove inplace parameter from squeeze and unsqueeze op	7 years ago
Xin Pan	7473d5f735	fix program_desc constructor	7 years ago
tensor-tang	0507f7bc3c	fix SEGV elementwise add at debug mode	7 years ago
tangwei12	cfb12f09bf	fix some teeny mistakes	7 years ago
Yu Yang	c6af7201e9	Merge pull request #12692 from reyoung/feature/fast_executor Feature/fast executor	7 years ago
Xin Pan	e525aa232e	Merge pull request #12780 from panyx0718/ir4 fix ProgramToGraph	7 years ago
Tao Luo	7decbaaa13	Merge pull request #12762 from luotao1/anakin_cuda_env disable anakin when cuda < 8.0 or cudnn < 7.0	7 years ago
nhzlx	324dd16816	merge develop	7 years ago
yuyang18	b8029fd650	Follow comments	7 years ago
tangwei12	ca1e18c04a	Merge pull request #12469 from seiriosPlus/sum_op_dim_fix sum_op selectedRows dim bug fix	7 years ago
Xin Pan	1d3343240e	fix	7 years ago
nhzlx	144b20c160	add batch norm op converter	7 years ago
nhzlx	14311bb094	merge develop	7 years ago

... 3 4 5 6 7 ...

10520 Commits (5b5fa37fb98bfa05f23e5ad508f6dbf3e7ec9f93)