Paddle

Commit Graph

Author	SHA1	Message	Date
liuwei1031	14fe9219dc	reset unexpected changes, test=develop	6 years ago
tensor-tang	31fd8ce1e1	Merge pull request #15375 from mozga-intel/mozga-intel/batch_norm_ngraph_operator Enable batch_norm operator for a ngraph engine	6 years ago
liuwei1031	b1f97a6fa9	fix security issue 27, 38 test=develop	6 years ago
liuwei1031	1413d83a0f	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop	6 years ago
Tao Luo	882e7ec480	fix generate doc error in activation ops test=develop	6 years ago
Gabor Buella	da9c94da33	Clang build fixes (#15628 ) * Remove some superfluous std::move calls The std:move triggered a build error (with -Werror): ``` [ 9%] Building CXX object paddle/fluid/memory/allocation/CMakeFiles/allocator_facade.dir/allocator_facade.cc.o /home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: error: moving a temporary object prevents copy elision [-Werror,-Wpessimizing-move] [this] { return std::move(CreateAllocatorWithChunk()); }, capacity); ^ /home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: note: remove std::move call here [this] { return std::move(CreateAllocatorWithChunk()); }, capacity); ^~~~~~~~~~ ~ 1 error generated. ``` See: https://reviews.llvm.org/D7633 * Remove a superfluous lambda capture from framework/operator.h ``` [ 10%] Building CXX object paddle/fluid/platform/CMakeFiles/device_context.dir/init.cc.o In file included from /home/tej/code/gbuella_paddle/paddle/fluid/platform/init.cc:19: /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.h:229:21: error: lambda capture 'this' is not used [-Werror,-Wunused-lambda-capture] [this](Variable* var) { return var; }); ^~~~ 1 error generated. ``` Changing it to `return it->second;`, as is in the function below. * Rethrow an exception (instead of copying it) ``` [ 11%] Building CXX object paddle/fluid/framework/CMakeFiles/operator.dir/operator.cc.o /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: error: local variable 'exception' will be copied despite being thrown by name [-Werror,-Wreturn-std-move] throw exception; ^~~~~~~~~ /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: note: call 'std::move' explicitly to avoid copying throw exception; ^~~~~~~~~ std::move(exception) ``` See https://reviews.llvm.org/D43322 for an explanation of this diagnostic message. * Remove an unused variable ``` /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:884:16: error: private field 'scope_' is not used [-Werror,-Wunused-private-field] const Scope& scope_; ^ ``` * struct ComputationOpHandle -> class ComputationOpHandle ``` [ 13%] Building CXX object paddle/fluid/framework/details/CMakeFiles/memory_early_delete_pass.dir/memory_early_delete_pass.cc.o In file included from /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/memory_early_delete_pass.cc:21: /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: error: class 'ComputationOpHandle' was previously declared as a struct; this is valid, but may result in linker errors under the Microsoft C++ ABI [-Werror,-Wmismatched-tags] class ComputationOpHandle; ^ /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/computation_op_handle.h:29:8: note: previous use is here struct ComputationOpHandle : public OpHandleBase { ^ /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: note: did you mean struct here? class ComputationOpHandle; ^~~~~ struct 1 error generated. ``` * Fix name() methods under fluid/operators ``` In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.cc:15: In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.h:19: /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/jitcode.h:71:23: error: 'name' overrides a member function but is not marked 'override' [-Werror,-Winconsistent-missing-override] virtual const char* name() const = 0; ^ /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen_base.h:31:23: note: overridden virtual function is here virtual const char* name() const = 0; ^ ``` test=develop	6 years ago
Zeng Jinle	fea7f0de25	Merge pull request #15667 from sneaxiy/fix_decorator_signature Fix decorator signature error	6 years ago
kolinwei	acfe28d5eb	Merge pull request #15684 from PaddlePaddle/revert-15661-fix-cpu-broadcast Revert "cpu reduce mode did not need to broadcast params test=develop"	6 years ago
Xin Pan	d670d8ef1d	Merge pull request #15671 from cjld/fix_graph fix bug CreateControlDepVar duplicate name	6 years ago
乔龙飞 Qiao Longfei	6e0e706198	Revert "cpu reduce mode did not need to broadcast params test=develop"	6 years ago
Qiao Longfei	76c1378a70	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-lookup_table_grad-padding-index test=develop	6 years ago
Qiao Longfei	97b143fb49	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-cpu-broadcast test=develop	6 years ago
Qiao Longfei	29a4b21bc8	fix problem test=develop	6 years ago
Qiao Longfei	7b673bce6a	lookup_table_grad kernel should consider padding_idx test=develop	6 years ago
dzhwinter	381f2015a5	Merge pull request #15665 from dzhwinter/experiment/refactor_memory refactor optimize pass.	6 years ago
Qiao Longfei	ffd0d1d216	clean need_broadcast_var_ test=develop	6 years ago
Qiao Longfei	fbadd4b60c	follow comment test=develop	6 years ago
xuezhong	eeaa2066e5	add device info to tensor test=develop	6 years ago
xuezhong	9b24ac34dd	remove debug print test=develop	6 years ago
xuezhong	50b48400bb	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp test=develop	6 years ago
xuezhong	c0b8fd7ca0	update lstmp op api spec test=develop	6 years ago
dzhwinter	04e9776aef	add details. test=develop	6 years ago
dzhwinter	b80bcbb4fd	Merge pull request #15660 from dzhwinter/enhance/memory add elementwise_xxx_grad for inplace optimize	6 years ago
mozga-intel	1198ccae6b	Enable batch_norm operator for a ngraph engine test=develop	6 years ago
xuezhong	58101e6d4d	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp test=develop	6 years ago
xuezhong	4921c2cd02	add api spec change test=develop	6 years ago
baojun	f4a0e68481	Fix ngraph compile WITH_DISTRIBUTE=ON (#15636 ) * fix compile issue with_distribute test=develop * simplified logic test=develop * use ngraph dependency test=develop * set cpu only test=develop * update test and eliminate fp16 test test=develop	6 years ago
Tao Luo	2fbfebc534	Merge pull request #15666 from Superjomn/fix/anakin-api-dependency fix anakin compile dependency	6 years ago
xuezhong	fb261793b9	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op test=develop	6 years ago
Dun Liang	1905f1a108	bug fix && test=develop	6 years ago
xuezhong	fb9a6a2bc6	pass test for lstm op test=develop	6 years ago
xuezhong	1abb0d835e	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp test=develop	6 years ago
sneaxiy	42f6d0f899	modify API.spec test=develop	6 years ago
Chunwei	d85c2e4e5c	fix anakin compile dependency test=develop	6 years ago
xuezhong	2ba256df40	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp	6 years ago
dzhwinter	104d3b4e68	add details. test=develop	6 years ago
dzhwinter	3932cd6714	Merge remote-tracking branch 'origin/develop' into enhance/memory	6 years ago
Dun Liang	ceec13562c	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_checkpoint test=develop	6 years ago
Dun Liang	bc92192747	Fix Pr #15296 test=develop	6 years ago
Qiao Longfei	76072261f8	fix compiler test=develop	6 years ago
Qiao Longfei	b99db0e2c2	cpu reduce mode did not need to broadcast test=develop	6 years ago
dzhwinter	94dd50c33f	add details. test=develop	6 years ago
Gabor Buella	4975a9050a	Tests - add some missing to_string calls ``` /home/tej/code/gbuella_paddle/paddle/fluid/framework/ir/seqpool_concat_fuse_pass_tester.cc:167:40: error: adding 'int' to a string does not append to the string [-Werror,-Wstring-plus-int] std::string prefix = "seqpool_op_" + i; ~~~~~~~~~~~~~~^~~ /home/tej/code/gbuella_paddle/paddle/fluid/framework/ir/seqpool_concat_fuse_pass_tester.cc:167:40: note: use array indexing to silence this warning std::string prefix = "seqpool_op_" + i; ^ & [ ] 1 error generated. ``` test=develop	6 years ago
Tao Luo	bec68fa0b3	Merge pull request #15637 from jerrywgz/refine_box_coder speed up box_coder in CPU	6 years ago
乔龙飞 Qiao Longfei	7ddf4e2c55	Merge pull request #15649 from jacquesqiao/fix-ctr-reader-svm fix ctr reader read svm data	6 years ago
peizhilin	883d22093a	fix the lib_any dependency test=develop	6 years ago
Qiao Longfei	fa77186fdc	fix ctr_reader_test test=develop	6 years ago
Qiao Longfei	2afe82fe83	fix ctr reader read svm data test=develop	6 years ago
Kaipeng Deng	9df7bc2c5a	fix exlusive pool doc. test=develop (#15632 )	6 years ago
wopeizl	3614dadf23	Merge pull request #15631 from wopeizl/windows/fixci fix ci broken randomly and disable some warnings	6 years ago
tensor-tang	18bff5298d	extract fused_emb_seq_pool forward function test=develop	6 years ago
wopeizl	c1e18b13aa	Merge pull request #15635 from wopeizl/fixbuildissue fix the build issue on gpu mode for win	6 years ago
peizhilin	55510744b5	test=develop	6 years ago
peizhilin	061299be87	fix dependency test=develop	6 years ago
baojun	ac4cde009d	Enable accuracy op for ngraph engine (#15592 ) * Added accuracy ngraph op test=develop * fixed name type test=develop	6 years ago
dzhwinter	488719bac7	Enhance/memory optimize (#15634 ) * add skip send.recv test=develop * enhanced print message. test=develop * rerun ci. test=develop	6 years ago
Gabor Buella	2bf63f4c33	Fix std::abs usage in memory_optimize_pass.cc (#15627 ) test=develop size_t is an unsigned integer, with a conversion rank larger than int, therefore in the following expression the int value was promoted to size_t, making it a subtraction of unsigned values. The result of such a subtraction is also an unsigned value.	6 years ago
peizhilin	db563ec2cd	test=develop	6 years ago
dzhwinter	5d30b55de1	rerun ci. test=develop	6 years ago
kolinwei	6f0f8045f6	Revert "Async double buffered py reader"	6 years ago
jerrywgz	ceb412b0ae	speed up box coder in CPU, test=develop	6 years ago
dzhwinter	4ef34916a4	enhanced print message. test=develop	6 years ago
peizhilin	238ef94702	fix the build issue on gpu mode for win test=develop	6 years ago
xuezhong	e261b60f97	change api spec for adagrad optimizer test=develop	6 years ago
dzhwinter	ce0394bcd0	merge develop branch. test=develop	6 years ago
peizhilin	3a4110f960	fix ci broken randomly and disable some warnings test=develop	6 years ago
Dun	db8fcf6b95	Merge pull request #15296 from cjld/async_double_buffered_py_reader Async double buffered py reader	6 years ago
xuezhong	dff7461ea8	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp test=develop	6 years ago
Zeng Jinle	1ccbc51218	Merge pull request #15619 from sneaxiy/debug Fix unittest failure in debug mode	6 years ago
Xin Pan	74bc55c2a6	Merge pull request #14975 from dzhwinter/ir_inplace_pass Ir inplace pass	6 years ago
dzhwinter	cca71532eb	add skip send.recv test=develop	6 years ago
dzhwinter	9f001c6525	skip dist. test=develop	6 years ago
xuezhong	2857dac260	add assert for clip and remove print	6 years ago
xuezhong	4028943125	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp	6 years ago
Tao Luo	546eefae6f	Merge pull request #15591 from lidanqing-intel/lidanqing/density_prior_box optimize density_prior_box_op.h in detect model for cpu	6 years ago
sneaxiy	f26a1c9077	test=develop	6 years ago
Yan Chunwei	dc5e25fc7f	remove dot marked node (#15606 )	6 years ago
dzhwinter	2561a6fc59	follow comment. test=develop	6 years ago
dzhwinter	2a5ecb68b0	follow comment. test=develop	6 years ago
dzhwinter	9f693fcac4	rerun ci. test=develop	6 years ago
guoshengCS	b6c3b69af8	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-beam-search-size test=develop	6 years ago
dzhwinter	e537634d16	delete graph print pass. test=develop	6 years ago
dzhwinter	4f01de6378	Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass	6 years ago
dzhwinter	46a6cac91f	fix batch norm. test=develop (#15597 )	6 years ago
qingqing01	943d972878	Fix analysis predictor when loading the persistable RAW type variable. (#15613 )	6 years ago
liuwei1031	6e84eb131f	expose peak gpu memory API to python test=develop (#15529 ) * expose peak gpu memory API to python test=develop * add unittest for peak gpu memory monitoring test=develop * add pybind change test=develop * add mutex to gpu mem usage monitor test=develop * update benchmark flag definition file test=develop * tweak unittest for memory monitoring test=develop	6 years ago
dzhwinter	5cab99a686	fuck windows. rerun windows ci. test=develop	6 years ago
dzhwinter	9c9ad7d40b	Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass test=develop	6 years ago
jerrywgz	1743d1a58f	Merge pull request #15356 from jerrywgz/add_clip_op Add box clip op	6 years ago
dzhwinter	0a63234c85	follow comments. test=develop	6 years ago
mozga-intel	43a67a2662	Enable conv2d operator for a ngraph engine (#15269 ) test=develop	6 years ago
tensor-tang	a6a1a92ef7	Merge pull request #15586 from tensor-tang/jit/cache refine bert	6 years ago
jerrywgz	4f18a9b87b	test=develop	6 years ago
guoshengCS	5dfce93101	To make CUDA_LAUNCH_KERNEL_HELPER support large size. test=develop	6 years ago
Yan Chunwei	e887d71958	fix ir debug config (#15571 )	6 years ago
Yan Chunwei	897789b16e	fix save_inferece_model bug (#15365 )	6 years ago
lidanqing-intel	4b3c6612a1	optimize density_prior_box_op.h for cpu test=develop	6 years ago
xuezhong	4c98c2ccc3	remove debug print	6 years ago
WangZhen	2175292634	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantization_inference_passes test=develop	6 years ago
xuezhong	58ad40cc15	add sample_logits op	6 years ago
tensor-tang	ba02ac4692	use mat attr and refine test (#15448 ) * use mat attr and refine test test=develop * add matmul jitcode test=develop * fix mac compile test=develop	6 years ago
xuezhong	b0c75f1763	remove debug print	6 years ago
xuezhong	880836329d	add cell clip and proj clip, fix bug for h0	6 years ago
dzhwinter	9e87fbebb7	rerun windows ci. test=develop	6 years ago
dzhwinter	6f9904e99a	rerun windows ci. test=develop	6 years ago
jerrywgz	4eb44380a6	Merge branch 'develop' into add_clip_op	6 years ago
Xin Pan	30cc8b7a92	Merge pull request #15554 from heavengate/yolo_loss_darknet Yolo loss darknet	6 years ago
Tao Luo	1a252f4be6	Merge pull request #15587 from luotao1/bert use embedding=128 bert model for test	6 years ago
Haihao Shen	1b8047b712	Add INT8 calibration support in Paddle package (#15569 ) * Add INT8 calibration support in Paddle package; test=develop	6 years ago
mozga-intel	312500dcb5	Enable pool2d operator for a ngraph engine (#15395 ) * Enable pool2d operator for a ngraph engine test=develop * Update test=develop	6 years ago
Jiabin Yang	b4c24f3f7c	Merge pull request #15575 from JiabinYang/feature/imperative test=develop, polish code and fix some wrong change	6 years ago
dzhwinter	a52be7c081	refine build strategy. test=develop	6 years ago
Tao Luo	ea92905be4	Merge pull request #15478 from kbinias/kbinias/seperate-folders-for-mkldnn Make separate folders for mkldnn codes	6 years ago
dzhwinter	32a2014939	refine build strategy. test=develop	6 years ago
Yibing Liu	170842cbb4	Some improvements to support bert mixed precision training (#15585 ) * Some improvements to support bert mixed precision training test=develop * Revert the cast in layer_norm test=develop	6 years ago
Yiqun Liu	16d54f7f23	Return parent_idx in beam_search op (#15520 ) * Refine beam_search_op to output an extra parent_idx tensor. test=develop * Fix the unittest test_beam_search_op. test=develop * Fix the merging mistake. test=develop	6 years ago
jerrywgz	72ee3c6232	Merge pull request #15398 from jerrywgz/add_axis_for_boxcoder Add axis for boxcoder	6 years ago
jerrywgz	e402c0ec7d	test=develop	6 years ago
Kaipeng Deng	d3eeb92bba	Merge pull request #15491 from tink2123/new_align_corners add align_corners and align_mode for image_resize	6 years ago
jerrywgz	3046799ecd	Merge branch 'develop' into add_clip_op	6 years ago
dzhwinter	1a44b2fbe8	Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass	6 years ago
Jiabin Yang	2d0ffdc485	test=develop, fix debug mode unitest, hsigmoid (#15574 )	6 years ago
Zhaolong Xing	90ffe74954	Merge pull request #15546 from NHZlX/fix_trt_utest_random_failed fix trt models utest failed.	6 years ago
luotao1	8f0c2b07f2	use embedding=128 bert model for test test=develop	6 years ago
tensor-tang	2b0811c3fb	refine vadd jitkernel choice test=develop	6 years ago
tensor-tang	a18c0d4242	cache fc kernel test=develop	6 years ago
tensor-tang	6e1ee7fb57	cache softmax kernel func test=develop	6 years ago
Krzysztof Binias	69b7c595d6	Small fix test=develop	6 years ago
Krzysztof Binias	b1bdcd4de8	Make separate folders for mkldnn codes test=develop	6 years ago
dzhwinter	06f2448848	Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass	6 years ago
dzhwinter	8156fedf56	merge develop branch. test=develop	6 years ago
dengkaipeng	23d34d1f7e	move yolov3_loss to detection. test=develop	6 years ago
JiabinYang	16f64b43d4	test=develop, Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/imperative	6 years ago
tensor-tang	c7449227e8	Merge pull request #15563 from tensor-tang/jit/softmax refine softmax kernel	6 years ago
dengkaipeng	733bb82ec0	downsample -> downsample_ratio. test=develop	6 years ago
dengkaipeng	ae0b0d5f93	fix doc. test=develop	6 years ago
dengkaipeng	56e21c558e	add comments and docs. test=develop	6 years ago
dengkaipeng	577424e5ec	use darknet loss and trick	6 years ago
dengkaipeng	042fecefab	use L2Loss. test=develop	6 years ago
dengkaipeng	af124dcdf6	fix API error	6 years ago
dengkaipeng	c945ffa7f8	fix label_smooth and mixup score	6 years ago
tink2123	2b89f59055	add attr use_label_smooth test=develop	6 years ago
dengkaipeng	8218e30176	add gtscore. test=develop	6 years ago
dengkaipeng	3c08f620c2	add label smooth. test=develop	6 years ago
dengkaipeng	cc01db6029	calc valid gt before loss calc. test=develop	6 years ago
dengkaipeng	32d533c2cd	cache obj_mask and gt_match_mask. test=develop	6 years ago
dengkaipeng	6c5a5d0789	format code. test=develop	6 years ago
dengkaipeng	e7e4f084e5	ignore pred overlap gt > 0.7. test=develop	6 years ago
dengkaipeng	bd6deb1a8b	fix API.spec change. test=develop	6 years ago
dengkaipeng	db8ff57a61	remove useless code and update doc. test=develop	6 years ago
dengkaipeng	577a92d992	use typename DeviceContext. test=develop	6 years ago
dengkaipeng	0c4acc8305	imporve yolo loss implement. test=develop	6 years ago
dengkaipeng	2fbfef2ec9	fix no box expression. test=develop	6 years ago
dengkaipeng	c0fa8d2eec	use L1Loss for w, h. test=develop	6 years ago
dengkaipeng	3841983aa0	fix division error in mean process. test=develop	6 years ago
dengkaipeng	192d293854	use stable Sigmoid Cross Entropy implement. test=develop	6 years ago
Tao Luo	245b1f0579	Merge pull request #15570 from luotao1/bert fix compiler error, use len20 dataset for bert	6 years ago
tink2123	909f864a9b	remove unnecessary flags test=develop	6 years ago
JiabinYang	bb881199f2	test=develop, polish code and fix wrong change in /paddle/fluid/inference/utils/CMakeLists.txt	6 years ago
tink2123	6961a94e94	avoid out_size less than 1 test=develop	6 years ago
Jiabin Yang	075df09f86	Merge pull request #15470 from JiabinYang/feature/imperative Add simple RNN in imperative	6 years ago
Qiyang Min	b69996c2d3	Merge pull request #15558 from velconia/imperative_resnet Refine Batch Norm	6 years ago
luotao1	5504425eb3	fix compiler error, use len20 dataset for bert test=develop	6 years ago
Yan Chunwei	655179089f	AnalysisConfig remove contrib namespace (#15540 )	6 years ago
jerrywgz	7bc8481c62	Merge pull request #15418 from jerrywgz/refine_nms Refine nms	6 years ago
tensor-tang	d59f733551	refine softmax and use with cache test=develop	6 years ago
tensor-tang	7383eefd2d	add softmax mix and mkl code test=develop	6 years ago
tensor-tang	50945685f2	add hmax, hsum jitcode test=develop	6 years ago
tensor-tang	8117725852	add jit kernel hsum, hmax and softmax refer code test=develop	6 years ago
Tao Luo	67e4450c34	Merge pull request #15485 from luotao1/fc500110-bert_test add bert analyzer test	6 years ago
Qiyang Min	6000a6e76e	Merge pull request #15312 from velconia/add_pyramid_dnn_support Use malloc and free in JeMalloc	6 years ago
Jiabin Yang	fd286f3596	Merge pull request #15534 from JiabinYang/fix/multi_output_support_imperative test=develop, fix/multi_output_support_imperative	6 years ago
minqiyang	07822fef2c	Clear all parameters' gradient test=develop	6 years ago
Zeng Jinle	bf7dedcbc7	Merge pull request #15545 from sneaxiy/fix_debug_nccl_error Fix nccl unittest error in debug mode	6 years ago
minqiyang	49a7fba848	Polish code test=develop	6 years ago
minqiyang	159c407328	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet test=develop	6 years ago
minqiyang	5c7768776c	Fix batch_norm's stop_gradient bug test=develop	6 years ago
luotao1	e31aef9f6e	Merge branch 'develop' into fc500110-bert_test test=develop	6 years ago
WangZhen	c67b29c178	fix some bugs of graph.to_program and get_pass.	6 years ago
qingqing01	a6910f900e	Always create variables in analysis_predictor before OptimizeInferenceProgram. (#15533 ) Otherwise, some other persistable variable (like RAW type) will not be created	6 years ago
tink2123	e7eb08febe	fix api.spec test=develop	6 years ago
dzhwinter	ee3aae56cd	merge develop branch. test=develop	6 years ago
dzhwinter	d6d3e6afe2	add more skip strategy	6 years ago
Yan Chunwei	b62b756b28	add version support (#15469 )	6 years ago
Yan Chunwei	526790e652	infer get program (#15511 )	6 years ago
JiabinYang	2e309b11c2	test=develop, merge develop	6 years ago
tensor-tang	3c224e7e79	Merge pull request #15537 from baojun-nervana/rm_ngraph_operator rm ngraph_operator.cc test=develop	6 years ago
jerrywgz	aaf756272f	remove inplace arg, test=develop	6 years ago
jerrywgz	cee2e1b089	refine code, test=develop	6 years ago
Xin Pan	c11afdb5cb	Merge pull request #15516 from panyx0718/imperative3 imperative supports multi grad ops	6 years ago
sneaxiy	ba4f43fd62	fix compile error in distributed mode test=develop	6 years ago
tink2123	a0c63f1106	add align_flag test=develop	6 years ago
nhzlx	95b98f27ae	fix trt models utest failed. test=develop	6 years ago
Tao Luo	b919190232	Merge pull request #15531 from jczaja/prv-googlenet-fix Performance and functional fixes to LRN	6 years ago
JiabinYang	53d558cd41	test=develop, polish code and merge develop	6 years ago
Zhaolong Xing	97b76c94c4	Merge pull request #15242 from NHZlX/trt_int8_ultimate_version add trt int8 support	6 years ago
Jiabin Yang	10bc9ffc2d	Merge pull request #15518 from JiabinYang/fix/refine_error_message test=develop, refine_error_message for data type	6 years ago
Kaipeng Deng	aeca5c50b2	fix grid_sampler PADDLE_ENFORCE error. test=develop (#15542 )	6 years ago
乔龙飞 Qiao Longfei	5f89ce7fcd	Merge pull request #15536 from jacquesqiao/fix-prefetch-one-parameter Fix prefetch one parameter	6 years ago
Jacek Czaja	5885c5cdf6	- Added explanation to LRN MKL-DNN op on alpha modification test=develop	6 years ago
Jacek Czaja	4aa7ef3c13	- Compensation fix to LRN MKL-DNN op test=develop	6 years ago
dzhwinter	2739096eec	compatibable with python side mem_opt	6 years ago
Qiao Longfei	806658d72b	add space after colon in commnet test=develop	6 years ago
gongweibao	d303270a0e	revert test=develop (#15535 )	6 years ago
Tao Luo	8e2dea5787	Merge pull request #15538 from baojun-nervana/mv_ng_bridge_file move ngraph_bridge to ngraph directory	6 years ago
nhzlx	b43ea40c51	delete the usage of the const_cast test=develop	6 years ago
Yan Chunwei	e2818c8608	add dynamic memory optim (#15457 )	6 years ago
baojun-nervana	8e9308a51a	mv ngraph_bridge to ngraph directory test=develop	6 years ago
baojun-nervana	da3f9cc512	rm ngraph_operator.cc test=develop	6 years ago
WangZhen	c8095eeb82	add freeze pass, and UT is passed.	6 years ago
Qiao Longfei	4d13434443	fix a little problem test=develop	6 years ago
Qiao Longfei	9c3910f390	IncreaseBatchBarrier should be in the right condition test=develop	6 years ago
JiabinYang	a59b7ac73b	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/imperative	6 years ago
JiabinYang	5639f49b16	test=develop, fix/multi_output_support_imperative	6 years ago
JiabinYang	ba981604fd	fix split	6 years ago
ruri	88bd7e1a61	Merge pull request #15027 from shippingwang/shufflechannel Add Shuffle Channel Operator	6 years ago
Jacek Czaja	fa286b1052	LRN reengineering Added reading dst mem pd from lrn pd coding style fixes test=develop	6 years ago
nhzlx	92cf4a4c6b	fix comments test=develop	6 years ago
liuwei1031	0016b6a2df	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop	6 years ago
liuwei1031	ab5442d9cd	Merge branch 'develop' of https://github.com/liuwei1031/Paddle into develop	6 years ago
tensor-tang	e043ea9653	Merge pull request #15515 from tensor-tang/jit/benchmark jit benchmark use tensor with alignment	6 years ago
Qiao Longfei	5a0c6593d5	revert RequestGetHandler	6 years ago
jerrywgz	d9b93962b0	test=develop	6 years ago
jerrywgz	466a10dcdd	refine code, test=develop	6 years ago
乔龙飞 Qiao Longfei	c58555067e	Merge pull request #14731 from jacquesqiao/optimize-cpp-reader Optimize cpp reader	6 years ago
jerrywgz	a39240c3b6	add attr variance for box coder, test=develop	6 years ago
Dun Liang	db9e700ba1	default use pin place && test=develop	6 years ago
gongweibao	d54494ba87	cleanup test=develop (#15347 )	6 years ago
Qiao Longfei	84220765a7	refine code, add more log	6 years ago
JiabinYang	3be8ffad2f	test=develop, polish code and merge conflict	6 years ago
nhzlx	36abc964df	fix pybind problem: add an enum to AnalysisConfig test=develop	6 years ago
JiabinYang	c52f57de5b	test=develop, refine_error_message for data type	6 years ago
Qiao Longfei	c750be6d9d	add some log	6 years ago
JiabinYang	1bf2facecb	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/imperative	6 years ago
JiabinYang	e3a8929cf8	little change	6 years ago
gongweibao	fe8f28c957	Add GetVariableNoBarrier on brpc. (#15488 )	6 years ago
tangwei12	981fc2bdba	fix bug in merge_ids (#15503 ) * fix mistakes in merge_ids, test=develop	6 years ago
Zhaolong Xing	a7ba07d7ef	Merge pull request #15504 from NHZlX/fix_conv2d_fusion Add check: conv_fusion op runs with cudnn version > 7100 .	6 years ago
Xin Pan	42e61af861	polish test=develop	6 years ago
baojun	efce25673c	Adding ngraph_engine_op (#14948 ) * enable ngraph_engine_op test=develop * merge develop test=develop * avoid const_cast test=develop * rm ngraph_operator test=develop * Added TODO to move EnableNgraph test=develop * Add TODO to remove const_cast test=develop	6 years ago
chengduo	f8f91fb4b3	Revert conv transpose cudnn (#15514 ) * Revert "set constant for loss" This reverts commit 167933f678ccbb3563e949710279efe004a27731. * Revert "remove workspace_handle" test=develop This reverts commit b4aca8ede9e685bce1dfb1c59e63919f33432572.	6 years ago
tensor-tang	b67584a6e9	jit benchmark use tensor test=develop	6 years ago
Xin Pan	4d9feb35b9	support multi grad ops test=develop	6 years ago
Yiqun Liu	3008fa1261	Add the CUDA kernel for beam_search op (#15020 ) * Refine the beam_search op and test. * A basic CUDA implementation of beam_search for small batch_size. * Implement CUDA kernel for beam_search_op. * Use multiple CUDA threads in the same block to select the top beam. * Update the python api of beam_search op. * Enable extend function in CPU kernel of beam_search op. * Unify the CUDA codes. test=develop * Unify the CPU kernel of beam_search op. * Ensure the seletced items of beam_search_op's CPU kernel sorted by scores. * Update the description of beam_search in API.spec. * Enable the use of CUDA kernel in beam_search op. * Exclude the beam_search's CUDA unittest when there is no CUDA gpu, and delete some debuging statements. test=develop * Follow comments. test=develop * Call the CPU kernel for beam_search op when batch_size > 4. test=develop * Remove the except of is_empty op in PrepareData. test=develop	6 years ago
nhzlx	0779e35544	fix two bug: 1. graph and program_desc alignment 2. trt stream test=develop	6 years ago
Zeng Jinle	2480a3df7d	Merge pull request #15496 from sneaxiy/lazy_allocator2 Fix bug when user set CUDA_VISIBLE_DEVICES be empty and run CPU-only models	6 years ago
tink2123	78145c7dff	modified some comments test=develop	6 years ago
WangZhen	dde19a0ff8	add quantization freeze pass.	6 years ago
Zeng Jinle	dec89bd7ed	Merge pull request #15460 from sneaxiy/try_to_turn_on_remove_unnecessary_lock Turn on remove_unnecessary_lock by default	6 years ago
nhzlx	027d24c831	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version	6 years ago

... 3 4 5 6 7 ...

6644 Commits (4d80db838a679b0144f8569fa461fd07b0dc2295)