Paddle

Commit Graph

Author	SHA1	Message	Date
colourful-tree	f2d6473ef8	Merge branch 'develop' into new_develop	6 years ago
heqiaozhi	04f876f5bc	remove mkl & fix commit	6 years ago
dengkaipeng	373cfb0ccf	use kernel size in global_pooling. test=develop	6 years ago
dengkaipeng	60305196b8	fix spell mistakes. test=develop	6 years ago
Tao Luo	8a7efc78f1	Merge pull request #15882 from sfraczek/unique_ptr_dereference Change (smart_ptr.get()) -> smart_ptr	6 years ago
tensor-tang	a0c37662b9	enable sgd jitkernel refer code and test test=develop	6 years ago
xuezhong	1dad36f6aa	Merge pull request #15609 from xuezhong/add_sample_logits_op add sample_logits and sampled_softmax_with_cross_entropy op	6 years ago
Kaipeng Deng	9e524a7b51	Merge pull request #15870 from heavengate/fix_adaptive_pool_doc fix adaptive pool doc.test=develop	6 years ago
dengkaipeng	14df92fe8f	fix spell error. test=develop	6 years ago
dengkaipeng	144016fcfc	fix adaptive_pool and yolov3_loss. test=develop	6 years ago
Sylwester Fraczek	74672d1aff	Change (smart_ptr.get()) -> smart_ptr reason: dereferencing smart pointer is the same as the underlying pointer test=develop	6 years ago
tensor-tang	ee2321debd	Revert 15770 develop `a6910f900` gelu mkl opt (#15872 ) * Revert "Optimze Gelu with MKL Erf function (#15770)" This reverts commit `676995c86c`. * test=develop	6 years ago
xuezhong	81870723c6	Merge pull request #15605 from xuezhong/fix_bug_for_lstmp Fix bug for lstmp	6 years ago
dengkaipeng	eb65b4e47d	\frac -> \frac. test=develop	6 years ago
nhzlx	1d5ef7c9ee	5. add static trt load model 1). add static trt load model 2). fix bug: when device_id is not 0, the trt will have a bug test=develop	6 years ago
dengkaipeng	8167588f14	add blank after math::. test=develop	6 years ago
dengkaipeng	d9ec605873	use math:: instead of 29. test=develop	6 years ago
dengkaipeng	19292ac6a1	fix adaptive pool doc.test=develop	6 years ago
Yiqun Liu	7d96c74ab2	Initialize the benchmark tester for operator. (#15772 ) * Initialize the benchmark tester for operator. test=develop * Rearrange the codes. test=develop	6 years ago
Yihua Xu	676995c86c	Optimze Gelu with MKL Erf function (#15770 ) * Optimize for gelu operator * Set up the low accuracy mode of MKL ERF function. test=develop * Only enable MKLML ERF when OS is linux * Use the speical mklml version included vmsErf function to verify gelu mkl kernel. test=develop * Add the CUDA macro to avoid NVCC's compile issue. test=develop * Add the TODO comments for mklml library modification. test=develop * Clean Code test=develop * Add the comment of marco for NVCC compiler. test=develop	6 years ago
mozga-intel	5d132ecf83	Auto-cmake generator, auto-fill map (#15402 ) test=develop	6 years ago
Krzysztof Binias	1578c60bdd	Add new ut and remove unnecessary code test=develop	6 years ago
Xin Pan	5eb87506bc	add per kernel config and remove const_cast. test=develop	6 years ago
Dun	a83e470405	Profiler refine and add CUDA runtime api tracer (#15301 ) * refine profiler && add runtime tracer * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * fix bug && test=develop * add thread id map && test=develop * test=develop * testing * bug fix * remove cuda event && refine code && test=develop * test=develop * test=develop * test=develop * fix windows temp file && test=develop * test=develop * fix windows bug && test=develop * fix start up issue && test=develop * code polish && test=develop * remove unused code && test=develop * add some cupti cbid && test=develop * add FLAGS_multiple_of_cupti_buffer_size && test=develop * fix compile error && test=develop * add keyword && test=develop * fix && test=develop * code polish && test=develop	6 years ago
mozga-intel	13ec2d331b	Enable momentum operator for a ngraph engine (#15673 ) * Enable momentum operator for a ngraph engine test=develop * Update tests test=develop * Unnecessary line of the code as intended was removed test=develop	6 years ago
xuezhong	eb7bc3e7ea	remove non-ascii charactor test=develop	6 years ago
tensor-tang	e1c707fe9c	fix warnings (#15790 ) * fix warnings test=develop * fix enforce test test=develop	6 years ago
xuezhong	d328660304	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp	6 years ago
xuezhong	f2262d7336	update comment test=develop	6 years ago
Tao Luo	6402424f7a	Merge pull request #15773 from chengduoZH/fix_shape_api_doc Fix shape api doc	6 years ago
xuezhong	d12252e6a6	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op test=develop	6 years ago
xuezhong	c5360a3f6b	refine code	6 years ago
tensor-tang	5aea2cd2e0	Merge pull request #15652 from tensor-tang/refine/pyramiddnn refine fused emb seq pool	6 years ago
mozga-intel	df23a6f894	Enable cross_entropy operator for a ngraph engine (#15674 ) * Enable cross_entropy operator for a ngraph engine test=develop * Update tests test=develop * Added PADDLE_ENFORCE for the batch_norm operator test=develop * Update the message about which format are supported right now test=develop	6 years ago
Yiqun Liu	56a5039e24	Correct the doc in Python API (#15725 ) * Correct the comment in control_flow.py. * Correct the argument list of ops. test=develop * Update API.spec. test=develop * Skip op_callstack attr for all op apis. test=develop * Remove use_mkldnn and is_test from python api. test=develop * Remove use_mkldnn and is_test from op_proto_maker and hard-coding them in python when generating doc string. test=develop	6 years ago
baojun	72061b0ac0	Add ngraph op coverage (#15721 )	6 years ago
chengduozh	d79d2f686c	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_shape_api_doc test=develop	6 years ago
xuezhong	4424021623	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op	6 years ago
nhzlx	2070fb246d	4. do the trt_engine optim during init. add simple static mode loading test=develop	6 years ago
Yihua Xu	685a20ef56	Add JIT CRF_decoding and Layer_norm unit-test (#15699 ) * Add the CRFDecoding and LayerNorm's test case test=develop * Fix the size checking issue test=develop * Remove the remnant code test=develop * Add TestAllImpls and double support test=develop * Clean Code test=develop * Add benchmark test for LayerNorm & CRFDecoding test=develop	6 years ago
tensor-tang	75fc792d40	fix when table width larger than 64 test=develop	6 years ago
tensor-tang	40402d5e68	add emb seqpool jitcode test=develop	6 years ago
tensor-tang	2ccbcb157d	Merge remote-tracking branch 'ups/develop' into refine/pyramiddnn	6 years ago
chengduozh	3ce12b1b8e	fix shape api doc test=develop	6 years ago
Dun	5e6834d891	inplace group_norm (#15754 ) * inplace group * test=develop	6 years ago
Hongyu Liu	8c0292dead	Merge pull request #15717 from phlrain/fix_leak Fix lstm possible leak	6 years ago
Tao Luo	4da291c6a3	Merge pull request #15726 from qingqing01/fix_api_doc Fix row_conv doc	6 years ago
nhzlx	ecc12fb430	3. when runing in trt mode, do not allocate memory for parameters in fluid. test=develop	6 years ago
Dun	e4b9fcdbd2	More restrict check load_combine_op. (#15479 ) * fix && test=develop * fix && test=develop * test=develop	6 years ago
qingqing01	48a5cccbcd	Fix debug mode in prior_box_op (#15702 ) * Fix debug mode in prior_box_op * Refine code	6 years ago
Dang Qingqing	2868232556	Fix row_conv doc test=develop	6 years ago
tensor-tang	a3a3d3d861	add embseqpool jitkernel mkl impl and use it test=develop	6 years ago
tensor-tang	15da2f9a0d	add embseqpool jitkernel refer code, test and benchmark test=develop	6 years ago
tensor-tang	c2ccf14590	Merge remote-tracking branch 'ups/develop' into refine/pyramiddnn	6 years ago
qingqing01	abcefe7211	Fix debug mode in fake_quantize_op (#15693 ) * Fix debug mode in fake_quantize_op * Remove template specialization	6 years ago
liuhongyu	029be5fda9	fix lstmp bug; test=develop	6 years ago
nhzlx	9cc6249cd6	2. TRTEngine using stream only when execute.	6 years ago
liuhongyu	393fa6021e	set lstm lstmp unsed pointer to nullptr; test=develop	6 years ago
liuhongyu	869f00ffc6	set lstm lstmp unsed pointer to null	6 years ago
nhzlx	034ba1c291	add static model load for trt 1. bind trt input and output to fluid tensors	6 years ago
jerrywgz	6f11f35abe	Merge pull request #15703 from jerrywgz/enhance_expand_op support multiple var types for expand op	6 years ago
Tao Luo	3086502522	Merge pull request #15704 from Sand3r-/mgallus/old-fc-mkldnn-branch-fix-develop Fix old FC backward weights descriptor creation	6 years ago
baojun	c47e258ea4	Add ngraph sum, sigmoid, relu_grad and tanh_grad op (#15642 ) * Added ngraph sum op test=develop * Added sigmoid, relu_grad and tanh_grad test=develop * remove duplicates test=develop	6 years ago
tensor-tang	33d0cebbff	Merge pull request #15695 from tensor-tang/fix/name fix jitcode name, use after free	6 years ago
Michal Gallus	7a8eff36a6	Fix old FC backward weights descriptor creation test=develop	6 years ago
chengduo	ad61e1b22c	fix potential bug (#15688 ) test=develop	6 years ago
dzhwinter	f9ac88e1a0	Merge pull request #15694 from liuwei1031/fix_security_issue Fix security issue	6 years ago
jerrywgz	8fc0fc314a	support multiple var types for expand op, test=develop	6 years ago
tensor-tang	fb2a7b2300	fix aligned-new error in jitkernel (#15626 ) * fix aligned-new error in jitkernel test=develop * override genbase new to fix mis-align test=develop	6 years ago
乔龙飞 Qiao Longfei	08ad72d0b9	Merge pull request #15679 from jacquesqiao/update-lookup_table_grad-padding-index lookup_table_grad kernel should consider padding_idx test=develop	6 years ago
Tao Luo	d9270e34d1	Merge pull request #15691 from luotao1/activation_doc fix generate doc error in activation ops	6 years ago
tensor-tang	15d7220f94	fix jitcode name test=develop	6 years ago
tensor-tang	31fd8ce1e1	Merge pull request #15375 from mozga-intel/mozga-intel/batch_norm_ngraph_operator Enable batch_norm operator for a ngraph engine	6 years ago
liuwei1031	b1f97a6fa9	fix security issue 27, 38 test=develop	6 years ago
Tao Luo	882e7ec480	fix generate doc error in activation ops test=develop	6 years ago
Gabor Buella	da9c94da33	Clang build fixes (#15628 ) * Remove some superfluous std::move calls The std:move triggered a build error (with -Werror): ``` [ 9%] Building CXX object paddle/fluid/memory/allocation/CMakeFiles/allocator_facade.dir/allocator_facade.cc.o /home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: error: moving a temporary object prevents copy elision [-Werror,-Wpessimizing-move] [this] { return std::move(CreateAllocatorWithChunk()); }, capacity); ^ /home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: note: remove std::move call here [this] { return std::move(CreateAllocatorWithChunk()); }, capacity); ^~~~~~~~~~ ~ 1 error generated. ``` See: https://reviews.llvm.org/D7633 * Remove a superfluous lambda capture from framework/operator.h ``` [ 10%] Building CXX object paddle/fluid/platform/CMakeFiles/device_context.dir/init.cc.o In file included from /home/tej/code/gbuella_paddle/paddle/fluid/platform/init.cc:19: /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.h:229:21: error: lambda capture 'this' is not used [-Werror,-Wunused-lambda-capture] [this](Variable* var) { return var; }); ^~~~ 1 error generated. ``` Changing it to `return it->second;`, as is in the function below. * Rethrow an exception (instead of copying it) ``` [ 11%] Building CXX object paddle/fluid/framework/CMakeFiles/operator.dir/operator.cc.o /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: error: local variable 'exception' will be copied despite being thrown by name [-Werror,-Wreturn-std-move] throw exception; ^~~~~~~~~ /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: note: call 'std::move' explicitly to avoid copying throw exception; ^~~~~~~~~ std::move(exception) ``` See https://reviews.llvm.org/D43322 for an explanation of this diagnostic message. * Remove an unused variable ``` /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:884:16: error: private field 'scope_' is not used [-Werror,-Wunused-private-field] const Scope& scope_; ^ ``` * struct ComputationOpHandle -> class ComputationOpHandle ``` [ 13%] Building CXX object paddle/fluid/framework/details/CMakeFiles/memory_early_delete_pass.dir/memory_early_delete_pass.cc.o In file included from /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/memory_early_delete_pass.cc:21: /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: error: class 'ComputationOpHandle' was previously declared as a struct; this is valid, but may result in linker errors under the Microsoft C++ ABI [-Werror,-Wmismatched-tags] class ComputationOpHandle; ^ /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/computation_op_handle.h:29:8: note: previous use is here struct ComputationOpHandle : public OpHandleBase { ^ /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: note: did you mean struct here? class ComputationOpHandle; ^~~~~ struct 1 error generated. ``` * Fix name() methods under fluid/operators ``` In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.cc:15: In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.h:19: /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/jitcode.h:71:23: error: 'name' overrides a member function but is not marked 'override' [-Werror,-Winconsistent-missing-override] virtual const char* name() const = 0; ^ /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen_base.h:31:23: note: overridden virtual function is here virtual const char* name() const = 0; ^ ``` test=develop	6 years ago
Qiao Longfei	76c1378a70	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-lookup_table_grad-padding-index test=develop	6 years ago
Qiao Longfei	29a4b21bc8	fix problem test=develop	6 years ago
Qiao Longfei	7b673bce6a	lookup_table_grad kernel should consider padding_idx test=develop	6 years ago
jerrywgz	5ce48220f1	change default option related to softmax, test=develop	6 years ago
xuezhong	9b24ac34dd	remove debug print test=develop	6 years ago
xuezhong	50b48400bb	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp test=develop	6 years ago
dzhwinter	b80bcbb4fd	Merge pull request #15660 from dzhwinter/enhance/memory add elementwise_xxx_grad for inplace optimize	6 years ago
mozga-intel	1198ccae6b	Enable batch_norm operator for a ngraph engine test=develop	6 years ago
xuezhong	58101e6d4d	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp test=develop	6 years ago
xuezhong	4921c2cd02	add api spec change test=develop	6 years ago
baojun	f4a0e68481	Fix ngraph compile WITH_DISTRIBUTE=ON (#15636 ) * fix compile issue with_distribute test=develop * simplified logic test=develop * use ngraph dependency test=develop * set cpu only test=develop * update test and eliminate fp16 test test=develop	6 years ago
xuezhong	fb261793b9	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op test=develop	6 years ago
xuezhong	fb9a6a2bc6	pass test for lstm op test=develop	6 years ago
xuezhong	1abb0d835e	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp test=develop	6 years ago
xuezhong	2ba256df40	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp	6 years ago
dzhwinter	104d3b4e68	add details. test=develop	6 years ago
dzhwinter	3932cd6714	Merge remote-tracking branch 'origin/develop' into enhance/memory	6 years ago
Dun Liang	ceec13562c	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_checkpoint test=develop	6 years ago
Dun Liang	bc92192747	Fix Pr #15296 test=develop	6 years ago
dzhwinter	94dd50c33f	add details. test=develop	6 years ago
Tao Luo	bec68fa0b3	Merge pull request #15637 from jerrywgz/refine_box_coder speed up box_coder in CPU	6 years ago
乔龙飞 Qiao Longfei	7ddf4e2c55	Merge pull request #15649 from jacquesqiao/fix-ctr-reader-svm fix ctr reader read svm data	6 years ago
Qiao Longfei	fa77186fdc	fix ctr_reader_test test=develop	6 years ago
Qiao Longfei	2afe82fe83	fix ctr reader read svm data test=develop	6 years ago
Kaipeng Deng	9df7bc2c5a	fix exlusive pool doc. test=develop (#15632 )	6 years ago
wopeizl	3614dadf23	Merge pull request #15631 from wopeizl/windows/fixci fix ci broken randomly and disable some warnings	6 years ago
tensor-tang	18bff5298d	extract fused_emb_seq_pool forward function test=develop	6 years ago
peizhilin	55510744b5	test=develop	6 years ago
peizhilin	061299be87	fix dependency test=develop	6 years ago
baojun	ac4cde009d	Enable accuracy op for ngraph engine (#15592 ) * Added accuracy ngraph op test=develop * fixed name type test=develop	6 years ago
kolinwei	6f0f8045f6	Revert "Async double buffered py reader"	6 years ago
jerrywgz	ceb412b0ae	speed up box coder in CPU, test=develop	6 years ago
Dun	db8fcf6b95	Merge pull request #15296 from cjld/async_double_buffered_py_reader Async double buffered py reader	6 years ago
xuezhong	dff7461ea8	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp test=develop	6 years ago
Zeng Jinle	1ccbc51218	Merge pull request #15619 from sneaxiy/debug Fix unittest failure in debug mode	6 years ago
Xin Pan	74bc55c2a6	Merge pull request #14975 from dzhwinter/ir_inplace_pass Ir inplace pass	6 years ago
xuezhong	2857dac260	add assert for clip and remove print	6 years ago
xuezhong	4028943125	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp	6 years ago
Tao Luo	546eefae6f	Merge pull request #15591 from lidanqing-intel/lidanqing/density_prior_box optimize density_prior_box_op.h in detect model for cpu	6 years ago
sneaxiy	f26a1c9077	test=develop	6 years ago
dzhwinter	4f01de6378	Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass	6 years ago
dzhwinter	46a6cac91f	fix batch norm. test=develop (#15597 )	6 years ago
dzhwinter	9c9ad7d40b	Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass test=develop	6 years ago
jerrywgz	1743d1a58f	Merge pull request #15356 from jerrywgz/add_clip_op Add box clip op	6 years ago
mozga-intel	43a67a2662	Enable conv2d operator for a ngraph engine (#15269 ) test=develop	6 years ago
tensor-tang	a6a1a92ef7	Merge pull request #15586 from tensor-tang/jit/cache refine bert	6 years ago
lidanqing-intel	4b3c6612a1	optimize density_prior_box_op.h for cpu test=develop	6 years ago
xuezhong	4c98c2ccc3	remove debug print	6 years ago
xuezhong	58ad40cc15	add sample_logits op	6 years ago
tensor-tang	ba02ac4692	use mat attr and refine test (#15448 ) * use mat attr and refine test test=develop * add matmul jitcode test=develop * fix mac compile test=develop	6 years ago
xuezhong	b0c75f1763	remove debug print	6 years ago
xuezhong	880836329d	add cell clip and proj clip, fix bug for h0	6 years ago
jerrywgz	4eb44380a6	Merge branch 'develop' into add_clip_op	6 years ago
Xin Pan	30cc8b7a92	Merge pull request #15554 from heavengate/yolo_loss_darknet Yolo loss darknet	6 years ago
mozga-intel	312500dcb5	Enable pool2d operator for a ngraph engine (#15395 ) * Enable pool2d operator for a ngraph engine test=develop * Update test=develop	6 years ago
Tao Luo	ea92905be4	Merge pull request #15478 from kbinias/kbinias/seperate-folders-for-mkldnn Make separate folders for mkldnn codes	6 years ago
Yibing Liu	170842cbb4	Some improvements to support bert mixed precision training (#15585 ) * Some improvements to support bert mixed precision training test=develop * Revert the cast in layer_norm test=develop	6 years ago
Yiqun Liu	16d54f7f23	Return parent_idx in beam_search op (#15520 ) * Refine beam_search_op to output an extra parent_idx tensor. test=develop * Fix the unittest test_beam_search_op. test=develop * Fix the merging mistake. test=develop	6 years ago
jerrywgz	72ee3c6232	Merge pull request #15398 from jerrywgz/add_axis_for_boxcoder Add axis for boxcoder	6 years ago
jerrywgz	e402c0ec7d	test=develop	6 years ago
Kaipeng Deng	d3eeb92bba	Merge pull request #15491 from tink2123/new_align_corners add align_corners and align_mode for image_resize	6 years ago
jerrywgz	3046799ecd	Merge branch 'develop' into add_clip_op	6 years ago
dzhwinter	1a44b2fbe8	Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass	6 years ago
Jiabin Yang	2d0ffdc485	test=develop, fix debug mode unitest, hsigmoid (#15574 )	6 years ago
tensor-tang	2b0811c3fb	refine vadd jitkernel choice test=develop	6 years ago
tensor-tang	a18c0d4242	cache fc kernel test=develop	6 years ago
tensor-tang	6e1ee7fb57	cache softmax kernel func test=develop	6 years ago
Krzysztof Binias	69b7c595d6	Small fix test=develop	6 years ago
Krzysztof Binias	b1bdcd4de8	Make separate folders for mkldnn codes test=develop	6 years ago
dzhwinter	06f2448848	Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass	6 years ago
dengkaipeng	23d34d1f7e	move yolov3_loss to detection. test=develop	6 years ago
tensor-tang	c7449227e8	Merge pull request #15563 from tensor-tang/jit/softmax refine softmax kernel	6 years ago
dengkaipeng	733bb82ec0	downsample -> downsample_ratio. test=develop	6 years ago
dengkaipeng	ae0b0d5f93	fix doc. test=develop	6 years ago
dengkaipeng	56e21c558e	add comments and docs. test=develop	6 years ago
dengkaipeng	577424e5ec	use darknet loss and trick	6 years ago
dengkaipeng	042fecefab	use L2Loss. test=develop	6 years ago
dengkaipeng	af124dcdf6	fix API error	6 years ago
dengkaipeng	c945ffa7f8	fix label_smooth and mixup score	6 years ago
tink2123	2b89f59055	add attr use_label_smooth test=develop	6 years ago
dengkaipeng	8218e30176	add gtscore. test=develop	6 years ago
dengkaipeng	3c08f620c2	add label smooth. test=develop	6 years ago
dengkaipeng	cc01db6029	calc valid gt before loss calc. test=develop	6 years ago
dengkaipeng	32d533c2cd	cache obj_mask and gt_match_mask. test=develop	6 years ago
dengkaipeng	6c5a5d0789	format code. test=develop	6 years ago
dengkaipeng	e7e4f084e5	ignore pred overlap gt > 0.7. test=develop	6 years ago
dengkaipeng	db8ff57a61	remove useless code and update doc. test=develop	6 years ago
dengkaipeng	577a92d992	use typename DeviceContext. test=develop	6 years ago
dengkaipeng	0c4acc8305	imporve yolo loss implement. test=develop	6 years ago
dengkaipeng	2fbfef2ec9	fix no box expression. test=develop	6 years ago
dengkaipeng	c0fa8d2eec	use L1Loss for w, h. test=develop	6 years ago
dengkaipeng	3841983aa0	fix division error in mean process. test=develop	6 years ago
dengkaipeng	192d293854	use stable Sigmoid Cross Entropy implement. test=develop	6 years ago
tink2123	909f864a9b	remove unnecessary flags test=develop	6 years ago
tink2123	6961a94e94	avoid out_size less than 1 test=develop	6 years ago
jerrywgz	7bc8481c62	Merge pull request #15418 from jerrywgz/refine_nms Refine nms	6 years ago
tensor-tang	d59f733551	refine softmax and use with cache test=develop	6 years ago
tensor-tang	7383eefd2d	add softmax mix and mkl code test=develop	6 years ago
tensor-tang	50945685f2	add hmax, hsum jitcode test=develop	6 years ago
tensor-tang	8117725852	add jit kernel hsum, hmax and softmax refer code test=develop	6 years ago
Zeng Jinle	bf7dedcbc7	Merge pull request #15545 from sneaxiy/fix_debug_nccl_error Fix nccl unittest error in debug mode	6 years ago
dzhwinter	ee3aae56cd	merge develop branch. test=develop	6 years ago
jerrywgz	cee2e1b089	refine code, test=develop	6 years ago
sneaxiy	ba4f43fd62	fix compile error in distributed mode test=develop	6 years ago
tink2123	a0c63f1106	add align_flag test=develop	6 years ago
Tao Luo	b919190232	Merge pull request #15531 from jczaja/prv-googlenet-fix Performance and functional fixes to LRN	6 years ago
Zhaolong Xing	97b76c94c4	Merge pull request #15242 from NHZlX/trt_int8_ultimate_version add trt int8 support	6 years ago
Kaipeng Deng	aeca5c50b2	fix grid_sampler PADDLE_ENFORCE error. test=develop (#15542 )	6 years ago
乔龙飞 Qiao Longfei	5f89ce7fcd	Merge pull request #15536 from jacquesqiao/fix-prefetch-one-parameter Fix prefetch one parameter	6 years ago
Jacek Czaja	5885c5cdf6	- Added explanation to LRN MKL-DNN op on alpha modification test=develop	6 years ago
Jacek Czaja	4aa7ef3c13	- Compensation fix to LRN MKL-DNN op test=develop	6 years ago
Qiao Longfei	806658d72b	add space after colon in commnet test=develop	6 years ago
nhzlx	b43ea40c51	delete the usage of the const_cast test=develop	6 years ago
baojun-nervana	8e9308a51a	mv ngraph_bridge to ngraph directory test=develop	6 years ago
Qiao Longfei	4d13434443	fix a little problem test=develop	6 years ago
Qiao Longfei	9c3910f390	IncreaseBatchBarrier should be in the right condition test=develop	6 years ago
ruri	88bd7e1a61	Merge pull request #15027 from shippingwang/shufflechannel Add Shuffle Channel Operator	6 years ago
Jacek Czaja	fa286b1052	LRN reengineering Added reading dst mem pd from lrn pd coding style fixes test=develop	6 years ago
nhzlx	92cf4a4c6b	fix comments test=develop	6 years ago
tensor-tang	e043ea9653	Merge pull request #15515 from tensor-tang/jit/benchmark jit benchmark use tensor with alignment	6 years ago
Qiao Longfei	5a0c6593d5	revert RequestGetHandler	6 years ago
jerrywgz	466a10dcdd	refine code, test=develop	6 years ago
乔龙飞 Qiao Longfei	c58555067e	Merge pull request #14731 from jacquesqiao/optimize-cpp-reader Optimize cpp reader	6 years ago
jerrywgz	a39240c3b6	add attr variance for box coder, test=develop	6 years ago
gongweibao	d54494ba87	cleanup test=develop (#15347 )	6 years ago
Qiao Longfei	84220765a7	refine code, add more log	6 years ago
Qiao Longfei	c750be6d9d	add some log	6 years ago
gongweibao	fe8f28c957	Add GetVariableNoBarrier on brpc. (#15488 )	6 years ago
tangwei12	981fc2bdba	fix bug in merge_ids (#15503 ) * fix mistakes in merge_ids, test=develop	6 years ago
baojun	efce25673c	Adding ngraph_engine_op (#14948 ) * enable ngraph_engine_op test=develop * merge develop test=develop * avoid const_cast test=develop * rm ngraph_operator test=develop * Added TODO to move EnableNgraph test=develop * Add TODO to remove const_cast test=develop	6 years ago
chengduo	f8f91fb4b3	Revert conv transpose cudnn (#15514 ) * Revert "set constant for loss" This reverts commit 167933f678ccbb3563e949710279efe004a27731. * Revert "remove workspace_handle" test=develop This reverts commit b4aca8ede9e685bce1dfb1c59e63919f33432572.	6 years ago
tensor-tang	b67584a6e9	jit benchmark use tensor test=develop	6 years ago
Yiqun Liu	3008fa1261	Add the CUDA kernel for beam_search op (#15020 ) * Refine the beam_search op and test. * A basic CUDA implementation of beam_search for small batch_size. * Implement CUDA kernel for beam_search_op. * Use multiple CUDA threads in the same block to select the top beam. * Update the python api of beam_search op. * Enable extend function in CPU kernel of beam_search op. * Unify the CUDA codes. test=develop * Unify the CPU kernel of beam_search op. * Ensure the seletced items of beam_search_op's CPU kernel sorted by scores. * Update the description of beam_search in API.spec. * Enable the use of CUDA kernel in beam_search op. * Exclude the beam_search's CUDA unittest when there is no CUDA gpu, and delete some debuging statements. test=develop * Follow comments. test=develop * Call the CPU kernel for beam_search op when batch_size > 4. test=develop * Remove the except of is_empty op in PrepareData. test=develop	6 years ago
tink2123	78145c7dff	modified some comments test=develop	6 years ago
nhzlx	027d24c831	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version	6 years ago
chengduo	bf91d11ed5	Clean elementwise_op_function (#15502 ) test=develop	6 years ago
tangwei12	5cfc40dea8	nce add check sample lables, test=develop (#15463 ) * nce add check sample lables, test=develop	6 years ago
tink2123	e448bdb298	modified some comments test=develop	6 years ago
tink2123	88744e4ab8	fixed some errors test=develop	6 years ago
jerrywgz	9eb2d7b3e1	refine code, test=develop	6 years ago
jerrywgz	6dfd789bfc	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_nms	6 years ago
jerrywgz	6928f8318f	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_axis_for_boxcoder	6 years ago
jerrywgz	e60c8438fc	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_clip_op	6 years ago
tink2123	48cc484643	add align_corners and align_mode for image_resize test=develop	6 years ago
jerrywgz	11f1baa406	refine code, test=develop	6 years ago
Zhaolong Xing	b7b68f2a8c	Merge pull request #15461 from NHZlX/fix_trt_stream_bug fix trt stream bug.	6 years ago
tangwei12	8b50ad80ff	checkpoint at distributed training (#14854 ) checkpoint for distributed training.	6 years ago
jerrywgz	57e5f61ec8	add gpu kernel, test=develop	6 years ago
jerrywgz	cc53453057	add comment and refine code, test=develop	6 years ago
qingqing01	07dc5a1506	Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371 ) * Add generate_mask_labels_op to support Mask-RCNN. * Refine sigmoid_cross_entropy to support nomalize mode. * Fix generator_proposals_label. * Use DeviceTemporaryAllocator in roi_pool and roi_algin. * Remove shape check in data_feeder.	6 years ago
Yiqun Liu	eaad3e4c3d	Add check of input in sequence_expand op. (#15466 ) * Add check of input in sequence_expand op. test=develop * Correct the unittest of sequence_expand op. test=develop	6 years ago
gongweibao	f4dec5cdee	Check collective server's data. (#15449 )	6 years ago
jerrywgz	c12a969bd4	refine comment and unittest, test=develop	6 years ago
chengduo	5a8bd82c0c	Remove workspace_handle (#15376 ) * remove workspace_handle test=develop * set constant for loss test=develop	6 years ago
jerrywgz	1c558ad388	add gpu kernel for box clip, test=develop	6 years ago
nhzlx	5b92ddabe2	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_trt_stream_bug test=develop	6 years ago
nhzlx	2f4aee361a	fix comments test=develop	6 years ago
nhzlx	ec213730bc	fix trt stream bug. BUG: After continuing to input different data, the output cannot be aligned test=develop	6 years ago
wopeizl	a8aa79130b	Merge pull request #15453 from wopeizl/fix15313 fix pr 15313	6 years ago
gongweibao	7f8b40f68d	Fix brpc complation error. (#15451 )	6 years ago
jerrywgz	0d4b60ab8b	add lod for slice op, test=develop	6 years ago
dzhwinter	8f3b252392	squash commits. test=develop	6 years ago
peizhilin	e6a3a3a31a	fix pr 15313 test=develop	6 years ago
jerrywgz	66bb5dd760	refine infer shape, test=develop	6 years ago
tensor-tang	266e625d2e	Merge pull request #15399 from tensor-tang/refine/seqpool/fc fix cpu jitkernel test and refine benchmark test	6 years ago
Qiao Longfei	45578c1b48	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader	6 years ago
Yan Chunwei	885c4e57ab	fea/infer memory optim2 (#14953 )	6 years ago
jerrywgz	0d91507859	fix share lod, test=develop	6 years ago
Tao Luo	6597ccb01f	Merge pull request #15413 from luotao1/legacy_code remove legacy code	6 years ago
Dun	9f8f0fc2d3	Memory optimization of depthwise conv op and group norm op (#15313 ) * mem opt * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * refine code test=develop * refine code test=develop * refine code test=develop * refine code test=develop * refine with cub test=develop * fix mkldnn test && remove comments && test=develop * polish code && test=develop * add only_forward test && test=develop	6 years ago
jerrywgz	5246285e34	test=develop	6 years ago
jerrywgz	b10d84bc5a	fix bug when run on GPU, test=develop	6 years ago
whs	530869f829	Share LoD from Input(Rois). (#15420 ) test=develop	6 years ago
gongweibao	7ab4af2716	Fix brpc compilation. (#15417 )	6 years ago

... 3 4 5 6 7 ...

3805 Commits (084310f536e0849ad04d8391a5563f438ddf69a2)