Paddle

Commit Graph

Author	SHA1	Message	Date
jerrywgz	9eb2d7b3e1	refine code, test=develop	6 years ago
nhzlx	484b3bc801	When cudnn version < 7100, there is problem with conv_fusion. Add check for it. test=develop	6 years ago
jerrywgz	6dfd789bfc	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_nms	6 years ago
jerrywgz	6928f8318f	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_axis_for_boxcoder	6 years ago
tensor-tang	af07118dd7	Merge pull request #15486 from tensor-tang/fix/pass/debug fix debug compile issue of analysis pass	6 years ago
liuwei1031	5d026a881a	Gpu memory monitoring (#15436 ) * fix github issue 15267 test=develop * fix github issue 15267 test=develop * monitor the GPU usage during runtime * revert allocator_facade.cc change * comments update test=develop	6 years ago
Xin Pan	58cb18d9d9	Merge pull request #15322 from velconia/imperative_resnet Imperative Resnet	6 years ago
sneaxiy	51227bd447	lazy_allocator test=develop	6 years ago
tink2123	48cc484643	add align_corners and align_mode for image_resize test=develop	6 years ago
minqiyang	ac80273686	Change definitions to PADDLE_WITH_JEMALLOC	6 years ago
minqiyang	c8965dc1ab	Polish code test=develop	6 years ago
tensor-tang	5c68dee798	fix debug compile of analysis pass fail test=develop	6 years ago
乔龙飞 Qiao Longfei	d243e555eb	Merge pull request #15080 from jacquesqiao/optimize-assign Optimize assign	6 years ago
Zhaolong Xing	b7b68f2a8c	Merge pull request #15461 from NHZlX/fix_trt_stream_bug fix trt stream bug.	6 years ago
luotao1	353b5f06a7	refine analyzer_bert_test to pass the ci test=develop	6 years ago
tangwei12	8b50ad80ff	checkpoint at distributed training (#14854 ) checkpoint for distributed training.	6 years ago
luotao1	cc618934c0	Merge branch 'bert_test' of https://github.com/fc500110/Paddle into fc500110-bert_test	6 years ago
jerrywgz	cc53453057	add comment and refine code, test=develop	6 years ago
nhzlx	e6218c1d7b	change the input to a smaller value test=develop	6 years ago
qingqing01	07dc5a1506	Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371 ) * Add generate_mask_labels_op to support Mask-RCNN. * Refine sigmoid_cross_entropy to support nomalize mode. * Fix generator_proposals_label. * Use DeviceTemporaryAllocator in roi_pool and roi_algin. * Remove shape check in data_feeder.	6 years ago
Qiao Longfei	6833ec06dc	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-assign test=develop	6 years ago
Yiqun Liu	eaad3e4c3d	Add check of input in sequence_expand op. (#15466 ) * Add check of input in sequence_expand op. test=develop * Correct the unittest of sequence_expand op. test=develop	6 years ago
sneaxiy	ef788603d4	merge develop test=develop	6 years ago
gongweibao	f4dec5cdee	Check collective server's data. (#15449 )	6 years ago
Zhen Wang	58727e8e6d	Merge pull request #15455 from wzzju/graph_quantization Graph quantization pass. TODO(Add public API comments.)	6 years ago
jerrywgz	f44b1507f0	revised API spec, test=develop	6 years ago
fuchang01	4a33a44f45	analyzer bert tester	6 years ago
Tao Luo	fef3fd6d62	Merge pull request #15452 from luotao1/legacy_option remove legacy compiler option	6 years ago
Paddle CI	289aba750a	Polish code test=develop	6 years ago
jerrywgz	c12a969bd4	refine comment and unittest, test=develop	6 years ago
chengduo	5a8bd82c0c	Remove workspace_handle (#15376 ) * remove workspace_handle test=develop * set constant for loss test=develop	6 years ago
JiabinYang	266e0b63cd	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/imperative simple rnn	6 years ago
JiabinYang	e686818aed	simple RNN	6 years ago
WangZhen	4e91d8d291	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization test=develop	6 years ago
nhzlx	5b92ddabe2	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_trt_stream_bug test=develop	6 years ago
nhzlx	2f4aee361a	fix comments test=develop	6 years ago
WangZhen	c6f99a1645	Update API.spec. test=develop	6 years ago
WangZhen	b913463e83	Update according to the reviewers' suggestion. test=develop	6 years ago
sneaxiy	d8568acd19	turn on remove_unnecessary_lock test=develop	6 years ago
Qiao Longfei	a71f7ed787	update API.spec test=develop	6 years ago
nhzlx	ec213730bc	fix trt stream bug. BUG: After continuing to input different data, the output cannot be aligned test=develop	6 years ago
wopeizl	a8aa79130b	Merge pull request #15453 from wopeizl/fix15313 fix pr 15313	6 years ago
gongweibao	7f8b40f68d	Fix brpc complation error. (#15451 )	6 years ago
WangZhen	3ce6172052	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization	6 years ago
WangZhen	787c5e714c	Update the API.spec. test=develop.	6 years ago
WangZhen	59e5cc51d6	Add quantization transform pass and UT.	6 years ago
flame	d60751fb71	add python inference api (#15248 ) add python inference api	6 years ago
jerrywgz	0d4b60ab8b	add lod for slice op, test=develop	6 years ago
peizhilin	e6a3a3a31a	fix pr 15313 test=develop	6 years ago
Qiao Longfei	9449844c2a	update ctr_reader in API.spec test=develop	6 years ago
Tao Luo	cf29ea1592	remove legacy ANDROID option	6 years ago
jerrywgz	66bb5dd760	refine infer shape, test=develop	6 years ago
tensor-tang	266e625d2e	Merge pull request #15399 from tensor-tang/refine/seqpool/fc fix cpu jitkernel test and refine benchmark test	6 years ago
Qiao Longfei	45578c1b48	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader	6 years ago
jerrywgz	7d0c5fafa9	add API spec, test=develop	6 years ago
Yan Chunwei	885c4e57ab	fea/infer memory optim2 (#14953 )	6 years ago
jerrywgz	0d91507859	fix share lod, test=develop	6 years ago
minqiyang	a21f4e38c3	Polish code test=develop	6 years ago
minqiyang	8ce198b2e1	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet test=develop	6 years ago
minqiyang	31a1cd8ce5	Align the first batch of gpu resnet	6 years ago
Tao Luo	6597ccb01f	Merge pull request #15413 from luotao1/legacy_code remove legacy code	6 years ago
Dun	9f8f0fc2d3	Memory optimization of depthwise conv op and group norm op (#15313 ) * mem opt * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * refine code test=develop * refine code test=develop * refine code test=develop * refine code test=develop * refine with cub test=develop * fix mkldnn test && remove comments && test=develop * polish code && test=develop * add only_forward test && test=develop	6 years ago
whs	530869f829	Share LoD from Input(Rois). (#15420 ) test=develop	6 years ago
gongweibao	7ab4af2716	Fix brpc compilation. (#15417 )	6 years ago
Xin Pan	9a9c690e71	Merge pull request #15343 from panyx0718/imperative3 add a GAN model in imperative mode	6 years ago
WangZhen	e2ff300b02	add UT for quantization.	7 years ago
WangZhen	451896fce4	init quantization.	7 years ago
tensor-tang	316e44b1b7	fix unused warnings test=develop	7 years ago
Wu Yi	7e651a38dd	fix mac cmake version 3.13 build (#15386 ) * fix mac cmake version 3.13 test=develop * fix again test=develop	7 years ago
jerrywgz	b62a17bbae	add nms api	7 years ago
tensor-tang	579d758254	fix jitkernel tests and refine benchmark test=develop	7 years ago
jerrywgz	f660553d77	enhance nms for mask rcnn, test=develop	7 years ago
shippingwang	14f2a1060d	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel	7 years ago
jerrywgz	88ee56d0b2	enhance nms for mask rcnn	7 years ago
zhaozhehao	e2ba9668b4	Tree conv op (#15217 ) * refactor tree2col operator with new memory mechanism test=develop * test=develop * test=develop * Modified API according to panyx0718 test=develop * fix API change according to heavengate test=develop * Modify API comment test=develop	7 years ago
Tao Luo	3ede8b67e6	update CMakeLists.txt	7 years ago
Tao Luo	8f522c15ed	Merge pull request #15408 from luotao1/mm_dnn test_analyzer_mm_dnn runs in serial	7 years ago
Tao Luo	001827c270	test_analyzer_mm_dnn runs in serial test=develop	7 years ago
Tao Luo	140fc1e92c	Merge pull request #15392 from luotao1/pyramid_dnn add pyramid_dnn c++ inference test	7 years ago
Yan Chunwei	c9e5aa19c1	get tensor API add more comments (#15345 )	7 years ago
Yiqun Liu	f413b6892b	Revert the modification of while_op in #14764 . (#15372 ) * Revert the modification of while_op in #14764. test=develop * Remove the dependency of GRPC_DEPS. test=develop	7 years ago
jerrywgz	ab9d6a4f39	add comments, test=develop	7 years ago
jerrywgz	10dd3b37ad	add axis for box coder op	7 years ago
Yan Chunwei	e84234b551	make clone thread safe (#15363 )	7 years ago
乔龙飞 Qiao Longfei	adba4384ec	Merge pull request #15161 from jacquesqiao/gru-add-mode gru add origin mode	7 years ago
gongweibao	7cd4dd7ce4	Hide varhandle members. (#15382 )	7 years ago
Tao Luo	668563088e	add pyramid_dnn c++ inference test test=develop	7 years ago
Zhaolong Xing	236201c222	Merge pull request #15350 from NHZlX/fix_bug_for_precditor fix analysis config bug	7 years ago
nhzlx	8817841c73	fix unit test bug test=develop	7 years ago
Yan Chunwei	e07900d317	cache tensor ptr in ZeroCopyTensor (#15352 )	7 years ago
Yan Chunwei	b7916440ff	hot fix the Native clone (#15344 )	7 years ago
minqiyang	dbd4d058af	Add static implementation and fix fc layer	7 years ago
Xin Pan	3ecf6bb338	Merge pull request #15028 from yihuaxu/develop_641313ea7_elementwise_mul_mkldnn_bug_fix Fix the exception when tensor format is x	7 years ago
Xin Pan	e395f2c6a3	polish codes test=develop	7 years ago
nhzlx	b95f2ff8fe	fix win build bug test=develop	7 years ago
nhzlx	b938324381	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version test=develop	7 years ago
nhzlx	312fe0ece1	add trt int8 calibration support fix comments test=develop	7 years ago
wopeizl	994e73f685	Merge pull request #15351 from wopeizl/fixbuildissue disable the parallel mode for adam op on windows test=develop	7 years ago
minqiyang	315b133e67	Add single GPU support to imperative	7 years ago
Yiqun Liu	568cc2ffa8	Optimize while_op for test (#14764 ) * Simplify the compare op for CPU. * Use asynchronous tensor copy in reshape_op's kernel. * Optimize while_op for test, avoiding creating variables every time. test=develop * Enable the cache of kernel type and kernel function. test=develop * Enable profiling with gperftools. * Remove flags for testing, and fix the linking error. test=develop * Delete the codes of ChooseKernel. test=develop * Fix bug when preparing ExecutorPrepareContext for while_op. * Fix missing depending on grpc libraries. * Remove the redundant print. test=develop * Follow comments. * Remove the codes related to prepare the ExecutorPrepareContext for while_op. test=develop	7 years ago
tensor-tang	3759c1db8c	Merge pull request #14805 from mozga-intel/mozga-intel/element_wise_operator_ngraph Enable element_wise_add operator for a ngraph engine	7 years ago
tensor-tang	904a39239d	Merge pull request #15254 from mozga-intel/mozga-intel/softmax_operator_ngraph Enable softmax operator for a ngraph engine	7 years ago
nhzlx	e61a1b9514	merge develop test=develop	7 years ago
peizhilin	cd562f8fb7	disable the parallel mode for adam op on windows test=develop	7 years ago
nhzlx	b2ba3471fd	fix analysis config bug.	7 years ago
Xin Pan	01dc15ce32	Merge pull request #15329 from panyx0718/imperative2 add imperative mode design	7 years ago
Xin Pan	16cb3ebd68	Merge pull request #15268 from xiaolil1/pool-int8 Enhance key generation for Pool INT8 test	7 years ago
Xin Pan	9a4314f025	imperative gan test=develop	7 years ago
tensor-tang	a7fc3d42a0	Merge pull request #15304 from tensor-tang/fuse/second_order_mul_sub Fuse/second order mul sub and fuse repeated fc relu	7 years ago
bingyanghuang	a152a5c731	Disable conv3d mkldnn in dam (#15335 ) * disable conv3d mkldnn in dam * Add some comments test=develop	7 years ago
Xin Pan	73093656b8	Merge pull request #15331 from panyx0718/api expose CompiledProgram	7 years ago
Xin Pan	2db6e3ed2a	Merge pull request #15292 from panyx0718/imperative polish imperative codes	7 years ago
乔龙飞 Qiao Longfei	b14d4cdd75	Merge pull request #14890 from jacquesqiao/multithread-sparse-adam adam support multithread	7 years ago
Xin Pan	6b762f6519	add doc test=develop	7 years ago
Xin Pan	d7b159355c	add more doc test=develop	7 years ago
mozga-intel	cba729404d	Enable softmax operator for a ngraph engine test=develop	7 years ago
Qiao Longfei	cd31b90a46	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader test=develop	7 years ago
wopeizl	0fbb76f66b	Merge pull request #15204 from wopeizl/debug/support add the python callstack for debug support test=develop	7 years ago
Xin Pan	24bb6a6aec	expose CompiledProgram test=develop	7 years ago
Xin Pan	783dbe9abb	more doc test=develop	7 years ago
Xin Pan	f997109bb1	polish	7 years ago
Xin Pan	c1fdacd4b4	add imperative mode design test=develop	7 years ago
Qiao Longfei	8c516a24e5	remote min_row_size_to_use_multithread in adam interface test=develop	7 years ago
Tao Luo	9497d43921	Merge pull request #15307 from luotao1/trace_deps fix imperative compile when WITH_PYTHON=OFF	7 years ago
tensor-tang	1a95cd227d	disable seqpool test on mac or without mkl test=develop	7 years ago
Qiao Longfei	9b4fe283e1	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam test=develop	7 years ago
tensor-tang	0b6447a482	Merge pull request #15310 from luotao1/ZeroCopy_omp fix multi-threads in ZeroCopyProfile	7 years ago
peizhilin	5e450833bd	test=develop	7 years ago
Qiyang Min	3f687765e6	Merge pull request #15281 from velconia/fix_expand_op_compile_time Fix expand op compile time bug	7 years ago
peizhilin	eea75a1d93	fix issue when type is invalid test=develop	7 years ago
peizhilin	9adb158e5b	Merge remote-tracking branch 'upstream/develop' into debug/support	7 years ago
minqiyang	29ceb93126	Use malloc and free in JeMalloc test=develop	7 years ago
Tao Luo	2411ed4286	fix multi-threads in ZeroCopyProfile test=develop	7 years ago
minqiyang	c4cf5967db	Change backward op infershape test=develop	7 years ago
tensor-tang	84b0ecdcce	Merge remote-tracking branch 'ups/develop' into fuse/second_order_mul_sub test=develop	7 years ago
tensor-tang	7035f051a8	adjust acc on mac	7 years ago
luotao1	346561a37f	fix imperative compile when WITH_PYTHON=OFF test=develop	7 years ago
Xin Pan	b29eca3b71	code style test=develop	7 years ago
Xin Pan	7bc67c31e5	polish more test=develop	7 years ago
Xin Pan	0c04cac484	polish test=develop	7 years ago
Xin Pan	47ef2df01a	polish test=develop	7 years ago
Xin Pan	0d5819eb4f	polish imperative codes test=develop	7 years ago
Tao Luo	e33427da0d	Merge pull request #15280 from luotao1/random_test fix CompareDeterministic error when test_all_data	7 years ago
chengduo	46d01d798e	Revert "Revert "Remove workspace_handle in conv_cudnn (#15186 )"" (#15290 ) test=develop This reverts commit `358e657f68`.	7 years ago
Qiao Longfei	4d15515c40	fix gru_gpu_kernel test=develop	7 years ago
tensor-tang	93e75c5ae5	refine jitcode of vsub and vsquare test=develop	7 years ago
tensor-tang	d618e48309	fix fuse square mat order and refine test test=develop	7 years ago
tensor-tang	a5d2a6d1ad	add fuse pass of sequared mat sub fusion	7 years ago
tensor-tang	531f4a1578	Merge branch 'fuse/repeatedfcrelu' into fuse/second_order_mul_sub	7 years ago
tensor-tang	84e023eae5	adjust the acc since the refer result is too large test=develop	7 years ago

1 2 3 4 5 ...

6272 Commits (3d0ecab41bc62585d52816251098a78b5c65d217)