Paddle

Commit Graph

Author	SHA1	Message	Date
qiaolongfei	2b9ff39f5f	fix the default value prefetch_var_name_to_block_id	7 years ago
qingqing01	19fd071785	Make the normalization operator more general and fix bug in l2_normalize. (#11348 ) * Add normalization operator. 1. Refine the raw norm_op and let it more general to support to normalize Tensor along any axis. 2. There is a bug in l2_normalize API, which lacks sqrt after `reduce_sum`. 3. Use norm_op to refine the l2_normalize API. 4. Fix bug in test_normalization_wrapper.py.	7 years ago
Lei Wang	24391c76de	Build: add make before make install to catch up Makefile change.	7 years ago
whs	adc09087c1	Add slice op. (#11052 ) * Add slice op. * Remove using from header file and fix doc. * Fix doc * Small fix.	7 years ago
qiaolongfei	6dd3f3cf27	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids	7 years ago
qiaolongfei	16658f7b59	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch	7 years ago
Xin Pan	1d198494d7	Merge pull request #11370 from panyx0718/dist Make status update thread-safe	7 years ago
chengduo	183377f410	Merge pull request #11306 from chengduoZH/enable_cpu_on_pe Enable CPU on Parallel executor	7 years ago
qiaolongfei	83a577e8ce	fix build problem	7 years ago
qiaolongfei	fe65064827	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch	7 years ago
Luo Tao	7bdb573d79	update with comments	7 years ago
qiaolongfei	506fc8d9e8	optimize code	7 years ago
gongweibao	d9de6b8621	Add brpc surpport. (#11263 )	7 years ago
Xin Pan	1509ae3a53	Make status update thread-safe The status is updated in the Process() thread and can be checked in another HandleRequest() thread.	7 years ago
qiaolongfei	ea106c91e0	optimize comment and code	7 years ago
Luo Tao	7694199050	refine docs of elementwise_op etc.	7 years ago
qiaolongfei	7f4b9656a4	set status before Finish in prefetch process	7 years ago
dzhwinter	bfa3fd6f15	add inplace attribute to op_proto_maker (#10665 ) * "add inplace attribute" * "register inplace attribute" * "change se-next model for memory-reuse" * "fix typo" * repick * fix merge conflict * "fix stupid error"	7 years ago
qiaolongfei	5aba10b585	set the thread pool of prefetch to 1 to fix a bug	7 years ago
gongweibao	9087c6687f	polish (#11363 )	7 years ago
qiaolongfei	8fb78f6c07	fix grpc_server_test	7 years ago
chengduoZH	173d72b481	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enable_cpu_on_pe	7 years ago
tensor-tang	b3fd9da60e	Merge pull request #11101 from mozga-intel/mozga-intel/Pool_mkldnn_layout MKLDNN layout: Support for pool operator	7 years ago
chengduoZH	aadaadf735	replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.	7 years ago
qiaolongfei	4e36c0ecab	update prefetch logic in grpc_server	7 years ago
gongweibao	627d7a64f8	Clean `sendop` `recv` operator. (#11309 )	7 years ago
chengduo	fa29ef0b0d	Merge pull request #11277 from chengduoZH/check_ssa_graph Check SSA Graph	7 years ago
chengduoZH	961fbce8e2	follow comments	7 years ago
Yu Yang	3fd3e500cc	Merge pull request #11346 from reyoung/feature/add_lock_to_device_ctx Add lock to record_event.	7 years ago
yuyang18	2955ff5887	Polish documentation * row_conv * uniform_random * layer_norm * create_parameter * hard_shrink * ssd_loss	7 years ago
qiaolongfei	0d3d4ae775	refine prefetch logic	7 years ago
chengduoZH	7b723839ef	Add cpu test for parallel_executor_crf executor_fetch_feed, and enable these tests	7 years ago
sneaxiy	831909ce69	Merge pull request #11313 from sneaxiy/argmin_argmax Add argmin and argmax op	7 years ago
chengduoZH	d24e046c1e	fix allReduce bug	7 years ago
yuyang18	a1254a86ba	Add lock to record_event.	7 years ago
Tao Luo	69b5a62c65	Merge pull request #11319 from luotao1/mkldnn add FLAGS_use_mkldnn to global control use_mkldnn	7 years ago
yuyang18	9b43edeae0	Polish arg_min_max_op * Remove unused arg_max/min_op.h * Remove reference parameter. Use pointer insteaded. * undef macro * Always set OutT as int64_t.	7 years ago
chengduoZH	a57e8a4338	add cpu test	7 years ago
Yu Yang	9328c3cf7b	Merge pull request #11308 from reyoung/feature/polish_api_ref Simplize API Reference Documentation	7 years ago
qiaolongfei	0485405b3d	add more debug string	7 years ago
Luo Tao	045589fae4	fix compiler error in high-level api	7 years ago
Luo Tao	79d555b9f2	Merge branch 'develop' into mkldnn	7 years ago
gongweibao	062d5a56b4	Add comments to a singleton. (#11333 )	7 years ago
mozga-intel	7d5643562f	MKLDNN layout: Support for batch norm operator	7 years ago
mozga-intel	9908d3cfbc	MKLDNN layout: Support for convolution operator	7 years ago
mozga-intel	36031cb50f	MKLDNN layout: Support for pool operator	7 years ago
qiaolongfei	509cb0bc76	add unit test, pass the unit test	7 years ago
qiaolongfei	7cebec4b7e	init merge_ids_op	7 years ago
chengduoZH	1e731f5964	small fix	7 years ago
chengduoZH	495368c243	ADD CPU_NUM	7 years ago
chengduoZH	27073c284d	nccl_all_reduce_op_handle => all_reduce_op_handle	7 years ago
chengduoZH	2d94697a82	code refine	7 years ago
chengduoZH	5a3c8bf813	fix in c++ side	7 years ago
Wu Yi	7bcc98089a	Merge pull request #11321 from Yancey1989/polish_sparse_update polish sparse update logic	7 years ago
guochaorong	eced973091	Merge pull request #11317 from guochaorong/fix_bad_code Fix bad code in c plus and python	7 years ago
guochaorong	310598f99b	Update device_tracer.cc	7 years ago
fengjiayi	fae3d8d2dc	Merge pull request #11311 from JiayiFeng/a_small_fix fix a small compile error on Mac	7 years ago
sneaxiy	6d32e96096	remove redundant comments	7 years ago
Yancey1989	56964946d4	polish sparse update logic	7 years ago
Luo Tao	c6d230e03e	add FLAGS_use_mkldnn to global control use_mkldnn	7 years ago
guochaorong	04b8d3d03c	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into paddle_fix	7 years ago
guochaorong	0fec9469f9	fix some bugs introduced by unfreed memory	7 years ago
yuyang18	8c9041f486	Refine LinearCRF	7 years ago
sneaxiy	568c4e5ec4	recommit using account sneaxiy	7 years ago
Yan Chunwei	145aaa4b49	loose threshold of TRT for CI in different model (#11305 )	7 years ago
fengjiayi	d745840a6e	fix a small compile error on Mac	7 years ago
yuyang18	0d29e65924	Add resize_bilinear	7 years ago
yuyang18	b000e0de5d	Simplize API Reference Documentation	7 years ago
chengduoZH	0c851cab22	add SSA graph checker	7 years ago
fengjiayi	b587a7f66e	Merge pull request #11293 from JiayiFeng/update_crop_op Update crop op	7 years ago
Xin Pan	259e63d4a1	Merge pull request #11248 from panyx0718/dist Fix sparse vars usage for dist train	7 years ago
Xin Pan	2d7c836d32	Merge pull request #11295 from panyx0718/doc Refine API doc string	7 years ago
Yu Yang	8deff48db0	Merge pull request #11081 from reyoung/feature/python_doc Add document to random crop operator	7 years ago
fengjiayi	c7bbfb33ad	Fix a GPU bug	7 years ago
Yancey1989	1239fce771	polish sparse update code	7 years ago
chengduoZH	1076e85135	refine logic	7 years ago
Yancey	0aa9546eed	fix dist train error (#11281 ) * fix dist train error * update by comment	7 years ago
Xin Pan	e80c6b3c24	Refine API doc string	7 years ago
tensor-tang	80e882a398	Merge pull request #11247 from tensor-tang/infer_api Infer multi-threads API Demo and UT	7 years ago
cuichaowen	9141bee1e7	add Anakin api for paddle (#11228 )	7 years ago
fengjiayi	24649a780d	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update_crop_op	7 years ago
dzhwinter	d48172f22a	split reduce op into multiple libraries, accelerate the compiling (#11029 ) * "split into multiple .ccl" * "refine file structure" * "refine files" * "remove the cmakelist" * "fix typo" * "fix typo" * fix ci	7 years ago
fengjiayi	5803115720	Merge pull request #11288 from JiayiFeng/fix_bug_of_ExecutionContext fix bugs in the implementation of 'HasInput' and 'HasOutput'	7 years ago
fengjiayi	9c61409a18	Make crop op supporting taking offsets as one of its inputs	7 years ago
dzhwinter	f7c96f079b	Big data op_test benchmark, for checking output consistent in different runs. (#10646 ) * "init benchmark ops" * "untrack outputs" * "delete some usused code" * "benchmark" * "fix ci" * "fix op test" * "fix uint16 missing" * "fix ci" * "follow comments" * "fix ci" * "follow comments" * "conficts. merge develop branch" * repick * "merge develop branch"	7 years ago
fengjiayi	9ce0885067	Merge branch 'fix_bug_of_ExecutionContext' into update_crop_op	7 years ago
fengjiayi	dc8e0b494d	fix bugs in the implementation of 'HasInput' and 'HasOutput'	7 years ago
tensor-tang	e030741df9	fix gpu fraction	7 years ago
fengjiayi	4f46a98fa9	stash	7 years ago
tensor-tang	746a62ebe6	add gpu tests	7 years ago
tensor-tang	35e820dc2b	Merge remote-tracking branch 'ups/develop' into infer_api	7 years ago
mozga-intel	3ff9ba0e6b	Mkldnn layout (#11040 ) * Add MKLDNN layout support in Paddle Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout can be used in MKLDNN enabled OP kernel. Before this commit, NCHW is hardcode to be used in all MKLDNN op kernels. As a result, non-optimized execution path is selected in MKLDNN primitive which bring worse performance. Besides framework change, three MKLDNN OP kernels were updated for using new MKLDNN layout. They are conv/pool2d/batch_norm. Other MKLDNN OP kernels need be also updated in similar way to achieve best performance. * Add MKLDNN layout support in activation OP * Don't populate layout from input to output when kMKLDNN in * Refine pool mkldnn op kernel * MKLDNN layout * Remove the inferitance from tensor file * MKLDNN layout: refactoring * Remove additional #define to register new operator * Prepare mkldnn tests to work with layout	7 years ago
chengduoZH	8291b916d6	replace graph_builder_factory with ssa_graph_builder_factory	7 years ago
chengduoZH	9ac785be39	check graph's validation	7 years ago
fengjiayi	a1e046bfc0	Merge pull request #11270 from JiayiFeng/fix_a_error_on_max fix a compile error on Mac	7 years ago
Yu Yang	03073df182	Merge pull request #11237 from chengduoZH/add_fuse_var_op_handle [Feature] Add fuse vars op handle	7 years ago
Tao Luo	6d80dd5a50	Merge pull request #11222 from luotao1/trt rewrite unittest of trt_activation_op	7 years ago
fengjiayi	499dbe0536	fix a multi-thread bug in readers	7 years ago
fengjiayi	7344210070	Merge branch 'fix_a_error_on_max' into fix_reader_bug	7 years ago
fengjiayi	2f5e310167	fix a compile error	7 years ago

1 2 3 4 5 ...

8942 Commits (1958654d6f15087c28b44759c1a8d004826f00ce)