Paddle

Commit Graph

Author	SHA1	Message	Date
Yu Yang	04a570b463	Fix ut test=develop	7 years ago
heqiaozhi	09d669ba40	fix static_cast to const_cast	7 years ago
peizhilin	23dec78772	fix script issue test=develop	7 years ago
heqiaozhi	bd1c1724aa	add ps_instance doc	7 years ago
Yu Yang	62b5772ffb	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type	7 years ago
Wang Guibao	e213050223	Fix multi-threading bug with WItH_MKL=ON (#14882 ) fixes #14884	7 years ago
Yancey1989	4f304eaa6f	fix unittest test=develop	7 years ago
dongdaxiang	d839bd0dd4	simple commit	7 years ago
minqiyang	3bb9b0cac2	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg test=develop	7 years ago
minqiyang	a81495d6f4	Fix code	7 years ago
minqiyang	ad6ae0b071	1. Add SpinLock 2. Seperate the lock of kids and vars in Scope test=develop	7 years ago
Yancey1989	c722b1dcb6	Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode test=develop	7 years ago
Yu Yang	4ecdb6f486	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type test=develop	7 years ago
sneaxiy	f6741df462	merge develop fix bug test=develop	7 years ago
Zeng Jinle	1b564bc49a	Merge pull request #14670 from sneaxiy/refactor_eager_deletion Rewrite eager deletion	7 years ago
dongdaxiang	c9b799896e	fix tag in async_executor	7 years ago
heqiaozhi	f81957a753	refine cmake for pslib & pre_define	7 years ago
sneaxiy	2328bee1cc	fix Windows compile bug test=develop	7 years ago
Yancey1989	23eb8c4299	fix ci test=develop	7 years ago
heqiaozhi	2912d5311b	fix code style bug & change pslib.cmake & change Cmakelist adapt pslib	7 years ago
dongdaxiang	c59cdf3a24	refine executor_thread_worker.h and executor_thread_worker.cc code style	7 years ago
heqiaozhi	3c01cdeff0	refine executor_thread_worker.cc & executor_thread_worker.h code style	7 years ago
dongdaxiang	c71279bc69	refine code style for async_executor.h and async_executor.cc	7 years ago
Yu Yang	162637b64a	Fix ngraph compile test=develop	7 years ago
dongdaxiang	33ee5cad61	format code style of executor_thread_worker.cc	7 years ago
Yu Yang	194e66f785	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type	7 years ago
heqiaozhi	7bd16e3afa	fix some bug & add log	7 years ago
Xin Pan	30aad88449	Merge pull request #14839 from panyx0718/infer3 fix clang	7 years ago
sneaxiy	db2daefe50	merge develop test=develop	7 years ago
Yancey1989	106e285236	add unittest for parllelgraph mode test=develop	7 years ago
minqiyang	a61eb543f5	Add RWLock to Scope	7 years ago
sneaxiy	e82772f425	fix cmake conflict test=develop	7 years ago
sneaxiy	0c554a59fa	merge develop test=develop	7 years ago
minqiyang	1b61021cb3	Polish code	7 years ago
minqiyang	b75bd29c3a	Remove debug info	7 years ago
minqiyang	7a43e51703	Add gperf tools	7 years ago
Yu Yang	c00e07cda0	Fix distribute compile test=develop	7 years ago
tensor-tang	d846fb03a8	Merge pull request #14807 from baojun-nervana/ngraph_follow_up Follow ups on nGraph PR comments	7 years ago
sneaxiy	ca84c2ca8f	merge develop test=develop	7 years ago
sneaxiy	e240ba2918	implement backward test=develop	7 years ago
sneaxiy	06f8aa5b97	remove while_op support temporarily test=develop	7 years ago
Yu Yang	81520a24cf	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor	7 years ago
Yu Yang	9bd70a1e04	Change tensor uses proto::VarType::type test=develop	7 years ago
Yu Yang	8175983ef9	Merge pull request #14814 from reyoung/feature/gprof Add gperftools supports for PE	7 years ago
heqiaozhi	10ed9e0a6e	download & run & instance	7 years ago
Yancey1989	5cc83f79bf	update by comment	7 years ago
Yancey1989	82726402be	exception safe	7 years ago
minqiyang	570338699b	Add debug info	7 years ago
Yu Yang	9f8d86858e	Revert data_type test=develop	7 years ago
sneaxiy	7923042365	merge develop test=develop	7 years ago
Tao Luo	3243b45bb2	Merge pull request #14651 from luotao1/has_attr add Opdesc's HasProtoAttr	7 years ago
Xin Pan	1735022a1b	fix clang test=develop	7 years ago
heqiaozhi	016a068775	stop server	7 years ago
Tao Luo	067ed70f2d	add HasProtoAttr function in op_desc.h, clean node.h test=develop	7 years ago
heqiaozhi	8e3fe2d735	add skip op	7 years ago
Yancey1989	79082c9459	fix pyreader failed	7 years ago
minqiyang	202b2f1fa7	Move the beta pow scale calculation into Adam Op	7 years ago
Yu Yang	b22d638d8f	Speed up SizeOfType test=develop	7 years ago
Xin Pan	4a716a70c6	Merge pull request #14811 from panyx0718/revert-14782-revert-14398-imperative Imperative	7 years ago
Brian Liu	9623b45f40	Remove unnecessary MKLDNN reorder (#14799 ) When data flow from a MKLDNN OP kernel to a non-MKLDNN OP kernel, data layout transform (via MKLDNN reorder) will occur even when those two OP kernels share same layout. Add code to remove this unnecessary reorder. test=develop	7 years ago
Yancey1989	2dda19f756	Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode	7 years ago
Tao Luo	e99597d35c	Merge branch 'develop' into luotao1-has_attr	7 years ago
Yu Yang	f0c0bf328d	Add gperftools supports for PE	7 years ago
Xin Pan	748549b2e3	Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative" This reverts commit `b1d3a1c8b4`, reversing changes made to `f1fb64b17f`.	7 years ago
bingyanghuang	943ad4781f	One possible solution to add flexibility for mkldnn placement pass (#14768 ) * Choose to turn on use_mkldnn attribute v1 * Fix mkldnn_op empty bug * format change test=develop * fix ci test=develop * fix ci test and add test in dam test=develop * add example to dam compare test test=develop * review changes test=develop	7 years ago
baojun-nervana	fddbd87c0a	Rename argument test=develop	7 years ago
baojun-nervana	22ac2133e4	Rename class test=develop	7 years ago
baojun-nervana	bfde5e10ce	Move ngraph compile control to cmake test=develop	7 years ago
sneaxiy	2c6159a151	fix unittest fix cmake test=develop	7 years ago
Xin Pan	c049fa7cf7	Revert "Revert "Revert "Imperative"""	7 years ago
gongweibao	f1fb64b17f	Add reduce sparse tensor feature. (#14757 )	7 years ago
sneaxiy	eb8252466b	polish code add unittest model containing while_op remove unnecessary codes test=develop	7 years ago
Tao Luo	c83d5b7a16	Merge pull request #14709 from yihuaxu/develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl Implement the fusion of convolution 3D and bias for mkldnn	7 years ago
Yancey1989	47740ace28	fix performance	7 years ago
Yancey1989	220db4f334	clean code	7 years ago
Yancey1989	cb8a24be14	clean code	7 years ago
heqiaozhi	575ae7c6c3	refine pslib inferface & fix some bugs	7 years ago
Yihua Xu	3821fc3950	Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl test=develop	7 years ago
Yihua Xu	240d974ac5	Clean Code test=develop	7 years ago
Xin Pan	2538ef64f1	Revert "Revert "Imperative""	7 years ago
guru4elephant	b82a44ea85	Merge pull request #14778 from wangguibao/async_executor_bugfix Async executor bugfix: Tensor changed to LoDTensor	7 years ago
sneaxiy	8095fb5e68	fix code bug in CPU compilation test=develop	7 years ago
sneaxiy	387bac46b5	refine code test=develop	7 years ago
Tao Luo	cf66133857	Merge pull request #14734 from luotao1/memory_load support loading from memory	7 years ago
Xin Pan	6217f42ab7	Revert "Imperative"	7 years ago
Tao Luo	743cb840f1	update with comments test=develop	7 years ago
Yancey1989	c9de6f1b05	init parallel graph mode	7 years ago
wangguibao	5a2cd4505b	AsyncExecutor bugfix: Tensor to LoDTensor test=develop	7 years ago
wangguibao	5f98d80039	AsyncExecutor bugfix: Tensor change to LoDTensor	7 years ago
Tao Luo	923b18877e	Merge branch 'develop' into memory_load test=develop	7 years ago
Tao Luo	405b2486db	support loading from memory test=develop	7 years ago
Xin Pan	b52f5d2870	Merge pull request #14398 from panyx0718/imperative Imperative	7 years ago
heqiaozhi	d3ca359e44	config init & adapt to interface	7 years ago
Xin Pan	73b4d1aa72	Merge pull request #14742 from panyx0718/infer2 support customized kernel selection	7 years ago
Xin Pan	82d68281c0	follow comments test=develop	7 years ago
Xin Pan	41c28d54c6	allow customize kernel selection test=develop	7 years ago
Xin Pan	439af8d50a	Merge pull request #14717 from panyx0718/infer fix a const_cast and avoid using stale program.	7 years ago
sneaxiy	d0c8b9b9b3	remove timeout unittest test=develop	7 years ago
heqiaozhi	419506f510	refine for compile pslib.so	7 years ago
Wu Yi	29d9fb53fc	[Feature] multi process multi gpu dist training, boost v100 performance by 20% (#14661 ) * wip multi process multi gpu dist training * workable for p2p * update test=develop * change back env name test=develop * fix alloc init * fix cpu build test=devlop * fix mac tests test=develop * refine code * refine test=develop	7 years ago
sneaxiy	e694d0c2e4	fix while_op eager deletion bug add unittest test=develop	7 years ago
Xin Pan	dc458b1482	Merge pull request #14713 from panyx0718/api add more files to protected file list	7 years ago
heqiaozhi	2301abc481	cc libaray add pslib	7 years ago
sneaxiy	35a2578426	fix bug test=develop	7 years ago
zhang wenhui	abbe382e1e	Revert "Add EstiminateFlops"	7 years ago
Xin Pan	0591ba96ec	fix hack test=develop	7 years ago
sneaxiy	64ad051b9a	merge develop test=develop	7 years ago
sneaxiy	c47c451a00	fix bug	7 years ago
heqiaozhi	a77fa67bbd	async_thread_trainer & libmct & pslib.cmake	7 years ago
Tao Luo	3437e17713	Merge branch 'has_attr' of https://github.com/luotao1/Paddle into luotao1-has_attr	7 years ago
Xin Pan	bcf36d8401	add more files to protected file list test=develop	7 years ago
Tao Luo	0e3048db43	Merge pull request #14659 from luotao1/update_pass update is_test_pass and mkldnn_placement_pass	7 years ago
Xin Pan	7e0801d4ed	Merge pull request #14441 from baojun-nervana/intel/ngraph_op Implementing ngraph engine	7 years ago
Xin Pan	b80fe8264a	polish test=develop	7 years ago
Yihua Xu	64e261c6cd	Implement the fusion of convolution and bias for mkldnn (test=develop)	7 years ago
Tao Luo	8d6984eb9b	change OpHasAttr to RuntimeHasAttr, add some comments test=develop	7 years ago
Tao Luo	a6ac42669c	Merge branch 'develop' into update_pass	7 years ago
Yu Yang	0f0e197914	Merge pull request #14666 from reyoung/feature/estiminate_flops Add EstiminateFlops	7 years ago
Xin Pan	aeb74af54c	allow operator to run imperatively	7 years ago
Xin Pan	b1f6fda5e5	run forward	7 years ago
barrierye	08233beed7	add the comment for CheckFile function. test=develop	7 years ago
barrierye	d62a3dd72d	add the comment for CheckFile function. test=develop	7 years ago
barrierye	d89108766c	update CheckFile function in data_feed for ignore the space at the end of each line of data(for example, it may be added '\t' character to the end of the reduce task output when processes data by hadoop, which does not affect the correctness of the data). test=develop	7 years ago
Tao Luo	61ae88b760	Revert "Fix for accuracy problem for inplace operators when MKL-DNN mode is enabled"	7 years ago
dongdaxiang	52a0be7bb4	add mct into CMakeLists.txt	7 years ago
Tao Luo	99177b424b	Merge pull request #14693 from kbinias/fix-for-accuracy-problem-for-inlplace-operators Fix for accuracy problem for inplace operators when MKL-DNN mode is enabled	7 years ago
heqiaozhi	3c239cd640	pslib	7 years ago
Krzysztof Binias	bc7db6cec9	Fix for accuracy problem for inplace operators when MKL-DNN mode is enabled test=develop	7 years ago
Wang Guibao	41e19eb431	AsyncExecutor (#14627 ) * AsyncExecutor: C++ side * Google naming conventions * Rename MultiExecutor to AsyncExecutor * pybind with async_executor * Naming convention * remove some flags and unused code * add refactored file of async_executor and data_feed * clear async executor interface and add data feed factory * split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto * Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch * refine async_executor_refactor.cc * add some files about datafeed * Revert "add some files about datafeed" This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d. * Interface rework * add MultiSlotDataFeed * Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side * update data_feed for add MultiSlotDataFeed * update datafeed and async_executor to run bow_net demo * fix bug that finish_set_filelist failed in multithread * delete finish_binding_memory_(flag), because it can not be marked under the current interface * Fix bug * update async_executor.py for support set_use_slots * update async_executor.py for support set_use_slots and set set_dense_slots * fix bug that when the number of files is less than the number of threads, it will fetch nan * remove redundant code, and make executor exit when set a illegal queue size * add batch_size check * add MultiSlotDesc * Revert "add MultiSlotDesc" This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e. * add some checkpoint in DataFeedDesc * add CheckFile function in MultiSlotDataFeed * update something error info * fix deaded lock bug * Fix fetch variable * Merge error * fix code style in async_executor * using one lock blocking queue replace two lock blocking queue because of some bugs * update code style * add utest for data_feed * Fix fetch var * update utest for data_feed for multithread * update SetFileList info * fix bug in utest of data_feed * Add comments for python * Add comments for python code * Fix pybind.cc with new pybind11 version * add note for DataFeedDesc's set_use_slots function * Add save_model * update data_feed_test for multi-type * add comment for executor_thread_worker * Remove unused code * update data_feed_test for generate test data file * removed unnecessary interfaces and add comments * c++ style check * update data_feed.cc * AsyncExecutor: C++ side Google naming conventions Rename MultiExecutor to AsyncExecutor pybind with async_executor Naming convention remove some flags and unused code add refactored file of async_executor and data_feed clear async executor interface and add data feed factory split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch refine async_executor_refactor.cc add some files about datafeed Revert "add some files about datafeed" This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d. add MultiSlotDataFeed Interface rework Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side update datafeed and async_executor to run bow_net demo update async_executor.py for support set_use_slots Fix bug update async_executor.py for support set_use_slots and set set_dense_slots fix bug that when the number of files is less than the number of threads, it will fetch nan remove redundant code, and make executor exit when set a illegal queue size add MultiSlotDesc Revert "add MultiSlotDesc" This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e. add some checkpoint in DataFeedDesc Fix fetch variable fix code style in async_executor Fix fetch var add utest for data_feed Add comments for python update utest for data_feed for multithread fix bug in utest of data_feed Add comments for python code Fix pybind.cc with new pybind11 version add note for DataFeedDesc's set_use_slots function update data_feed_test for multi-type Add save_model update data_feed_test for generate test data file removed unnecessary interfaces and add comments add comment for executor_thread_worker Remove unused code update data_feed.cc c++ style check * commit for code style * commit for code style * commit for code style * commit for code style * Comment away __init__ in async_executor.py * clang-format fix test=develop * use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc * commit for update code style * commit for update code style * Add async_executor demo; Remove some methods test=develop * commit for update code style * commit for update code style * commit for update code style * update API.spec * AsyncExecutor test=develop * AsyncExecutor test=develop * AsyncExecutor test=develop * AsyncExecutor test=develop * Fix API.spec test=develop * Fix API.spec test=develop * Fix windows build error test=develop * FIx windows build error test=develop * FIx windows build error test=develop * FIx windows build error test=develop * Fix Windows Build test=develop * Fix Windows Build test=develop * Fix Windows Build test=develop * Fix code style test=develop * Fix code style test=develop * update datafeed * Fix code style test=develop * update data_feed_test for test Tensor test=develop * Fix code style test=develop * Fix windows build failure test=develop * Fix code style and windows build failure test=develop * Fix PYTHON3.5 build failure test=develop * AsyncExecutor API test=develop	7 years ago
baojun-nervana	fc61bf1b16	Renamed methods test=develope	7 years ago
sneaxiy	096673f675	refactor eager deletion test=develop	7 years ago
Tao Luo	c856ac8721	add OpHasAttr in node.h, update is_test_pass and mkldnn_placement_pass test=develop	7 years ago
Qiao Longfei	44debca844	Merge pull request #14589 from jacquesqiao/refactor-prefetch Refactor prefetch	7 years ago
luotao1	5db273d874	enhance HasAttr to fix ci test=develop	7 years ago
Yu Yang	589b863b98	Add EstiminateFlops test=develop	7 years ago
baojun-nervana	d5ee05e6c3	Replaced VarIsTensor test=develop	7 years ago
baojun-nervana	e6bd53be60	Named to RuntimeInferShape test=develop	7 years ago
baojun-nervana	a29696146c	Added annotation test=develop	7 years ago
baojun-nervana	caf4b937b3	Added RunInferShape test=develop	7 years ago
baojun-nervana	1d19eb2bd4	Implemented ngraph engine test=develop	7 years ago
luotao1	fe915901cd	update Opdesc's HasAttr test=develop	7 years ago
chengduo	6776e92846	refine tensor_array_write_read (#14643 ) test=develop	7 years ago
Qiao Longfei	8ebde595c9	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch test=develop	7 years ago
Qiao Longfei	145c535750	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch test=develop	7 years ago
minqiyang	a02ce58f2c	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog test=develop	7 years ago
JiabinYang	7594787deb	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op	7 years ago
Xin Pan	3c77ce3751	Merge pull request #14593 from panyx0718/fix5 Protect important header files.	7 years ago
Tao Luo	e8ef14d2a7	Merge pull request #14610 from Superjomn/revert/cache_fix Revert "fix transfer cache thread_local bug (#14581)"	7 years ago
JiabinYang	87648f8edf	merge develop, test=develop	7 years ago
wopeizl	db9284ecde	Merge pull request #14617 from wopeizl/windows/online Windows/online	7 years ago
JiabinYang	c3c3c0b33c	polish code, test=develop	7 years ago
gongweibao	867c312bc4	Fix allreduce dependency order. (#14586 )	7 years ago
Qiao Longfei	9589babe12	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch test=develop	7 years ago
peizhilin	38715e6fd0	minor fix	7 years ago
JiabinYang	7e4bd695e6	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op	7 years ago
JiabinYang	b10df8bcfa	refine code and add none bias ut, test=develop	7 years ago
superjomn	dc249d3b69	Revert "fix transfer cache thread_local bug (#14581 )" This reverts commit `5c073a4db2`.	7 years ago
minqiyang	be04d99fe4	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog test=develop	7 years ago
Qiao Longfei	af2f5fc824	fix some bugs	7 years ago
Xin Pan	3e665862b8	Protect important header files. test=develop	7 years ago
minqiyang	53433d7f2e	Revert the changes of VLOG test=develop	7 years ago
peizhilin	30849d1f20	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
Yan Chunwei	5c073a4db2	fix transfer cache thread_local bug (#14581 )	7 years ago
Xin Pan	87332bb18d	Merge pull request #14579 from Superjomn/fix/transfer-cache-compile-error fix compile	7 years ago
peizhilin	36cd18b549	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
peizhilin	1afa9492af	Recover the profiler	7 years ago
Yiqun Liu	bf222f197d	Use sub scope in tensor_array_to_tensor op. (#14524 ) test=develop	7 years ago
superjomn	4b40c0013b	fix compile test=develop	7 years ago
Yan Chunwei	a7188d5bc7	fix executor transfer cache bug (#14518 )	7 years ago
JiabinYang	42470f14b7	test=develop	7 years ago
qingqing01	36f08eef3b	CUDA kernel for density_prior_box_op. (#14513 ) * CUDA kernel for density_prior_box_op. * Support flatten to 2D.	7 years ago
peizhilin	7c8c9dc9bf	fix unit test cases	7 years ago
JiabinYang	3c6102a367	test=develop	7 years ago
sneaxiy	488610a65a	merge develop test=develop	7 years ago
wopeizl	d9a1f3e58e	Windows/online (#14474 ) * add recordio support * disable the openblas multi-thread on windows since no support adjust the python script * code style * code style test=develop * add create_recordio_file_reader back * fix code style test=develop * fix the gtest.cmake on windows * fix cc_test on windows * fix the win build test=develop * remove fused compile support on windows test=develop * add the jit support test=develop * add the jit support, test=develop * add the jit support, test=develop * add the jit back fix compile error on windows * rollback test=develop * test case fix * disable DSO by default on windows * exclude warpctc_op on windows * exclude the dynload_warpctc out on windows test=develop * fix the scripts error test=develop * disable avx on windows by default test=develop * re-organize the cmake file * disable mkl on windows by default * add warp_ctc back * fix the dependency * fix the dependency * fix the build issue on windows * remove unsupported flag on windows * code style * code style test=develop * fix issue * add profiler, parallel_executor back * clean up the pre-definitions on windows * fix build issue * test=develop	7 years ago
sneaxiy	3912545ffe	add dlpack support test=develop	7 years ago
Jiabin Yang	f7b55de9e5	Merge branch 'develop' into enhance_hierachical_sigmod_op	7 years ago
JiabinYang	af9a3301da	test=develop	7 years ago
JiabinYang	014e50c284	test=develop	7 years ago
peizhilin	6e66fadb95	clean up the pre-definitions on windows	7 years ago
peizhilin	67562a6fcd	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
peizhilin	703b26e697	add profiler, parallel_executor back	7 years ago
tensor-tang	10fb4ceefc	Merge pull request #14351 from tpatejko/tpatejko/mkldnn-elementwise_mul [MKLDNN][JIT][AVX512] Elementwise Mul	7 years ago
peizhilin	25adf970b2	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
chengduo	a94a7355f0	Refine the GraphNum check (#14144 ) * refine GraphCheck test=develop * fix ci fail test=develop	7 years ago
peizhilin	3a72a634cf	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
Tao Luo	cf685f361b	Merge pull request #14458 from tpatejko/tpatejko/mkldnn-skip-connections [WIP] Correcting and extending MKLDNN residual connection fuse pass	7 years ago
Yu Yang	98bbfc17be	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation test=develop	7 years ago
Yu Yang	d424115f9e	Clean code test=develop	7 years ago
peizhilin	1cc23ef67d	merge from paddle:develop	7 years ago
Michal Gallus	4e54ab76ec	Add HasAttr method to Operator	7 years ago
Wu Yi	a2d9b34417	Refine operator cmake (#14413 ) * wip simplify operator framework * wip * wip * done test=develop * clean test=develop * fix test=develop * fix deps test=develop * fix cpu build test=develop * fix tensorrt build test=develop * fix tests test=develop * fix test=develop * fix cpu build test=develop	7 years ago
Tomasz Patejko	53da846d1e	MKLDNN residual connections fuse pass: initial implementation of fusion for projection pass test=develop	7 years ago
Tomasz Patejko	dbc4fcd722	MKLDNN residual connections fuse pass: unit tests enabled and added	7 years ago
Tomasz Patejko	4224089354	MKLDNN residual connections fuse pass: Maybe removed and boost::optional used where it makes sense	7 years ago
Tomasz Patejko	86fd3b32be	MKLDNN residual connections fuse pass: counting statistics added to the pass	7 years ago
Tomasz Patejko	ee6f778beb	MKLDNN residual connections fuse pass: further refactoring	7 years ago
Tomasz Patejko	7423748e37	MKLDNN residual connections fuse pass: * implements reachability check between identity node and non-identity argument to elementwise_add * implements handling identity node as x and as y argument to elementwise_add	7 years ago
peizhilin	6d0d5a76eb	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
Yan Chunwei	7796f65f89	fix inference on gpu out of mem (#14414 ) * fix inference on gpu out of mem the transfer logic in operator.cc will keep creating new scopes.	7 years ago
Yu Yang	c8f6e70ab4	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation test=develop	7 years ago
Sylwester Fraczek	8a1eeec579	add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278 ) * add is_test to pooling and activations add prop_kind support for layers activation. conv and pooling add a pass that sets is_test to true add transpiler version of is_test pass test=develop * patch test and pass test=develop * add pass to analyzer.h test=develop * add is_test attr description & pass only on mkldnn in: activation_op.cc batch_norm_op.cc conv_op.cc dropout_op.cc lrn_op.cc pool_op.cc sequence_pool_op.cc softmax_op.cc * fix is_test handling for activation pool and conv * change description of is_test for all layers again * remove GetAttr(use_mkldnn) from pass * rename correct_mkldnn_test_phase to is_test and remove dependency on MKLDNN test=develop * review fix magic number * two if(..)s into one * Check is_test once and pass mkldnn forward prop kind * dereference shared_ptr with * (without get()) test=develop * add is_test_pass back test=develop	7 years ago
peizhilin	d1429ac4a5	add recordio support	7 years ago
Yu Yang	0d6718fcbd	Pass compile	7 years ago
Tao Luo	5ef123c778	Merge branch 'develop' into dam_fc	7 years ago
dzhwinter	d3aed98d86	Merge pull request #14320 from wopeizl/windows/online Windows/online	7 years ago
peizhilin	be332a13bc	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
Xin Pan	8cfda7ee0c	Merge pull request #14382 from panyx0718/fix4 Refine the pass builder and buildstrategy	7 years ago
Tao Luo	980a6753a8	fix typo to pass the ci test=develop	7 years ago
Tao Luo	c7b3bfcdf1	Merge pull request #14376 from baojun-nervana/intel/ngraph_fusedop Adding fused operator for ngraph	7 years ago
peizhilin	1a9008c420	code style fix test=develop	7 years ago
Tao Luo	8ea13e336a	add in_num_col_dims for fc	7 years ago
peizhilin	08d1dc84a9	fix	7 years ago
peizhilin	30ddc07a7e	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
Qiao Longfei	e65cbd3b06	Merge pull request #14387 from jacquesqiao/lookup_sparse_table_add_test_mode Lookup sparse table add test mode	7 years ago
Zeng Jinle	7066b3850a	Merge pull request #14395 from sneaxiy/fix_num_threads_in_fast_pe Fix num_threads settings in fast_pe	7 years ago
Yan Chunwei	9f252e0032	Combine Inference Analysis with IR (#13914 )	7 years ago
baojun-nervana	51a538e055	Fix style and use enum test=develop	7 years ago
Zeng Jinle	38d32c98b8	merge develop test=develop	7 years ago
sneaxiy	eb18d532a5	fix num_threads in fast_pe test=develop	7 years ago
Qiao Longfei	51f3838f96	add log for not exist code test=develop	7 years ago
Qiao Longfei	7aa8b2ccf2	optimize code	7 years ago
Qiao Longfei	8d205c853c	add is_test for lookup_sparse_table	7 years ago
baojun-nervana	ea3538d8dd	Added fused operator test=develop	7 years ago
Xin Pan	759ffca423	some improvements test=develop	7 years ago
Xin Pan	99dffb91d6	allow to repeatedly share and update BuildStrategy test=develop	7 years ago
Tao Luo	6c32945556	Merge pull request #14372 from luotao1/speedup_analysis speedup DetectPatterns	7 years ago
Tao Luo	668ae523d2	speedup DetectPatterns test=develop	7 years ago
Yan Chunwei	9a6e239281	fix mac graph detector sort (#14356 )	7 years ago
peizhilin	1b75fd2236	revert	7 years ago
peizhilin	7840d181c9	fix style issue	7 years ago
peizhilin	dc339b78d7	fix code style	7 years ago
sneaxiy	d231e55065	merge develop test=develop	7 years ago
JiabinYang	c8801e100f	grad diff problem to be fixed and need api spec change to be done	7 years ago
peizhilin	e23061e0dc	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
chengduo	6c6e638550	Add InferVarType for some op (#14201 ) * add_infer_var_type test=develop * InferVarTypeHelper-> VarTypeInferenceHelper test=develop * PassInputTypeAndDTypeOnOutput test=develop * follow comment test=develop	7 years ago
peizhilin	ca60e1d34d	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
peizhilin	d08334011a	fix merge issue	7 years ago
peizhilin	52f7644f53	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
qingqing01	abe209234f	Exhaustive search for cuDNN conv. (#14286 ) * exhaustive search for cuDNN conv. * Refine code and add unit testing. * Fix model load in fluid/inference and unit testing in conv2d * Follow comments. * Fix compiling test=develop	7 years ago
Yu Yang	fdc689142c	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation test=develop	7 years ago
minqiyang	87450b9ad4	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog test=develop	7 years ago
peizhilin	dcfab11193	merge from develop	7 years ago
peizhilin	4ffa92d4f0	Merge branch 'develop' into windows/build	7 years ago
chengduo	c5b6573a5a	Fix input<tensor> (#14208 ) * fix input<tensor> test=develop * fix split_ids test=develop * ElementwiseMul should not support SelectedRows * fix scale op test=develop * change GetTensorFromVar() method to GetTensorOrSelectedRowsFromVar() * fix operator * refine MultiOutput * fix MultiOutput test=develop * disable test_dist_save_load test=develop * fix elementwise_op test=develop * add get_sparse_as_op test=develop * add info for check test=develop * rename get_sparse_as_op with extract_rows_as_op. test=develop * elementwise doesn't support selected_rows * fix regularizer * remove extract_rows_as test=develop * fix ci test=develop * add test for sum_op * fix regularizer test=develop * test=develop * fix pserver weight decay multi inputs test=develop	7 years ago
Tao Luo	813e54efbd	Merge pull request #14328 from PaddlePaddle/revert-14046-windows/debug Revert "cherry picked windows patches."	7 years ago
minqiyang	3db9fad764	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog test=develop	7 years ago
Xin Pan	ff6c809bfc	Merge pull request #14251 from panyx0718/fix Make OpHandle/VarHandle and ir::Node works cleaner	7 years ago
Zhaolong Xing	ba8b5619a3	Revert "cherry picked windows patches."	7 years ago
minqiyang	49710960ef	Revert tensor_util.cu test=develop	7 years ago

... 3 4 5 6 7 ...

1877 Commits (4461a458a58c1bc55fba76256e7d9fd0d5b09486)