Paddle

Commit Graph

Author	SHA1	Message	Date
pkpk	e9c7e218f2	Nan debugger init (#18401 ) test=develop	6 years ago
Qiao Longfei	58f7695ab2	Async exe support communicator (#17386 ) Async exe support communicator	6 years ago
Qiao Longfei	728bbaa4e3	add cache_update_mutex_ for operator test=develop (#17124 ) * add cache_update_mutex_ for operator	6 years ago
Tao Luo	5babcd02dd	Revert "remove unnecessary prepare_data (#17080 )" (#17432 ) This reverts commit `aca60e9a20`.	6 years ago
Zeng Jinle	5e5e7b3305	fix data_type error message (#17312 ) test=develop	6 years ago
Tao Luo	aca60e9a20	remove unnecessary prepare_data (#17080 ) test=develop	6 years ago
luotao1	490e746269	fix runtime_context_cache bug when gpu model has an op runs only on cpu test=develop	6 years ago
luotao1	4098ba29ed	reduce hasAttr elapsed time in RunImpl test=develop	6 years ago
luotao1	f89a9c5d95	Merge branch 'develop' into has_attr	6 years ago
luotao1	6afc97ca6b	reduce hasAttr elapsed time in RunImpl test=develop	6 years ago
luotao1	226596a296	Merge branch 'develop' into core_opt_choose_kernel	6 years ago
gongweibao	a61ed9782e	fix log level test=develop (#16554 )	6 years ago
liuwei1031	278debab71	fix comments of 16410, test=develop (#16499 ) * fix comments of 16410, test=develop * modify inplace_op_inference_test according to pass interface change, test=develop	6 years ago
gongweibao	eb83abeac3	Add DGC(Deep Gradient Compression) interface. (#15841 )	6 years ago
Zeng Jinle	c7c6eeb44e	Merge pull request #16409 from sneaxiy/feature/advance_gc Enhance gc to support deleting tensor buffer in advance	6 years ago
liuwei1031	8d22bc17a4	Memory optimize (#16410 ) * fix cdn issue, test=develop * fix memory optimize bugs, test=develop * fix memory optimize bugs, test=develop * remove add/sub_2 op, test=develop * disable memory_optimize by default, test=develop * disable inplace activation in python, test=develop * fix unittests, test=develop * fix unittests, test=develop * bug-fix, test=develop	6 years ago
sneaxiy	7000ec85d9	fix some op grad maker fix ctest eager deletion disable bug test=develop	6 years ago
sneaxiy	a93a9eef8f	add op registry type refine gc code test=develop	6 years ago
luotao1	056599a738	add expected_kernel_cache_pass test=develop	6 years ago
luotao1	bfdab00e5b	Merge branch 'develop' into core_opt_choose_kernel	6 years ago
luotao1	6c6a39222b	Merge branch 'core_opt_choose_kernel' of https://github.com/Xreki/Paddle into core_opt_choose_kernel	6 years ago
luotao1	82af8031d9	add runtime_context_cache_pass test=develop	6 years ago
Tao Luo	7d2740db83	Revert "cache runtime_context"	6 years ago
luotao1	cc0ae1f1a1	refine with comments test=develop	6 years ago
luotao1	46ee6bb1aa	fix distributed unit-tests test=develop	6 years ago
luotao1	b2898c0f57	Merge branch 'develop' into runtime_context test=develop	6 years ago
Tao Luo	4ef6f738c3	Merge pull request #16154 from luotao1/infershape_example add all_kernels_must_compute_runtime_shape example for speedup infershape	6 years ago
luotao1	d94fd97230	add runtime_context_cache_pass test=develop	6 years ago
luotao1	b561ad1e55	Merge branch 'develop' into runtime_context	6 years ago
luotao1	fe78a92e6e	refine with comments test=develop	6 years ago
wopeizl	85709f4378	restore the exception caught since it is necessary for python call stack (#16160 ) test=develop	6 years ago
luotao1	31ccaf0916	add all_kernels_must_compute_runtime_shape example for speedup infershape test=develop	6 years ago
Tao Luo	f4587789d8	remove legacy function in ExecutionContext test=develop	6 years ago
Liu Yiqun	1041e18c47	Refine codes. test=develop	6 years ago
luotao1	c0b240aa43	try to fix distributed unit-test test=develop	6 years ago
luotao1	784826a4f5	enhance cache runtime_context for different scope test=develop	6 years ago
luotao1	2fb38c108c	Merge branch 'develop' into runtime_context	6 years ago
Liu Yiqun	d4674dab13	Cache the chosen kernel of operators'. test=develop	6 years ago
luotao1	9773f38f99	cache runtime_context test=develop	6 years ago
tangwei12	6d5a04c1e7	add op type in check nan/inf (#15986 ) * add op name in check nan/inf, test=develop	6 years ago
Xin Pan	5dd281f738	polish test=develop	6 years ago
Xin Pan	5eb87506bc	add per kernel config and remove const_cast. test=develop	6 years ago
Dun	a83e470405	Profiler refine and add CUDA runtime api tracer (#15301 ) * refine profiler && add runtime tracer * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * fix bug && test=develop * add thread id map && test=develop * test=develop * testing * bug fix * remove cuda event && refine code && test=develop * test=develop * test=develop * test=develop * fix windows temp file && test=develop * test=develop * fix windows bug && test=develop * fix start up issue && test=develop * code polish && test=develop * remove unused code && test=develop * add some cupti cbid && test=develop * add FLAGS_multiple_of_cupti_buffer_size && test=develop * fix compile error && test=develop * add keyword && test=develop * fix && test=develop * code polish && test=develop	6 years ago
chengduo	ad61e1b22c	fix potential bug (#15688 ) test=develop	6 years ago
Gabor Buella	da9c94da33	Clang build fixes (#15628 ) * Remove some superfluous std::move calls The std:move triggered a build error (with -Werror): ``` [ 9%] Building CXX object paddle/fluid/memory/allocation/CMakeFiles/allocator_facade.dir/allocator_facade.cc.o /home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: error: moving a temporary object prevents copy elision [-Werror,-Wpessimizing-move] [this] { return std::move(CreateAllocatorWithChunk()); }, capacity); ^ /home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: note: remove std::move call here [this] { return std::move(CreateAllocatorWithChunk()); }, capacity); ^~~~~~~~~~ ~ 1 error generated. ``` See: https://reviews.llvm.org/D7633 * Remove a superfluous lambda capture from framework/operator.h ``` [ 10%] Building CXX object paddle/fluid/platform/CMakeFiles/device_context.dir/init.cc.o In file included from /home/tej/code/gbuella_paddle/paddle/fluid/platform/init.cc:19: /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.h:229:21: error: lambda capture 'this' is not used [-Werror,-Wunused-lambda-capture] [this](Variable* var) { return var; }); ^~~~ 1 error generated. ``` Changing it to `return it->second;`, as is in the function below. * Rethrow an exception (instead of copying it) ``` [ 11%] Building CXX object paddle/fluid/framework/CMakeFiles/operator.dir/operator.cc.o /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: error: local variable 'exception' will be copied despite being thrown by name [-Werror,-Wreturn-std-move] throw exception; ^~~~~~~~~ /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: note: call 'std::move' explicitly to avoid copying throw exception; ^~~~~~~~~ std::move(exception) ``` See https://reviews.llvm.org/D43322 for an explanation of this diagnostic message. * Remove an unused variable ``` /home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:884:16: error: private field 'scope_' is not used [-Werror,-Wunused-private-field] const Scope& scope_; ^ ``` * struct ComputationOpHandle -> class ComputationOpHandle ``` [ 13%] Building CXX object paddle/fluid/framework/details/CMakeFiles/memory_early_delete_pass.dir/memory_early_delete_pass.cc.o In file included from /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/memory_early_delete_pass.cc:21: /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: error: class 'ComputationOpHandle' was previously declared as a struct; this is valid, but may result in linker errors under the Microsoft C++ ABI [-Werror,-Wmismatched-tags] class ComputationOpHandle; ^ /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/computation_op_handle.h:29:8: note: previous use is here struct ComputationOpHandle : public OpHandleBase { ^ /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: note: did you mean struct here? class ComputationOpHandle; ^~~~~ struct 1 error generated. ``` * Fix name() methods under fluid/operators ``` In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.cc:15: In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.h:19: /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/jitcode.h:71:23: error: 'name' overrides a member function but is not marked 'override' [-Werror,-Winconsistent-missing-override] virtual const char* name() const = 0; ^ /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen_base.h:31:23: note: overridden virtual function is here virtual const char* name() const = 0; ^ ``` test=develop	6 years ago
Jiabin Yang	fd286f3596	Merge pull request #15534 from JiabinYang/fix/multi_output_support_imperative test=develop, fix/multi_output_support_imperative	6 years ago
JiabinYang	5639f49b16	test=develop, fix/multi_output_support_imperative	6 years ago
JiabinYang	c52f57de5b	test=develop, refine_error_message for data type	6 years ago
minqiyang	8ce198b2e1	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet test=develop	6 years ago
乔龙飞 Qiao Longfei	b14d4cdd75	Merge pull request #14890 from jacquesqiao/multithread-sparse-adam adam support multithread	6 years ago
peizhilin	5e450833bd	test=develop	6 years ago
peizhilin	eea75a1d93	fix issue when type is invalid test=develop	6 years ago
peizhilin	c1235c935f	add the enable_debug flag test=develop	7 years ago
peizhilin	a6f5ceee74	add the python callstack for debug support test=develop	7 years ago
Qiao Longfei	44b300556d	change min_row_size_to_use_multithread to parameter of adam test=develop	7 years ago
Qiao Longfei	87b4eb1da4	change min_param_size_to_use_multithread to min_row_size_to_use_multithread	7 years ago
Qiao Longfei	17b1b660fc	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam test=develop	7 years ago
Xin Pan	3e8408429d	Merge pull request #15053 from panyx0718/imperative_hold refactor to avoid scope.	7 years ago
Zeng Jinle	c0bcff00dc	Merge pull request #14962 from sneaxiy/rewrite_variable_type Rewrite variable type	7 years ago
Xin Pan	b91a7a9d30	clear operator changes test=develop	7 years ago
Xin Pan	f52b514dcd	call kernel	7 years ago
Xin Pan	4e80e04f23	fix test=develop	7 years ago
Xin Pan	61491ce250	clean test=develop	7 years ago
Xin Pan	ce7e503cbe	refactor to avoid scope. test=develop	7 years ago
Qiyang Min	0238a3bb4f	Merge pull request #14972 from velconia/accelerate_lstm Accelerate PADDLE_ENFORCE	7 years ago
Qiao Longfei	3b294e2e2e	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam	7 years ago
sneaxiy	c4ce2e7b21	merge develop, solve conflict test=develop	7 years ago
peizhilin	2388d0e7d6	Revert "cherry-pick the #12759" test=develop This reverts commit `7f6d8acecb`.	7 years ago
Tao Luo	69659f4ae2	Merge pull request #15037 from jianhang-liu/fix/abnormal_stack_op_time Fix/abnormal stack op time	7 years ago
sneaxiy	dde3afe7b7	Merge develop test=develop	7 years ago
Brian Liu	e821b12f57	Fix issue which cause abnormal CPU usage in stack op Stack OP has much higher CPU cost than expected in release mode. Caused by DebugStringEx() in base class OperatorWithKernel. Actually this issue occur for each OP which hasn't implement it's own GetExpectedKernelType(). test=develop	7 years ago
peizhilin	7f6d8acecb	cherry-pick the #12759 test=develop	7 years ago
minqiyang	53619a79b4	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm	7 years ago
sneaxiy	c1f7e54f62	merge develop test=develop	7 years ago
Qiao Longfei	d76bda50c4	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam test=develop	7 years ago
Xin Pan	969ad966ba	all converted test=develop	7 years ago
minqiyang	679d1a9e0b	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm	7 years ago
sneaxiy	74a8e6b032	merge develop fix conflict test=develop	7 years ago
Xin Pan	1fe3ac352a	move more and fix while test=develop	7 years ago
sneaxiy	ae6f46a1a9	rewrite variable type test=develop	7 years ago
Xin Pan	9ef8a76873	convert more test=develop	7 years ago
Xin Pan	876993887b	convert more interface to avoid scope test=develop	7 years ago
Xin Pan	8c19f0bfe3	fix test=develop	7 years ago
Xin Pan	4dd61e7260	convert GetInputVarPtrs and GetOutputVarPtrs test=develop	7 years ago
Xin Pan	52d3903a12	fix test=develop	7 years ago
Xin Pan	0e0983cc1d	convert more infer shape	7 years ago
Xin Pan	62eb43ba98	convert more test=develop	7 years ago
Xin Pan	f897bd16c0	clean test=develop	7 years ago
Xin Pan	70981f5d79	clean test=develop	7 years ago
Xin Pan	fb8ae30331	fix test=develop	7 years ago
Xin Pan	eaf8ba35b5	change input test=develop	7 years ago
Xin Pan	840e6729e2	inject context test=develop	7 years ago
Xin Pan	bbff0df320	try cache variables test=develop	7 years ago
Xin Pan	52bc4ee75a	delay infer scope test=develop	7 years ago
Qiao Longfei	3bd54ed769	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam	7 years ago
Qiao Longfei	fac8702269	adam support multithread	7 years ago
minqiyang	3bb9b0cac2	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg test=develop	7 years ago
minqiyang	ad6ae0b071	1. Add SpinLock 2. Seperate the lock of kids and vars in Scope test=develop	7 years ago
Yu Yang	4ecdb6f486	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type test=develop	7 years ago
minqiyang	b75bd29c3a	Remove debug info	7 years ago

1 2 3 4 5

240 Commits (a6072055be98fe21c898ad4685faaedd591c6b93)