Paddle

Commit Graph

Author	SHA1	Message	Date
wanghuancoder	35c5b23f68	use iwyu clean include second time, test=develop (#30829 ) * use iwyu clean include second time, test=develop	4 years ago
QingshuChen	5c8455d6ea	try again if kunlun memory malloc failed (#30855 ) * try again if kunlun memory malloc failed * minor	4 years ago
石晓伟	2ac4143b6c	support xpu with analysis predictor, test=develop (#30832 ) * support xpu inference with analysis predictor, test=develop * merge the cmake of the xpu toolchain, test=develop * add c-apis, test=develop * fix a bug in extern_xpu, test=develop	4 years ago
Qi Li	69875dc42c	[ROCM] update fluid memory for rocm35 (part1), test=develop (#30758 )	4 years ago
Leo Chen	81217a94d8	unify calling cudaSetDevice (#30470 ) * unify calling cudaSetDevice * fix compile	4 years ago
石晓伟	715d862868	export global google flags to users, test=develop (#30448 )	4 years ago
QingshuChen	cf786d22ec	fix bug that cann't find mkldnn(kunlun) (#30394 )	4 years ago
QingshuChen	2c1bba02e4	optimize memcpy perf for kunlun (#30291 ) * optimize memcpy perf for kunlun * remove useless unitest for kunlun mean * minor	4 years ago
石晓伟	efa54629fb	fix header file paths of gflags, commit 3, test=develop (#30273 )	4 years ago
Chen Weihang	a6072055be	[Complex] Handle complex to real after type promotion (#29855 ) * try to add fwd op input dtypes * refactor base impl * return tmp_ins after dygraph prepare data * fix typo found in debug * polish comment & add complex net test * revert detail change * fix unittest failed * add complex kernel condition control * fix xpu test failed & polish comment * polish details by review comments	4 years ago
LoveAn	b5d4a1f33d	Add the strategy of skipping cc/cu test compilation and execution in CI (#29499 ) * Add the strategy of skipping cc/cu test compilation and execution in CI, test=develop * fix if error with CI_SKIP_TEST, test=develop * fix add properties to test error on Linux/MAC, test=develop * fix set test properties of test_code_generator error, test=develop * remove test codes and advance judgment of file modification on Linux, test=develop * rename CI_SKIP_TEST to CI_SKIP_CPP_TEST, test=document_fix * Add branch judgement on Linux, test=develop	4 years ago
Wojciech Uss	6673fb0565	change import math.h to cmath (#29260 )	4 years ago
gongweibao	1dad8ceaab	Fix gpu memory allocation bug. (#28703 )	4 years ago
Wilber	ced5c40c41	Update memory release interface. (#28456 )	4 years ago
Wilber	05114693cf	[Inference] Memory modification for ShrinkMemory. (#28355 )	4 years ago
Chen Weihang	2babd6ff67	Add compile limit for PADDLE_ENFORCE without error message (#28221 ) * add compile limit for paddle enforce * polish elementwise_op_function.cu.h * fix failed unittest * fix windows compile failed * detail polish * revert no type constructor	4 years ago
Leo Chen	1f3be85914	Fix bug of fetch_async_op_handle when fetching the feed variable (#28194 ) * fix bug of fetch_async_op_handle * revert some changes of test_buffer_shared_memory_reuse_pass * revert some changes of test_buffer_shared_memory_reuse_pass	4 years ago
QingshuChen	6b727e08b1	support elementwise add, activation, matmul on Baidu Kunlun (#27143 ) * support elementwise add, activation, matmul on Baidu Kunlun * test=kunlun * minor * test=kunlun * reconstuct the xpu directory * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun	4 years ago
Leo Chen	6bb02e8e3c	increase retry time (#27553 )	4 years ago
wanghuancoder	df43905f12	use iwyu clean include (#27267 ) * use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop	4 years ago
Leo Chen	4c8ea492cd	use shared dev_ctx (#27313 )	4 years ago
QingshuChen	138ecf24aa	support Baidu Kunlun AI Accelerator (#25959 ) * support Baidu AI Accelerator * test=kunlun * minor * test=kunlun * support xpu op in separate file * test=kunlun * update XPU error message and remove duplicated code * test=kunlun * minor * test=kunlun * minor * test=kunlun	5 years ago
joanna.wozna.intel	e5bbffa84c	Add NOMINMAX define due to windows.h max/min macro conflict (#25637 ) test=develop	5 years ago
Leo Chen	417b243968	fix best_fit_allocator_test on windows, test=develop (#25650 ) * fix best_fit_allocator_test on windows, test=develop * enable best_fit_allocator_test and test_math_op_patch_var_base, test=develop	5 years ago
Leo Chen	ff5be2fb77	Refine error message in memory folder (#25095 ) * refine PADDLE_THROW, test=develop * refine error msg, test=develop * refine cuda error, test=develop * follow comments, test=develop * fix compile problem, test=develop * fix bug, test=develop	5 years ago
tianshuo78520a	1eb9ee242b	delete buddy_allocator_test_data to make repo clean (#25046 )	5 years ago
liuwei1031	8603b5fb72	fix randomly hang issue of PaddleDetection training task on windows (#24977 )	5 years ago
Zhou Wei	4058e736ff	temporarily disable these unittests failed on windows (#24942 )	5 years ago
Chen Weihang	d1062d5278	Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759 ) * remove REPLACE_ENFORCE_GLOG compile option & add ci rule prohibit LOG(FATAL) using, test=develop * remove ci test case, test=develop * replace all LOG(FATAL) & polish message, test=develop * fix typo, test=develop * polish error info detail, test=develop	5 years ago
Huihuang Zheng	f831f61216	Make buddy_allocator_test Exclusive (#24636 ) The background is that buddy_allocator_test depends on some big GPU allocations and current machine may not have it. It can cause random failure on cloud machine. After discussion with Shuo Tian, we decided to make buddy_allocator_test exclusive.	5 years ago
Chen Weihang	aa0f254fbe	Add macro BOOST_GET to enrich the error information of boost :: get (#24175 ) * add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop	5 years ago
Huihuang Zheng	3047b69f94	Make Allocation Smaller on BuddyAllocatorTest to Decrease CI Failure (#24205 ) test=develop	5 years ago
石晓伟	d2584a7082	New feature: thread local allocator, test=develop (#23989 ) * add the thread_local_allocator, test=develop * refactor the thread_local_allocator, test=develop * provides option setting strategy, test=develop	5 years ago
Zhou Wei	7817003795	Optimize the error messages of paddle CUDA API (#23816 ) * Optimize the error messages of paddle CUDA API, test=develop * fix the error messages of paddle CUDA API, test=develop * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop * remove build_ex_string,test=develop * merge conflict,test=develop	5 years ago
Zeng Jinle	d41d802ba3	Add flags to limit gpu memory (#22793 ) * add recorded cuda memory apis, fix typo, test=develop * add more ut, test=develop * follow comments, test=develop * fix py35 incompatible issues, test=develop	5 years ago
Chen Weihang	7d8d573453	Speed up dygraph DataLoader based on shared memory and LoDTensor serialization (#22541 ) * add lodtensor share memory & serialization, test=develop * fix windows compile error, test=develop * deal vartype pickle & fix unittest matching error message, test=develop * update timeout variable name, test=develop * refactor memory map implement, test=develop * clear mmap file discripter when exit unexpectedly, test=develop * remove the child process fd in advance, test=develop * remove mmap fds after Queue.put in child process, test=develop * add hard unittests for register exit func, test=develop * fix python2 compatibility problem in unittest, test=develop * fix exception unittest error, test=develop * polish code based review comment, test=develop	5 years ago
Yiqun Liu	44b45b9f07	Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456 ) * Add log in memory::Copy for debug purpose. * Change to use context in DeviceContextPool directly in sequence_pooling_test, instead to new one. * Change to use context in DeviceContextPool directly in sequence_padding_test, instead to new one. test=develop * Change the type of second_dim from size_t to int64_t. test=develop	5 years ago
zhouwei25	549e6de7ac	faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164 )	5 years ago
Zeng Jinle	1b76e789cf	remove cuda allocator ctor, test=develop (#22212 )	5 years ago
Zeng Jinle	c3bcd3c1e2	fix dygraph non zero gpu bug, test=develop (#22165 )	5 years ago
Zeng Jinle	d9f5d1eb29	ag allocator by default, test=develop (#21837 )	5 years ago
Zeng Jinle	aa4d6a5d6c	Add some debug flags to auto growth allocator (#21766 ) * add some debug flags to auto growth allocator, test=develop * add comments about auto growth, test=develop	5 years ago
Tao Luo	01fa4ead61	fix -Wno-error=sign-compare warning in gcc8 (#21434 ) * fix -Wno-error=sign-compare warning in gcc8 test=develop * fix warning in distributed codes test=develop	5 years ago
Zeng Jinle	09696d5df8	Use system allocator in OpTest (#21335 ) * use system allocator in unittests, test=develop * fix op bugs, test=develop * fix tensor copy bug when src and dst are the same, test=develop	5 years ago
Chen Weihang	b3a3e6f60c	change cuda enforce & add example (#21142 )	5 years ago
Chen Weihang	27fa9c100b	add examples for resource exhausted error, test=develop (#21140 )	5 years ago
Zeng Jinle	a710ccc0cb	refine error message of allocator again, test=develop (#21023 )	5 years ago
Zeng Jinle	f56967c483	refine error message of gpu allocator, test=develop (#21008 )	5 years ago
Zeng Jinle	c51722c820	refine err msg of allocator, test=develop (#20879 )	5 years ago
Zeng Jinle	bb8d778358	lazy init of allocators, test=develop (#20854 )	5 years ago

1 2 3 4 5

216 Commits (79fa8fb0df524cc5efbe5cd7a91acac7b721e5cf)