Commit Graph

216 Commits (79fa8fb0df524cc5efbe5cd7a91acac7b721e5cf)

Author SHA1 Message Date
wanghuancoder 35c5b23f68
use iwyu clean include second time, test=develop (#30829)
4 years ago
QingshuChen 5c8455d6ea
try again if kunlun memory malloc failed (#30855)
4 years ago
石晓伟 2ac4143b6c
support xpu with analysis predictor, test=develop (#30832)
4 years ago
Qi Li 69875dc42c
[ROCM] update fluid memory for rocm35 (part1), test=develop (#30758)
4 years ago
Leo Chen 81217a94d8
unify calling cudaSetDevice (#30470)
4 years ago
石晓伟 715d862868
export global google flags to users, test=develop (#30448)
4 years ago
QingshuChen cf786d22ec
fix bug that cann't find mkldnn(kunlun) (#30394)
4 years ago
QingshuChen 2c1bba02e4
optimize memcpy perf for kunlun (#30291)
4 years ago
石晓伟 efa54629fb
fix header file paths of gflags, commit 3, test=develop (#30273)
4 years ago
Chen Weihang a6072055be
[Complex] Handle complex to real after type promotion (#29855)
4 years ago
LoveAn b5d4a1f33d
Add the strategy of skipping cc/cu test compilation and execution in CI (#29499)
4 years ago
Wojciech Uss 6673fb0565
change import math.h to cmath (#29260)
4 years ago
gongweibao 1dad8ceaab
Fix gpu memory allocation bug. (#28703)
4 years ago
Wilber ced5c40c41
Update memory release interface. (#28456)
4 years ago
Wilber 05114693cf
[Inference] Memory modification for ShrinkMemory. (#28355)
4 years ago
Chen Weihang 2babd6ff67
Add compile limit for PADDLE_ENFORCE without error message (#28221)
4 years ago
Leo Chen 1f3be85914
Fix bug of fetch_async_op_handle when fetching the feed variable (#28194)
4 years ago
QingshuChen 6b727e08b1
support elementwise add, activation, matmul on Baidu Kunlun (#27143)
4 years ago
Leo Chen 6bb02e8e3c
increase retry time (#27553)
4 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
4 years ago
Leo Chen 4c8ea492cd
use shared dev_ctx (#27313)
4 years ago
QingshuChen 138ecf24aa
support Baidu Kunlun AI Accelerator (#25959)
5 years ago
joanna.wozna.intel e5bbffa84c
Add NOMINMAX define due to windows.h max/min macro conflict (#25637)
5 years ago
Leo Chen 417b243968
fix best_fit_allocator_test on windows, test=develop (#25650)
5 years ago
Leo Chen ff5be2fb77
Refine error message in memory folder (#25095)
5 years ago
tianshuo78520a 1eb9ee242b
delete buddy_allocator_test_data to make repo clean (#25046)
5 years ago
liuwei1031 8603b5fb72
fix randomly hang issue of PaddleDetection training task on windows (#24977)
5 years ago
Zhou Wei 4058e736ff
temporarily disable these unittests failed on windows (#24942)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Huihuang Zheng f831f61216
Make buddy_allocator_test Exclusive (#24636)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Huihuang Zheng 3047b69f94
Make Allocation Smaller on BuddyAllocatorTest to Decrease CI Failure (#24205)
5 years ago
石晓伟 d2584a7082
New feature: thread local allocator, test=develop (#23989)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
Zeng Jinle d41d802ba3
Add flags to limit gpu memory (#22793)
5 years ago
Chen Weihang 7d8d573453
Speed up dygraph DataLoader based on shared memory and LoDTensor serialization (#22541)
5 years ago
Yiqun Liu 44b45b9f07
Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456)
5 years ago
zhouwei25 549e6de7ac faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
5 years ago
Zeng Jinle 1b76e789cf
remove cuda allocator ctor, test=develop (#22212)
5 years ago
Zeng Jinle c3bcd3c1e2
fix dygraph non zero gpu bug, test=develop (#22165)
5 years ago
Zeng Jinle d9f5d1eb29
ag allocator by default, test=develop (#21837)
5 years ago
Zeng Jinle aa4d6a5d6c
Add some debug flags to auto growth allocator (#21766)
5 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
5 years ago
Zeng Jinle 09696d5df8
Use system allocator in OpTest (#21335)
5 years ago
Chen Weihang b3a3e6f60c change cuda enforce & add example (#21142)
5 years ago
Chen Weihang 27fa9c100b
add examples for resource exhausted error, test=develop (#21140)
5 years ago
Zeng Jinle a710ccc0cb
refine error message of allocator again, test=develop (#21023)
5 years ago
Zeng Jinle f56967c483
refine error message of gpu allocator, test=develop (#21008)
5 years ago
Zeng Jinle c51722c820
refine err msg of allocator, test=develop (#20879)
5 years ago
Zeng Jinle bb8d778358
lazy init of allocators, test=develop (#20854)
5 years ago