Commit Graph

83 Commits (15297a065ce7e87ffef1a9c4d692361000ab27af)

Author SHA1 Message Date
QingshuChen 15297a065c
fix depends of kunlun bkcl (#30945)
4 years ago
石晓伟 2ac4143b6c
support xpu with analysis predictor, test=develop (#30832)
4 years ago
wanghuancoder 90773473a0
use nvtx push pop in timeline (#30567)
4 years ago
WangXi 572c466d19
[Prepare for MultiProcess xpu] unified gen nccl id, refine imperative reducer (#30455)
4 years ago
石晓伟 181ea1870b
flush denormals to zero, test=develop (#29924)
4 years ago
joanna.wozna.intel 95e1434bb2
Add bfloat16 data type (#25402)
5 years ago
lilong12 1c68138327
[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis (#26552)
5 years ago
QingshuChen 138ecf24aa
support Baidu Kunlun AI Accelerator (#25959)
5 years ago
wangchaochaohu 0b81d76310
[API2.0] add op for cudnn version query test=develop (#26180)
5 years ago
arlesniak e52df3b125
Added DNNL cache management for DyGraph (#25624)
5 years ago
Chen Weihang 7be285a66f
remove useless property, test=develop (#25461)
5 years ago
Chen Weihang 172d4ecb6c
remove WITH_DSO compile option (#25444)
5 years ago
Tao Luo 2996315fc9
fix profiler_test on win32 (#25073)
5 years ago
hutuxian 5822862d8a
Monitor Framework (#24079)
5 years ago
Zhou Wei 4058e736ff
temporarily disable these unittests failed on windows (#24942)
5 years ago
Zhou Wei d1047d0a69
add WITH_GPU for cudaerror download (#24056)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
石晓伟 2d01cc85c4
DeviceContext Split, test=develop (#23737)
5 years ago
石晓伟 5c59d2139e
reverts the commit 23177, test=develop (#23363)
5 years ago
石晓伟 75ebb48a91
supports thread-binding stream, test=develop (#23177)
5 years ago
Zeng Jinle bba740710d
add cuda resource pool for BufferedReader, test=develop (#23152)
5 years ago
Yi Liu 121b2aed4d
initialize global nccl context in dygraph (#23037)
5 years ago
Zeng Jinle d41d802ba3
Add flags to limit gpu memory (#22793)
5 years ago
Zhaolong Xing c5f0293cf3
NV jetson(nano, tx2, xavier) inference compile support (#21393)
5 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
5 years ago
Chen Weihang 7ee25189c3
Enrich the type of error and declare the error type interfaces (#21024)
5 years ago
Zeng Jinle 4922eb6da5
make_conv_workspace_size_configurable, test=develop (#20662)
5 years ago
633WHU 12e4be0382 Dlpack support (#20039)
5 years ago
Zeng Jinle 37f76407b0
fix cuda dev_ctx allocator cmake deps, test=develop (#19953)
5 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
6 years ago
Yiqun Liu 42b5bec6f9
Integrate NVRTC to support compiling CUDA kernel at runtime (#19422)
6 years ago
Zeng Jinle 708bd9798d
move_flags_to_unified_files_for_management, test=develop (#19224)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
HaoRen 9931bc64f5 add dependecy of collective_helper (#18365)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
gongweibao cbdb8a17b1
Polish DGC code (#16818)
6 years ago
guru4elephant 76b49f02ee
Merge pull request #16539 from guru4elephant/train_with_pipe_reader_merge_develop
6 years ago
gongweibao fea91164b7 Fix windows compilation error! (#16546)
6 years ago
dongdaxiang e3107a6ae0 fix windows compile problem
6 years ago
dongdaxiang dc8cf36e4b add more example on datagenerator
6 years ago
dongdaxiang cf1360643f add printer for fetch variable
6 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
6 years ago
dzhwinter 225c11a91f polish cudnn related code and fix bug. (#15164)
6 years ago
chengduo 8e904d322f
Remove unnecessary dependence for profiler (#15899)
6 years ago
Tao Luo e3dd6970fc disable dam temporarily (#15860)
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
Tao Luo c797a1f050 remove legacy any.cmake
6 years ago
peizhilin 883d22093a fix the lib_any dependency
6 years ago
peizhilin 061299be87 fix dependency
6 years ago