Commit Graph

59 Commits (45702951226401a24df501960d7fd9b47152083d)

Author SHA1 Message Date
Zhaolong Xing c5f0293cf3
NV jetson(nano, tx2, xavier) inference compile support (#21393)
5 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
5 years ago
Chen Weihang 7ee25189c3
Enrich the type of error and declare the error type interfaces (#21024)
5 years ago
Zeng Jinle 4922eb6da5
make_conv_workspace_size_configurable, test=develop (#20662)
5 years ago
633WHU 12e4be0382 Dlpack support (#20039)
5 years ago
Zeng Jinle 37f76407b0
fix cuda dev_ctx allocator cmake deps, test=develop (#19953)
5 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
6 years ago
Yiqun Liu 42b5bec6f9
Integrate NVRTC to support compiling CUDA kernel at runtime (#19422)
6 years ago
Zeng Jinle 708bd9798d
move_flags_to_unified_files_for_management, test=develop (#19224)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
HaoRen 9931bc64f5 add dependecy of collective_helper (#18365)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
gongweibao cbdb8a17b1
Polish DGC code (#16818)
6 years ago
guru4elephant 76b49f02ee
Merge pull request #16539 from guru4elephant/train_with_pipe_reader_merge_develop
6 years ago
gongweibao fea91164b7 Fix windows compilation error! (#16546)
6 years ago
dongdaxiang e3107a6ae0 fix windows compile problem
6 years ago
dongdaxiang dc8cf36e4b add more example on datagenerator
6 years ago
dongdaxiang cf1360643f add printer for fetch variable
6 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
6 years ago
dzhwinter 225c11a91f polish cudnn related code and fix bug. (#15164)
6 years ago
chengduo 8e904d322f
Remove unnecessary dependence for profiler (#15899)
6 years ago
Tao Luo e3dd6970fc disable dam temporarily (#15860)
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
Tao Luo c797a1f050 remove legacy any.cmake
6 years ago
peizhilin 883d22093a fix the lib_any dependency
6 years ago
peizhilin 061299be87 fix dependency
6 years ago
Xin Pan 9186451f60 hide GetTensor
6 years ago
dongdaxiang ab2abfc5b2 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
dongdaxiang 2e5ebc4594 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
dongdaxiang 2dee8f6cd5 add TrainFilesWithTimer in async_executor
6 years ago
chengduo 79bd6dfa18
[Feature] Add Temporary Allocator (#14875)
6 years ago
peizhilin 23dec78772 fix script issue
6 years ago
sneaxiy 096673f675 refactor eager deletion
6 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
6 years ago
Yu Yang 524f6e9b36 Refine code
6 years ago
sneaxiy d0b2453ecd merge develop
7 years ago
sneaxiy 24ea39c4c6 feature/eager_delete_tensor
7 years ago
dzhwinter d361624c1d
platform module (#12932)
7 years ago
tensor-tang 0d46f518ae refine avx condition and warning
7 years ago
tensor-tang a50889f523 introduce xbyak
7 years ago
dzhwinter 39ac9e39c2
float16 type support enhance (#12181)
7 years ago
minqiyang 2cc6ca43a0 Add framework_proto to device context deps
7 years ago
tensor-tang 2e418a5227 fix conflicts
7 years ago
tensor-tang 3df99e72ab Merge remote-tracking branch 'ups/develop' into refine/set_num_threads
7 years ago
dzhwinter 4ed0b62476
Move fluid::framework::InitDevices into fluid::platform (#11757)
7 years ago
tensor-tang e3a96300bb move SetNumThreads to platform
7 years ago
dzhwinter 0e4467eee4
"fix compile" (#10657)
7 years ago
yuyang18 dc6ce071d4 Polish cmake
7 years ago
gongweibao 6171705a2c Potential bug in paddle/fluid/platform/CMakeLists.txt (#9723)
7 years ago
Yi Wang 67ba884d2a Update CMakeLists
7 years ago