You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/platform
Leo Chen 81138239db
[feature] support npu allocator (#30840)
4 years ago
..
details
dynload [feature] support npu allocator (#30840) 4 years ago
stream use iwyu clean include (#27267) 5 years ago
CMakeLists.txt [feature] support npu allocator (#30840) 4 years ago
ascend_npu_info.cc Add distribution supported (#30578) 4 years ago
ascend_npu_info.h Add distribution supported (#30578) 4 years ago
bfloat16.h use iwyu clean include (#27267) 5 years ago
bfloat16_test.cc use iwyu clean include (#27267) 5 years ago
bkcl_helper.h [Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574) 4 years ago
collective_helper.cc Support dynamic graph distributed (#28997) 5 years ago
collective_helper.h Support dynamic graph distributed (#28997) 5 years ago
complex64.h Make transpose, trace, kron, reshape, sum op support complex type (#29321) 5 years ago
complex128.h Support type promote for basic math ops (quantum required) (#29265) 5 years ago
cpu_helper.cc Paddle support compile on sw (#27858) 5 years ago
cpu_helper.h
cpu_helper_test.cc
cpu_info.cc Support mips arch (#29903) 4 years ago
cpu_info.h Support mips arch (#29903) 4 years ago
cpu_info_test.cc
cuda_device_function.h add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199) 5 years ago
cuda_device_guard.cc
cuda_device_guard.h
cuda_error.proto Optimize the error messages of paddle CUDA API (#23816) 5 years ago
cuda_helper.h Add Retry Logic to CublasHandlerHolder 4 years ago
cuda_helper_test.cu Fix index overflow bug of the CUDA kernel loop increment (#25435) 5 years ago
cuda_primitives.h [Complex] Add support for complex grad accumulated (#29889) 4 years ago
cuda_profiler.h Polish some lost invalid error message (#27445) 5 years ago
cuda_resource_pool.cc refine PADDLE_ENFORCE (#25456) 5 years ago
cuda_resource_pool.h use iwyu clean include (#27267) 5 years ago
cudnn_desc.h Add tf32 switch for cuDNN (#29192) 4 years ago
cudnn_desc_test.cc polish cudnn related code and fix bug. (#15164) 6 years ago
cudnn_helper.h fix rnn_op bug in cudnn_version>= 8 (#29406) 5 years ago
cudnn_helper_test.cc
cudnn_workspace_helper.cc make_conv_workspace_size_configurable, test=develop (#20662) 6 years ago
cudnn_workspace_helper.h make_conv_workspace_size_configurable, test=develop (#20662) 6 years ago
denormal.cc compile the denormal.cc on aarch64, test=develop (#29956) 4 years ago
denormal.h flush denormals to zero, test=develop (#29924) 4 years ago
device_code.cc use iwyu clean include (#27267) 5 years ago
device_code.h use iwyu clean include (#27267) 5 years ago
device_code_test.cc Fix gpu memory allocation bug. (#28703) 5 years ago
device_context.cc [feature] support npu allocator (#30840) 4 years ago
device_context.h [feature] support npu allocator (#30840) 4 years ago
device_context_test.cu Revert "Revert "Remove op handle lock"" 6 years ago
device_context_xpu_test.cc support Baidu Kunlun AI Accelerator (#25959) 5 years ago
device_memory_aligment.cc fix PADDLE_ENFORCE (#25297) 5 years ago
device_memory_aligment.h use iwyu clean include (#27267) 5 years ago
device_tracer.cc Paddle support compile on sw (#27858) 5 years ago
device_tracer.h use iwyu clean include (#27267) 5 years ago
enforce.cc
enforce.h [feature] support npu allocator (#30840) 4 years ago
enforce_test.cc Refine PADDLE_ENFORCE (#25369) 5 years ago
error_codes.proto Enrich the type of error and declare the error type interfaces (#21024) 6 years ago
errors.cc Enrich the python error types of paddle & polish format (#28124) 5 years ago
errors.h Enrich the python error types of paddle & polish format (#28124) 5 years ago
errors_test.cc Hide the C++ stack by default and add hints (#29042) 5 years ago
event.h Add pe profiler Event (#24611) 5 years ago
flags.cc [feature] support npu allocator (#30840) 4 years ago
float16.h 添加rocm平台支持代码 (#29342) 4 years ago
float16_test.cc use iwyu clean include (#27267) 5 years ago
float16_test.cu Refine PADDLE_ENFORCE (#25369) 5 years ago
for_range.h
gloo_context.cc Initialize gloo for low level collective apis (#27672) 5 years ago
gloo_context.h Initialize gloo for low level collective apis (#27672) 5 years ago
gpu_info.cc [feature] support npu allocator (#30840) 4 years ago
gpu_info.h [API2.0] add op for cudnn version query test=develop (#26180) 5 years ago
gpu_launch_config.h Enable bilateral_slice unittest on windows platform (#29896) 4 years ago
hostdevice.h 添加rocm平台支持代码 (#29342) 4 years ago
init.cc [feature] support npu allocator (#30840) 4 years ago
init.h Fix gpu memory allocation bug. (#28703) 5 years ago
init_test.cc Fix gpu memory allocation bug. (#28703) 5 years ago
lock_guard_ptr.h
lodtensor_printer.cc use iwyu clean include (#27267) 5 years ago
lodtensor_printer.h use iwyu clean include (#27267) 5 years ago
lodtensor_printer_test.cc use iwyu clean include (#27267) 5 years ago
macros.h add musl option (#27798) 5 years ago
mkldnn_helper.h Added missing format of oneDNN (#29670) 4 years ago
mkldnn_reuse.h [oneDNN] Added UT for testing elementwise_mul caching (#30203) 4 years ago
monitor.cc [feature] support npu allocator (#30840) 4 years ago
monitor.h [feature] support npu allocator (#30840) 4 years ago
nccl_helper.h Retry CUDA Initialization to Fix Random Failure, test=develop (#28323) 5 years ago
npu_info.cc [feature] support npu allocator (#30840) 4 years ago
npu_info.h [feature] support npu allocator (#30840) 4 years ago
place.cc [feature] support npu allocator (#30840) 4 years ago
place.h [feature] support npu allocator (#30840) 4 years ago
place_test.cc use iwyu clean include (#27267) 5 years ago
port.h call_statck is turned on default when ON_INFER=ON (#29798) 4 years ago
profiler.cc use iwyu clean include (#27267) 5 years ago
profiler.cu refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603) 6 years ago
profiler.h use iwyu clean include (#27267) 5 years ago
profiler.proto Add memory profiler (#16137) 6 years ago
profiler_helper.h added internal and external reorders to profiler (#29443) 5 years ago
profiler_test.cc use iwyu clean include (#27267) 5 years ago
resource_pool.h refine PADDLE_ENFORCE (#25456) 5 years ago
stream_callback_manager.cc fix PADDLE_ENFORCE (#25297) 5 years ago
stream_callback_manager.h
test_limit_gpu_memory.cu Add flags to limit gpu memory (#22793) 5 years ago
timer.cc Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer 6 years ago
timer.h use iwyu clean include (#27267) 5 years ago
timer_test.cc Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer 6 years ago
transform.h fix PADDLE_ENFORCE (#25297) 5 years ago
transform_test.cu
variant.h
xpu_header.h add xpu ops for training transformer in kunlun (#29539) 4 years ago
xpu_info.cc support Baidu Kunlun AI Accelerator (#25959) 5 years ago
xpu_info.h support Baidu Kunlun AI Accelerator (#25959) 5 years ago