You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/platform
tianshuo78520a 8829a309fe
Delete cudnn6 code (#31835)
4 years ago
..
details [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
dynload [ROCM] fix test_rnn_op (#31735) 4 years ago
stream [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
CMakeLists.txt [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
bfloat16.h add cmath header for bfloat (#31792) 4 years ago
bfloat16_test.cc [CustomOp] Remove the dependence of the underlying data types on eigen (#31602) 4 years ago
bkcl_helper.h [Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574) 4 years ago
collective_helper.cc Add collective async wait op (#31463) 4 years ago
collective_helper.h Add collective async wait op (#31463) 4 years ago
complex64.h [CustomOp] Remove the dependence of the underlying data types on eigen (#31602) 4 years ago
complex128.h [CustomOp] Remove the dependence of the underlying data types on eigen (#31602) 4 years ago
cpu_helper.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cpu_helper.h move SetNumThreads to platform 7 years ago
cpu_helper_test.cc move SetNumThreads to platform 7 years ago
cpu_info.cc Add bf16 fast performance verification (#30551) 4 years ago
cpu_info.h Add bf16 fast performance verification (#30551) 4 years ago
cpu_info_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cuda_device_function.h [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
cuda_device_guard.cc Refine code 6 years ago
cuda_device_guard.h Refine code 6 years ago
cuda_error.proto Optimize the error messages of paddle CUDA API (#23816) 5 years ago
cuda_helper.h [ROCM] fix gather_op, sigmoid_cross_entropy_with_logits_op, test=develop (#31467) 4 years ago
cuda_helper_test.cu [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
cuda_primitives.h [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
cuda_profiler.h use nvtx push pop in timeline (#30567) 4 years ago
cuda_resource_pool.cc [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
cuda_resource_pool.h [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
cudnn_desc.h Add tf32 switch for cuDNN (#29192) 4 years ago
cudnn_desc_test.cc [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
cudnn_helper.h Delete cudnn6 code (#31835) 4 years ago
cudnn_helper_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cudnn_workspace_helper.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cudnn_workspace_helper.h make_conv_workspace_size_configurable, test=develop (#20662) 5 years ago
denormal.cc update. (#30585) 4 years ago
denormal.h flush denormals to zero, test=develop (#29924) 4 years ago
device_code.cc [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
device_code.h [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
device_code_test.cc [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
device_context.cc [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
device_context.h [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
device_context_test.cu [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
device_context_xpu_test.cc support Baidu Kunlun AI Accelerator (#25959) 5 years ago
device_memory_aligment.cc xpu support fuse allreduce (#31104) 4 years ago
device_memory_aligment.h [ROCM] update fluid platform for rocm39 (part4), test=develop (#30936) 4 years ago
device_tracer.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
device_tracer.h use iwyu clean include (#27267) 4 years ago
eigen_ext.h [CustomOp] Remove Eigen dependencies of float16 (#31669) 4 years ago
enforce.cc Fix the grammar in copyright. (#8403) 7 years ago
enforce.h [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
enforce_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
error_codes.proto Enrich the type of error and declare the error type interfaces (#21024) 5 years ago
errors.cc Enrich the python error types of paddle & polish format (#28124) 4 years ago
errors.h Enrich the python error types of paddle & polish format (#28124) 4 years ago
errors_test.cc Hide the C++ stack by default and add hints (#29042) 4 years ago
event.h [ROCM] update fluid platform for rocm39 (part4), test=develop (#30936) 4 years ago
flags.cc [ROCM] added a cudnn switch of conv2d for rocm platform (#31836) 4 years ago
float16.h [CustomOp] Remove Eigen dependencies of float16 (#31669) 4 years ago
float16_test.cc [CustomOp] Remove Eigen dependencies of float16 (#31669) 4 years ago
float16_test.cu [CustomOp] Remove Eigen dependencies of float16 (#31669) 4 years ago
for_range.h [ROCM] fix softmax with loss and update python scripts, test=develop (#31373) 4 years ago
gen_comm_id_helper.cc c_gen_nccl_id add SocketServer to persit server (#31589) 4 years ago
gen_comm_id_helper.h c_gen_nccl_id add SocketServer to persit server (#31589) 4 years ago
gloo_context.cc Initialize gloo for low level collective apis (#27672) 4 years ago
gloo_context.h Initialize gloo for low level collective apis (#27672) 4 years ago
gpu_info.cc [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
gpu_info.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
gpu_launch_config.h [ROCM] fix reduce op, test=develop (#31478) 4 years ago
hostdevice.h 添加rocm平台支持代码 (#29342) 4 years ago
init.cc [ROCM] update fluid platform for rocm39 (part4), test=develop (#30936) 4 years ago
init.h Fix gpu memory allocation bug. (#28703) 4 years ago
init_test.cc [ROCM] update fluid platform for rocm39 (part4), test=develop (#30936) 4 years ago
lock_guard_ptr.h Clean LockGuardPtr 6 years ago
lodtensor_printer.cc use iwyu clean include (#27267) 4 years ago
lodtensor_printer.h use iwyu clean include (#27267) 4 years ago
lodtensor_printer_test.cc use iwyu clean include (#27267) 4 years ago
macros.h add musl option (#27798) 4 years ago
miopen_desc.h [ROCM] fix conv2d and conv3d op, test=develop (#31553) 4 years ago
miopen_helper.h [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
miopen_helper_test.cc [ROCM] update fluid platform for rocm39 (part4), test=develop (#30936) 4 years ago
mkldnn_helper.h Add Conv Transpose BF16 (#30877) 4 years ago
mkldnn_reuse.h [oneDNN] Added Elementwise Mul grad fp32/bf16 (#31647) 4 years ago
monitor.cc use iwyu clean include (#27267) 4 years ago
monitor.h use iwyu clean include (#27267) 4 years ago
nccl_helper.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
place.cc support Baidu Kunlun AI Accelerator (#25959) 5 years ago
place.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
place_test.cc use iwyu clean include (#27267) 4 years ago
port.h call_statck is turned on default when ON_INFER=ON (#29798) 4 years ago
profiler.cc [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
profiler.cu [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
profiler.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
profiler.proto Add memory profiler (#16137) 6 years ago
profiler_helper.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
profiler_test.cc [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
resource_pool.h refine PADDLE_ENFORCE (#25456) 5 years ago
stream_callback_manager.cc [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
stream_callback_manager.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
test_limit_gpu_memory.cu [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
timer.cc Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer 6 years ago
timer.h use iwyu clean include (#27267) 4 years ago
timer_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
transform.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
transform_test.cu Fix bug in uts 6 years ago
type_defs.h [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
variant.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
xpu_header.h add xpu ops for training transformer in kunlun (#29539) 4 years ago
xpu_info.cc support Baidu Kunlun AI Accelerator (#25959) 5 years ago
xpu_info.h 【kunlun】dygraph supports multi xpu card training (#30671) 4 years ago