You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/platform
ronnywang 123949eb48
[ROCM] added a cudnn switch of conv2d for rocm platform (#31836)
4 years ago
..
details [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
dynload [ROCM] fix test_rnn_op (#31735) 4 years ago
stream [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
CMakeLists.txt [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
bfloat16.h add cmath header for bfloat (#31792) 4 years ago
bfloat16_test.cc [CustomOp] Remove the dependence of the underlying data types on eigen (#31602) 4 years ago
bkcl_helper.h [Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574) 4 years ago
collective_helper.cc Add collective async wait op (#31463) 4 years ago
collective_helper.h Add collective async wait op (#31463) 4 years ago
complex64.h [CustomOp] Remove the dependence of the underlying data types on eigen (#31602) 4 years ago
complex128.h [CustomOp] Remove the dependence of the underlying data types on eigen (#31602) 4 years ago
cpu_helper.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cpu_helper.h
cpu_helper_test.cc
cpu_info.cc Add bf16 fast performance verification (#30551) 4 years ago
cpu_info.h Add bf16 fast performance verification (#30551) 4 years ago
cpu_info_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cuda_device_function.h [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
cuda_device_guard.cc
cuda_device_guard.h
cuda_error.proto
cuda_helper.h [ROCM] fix gather_op, sigmoid_cross_entropy_with_logits_op, test=develop (#31467) 4 years ago
cuda_helper_test.cu [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
cuda_primitives.h [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
cuda_profiler.h use nvtx push pop in timeline (#30567) 4 years ago
cuda_resource_pool.cc [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
cuda_resource_pool.h [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
cudnn_desc.h Add tf32 switch for cuDNN (#29192) 4 years ago
cudnn_desc_test.cc [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
cudnn_helper.h fix rnn_op bug in cudnn_version>= 8 (#29406) 4 years ago
cudnn_helper_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cudnn_workspace_helper.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cudnn_workspace_helper.h
denormal.cc update. (#30585) 4 years ago
denormal.h flush denormals to zero, test=develop (#29924) 4 years ago
device_code.cc [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
device_code.h [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
device_code_test.cc [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
device_context.cc [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
device_context.h [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
device_context_test.cu [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
device_context_xpu_test.cc
device_memory_aligment.cc xpu support fuse allreduce (#31104) 4 years ago
device_memory_aligment.h [ROCM] update fluid platform for rocm39 (part4), test=develop (#30936) 4 years ago
device_tracer.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
device_tracer.h
eigen_ext.h [CustomOp] Remove Eigen dependencies of float16 (#31669) 4 years ago
enforce.cc
enforce.h [ROCM] update fluid platform for rocm39 (part3), test=develop (#30913) 4 years ago
enforce_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
error_codes.proto
errors.cc
errors.h
errors_test.cc
event.h [ROCM] update fluid platform for rocm39 (part4), test=develop (#30936) 4 years ago
flags.cc [ROCM] added a cudnn switch of conv2d for rocm platform (#31836) 4 years ago
float16.h [CustomOp] Remove Eigen dependencies of float16 (#31669) 4 years ago
float16_test.cc [CustomOp] Remove Eigen dependencies of float16 (#31669) 4 years ago
float16_test.cu [CustomOp] Remove Eigen dependencies of float16 (#31669) 4 years ago
for_range.h [ROCM] fix softmax with loss and update python scripts, test=develop (#31373) 4 years ago
gen_comm_id_helper.cc c_gen_nccl_id add SocketServer to persit server (#31589) 4 years ago
gen_comm_id_helper.h c_gen_nccl_id add SocketServer to persit server (#31589) 4 years ago
gloo_context.cc
gloo_context.h
gpu_info.cc [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
gpu_info.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
gpu_launch_config.h [ROCM] fix reduce op, test=develop (#31478) 4 years ago
hostdevice.h 添加rocm平台支持代码 (#29342) 4 years ago
init.cc [ROCM] update fluid platform for rocm39 (part4), test=develop (#30936) 4 years ago
init.h
init_test.cc [ROCM] update fluid platform for rocm39 (part4), test=develop (#30936) 4 years ago
lock_guard_ptr.h
lodtensor_printer.cc
lodtensor_printer.h
lodtensor_printer_test.cc
macros.h
miopen_desc.h [ROCM] fix conv2d and conv3d op, test=develop (#31553) 4 years ago
miopen_helper.h [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
miopen_helper_test.cc [ROCM] update fluid platform for rocm39 (part4), test=develop (#30936) 4 years ago
mkldnn_helper.h Add Conv Transpose BF16 (#30877) 4 years ago
mkldnn_reuse.h [oneDNN] Added Elementwise Mul grad fp32/bf16 (#31647) 4 years ago
monitor.cc
monitor.h
nccl_helper.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
place.cc
place.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
place_test.cc
port.h call_statck is turned on default when ON_INFER=ON (#29798) 4 years ago
profiler.cc [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
profiler.cu [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
profiler.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
profiler.proto
profiler_helper.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
profiler_test.cc [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
resource_pool.h
stream_callback_manager.cc [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
stream_callback_manager.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
test_limit_gpu_memory.cu [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
timer.cc
timer.h
timer_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
transform.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
transform_test.cu
type_defs.h [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
variant.h [ROCM] update fluid platform for rocm39 (part2), test=develop (#30774) 4 years ago
xpu_header.h add xpu ops for training transformer in kunlun (#29539) 4 years ago
xpu_info.cc
xpu_info.h 【kunlun】dygraph supports multi xpu card training (#30671) 4 years ago