You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/platform
Jacek Czaja abfa822650
[oneDNN]Extended adaptive pooling support for oneDNN pool kernel (#30757)
4 years ago
..
details [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
dynload use iwyu clean include second time, test=develop (#30829) 4 years ago
stream [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
CMakeLists.txt support xpu with analysis predictor, test=develop (#30832) 4 years ago
bfloat16.h [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
bfloat16_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
bkcl_helper.h [Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574) 4 years ago
collective_helper.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
collective_helper.h use iwyu clean include second time, test=develop (#30829) 4 years ago
complex64.h [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
complex128.h [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
cpu_helper.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cpu_helper.h
cpu_helper_test.cc
cpu_info.cc Add bf16 fast performance verification (#30551) 4 years ago
cpu_info.h Add bf16 fast performance verification (#30551) 4 years ago
cpu_info_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cuda_device_function.h add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199) 4 years ago
cuda_device_guard.cc
cuda_device_guard.h
cuda_error.proto Optimize the error messages of paddle CUDA API (#23816) 5 years ago
cuda_helper.h Add Retry Logic to CublasHandlerHolder 4 years ago
cuda_helper_test.cu Fix index overflow bug of the CUDA kernel loop increment (#25435) 5 years ago
cuda_primitives.h [Complex] Add support for complex grad accumulated (#29889) 4 years ago
cuda_profiler.h use nvtx push pop in timeline (#30567) 4 years ago
cuda_resource_pool.cc refine PADDLE_ENFORCE (#25456) 5 years ago
cuda_resource_pool.h use iwyu clean include (#27267) 4 years ago
cudnn_desc.h Add tf32 switch for cuDNN (#29192) 4 years ago
cudnn_desc_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cudnn_helper.h fix rnn_op bug in cudnn_version>= 8 (#29406) 4 years ago
cudnn_helper_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cudnn_workspace_helper.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
cudnn_workspace_helper.h make_conv_workspace_size_configurable, test=develop (#20662) 5 years ago
denormal.cc update. (#30585) 4 years ago
denormal.h flush denormals to zero, test=develop (#29924) 4 years ago
device_code.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
device_code.h use iwyu clean include (#27267) 4 years ago
device_code_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
device_context.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
device_context.h [oneDNN] Cache oneDNN stream not to recreate in each oneDNN op (#30358) 4 years ago
device_context_test.cu Revert "Revert "Remove op handle lock"" 6 years ago
device_context_xpu_test.cc support Baidu Kunlun AI Accelerator (#25959) 5 years ago
device_memory_aligment.cc fix PADDLE_ENFORCE (#25297) 5 years ago
device_memory_aligment.h use iwyu clean include (#27267) 4 years ago
device_tracer.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
device_tracer.h use iwyu clean include (#27267) 4 years ago
enforce.cc
enforce.h [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
enforce_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
error_codes.proto Enrich the type of error and declare the error type interfaces (#21024) 5 years ago
errors.cc Enrich the python error types of paddle & polish format (#28124) 4 years ago
errors.h Enrich the python error types of paddle & polish format (#28124) 4 years ago
errors_test.cc Hide the C++ stack by default and add hints (#29042) 4 years ago
event.h Add pe profiler Event (#24611) 5 years ago
flags.cc call_statck is turned on default when ON_INFER=ON (#29798) 4 years ago
float16.h [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
float16_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
float16_test.cu Refine PADDLE_ENFORCE (#25369) 5 years ago
for_range.h
gen_comm_id_helper.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
gen_comm_id_helper.h 【kunlun】dygraph supports multi xpu card training (#30671) 4 years ago
gloo_context.cc Initialize gloo for low level collective apis (#27672) 4 years ago
gloo_context.h Initialize gloo for low level collective apis (#27672) 4 years ago
gpu_info.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
gpu_info.h [API2.0] add op for cudnn version query test=develop (#26180) 5 years ago
gpu_launch_config.h Enable bilateral_slice unittest on windows platform (#29896) 4 years ago
hostdevice.h 添加rocm平台支持代码 (#29342) 4 years ago
init.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
init.h Fix gpu memory allocation bug. (#28703) 4 years ago
init_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
lock_guard_ptr.h
lodtensor_printer.cc use iwyu clean include (#27267) 4 years ago
lodtensor_printer.h use iwyu clean include (#27267) 4 years ago
lodtensor_printer_test.cc use iwyu clean include (#27267) 4 years ago
macros.h add musl option (#27798) 4 years ago
mkldnn_helper.h [oneDNN] Cache oneDNN stream not to recreate in each oneDNN op (#30358) 4 years ago
mkldnn_reuse.h [oneDNN]Extended adaptive pooling support for oneDNN pool kernel (#30757) 4 years ago
monitor.cc use iwyu clean include (#27267) 4 years ago
monitor.h use iwyu clean include (#27267) 4 years ago
nccl_helper.h [Prepare for MultiProcess xpu] unified gen nccl id, refine imperative reducer (#30455) 4 years ago
place.cc support Baidu Kunlun AI Accelerator (#25959) 5 years ago
place.h support Baidu Kunlun AI Accelerator (#25959) 5 years ago
place_test.cc use iwyu clean include (#27267) 4 years ago
port.h call_statck is turned on default when ON_INFER=ON (#29798) 4 years ago
profiler.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
profiler.cu refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603) 6 years ago
profiler.h use nvtx push pop in timeline (#30567) 4 years ago
profiler.proto Add memory profiler (#16137) 6 years ago
profiler_helper.h use nvtx push pop in timeline (#30567) 4 years ago
profiler_test.cc use iwyu clean include (#27267) 4 years ago
resource_pool.h refine PADDLE_ENFORCE (#25456) 5 years ago
stream_callback_manager.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
stream_callback_manager.h fix bug 6 years ago
test_limit_gpu_memory.cu Add flags to limit gpu memory (#22793) 5 years ago
timer.cc Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer 6 years ago
timer.h use iwyu clean include (#27267) 4 years ago
timer_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
transform.h fix PADDLE_ENFORCE (#25297) 5 years ago
transform_test.cu
type_defs.h [ROCM] update fluid platform for rocm35 (part1), test=develop (#30639) 4 years ago
variant.h
xpu_header.h add xpu ops for training transformer in kunlun (#29539) 4 years ago
xpu_info.cc support Baidu Kunlun AI Accelerator (#25959) 5 years ago
xpu_info.h 【kunlun】dygraph supports multi xpu card training (#30671) 4 years ago