Paddle

History

Jacek Czaja abfa822650 [oneDNN]Extended adaptive pooling support for oneDNN pool kernel (#30757 )		4 years ago
..
details	[ROCM] update fluid platform for rocm35 (part1), test=develop (#30639 )	4 years ago
dynload	use iwyu clean include second time, test=develop (#30829 )	4 years ago
stream	[ROCM] update fluid platform for rocm35 (part1), test=develop (#30639 )	4 years ago
CMakeLists.txt	support xpu with analysis predictor, test=develop (#30832 )	4 years ago
bfloat16.h	[ROCM] update fluid platform for rocm35 (part1), test=develop (#30639 )	4 years ago
bfloat16_test.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
bkcl_helper.h	[Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574 )	4 years ago
collective_helper.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
collective_helper.h	use iwyu clean include second time, test=develop (#30829 )	4 years ago
complex64.h	[ROCM] update fluid platform for rocm35 (part1), test=develop (#30639 )	4 years ago
complex128.h	[ROCM] update fluid platform for rocm35 (part1), test=develop (#30639 )	4 years ago
cpu_helper.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
cpu_helper.h	…
cpu_helper_test.cc	…
cpu_info.cc	Add bf16 fast performance verification (#30551 )	4 years ago
cpu_info.h	Add bf16 fast performance verification (#30551 )	4 years ago
cpu_info_test.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
cuda_device_function.h	add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199 )	4 years ago
cuda_device_guard.cc	…
cuda_device_guard.h	…
cuda_error.proto	Optimize the error messages of paddle CUDA API (#23816 )	5 years ago
cuda_helper.h	Add Retry Logic to CublasHandlerHolder	4 years ago
cuda_helper_test.cu	Fix index overflow bug of the CUDA kernel loop increment (#25435 )	5 years ago
cuda_primitives.h	[Complex] Add support for complex grad accumulated (#29889 )	4 years ago
cuda_profiler.h	use nvtx push pop in timeline (#30567 )	4 years ago
cuda_resource_pool.cc	refine PADDLE_ENFORCE (#25456 )	5 years ago
cuda_resource_pool.h	use iwyu clean include (#27267 )	4 years ago
cudnn_desc.h	Add tf32 switch for cuDNN (#29192 )	4 years ago
cudnn_desc_test.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
cudnn_helper.h	fix rnn_op bug in cudnn_version>= 8 (#29406 )	4 years ago
cudnn_helper_test.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
cudnn_workspace_helper.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
cudnn_workspace_helper.h	make_conv_workspace_size_configurable, test=develop (#20662 )	5 years ago
denormal.cc	update. (#30585 )	4 years ago
denormal.h	flush denormals to zero, test=develop (#29924 )	4 years ago
device_code.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
device_code.h	use iwyu clean include (#27267 )	4 years ago
device_code_test.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
device_context.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
device_context.h	[oneDNN] Cache oneDNN stream not to recreate in each oneDNN op (#30358 )	4 years ago
device_context_test.cu	Revert "Revert "Remove op handle lock""	6 years ago
device_context_xpu_test.cc	support Baidu Kunlun AI Accelerator (#25959 )	5 years ago
device_memory_aligment.cc	fix PADDLE_ENFORCE (#25297 )	5 years ago
device_memory_aligment.h	use iwyu clean include (#27267 )	4 years ago
device_tracer.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
device_tracer.h	use iwyu clean include (#27267 )	4 years ago
enforce.cc	…
enforce.h	[ROCM] update fluid platform for rocm35 (part1), test=develop (#30639 )	4 years ago
enforce_test.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
error_codes.proto	Enrich the type of error and declare the error type interfaces (#21024 )	5 years ago
errors.cc	Enrich the python error types of paddle & polish format (#28124 )	4 years ago
errors.h	Enrich the python error types of paddle & polish format (#28124 )	4 years ago
errors_test.cc	Hide the C++ stack by default and add hints (#29042 )	4 years ago
event.h	Add pe profiler Event (#24611 )	5 years ago
flags.cc	call_statck is turned on default when ON_INFER=ON (#29798 )	4 years ago
float16.h	[ROCM] update fluid platform for rocm35 (part1), test=develop (#30639 )	4 years ago
float16_test.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
float16_test.cu	Refine PADDLE_ENFORCE (#25369 )	5 years ago
for_range.h	…
gen_comm_id_helper.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
gen_comm_id_helper.h	【kunlun】dygraph supports multi xpu card training (#30671 )	4 years ago
gloo_context.cc	Initialize gloo for low level collective apis (#27672 )	4 years ago
gloo_context.h	Initialize gloo for low level collective apis (#27672 )	4 years ago
gpu_info.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
gpu_info.h	[API2.0] add op for cudnn version query test=develop (#26180 )	5 years ago
gpu_launch_config.h	Enable bilateral_slice unittest on windows platform (#29896 )	4 years ago
hostdevice.h	添加rocm平台支持代码 (#29342 )	4 years ago
init.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
init.h	Fix gpu memory allocation bug. (#28703 )	4 years ago
init_test.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
lock_guard_ptr.h	…
lodtensor_printer.cc	use iwyu clean include (#27267 )	4 years ago
lodtensor_printer.h	use iwyu clean include (#27267 )	4 years ago
lodtensor_printer_test.cc	use iwyu clean include (#27267 )	4 years ago
macros.h	add musl option (#27798 )	4 years ago
mkldnn_helper.h	[oneDNN] Cache oneDNN stream not to recreate in each oneDNN op (#30358 )	4 years ago
mkldnn_reuse.h	[oneDNN]Extended adaptive pooling support for oneDNN pool kernel (#30757 )	4 years ago
monitor.cc	use iwyu clean include (#27267 )	4 years ago
monitor.h	use iwyu clean include (#27267 )	4 years ago
nccl_helper.h	[Prepare for MultiProcess xpu] unified gen nccl id, refine imperative reducer (#30455 )	4 years ago
place.cc	support Baidu Kunlun AI Accelerator (#25959 )	5 years ago
place.h	support Baidu Kunlun AI Accelerator (#25959 )	5 years ago
place_test.cc	use iwyu clean include (#27267 )	4 years ago
port.h	call_statck is turned on default when ON_INFER=ON (#29798 )	4 years ago
profiler.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
profiler.cu	refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603 )	6 years ago
profiler.h	use nvtx push pop in timeline (#30567 )	4 years ago
profiler.proto	Add memory profiler (#16137 )	6 years ago
profiler_helper.h	use nvtx push pop in timeline (#30567 )	4 years ago
profiler_test.cc	use iwyu clean include (#27267 )	4 years ago
resource_pool.h	refine PADDLE_ENFORCE (#25456 )	5 years ago
stream_callback_manager.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
stream_callback_manager.h	fix bug	6 years ago
test_limit_gpu_memory.cu	Add flags to limit gpu memory (#22793 )	5 years ago
timer.cc	Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer	6 years ago
timer.h	use iwyu clean include (#27267 )	4 years ago
timer_test.cc	use iwyu clean include second time, test=develop (#30829 )	4 years ago
transform.h	fix PADDLE_ENFORCE (#25297 )	5 years ago
transform_test.cu	…
type_defs.h	[ROCM] update fluid platform for rocm35 (part1), test=develop (#30639 )	4 years ago
variant.h	…
xpu_header.h	add xpu ops for training transformer in kunlun (#29539 )	4 years ago
xpu_info.cc	support Baidu Kunlun AI Accelerator (#25959 )	5 years ago
xpu_info.h	【kunlun】dygraph supports multi xpu card training (#30671 )	4 years ago