You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/operators/math
Dun 9f8f0fc2d3
Memory optimization of depthwise conv op and group norm op (#15313)
6 years ago
..
detail fix gru_gpu_kernel test=develop 6 years ago
CMakeLists.txt Tree conv op (#15217) 6 years ago
algorithm.h test=develop 6 years ago
blas.cc
blas.h Use mkl 6 years ago
blas_impl.cu.h Revert "Revert "Remove op handle lock"" 6 years ago
blas_impl.h Use mkl 6 years ago
compound_functors.h Feature/op_fuse_pass (#12440) 6 years ago
concat.hip.cu
concat_and_split.cc Refine Split op (#13967) 6 years ago
concat_and_split.cu Move GetTensor to tensor_util (#15011) 6 years ago
concat_and_split.h Refine Split op (#13967) 6 years ago
concat_test.cc Refine Split op (#13967) 6 years ago
context_project.cc
context_project.cu Fix Eigen macro when using GPU 6 years ago
context_project.h
cos_sim_functor.cc
cos_sim_functor.cu Fix SelectedRowsAdd bug (#14309) 6 years ago
cos_sim_functor.h
cpu_vec.h add avx support for windows 6 years ago
cpu_vec_test.cc remove jit namespace 6 years ago
cross_entropy.cc
cross_entropy.cu hot fix log (#14293) 6 years ago
cross_entropy.h Add fp16 backward support (#14202) 6 years ago
depthwise_conv.cu Memory optimization of depthwise conv op and group norm op (#15313) 6 years ago
depthwise_conv.h Memory optimization of depthwise conv op and group norm op (#15313) 6 years ago
fc_compute.h fix enum style 6 years ago
functors.h Feature/op_fuse_pass (#12440) 6 years ago
gru_compute.cc fix build problem test=develop 6 years ago
gru_compute.cu update avx gru grad kernel test=develop 6 years ago
gru_compute.h update gru op forward kernel 6 years ago
im2col.cc
im2col.cu
im2col.h
im2col_cfo_cpu.h
im2col_test.cc fix unit test cases 6 years ago
lstm_compute.cc
lstm_compute.cu
lstm_compute.h
math_function.cc Change tensor uses proto::VarType::type 6 years ago
math_function.cu Change tensor uses proto::VarType::type 6 years ago
math_function.h "fix compile error" (#13579) 6 years ago
math_function_impl.h rewrite ddim 6 years ago
math_function_test.cc
math_function_test.cu
matrix_bit_code.cc fix bug after merge reyoung optimization, test=develop 6 years ago
matrix_bit_code.h fix bug after merge reyoung optimization, test=develop 6 years ago
maxouting.cc
maxouting.cu
maxouting.h
padding.h
pooling.cc fix doc errors. test=develop 6 years ago
pooling.cu fix doc errors. test=develop 6 years ago
pooling.h fix doc errors. test=develop 6 years ago
prelu.cu add prelu gpu inference 6 years ago
prelu.h add prelu gpu inference 6 years ago
sampler.cc Make NCE_OP more efficient and support SelectedRows (#14469) 6 years ago
sampler.h Make NCE_OP more efficient and support SelectedRows (#14469) 6 years ago
selected_rows_functor.cc follow comment test=develop 6 years ago
selected_rows_functor.cu Polish code 6 years ago
selected_rows_functor.h Add sorted_result parameter to SelectedRows Functor 6 years ago
selected_rows_functor_test.cc sum op handle empty input 6 years ago
selected_rows_functor_test.cu.cc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation 6 years ago
sequence2batch.cc
sequence2batch.cu Fix Eigen macro when using GPU 6 years ago
sequence2batch.h
sequence_padding.cc
sequence_padding.cu
sequence_padding.h
sequence_padding_test.cc
sequence_pooling.cc use height from params of jitcode 6 years ago
sequence_pooling.cu minor fix 6 years ago
sequence_pooling.h Rebase of seqpool-max optimization 6 years ago
sequence_pooling_test.cc fix some compiler warning 6 years ago
sequence_scale.cc
sequence_scale.cu
sequence_scale.h
softmax.cc - Softmax for Inference is enabled when ON_INFER is set 6 years ago
softmax.cu Fix Eigen macro when using GPU 6 years ago
softmax.h [Feature] support mix precision training for resnet (#14899) 6 years ago
softmax_impl.h rewrite ddim 6 years ago
tree2col.cc Tree conv op (#15217) 6 years ago
tree2col.cu Tree conv op (#15217) 6 years ago
tree2col.h Tree conv op (#15217) 6 years ago
unpooling.cc
unpooling.cu
unpooling.h
vol2col.cc
vol2col.cu
vol2col.h
vol2col_test.cc