You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/operators/math
wopeizl fa78fc60be
Merge pull request #14907 from wopeizl/windows/avx
7 years ago
..
detail add avx support for windows 7 years ago
CMakeLists.txt add prelu gpu inference 7 years ago
algorithm.h
blas.cc
blas.h - ASUM MKL integration 7 years ago
blas_impl.cu.h Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929) 7 years ago
blas_impl.h - Coding style fixes 7 years ago
compound_functors.h
concat.hip.cu
concat_and_split.cc
concat_and_split.cu
concat_and_split.h
concat_test.cc
context_project.cc
context_project.cu Fix Eigen macro when using GPU 7 years ago
context_project.h
cos_sim_functor.cc
cos_sim_functor.cu
cos_sim_functor.h
cpu_vec.h add avx support for windows 7 years ago
cpu_vec_test.cc remove jit namespace 7 years ago
cross_entropy.cc
cross_entropy.cu
cross_entropy.h
depthwise_conv.cu
depthwise_conv.h
fc_compute.h add Set/GetCPUNumThreads api 7 years ago
functors.h
gru_compute.cc
gru_compute.cu
gru_compute.h
im2col.cc
im2col.cu
im2col.h
im2col_cfo_cpu.h
im2col_test.cc fix unit test cases 7 years ago
jit_code.cc add avx support for windows 7 years ago
jit_code.h add avx support for windows 7 years ago
jit_gen.cc remove jit namespace 7 years ago
jit_gen.h
jit_kernel.cc remove jit namespace 7 years ago
jit_kernel.h add gru refer code and remove redundant avx code 7 years ago
jit_kernel_blas.cc remove jit namespace 7 years ago
jit_kernel_crf_decode.cc add avx support for windows 7 years ago
jit_kernel_exp.cc remove jit namespace 7 years ago
jit_kernel_impl.h add gru refer code and remove redundant avx code 7 years ago
jit_kernel_layer_norm.cc add avx support for windows 7 years ago
jit_kernel_macro.h remove jit namespace 7 years ago
jit_kernel_refer.h add comments and follow comments 7 years ago
jit_kernel_rnn.cc enable gru jitcode and refine act and lstm jitcode 7 years ago
jit_kernel_test.cc remove jit namespace 7 years ago
lstm_compute.cc
lstm_compute.cu
lstm_compute.h
math_function.cc Change tensor uses proto::VarType::type 7 years ago
math_function.cu Change tensor uses proto::VarType::type 7 years ago
math_function.h
math_function_impl.h
math_function_test.cc
math_function_test.cu
matrix_bit_code.cc optimize op with blas 7 years ago
matrix_bit_code.h optimize op with blas 7 years ago
maxouting.cc
maxouting.cu
maxouting.h
padding.h
pooling.cc fix doc errors. test=develop 7 years ago
pooling.cu fix doc errors. test=develop 7 years ago
pooling.h fix doc errors. test=develop 7 years ago
prelu.cu add prelu gpu inference 7 years ago
prelu.h add prelu gpu inference 7 years ago
sampler.cc Make NCE_OP more efficient and support SelectedRows (#14469) 7 years ago
sampler.h Make NCE_OP more efficient and support SelectedRows (#14469) 7 years ago
selected_rows_functor.cc Revert the changes of VLOG 7 years ago
selected_rows_functor.cu Revert the changes of VLOG 7 years ago
selected_rows_functor.h
selected_rows_functor_test.cc
selected_rows_functor_test.cu.cc
sequence2batch.cc
sequence2batch.cu Fix Eigen macro when using GPU 7 years ago
sequence2batch.h
sequence_padding.cc
sequence_padding.cu
sequence_padding.h
sequence_padding_test.cc
sequence_pooling.cc
sequence_pooling.cu minor fix 7 years ago
sequence_pooling.h
sequence_pooling_test.cc
sequence_scale.cc
sequence_scale.cu
sequence_scale.h
softmax.cc - Softmax for Inference is enabled when ON_INFER is set 7 years ago
softmax.cu Fix Eigen macro when using GPU 7 years ago
softmax.h - Removing partial specialization of sotmax for inference for GPU 7 years ago
softmax_impl.h Add reduce sparse tensor feature. (#14757) 7 years ago
unpooling.cc
unpooling.cu
unpooling.h
vol2col.cc
vol2col.cu
vol2col.h
vol2col_test.cc