You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/operators/math
Tao Luo ea47685f91
Merge pull request #14646 from jczaja/prv-softmax-mkl-sasum
6 years ago
..
detail Windows/online (#14474) 6 years ago
CMakeLists.txt Windows/online (#14474) 6 years ago
algorithm.h test=develop 6 years ago
blas.cc
blas.h - ASUM MKL integration 6 years ago
blas_impl.cu.h Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929) 6 years ago
blas_impl.h - Coding style fixes 6 years ago
compound_functors.h Feature/op_fuse_pass (#12440) 7 years ago
concat.hip.cu
concat_and_split.cc Refine Split op (#13967) 6 years ago
concat_and_split.cu Refine Split op (#13967) 6 years ago
concat_and_split.h Refine Split op (#13967) 6 years ago
concat_test.cc Refine Split op (#13967) 6 years ago
context_project.cc
context_project.cu
context_project.h
cos_sim_functor.cc
cos_sim_functor.cu Fix SelectedRowsAdd bug (#14309) 6 years ago
cos_sim_functor.h
cpu_vec.h exp, sigmoid, tanh jitcode support more size 6 years ago
cpu_vec_test.cc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog 6 years ago
cross_entropy.cc
cross_entropy.cu hot fix log (#14293) 6 years ago
cross_entropy.h Add fp16 backward support (#14202) 6 years ago
depthwise_conv.cu optimize depthwise conv by register memory (#13778) 6 years ago
depthwise_conv.h Optimization of Kernels that related to DeepLabv3+ (#13534) 6 years ago
fc_compute.h add Set/GetCPUNumThreads api 6 years ago
functors.h Feature/op_fuse_pass (#12440) 7 years ago
gru_compute.cc
gru_compute.cu
gru_compute.h
im2col.cc
im2col.cu
im2col.h
im2col_cfo_cpu.h
im2col_test.cc fix unit test cases 6 years ago
jit_code.cc enable gru jitcode and refine act and lstm jitcode 6 years ago
jit_code.h enable gru jitcode and refine act and lstm jitcode 6 years ago
jit_gen.cc add jit gencode 6 years ago
jit_gen.h add jit gencode 6 years ago
jit_kernel.cc thread local jit kernels 6 years ago
jit_kernel.h add gru refer code and remove redundant avx code 6 years ago
jit_kernel_blas.cc Merge branch 'develop' into fea/jit/rnn 6 years ago
jit_kernel_crf_decode.cc exp, sigmoid, tanh jitcode support more size 6 years ago
jit_kernel_exp.cc add gru refer code and remove redundant avx code 6 years ago
jit_kernel_impl.h add gru refer code and remove redundant avx code 6 years ago
jit_kernel_layer_norm.cc Optimize the layer_norm operator with AVX intrinsic function (#14417) 6 years ago
jit_kernel_macro.h jitkernel lstm refer support peephole 6 years ago
jit_kernel_refer.h add comments and follow comments 6 years ago
jit_kernel_rnn.cc enable gru jitcode and refine act and lstm jitcode 6 years ago
jit_kernel_test.cc Polish code 6 years ago
lstm_compute.cc
lstm_compute.cu
lstm_compute.h
math_function.cc "fix compile error" (#13579) 6 years ago
math_function.cu
math_function.h "fix compile error" (#13579) 6 years ago
math_function_impl.h
math_function_test.cc
math_function_test.cu
matrix_bit_code.cc polish code, test=develop 6 years ago
matrix_bit_code.h polish code, test=develop 6 years ago
maxouting.cc
maxouting.cu
maxouting.h
padding.h
pooling.cc move param exclusive to the last in pool2d/pool3d for forward compatibility:. test=develop 6 years ago
pooling.cu fxi avg pool trt bug and fix cpplint 6 years ago
pooling.h add macro for pool2dDirectCUDAFunctor 6 years ago
sampler.cc Make NCE_OP more efficient and support SelectedRows (#14469) 6 years ago
sampler.h Make NCE_OP more efficient and support SelectedRows (#14469) 6 years ago
selected_rows_functor.cc Revert the changes of VLOG 6 years ago
selected_rows_functor.cu Revert the changes of VLOG 6 years ago
selected_rows_functor.h merge from develop 6 years ago
selected_rows_functor_test.cc sum op handle empty input 6 years ago
selected_rows_functor_test.cu.cc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation 6 years ago
sequence2batch.cc
sequence2batch.cu
sequence2batch.h
sequence_padding.cc
sequence_padding.cu
sequence_padding.h
sequence_padding_test.cc
sequence_pooling.cc Rebase of seqpool-max optimization 6 years ago
sequence_pooling.cu minor fix 6 years ago
sequence_pooling.h Rebase of seqpool-max optimization 6 years ago
sequence_pooling_test.cc fix some compiler warning 6 years ago
sequence_scale.cc
sequence_scale.cu
sequence_scale.h
softmax.cc - Softmax for Inference is enabled when ON_INFER is set 6 years ago
softmax.cu - Softmax for Inference is enabled when ON_INFER is set 6 years ago
softmax.h - Removing partial specialization of sotmax for inference for GPU 6 years ago
softmax_impl.h - ASUM MKL integration 6 years ago
unpooling.cc
unpooling.cu
unpooling.h
vol2col.cc
vol2col.cu
vol2col.h
vol2col_test.cc