You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/operators/math
Kaipeng Deng 99c78b772a
fix softmax axis!=-1. test=develop (#19800)
6 years ago
..
detail improve gru unit performance. (#16338) 6 years ago
CMakeLists.txt Implement the GPU kernel of fc operator (#19687) 6 years ago
algorithm.h
beam_search.cc Make parent_idx a dispensable output for beam_search op to support models saved by older paddle version. (#16106) 6 years ago
beam_search.cu Make parent_idx a dispensable output for beam_search op to support models saved by older paddle version. (#16106) 6 years ago
beam_search.h Return parent_idx in beam_search op (#15520) 6 years ago
beam_search_test.cc Return parent_idx in beam_search op (#15520) 6 years ago
blas.cc
blas.h Use sparse matrix to implement fused emb_seq_pool operator (#19064) 6 years ago
blas_impl.cu.h change PADDLE_ENFORCE to PADDLE_ENFORCE_CUDA_SUCCESS (#19205) 6 years ago
blas_impl.h refine some PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19607) 6 years ago
compound_functors.h Optimize fused_elewise_activation_grad op. (#18041) 6 years ago
concat.hip.cu
concat_and_split.cc
concat_and_split.cu Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) 6 years ago
concat_and_split.h
concat_test.cc Optimize the concat and split kernel for specical cases when the number of inputs/outputs is 2 (#17415) 6 years ago
context_project.cc
context_project.cu Fix Eigen macro when using GPU 6 years ago
context_project.h fix security bugs : (#17464) 6 years ago
cos_sim_functor.cc
cos_sim_functor.cu Fix SelectedRowsAdd bug (#14309) 6 years ago
cos_sim_functor.h
cpu_vec.h fix sofmax seg fault in AVX, test=develop (#19487) 6 years ago
cpu_vec_test.cc [CPU] refine cpu softmax bwd (#17534) 6 years ago
cross_entropy.cc Softmax_cross_entropy op add axis (#16806) 6 years ago
cross_entropy.cu unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631) 6 years ago
cross_entropy.h unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631) 6 years ago
depthwise_conv.cu Fix depthwise conv gpu kernel bug (#18582) 6 years ago
depthwise_conv.h Memory optimization of depthwise conv op and group norm op (#15313) 6 years ago
fc.cc Implement the GPU kernel of fc operator (#19687) 6 years ago
fc.cu Implement the GPU kernel of fc operator (#19687) 6 years ago
fc.h Implement the GPU kernel of fc operator (#19687) 6 years ago
functors.h Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236) 6 years ago
gru_compute.cc fix build problem test=develop 6 years ago
gru_compute.cu Add conditional compile for gru opt (#17368) 6 years ago
gru_compute.h update gru op forward kernel 6 years ago
im2col.cc
im2col.cu refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603) 6 years ago
im2col.h
im2col_cfo_cpu.h
im2col_test.cc fix unit test cases 6 years ago
lstm_compute.cc add cell clip and proj clip, fix bug for h0 6 years ago
lstm_compute.cu add cell clip and proj clip, fix bug for h0 6 years ago
lstm_compute.h add cell clip and proj clip, fix bug for h0 6 years ago
math_function.cc Change tensor uses proto::VarType::type 6 years ago
math_function.cu Sequence mask support tensor (#18249) 6 years ago
math_function.h
math_function_impl.h rewrite ddim 6 years ago
math_function_test.cc
math_function_test.cu
matrix_bit_code.cc fix bug after merge reyoung optimization, test=develop 6 years ago
matrix_bit_code.h fix bug after merge reyoung optimization, test=develop 6 years ago
maxouting.cc
maxouting.cu
maxouting.h
padding.h
pooling.cc fix doc errors. test=develop 6 years ago
pooling.cu fix doc errors. test=develop 6 years ago
pooling.h fix doc errors. test=develop 6 years ago
prelu.cu add prelu gpu inference 6 years ago
prelu.h add prelu gpu inference 6 years ago
sample_prob.cc update comment 6 years ago
sample_prob.cu refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603) 6 years ago
sample_prob.h update comment 6 years ago
sampler.cc nce add check sample lables, test=develop (#15463) 6 years ago
sampler.h nce add check sample lables, test=develop (#15463) 6 years ago
selected_rows_functor.cc fix the diff between async mode and async_half mode (#19535) 6 years ago
selected_rows_functor.cu refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603) 6 years ago
selected_rows_functor.h fix the diff between async mode and async_half mode (#19535) 6 years ago
selected_rows_functor_test.cc fix the diff between async mode and async_half mode (#19535) 6 years ago
selected_rows_functor_test.cu.cc Add the CUDA kernel for beam_search op (#15020) 6 years ago
sequence2batch.cc
sequence2batch.cu Fix Eigen macro when using GPU 6 years ago
sequence2batch.h
sequence_padding.cc Optimize the sequence padding op (#17403) 6 years ago
sequence_padding.cu fix sequence pad; test=develop 6 years ago
sequence_padding.h
sequence_padding_test.cc
sequence_pooling.cc Enable seq_pool op to accept len 0 input (#17284) 6 years ago
sequence_pooling.cu Enable seq_pool op to accept len 0 input (#17284) 6 years ago
sequence_pooling.h Enable seq_pool op to accept len 0 input (#17284) 6 years ago
sequence_pooling_test.cc Add the CUDA kernel for beam_search op (#15020) 6 years ago
sequence_scale.cc
sequence_scale.cu
sequence_scale.h
softmax.cc - Softmax for Inference is enabled when ON_INFER is set 6 years ago
softmax.cu paddle::framework::vectorize() templatization (#19627) 6 years ago
softmax.h [CPU] refine cpu softmax bwd (#17534) 6 years ago
softmax_impl.h fix softmax axis!=-1. test=develop (#19800) 6 years ago
tree2col.cc Tree conv op (#15217) 6 years ago
tree2col.cu Tree conv op (#15217) 6 years ago
tree2col.h Tree conv op (#15217) 6 years ago
unpooling.cc
unpooling.cu unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631) 6 years ago
unpooling.h
vol2col.cc
vol2col.cu refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603) 6 years ago
vol2col.h
vol2col_test.cc