You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/operators/math
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
..
detail improve gru unit performance. (#16338) 6 years ago
CMakeLists.txt Implement the GPU kernel of fc operator (#19687) 6 years ago
algorithm.h
beam_search.cc Make parent_idx a dispensable output for beam_search op to support models saved by older paddle version. (#16106) 6 years ago
beam_search.cu Make parent_idx a dispensable output for beam_search op to support models saved by older paddle version. (#16106) 6 years ago
beam_search.h
beam_search_test.cc
blas.cc
blas.h Use sparse matrix to implement fused emb_seq_pool operator (#19064) 6 years ago
blas_impl.cu.h change PADDLE_ENFORCE to PADDLE_ENFORCE_CUDA_SUCCESS (#19205) 6 years ago
blas_impl.h refine some PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19607) 6 years ago
compound_functors.h Optimize fused_elewise_activation_grad op. (#18041) 6 years ago
concat.hip.cu
concat_and_split.cc
concat_and_split.cu Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. (#17979) 6 years ago
concat_and_split.h
concat_test.cc Optimize the concat and split kernel for specical cases when the number of inputs/outputs is 2 (#17415) 6 years ago
context_project.cc
context_project.cu
context_project.h fix security bugs : (#17464) 6 years ago
cos_sim_functor.cc
cos_sim_functor.cu
cos_sim_functor.h
cpu_vec.h fix sofmax seg fault in AVX, test=develop (#19487) 6 years ago
cpu_vec_test.cc [CPU] refine cpu softmax bwd (#17534) 6 years ago
cross_entropy.cc Softmax_cross_entropy op add axis (#16806) 6 years ago
cross_entropy.cu unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631) 6 years ago
cross_entropy.h unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631) 6 years ago
depthwise_conv.cu Fix depthwise conv gpu kernel bug (#18582) 6 years ago
depthwise_conv.h
fc.cc Implement the GPU kernel of fc operator (#19687) 6 years ago
fc.cu Implement the GPU kernel of fc operator (#19687) 6 years ago
fc.h Implement the GPU kernel of fc operator (#19687) 6 years ago
functors.h Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236) 6 years ago
gru_compute.cc
gru_compute.cu Add conditional compile for gru opt (#17368) 6 years ago
gru_compute.h
im2col.cc
im2col.cu refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603) 6 years ago
im2col.h
im2col_cfo_cpu.h
im2col_test.cc
lstm_compute.cc
lstm_compute.cu
lstm_compute.h
math_function.cc
math_function.cu Sequence mask support tensor (#18249) 6 years ago
math_function.h
math_function_impl.h
math_function_test.cc
math_function_test.cu
matrix_bit_code.cc
matrix_bit_code.h
maxouting.cc
maxouting.cu
maxouting.h
padding.h
pooling.cc
pooling.cu
pooling.h
prelu.cu
prelu.h
sample_prob.cc update comment 6 years ago
sample_prob.cu refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603) 6 years ago
sample_prob.h update comment 6 years ago
sampler.cc
sampler.h
selected_rows_functor.cc fix the diff between async mode and async_half mode (#19535) 6 years ago
selected_rows_functor.cu refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603) 6 years ago
selected_rows_functor.h fix the diff between async mode and async_half mode (#19535) 6 years ago
selected_rows_functor_test.cc fix the diff between async mode and async_half mode (#19535) 6 years ago
selected_rows_functor_test.cu.cc
sequence2batch.cc
sequence2batch.cu
sequence2batch.h
sequence_padding.cc Optimize the sequence padding op (#17403) 6 years ago
sequence_padding.cu fix sequence pad; test=develop 6 years ago
sequence_padding.h
sequence_padding_test.cc
sequence_pooling.cc Enable seq_pool op to accept len 0 input (#17284) 6 years ago
sequence_pooling.cu Enable seq_pool op to accept len 0 input (#17284) 6 years ago
sequence_pooling.h Enable seq_pool op to accept len 0 input (#17284) 6 years ago
sequence_pooling_test.cc
sequence_scale.cc
sequence_scale.cu
sequence_scale.h
softmax.cc
softmax.cu paddle::framework::vectorize() templatization (#19627) 6 years ago
softmax.h [CPU] refine cpu softmax bwd (#17534) 6 years ago
softmax_impl.h [CPU] refine cpu softmax bwd (#17534) 6 years ago
tree2col.cc
tree2col.cu
tree2col.h
unpooling.cc
unpooling.cu unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631) 6 years ago
unpooling.h
vol2col.cc
vol2col.cu refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603) 6 years ago
vol2col.h
vol2col_test.cc