You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/operators/math
Zeng Jinle f5641000bb
Add a unittest to inplace elementwise_add (#18385)
6 years ago
..
detail improve gru unit performance. (#16338) 6 years ago
CMakeLists.txt Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op 6 years ago
algorithm.h
beam_search.cc Make parent_idx a dispensable output for beam_search op to support models saved by older paddle version. (#16106) 6 years ago
beam_search.cu Make parent_idx a dispensable output for beam_search op to support models saved by older paddle version. (#16106) 6 years ago
beam_search.h Return parent_idx in beam_search op (#15520) 6 years ago
beam_search_test.cc Return parent_idx in beam_search op (#15520) 6 years ago
blas.cc
blas.h Optimize gelu operation with mkl erf. 6 years ago
blas_impl.cu.h Revert "Revert "Remove op handle lock"" 6 years ago
blas_impl.h Add a unittest to inplace elementwise_add (#18385) 6 years ago
compound_functors.h Optimize fused_elewise_activation_grad op. (#18041) 6 years ago
concat.hip.cu
concat_and_split.cc
concat_and_split.cu Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. (#17979) 6 years ago
concat_and_split.h
concat_test.cc Optimize the concat and split kernel for specical cases when the number of inputs/outputs is 2 (#17415) 6 years ago
context_project.cc
context_project.cu Fix Eigen macro when using GPU 6 years ago
context_project.h fix security bugs : (#17464) 6 years ago
cos_sim_functor.cc
cos_sim_functor.cu
cos_sim_functor.h
cpu_vec.h [CPU] refine cpu softmax bwd (#17534) 6 years ago
cpu_vec_test.cc [CPU] refine cpu softmax bwd (#17534) 6 years ago
cross_entropy.cc Softmax_cross_entropy op add axis (#16806) 6 years ago
cross_entropy.cu Softmax_cross_entropy op add axis (#16806) 6 years ago
cross_entropy.h Softmax_cross_entropy op add axis (#16806) 6 years ago
depthwise_conv.cu Memory optimization of depthwise conv op and group norm op (#15313) 6 years ago
depthwise_conv.h Memory optimization of depthwise conv op and group norm op (#15313) 6 years ago
fc_compute.h simplify the jitkernel templates and tests 6 years ago
functors.h Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236) 6 years ago
gru_compute.cc fix build problem test=develop 6 years ago
gru_compute.cu Add conditional compile for gru opt (#17368) 6 years ago
gru_compute.h update gru op forward kernel 6 years ago
im2col.cc
im2col.cu
im2col.h
im2col_cfo_cpu.h
im2col_test.cc fix unit test cases 6 years ago
lstm_compute.cc add cell clip and proj clip, fix bug for h0 6 years ago
lstm_compute.cu add cell clip and proj clip, fix bug for h0 6 years ago
lstm_compute.h add cell clip and proj clip, fix bug for h0 6 years ago
math_function.cc Change tensor uses proto::VarType::type 6 years ago
math_function.cu Sequence mask support tensor (#18249) 6 years ago
math_function.h
math_function_impl.h rewrite ddim 6 years ago
math_function_test.cc
math_function_test.cu
matrix_bit_code.cc fix bug after merge reyoung optimization, test=develop 6 years ago
matrix_bit_code.h fix bug after merge reyoung optimization, test=develop 6 years ago
maxouting.cc
maxouting.cu
maxouting.h
padding.h
pooling.cc fix doc errors. test=develop 6 years ago
pooling.cu fix doc errors. test=develop 6 years ago
pooling.h fix doc errors. test=develop 6 years ago
prelu.cu add prelu gpu inference 6 years ago
prelu.h add prelu gpu inference 6 years ago
sample_prob.cc update comment 6 years ago
sample_prob.cu update comment 6 years ago
sample_prob.h update comment 6 years ago
sampler.cc nce add check sample lables, test=develop (#15463) 6 years ago
sampler.h nce add check sample lables, test=develop (#15463) 6 years ago
selected_rows_functor.cc optimize merge add if input rows of all selected rows is not duplicated 6 years ago
selected_rows_functor.cu Polish code 6 years ago
selected_rows_functor.h improve communicator 6 years ago
selected_rows_functor_test.cc fix cpplint test=develop 6 years ago
selected_rows_functor_test.cu.cc Add the CUDA kernel for beam_search op (#15020) 6 years ago
sequence2batch.cc
sequence2batch.cu Fix Eigen macro when using GPU 6 years ago
sequence2batch.h
sequence_padding.cc Optimize the sequence padding op (#17403) 6 years ago
sequence_padding.cu fix sequence pad; test=develop 6 years ago
sequence_padding.h
sequence_padding_test.cc
sequence_pooling.cc Enable seq_pool op to accept len 0 input (#17284) 6 years ago
sequence_pooling.cu Enable seq_pool op to accept len 0 input (#17284) 6 years ago
sequence_pooling.h Enable seq_pool op to accept len 0 input (#17284) 6 years ago
sequence_pooling_test.cc Add the CUDA kernel for beam_search op (#15020) 6 years ago
sequence_scale.cc
sequence_scale.cu
sequence_scale.h
softmax.cc
softmax.cu Fix Eigen macro when using GPU 6 years ago
softmax.h [CPU] refine cpu softmax bwd (#17534) 6 years ago
softmax_impl.h [CPU] refine cpu softmax bwd (#17534) 6 years ago
tree2col.cc Tree conv op (#15217) 6 years ago
tree2col.cu Tree conv op (#15217) 6 years ago
tree2col.h Tree conv op (#15217) 6 years ago
unpooling.cc
unpooling.cu
unpooling.h
vol2col.cc
vol2col.cu
vol2col.h
vol2col_test.cc