You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/operators/math
ShenLiang becf99d2e8
fix error message (#30135)
4 years ago
..
detail Fix gru performace decline in 1.8.5 (#29455) 4 years ago
CMakeLists.txt 添加rocm平台支持代码 (#29342) 4 years ago
algorithm.h
beam_search.cc use iwyu clean include (#27267) 4 years ago
beam_search.cu fix the error message for the math dir 4 years ago
beam_search.h
beam_search_test.cc use iwyu clean include (#27267) 4 years ago
bert_encoder_functor.cu detect tensorRT plugin fp16 in runtime (#27933) 4 years ago
bert_encoder_functor.h detect tensorRT plugin fp16 in runtime (#27933) 4 years ago
blas.cc use iwyu clean include (#27267) 4 years ago
blas.h Paddle support compile on sw (#27858) 4 years ago
blas_impl.cu.h [Complex] Add support for complex grad accumulated (#29889) 4 years ago
blas_impl.h [Complex] Add support for complex grad accumulated (#29889) 4 years ago
bloomfilter.h refine murmurhash3_x64_128 for bloom_filter (#20996) 5 years ago
complex_functors.h complex gradient matmul (#29966) 4 years ago
compound_functors.h
concat.hip.cu
concat_and_split.cc use iwyu clean include (#27267) 4 years ago
concat_and_split.cu Add macro BOOST_GET to enrich the error information of boost :: get (#24175) 5 years ago
concat_and_split.h add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199) 4 years ago
concat_test.cc use iwyu clean include (#27267) 4 years ago
context_project.cc use iwyu clean include (#27267) 4 years ago
context_project.cu
context_project.h use iwyu clean include (#27267) 4 years ago
cos_sim_functor.cc use iwyu clean include (#27267) 4 years ago
cos_sim_functor.cu
cos_sim_functor.h use iwyu clean include (#27267) 4 years ago
cpu_vec.h use iwyu clean include (#27267) 4 years ago
cpu_vec_test.cc use iwyu clean include (#27267) 4 years ago
cross_entropy.cc use iwyu clean include (#27267) 4 years ago
cross_entropy.cu Enhance the error messages for files in operators/math 4 years ago
cross_entropy.h
depthwise_conv.cu Optimize the depthwise op test=develop (#22265) 5 years ago
depthwise_conv.h conv_transpose supports channel_last input, test=develop, test=document_preview (#20072) 5 years ago
fc.cc optimize fc jit (#21878) 5 years ago
fc.cu [Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494) 5 years ago
fc.h Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972) 5 years ago
functors.h Support fp16 in GPU impl of fused_elemwise_activation_op. (#20636) 5 years ago
gru_compute.cc fix gru gcc7.4 bug for the gru compile 4 years ago
gru_compute.cu Fix ce ocr_recognition test fails (#20987) 5 years ago
gru_compute.h Add eigen gru and fix the dropout bug in the rnn 4 years ago
im2col.cc use iwyu clean include (#27267) 4 years ago
im2col.cu Enhance the error messages for files in operators/math 4 years ago
im2col.h conv_transpose supports channel_last input, test=develop, test=document_preview (#20072) 5 years ago
im2col_cfo_cpu.h fix conv_transpose's bug: compatible with Anylayout setting, test=develop (#20589) 5 years ago
im2col_test.cc
lstm_compute.cc Add LSTM, Simple RNN and GRU CPU kernel (#28577) 4 years ago
lstm_compute.cu Add LSTM, Simple RNN and GRU CPU kernel (#28577) 4 years ago
lstm_compute.h Add LSTM, Simple RNN and GRU CPU kernel (#28577) 4 years ago
math_cuda_utils.h detect tensorRT plugin fp16 in runtime (#27933) 4 years ago
math_function.cc Add complex dtype op (add) test example (#29603) 4 years ago
math_function.cu add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199) 4 years ago
math_function.h xpu support for fill_constant Op (#27675) 4 years ago
math_function_impl.h adjust kunlun header file (#28536) 4 years ago
math_function_test.cc Error description optimize for the math dir 4 years ago
math_function_test.cu Error description optimize for math dir 4 years ago
matrix_bit_code.cc use iwyu clean include (#27267) 4 years ago
matrix_bit_code.h Add NOMINMAX define due to windows.h max/min macro conflict (#25637) 5 years ago
matrix_inverse.cc Add the implementation of inverse (#23310) 5 years ago
matrix_inverse.cu.cc fix error message (#30135) 4 years ago
matrix_inverse.h Add the implementation of inverse (#23310) 5 years ago
maxouting.cc maxout supports channel_last input (#20846) 5 years ago
maxouting.cu maxout supports channel_last input (#20846) 5 years ago
maxouting.h maxout supports channel_last input (#20846) 5 years ago
padding.h Error description optimize for math dir 4 years ago
pooling.cc fix Wmaybe-uninitialized warning in pooling.cc, test=develop (#28126) 4 years ago
pooling.cu Pool2d cuda kernel supports fp16 (#28316) 4 years ago
pooling.h Pool2d cuda kernel supports fp16 (#28316) 4 years ago
prelu.cu remove duplicated macro (#29563) 4 years ago
prelu.h fix bug of prelu when rank not equal 4, test=develop (#25067) 5 years ago
sample_prob.cc use iwyu clean include (#27267) 4 years ago
sample_prob.cu
sample_prob.h use iwyu clean include (#27267) 4 years ago
sampler.cc Refine paddle.manual_seed (#26496) 5 years ago
sampler.h Error description optimize for the math dir 4 years ago
segment_pooling.cc Add the cpu version of segment sum mean max min op 4 years ago
segment_pooling.cu refine gpu kernel config for Paddle (#28085) 4 years ago
segment_pooling.h Add the cpu version of segment sum mean max min op 4 years ago
selected_rows_functor.cc [Complex] Add support for complex grad accumulated (#29889) 4 years ago
selected_rows_functor.cu [Complex] Add support for complex grad accumulated (#29889) 4 years ago
selected_rows_functor.h
selected_rows_functor_test.cc
selected_rows_functor_test.cu.cc use iwyu clean include (#27267) 4 years ago
sequence2batch.cc use iwyu clean include (#27267) 4 years ago
sequence2batch.cu optimize the error message for math dir 4 years ago
sequence2batch.h optimize the error message for math dir 4 years ago
sequence_padding.cc use iwyu clean include (#27267) 4 years ago
sequence_padding.cu optimize the error message for math dir 4 years ago
sequence_padding.h optimize the error message for math dir 4 years ago
sequence_padding_test.cc use iwyu clean include (#27267) 4 years ago
sequence_pooling.cc optimize the error message for math dir 4 years ago
sequence_pooling.cu optimize the error message for math dir 4 years ago
sequence_pooling.h support 2-level lod of input in sequence_pool (#19839) 5 years ago
sequence_pooling_test.cc optimize the error message for math dir 4 years ago
sequence_scale.cc add support to float64 input of warpctc op. (#27399) 4 years ago
sequence_scale.cu add support to float64 input of warpctc op. (#27399) 4 years ago
sequence_scale.h use iwyu clean include (#27267) 4 years ago
softmax.cc
softmax.cu replace CUDNN_ENFORCE with PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#22109) 5 years ago
softmax.h
softmax_impl.h remove eval in eigen function when dtype is fp16 (#23845) 5 years ago
tree2col.cc optimize the error message for math dir 4 years ago
tree2col.cu Add macro BOOST_GET to enrich the error information of boost :: get (#24175) 5 years ago
tree2col.h
unpooling.cc optimize the error message for unpooling.cc 4 years ago
unpooling.cu
unpooling.h
vol2col.cc use iwyu clean include (#27267) 4 years ago
vol2col.cu Error description optimize for the math dir 4 years ago
vol2col.h conv_transpose supports channel_last input, test=develop, test=document_preview (#20072) 5 years ago
vol2col_test.cc use iwyu clean include (#27267) 4 years ago