..
detail
fix gru_gpu_kernel test=develop
6 years ago
CMakeLists.txt
Add the CUDA kernel for beam_search op ( #15020 )
6 years ago
algorithm.h
test=develop
7 years ago
beam_search.cc
Add the CUDA kernel for beam_search op ( #15020 )
6 years ago
beam_search.cu
Add the CUDA kernel for beam_search op ( #15020 )
6 years ago
beam_search.h
Add the CUDA kernel for beam_search op ( #15020 )
6 years ago
beam_search_test.cc
Add the CUDA kernel for beam_search op ( #15020 )
6 years ago
blas.cc
…
blas.h
Use mkl
6 years ago
blas_impl.cu.h
Revert "Revert "Remove op handle lock""
6 years ago
blas_impl.h
Use mkl
6 years ago
compound_functors.h
Feature/op_fuse_pass ( #12440 )
7 years ago
concat.hip.cu
…
concat_and_split.cc
Refine Split op ( #13967 )
7 years ago
concat_and_split.cu
Move GetTensor to tensor_util ( #15011 )
6 years ago
concat_and_split.h
Refine Split op ( #13967 )
7 years ago
concat_test.cc
Refine Split op ( #13967 )
7 years ago
context_project.cc
…
context_project.cu
Fix Eigen macro when using GPU
6 years ago
context_project.h
…
cos_sim_functor.cc
…
cos_sim_functor.cu
Fix SelectedRowsAdd bug ( #14309 )
7 years ago
cos_sim_functor.h
…
cpu_vec.h
add avx support for windows
6 years ago
cpu_vec_test.cc
remove jit namespace
6 years ago
cross_entropy.cc
…
cross_entropy.cu
hot fix log ( #14293 )
7 years ago
cross_entropy.h
Add fp16 backward support ( #14202 )
7 years ago
depthwise_conv.cu
Memory optimization of depthwise conv op and group norm op ( #15313 )
6 years ago
depthwise_conv.h
Memory optimization of depthwise conv op and group norm op ( #15313 )
6 years ago
fc_compute.h
fix enum style
6 years ago
functors.h
Feature/op_fuse_pass ( #12440 )
7 years ago
gru_compute.cc
fix build problem test=develop
6 years ago
gru_compute.cu
update avx gru grad kernel test=develop
6 years ago
gru_compute.h
update gru op forward kernel
6 years ago
im2col.cc
…
im2col.cu
…
im2col.h
…
im2col_cfo_cpu.h
…
im2col_test.cc
fix unit test cases
6 years ago
lstm_compute.cc
…
lstm_compute.cu
…
lstm_compute.h
…
math_function.cc
Change tensor uses proto::VarType::type
6 years ago
math_function.cu
Change tensor uses proto::VarType::type
6 years ago
math_function.h
"fix compile error" ( #13579 )
7 years ago
math_function_impl.h
rewrite ddim
6 years ago
math_function_test.cc
…
math_function_test.cu
…
matrix_bit_code.cc
fix bug after merge reyoung optimization, test=develop
6 years ago
matrix_bit_code.h
fix bug after merge reyoung optimization, test=develop
6 years ago
maxouting.cc
…
maxouting.cu
…
maxouting.h
…
padding.h
…
pooling.cc
fix doc errors. test=develop
6 years ago
pooling.cu
fix doc errors. test=develop
6 years ago
pooling.h
fix doc errors. test=develop
6 years ago
prelu.cu
add prelu gpu inference
6 years ago
prelu.h
add prelu gpu inference
6 years ago
sampler.cc
nce add check sample lables, test=develop ( #15463 )
6 years ago
sampler.h
nce add check sample lables, test=develop ( #15463 )
6 years ago
selected_rows_functor.cc
follow comment test=develop
6 years ago
selected_rows_functor.cu
Polish code
6 years ago
selected_rows_functor.h
Add sorted_result parameter to SelectedRows Functor
6 years ago
selected_rows_functor_test.cc
Add the CUDA kernel for beam_search op ( #15020 )
6 years ago
selected_rows_functor_test.cu.cc
Add the CUDA kernel for beam_search op ( #15020 )
6 years ago
sequence2batch.cc
…
sequence2batch.cu
Fix Eigen macro when using GPU
6 years ago
sequence2batch.h
…
sequence_padding.cc
…
sequence_padding.cu
…
sequence_padding.h
…
sequence_padding_test.cc
…
sequence_pooling.cc
use height from params of jitcode
6 years ago
sequence_pooling.cu
minor fix
6 years ago
sequence_pooling.h
Rebase of seqpool-max optimization
7 years ago
sequence_pooling_test.cc
Add the CUDA kernel for beam_search op ( #15020 )
6 years ago
sequence_scale.cc
…
sequence_scale.cu
…
sequence_scale.h
…
softmax.cc
- Softmax for Inference is enabled when ON_INFER is set
6 years ago
softmax.cu
Fix Eigen macro when using GPU
6 years ago
softmax.h
[Feature] support mix precision training for resnet ( #14899 )
6 years ago
softmax_impl.h
rewrite ddim
6 years ago
tree2col.cc
Tree conv op ( #15217 )
6 years ago
tree2col.cu
Tree conv op ( #15217 )
6 years ago
tree2col.h
Tree conv op ( #15217 )
6 years ago
unpooling.cc
…
unpooling.cu
…
unpooling.h
…
vol2col.cc
…
vol2col.cu
…
vol2col.h
…
vol2col_test.cc
…