You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/operators
Hongyu Liu 0701c2db47
Merge pull request #16518 from zhoukunsheng/rsqrt
6 years ago
..
anakin resolve conflicts with the develop branch test=develop 6 years ago
benchmark Enhance the op benchmark: (#16066) 6 years ago
controlflow fix gc bug in conditional block (#16673) 6 years ago
csp Refine operator cmake (#14413) 6 years ago
detail Merge pull request #14933 from sneaxiy/rewrite_ddim 6 years ago
detection Security issue (#16774) 6 years ago
distributed fix brpc code 6 years ago
distributed_ops fix split_byref_op infer shape 6 years ago
elementwise Fix some grad op desc makers (#16633) 6 years ago
fused check default grad maker 6 years ago
jit fix avx option (#16683) 6 years ago
math fix cpplint test=develop 6 years ago
metrics Fp16 training (#14992) 6 years ago
mkldnn [MKL-DNN] Added reusing of primitive descriptors (fp32) (#16667) 6 years ago
nccl Polish code style 6 years ago
ngraph fix training validation test=develop (#16698) 6 years ago
optimizers Merge pull request #16214 from velconia/imperative_infer_var_type 6 years ago
reader Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator 6 years ago
reduce_ops fix 16823: delete default_grad register for reduce_all, reduce_any 6 years ago
sequence_ops fix some grad op desc maker (#16581) 6 years ago
tensorrt fix trt engine test error. 6 years ago
CMakeLists.txt Add DGC(Deep Gradient Compression) interface. (#15841) 6 years ago
activation_cudnn.cu.cc polish cudnn related code and fix bug. (#15164) 6 years ago
activation_cudnn_op.cu.cc fix activation grad op desc maker (#16715) 6 years ago
activation_op.cc fix merge conflict 6 years ago
activation_op.cu fix activation grad op desc maker (#16715) 6 years ago
activation_op.h fix merge conflict 6 years ago
add_position_encoding_op.cc fix some op grad maker 6 years ago
add_position_encoding_op.h Exhaustive search for cuDNN conv. (#14286) 6 years ago
affine_channel_op.cc polish the code 6 years ago
affine_channel_op.cu Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) 6 years ago
affine_grid_cudnn_op.cu.cc Add affine grid generator op (#12238) 6 years ago
affine_grid_op.cc fix some grad op desc maker (#16581) 6 years ago
affine_grid_op.h test=develop (#16783) 6 years ago
alloc_continuous_space_op.cc Fuse Adam And SGD ops (#15933) 6 years ago
arg_max_op.cc Change tensor uses proto::VarType::type 6 years ago
arg_max_op.cu Change tensor uses proto::VarType::type 6 years ago
arg_min_max_op_base.h fix min and max bug (#16570) 6 years ago
arg_min_op.cc Change tensor uses proto::VarType::type 6 years ago
arg_min_op.cu Change tensor uses proto::VarType::type 6 years ago
argsort_op.cc Set the right shape of selected_rows (#13723) 6 years ago
argsort_op.cu
argsort_op.h
array_operator.h Revert the changes of VLOG 6 years ago
array_to_lod_tensor_op.cc Change tensor uses proto::VarType::type 6 years ago
assign_op.cc
assign_value_op.cc
assign_value_op.cu.cc Revert ""cherry picked operators changes" (#12184)" (#12747) 7 years ago
assign_value_op.h
attention_lstm_op.cc fix warnings (#15790) 6 years ago
attention_lstm_op.h implement attention lstm cpu forward 7 years ago
average_accumulates_op.cc Change tensor uses proto::VarType::type 6 years ago
average_accumulates_op.cu
average_accumulates_op.h
batch_norm_op.cc test=develop 6 years ago
batch_norm_op.cu Batch norm cudnn accurate (#16545) 6 years ago
batch_norm_op.h Support sync batch norm. (#16121) 6 years ago
batch_size_like.h Fix some grad op desc makers (#16633) 6 years ago
beam_search_decode_op.cc Polish code style 6 years ago
beam_search_decode_op.h Change *(smart_ptr.get()) -> *smart_ptr 6 years ago
beam_search_decode_op_test.cc
beam_search_op.cc Polish code style 6 years ago
beam_search_op.cu.cc Add the CUDA kernel for beam_search op (#15020) 6 years ago
beam_search_op.h Make parent_idx a dispensable output for beam_search op to support models saved by older paddle version. (#16106) 6 years ago
bilinear_tensor_product_op.cc fix some grad op desc maker (#16581) 6 years ago
bilinear_tensor_product_op.cu Fix Eigen macro when using GPU 6 years ago
bilinear_tensor_product_op.h Optimize bilinear tensor product op (#14485) 6 years ago
bpr_loss_op.cc fix grad desc maker 6 years ago
bpr_loss_op.h Add the CUDA kernel for beam_search op (#15020) 6 years ago
cast_op.cc polish the cast op doc (#16078) 6 years ago
cast_op.cu
cast_op.h Revert "cherry picked windows patches." 6 years ago
chunk_eval_op.cc
chunk_eval_op.h
clip_by_norm_op.cc Add DGC(Deep Gradient Compression) interface. (#15841) 6 years ago
clip_by_norm_op.cu
clip_by_norm_op.h Add DGC(Deep Gradient Compression) interface. (#15841) 6 years ago
clip_op.cc add op registry type 6 years ago
clip_op.cu
clip_op.h fix sparse gradient clip 6 years ago
concat_op.cc fix concat; test=develop 6 years ago
concat_op.cu.cc
concat_op.h Refine Split op (#13967) 6 years ago
conv_cudnn_op.cu.cc add per kernel config and remove const_cast. 6 years ago
conv_cudnn_op_cache.h add per kernel config and remove const_cast. 6 years ago
conv_fusion_op.cc Inception fusion operator. (#14968) 6 years ago
conv_fusion_op.cu.cc polish 6 years ago
conv_op.cc polish the code 6 years ago
conv_op.cu.cc
conv_op.h Memory optimization of depthwise conv op and group norm op (#15313) 6 years ago
conv_shift_op.cc Fix conv_shift_op infershape 6 years ago
conv_shift_op.cu
conv_shift_op.h
conv_transpose_cudnn_op.cu.cc Revert conv transpose cudnn (#15514) 6 years ago
conv_transpose_op.cc fix op grad maker 6 years ago
conv_transpose_op.cu.cc
conv_transpose_op.h Optimization of Kernels that related to DeepLabv3+ (#13534) 6 years ago
cos_sim_op.cc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into infershape 6 years ago
cos_sim_op.cu Fix Eigen macro when using GPU 6 years ago
cos_sim_op.h refine cos_sim infershape 6 years ago
crf_decoding_op.cc fix warnings (#15790) 6 years ago
crf_decoding_op.h simplify the jitkernel templates and tests 6 years ago
crop_op.cc try to fix ci error 6 years ago
crop_op.cu Fix Eigen macro when using GPU 6 years ago
crop_op.h rewrite ddim 6 years ago
cross_entropy_op.cc fix some op grad maker 6 years ago
cross_entropy_op.cu revert revert 16144 6 years ago
cross_entropy_op.h fix numeric error 6 years ago
ctc_align_op.cc Change tensor uses proto::VarType::type 6 years ago
ctc_align_op.cu
ctc_align_op.h
cudnn_lstm_op.cc fix some op grad maker 6 years ago
cudnn_lstm_op.cu.cc merge develop 6 years ago
cudnn_rnn_cache.h rewrite variable type 6 years ago
cum_op.h fix test issues on windows 6 years ago
cumsum_op.cc
cumsum_op.cu
cvm_op.cc add X to grad 6 years ago
cvm_op.h fix doc 6 years ago
data_norm_op.cc remove mkldnn & fix commit 6 years ago
data_norm_op.h data_norm 6 years ago
delete_var_op.cc [1.1] Load vars on PSERVER (#14037) 6 years ago
dequantize_op.cc Fix comments misunderstanding 6 years ago
dequantize_op.h Add Dequantize OP 6 years ago
detection_map_op.cc modified infer shape 6 years ago
detection_map_op.h Revert "Revert "Merge pull request #13431 from chengduoZH/refine_lod"" 6 years ago
dgc_clip_by_norm_op.cc Add DGC(Deep Gradient Compression) interface. (#15841) 6 years ago
dgc_clip_by_norm_op.cu Add DGC(Deep Gradient Compression) interface. (#15841) 6 years ago
dgc_clip_by_norm_op.h Fix dgc bug. (#16602) 6 years ago
dgc_op.cc Add DGC(Deep Gradient Compression) interface. (#15841) 6 years ago
dgc_op.cu Add DGC(Deep Gradient Compression) interface. (#15841) 6 years ago
dgc_op.h Add DGC(Deep Gradient Compression) interface. (#15841) 6 years ago
dropout_op.cc Merge pull request #16217 from ceci3/doc 6 years ago
dropout_op.cu Some improvements to support bert mixed precision training (#15585) 6 years ago
dropout_op.h modify dropout att; test=develop 6 years ago
dropout_op_test.cc minor fix 6 years ago
edit_distance_op.cc
edit_distance_op.cu
edit_distance_op.h
expand_op.cc revert revert 16144 6 years ago
expand_op.cu support multiple var types for expand op, test=develop 6 years ago
expand_op.h rewrite ddim 6 years ago
fake_dequantize_op.cc rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop 6 years ago
fake_dequantize_op.cu rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop 6 years ago
fake_dequantize_op.h rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop 6 years ago
fake_quantize_op.cc rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop 6 years ago
fake_quantize_op.cu fix the hang bugs of memory copying. test=develop 6 years ago
fake_quantize_op.h rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop 6 years ago
fc_op.cc refine with comments 6 years ago
fc_op.h refine fc_infershape 6 years ago
fill_constant_batch_size_like_op.cc Fix some grad op desc makers (#16633) 6 years ago
fill_constant_batch_size_like_op.cu.cc
fill_constant_batch_size_like_op.h
fill_constant_op.cc Polish code style 6 years ago
fill_constant_op.cu.cc register float16 6 years ago
fill_constant_op.h make fill_constant kernel-based 6 years ago
fill_op.cc Change tensor uses proto::VarType::type 6 years ago
fill_zeros_like_op.cc Fix some grad op desc makers (#16633) 6 years ago
fill_zeros_like_op.cu.cc Fix some grad op desc makers (#16633) 6 years ago
fill_zeros_like_op.h
flatten_op.cc Memory optimize (#16410) 6 years ago
fsp_op.cc [slim] Add quantization strategy and distillation strategy. (#16408) 6 years ago
fsp_op.cu [slim] Add quantization strategy and distillation strategy. (#16408) 6 years ago
fsp_op.h [slim] Add quantization strategy and distillation strategy. (#16408) 6 years ago
gather.cu.h update DeepCF model 6 years ago
gather.h Fix gather & stack op (#14355) 6 years ago
gather_op.cc try to fix ci error 6 years ago
gather_op.cu Some improvements to support bert mixed precision training (#15585) 6 years ago
gather_op.h Return parent_idx in beam_search op (#15520) 6 years ago
gather_test.cc
gaussian_random_batch_size_like_op.cc Fix some grad op desc makers (#16633) 6 years ago
gaussian_random_op.cc clean code test=develop 6 years ago
gaussian_random_op.cu Revert ""cherry picked operators changes" (#12184)" (#12747) 7 years ago
get_tensor_from_selected_rows_op.cc Polish code style 6 years ago
grid_sampler_cudnn_op.cu.cc fix some inappropriate expressions in api doc for grid_sampler. test=develop 6 years ago
grid_sampler_op.cc infer shape compatable -1. test=develop 6 years ago
grid_sampler_op.h code style fix 6 years ago
group_norm_op.cc fix some grad op desc maker (#16581) 6 years ago
group_norm_op.cu fix pr 15313 6 years ago
group_norm_op.h Memory optimization of depthwise conv op and group norm op (#15313) 6 years ago
gru_op.cc update gru op forward kernel 6 years ago
gru_op.cu.cc update gru op forward kernel 6 years ago
gru_op.h update gru op forward kernel 6 years ago
gru_unit_op.cc change interface and api spec for dynamic_gru test=develop 6 years ago
gru_unit_op.cu Fix Eigen macro when using GPU 6 years ago
gru_unit_op.h complete gru_unite_op and test 6 years ago
hash_op.cc refine with comments 6 years ago
hash_op.h refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool 6 years ago
hierarchical_sigmoid_op.cc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator 6 years ago
hierarchical_sigmoid_op.h test=develop, fix hsigmoid dereference nullptr (#16769) 6 years ago
hinge_loss_op.cc Fix some grad op desc makers (#16633) 6 years ago
hinge_loss_op.cu Fix Eigen macro when using GPU 6 years ago
hinge_loss_op.h
huber_loss_op.cc Fix some grad op desc makers (#16633) 6 years ago
huber_loss_op.cu Fix Eigen macro when using GPU 6 years ago
huber_loss_op.h fix the huber loss compile issue on windows test=develop 6 years ago
im2sequence_op.cc fix grad desc maker 6 years ago
im2sequence_op.cu Fix Eigen macro when using GPU 6 years ago
im2sequence_op.h Fix infershape of im2sequence. (#12183) 7 years ago
increment_op.cc
increment_op.cu
increment_op.h
interpolate_op.cc fix for itnerpolate. test=develop 6 years ago
interpolate_op.cu fix format. test=develop 6 years ago
interpolate_op.h round down for scale. test=develop 6 years ago
is_empty_op.cc Rewrite is_empty op to avoid unnecessary data transform. (#15509) 6 years ago
is_empty_op.cu.cc Rewrite is_empty op to avoid unnecessary data transform. (#15509) 6 years ago
is_empty_op.h Rewrite is_empty op to avoid unnecessary data transform. (#15509) 6 years ago
isfinite_op.cc Change tensor uses proto::VarType::type 6 years ago
isfinite_op.cu Fix Eigen macro when using GPU 6 years ago
isfinite_op.h enhance isinf/isnan in tensor util, avoid copy back to cpu (#12688) 6 years ago
kldiv_loss_op.cc infer shape compatable -1. test=develop 6 years ago
kldiv_loss_op.cu fix grad check. test=develop 6 years ago
kldiv_loss_op.h fix doc. test=develop 6 years ago
l1_norm_op.cc fix grad desc maker 6 years ago
l1_norm_op.cu Fix Eigen macro when using GPU 6 years ago
l1_norm_op.h
label_smooth_op.cc fix grad desc maker 6 years ago
label_smooth_op.cu
label_smooth_op.h fix windows compile (#13147) 7 years ago
layer_norm_op.cc fix op grad maker 6 years ago
layer_norm_op.cu Use double to reduce 7 years ago
layer_norm_op.h fix op grad maker 6 years ago
linear_chain_crf_op.cc fix grad desc maker 6 years ago
linear_chain_crf_op.cu
linear_chain_crf_op.h
linspace_op.cc test=develop 6 years ago
linspace_op.cu test=develop 6 years ago
linspace_op.h test=develop 6 years ago
load_combine_op.cc fix mix input type error, test=develop 6 years ago
load_combine_op.cu checkpoint pr be moved here, test=develop 6 years ago
load_combine_op.h checkpoint pr be moved here, test=develop 6 years ago
load_op.cc fix load type, test=develop 6 years ago
load_op.cu checkpoint pr be moved here, test=develop 6 years ago
load_op.h checkpoint pr be moved here, test=develop 6 years ago
lod_array_length_op.cc
lod_rank_table_op.cc Polish code style 6 years ago
lod_reset_op.cc Correct the lod level of compiled time in lod_reset (#16790) 6 years ago
lod_reset_op.cu
lod_reset_op.h Correct the lod level of compiled time in lod_reset (#16790) 6 years ago
lod_tensor_to_array_op.cc Polish code style 6 years ago
log_loss_op.cc fix grad desc maker 6 years ago
log_loss_op.cu Fix Eigen macro when using GPU 6 years ago
log_loss_op.h
lookup_sparse_table_op.cc Change tensor uses proto::VarType::type 6 years ago
lookup_table_op.cc Fix op registry (#16677) 6 years ago
lookup_table_op.cu fix gpu build for lookup_table_op test=develop 6 years ago
lookup_table_op.h remote remote_prefetch in embedding layer test=develop 6 years ago
lrn_op.cc Change tensor uses proto::VarType::type 6 years ago
lrn_op.cu
lrn_op.h refine lrn_op cpu forward and speedup 6 years ago
lstm_op.cc revert some loop op revision 6 years ago
lstm_op.cu.cc
lstm_op.h Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp 6 years ago
lstm_unit_op.cc
lstm_unit_op.cu
lstm_unit_op.h Revert "cherry picked windows patches." 6 years ago
lstmp_op.cc optimize lstmp and sample_logits op, test=develop (#16845) 6 years ago
lstmp_op.cu
lstmp_op.h optimize lstmp and sample_logits op, test=develop (#16845) 6 years ago
margin_rank_loss_op.cc revert some loop op revision 6 years ago
margin_rank_loss_op.cu
margin_rank_loss_op.h
math.h revert revert 16144 6 years ago
matmul_op.cc fix matmul shape check; test=develop 6 years ago
max_sequence_len_op.cc
maxout_op.cc modified error info for maxout op 7 years ago
maxout_op.cu.cc
maxout_op.h
mean_iou_op.cc Change tensor uses proto::VarType::type 6 years ago
mean_iou_op.cu [Feature] Add Temporary Allocator (#14875) 6 years ago
mean_iou_op.h
mean_op.cc revert some loop op revision 6 years ago
mean_op.cu Fix Eigen macro when using GPU 6 years ago
mean_op.h Add fp16 backward support (#14202) 6 years ago
merge_lod_tensor_op.cc fix merge_lod_tensor_op infer shape, test=develop 6 years ago
merge_selected_rows_op.cc Refine merge_selected_rows Doc (#14748) 6 years ago
merge_selected_rows_op.cu.cc Fix clip.py (#14718) 6 years ago
merge_selected_rows_op.h Fix clip.py (#14718) 6 years ago
minus_op.cc
minus_op.cu
minus_op.h
modified_huber_loss_op.cc rewrite ddim 6 years ago
modified_huber_loss_op.cu
modified_huber_loss_op.h
mul_op.cc merge develop 6 years ago
mul_op.cu.cc Add fp16 backward support (#14202) 6 years ago
mul_op.h Process elemwise grad op's lod. mul_op's lod 7 years ago
multiplex_op.cc revert some loop op revision 6 years ago
multiplex_op.cu fix grad desc maker 6 years ago
multiplex_op.h fix grad desc maker 6 years ago
nce_op.cc Merge pull request #16172 from jacquesqiao/add-async-ssa-graph-executor-communicator 6 years ago
nce_op.h update nce and hierarchical_sigmoid remote_prefetch 6 years ago
norm_op.cc fix some grad op desc maker (#16581) 6 years ago
norm_op.cu Implement norm_op by CUDA instead of Eigen. (#13273) 7 years ago
norm_op.h test=develop 6 years ago
one_hot_op.cc
one_hot_op.cu Feature/template (#13093) 7 years ago
one_hot_op.h Feature/template (#13093) 7 years ago
pad2d_op.cc Fix infer_shape in pad2d_op 6 years ago
pad2d_op.cu Make pad2d support for variable paddings. (#14667) 6 years ago
pad_constant_like_op.cc Change tensor uses proto::VarType::type 6 years ago
pad_constant_like_op.cu Fix Eigen macro when using GPU 6 years ago
pad_constant_like_op.h Add pad_constant_like_op (#12943) 7 years ago
pad_op.cc fix grad desc maker 6 years ago
pad_op.cu Fix Eigen macro when using GPU 6 years ago
pad_op.h Add pad_constant_like_op (#12943) 7 years ago
pixel_shuffle_op.cc Add Pixel shuffle OP (#15782) 6 years ago
pixel_shuffle_op.cu Add Pixel shuffle OP (#15782) 6 years ago
pixel_shuffle_op.h Add Pixel shuffle OP (#15782) 6 years ago
pool_cudnn_op.cu.cc Add fp16 backward support (#14202) 6 years ago
pool_op.cc Add cpu_quantize_pass for C-API quantization (#16127) 6 years ago
pool_op.cu.cc
pool_op.h add adaptive pool 2d & 3d. test=develop 6 years ago
pool_with_index_op.cc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type 6 years ago
pool_with_index_op.cu.cc
pool_with_index_op.h add adaptive pool 2d & 3d. test=develop 6 years ago
positive_negative_pair_op.cc Change tensor uses proto::VarType::type 6 years ago
positive_negative_pair_op.h update CMakeLists.txt 6 years ago
prelu_op.cc clean 6 years ago
prelu_op.cu add prelu gpu inference 6 years ago
prelu_op.h Refine prelu_op 6 years ago
print_op.cc Change tensor uses proto::VarType::type 6 years ago
psroi_pool_op.cc fix grad desc maker 6 years ago
psroi_pool_op.cu this is for psroi_pool op, test=develop (#14796) 6 years ago
psroi_pool_op.h rewrite ddim 6 years ago
py_func_op.cc Fix py_func_op's problem 6 years ago
py_func_op.h try fix py2 6 years ago
quantize_op.cc Add Quantize OP 6 years ago
quantize_op.h Add Quantize OP 6 years ago
random_crop_op.cc Change tensor uses proto::VarType::type 6 years ago
random_crop_op.cu
random_crop_op.h fix security issue 27, 38 test=develop 6 years ago
range_op.cc [Operator] Add range op. (#15431) 6 years ago
range_op.cu [Operator] Add range op. (#15431) 6 years ago
range_op.h [Operator] Add range op. (#15431) 6 years ago
rank_loss_op.cc fix grad desc maker 6 years ago
rank_loss_op.cu
rank_loss_op.h
recurrent_op.cc Refine StaticRnn (#16707) 6 years ago
reorder_lod_tensor_by_rank_op.cc refine tensor_array_write_read (#14643) 6 years ago
requantize_op.cc Add Requantize OP (#15318) 6 years ago
requantize_op.h Add Requantize OP (#15318) 6 years ago
reshape_op.cc Memory optimize (#16410) 6 years ago
reverse_op.cc
reverse_op.cu
reverse_op.h
rnn_memory_helper_op.cc Refine StaticRnn (#16707) 6 years ago
roi_align_op.cc fix grad desc maker 6 years ago
roi_align_op.cu Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) 6 years ago
roi_align_op.h test=develop 6 years ago
roi_pool_op.cc polish the code 6 years ago
roi_pool_op.cu Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) 6 years ago
roi_pool_op.h Refine and fix some code for faster-rcnn. (#13135) 7 years ago
row_conv_op.cc polish the code 6 years ago
row_conv_op.cu
row_conv_op.h
sample_logits_op.cc optimize lstmp and sample_logits op, test=develop (#16845) 6 years ago
sample_logits_op.cu refine code 6 years ago
sample_logits_op.h refine code 6 years ago
sampling_id_op.cc fix 6 years ago
sampling_id_op.cu merge cpu and gpu 7 years ago
sampling_id_op.h refine 7 years ago
save_combine_op.cc fix mix input type error, test=develop 6 years ago
save_combine_op.cu fix mix input type error, test=develop 6 years ago
save_combine_op.h checkpoint pr be moved here, test=develop 6 years ago
save_load_combine_op_test.cc checkpoint pr be moved here, test=develop 6 years ago
save_load_op_test.cc checkpoint pr be moved here, test=develop 6 years ago
save_op.cc checkpoint pr be moved here, test=develop 6 years ago
save_op.cu checkpoint pr be moved here, test=develop 6 years ago
save_op.h checkpoint pr be moved here, test=develop 6 years ago
scale_op.cc Polish code style 6 years ago
scale_op.cu Add fp16 backward support (#14202) 6 years ago
scale_op.h Fix input<tensor> (#14208) 6 years ago
scatter.cu.h Fix gather & stack op (#14355) 6 years ago
scatter.h Fix gather & stack op (#14355) 6 years ago
scatter_op.cc scatter_op bug fix, test=develop (#16866) 6 years ago
scatter_op.cu
scatter_op.h Fix scatter_op python API (#12742) 7 years ago
scatter_test.cc Fix bug in uts 6 years ago
selu_op.cc Add selu (#14415) 6 years ago
selu_op.cu Add selu (#14415) 6 years ago
selu_op.h revert revert 16144 6 years ago
shape_op.cc fix shape api doc 6 years ago
shape_op.cu Add flatten op interface and enhance APIs about detection to support variable-length image. (#12422) 7 years ago
shape_op.h Add flatten op interface and enhance APIs about detection to support variable-length image. (#12422) 7 years ago
shrink_rnn_memory_op.cc refine tensor_array_write_read (#14643) 6 years ago
shuffle_channel_op.cc fix some grad op desc maker (#16581) 6 years ago
shuffle_channel_op.cu fix some grad op desc maker (#16581) 6 years ago
shuffle_channel_op.h fix some grad op desc maker (#16581) 6 years ago
sigmoid_cross_entropy_with_logits_op.cc supprt high rank; test=develop 6 years ago
sigmoid_cross_entropy_with_logits_op.cu revert revert 16144 6 years ago
sigmoid_cross_entropy_with_logits_op.h Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) 6 years ago
sign_op.cc "fix op. test=develop" 6 years ago
sign_op.cu "fix op. test=develop" 6 years ago
sign_op.h
similarity_focus_op.cc Change tensor uses proto::VarType::type 6 years ago
similarity_focus_op.h add similarity_focus op 6 years ago
slice_op.cc fix some grad op desc maker (#16581) 6 years ago
slice_op.cu Fix the bug in fp16 backward kernel (#16269) 6 years ago
slice_op.h Implement slice grad operator. #8130 (#12330) 7 years ago
smooth_l1_loss_op.cc
smooth_l1_loss_op.cu Fix Eigen macro when using GPU 6 years ago
smooth_l1_loss_op.h
softmax_cudnn_op.cu.cc refine softmax kernel. test=develop 6 years ago
softmax_op.cc Merge pull request #16057 from heavengate/softmax_axis 6 years ago
softmax_op.cu.cc Add fp16 backward support (#14202) 6 years ago
softmax_op.h fix format. test=develop 6 years ago
softmax_with_cross_entropy_op.cc fix op grad maker 6 years ago
softmax_with_cross_entropy_op.cu Fix cross_entropy bug (#16236) 6 years ago
softmax_with_cross_entropy_op.h fix formax. test=develop 6 years ago
space_to_depth_op.cc Fix op registry (#16677) 6 years ago
space_to_depth_op.cu test=develop 6 years ago
space_to_depth_op.h test=develop 6 years ago
spectral_norm_op.cc infer shape compatable -1. test=develop 6 years ago
spectral_norm_op.cu fix spectral_norm doc. test=develop 6 years ago
spectral_norm_op.h fix format. test=develop 6 years ago
split_lod_tensor_op.cc fix bug in if-else op, test=develop 6 years ago
split_op.cc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_split 6 years ago
split_op.cu.cc
split_op.h Refine Split op (#13967) 6 years ago
split_selected_rows_op.cc Polish code style 6 years ago
split_selected_rows_op.cu
split_selected_rows_op.h add some check 6 years ago
spp_op.cc
spp_op.cu.cc
spp_op.h add adaptive pool 2d & 3d. test=develop 6 years ago
squared_l2_distance_op.cc Fix op registry (#16677) 6 years ago
squared_l2_distance_op.cu Fix Eigen macro when using GPU 6 years ago
squared_l2_distance_op.h Security issue (#16774) 6 years ago
squared_l2_norm_op.cc Fix op registry (#16677) 6 years ago
squared_l2_norm_op.cu Fix Eigen macro when using GPU 6 years ago
squared_l2_norm_op.h
squeeze_op.cc fix squeeze shape check; test=develop 6 years ago
stack_op.cc Fix gather & stack op (#14355) 6 years ago
stack_op.cu Some improvements to support bert mixed precision training (#15585) 6 years ago
stack_op.h Add the macro for NVCC (test=develop) 6 years ago
strided_memcpy.h rewrite ddim 6 years ago
strided_memcpy_test.cc refactor(memory): rewrite memory allocation and make it extentable 6 years ago
sum_op.cc Fix sum infershape issue 6 years ago
sum_op.cu Fix Eigen macro when using GPU 6 years ago
sum_op.h rewrite variable type 6 years ago
sync_batch_norm_op.cc Support sync batch norm. (#16121) 6 years ago
sync_batch_norm_op.cu Support sync batch norm. (#16121) 6 years ago
teacher_student_sigmoid_loss_op.cc Fix op registry (#16677) 6 years ago
teacher_student_sigmoid_loss_op.h remove some comments & refine doc & put template class in .h 6 years ago
temporal_shift_op.cc fix some grad op desc maker (#16581) 6 years ago
temporal_shift_op.cu fix format. test=develop 6 years ago
temporal_shift_op.h fix format. test=develop 6 years ago
tensor_array_to_tensor_op.cc Polish code style 6 years ago
top_k_op.cc fix squeeze op shape check; test=develop 6 years ago
top_k_op.cu Make topk op support variable k. (#15044) 6 years ago
top_k_op.h Make topk op support variable k. (#15044) 6 years ago
transpose_op.cc - Added transpose/transpose2 MKLDNN grad ops 6 years ago
transpose_op.cu.cc Some improvements to support bert mixed precision training (#15585) 6 years ago
transpose_op.h
tree_conv_op.cc Fix op registry (#16677) 6 years ago
tree_conv_op.cu Tree conv op (#15217) 6 years ago
tree_conv_op.h Tree conv op (#15217) 6 years ago
truncated_gaussian_random_op.cc Fix truncated norm (#13785) 6 years ago
truncated_gaussian_random_op.cu Fix truncated norm (#13785) 6 years ago
uniform_random_batch_size_like_op.cc Fix some grad op desc makers (#16633) 6 years ago
uniform_random_op.cc Polish code style 6 years ago
uniform_random_op.cu fix shape type in uniform_random_op.cu 6 years ago
unpool_op.cc polish the code 6 years ago
unpool_op.cu.cc
unpool_op.h
unsqueeze_op.cc Refine reshape_grad and transpose_grad (#13074) 7 years ago
unstack_op.cc add unstack_op 7 years ago
unstack_op.h add unstack_op 7 years ago
warpctc_cudnn_op.cu.cc fix formax. test=develop 6 years ago
warpctc_op.cc Fix op registry (#16677) 6 years ago
warpctc_op.cu.cc
warpctc_op.h Complete sequence_padding GPU kernel 7 years ago