You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/operators
Qiao Longfei 039d783db5
change communicator_recv_wait_ms to communicator_max_send_grad_num_before_recv
7 years ago
..
benchmark Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493) 7 years ago
controlflow Revert "Optimize while_op when is_test is true. (#15811)" (#15968) 7 years ago
csp
detail Merge pull request #14933 from sneaxiy/rewrite_ddim 7 years ago
detection This PR improve performance of prior_box op about 1.25x faster on CPU. (#15909) 7 years ago
distributed change communicator_recv_wait_ms to communicator_max_send_grad_num_before_recv 7 years ago
distributed_ops add some check 7 years ago
elementwise - MKL-DNN pooling updated to set_prim_desc 7 years ago
fused refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool 7 years ago
jit fix jitcodekey and refine test 7 years ago
math Merge branch 'add-communicator' of ssh://github.com/jacquesqiao/Paddle into add-async-ssa-graph-executor-communicator 7 years ago
metrics
mkldnn Optimize Quantize Op with primitive reuse. (#15929) 7 years ago
nccl
ngraph fix cpplint test=develop (#16028) 7 years ago
optimizers enable sgd jitkernel refer code and test 7 years ago
reader code format test=develop 7 years ago
reduce_ops test=develop 7 years ago
sequence_ops refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool 7 years ago
tensorrt delete the usage of the const_cast 7 years ago
CMakeLists.txt Merge pull request #15609 from xuezhong/add_sample_logits_op 7 years ago
activation_cudnn.cu.cc polish cudnn related code and fix bug. (#15164) 7 years ago
activation_cudnn_op.cu.cc polish cudnn related code and fix bug. (#15164) 7 years ago
activation_op.cc polish cudnn related code and fix bug. (#15164) 7 years ago
activation_op.cu
activation_op.h Merge pull request #15931 from yihuaxu/develop_2c5c7b2a7_gelu_mkl_opt 7 years ago
add_position_encoding_op.cc
add_position_encoding_op.h
affine_channel_op.cc
affine_channel_op.cu Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) 7 years ago
affine_grid_cudnn_op.cu.cc
affine_grid_op.cc
affine_grid_op.h
alloc_continuous_space_op.cc Add alloc_continuous_space_op (#15900) 7 years ago
arg_max_op.cc
arg_max_op.cu
arg_min_max_op_base.h
arg_min_op.cc
arg_min_op.cu
argsort_op.cc
argsort_op.cu
argsort_op.h
array_operator.h
array_to_lod_tensor_op.cc
assign_op.cc
assign_value_op.cc
assign_value_op.cu.cc
assign_value_op.h
attention_lstm_op.cc fix warnings (#15790) 7 years ago
attention_lstm_op.h
average_accumulates_op.cc
average_accumulates_op.cu
average_accumulates_op.h
batch_norm_op.cc Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass 7 years ago
batch_norm_op.cu
batch_norm_op.h
batch_size_like.h
beam_search_decode_op.cc add per kernel config and remove const_cast. 7 years ago
beam_search_decode_op.h Change *(smart_ptr.get()) -> *smart_ptr 7 years ago
beam_search_decode_op_test.cc
beam_search_op.cc Return parent_idx in beam_search op (#15520) 7 years ago
beam_search_op.cu.cc Add the CUDA kernel for beam_search op (#15020) 7 years ago
beam_search_op.h Return parent_idx in beam_search op (#15520) 7 years ago
bilinear_tensor_product_op.cc
bilinear_tensor_product_op.cu
bilinear_tensor_product_op.h
bpr_loss_op.cc
bpr_loss_op.h Add the CUDA kernel for beam_search op (#15020) 7 years ago
cast_op.cc
cast_op.cu
cast_op.h
chunk_eval_op.cc
chunk_eval_op.h
clip_by_norm_op.cc
clip_by_norm_op.cu
clip_by_norm_op.h
clip_op.cc
clip_op.cu
clip_op.h
concat_op.cc
concat_op.cu.cc
concat_op.h
conv_cudnn_op.cu.cc add per kernel config and remove const_cast. 7 years ago
conv_cudnn_op_cache.h add per kernel config and remove const_cast. 7 years ago
conv_fusion_op.cc Inception fusion operator. (#14968) 7 years ago
conv_fusion_op.cu.cc polish 7 years ago
conv_op.cc Enable function coverage for U8/S8 ConvMKLDNNOpKernel 7 years ago
conv_op.cu.cc
conv_op.h Memory optimization of depthwise conv op and group norm op (#15313) 7 years ago
conv_shift_op.cc
conv_shift_op.cu
conv_shift_op.h
conv_transpose_cudnn_op.cu.cc Revert conv transpose cudnn (#15514) 7 years ago
conv_transpose_op.cc MKLDNN: Add UT for conv_transpose_mkldnn op. (#16030) 7 years ago
conv_transpose_op.cu.cc
conv_transpose_op.h
cos_sim_op.cc
cos_sim_op.cu
cos_sim_op.h
crf_decoding_op.cc fix warnings (#15790) 7 years ago
crf_decoding_op.h
crop_op.cc
crop_op.cu
crop_op.h
cross_entropy_op.cc loosly check in the InferShape of cross_entropy_op. (#15863) 7 years ago
cross_entropy_op.cu
cross_entropy_op.h
ctc_align_op.cc
ctc_align_op.cu
ctc_align_op.h
cudnn_lstm_op.cc
cudnn_lstm_op.cu.cc merge develop 7 years ago
cudnn_rnn_cache.h
cum_op.h
cumsum_op.cc
cumsum_op.cu
data_norm_op.cc remove mkldnn & fix commit 7 years ago
data_norm_op.h
delete_var_op.cc
dequantize_op.cc
dequantize_op.h
detection_map_op.cc
detection_map_op.h
dropout_op.cc
dropout_op.cu Some improvements to support bert mixed precision training (#15585) 7 years ago
dropout_op.h
dropout_op_test.cc
edit_distance_op.cc
edit_distance_op.cu
edit_distance_op.h
expand_op.cc support multiple var types for expand op, test=develop 7 years ago
expand_op.cu support multiple var types for expand op, test=develop 7 years ago
expand_op.h
fake_dequantize_op.cc
fake_dequantize_op.cu
fake_dequantize_op.h
fake_quantize_op.cc Fix bug in fake_quantize_op and add more unit testing (#15912) 7 years ago
fake_quantize_op.cu
fake_quantize_op.h
fc_op.cc fix warnings (#15790) 7 years ago
fc_op.h
fill_constant_batch_size_like_op.cc
fill_constant_batch_size_like_op.cu.cc
fill_constant_batch_size_like_op.h
fill_constant_op.cc register float16 7 years ago
fill_constant_op.cu.cc register float16 7 years ago
fill_constant_op.h make fill_constant kernel-based 7 years ago
fill_op.cc
fill_zeros_like_op.cc
fill_zeros_like_op.cu.cc
fill_zeros_like_op.h
flatten_op.cc squash commits. test=develop 7 years ago
gather.cu.h
gather.h
gather_op.cc Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) 7 years ago
gather_op.cu Some improvements to support bert mixed precision training (#15585) 7 years ago
gather_op.h Return parent_idx in beam_search op (#15520) 7 years ago
gather_test.cc
gaussian_random_batch_size_like_op.cc
gaussian_random_op.cc
gaussian_random_op.cu
get_tensor_from_selected_rows_op.cc
grid_sampler_cudnn_op.cu.cc
grid_sampler_op.cc fix grid_sampler PADDLE_ENFORCE error. test=develop (#15542) 7 years ago
grid_sampler_op.h
group_norm_op.cc inplace group_norm (#15754) 7 years ago
group_norm_op.cu fix pr 15313 7 years ago
group_norm_op.h Memory optimization of depthwise conv op and group norm op (#15313) 7 years ago
gru_op.cc update gru op forward kernel 7 years ago
gru_op.cu.cc update gru op forward kernel 7 years ago
gru_op.h update gru op forward kernel 7 years ago
gru_unit_op.cc change interface and api spec for dynamic_gru test=develop 7 years ago
gru_unit_op.cu
gru_unit_op.h complete gru_unite_op and test 7 years ago
hash_op.cc refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool 7 years ago
hash_op.h refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool 7 years ago
hierarchical_sigmoid_op.cc code clean 7 years ago
hierarchical_sigmoid_op.h Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator 7 years ago
hinge_loss_op.cc
hinge_loss_op.cu
hinge_loss_op.h
huber_loss_op.cc
huber_loss_op.cu
huber_loss_op.h fix the huber loss compile issue on windows test=develop 7 years ago
im2sequence_op.cc
im2sequence_op.cu
im2sequence_op.h
increment_op.cc
increment_op.cu
increment_op.h
interpolate_op.cc refine image_resize annotation (#15976) 7 years ago
interpolate_op.cu test=develop 7 years ago
interpolate_op.h test=develop 7 years ago
is_empty_op.cc Rewrite is_empty op to avoid unnecessary data transform. (#15509) 7 years ago
is_empty_op.cu.cc Rewrite is_empty op to avoid unnecessary data transform. (#15509) 7 years ago
is_empty_op.h Rewrite is_empty op to avoid unnecessary data transform. (#15509) 7 years ago
isfinite_op.cc
isfinite_op.cu
isfinite_op.h
l1_norm_op.cc
l1_norm_op.cu
l1_norm_op.h
label_smooth_op.cc
label_smooth_op.cu
label_smooth_op.h
layer_norm_op.cc fix warnings (#15790) 7 years ago
layer_norm_op.cu
layer_norm_op.h
linear_chain_crf_op.cc fix warnings (#15790) 7 years ago
linear_chain_crf_op.cu
linear_chain_crf_op.h
load_combine_op.cc More restrict check load_combine_op. (#15479) 7 years ago
load_op.cc fix save and load ops on windows test=develop 7 years ago
lod_array_length_op.cc
lod_rank_table_op.cc
lod_reset_op.cc
lod_reset_op.cu
lod_reset_op.h
lod_tensor_to_array_op.cc
log_loss_op.cc
log_loss_op.cu
log_loss_op.h
lookup_sparse_table_op.cc
lookup_table_op.cc code clean 7 years ago
lookup_table_op.cu Some improvements to support bert mixed precision training (#15585) 7 years ago
lookup_table_op.h Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator 7 years ago
lrn_op.cc
lrn_op.cu
lrn_op.h
lstm_op.cc
lstm_op.cu.cc
lstm_op.h Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp 7 years ago
lstm_unit_op.cc
lstm_unit_op.cu
lstm_unit_op.h
lstmp_op.cc add cell clip and proj clip, fix bug for h0 7 years ago
lstmp_op.cu
lstmp_op.h Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp 7 years ago
margin_rank_loss_op.cc
margin_rank_loss_op.cu
margin_rank_loss_op.h
matmul_op.cc
max_sequence_len_op.cc
maxout_op.cc
maxout_op.cu.cc
maxout_op.h
mean_iou_op.cc
mean_iou_op.cu
mean_iou_op.h
mean_op.cc
mean_op.cu
mean_op.h
merge_lod_tensor_op.cc
merge_selected_rows_op.cc
merge_selected_rows_op.cu.cc
merge_selected_rows_op.h
minus_op.cc
minus_op.cu
minus_op.h
modified_huber_loss_op.cc
modified_huber_loss_op.cu
modified_huber_loss_op.h
mul_op.cc
mul_op.cu.cc
mul_op.h
multiplex_op.cc
multiplex_op.cu
multiplex_op.h
nce_op.cc code clean 7 years ago
nce_op.h Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator 7 years ago
norm_op.cc
norm_op.cu
norm_op.h test=develop 7 years ago
one_hot_op.cc
one_hot_op.cu
one_hot_op.h
pad2d_op.cc
pad2d_op.cu
pad_constant_like_op.cc
pad_constant_like_op.cu
pad_constant_like_op.h
pad_op.cc
pad_op.cu
pad_op.h
pool_cudnn_op.cu.cc
pool_op.cc use kernel size in global_pooling. test=develop 7 years ago
pool_op.cu.cc
pool_op.h
pool_with_index_op.cc
pool_with_index_op.cu.cc
pool_with_index_op.h
positive_negative_pair_op.cc
positive_negative_pair_op.h update CMakeLists.txt 7 years ago
prelu_op.cc
prelu_op.cu
prelu_op.h
print_op.cc
psroi_pool_op.cc
psroi_pool_op.cu
psroi_pool_op.h
py_func_op.cc try fix py2 7 years ago
py_func_op.h try fix py2 7 years ago
quantize_op.cc
quantize_op.h
random_crop_op.cc
random_crop_op.cu
random_crop_op.h fix security issue 27, 38 test=develop 7 years ago
rank_loss_op.cc
rank_loss_op.cu
rank_loss_op.h
recurrent_op.cc
reorder_lod_tensor_by_rank_op.cc
reshape_op.cc Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass 7 years ago
reverse_op.cc
reverse_op.cu
reverse_op.h
rnn_memory_helper_op.cc
roi_align_op.cc
roi_align_op.cu Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) 7 years ago
roi_align_op.h
roi_pool_op.cc
roi_pool_op.cu Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) 7 years ago
roi_pool_op.h
row_conv_op.cc Fix row_conv doc 7 years ago
row_conv_op.cu
row_conv_op.h
sample_logits_op.cc remove non-ascii charactor 7 years ago
sample_logits_op.cu refine code 7 years ago
sample_logits_op.h refine code 7 years ago
sampling_id_op.cc
sampling_id_op.cu
sampling_id_op.h
save_combine_op.cc fix save and load ops on windows test=develop 7 years ago
save_load_combine_op_test.cc
save_load_op_test.cc
save_op.cc fix save and load ops on windows test=develop 7 years ago
scale_op.cc squash commits. test=develop 7 years ago
scale_op.cu
scale_op.h
scatter.cu.h
scatter.h
scatter_op.cc
scatter_op.cu
scatter_op.h
scatter_test.cc
selu_op.cc
selu_op.cu
selu_op.h
shape_op.cc fix shape api doc 7 years ago
shape_op.cu
shape_op.h
shrink_rnn_memory_op.cc
shuffle_channel_op.cc rewrite the comments, test=develop 7 years ago
shuffle_channel_op.cu
shuffle_channel_op.h
sigmoid_cross_entropy_with_logits_op.cc Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) 7 years ago
sigmoid_cross_entropy_with_logits_op.cu Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) 7 years ago
sigmoid_cross_entropy_with_logits_op.h Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) 7 years ago
sign_op.cc
sign_op.cu
sign_op.h
similarity_focus_op.cc
similarity_focus_op.h
slice_op.cc add lod for slice op, test=develop 7 years ago
slice_op.cu
slice_op.h
smooth_l1_loss_op.cc
smooth_l1_loss_op.cu
smooth_l1_loss_op.h
softmax_cudnn_op.cu.cc
softmax_op.cc squash commits. test=develop 7 years ago
softmax_op.cu.cc
softmax_op.h
softmax_with_cross_entropy_op.cc change default option related to softmax, test=develop 7 years ago
softmax_with_cross_entropy_op.cu [Feature] support mix precision training for resnet (#14899) 7 years ago
softmax_with_cross_entropy_op.h
space_to_depth_op.cc
space_to_depth_op.cu
space_to_depth_op.h
split_lod_tensor_op.cc
split_op.cc
split_op.cu.cc
split_op.h
split_selected_rows_op.cc
split_selected_rows_op.cu
split_selected_rows_op.h add some check 7 years ago
spp_op.cc
spp_op.cu.cc
spp_op.h
squared_l2_distance_op.cc
squared_l2_distance_op.cu
squared_l2_distance_op.h
squared_l2_norm_op.cc
squared_l2_norm_op.cu
squared_l2_norm_op.h
squeeze_op.cc
stack_op.cc
stack_op.cu Some improvements to support bert mixed precision training (#15585) 7 years ago
stack_op.h
strided_memcpy.h
strided_memcpy_test.cc
sum_op.cc fix sum_op selected rows test=develop 7 years ago
sum_op.cu
sum_op.h
teacher_student_sigmoid_loss_op.cc remove mkl & fix commit 7 years ago
teacher_student_sigmoid_loss_op.h
tensor_array_to_tensor_op.cc
top_k_op.cc
top_k_op.cu
top_k_op.h
transpose_op.cc
transpose_op.cu.cc Some improvements to support bert mixed precision training (#15585) 7 years ago
transpose_op.h
tree_conv_op.cc Tree conv op (#15217) 7 years ago
tree_conv_op.cu Tree conv op (#15217) 7 years ago
tree_conv_op.h Tree conv op (#15217) 7 years ago
truncated_gaussian_random_op.cc
truncated_gaussian_random_op.cu
uniform_random_batch_size_like_op.cc
uniform_random_op.cc
uniform_random_op.cu
unpool_op.cc
unpool_op.cu.cc
unpool_op.h
unsqueeze_op.cc
unstack_op.cc
unstack_op.h
warpctc_cudnn_op.cu.cc Revert conv transpose cudnn (#15514) 7 years ago
warpctc_op.cc
warpctc_op.cu.cc
warpctc_op.h