Paddle

History

Qiao Longfei 039d783db5 change communicator_recv_wait_ms to communicator_max_send_grad_num_before_recv		7 years ago
..
benchmark	Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493 )	7 years ago
controlflow	Revert "Optimize while_op when is_test is true. (#15811 )" (#15968 )	7 years ago
csp	…
detail	Merge pull request #14933 from sneaxiy/rewrite_ddim	7 years ago
detection	This PR improve performance of prior_box op about 1.25x faster on CPU. (#15909 )	7 years ago
distributed	change communicator_recv_wait_ms to communicator_max_send_grad_num_before_recv	7 years ago
distributed_ops	add some check	7 years ago
elementwise	- MKL-DNN pooling updated to set_prim_desc	7 years ago
fused	refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool	7 years ago
jit	fix jitcodekey and refine test	7 years ago
math	Merge branch 'add-communicator' of ssh://github.com/jacquesqiao/Paddle into add-async-ssa-graph-executor-communicator	7 years ago
metrics	…
mkldnn	Optimize Quantize Op with primitive reuse. (#15929 )	7 years ago
nccl	…
ngraph	fix cpplint test=develop (#16028 )	7 years ago
optimizers	enable sgd jitkernel refer code and test	7 years ago
reader	code format test=develop	7 years ago
reduce_ops	test=develop	7 years ago
sequence_ops	refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool	7 years ago
tensorrt	delete the usage of the const_cast	7 years ago
CMakeLists.txt	Merge pull request #15609 from xuezhong/add_sample_logits_op	7 years ago
activation_cudnn.cu.cc	polish cudnn related code and fix bug. (#15164 )	7 years ago
activation_cudnn_op.cu.cc	polish cudnn related code and fix bug. (#15164 )	7 years ago
activation_op.cc	polish cudnn related code and fix bug. (#15164 )	7 years ago
activation_op.cu	…
activation_op.h	Merge pull request #15931 from yihuaxu/develop_2c5c7b2a7_gelu_mkl_opt	7 years ago
add_position_encoding_op.cc	…
add_position_encoding_op.h	…
affine_channel_op.cc	…
affine_channel_op.cu	Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371 )	7 years ago
affine_grid_cudnn_op.cu.cc	…
affine_grid_op.cc	…
affine_grid_op.h	…
alloc_continuous_space_op.cc	Add alloc_continuous_space_op (#15900 )	7 years ago
arg_max_op.cc	…
arg_max_op.cu	…
arg_min_max_op_base.h	…
arg_min_op.cc	…
arg_min_op.cu	…
argsort_op.cc	…
argsort_op.cu	…
argsort_op.h	…
array_operator.h	…
array_to_lod_tensor_op.cc	…
assign_op.cc	…
assign_value_op.cc	…
assign_value_op.cu.cc	…
assign_value_op.h	…
attention_lstm_op.cc	fix warnings (#15790 )	7 years ago
attention_lstm_op.h	…
average_accumulates_op.cc	…
average_accumulates_op.cu	…
average_accumulates_op.h	…
batch_norm_op.cc	Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass	7 years ago
batch_norm_op.cu	…
batch_norm_op.h	…
batch_size_like.h	…
beam_search_decode_op.cc	add per kernel config and remove const_cast.	7 years ago
beam_search_decode_op.h	Change (smart_ptr.get()) -> smart_ptr	7 years ago
beam_search_decode_op_test.cc	…
beam_search_op.cc	Return parent_idx in beam_search op (#15520 )	7 years ago
beam_search_op.cu.cc	Add the CUDA kernel for beam_search op (#15020 )	7 years ago
beam_search_op.h	Return parent_idx in beam_search op (#15520 )	7 years ago
bilinear_tensor_product_op.cc	…
bilinear_tensor_product_op.cu	…
bilinear_tensor_product_op.h	…
bpr_loss_op.cc	…
bpr_loss_op.h	Add the CUDA kernel for beam_search op (#15020 )	7 years ago
cast_op.cc	…
cast_op.cu	…
cast_op.h	…
chunk_eval_op.cc	…
chunk_eval_op.h	…
clip_by_norm_op.cc	…
clip_by_norm_op.cu	…
clip_by_norm_op.h	…
clip_op.cc	…
clip_op.cu	…
clip_op.h	…
concat_op.cc	…
concat_op.cu.cc	…
concat_op.h	…
conv_cudnn_op.cu.cc	add per kernel config and remove const_cast.	7 years ago
conv_cudnn_op_cache.h	add per kernel config and remove const_cast.	7 years ago
conv_fusion_op.cc	Inception fusion operator. (#14968 )	7 years ago
conv_fusion_op.cu.cc	polish	7 years ago
conv_op.cc	Enable function coverage for U8/S8 ConvMKLDNNOpKernel	7 years ago
conv_op.cu.cc	…
conv_op.h	Memory optimization of depthwise conv op and group norm op (#15313 )	7 years ago
conv_shift_op.cc	…
conv_shift_op.cu	…
conv_shift_op.h	…
conv_transpose_cudnn_op.cu.cc	Revert conv transpose cudnn (#15514 )	7 years ago
conv_transpose_op.cc	MKLDNN: Add UT for conv_transpose_mkldnn op. (#16030 )	7 years ago
conv_transpose_op.cu.cc	…
conv_transpose_op.h	…
cos_sim_op.cc	…
cos_sim_op.cu	…
cos_sim_op.h	…
crf_decoding_op.cc	fix warnings (#15790 )	7 years ago
crf_decoding_op.h	…
crop_op.cc	…
crop_op.cu	…
crop_op.h	…
cross_entropy_op.cc	loosly check in the InferShape of cross_entropy_op. (#15863 )	7 years ago
cross_entropy_op.cu	…
cross_entropy_op.h	…
ctc_align_op.cc	…
ctc_align_op.cu	…
ctc_align_op.h	…
cudnn_lstm_op.cc	…
cudnn_lstm_op.cu.cc	merge develop	7 years ago
cudnn_rnn_cache.h	…
cum_op.h	…
cumsum_op.cc	…
cumsum_op.cu	…
data_norm_op.cc	remove mkldnn & fix commit	7 years ago
data_norm_op.h	…
delete_var_op.cc	…
dequantize_op.cc	…
dequantize_op.h	…
detection_map_op.cc	…
detection_map_op.h	…
dropout_op.cc	…
dropout_op.cu	Some improvements to support bert mixed precision training (#15585 )	7 years ago
dropout_op.h	…
dropout_op_test.cc	…
edit_distance_op.cc	…
edit_distance_op.cu	…
edit_distance_op.h	…
expand_op.cc	support multiple var types for expand op, test=develop	7 years ago
expand_op.cu	support multiple var types for expand op, test=develop	7 years ago
expand_op.h	…
fake_dequantize_op.cc	…
fake_dequantize_op.cu	…
fake_dequantize_op.h	…
fake_quantize_op.cc	Fix bug in fake_quantize_op and add more unit testing (#15912 )	7 years ago
fake_quantize_op.cu	…
fake_quantize_op.h	…
fc_op.cc	fix warnings (#15790 )	7 years ago
fc_op.h	…
fill_constant_batch_size_like_op.cc	…
fill_constant_batch_size_like_op.cu.cc	…
fill_constant_batch_size_like_op.h	…
fill_constant_op.cc	register float16	7 years ago
fill_constant_op.cu.cc	register float16	7 years ago
fill_constant_op.h	make fill_constant kernel-based	7 years ago
fill_op.cc	…
fill_zeros_like_op.cc	…
fill_zeros_like_op.cu.cc	…
fill_zeros_like_op.h	…
flatten_op.cc	squash commits. test=develop	7 years ago
gather.cu.h	…
gather.h	…
gather_op.cc	Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371 )	7 years ago
gather_op.cu	Some improvements to support bert mixed precision training (#15585 )	7 years ago
gather_op.h	Return parent_idx in beam_search op (#15520 )	7 years ago
gather_test.cc	…
gaussian_random_batch_size_like_op.cc	…
gaussian_random_op.cc	…
gaussian_random_op.cu	…
get_tensor_from_selected_rows_op.cc	…
grid_sampler_cudnn_op.cu.cc	…
grid_sampler_op.cc	fix grid_sampler PADDLE_ENFORCE error. test=develop (#15542 )	7 years ago
grid_sampler_op.h	…
group_norm_op.cc	inplace group_norm (#15754 )	7 years ago
group_norm_op.cu	fix pr 15313	7 years ago
group_norm_op.h	Memory optimization of depthwise conv op and group norm op (#15313 )	7 years ago
gru_op.cc	update gru op forward kernel	7 years ago
gru_op.cu.cc	update gru op forward kernel	7 years ago
gru_op.h	update gru op forward kernel	7 years ago
gru_unit_op.cc	change interface and api spec for dynamic_gru test=develop	7 years ago
gru_unit_op.cu	…
gru_unit_op.h	complete gru_unite_op and test	7 years ago
hash_op.cc	refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool	7 years ago
hash_op.h	refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool	7 years ago
hierarchical_sigmoid_op.cc	code clean	7 years ago
hierarchical_sigmoid_op.h	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator	7 years ago
hinge_loss_op.cc	…
hinge_loss_op.cu	…
hinge_loss_op.h	…
huber_loss_op.cc	…
huber_loss_op.cu	…
huber_loss_op.h	fix the huber loss compile issue on windows test=develop	7 years ago
im2sequence_op.cc	…
im2sequence_op.cu	…
im2sequence_op.h	…
increment_op.cc	…
increment_op.cu	…
increment_op.h	…
interpolate_op.cc	refine image_resize annotation (#15976 )	7 years ago
interpolate_op.cu	test=develop	7 years ago
interpolate_op.h	test=develop	7 years ago
is_empty_op.cc	Rewrite is_empty op to avoid unnecessary data transform. (#15509 )	7 years ago
is_empty_op.cu.cc	Rewrite is_empty op to avoid unnecessary data transform. (#15509 )	7 years ago
is_empty_op.h	Rewrite is_empty op to avoid unnecessary data transform. (#15509 )	7 years ago
isfinite_op.cc	…
isfinite_op.cu	…
isfinite_op.h	…
l1_norm_op.cc	…
l1_norm_op.cu	…
l1_norm_op.h	…
label_smooth_op.cc	…
label_smooth_op.cu	…
label_smooth_op.h	…
layer_norm_op.cc	fix warnings (#15790 )	7 years ago
layer_norm_op.cu	…
layer_norm_op.h	…
linear_chain_crf_op.cc	fix warnings (#15790 )	7 years ago
linear_chain_crf_op.cu	…
linear_chain_crf_op.h	…
load_combine_op.cc	More restrict check load_combine_op. (#15479 )	7 years ago
load_op.cc	fix save and load ops on windows test=develop	7 years ago
lod_array_length_op.cc	…
lod_rank_table_op.cc	…
lod_reset_op.cc	…
lod_reset_op.cu	…
lod_reset_op.h	…
lod_tensor_to_array_op.cc	…
log_loss_op.cc	…
log_loss_op.cu	…
log_loss_op.h	…
lookup_sparse_table_op.cc	…
lookup_table_op.cc	code clean	7 years ago
lookup_table_op.cu	Some improvements to support bert mixed precision training (#15585 )	7 years ago
lookup_table_op.h	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator	7 years ago
lrn_op.cc	…
lrn_op.cu	…
lrn_op.h	…
lstm_op.cc	…
lstm_op.cu.cc	…
lstm_op.h	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp	7 years ago
lstm_unit_op.cc	…
lstm_unit_op.cu	…
lstm_unit_op.h	…
lstmp_op.cc	add cell clip and proj clip, fix bug for h0	7 years ago
lstmp_op.cu	…
lstmp_op.h	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp	7 years ago
margin_rank_loss_op.cc	…
margin_rank_loss_op.cu	…
margin_rank_loss_op.h	…
matmul_op.cc	…
max_sequence_len_op.cc	…
maxout_op.cc	…
maxout_op.cu.cc	…
maxout_op.h	…
mean_iou_op.cc	…
mean_iou_op.cu	…
mean_iou_op.h	…
mean_op.cc	…
mean_op.cu	…
mean_op.h	…
merge_lod_tensor_op.cc	…
merge_selected_rows_op.cc	…
merge_selected_rows_op.cu.cc	…
merge_selected_rows_op.h	…
minus_op.cc	…
minus_op.cu	…
minus_op.h	…
modified_huber_loss_op.cc	…
modified_huber_loss_op.cu	…
modified_huber_loss_op.h	…
mul_op.cc	…
mul_op.cu.cc	…
mul_op.h	…
multiplex_op.cc	…
multiplex_op.cu	…
multiplex_op.h	…
nce_op.cc	code clean	7 years ago
nce_op.h	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator	7 years ago
norm_op.cc	…
norm_op.cu	…
norm_op.h	test=develop	7 years ago
one_hot_op.cc	…
one_hot_op.cu	…
one_hot_op.h	…
pad2d_op.cc	…
pad2d_op.cu	…
pad_constant_like_op.cc	…
pad_constant_like_op.cu	…
pad_constant_like_op.h	…
pad_op.cc	…
pad_op.cu	…
pad_op.h	…
pool_cudnn_op.cu.cc	…
pool_op.cc	use kernel size in global_pooling. test=develop	7 years ago
pool_op.cu.cc	…
pool_op.h	…
pool_with_index_op.cc	…
pool_with_index_op.cu.cc	…
pool_with_index_op.h	…
positive_negative_pair_op.cc	…
positive_negative_pair_op.h	update CMakeLists.txt	7 years ago
prelu_op.cc	…
prelu_op.cu	…
prelu_op.h	…
print_op.cc	…
psroi_pool_op.cc	…
psroi_pool_op.cu	…
psroi_pool_op.h	…
py_func_op.cc	try fix py2	7 years ago
py_func_op.h	try fix py2	7 years ago
quantize_op.cc	…
quantize_op.h	…
random_crop_op.cc	…
random_crop_op.cu	…
random_crop_op.h	fix security issue 27, 38 test=develop	7 years ago
rank_loss_op.cc	…
rank_loss_op.cu	…
rank_loss_op.h	…
recurrent_op.cc	…
reorder_lod_tensor_by_rank_op.cc	…
reshape_op.cc	Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass	7 years ago
reverse_op.cc	…
reverse_op.cu	…
reverse_op.h	…
rnn_memory_helper_op.cc	…
roi_align_op.cc	…
roi_align_op.cu	Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371 )	7 years ago
roi_align_op.h	…
roi_pool_op.cc	…
roi_pool_op.cu	Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371 )	7 years ago
roi_pool_op.h	…
row_conv_op.cc	Fix row_conv doc	7 years ago
row_conv_op.cu	…
row_conv_op.h	…
sample_logits_op.cc	remove non-ascii charactor	7 years ago
sample_logits_op.cu	refine code	7 years ago
sample_logits_op.h	refine code	7 years ago
sampling_id_op.cc	…
sampling_id_op.cu	…
sampling_id_op.h	…
save_combine_op.cc	fix save and load ops on windows test=develop	7 years ago
save_load_combine_op_test.cc	…
save_load_op_test.cc	…
save_op.cc	fix save and load ops on windows test=develop	7 years ago
scale_op.cc	squash commits. test=develop	7 years ago
scale_op.cu	…
scale_op.h	…
scatter.cu.h	…
scatter.h	…
scatter_op.cc	…
scatter_op.cu	…
scatter_op.h	…
scatter_test.cc	…
selu_op.cc	…
selu_op.cu	…
selu_op.h	…
shape_op.cc	fix shape api doc	7 years ago
shape_op.cu	…
shape_op.h	…
shrink_rnn_memory_op.cc	…
shuffle_channel_op.cc	rewrite the comments, test=develop	7 years ago
shuffle_channel_op.cu	…
shuffle_channel_op.h	…
sigmoid_cross_entropy_with_logits_op.cc	Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371 )	7 years ago
sigmoid_cross_entropy_with_logits_op.cu	Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371 )	7 years ago
sigmoid_cross_entropy_with_logits_op.h	Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371 )	7 years ago
sign_op.cc	…
sign_op.cu	…
sign_op.h	…
similarity_focus_op.cc	…
similarity_focus_op.h	…
slice_op.cc	add lod for slice op, test=develop	7 years ago
slice_op.cu	…
slice_op.h	…
smooth_l1_loss_op.cc	…
smooth_l1_loss_op.cu	…
smooth_l1_loss_op.h	…
softmax_cudnn_op.cu.cc	…
softmax_op.cc	squash commits. test=develop	7 years ago
softmax_op.cu.cc	…
softmax_op.h	…
softmax_with_cross_entropy_op.cc	change default option related to softmax, test=develop	7 years ago
softmax_with_cross_entropy_op.cu	[Feature] support mix precision training for resnet (#14899 )	7 years ago
softmax_with_cross_entropy_op.h	…
space_to_depth_op.cc	…
space_to_depth_op.cu	…
space_to_depth_op.h	…
split_lod_tensor_op.cc	…
split_op.cc	…
split_op.cu.cc	…
split_op.h	…
split_selected_rows_op.cc	…
split_selected_rows_op.cu	…
split_selected_rows_op.h	add some check	7 years ago
spp_op.cc	…
spp_op.cu.cc	…
spp_op.h	…
squared_l2_distance_op.cc	…
squared_l2_distance_op.cu	…
squared_l2_distance_op.h	…
squared_l2_norm_op.cc	…
squared_l2_norm_op.cu	…
squared_l2_norm_op.h	…
squeeze_op.cc	…
stack_op.cc	…
stack_op.cu	Some improvements to support bert mixed precision training (#15585 )	7 years ago
stack_op.h	…
strided_memcpy.h	…
strided_memcpy_test.cc	…
sum_op.cc	fix sum_op selected rows test=develop	7 years ago
sum_op.cu	…
sum_op.h	…
teacher_student_sigmoid_loss_op.cc	remove mkl & fix commit	7 years ago
teacher_student_sigmoid_loss_op.h	…
tensor_array_to_tensor_op.cc	…
top_k_op.cc	…
top_k_op.cu	…
top_k_op.h	…
transpose_op.cc	…
transpose_op.cu.cc	Some improvements to support bert mixed precision training (#15585 )	7 years ago
transpose_op.h	…
tree_conv_op.cc	Tree conv op (#15217 )	7 years ago
tree_conv_op.cu	Tree conv op (#15217 )	7 years ago
tree_conv_op.h	Tree conv op (#15217 )	7 years ago
truncated_gaussian_random_op.cc	…
truncated_gaussian_random_op.cu	…
uniform_random_batch_size_like_op.cc	…
uniform_random_op.cc	…
uniform_random_op.cu	…
unpool_op.cc	…
unpool_op.cu.cc	…
unpool_op.h	…
unsqueeze_op.cc	…
unstack_op.cc	…
unstack_op.h	…
warpctc_cudnn_op.cu.cc	Revert conv transpose cudnn (#15514 )	7 years ago
warpctc_op.cc	…
warpctc_op.cu.cc	…
warpctc_op.h	…