You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Paddle/paddle/fluid/operators
niuliling123 a71d72d921
relu forward and backward with vectortype (#31869)
4 years ago
..
amp [ROCM] update fluid collective op for rocm, test=develop (#31075) 4 years ago
benchmark [ROCM] update fluid collective op for rocm, test=develop (#31075) 4 years ago
collective c_gen_nccl_id add SocketServer to persit server (#31589) 4 years ago
controlflow [ROCM] update fluid operators for rocm (part1), test=develop (#31077) 4 years ago
detail [ROCM] update fluid collective op for rocm, test=develop (#31075) 4 years ago
detection Polish two error messages (#31852) 4 years ago
distributed [ROCM] update fluid operators for rocm (part1), test=develop (#31077) 4 years ago
distributed_ops delete include framework.pb.h (#31859) 4 years ago
elementwise [oneDNN] Added Elementwise Mul grad fp32/bf16 (#31647) 4 years ago
fused delete include framework.pb.h (#31859) 4 years ago
jit use iwyu clean include second time, test=develop (#30829) 4 years ago
lite [ROCM] update fluid operators for rocm (part3), test=develop (#31213) 4 years ago
math [oneDNN] lookup_table op with support for BF16 data type. (#31558) 4 years ago
metrics [ROCM] update fluid operators for rocm (part1), test=develop (#31077) 4 years ago
mkldnn fix cache key in concat oneDNN kernel (#31820) 4 years ago
nccl [ROCM] update fluid framework for rocm (part6), test=develop (#31015) 4 years ago
optimizers lamb_op_xpu;test=kunlun (#31012) 4 years ago
pscore fix compilation errors for missing brpc header files, test=develop (#31325) 4 years ago
reader delete include framework.pb.h (#31859) 4 years ago
reduce_ops [ROCM] fix reduce_sum nan in ROCM platform, test=develop (#31780) 4 years ago
sequence_ops [ROCM] fix dropout and remove hipcub, test=develop (#31455) 4 years ago
tensorrt [Paddle-TRT] Fix engine key in trt int8 calibration (#31513) 4 years ago
CMakeLists.txt [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
abs_op.cc make abs op support complex types (#30375) 4 years ago
abs_op.cu make abs op support complex types (#30375) 4 years ago
abs_op.h make abs op support complex types (#30375) 4 years ago
activation_cudnn.cu.cc [ROCM] update fluid operators for rocm (part6), test=develop (#31301) 4 years ago
activation_cudnn_op.cu.cc [ROCM] update fluid operators for rocm (part6), test=develop (#31301) 4 years ago
activation_op.cc [ROCM] update fluid operators for rocm (part6), test=develop (#31301) 4 years ago
activation_op.cu relu forward and backward with vectortype (#31869) 4 years ago
activation_op.h fix ELU output for nan, test=develop (#31132) 4 years ago
activation_op_xpu.cc update activation op on kunlun (#29577) 4 years ago
add_position_encoding_op.cc Polish no onwer ops error message (#27448) 4 years ago
add_position_encoding_op.h OP(warpctc, add_position_encoding, scaled_dot_product_attention) error message enhancement (#24261) 5 years ago
addmm_op.cc More precise mkldnn kernel rules in GetExpectedKernelType (#29840) 4 years ago
addmm_op.cu test=develop, add addmm op (#23384) 5 years ago
addmm_op.h test=develop, add addmm op (#23384) 5 years ago
affine_channel_op.cc API/OP (affine_channel, group_norm, layer_norm, random_crop, unpool, … (#24118) 5 years ago
affine_channel_op.cu Bugfix rocm (#31490) 4 years ago
affine_channel_op_xpu.cc support roi_align & affine_channel for kunlun (#29561) 4 years ago
affine_grid_cudnn_op.cu.cc [ROCM] update fluid operators for rocm (part6), test=develop (#31301) 4 years ago
affine_grid_op.cc [ROCM] update fluid operators for rocm (part6), test=develop (#31301) 4 years ago
affine_grid_op.cu Fix cuda kernel of affine grid (#27003) 5 years ago
affine_grid_op.h 【2.0 API】Enhance affine grid operator (#26385) 5 years ago
allclose_op.cc Fix the accuracy problem of allclose op when using float64 data type in static mode. (#29890) 4 years ago
allclose_op.cu [ROCM] update fluid operators for rocm (part6), test=develop (#31301) 4 years ago
allclose_op.h Fix the accuracy problem of allclose op when using float64 data type in static mode. (#29890) 4 years ago
arg_max_op.cc add the op version check for the elementwise ops, test=op_version (#30010) 4 years ago
arg_max_op.cu add cub impl for arg max, min (#25941) 5 years ago
arg_min_max_op_base.cu.h [ROCM] update fluid operators for rocm (part6), test=develop (#31301) 4 years ago
arg_min_max_op_base.h optimize compilation time of argmin/argmax op (#29595) 4 years ago
arg_min_op.cc add the op version check for the elementwise ops, test=op_version (#30010) 4 years ago
arg_min_op.cu add cub impl for arg max, min (#25941) 5 years ago
argsort_op.cc rename inplace/no_need_buffer inferer, part2, test=develop (#24733) 5 years ago
argsort_op.cu [ROCM] update fluid operators for rocm (part6), test=develop (#31301) 4 years ago
argsort_op.h Correct CPU gradients of the argsort op (#22739) 5 years ago
array_operator.h fix bugs in transformer predict in xpu place (#30730) 4 years ago
array_to_lod_tensor_op.cc [ROCM] update fluid operators for rocm (part5), test=develop (#31258) 4 years ago
ascend_trigger_op.cc Ascend Framework Part1: OP & Wrapper (#30281) 4 years ago
ascend_trigger_op.h Ascend Framework Part1: OP & Wrapper (#30281) 4 years ago
assert_op.cc use iwyu clean include (#27267) 4 years ago
assign_op.cc [ROCM] update fluid operators for rocm (part5), test=develop (#31258) 4 years ago
assign_op.h use iwyu clean include second time, test=develop (#30829) 4 years ago
assign_op_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
assign_op_xpu.cc add cast/concat/assign xpu op (#27911) 4 years ago
assign_value_op.cc use iwyu clean include (#27267) 4 years ago
assign_value_op.cu.cc Add the support of bool list for assign_value op (#23774) 5 years ago
assign_value_op.h use iwyu clean include (#27267) 4 years ago
attention_lstm_op.cc modify error message based on comments (#30189) 4 years ago
attention_lstm_op.h
average_accumulates_op.cc c++ API ( average_accumulates, tensor_array_to_tensor and average_accumulates) error message enhance. test=develop (#23631) 5 years ago
average_accumulates_op.cu Add macro BOOST_GET to enrich the error information of boost :: get (#24175) 5 years ago
average_accumulates_op.h optimize unity build (#30195) 4 years ago
batch_fc_op.cc rename inplace/no_need_buffer inferer, part2, test=develop (#24733) 5 years ago
batch_fc_op.cu [ROCM] update fluid operators for rocm (part6), test=develop (#31301) 4 years ago
batch_fc_op.h Add batch_fc op in contrib (#24017) 5 years ago
batch_norm_op.cc More precise mkldnn kernel rules in GetExpectedKernelType (#29840) 4 years ago
batch_norm_op.cu [ROCM] update fluid operators for rocm (part6), test=develop (#31301) 4 years ago
batch_norm_op.h support channel last in BatchNorm*d 4 years ago
batch_norm_op_xpu.cc optimize batch_norm & pool op for kunlun (#30490) 4 years ago
batch_size_like.h refine the error message for bath size like OP (#27446) 4 years ago
bce_loss_op.cc change to use bce_loss op, add shape check for bce_loss 5 years ago
bce_loss_op.cu [ROCM] update fluid operators for rocm (part6), test=develop (#31301) 4 years ago
bce_loss_op.h change to use bce_loss op, add shape check for bce_loss 5 years ago
beam_search_decode_op.cc Optimize the error message for OP (#27617) 4 years ago
beam_search_decode_op.h use iwyu clean include second time, test=develop (#30829) 4 years ago
beam_search_decode_op_test.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
beam_search_op.cc Fix beam_search InferShape (#25169) 5 years ago
beam_search_op.cu.cc
beam_search_op.h API/OP(sequence_first_step, sequence_last_step, sequence_mask, beam_search, beam_search_decode) error message enhancement (#24590) 5 years ago
bernoulli_op.cc Refine paddle.manual_seed (#26496) 5 years ago
bernoulli_op.cu use cuda generator in bernoulli cuda kernel (#30199) 4 years ago
bernoulli_op.h Refine bernoulli and unsqueeze op (#26842) 5 years ago
bilateral_slice_op.cc Fix bilateral inference shape bug (#26822) 4 years ago
bilateral_slice_op.cu Enable bilateral_slice unittest on windows platform (#29896) 4 years ago
bilateral_slice_op.h Add bilateral_slice op (#25401) 5 years ago
bilinear_tensor_product_op.cc modify error message based on comments (#30189) 4 years ago
bilinear_tensor_product_op.cu
bilinear_tensor_product_op.h fix typo words (#22653) 5 years ago
bmm_op.cc create bmm op and move several api from fluid.layers to tensor (#23457) 5 years ago
bmm_op.cu [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
bmm_op.h create bmm op and move several api from fluid.layers to tensor (#23457) 5 years ago
bpr_loss_op.cc enhance cvm bpr_loss adam adagrad adamax ftrl error message, test=develop (#24452) 5 years ago
bpr_loss_op.h enhance cvm bpr_loss adam adagrad adamax ftrl error message, test=develop (#24452) 5 years ago
cast_op.cc [oneDNN] Initial bf16 amp integration (#31093) 4 years ago
cast_op.cu add VecCastCUDAKernel (#30296) 4 years ago
cast_op.h delete include framework.pb.h (#31859) 4 years ago
cast_op_xpu.cc support some shape for matmul and cast in xpu place (#29900) 4 years ago
center_loss_op.cc API/OP (center_loss, fluid.one_hot, prroi_pool, roi_pool, ctc_greed_decoder) error message enhancement (#23794) 5 years ago
center_loss_op.cu Error message opt, test=develop (#27467) 4 years ago
center_loss_op.h Add center Loss Op Support (#18681) 6 years ago
cholesky_op.cc Add cholesky_op (#23543) 5 years ago
cholesky_op.cu [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
cholesky_op.h add error message for cholesky (#26444) 5 years ago
chunk_eval_op.cc API(dynamic_gru, chunk_eval, BeamSearchDecoder) error message enhancement (#24513) 5 years ago
chunk_eval_op.h Optimize the error message for OP (#27617) 4 years ago
clip_by_norm_op.cc
clip_by_norm_op.cu
clip_by_norm_op.h Fix the nan bug when passing all zero values into clip_by_norm_op. (#30777) 4 years ago
clip_by_norm_op_xpu.cc add clip_by_norm on kunlun, *test=kunlun (#30862) 4 years ago
clip_op.cc Add clip double grad (#29590) 4 years ago
clip_op.cu add clamp api, test=develop (#23273) 5 years ago
clip_op.h [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
coalesce_tensor_op.cc [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
common_infer_shape_functions.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
common_infer_shape_functions.h Add broadcast_shape api (#28257) 4 years ago
concat_op.cc More precise mkldnn kernel rules in GetExpectedKernelType (#29840) 4 years ago
concat_op.cu.cc Add support for tuple of concat Op test=develop (#25800) 5 years ago
concat_op.h fix concat dimension (#25606) 5 years ago
concat_op_xpu.cc fix bugs in transformer predict in xpu place (#30730) 4 years ago
conj_op.cc add conj op for complex types (#29527) 4 years ago
conj_op.cu add conj op for complex types (#29527) 4 years ago
conj_op.h complex gradient matmul (#29966) 4 years ago
conv_cudnn_helper.h enable exhaustive_search for forward and backward algos when dtype is float16 (#30959) 4 years ago
conv_cudnn_op.cu [ROCM] fix conv2d and conv3d op, test=develop (#31553) 4 years ago
conv_cudnn_op_cache.h Delete cudnn6 code (#31835) 4 years ago
conv_miopen_helper.h [ROCM] fix conv2d and conv3d op, test=develop (#31553) 4 years ago
conv_op.cc update conv2d, test=develop (#31480) 4 years ago
conv_op.cu.cc
conv_op.h support channel last in BatchNorm*d 4 years ago
conv_op_xpu.cc add squeeze_op/unsqueeze_op on kunlun;fix conv op and parallel executor;optimize lookup_table op (#31056) 4 years ago
conv_shift_op.cc Remove extraneous comma in error messages (#24478) 5 years ago
conv_shift_op.cu
conv_shift_op.h
conv_transpose_cudnn_op.cu [ROCM] fix test_conv2d_transpose_op (#31749) 4 years ago
conv_transpose_op.cc [ROCM] update fluid operators for rocm (part4), test=develop (#31225) 4 years ago
conv_transpose_op.cu Add double grad for conv_transpose (#29706) 4 years ago
conv_transpose_op.h fix bug of DepthwiseConvTransposeGradKernel (#31762) 4 years ago
correlation_op.cc Add correlation api to contrib (#27015) 4 years ago
correlation_op.cu [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
cos_sim_op.cc Fix ops doc for some ops 4 years ago
cos_sim_op.cu
cos_sim_op.h
crf_decoding_op.cc Format error message for ops (#24482) 5 years ago
crf_decoding_op.h Format error message for ops (#24482) 5 years ago
crop_op.cc rename inplace/no_need_buffer inferer, part4, test=develop (#24781) 5 years ago
crop_op.cu fix document of 11 APIs (#20278) 5 years ago
crop_op.h Improving error reporting messages for ops (#24438) 5 years ago
crop_tensor_op.cc Improving error reporting messages for ops (#24438) 5 years ago
crop_tensor_op.cu All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756) 5 years ago
crop_tensor_op.h Fix the bug that Input(Offsets) and attr(offsets) cannot be set at the same time. (#24975) 5 years ago
cross_entropy_op.cc Fix bug: Don't check dims if contain_unknown_dim of cross_entropy_grad_op in compile time (#25221) 5 years ago
cross_entropy_op.cu
cross_entropy_op.h Enhance error message of cross_entropy_op, sigmoid_cross_entropy_with_logits_op (#24485) 5 years ago
cross_op.cc add new api for Paddle2.0: nonzero, index_selct, roll, cross (#23176) 5 years ago
cross_op.cu add new api for Paddle2.0: nonzero, index_selct, roll, cross (#23176) 5 years ago
cross_op.h add new api for Paddle2.0: nonzero, index_selct, roll, cross (#23176) 5 years ago
ctc_align_op.cc API/OP (center_loss, fluid.one_hot, prroi_pool, roi_pool, ctc_greed_decoder) error message enhancement (#23794) 5 years ago
ctc_align_op.cu Error message opt, test=develop (#27467) 4 years ago
ctc_align_op.h Error message opt, test=develop (#27467) 4 years ago
cudnn_lstm_cache.h Delete cudnn6 code (#31835) 4 years ago
cudnn_lstm_op.cc add REGISTER_OP_VERSION for LSTM (#30038) 4 years ago
cudnn_lstm_op.cu.cc [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
cudnn_rnn_cache.h Delete cudnn6 code (#31835) 4 years ago
cum_op.h fix cumsum op for API 2.0, optimize performance 5 years ago
cumsum_op.cc register cumsum Op version for compatible Op upgrades (#26734) 5 years ago
cumsum_op.cu [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
cvm_op.cc enhance error message, test=develop (#30220) 4 years ago
cvm_op.cu Fix index overflow bug of the CUDA kernel loop increment (#25435) 5 years ago
cvm_op.h mod cvm test=develop (#25146) 5 years ago
data_norm_op.cc More precise mkldnn kernel rules in GetExpectedKernelType (#29840) 4 years ago
data_norm_op.cu [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
data_norm_op.h
deformable_conv_filter.cu.h add deformable conv v1 op and cpu version of deformable conv v2 (#18500) 5 years ago
deformable_conv_func.h add deformable conv v1 op and cpu version of deformable conv v2 (#18500) 5 years ago
deformable_conv_op.cc Refine error message, test=develop (#23823) 5 years ago
deformable_conv_op.cu add deformable conv v1 op and cpu version of deformable conv v2 (#18500) 5 years ago
deformable_conv_op.h Fix compling warning in deformable conv. (#20036) 5 years ago
deformable_conv_op_xpu.cc fix expand/uniform_random && concat/transpose to new api on xpu (#29280) 4 years ago
deformable_conv_v1_op.cc Refine error message, test=develop (#23823) 5 years ago
deformable_conv_v1_op.cu add deformable conv v1 op and cpu version of deformable conv v2 (#18500) 5 years ago
deformable_conv_v1_op.h Fix compling warning in deformable conv. (#20036) 5 years ago
deformable_psroi_pooling_op.cc OP(retinanet_detection_output, retinanet_target_assign, sigmoid_focal_loss, deformable_roi_pooling) error message enhancement. test=develop (#23726) 5 years ago
deformable_psroi_pooling_op.cu Fix index overflow bug of the CUDA kernel loop increment (#25435) 5 years ago
deformable_psroi_pooling_op.h OP(retinanet_detection_output, retinanet_target_assign, sigmoid_focal_loss, deformable_roi_pooling) error message enhancement. test=develop (#23726) 5 years ago
delete_var_op.cc use iwyu clean include (#27267) 4 years ago
dequantize_abs_max_op.cc use iwyu clean include (#27267) 4 years ago
dequantize_abs_max_op.cu add dequantize_abs_max op and modify lookup_table op (#20899) 5 years ago
dequantize_abs_max_op.h use iwyu clean include (#27267) 4 years ago
dequantize_log_op.cc use iwyu clean include (#27267) 4 years ago
dequantize_log_op.cu remove pow to speed up in dequantize_log op (#24607) 5 years ago
dequantize_log_op.h use iwyu clean include (#27267) 4 years ago
dequantize_op.cc operator checkpoints for new attributes. (#29832) 4 years ago
dequantize_op.h
dequeue_op.cc add queue_generator_op, dequeue_op, enqueue_op and ut (#24481) 5 years ago
detection_map_op.cc optimize the error meesage for detetion_map_op 4 years ago
detection_map_op.h optimize the error meesage for detetion_map_op 4 years ago
dgc_clip_by_norm_op.cc Optimize error message, include dgc, nccl, size op (#24456) 5 years ago
dgc_clip_by_norm_op.cu
dgc_clip_by_norm_op.h
dgc_op.cc Optimize error message, include dgc, nccl, size op (#24456) 5 years ago
dgc_op.cu
dgc_op.h Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759) 5 years ago
diag_embed_op.cc add diag_embed op (#23385) 5 years ago
diag_embed_op.cu add diag_embed op (#23385) 5 years ago
diag_embed_op.h [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
diag_op.cc API(argsort, argmax, argmin, cast, diag) error message enhancement 5 years ago
diag_op.cu
diag_op.h
diag_v2_op.cc Fix diag OP bug on Windows Python3.8 4 years ago
diag_v2_op.cu Fix bug: The calculation result of Diag_v2 Op under large size input is wrong (#27447) 4 years ago
diag_v2_op.h add paddle.tensor.linalg.diag API, diag_v2 OP and CUDA kernel 5 years ago
dist_op.cc add dist op (#23503) 5 years ago
dist_op.cu [ROCM] fix test_dist_op ci test, test=develop (#31468) 4 years ago
dist_op.h use eval to improve performance, test=develop (#25459) 4 years ago
dot_op.cc complex gradient matmul (#29966) 4 years ago
dot_op.cu complex gradient matmul (#29966) 4 years ago
dot_op.h [ROCM] fix test_matmul_v2_op (#31802) 4 years ago
dropout_op.cc API/OP error message enhancement (#23691) 5 years ago
dropout_op.cu [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
dropout_op.h [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
dropout_op_test.cc
dropout_op_xpu.cc Polish kunlun error (#27974) 4 years ago
edit_distance_op.cc Format error message for ops (#24482) 5 years ago
edit_distance_op.cu Optimize the error message for OP (#27617) 4 years ago
edit_distance_op.h Optimize the error message for OP (#27617) 4 years ago
empty_op.cc add empty_like op (python, and unit test), use c++ implementation of empty op, (#27287) 4 years ago
empty_op.cu.cc add empty op (c++, python, unit test) (#26659) 4 years ago
empty_op.h add empty op (c++, python, unit test) (#26659) 4 years ago
enqueue_op.cc use iwyu clean include (#27267) 4 years ago
erf_op.cc add approximation for gelu, test=develop (#22961) 5 years ago
erf_op.cu add erf op (#21785) 5 years ago
erf_op.h add erf op (#21785) 5 years ago
expand_as_op.cc Optimize the error message for OP (#27617) 4 years ago
expand_as_op.cu add register op_data_type of pad/expand_as et.al (#21718) 5 years ago
expand_as_op.h Optimize the error message for OP (#27617) 4 years ago
expand_as_v2_op.cc update expand as op to use the shape of the target tensor instead of the target tensor itself. (#29020) 4 years ago
expand_as_v2_op.cu Improve expand as (#26290) 5 years ago
expand_as_v2_op.h update expand as op to use the shape of the target tensor instead of the target tensor itself. (#29020) 4 years ago
expand_op.cc add double grad for expand (#27183) 4 years ago
expand_op.cu register fp16 kernel for some ops (#22650) (#22696) 5 years ago
expand_op.h fix expand/uniform_random && concat/transpose to new api on xpu (#29280) 4 years ago
expand_v2_op.cc fix the bug in expand_v2 op (#30984) 4 years ago
expand_v2_op.cu [API 2.0] adaptive expand op to use shape instead of expand_times (#26206) 5 years ago
expand_v2_op.h [API 2.0] adaptive expand op to use shape instead of expand_times (#26206) 5 years ago
eye_op.cc support Baidu Kunlun AI Accelerator (#25959) 5 years ago
eye_op.cu add eye op, kernel and unitest test=develop (#18980) 6 years ago
eye_op.h use iwyu clean include (#27267) 4 years ago
fake_dequantize_op.cc add op version for fake_quant and fake_dequant ops, test=op_version (#29923) 4 years ago
fake_dequantize_op.cu Fix fake_quant error when cout > 1024, test=develop (#28603) 4 years ago
fake_dequantize_op.h [Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) 5 years ago
fake_quantize_op.cc [dygraph qat] Use layer to calculate output scale (#31861) 4 years ago
fake_quantize_op.cu [dygraph qat] Use layer to calculate output scale (#31861) 4 years ago
fake_quantize_op.h [dygraph qat] Use layer to calculate output scale (#31861) 4 years ago
fc_op.cc Polish the error message of fc, fused_fc_elementwise_layernorm and fused_embedding_seq_pool. (#27692) 4 years ago
fc_op.cu.cc Implement the GPU kernel of fc operator (#19687) 6 years ago
fc_op.h Polish the error message of fc, fused_fc_elementwise_layernorm and fused_embedding_seq_pool. (#27692) 4 years ago
fill_any_like_op.cc Add macro BOOST_GET to enrich the error information of boost :: get (#24175) 5 years ago
fill_any_like_op.cu Add the zeros, ones, ones_like, zeros_like for api 2.0, test=develop (#23471) 5 years ago
fill_any_like_op.h Add the error raise for some operators, add some test cases 5 years ago
fill_constant_batch_size_like_op.cc rename inplace/no_need_buffer inferer, part2, test=develop (#24733) 5 years ago
fill_constant_batch_size_like_op.cu.cc Add seq2seq api related code (#19820) 5 years ago
fill_constant_batch_size_like_op.h [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
fill_constant_op.cc Recompute Offload (#30233) 4 years ago
fill_constant_op.cu.cc Add complex dtype op (add) test example (#29603) 4 years ago
fill_constant_op.h [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
fill_constant_op_xpu.cc add conj op for complex types (#29527) 4 years ago
fill_op.cc Add macro BOOST_GET to enrich the error information of boost :: get (#24175) 5 years ago
fill_op.cu.cc add kernel for fill_op, test=develop (#19719) 5 years ago
fill_op.h Delete Ref & VectorRef and add GetDataSafely (#22997) 5 years ago
fill_zeros_like_op.cc rename inplace/no_need_buffer inferer, part2, test=develop (#24733) 5 years ago
fill_zeros_like_op.cu.cc
fill_zeros_like_op.h
filter_by_instag_op.cc Update paddle enforce message (#24498) 5 years ago
filter_by_instag_op.h [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
flatten_op.cc add new flatten op test=develop (#25393) 5 years ago
flatten_op.cu.cc add new flatten op test=develop (#25393) 5 years ago
flatten_op.h fix flatten api grad (#30426) 4 years ago
flip_op.cc add op_version for flip op [test=op_version] (#30019) 4 years ago
flip_op.cu modify flip test=develop (#25312) 5 years ago
flip_op.h modify flip test=develop (#25312) 5 years ago
fsp_op.cc Fix fsp_op error message,test=develop (#24405) 5 years ago
fsp_op.cu
fsp_op.h Remove disable flag in test_fsp_op.py (#22171) 5 years ago
gather.cu.h fix error message of gather nd (#29521) 4 years ago
gather.h add paddle.gather for API2.0 (#26455) 5 years ago
gather_nd_op.cc gather_nd Op for API 2.0 refine (#26540) 5 years ago
gather_nd_op.cu fix error message, test=develop (#24425) 5 years ago
gather_nd_op.h fix error message, test=develop (#24425) 5 years ago
gather_op.cc refine gather OP performance for dynamic mode (#28587) 4 years ago
gather_op.cu add paddle.gather for API2.0 (#26455) 5 years ago
gather_op.h add paddle.gather for API2.0 (#26455) 5 years ago
gather_op_xpu.cc add gather_op xpu, test=kunlun (#27822) 4 years ago
gather_test.cc use iwyu clean include (#27267) 4 years ago
gather_tree_op.cc Optimize the error message of OP. (#27478) 4 years ago
gather_tree_op.cu Fix index overflow bug of the CUDA kernel loop increment (#25435) 5 years ago
gather_tree_op.h Add seq2seq api related code (#19820) 5 years ago
gaussian_random_batch_size_like_op.cc rename inplace/no_need_buffer inferer, part2, test=develop (#24733) 5 years ago
gaussian_random_op.cc More precise mkldnn kernel rules in GetExpectedKernelType (#29840) 4 years ago
gaussian_random_op.cu add empty op (c++, python, unit test) (#26659) 4 years ago
gaussian_random_op_xpu.cc Add gaussian_random XPU kernels (#27853) 4 years ago
gelu_op.cc More precise mkldnn kernel rules in GetExpectedKernelType (#29840) 4 years ago
gelu_op.cu add approximation for gelu, test=develop (#22961) 5 years ago
gelu_op.h [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
get_tensor_from_selected_rows_op.cc [ROCM] update fluid operators for rocm (part7), test=develop (#31307) 4 years ago
grid_sampler_cudnn_op.cu.cc [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
grid_sampler_op.cc [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
grid_sampler_op.cu Fix round in grid sample op (#27657) 4 years ago
grid_sampler_op.h Make grid support stopping graients. (#27630) 4 years ago
group_norm_op.cc rename inplace/no_need_buffer inferer, part2, test=develop (#24733) 5 years ago
group_norm_op.cu [ROCM] fix dropout and remove hipcub, test=develop (#31455) 4 years ago
group_norm_op.h fixed group_norm's bug and modified unittest (#20506) 5 years ago
gru_op.cc rename inplace/no_need_buffer inferer, part2, test=develop (#24733) 5 years ago
gru_op.cu.cc use iwyu clean include (#27267) 4 years ago
gru_op.h
gru_unit_op.cc Move input_size check into RunTime phrase of gru_unit_op and refine error message (#24776) 5 years ago
gru_unit_op.cu
gru_unit_op.h optimize unity build (#31119) 4 years ago
hash_op.cc use iwyu clean include (#27267) 4 years ago
hash_op.h use iwyu clean include (#27267) 4 years ago
hierarchical_sigmoid_op.cc rename inplace/no_need_buffer inferer, part 1, test=develop (#24711) 5 years ago
hierarchical_sigmoid_op.h [Feature] one ps (3/4) (#29604) 4 years ago
hinge_loss_op.cc Error message enhancement of 6 op (#23759) 5 years ago
hinge_loss_op.cu
hinge_loss_op.h
histogram_op.cc Add histc op (#24562) 5 years ago
histogram_op.cu update histogram op for performance optimization, test=develop (#24912) 4 years ago
histogram_op.h Add histc op (#24562) 5 years ago
huber_loss_op.cc Error message enhancement of 6 op (#23759) 5 years ago
huber_loss_op.cu support fp64 in huber_loss cuda kernel (#26583) 5 years ago
huber_loss_op.h Add dygraph execution context (#20157) 5 years ago
im2sequence_op.cc Enhance checking in some operator. (#24473) 5 years ago
im2sequence_op.cu
im2sequence_op.h
imag_op.cc [Complex] Add real & imag op and api for complex tensor (#29672) 4 years ago
imag_op.cu [Complex] Add real & imag op and api for complex tensor (#29672) 4 years ago
imag_op.h [Complex] Add real & imag op and api for complex tensor (#29672) 4 years ago
increment_op.cc use iwyu clean include (#27267) 4 years ago
increment_op.cu
increment_op.h
index_sample_op.cc Update index sample (#24109) 5 years ago
index_sample_op.cu fix the bug in backward OP of index_sample. (#31026) 4 years ago
index_sample_op.h fix header file paths of gflags, commit 3, test=develop (#30273) 4 years ago
index_select_op.cc rename inplace/no_need_buffer inferer, part 1, test=develop (#24711) 5 years ago
index_select_op.cu [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
index_select_op.h test=develop, bug fix for index_select and roll op (#25251) 5 years ago
inplace_abn_op.cc delete include framework.pb.h (#31859) 4 years ago
inplace_abn_op.cu [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
inplace_abn_op.h Add inplace abn op (#22806) 5 years ago
instance_norm_op.cc register ModifyAttr for instance_norm, test=op_version (#30065) 4 years ago
instance_norm_op.cu [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
instance_norm_op.h improve efficiency of runtime InferVarType (#22778) 5 years ago
interpolate_op.cc More precise mkldnn kernel rules in GetExpectedKernelType (#29840) 4 years ago
interpolate_op.cu refine gpu kernel config for Paddle (#28085) 4 years ago
interpolate_op.h add linear interpolate operator (#23357) 5 years ago
interpolate_op_xpu.cc update activation op on kunlun (#29577) 4 years ago
interpolate_v2_op.cc Polish some error message in opeators (#27876) 4 years ago
interpolate_v2_op.cu refine gpu kernel config for Paddle (#28085) 4 years ago
interpolate_v2_op.h fix typo for interp_v2,test=develop (#26843) 4 years ago
interpolate_v2_op_xpu.cc add nearest_interp_v2 on kunlun (#29725) 4 years ago
inverse_op.cc Add the implementation of inverse (#23310) 5 years ago
inverse_op.cu.cc Add the implementation of inverse (#23310) 5 years ago
inverse_op.h Add the implementation of inverse (#23310) 5 years ago
is_empty_op.cc fix check and error message for flatten hash is_empty op (#24434) 5 years ago
is_empty_op.cu.cc
is_empty_op.h
isfinite_op.cc use iwyu clean include (#27267) 4 years ago
isfinite_op.cu
isfinite_op.h use iwyu clean include (#27267) 4 years ago
isfinite_v2_op.cc fix isfinite_v2_op OpProtoAndCheckerMaker AddComment bug (#29626) 4 years ago
isfinite_v2_op.cu Add isfinite v2 op (#26344) 5 years ago
isfinite_v2_op.h use iwyu clean include (#27267) 4 years ago
kldiv_loss_op.cc rename inplace/no_need_buffer inferer, part 1, test=develop (#24711) 5 years ago
kldiv_loss_op.cu
kldiv_loss_op.h optimize unity build (#31119) 4 years ago
kron_op.cc type promotion for grad (#30177) 4 years ago
kron_op.cu Make transpose, trace, kron, reshape, sum op support complex type (#29321) 4 years ago
kron_op.h [ROCM] fix dropout and remove hipcub, test=develop (#31455) 4 years ago
l1_norm_op.cc API/OP error message enhancement (#23684) 5 years ago
l1_norm_op.cu
l1_norm_op.h API/OP error message enhancement (#23684) 5 years ago
label_smooth_op.cc use iwyu clean include (#27267) 4 years ago
label_smooth_op.cu Fix default label dim of label_smooth_op. test=develop (#21862) 5 years ago
label_smooth_op.h Fix default label dim of label_smooth_op. test=develop (#21862) 5 years ago
layer_norm_op.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
layer_norm_op.cu [ROCM] fix layer_norm, norm, p_norm, test_sequence_softmax_op, test_math_op_patch_var_base (#31709) 4 years ago
layer_norm_op.h [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
layer_norm_op_xpu.cc support transformer v2.0 (#30381) 4 years ago
layout_utils.h support channel last in BatchNorm*d 4 years ago
linear_chain_crf_op.cc rename inplace/no_need_buffer inferer, part 1, test=develop (#24711) 5 years ago
linear_chain_crf_op.h optimize unity build (#31119) 4 years ago
linspace_op.cc Register op version for linspace,test=op_version (#30025) 4 years ago
linspace_op.cu refine the precious of linspace Op using half way (#27452) 4 years ago
linspace_op.h refine the precious of linspace Op using half way (#27452) 4 years ago
load_combine_op.cc
load_combine_op.cu
load_combine_op.h fix eigen in push sparse; fix hadoop command (#26872) 5 years ago
load_op.cc memory leak for cpu (#21174) 5 years ago
load_op.cu
load_op.h Op (Save/Load) error message enhancement, test=develop (#23650) 5 years ago
load_op_xpu.cc add load_op_xpu for Baidu Kunlun (#27817) 4 years ago
lod_array_length_op.cc use iwyu clean include (#27267) 4 years ago
lod_rank_table_op.cc enhance array_to_lod_tensor_op lod_tensor_to_array_op errors informaiton (#27386) 4 years ago
lod_reset_op.cc rename inplace/no_need_buffer inferer, part 1, test=develop (#24711) 5 years ago
lod_reset_op.cu
lod_reset_op.h Api (lod_append) error message enhancement (#23541) 5 years ago
lod_tensor_to_array_op.cc [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
log_loss_op.cc API/OP (affine_channel, group_norm, layer_norm, random_crop, unpool, … (#24118) 5 years ago
log_loss_op.cu
log_loss_op.h
log_loss_op_xpu.cc Polish kunlun error (#27974) 4 years ago
log_softmax_op.cc log_softmax and LogSoftmax: impl kernel and refind docs (#26088) 5 years ago
log_softmax_op.cu log_softmax and LogSoftmax: impl kernel and refind docs (#26088) 5 years ago
log_softmax_op.h fix softmax cross entropy integer overflow (#30590) 4 years ago
lookup_table_dequant_op.cc add lookup_table_dequant_op (#22900) 5 years ago
lookup_table_dequant_op.h [Feature] one ps (3/4) (#29604) 4 years ago
lookup_table_op.cc [oneDNN] lookup_table op with support for BF16 data type. (#31558) 4 years ago
lookup_table_op.cu Bugfix rocm (#31490) 4 years ago
lookup_table_op.h [oneDNN] lookup_table op with support for BF16 data type. (#31558) 4 years ago
lookup_table_v2_op.cc enhance error messages of lookup_tale, merge_ids, data_norm (#27619) 4 years ago
lookup_table_v2_op.cu fix gpu outofrange (#29238) 4 years ago
lookup_table_v2_op.h [Feature] one ps (3/4) (#29604) 4 years ago
lookup_table_v2_op_xpu.cc add squeeze_op/unsqueeze_op on kunlun;fix conv op and parallel executor;optimize lookup_table op (#31056) 4 years ago
lrn_op.cc More precise mkldnn kernel rules in GetExpectedKernelType (#29840) 4 years ago
lrn_op.cu lrn supports channel_last input, test=develop (#20954) 5 years ago
lrn_op.h Error message enhancement of 6 op (#23759) 5 years ago
lstm_op.cc API(dynamic_lstm, dynamic_lstmp) error message enhancement (#24450) 5 years ago
lstm_op.cu.cc
lstm_op.h API(dynamic_gru, chunk_eval, BeamSearchDecoder) error message enhancement (#24513) 5 years ago
lstm_unit_op.cc API(lstm_unit, lstmp, sequence_mask, sequence_enumerate, sequence_conv) error message enhancement (#27572) 4 years ago
lstm_unit_op.cu API(lstm_unit, lstmp, sequence_mask, sequence_enumerate, sequence_conv) error message enhancement (#27572) 4 years ago
lstm_unit_op.h API(lstm_unit, lstmp, sequence_mask, sequence_enumerate, sequence_conv) error message enhancement (#27572) 4 years ago
lstmp_op.cc API(dynamic_lstm, dynamic_lstmp) error message enhancement (#24450) 5 years ago
lstmp_op.cu
lstmp_op.h Modify relu native implementation 2 (#30996) 4 years ago
margin_rank_loss_op.cc API/OP (margin_rank_loss, nce, row_conv, positive_negative_pair) erro… (#24246) 5 years ago
margin_rank_loss_op.cu
margin_rank_loss_op.h
masked_select_op.cc 【API2.0】add masked_select Op for API2.0 (#26374) 5 years ago
masked_select_op.cu 【API2.0】add masked_select Op for API2.0 (#26374) 5 years ago
masked_select_op.h 【API2.0】add masked_select Op for API2.0 (#26374) 5 years ago
match_matrix_tensor_op.cc Refine error message of MatchMatrix and PyramidHash (#27484) 4 years ago
match_matrix_tensor_op.h Add match_matrix_tensor op (#18525) 6 years ago
math.h codegen for fused elementwise operation (#19520) 6 years ago
matmul_op.cc Polish two error messages (#31852) 4 years ago
matmul_op_xpu.cc opt matmul and matmul_v2 on kunlun, *test=kunlun (#31326) 4 years ago
matmul_v2_op.cc type promotion for grad (#30177) 4 years ago
matmul_v2_op.cu add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199) 4 years ago
matmul_v2_op.h [ROCM] fix dropout and remove hipcub, test=develop (#31455) 4 years ago
matmul_v2_op_xpu.cc opt matmul and matmul_v2 on kunlun, *test=kunlun (#31326) 4 years ago
max_sequence_len_op.cc use iwyu clean include (#27267) 4 years ago
maxout_op.cc refine APIs: brelu, hardsigmoid, hardswish, maxout (#27658) 4 years ago
maxout_op.cu.cc
maxout_op.h refine APIs: brelu, hardsigmoid, hardswish, maxout (#27658) 4 years ago
mean_iou_op.cc OP Normal, Uniform, Xavier Initializer, smooth_l1, mean_iou error message enhancement (#23751) 5 years ago
mean_iou_op.cu Fix index overflow bug of the CUDA kernel loop increment (#25435) 5 years ago
mean_iou_op.h
mean_op.cc rename inplace/no_need_buffer inferer, part 1, test=develop (#24711) 5 years ago
mean_op.cu [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
mean_op.h OP error message enhancement of l2_normalize, matmul, mean, etc 5 years ago
mean_op_xpu.cc error message optimization in mean_xpu,softmax_with_cross_entropy_op_xpu,test=kunlun (#27967) 4 years ago
memcpy_op.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
memcpy_op.h use iwyu clean include second time, test=develop (#30829) 4 years ago
merge_lod_tensor_op.cc [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
merge_selected_rows_op.cc Improving error reporting messages for ops (#24438) 5 years ago
merge_selected_rows_op.cu.cc
merge_selected_rows_op.h
meshgrid_op.cc Add meshgrid op (#23736) 5 years ago
meshgrid_op.cu Add meshgrid op (#23736) 5 years ago
meshgrid_op.h optimize unity build (#30195) 4 years ago
minus_op.cc OP(minus) error message enhancement. test=develop (#23621) 5 years ago
minus_op.cu
minus_op.h
miopen_lstm_cache.h [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
miopen_rnn_cache.h [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
mish_op.cc add mish op. (#24565) 5 years ago
mish_op.cu refine gpu kernel config for Paddle (#28085) 4 years ago
mish_op.h add mish op. (#24565) 5 years ago
modified_huber_loss_op.cc Error message enhancement of 6 op (#23759) 5 years ago
modified_huber_loss_op.cu
modified_huber_loss_op.h [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
mul_op.cc More precise mkldnn kernel rules in GetExpectedKernelType (#29840) 4 years ago
mul_op.cu.cc
mul_op.h Add mkldnn int8 mul-op kernel (#17834) 6 years ago
mul_op_xpu.cc support elementwise add, activation, matmul on Baidu Kunlun (#27143) 4 years ago
multinomial_op.cc Fix error message of multinomial op (#27946) 4 years ago
multinomial_op.cu [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
multinomial_op.h Fix error message of multinomial op (#27946) 4 years ago
multiplex_op.cc Upgrade Error Message for AucOP & MultiplexOP (#24458) 5 years ago
multiplex_op.cu Upgrade Error Message for AucOP & MultiplexOP (#24458) 5 years ago
multiplex_op.h Upgrade Error Message for AucOP & MultiplexOP (#24458) 5 years ago
mv_op.cc update mv op according PR#27024 (#27474) 4 years ago
mv_op.cu refine gpu kernel config for Paddle (#28085) 4 years ago
mv_op.h update mv op according PR#27024 (#27474) 4 years ago
nce_op.cc rename inplace/no_need_buffer inferer, part 1, test=develop (#24711) 5 years ago
nce_op.h [Feature] one ps (3/4) (#29604) 4 years ago
nll_loss_op.cc enhance error message of nll_loss op test=develop (#30125) 4 years ago
nll_loss_op.cu [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
nll_loss_op.h Polish two error messages (#31852) 4 years ago
norm_op.cc API/OP error message enhancement (#23684) 5 years ago
norm_op.cu [ROCM] fix layer_norm, norm, p_norm, test_sequence_softmax_op, test_math_op_patch_var_base (#31709) 4 years ago
norm_op.h
norm_utils.cu.h [ROCM] update fluid operators for rocm (part8), test=develop (#31309) 4 years ago
norm_utils.h add instance norm (#19500) 5 years ago
one_hot_op.cc delete include framework.pb.h (#31859) 4 years ago
one_hot_op.cu supports collective communicated training (#18175) 6 years ago
one_hot_op.h Error message enhancement of 6 op (#23759) 5 years ago
one_hot_op_xpu.cc delete include framework.pb.h (#31859) 4 years ago
one_hot_v2_op.cc delete include framework.pb.h (#31859) 4 years ago
one_hot_v2_op.cu Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716) 5 years ago
one_hot_v2_op.h Update paddle enforce message (#24498) 5 years ago
one_hot_v2_op_xpu.cc delete include framework.pb.h (#31859) 4 years ago
op_debug_string_test.cc use iwyu clean include (#27267) 4 years ago
p_norm_op.cc Add p_norm op version info (#30042) 4 years ago
p_norm_op.cu [ROCM] fix layer_norm, norm, p_norm, test_sequence_softmax_op, test_math_op_patch_var_base (#31709) 4 years ago
p_norm_op.h Norm op support 2-axis (#26492) 5 years ago
pad2d_op.cc rename inplace/no_need_buffer inferer, part 1, test=develop (#24711) 5 years ago
pad2d_op.cu Fix index overflow bug of the CUDA kernel loop increment (#25435) 5 years ago
pad3d_op.cc add pad and concat double grad (#29549) 4 years ago
pad3d_op.cu add pad func (#26106) 5 years ago
pad_constant_like_op.cc OP(pad, pad2d, pad_constant_like) error message enhancement (#23882) 5 years ago
pad_constant_like_op.cu add register op_data_type of pad/expand_as et.al (#21718) 5 years ago
pad_constant_like_op.h Update the precision of pad, pad2d, pad_constant_like's unit tests from fp32 to fp64 (#22394) 5 years ago
pad_op.cc add pad and concat double grad (#29549) 4 years ago
pad_op.cu add register op_data_type of pad/expand_as et.al (#21718) 5 years ago
pad_op.h Add fp16 support for pad and split (#19881) 5 years ago
partial_concat_op.cc Imperative tracer refactoring (#22457) 5 years ago
partial_concat_op.cu Add macro BOOST_GET to enrich the error information of boost :: get (#24175) 5 years ago
partial_concat_op.h add partial_concat op in contrib (#22528) 5 years ago
partial_sum_op.cc Imperative tracer refactoring (#22457) 5 years ago
partial_sum_op.cu Add macro BOOST_GET to enrich the error information of boost :: get (#24175) 5 years ago
partial_sum_op.h add partial_sum op in contrib (#22292) 5 years ago
pixel_shuffle_op.cc Add version checking, test=op_version (#30129) 4 years ago
pixel_shuffle_op.cu
pixel_shuffle_op.h [Api2.0] add pixel shuffle (#26071) 5 years ago
pool_cudnn_op.cu.cc [ROCM] update fluid operators for rocm (part4), test=develop (#31225) 4 years ago
pool_op.cc [ROCM] update fluid operators for rocm (part4), test=develop (#31225) 4 years ago
pool_op.cu Optimized the adaptive_avg_pool2d op when output_size == 1 (#31197) 4 years ago
pool_op.h [ROCM] fix dropout and remove hipcub, test=develop (#31455) 4 years ago
pool_op_xpu.cc optimize batch_norm & pool op for kunlun (#30490) 4 years ago
pool_with_index_op.cc Error message opt, test=develop (#27467) 4 years ago
pool_with_index_op.cu.cc
pool_with_index_op.h Error message opt, test=develop (#27467) 4 years ago
positive_negative_pair_op.cc fix error mesage for negative_positive_pair_op and nce_op (#27779) 4 years ago
positive_negative_pair_op.h
prelu_op.cc fix bug of prelu when rank not equal 4, test=develop (#25067) 5 years ago
prelu_op.cu [ROCM] fix dropout and remove hipcub, test=develop (#31455) 4 years ago
prelu_op.h fix the computation for dx (grad for x) for prelu operation. (#20949) 5 years ago
print_op.cc Register op version for print, test=op_version (#29945) 4 years ago
prroi_pool_op.cc API/OP (center_loss, fluid.one_hot, prroi_pool, roi_pool, ctc_greed_decoder) error message enhancement (#23794) 5 years ago
prroi_pool_op.cu [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
prroi_pool_op.h [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
psroi_pool_op.cc Update paddle enforce message (#24498) 5 years ago
psroi_pool_op.cu Error message opt, test=develop (#27467) 4 years ago
psroi_pool_op.h Update paddle enforce message (#24498) 5 years ago
pull_box_extended_sparse_op.cc fix conflict, test=develop (#24238) 5 years ago
pull_box_extended_sparse_op.cu fix conflict, test=develop (#24238) 5 years ago
pull_box_extended_sparse_op.h fix conflict, test=develop (#24238) 5 years ago
pull_box_sparse_op.cc heter box (#29734) 4 years ago
pull_box_sparse_op.cu Paddlebox Framework (#18982) 6 years ago
pull_box_sparse_op.h [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
pull_sparse_op.cc add fleet pslib pull and push sparse op and push dense op (#23139) 5 years ago
pull_sparse_op.h add fleet pslib pull and push sparse op and push dense op (#23139) 5 years ago
pull_sparse_v2_op.cc add fleet pslib pull and push sparse op and push dense op (#23139) 5 years ago
pull_sparse_v2_op.h add fleet pslib pull and push sparse op and push dense op (#23139) 5 years ago
push_dense_op.cc rename inplace/no_need_buffer inferer, part 1, test=develop (#24711) 5 years ago
push_dense_op.h add fleet pslib pull and push sparse op and push dense op (#23139) 5 years ago
py_func_op.cc enhance error info for py_func (#30138) 4 years ago
py_func_op.h
pyramid_hash_op.cc Refine error message of MatchMatrix and PyramidHash (#27484) 4 years ago
quantize_op.cc operator checkpoints for new attributes. (#29832) 4 years ago
quantize_op.h
queue_generator_op.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
randint_op.cc Refine paddle.manual_seed (#26496) 5 years ago
randint_op.cu add cuda generator (#26786) 5 years ago
random_crop_op.cc API/OP (affine_channel, group_norm, layer_norm, random_crop, unpool, … (#24118) 5 years ago
random_crop_op.cu
random_crop_op.h [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
randperm_op.cc Fix the formate of raising error in randperm op (#30108) 4 years ago
randperm_op.cu randperm API: remove out, devive, stop_gradient; add name (#25410) 5 years ago
randperm_op.h randperm run error in multi-gpus (#27942) 4 years ago
range_op.cc avoid data transfer, test=develop (#25810) 5 years ago
range_op.cu optimize range op by place parameters on cpu rather than gpu, test=develop (#30811) 4 years ago
range_op.h fix error log, test=develop (#24419) 5 years ago
range_op_xpu.cc dyngraph (#30892) 4 years ago
rank_attention.cu.h [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
rank_attention_op.cc fix error message (#30135) 4 years ago
rank_attention_op.cu [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
rank_attention_op.h fix conflict, test=develop (#23298) 5 years ago
rank_loss_op.cc use iwyu clean include (#27267) 4 years ago
rank_loss_op.cu
rank_loss_op.h optimize unity build (#30195) 4 years ago
real_op.cc [Complex] Add real & imag op and api for complex tensor (#29672) 4 years ago
real_op.cu [Complex] Add real & imag op and api for complex tensor (#29672) 4 years ago
real_op.h [Complex] Add real & imag op and api for complex tensor (#29672) 4 years ago
recurrent_op.cc fix runtime crash when rnn model inference, test=develop (#31833) 4 years ago
recurrent_op.h use iwyu clean include (#27267) 4 years ago
reorder_lod_tensor_by_rank_op.cc use iwyu clean include (#27267) 4 years ago
requantize_op.cc operator checkpoints for new attributes. (#29832) 4 years ago
requantize_op.h
reshape_op.cc [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
reverse_op.cc Support LoDTensorArray in reverse_op (#24797) 5 years ago
reverse_op.cu
reverse_op.h Support LoDTensorArray in reverse_op (#24797) 5 years ago
rnn_memory_helper_op.cc use iwyu clean include (#27267) 4 years ago
rnn_op.cc Add LSTM, Simple RNN and GRU CPU kernel (#28577) 4 years ago
rnn_op.cu.cc [ROCM] fix test_rnn_op (#31735) 4 years ago
rnn_op.h Modify relu native implementation 2 (#30996) 4 years ago
roi_align_op.cc add offset parameter in roi_align,generate_proposals.etc ops (#30864) 4 years ago
roi_align_op.cu fix roi_align, test=develop (#31479) 4 years ago
roi_align_op.h fix roi_align, test=develop (#31479) 4 years ago
roi_align_op_xpu.cc support roi_align & affine_channel for kunlun (#29561) 4 years ago
roi_pool_op.cc add REGISTER_OP_VERSION for generate_proposals, roi_align, roi_pool test=op_version (#30034) 4 years ago
roi_pool_op.cu Error message opt, test=develop (#27467) 4 years ago
roi_pool_op.h Enhance ops to support LoD as input for dygraph detection models. (#25316) 4 years ago
roll_op.cc test=develop, add op_register_version for roll_op (#30023) 4 years ago
roll_op.cu Roll cuda kernel (#29655) 4 years ago
roll_op.h modify roll test=develop (#25321) 5 years ago
row_conv_op.cc API/OP (margin_rank_loss, nce, row_conv, positive_negative_pair) erro… (#24246) 5 years ago
row_conv_op.cu fix row_conv_op to force it support lodtensor and tensor input simultaneously, test=develop (#19412) 6 years ago
row_conv_op.h
run_program_op.cc fix loaded no params layer run error (#27241) 4 years ago
run_program_op.cu.cc Implement StaticModelRunner to support dygraph fine-tune static graph pre-training model (#23171) 5 years ago
run_program_op.h [Dy2Stat] Add cache for Executor and Context in run_program_op (#28421) 4 years ago
sample_logits_op.cc Remove extraneous comma in error messages (#24478) 5 years ago
sample_logits_op.cu fix lod_reset bug, test=develop (#21392) 5 years ago
sample_logits_op.h Remove extraneous comma in error messages (#24478) 5 years ago
sampling_id_op.cc SamplingID Op fix error print (#24521) 5 years ago
sampling_id_op.cu
sampling_id_op.h Refine paddle.manual_seed (#26496) 5 years ago
save_combine_op.cc improve efficiency of runtime InferVarType (#22778) 5 years ago
save_combine_op.cu add register op_data_type of pad/expand_as et.al (#21718) 5 years ago
save_combine_op.h delete include framework.pb.h (#31859) 4 years ago
save_load_combine_op_test.cc
save_load_op_test.cc
save_op.cc Incorporate cudnn_lstm into LSTM api (#27217) 4 years ago
save_op.cu Incorporate cudnn_lstm into LSTM api (#27217) 4 years ago
save_op.h delete include framework.pb.h (#31859) 4 years ago
scale_op.cc [oneDNN] Initial bf16 amp integration (#31093) 4 years ago
scale_op.cu refine math_op_patch, test=develop (#19727) 6 years ago
scale_op.h add the error message check for the some operator 4 years ago
scale_op_xpu.cc support transformer v2.0 (#30381) 4 years ago
scatter.cu.h Fix scatter grad bug (#30604) 4 years ago
scatter.h Fix scatter grad bug (#30604) 4 years ago
scatter_nd_add_op.cc rename inplace/no_need_buffer inferer, part4, test=develop (#24781) 5 years ago
scatter_nd_add_op.cu Fix scatter grad bug (#30604) 4 years ago
scatter_nd_add_op.h Fix scatter grad bug (#30604) 4 years ago
scatter_op.cc Fix scatter grad bug (#30604) 4 years ago
scatter_op.cu Fix scatter grad bug (#30604) 4 years ago
scatter_op.h Fix scatter grad bug (#30604) 4 years ago
scatter_test.cc use iwyu clean include (#27267) 4 years ago
search_compute.h Support mips arch (#29903) 4 years ago
seed_op.cc add cuda kernel for seed, test=develop (#23749) 5 years ago
seed_op.cu [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
seed_op.h Dropout with seed (#21590) 5 years ago
segment_pool_op.cc Add the cpu version of segment sum mean max min op 4 years ago
segment_pool_op.cu refine gpu kernel config for Paddle (#28085) 4 years ago
segment_pool_op.h [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
select_input_op.cc Move input_size check into RunTime phrase of gru_unit_op and refine error message (#24776) 5 years ago
select_op_helper.h [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
select_output_op.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
selu_op.cc [OpDevOptimize] Add common infershape functions (#26096) 5 years ago
selu_op.cu
selu_op.h
set_value_op.cc Fix bug of set_value op:Decerease axes to do right broadcast (#31875) 4 years ago
set_value_op.cu [setitem] Support Tensor setitem in static mode (#29708) 4 years ago
set_value_op.h Fix bug of set_value op:Decerease axes to do right broadcast (#31875) 4 years ago
shape_op.cc shape op support int8 and uint8 tensor (#30201) 4 years ago
shape_op.cu shape op support int8 and uint8 tensor (#30201) 4 years ago
shape_op.h [Dy2stat] Support len syntax (#24638) 5 years ago
shape_op_xpu.cc add XPU support for shape op and reshape op (#27804) 4 years ago
shard_index_op.cc Improving error reporting messages for ops (#24438) 5 years ago
shard_index_op.cu Improving error reporting messages for ops (#24438) 5 years ago
shard_index_op.h Improving error reporting messages for ops (#24438) 5 years ago
shrink_rnn_memory_op.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
shuffle_batch_op.cc Imperative tracer refactoring (#22457) 5 years ago
shuffle_batch_op.h [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
shuffle_channel_op.cc [2.0RC]refine error message in shuffle channel OP (#27505) 4 years ago
shuffle_channel_op.cu
shuffle_channel_op.h
sigmoid_cross_entropy_with_logits_op.cc Enhance error message of cross_entropy_op, sigmoid_cross_entropy_with_logits_op (#24485) 5 years ago
sigmoid_cross_entropy_with_logits_op.cu [ROCM] fix gather_op, sigmoid_cross_entropy_with_logits_op, test=develop (#31467) 4 years ago
sigmoid_cross_entropy_with_logits_op.h
sign_op.cc update error info of ops,add some test cases for raise message (#23750) 5 years ago
sign_op.cu
sign_op.h
sign_op_xpu.cc Polish some error message in opeators (#27876) 4 years ago
similarity_focus_op.cc OP(rank_loss, similarity_focus, squeeze) error message enhancement (#24448) 5 years ago
similarity_focus_op.h OP(rank_loss, similarity_focus, squeeze) error message enhancement (#24448) 5 years ago
size_op.cc fix gpu kernel for numel Op (#27085) 4 years ago
size_op.cu fix gpu kernel for numel Op (#27085) 4 years ago
size_op.h fix gpu kernel for numel Op (#27085) 4 years ago
slice_op.cc Add error message for slice op(#30851) 4 years ago
slice_op.cu add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199) 4 years ago
slice_op.h Fix bug: In dynamic mode, if start or end is negetive, __getitem__ return wrong result(#30003) 4 years ago
slice_op_xpu.cc add kunlun kernel: slice, slice_grad, top_k, cast. *test=kunlun (#28542) 4 years ago
smooth_l1_loss_op.cc update enhance error message for Initializer, smooth_l1 (#23912) 5 years ago
smooth_l1_loss_op.cu
smooth_l1_loss_op.h
softmax_cudnn_op.cu [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
softmax_op.cc [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
softmax_op.cu.cc
softmax_op.h
softmax_op_xpu.cc fix softmax bug for multi_card in kunlun (#30600) 4 years ago
softmax_with_cross_entropy_op.cc add softmax_switch for softmax_with_cross_entropy_op, test=develop (#31428) 4 years ago
softmax_with_cross_entropy_op.cu [ROCM] fix softmax_with_cross_entropy_op, test=develop (#31629) 4 years ago
softmax_with_cross_entropy_op.h add softmax_switch for softmax_with_cross_entropy_op, test=develop (#31428) 4 years ago
softmax_with_cross_entropy_op_xpu.cc 1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op (#29448) 4 years ago
space_to_depth_op.cc op error info (#27856) 4 years ago
space_to_depth_op.cu add register op_data_type of pad/expand_as et.al (#21718) 5 years ago
space_to_depth_op.h
spectral_norm_op.cc Update OP_INOUT_CHECK (#23757) 5 years ago
spectral_norm_op.cu
spectral_norm_op.h fix PADDLE_THROW in spectral_norm_op.h. test=develop (#24414) 5 years ago
split_lod_tensor_op.cc [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
split_op.cc op error info (#27856) 4 years ago
split_op.cu.cc refine the split op for API 2.0 test=develop (#25320) 5 years ago
split_op.h op error info (#27856) 4 years ago
split_selected_rows_op.cc test=develop, error info improvement (#24496) 5 years ago
split_selected_rows_op.cu
split_selected_rows_op.h [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
spp_op.cc Update paddle enforce message (#24498) 5 years ago
spp_op.cu.cc
spp_op.h [ROCM] update fluid operators for rocm (part4), test=develop (#31225) 4 years ago
squared_l2_distance_op.cc rename inplace/no_need_buffer inferer, part4, test=develop (#24781) 5 years ago
squared_l2_distance_op.cu
squared_l2_distance_op.h optimize unity build (#30195) 4 years ago
squared_l2_norm_op.cc API/OP error message enhancement (#23684) 5 years ago
squared_l2_norm_op.cu
squared_l2_norm_op.h API/OP error message enhancement (#23684) 5 years ago
squeeze_op.cc add uint8 support for squeeze operator (#28734) 4 years ago
squeeze_op.cu.cc add uint8 support for squeeze operator (#28734) 4 years ago
squeeze_op.h add uint8 support for squeeze operator (#28734) 4 years ago
squeeze_op_xpu.cc add squeeze_op/unsqueeze_op on kunlun;fix conv op and parallel executor;optimize lookup_table op (#31056) 4 years ago
stack_op.cc Imperative tracer refactoring (#22457) 5 years ago
stack_op.cu refine gpu kernel config for Paddle (#28085) 4 years ago
stack_op.h Refine stack op to improve xlnet performance, test=develop (#22142) 5 years ago
stack_op_xpu.cc feat: support check_nan_inf for kunlun/xpu device (#29694) 4 years ago
strided_memcpy.h [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
strided_memcpy_test.cc [ROCM] update fluid operators for rocm (part9), test=develop (#31338) 4 years ago
strided_slice_op.cc add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199) 4 years ago
strided_slice_op.cu add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199) 4 years ago
strided_slice_op.h Support int32 int64 and fix bug (#24407) 5 years ago
sum_op.cc More precise mkldnn kernel rules in GetExpectedKernelType (#29840) 4 years ago
sum_op.cu update the error message check for the some ops 4 years ago
sum_op.h update the error message check for the some ops 4 years ago
sum_op_xpu.cc fix enforce msg of sum xpu op (#30113) 4 years ago
sync_batch_norm_op.cc Add macro BOOST_GET to enrich the error information of boost :: get (#24175) 5 years ago
sync_batch_norm_op.cu [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
sync_batch_norm_op.cu.h [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
tdm_child_op.cc Add Tdm child OP in contrib (#23241) 5 years ago
tdm_child_op.h fix header file paths of gflags, commit 3, test=develop (#30273) 4 years ago
tdm_sampler_op.cc Add Tdm sampler op in Contrib (#23290) 5 years ago
tdm_sampler_op.h fix header file paths of gflags, commit 3, test=develop (#30273) 4 years ago
teacher_student_sigmoid_loss_op.cc Refine error message, test=develop (#23823) 5 years ago
teacher_student_sigmoid_loss_op.h
temporal_shift_op.cc support NHWC for temporal_shift op (#31642) 4 years ago
temporal_shift_op.cu support NHWC for temporal_shift op (#31642) 4 years ago
temporal_shift_op.h support NHWC for temporal_shift op (#31642) 4 years ago
tensor_array_to_tensor_op.cc c++ API ( average_accumulates, tensor_array_to_tensor and average_accumulates) error message enhance. test=develop (#23631) 5 years ago
tensor_formatter.cc use iwyu clean include second time, test=develop (#30829) 4 years ago
tensor_formatter.h use iwyu clean include (#27267) 4 years ago
test_common_infer_shape_functions.cc [OpDevOptimize] Add common infershape functions (#26096) 5 years ago
test_leaky_relu_grad_grad_functor.cc fix leaky_relu op when alpha is zero, test=develop (#19833) 5 years ago
test_leaky_relu_grad_grad_functor.cu fix leaky_relu op when alpha is zero, test=develop (#19833) 5 years ago
test_leaky_relu_grad_grad_functor.h [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
tile_op.cc fix shape of tile_grad op (#29289) 4 years ago
tile_op.cu [API 2.0] add paddle.tile op (#26245) 5 years ago
tile_op.h fix shape of tile_grad op (#29289) 4 years ago
top_k_function_cuda.h [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
top_k_op.cc Polish some error message in opeators (#27876) 4 years ago
top_k_op.cu [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
top_k_op.h optimize unity build (#31119) 4 years ago
top_k_op_xpu.cc add kunlun kernel: slice, slice_grad, top_k, cast. *test=kunlun (#28542) 4 years ago
top_k_v2_op.cc update the code for the topk message optimize 4 years ago
top_k_v2_op.cu optimize topk op through limit SortTopK kernel entrance, test=develop (#30403) 4 years ago
top_k_v2_op.h optimize unity build (#31119) 4 years ago
trace_op.cc Optimize the error message of framework. (#30134) 4 years ago
trace_op.cu [ROCM] fix dropout and remove hipcub, test=develop (#31455) 4 years ago
trace_op.h [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
transpose_op.cc More precise mkldnn kernel rules in GetExpectedKernelType (#29840) 4 years ago
transpose_op.cu Make transpose, trace, kron, reshape, sum op support complex type (#29321) 4 years ago
transpose_op.h enhance reduce op which can reduce tensor with arbitrary rank 4 years ago
transpose_op_xpu.cc fix expand/uniform_random && concat/transpose to new api on xpu (#29280) 4 years ago
tree_conv_op.cc test=develop, error message of tree_conv OP enhancement (#23574) 5 years ago
tree_conv_op.cu
tree_conv_op.h
tril_triu_op.cc add fp16 support for tril_triu op (#30186) 4 years ago
tril_triu_op.cu add fp16 support for tril_triu op (#30186) 4 years ago
tril_triu_op.h add fp16 support for tril_triu op (#30186) 4 years ago
truncated_gaussian_random_op.cc Add truncated_gaussian_random XPU kernel (#27861) 4 years ago
truncated_gaussian_random_op.cu fix truncated_gaussian seed (#28777) 4 years ago
truncated_gaussian_random_op.h Add truncated_gaussian_random XPU kernel (#27861) 4 years ago
truncated_gaussian_random_op_xpu.cc Add truncated_gaussian_random XPU kernel (#27861) 4 years ago
unbind_op.cc add unbind op (#23359) 5 years ago
unbind_op.cu.cc add unbind op (#23359) 5 years ago
unbind_op.h add unbind op (#23359) 5 years ago
unfold_op.cc rename inplace/no_need_buffer inferer, part3, test=develop (#24734) 5 years ago
unfold_op.cu
unfold_op.h Polish PADDLE_ENFORCE of unfold_op (#24423) 5 years ago
uniform_random_batch_size_like_op.cc rename inplace/no_need_buffer inferer, part2, test=develop (#24733) 5 years ago
uniform_random_op.cc update the error message check for the some ops 4 years ago
uniform_random_op.cu update the error message check for the some ops 4 years ago
uniform_random_op.h update the error message check for the some ops 4 years ago
uniform_random_op_xpu.cc fix expand/uniform_random && concat/transpose to new api on xpu (#29280) 4 years ago
unique_op.cc fix a bug in op_version_registry, test=develop, test=op_version (#29994) 4 years ago
unique_op.cu [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
unique_op.h add dtype for unique (#26655) 5 years ago
unique_with_counts_op.cc Add some error meesage and dtyp, dtyep check for some ops (#23762) 5 years ago
unique_with_counts_op.h Add the op of unique_with_counts, expand count function of the op unique (#18720) 6 years ago
unity_build_rule.cmake optimize unity build (#31119) 4 years ago
unpool_op.cc API/OP (group_norm, layer_norm, random_crop, unpool) error message enhancement (#24413) 5 years ago
unpool_op.cu.cc
unpool_op.h
unsqueeze_op.cc add uint8 support for squeeze operator (#28734) 4 years ago
unsqueeze_op.cu.cc add uint8 support for squeeze operator (#28734) 4 years ago
unsqueeze_op.h Update paddle enforce message (#24498) 5 years ago
unsqueeze_op_xpu.cc add squeeze_op/unsqueeze_op on kunlun;fix conv op and parallel executor;optimize lookup_table op (#31056) 4 years ago
unstack_op.cc update error message for unstack op and lamb op; test=develop (#24439) 5 years ago
unstack_op.cu add kernel for unstack_op, test=develop (#19538) 5 years ago
unstack_op.h [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
utils.h xpu support for fill_constant Op (#27675) 4 years ago
var_conv_2d_op.cc Polish no onwer ops error message (#27448) 4 years ago
var_conv_2d_op.h Add var_conv_2d op (#18518) 6 years ago
warpctc_op.cc [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
warpctc_op.cu.cc add support to float64 input of warpctc op. (#27399) 4 years ago
warpctc_op.h [ROCM] update fluid platform for rocm (part5), test=develop (#31315) 4 years ago
where_index_op.cc add new api for Paddle2.0: nonzero, index_selct, roll, cross (#23176) 5 years ago
where_index_op.cu add new api for Paddle2.0: nonzero, index_selct, roll, cross (#23176) 5 years ago
where_index_op.h add new api for Paddle2.0: nonzero, index_selct, roll, cross (#23176) 5 years ago
where_op.cc rename inplace/no_need_buffer inferer, part3, test=develop (#24734) 5 years ago
where_op.cu refine gpu kernel config for Paddle (#28085) 4 years ago
where_op.h Implement a new C++ operator where and API tensor.where (#23220) 5 years ago