Commit Graph

4855 Commits (9894a4fb35fc91646474611864cd933a89fd4a9b)

Author SHA1 Message Date
Zeng Jinle 89966525f1
Polish reference count pass (#21324)
5 years ago
hutuxian 47a82e38e3
Support data_norm gpu kernel (#21325)
5 years ago
GaoWei8 8493f20ebc Polish the codes of fc when needs padding (#21378)
5 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
5 years ago
Zeng Jinle b639a882c3 fix syn bn grad maker, test=develop, test=document_fix (#21317)
5 years ago
Youwei Song 4d0f5ab1a8 add axis check for concat op (#21288)
5 years ago
zhaoyuchen2018 afb134847d
Fix ernie python infer diff (#21311)
5 years ago
Lv Mengsi b6ce4f8b2f
Fix mistake of batch norm op (#21237)
5 years ago
lilong12 41d13209d7
add the framework support for distfc (#21197)
5 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
5 years ago
Jacek Czaja f4cf028a8c [MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207)
5 years ago
Michał Gallus ed9ceb9f98 Refactor MKL-DNN ElementwiseMul (#21061)
5 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
5 years ago
wangchaochaohu 6514f52e46
fix the fill_constant op precious problem (#21322)
5 years ago
zhaoyuchen2018 08c19c585d
Improve argsort performance. (#21267)
5 years ago
WangXi 8ac7687e36 Fix dgc accuracy by mv regularization to local (#21278)
5 years ago
Leo Zhao b19e1a1b56 use prefetch to load next mem into cache (#21206)
5 years ago
gongweibao ed2a185248
optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597)
5 years ago
Yihua Xu 69dd5152cf Fix the crash issue when scale or bias was null-pointer. (#21284)
5 years ago
Zhang Ting 698b8b73ad
optimize lod_reset op to avoid data transform
5 years ago
Liufang Sang f0b1518438 add dequantize_abs_max op and modify lookup_table op (#20899)
5 years ago
hutuxian a6ce2306f9 support cvm_op run in gpu (#21300)
5 years ago
Yihua Xu b085ecc258 Avoid the string as the key of map to improve the jit performance (#21292)
5 years ago
zhongpu c4ede95c74 open dygraph op test, test=develop (#19787)
5 years ago
danleifeng 6fc3e8ec84 edit elementwise_mul doublegrad inplace (#21245)
5 years ago
zhaoyuchen2018 3ff5cc2d5e
Fix topk compile failed on windows (#21243)
5 years ago
Zhang Ting 01a9646323
optimize assign op to avoid copy data from GPU to GPU (#21181)
5 years ago
danleifeng 0e7baabe59 extend elementwise broadcast function (#20957)
5 years ago
Adam d623e863c9 Fix GELU grad error (#21204)
5 years ago
yaoxuefeng b5d8ba8394
fix data_norm op to avoid impractical normalization result test=develop (#21152)
5 years ago
Zhang Ting 9cbe7bccba modified error message and API doc for channel_last supported Op (#21002)
5 years ago
guofei 56b5d14704 Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118)
5 years ago
WangXi 3c98ec90ce Fix INF bug of softmax_cross_entropy_op (#21165)
5 years ago
Yihua Xu eec9c9cbe7 Fix jit tls issue (#21151)
5 years ago
ruri aeb887911f
Refine edit distance cn (#21121)
5 years ago
Kaipeng Deng 98b59cb82c
fix elementwise_mod float point kernel. test=develop (#21183)
5 years ago
whs cfdd1fc2cd
Fix warpctc in padding mode. (#21033)
5 years ago
Chen Weihang 8da0cd537a
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134)
5 years ago
zhaoyuchen2018 b93870e696
Improve topk performance. (#21087)
5 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
5 years ago
Chen Weihang 7e5f74b825
Add examples for error message writing specification - InvalidArgument (#21132)
5 years ago
zhaoyuchen2018 4a544762a2
Add Asypadding for conv fusion. (#21041)
5 years ago
WangXi de5d3ff688 Fix dgc buffer illegal & reuse velocity (#21012)
5 years ago
ceci3 f62a929151
fix instance norm (#21042)
5 years ago
lilong12 e249d9a3e2
fix the computation for dx (grad for x) for prelu operation. (#20949)
5 years ago
Zhang Ting e0285eae64 add check for input channels and Attr(groups), test=develop (#21095)
5 years ago
Yiqun Liu 35f17ae28f
Add the check of lod_level between compile-time and runtime. (#20961)
5 years ago
Chen Weihang 826254f664
Add pre-condition check for fuse optimizer op pass (#21005)
5 years ago
Aurelius84 1cd6721873 Optimizer mmcpy if _rand_len=16 and remove data copy in GradKernel (#21099)
5 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
5 years ago
LielinJiang 06063b7001 add op locality_aware_nms, test=develop (#20976)
5 years ago
wangchaochaohu fc385777e4
fix the compile cost long time test=develop (#21064)
5 years ago
Chen Weihang 2f27b10331
Add dependency for error_codes.proto (#21084)
5 years ago
wangchaochaohu 149a1e3124
Expand refine (#21063)
5 years ago
Wojciech Uss af3ff422cc Fix dst memory allocation in elementwise_add (#21059)
5 years ago
liym27 26a6e27afe fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997)
5 years ago
Chen Weihang 7ee25189c3
Enrich the type of error and declare the error type interfaces (#21024)
5 years ago
Adam 3fda695bb0 Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062)
5 years ago
Huihuang Zheng 1957192f05
Add select_input_op and select_output_op (#21016)
5 years ago
Liufang Sang e5e699ecc0 set lod level for compile time test=develop (#21022)
5 years ago
liym27 f0e95a6049 Polish error messages of pool_2d/3d and add Raises in English document. test=develop (#21017)
5 years ago
zhaoyuchen2018 0059404e77
Fix ce ocr_recognition test fails (#20987)
5 years ago
Chengmo bc8e600ce5
Fix rpc not wait in GEO communicator (#20967)
5 years ago
Tao Luo 25ffa8445d
refine murmurhash3_x64_128 for bloom_filter (#20996)
5 years ago
Zeng Jinle 878a40f57d
Support NoNeedBufferVarsInference in dygraph backward (#20868)
5 years ago
wangchaochaohu bf379fef96
refine code for code reuse test=develop (#20988)
5 years ago
Zhang Ting de9bec607e lrn supports channel_last input, test=develop (#20954)
5 years ago
Liufang Sang 9b666cae67 fix diff in dequantize op between cpu and gpu test=develop (#20953)
5 years ago
Zhang Ting f4f85831d3 fix the bug of conv_transpose cudnn kernel, test=develop (#20958)
5 years ago
zhaoyuchen2018 7f3a445e9a
Fix gru as small frame_size has error. (#20922)
5 years ago
123malin 20cdff0e02
Optimize decay (#20816)
5 years ago
Chengmo 16596f6498
Fix Paddle Cloud role maker (#20860)
5 years ago
liym27 59de8e1214 Compatible int32 and int64 for attr in concat/split/unsqueeze. test=develop (#20912)
5 years ago
Zhang Ting 8d1e9f0f7e maxout supports channel_last input (#20846)
5 years ago
Yihua Xu b6260f3866 Optimize the kernel implementation of layernorm with openmp (#20895)
5 years ago
hong 8c4573a3cb
GradMaker for dygraph (#19706)
5 years ago
Chen Weihang 768551b25d Add parameter init check add run_startup_progrom error message for fc(mul) (#20906)
5 years ago
Zhang Ting c18f1bd716 fix the bug of conv_transpose:compatible with Anylayout setting, test=develop (#20897)
5 years ago
Wilber b489760099
fix jit_matmul bug test=develop (#20886)
5 years ago
Yiqun Liu 03ba0fdae6
Move the codes of fused operators to operators/fused directory. (#20881)
5 years ago
zhang wenhui d428912503
fix select_rows mergeadd bug, test=develop (#20876)
5 years ago
liym27 6802539a2e support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780)
5 years ago
wangchaochaohu 28ca2e5ffa
strided_slice perforamnce improvement test=develop (#20852)
5 years ago
Yiqun Liu 6fcfd32e6c
Check and correct the output's lod_level in DynamicRNN related operators (#19144)
5 years ago
liym27 84d221b667 improve unsqueeze op to support int, Tensor for argument axes (#20824)
5 years ago
silingtong123 03d7f3ddb2 Make shape tensor support int32 (#20757)
5 years ago
Huihuang Zheng 95ba4bd2ab
Add shape and type check at read_op (#20754)
5 years ago
Aurelius84 aacd16dbb4 add pyramid_hash_op (#20698)
5 years ago
Chen Weihang 8b59ac3ad0 delete paddle infershape enforce marco (#20832)
5 years ago
whs c8e49be2f1
Fix roi_perspective_transform op (#20764)
5 years ago
Chen Weihang 26cc1fe508
Replace risky GetInputType method with secure IndicateVarDataType interface (#20668)
5 years ago
Yamei-Lee cf717fd6dd fix bug in reshape: (#20781)
5 years ago
Zhang Ting 5a8d885d72 All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756)
5 years ago
danleifeng 9171f73714 fix fp16 grid_size for size=1; test=develop (#20812)
5 years ago
Tao Luo efbdad0596
make search_compute support avx default (#20779)
5 years ago
WangXi 250e72d254 Fix DGC algorithm flow to make it the same as paper (#20758)
5 years ago
zhaoyuchen2018 6e6eab07e8
Fix multihead op bug. (#20783)
5 years ago
lvmengsi dfa0549f87
Revert "fix_depthwise_conv_cudnn, test=develop (#20712)" (#20782)
5 years ago
whs 4c7d196d83
Add norm_by_time for warpctc op in padding mode. (#17580)
5 years ago
Pei Yang e89c16b90d
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)
5 years ago