Commit Graph

4994 Commits (05c00af5f16da64d1e8953711c647512121ef3d2)

Author SHA1 Message Date
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
5 years ago
wangchaochaohu 95b95a284b
Mean gpu optimize (#21643)
5 years ago
Zeng Jinle 0f8888360e
Polish op registry codes (#21561)
5 years ago
Aurelius84 3d9dee575e
Set lod_level of Out in compile time of sequence_pool_op (#21604)
5 years ago
Huihuang Zheng 1dcf6a7212
Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532)
5 years ago
Jacek Czaja 8f5a93a07b - Fix to regression in performance of ResNet-50 training (#21588)
5 years ago
Jacek Czaja 9ce0e29dc3 [MKL-DNN] Batch norm mkl-dnn NHWC support (#21553)
5 years ago
Youwei Song cdba41af4d dygraph Embedding layer use lookuptable v2 (#21209)
5 years ago
wangchaochaohu 4c9b3dafa7
fill_constant_batch_size_like OP precious problem fix (#21337)
5 years ago
WangXi 768f9242e9 Fix dgc clip & rampup step, test=develop (#21491)
5 years ago
Zeng Jinle 3662fb71a7
remove eval() calls in Eigen, test=develop (#21498)
5 years ago
Jacek Czaja 18a5d30754 [MKL-DNN] Conv2d and Conv2d transpose MKL-DNN NHWC support (#21466)
5 years ago
Tao Luo 70eb397677
remove unused snappy/snappystream depends in distributed codes (#21484)
5 years ago
lilong12 0bc8bdf724
set dim[0] to -1 if dim[0] < 0 during compiling for c_allgather op (#21402)
5 years ago
tangwei12 0bddb951c2
fix async mode, test=develop (#21367)
5 years ago
Leo Chen b3090ad406
fix synchronization problem in softmax_with_cross_entropy_op, test=develop (#21480)
5 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
5 years ago
Lv Mengsi 37f3e56dea
Fix transpose conv (#21406)
5 years ago
hutuxian 7e68bc896b
refactor AUC OP and add its CUDA Kernel (#21336)
5 years ago
wawltor dbbe6e9cb6
fix the device supported of the op unique and unique_with_counts. (#21395)
5 years ago
wangguanzhong 379e3febf2
fix shape check in density_prior_box, test=develop (#21414)
5 years ago
Adam 76b55da15a Fix bug in UpdatePadding for int64_t type (#21465)
5 years ago
Pei Yang 7b28d938bf
show shape diff in wrong trt input shape errmsg, test=develop (#21451)
5 years ago
Jie Fang 5e813b53c5 nhwc optimization for batchnorm (#21090)
5 years ago
Leo Chen e0c9d856fb
add unused input vars check for OpWithKernel, test=develop (#21169)
5 years ago
Chen Weihang 664f958a02
Fix optimizer op infershape failed in dygraph multi-cards mode (#21374)
5 years ago
Huihuang Zheng 630be31952
Fix Cond Bug for Nested Control Flow (#21340)
5 years ago
Jacek Czaja cd43c4440e [MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375)
5 years ago
Leo Chen add62acfd1
remove kDepXOut for abs_grad op, test=develop (#21407)
5 years ago
Adam 9107bf209f Add template version of UpdatePadding (#21426)
5 years ago
zhaoyuchen2018 b16274556a
Add dscending for argsort (#21400)
5 years ago
hong ac8546701d
Add dygraph execution context (#20157)
5 years ago
hutuxian a6b089c614
add macro to ban windows (#21422)
5 years ago
Kaipeng Deng ebfb720a63
add Adam beta1/beta2 support Variable (#21234)
5 years ago
Zeng Jinle 09696d5df8
Use system allocator in OpTest (#21335)
5 years ago
Kaipeng Deng 67c836fb5c
batch_norm momentum support variable (#21246)
5 years ago
ShenLiang e2c6f434ec Add Lod information for gather_nd & scatter_nd (#21404)
5 years ago
Tao Luo c0656dcb1a
remove -Wno-error=sign-compare, make warning as error (#21358)
5 years ago
Zeng Jinle b97fc16d21
fix lod_reset bug, test=develop (#21392)
5 years ago
Zeng Jinle 89966525f1
Polish reference count pass (#21324)
5 years ago
hutuxian 47a82e38e3
Support data_norm gpu kernel (#21325)
5 years ago
GaoWei8 8493f20ebc Polish the codes of fc when needs padding (#21378)
5 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
5 years ago
Zeng Jinle b639a882c3 fix syn bn grad maker, test=develop, test=document_fix (#21317)
5 years ago
Youwei Song 4d0f5ab1a8 add axis check for concat op (#21288)
5 years ago
zhaoyuchen2018 afb134847d
Fix ernie python infer diff (#21311)
5 years ago
Lv Mengsi b6ce4f8b2f
Fix mistake of batch norm op (#21237)
5 years ago
lilong12 41d13209d7
add the framework support for distfc (#21197)
5 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
5 years ago
Jacek Czaja f4cf028a8c [MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207)
5 years ago
Michał Gallus ed9ceb9f98 Refactor MKL-DNN ElementwiseMul (#21061)
5 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
5 years ago
wangchaochaohu 6514f52e46
fix the fill_constant op precious problem (#21322)
5 years ago
zhaoyuchen2018 08c19c585d
Improve argsort performance. (#21267)
5 years ago
WangXi 8ac7687e36 Fix dgc accuracy by mv regularization to local (#21278)
5 years ago
Leo Zhao b19e1a1b56 use prefetch to load next mem into cache (#21206)
5 years ago
gongweibao ed2a185248
optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597)
5 years ago
Yihua Xu 69dd5152cf Fix the crash issue when scale or bias was null-pointer. (#21284)
5 years ago
Zhang Ting 698b8b73ad
optimize lod_reset op to avoid data transform
5 years ago
Liufang Sang f0b1518438 add dequantize_abs_max op and modify lookup_table op (#20899)
5 years ago
hutuxian a6ce2306f9 support cvm_op run in gpu (#21300)
5 years ago
Yihua Xu b085ecc258 Avoid the string as the key of map to improve the jit performance (#21292)
5 years ago
zhongpu c4ede95c74 open dygraph op test, test=develop (#19787)
5 years ago
danleifeng 6fc3e8ec84 edit elementwise_mul doublegrad inplace (#21245)
5 years ago
zhaoyuchen2018 3ff5cc2d5e
Fix topk compile failed on windows (#21243)
5 years ago
Zhang Ting 01a9646323
optimize assign op to avoid copy data from GPU to GPU (#21181)
5 years ago
danleifeng 0e7baabe59 extend elementwise broadcast function (#20957)
5 years ago
Adam d623e863c9 Fix GELU grad error (#21204)
5 years ago
yaoxuefeng b5d8ba8394
fix data_norm op to avoid impractical normalization result test=develop (#21152)
5 years ago
Zhang Ting 9cbe7bccba modified error message and API doc for channel_last supported Op (#21002)
5 years ago
guofei 56b5d14704 Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118)
5 years ago
WangXi 3c98ec90ce Fix INF bug of softmax_cross_entropy_op (#21165)
5 years ago
Yihua Xu eec9c9cbe7 Fix jit tls issue (#21151)
5 years ago
ruri aeb887911f
Refine edit distance cn (#21121)
5 years ago
Kaipeng Deng 98b59cb82c
fix elementwise_mod float point kernel. test=develop (#21183)
5 years ago
whs cfdd1fc2cd
Fix warpctc in padding mode. (#21033)
5 years ago
Chen Weihang 8da0cd537a
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134)
5 years ago
zhaoyuchen2018 b93870e696
Improve topk performance. (#21087)
5 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
5 years ago
Chen Weihang 7e5f74b825
Add examples for error message writing specification - InvalidArgument (#21132)
5 years ago
zhaoyuchen2018 4a544762a2
Add Asypadding for conv fusion. (#21041)
5 years ago
WangXi de5d3ff688 Fix dgc buffer illegal & reuse velocity (#21012)
5 years ago
ceci3 f62a929151
fix instance norm (#21042)
5 years ago
lilong12 e249d9a3e2
fix the computation for dx (grad for x) for prelu operation. (#20949)
5 years ago
Zhang Ting e0285eae64 add check for input channels and Attr(groups), test=develop (#21095)
5 years ago
Yiqun Liu 35f17ae28f
Add the check of lod_level between compile-time and runtime. (#20961)
5 years ago
Chen Weihang 826254f664
Add pre-condition check for fuse optimizer op pass (#21005)
5 years ago
Aurelius84 1cd6721873 Optimizer mmcpy if _rand_len=16 and remove data copy in GradKernel (#21099)
5 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
5 years ago
LielinJiang 06063b7001 add op locality_aware_nms, test=develop (#20976)
5 years ago
wangchaochaohu fc385777e4
fix the compile cost long time test=develop (#21064)
5 years ago
Chen Weihang 2f27b10331
Add dependency for error_codes.proto (#21084)
5 years ago
wangchaochaohu 149a1e3124
Expand refine (#21063)
5 years ago
Wojciech Uss af3ff422cc Fix dst memory allocation in elementwise_add (#21059)
5 years ago
liym27 26a6e27afe fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997)
5 years ago
Chen Weihang 7ee25189c3
Enrich the type of error and declare the error type interfaces (#21024)
5 years ago
Adam 3fda695bb0 Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062)
5 years ago
Huihuang Zheng 1957192f05
Add select_input_op and select_output_op (#21016)
5 years ago
Liufang Sang e5e699ecc0 set lod level for compile time test=develop (#21022)
5 years ago
liym27 f0e95a6049 Polish error messages of pool_2d/3d and add Raises in English document. test=develop (#21017)
5 years ago
zhaoyuchen2018 0059404e77
Fix ce ocr_recognition test fails (#20987)
5 years ago
Chengmo bc8e600ce5
Fix rpc not wait in GEO communicator (#20967)
5 years ago
Tao Luo 25ffa8445d
refine murmurhash3_x64_128 for bloom_filter (#20996)
5 years ago
Zeng Jinle 878a40f57d
Support NoNeedBufferVarsInference in dygraph backward (#20868)
5 years ago
wangchaochaohu bf379fef96
refine code for code reuse test=develop (#20988)
5 years ago
Zhang Ting de9bec607e lrn supports channel_last input, test=develop (#20954)
5 years ago
Liufang Sang 9b666cae67 fix diff in dequantize op between cpu and gpu test=develop (#20953)
5 years ago
Zhang Ting f4f85831d3 fix the bug of conv_transpose cudnn kernel, test=develop (#20958)
5 years ago
zhaoyuchen2018 7f3a445e9a
Fix gru as small frame_size has error. (#20922)
5 years ago
123malin 20cdff0e02
Optimize decay (#20816)
5 years ago
Chengmo 16596f6498
Fix Paddle Cloud role maker (#20860)
5 years ago
liym27 59de8e1214 Compatible int32 and int64 for attr in concat/split/unsqueeze. test=develop (#20912)
5 years ago
Zhang Ting 8d1e9f0f7e maxout supports channel_last input (#20846)
5 years ago
Yihua Xu b6260f3866 Optimize the kernel implementation of layernorm with openmp (#20895)
5 years ago
hong 8c4573a3cb
GradMaker for dygraph (#19706)
5 years ago
Chen Weihang 768551b25d Add parameter init check add run_startup_progrom error message for fc(mul) (#20906)
5 years ago
Zhang Ting c18f1bd716 fix the bug of conv_transpose:compatible with Anylayout setting, test=develop (#20897)
5 years ago
Wilber b489760099
fix jit_matmul bug test=develop (#20886)
5 years ago
Yiqun Liu 03ba0fdae6
Move the codes of fused operators to operators/fused directory. (#20881)
5 years ago
zhang wenhui d428912503
fix select_rows mergeadd bug, test=develop (#20876)
5 years ago
liym27 6802539a2e support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780)
5 years ago
wangchaochaohu 28ca2e5ffa
strided_slice perforamnce improvement test=develop (#20852)
5 years ago
Yiqun Liu 6fcfd32e6c
Check and correct the output's lod_level in DynamicRNN related operators (#19144)
5 years ago
liym27 84d221b667 improve unsqueeze op to support int, Tensor for argument axes (#20824)
5 years ago
silingtong123 03d7f3ddb2 Make shape tensor support int32 (#20757)
5 years ago
Huihuang Zheng 95ba4bd2ab
Add shape and type check at read_op (#20754)
5 years ago
Aurelius84 aacd16dbb4 add pyramid_hash_op (#20698)
5 years ago
Chen Weihang 8b59ac3ad0 delete paddle infershape enforce marco (#20832)
5 years ago
whs c8e49be2f1
Fix roi_perspective_transform op (#20764)
5 years ago
Chen Weihang 26cc1fe508
Replace risky GetInputType method with secure IndicateVarDataType interface (#20668)
5 years ago
Yamei-Lee cf717fd6dd fix bug in reshape: (#20781)
5 years ago
Zhang Ting 5a8d885d72 All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756)
5 years ago
danleifeng 9171f73714 fix fp16 grid_size for size=1; test=develop (#20812)
5 years ago
Tao Luo efbdad0596
make search_compute support avx default (#20779)
5 years ago
WangXi 250e72d254 Fix DGC algorithm flow to make it the same as paper (#20758)
5 years ago
zhaoyuchen2018 6e6eab07e8
Fix multihead op bug. (#20783)
5 years ago
lvmengsi dfa0549f87
Revert "fix_depthwise_conv_cudnn, test=develop (#20712)" (#20782)
5 years ago
whs 4c7d196d83
Add norm_by_time for warpctc op in padding mode. (#17580)
5 years ago
Pei Yang e89c16b90d
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)
5 years ago
石晓伟 37cd43545a
update the infer shape of matmul, test=develop (#20717)
5 years ago
Adam 67b59ddb38 Minor MKL-DNN conv int8 performance fixes (#20753)
5 years ago
wangchaochaohu 0687bcd64f
Refine getitem of Variable (#20729)
5 years ago
danleifeng 79e08ecebf add assertions on whether elementwise_div divison is zero (#20618)
5 years ago
123malin 95e90aa102
test=develop, add communicator_is_sgd_optimizer flag (#20677)
5 years ago
Aurelius84 74a28f5ea4
fix fill_constant shape with -1 and enhance cross_entropy test=develop (#20722)
5 years ago
lvmengsi dc229b4195
fix_depthwise_conv_cudnn, test=develop (#20712)
5 years ago
gongweibao c1710e91b2
Disable GRPC_ARG_ALLOW_REUSEPORT to avoid potencial problem. (#20690)
5 years ago
lidanqing 46e93f7c86 Revert "Refactor conv computeINT8" (#20640)
5 years ago
Zeng Jinle ab575de725 Fix op run log when memory optimization strategy is enabled (#20695)
5 years ago
Jacek Czaja a1cd27f13f [MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241)
5 years ago
Zeng Jinle 10505faf4e
polish codes, test=develop (#20672)
5 years ago
Zeng Jinle 34e3adaece
Refine reduce codes to save compiling time and binary size (#20676)
5 years ago
whs a3e641e93c
Fix infer shape of warpctc op. (#20653)
5 years ago
Zeng Jinle 4922eb6da5
make_conv_workspace_size_configurable, test=develop (#20662)
5 years ago
zhongpu efa10937bd fix elementwise_floordiv_op and elementwise_mod_op (#20534)
5 years ago
tangwei12 04384502a8
fix bug with heart beat , test=develop (#20654)
5 years ago
wangchaochaohu 7783d3bd43
Conv refine (#20644)
5 years ago
Chen Weihang 003f369bb2
Add IndicateVarDataType interface to block tensor is not initialized problem in OP GetExceptedKernelType (#20044)
5 years ago
gongweibao f3f52fc1e2
Retry when failed to bind address. (#20642)
5 years ago
qingqing01 01eddc1a04
Support fp16 in GPU impl of fused_elemwise_activation_op. (#20636)
5 years ago
Chengmo 940c6ff1c8
Fix communicator slow bug & fix communicator stop bug (#20366)
5 years ago
zhaoyuchen2018 8314e64a8b
Fix sum op fails as no memory in tensor(#20602)
5 years ago
Yibing Liu ee2869cae9
Remove redundant infershape in linear chain crf grad, test=develop (#20629)
5 years ago
123malin b4a3b75002
bug fix: invalid learning rate decay in pserver async mode (#20325)
5 years ago
石晓伟 a4753f3a79 Optimize error message of mean_op and matmul_op (#20413)
5 years ago
Leo Chen d6c1d6ca56 update class name, test=develop (#20578)
5 years ago
Double_V 0b39218749 memory optimizer for reshape op,test=develop (#20569)
5 years ago
chengduo 36c85ef492
Add sub-scope check in RecurrentOp (#20468)
5 years ago
JesseyXujin 2ff18e537f
add expand_as op, test=develop (#20565)
5 years ago
Zeng Jinle 40effc61af
Refine py_reader exit (#20331)
5 years ago
Zhang Ting 78910480c1 fix conv_transpose's bug: compatible with Anylayout setting, test=develop (#20589)
5 years ago
Yuan Shuai 172e91c008
Refine error message of transpose_op (#20437)
5 years ago
liym27 fc6ec3b9f6 fill_constant support Tensor; (#20521)
5 years ago
Zhang Ting 0130cc969c fixed group_norm's bug and modified unittest (#20506)
5 years ago
zhaoyuchen2018 8fb569e5b9
Fix api doc example bug and polish square doc (#20491)
5 years ago
Guo Sheng dfd1eee7f7 Add seq2seq api related code (#19820)
5 years ago
lvmengsi 2384589383
Fix conv_grad_grad (#20469)
5 years ago
Double_V 8299203370 Support reshape_op double gradient (#20304)
5 years ago
hong19860320 4d0d5e4cc7
refine eng doc for hard_sigmoid op (#20442)
5 years ago
Aurelius84 22823df2e2
enhance embedding error message test=develop (#20246)
5 years ago
zhupengyang 3997743a5b add input type and dtype check, enhance shape error message for concat_op (#20101)
5 years ago
zhupengyang 95524a4d30 fix APIs: relu, relu6, hash (#20416)
5 years ago
JesseyXujin 843bdbaae1
add input type and dtype check for accuracy_op (#20399)
5 years ago
lijianshe02 211f5b0319
enhance mul_op input error message test=develop (#20414)
5 years ago
GaoWei8 5ea2cc6733 fix API:cos, exp, ceil, elu, brelu English doc (#20032)
5 years ago
wopeizl 3044a62f2a
fix the precise roi poop op test=develop (#20126)
5 years ago
Wilber 2893cd1ae0
modify english api (#20159)
5 years ago
zhouwei25 b1218d056b fix English Doc of API:layers.py_func/sum (#20329)
5 years ago
qingqing01 63194d6e67
Enhance InferShape in deformable_conv and prior_box op (#20372)
5 years ago
tangwei12 a010d883b4
doc fix, test=develop, test=document_fix (#20239)
5 years ago
huzhiqiang 6a8e54047f
fix reorder_lod_tensor_by_rank doc en (#20256)
5 years ago
Yibing Liu 899ab30df0
Fix several api docs (#20282)
5 years ago
wangchaochaohu 1288ac2983
fix expand bug (#20340)
5 years ago
SunGaofeng a73e1f68b4
fix document of 11 APIs (#20278)
5 years ago
Pei Yang 057d782d51
fix en api doc of [round, sin, sqrt], test=develop, test=document_fix (#20296)
5 years ago
Kaipeng Deng 3833b511a6
refine en API doc (#20206)
5 years ago
wangchaochaohu bc6126dd07
fix the reduce bug test=develop (#20102)
5 years ago
FDInSky e2c7b6821a test=develop enhance uniform_random op python api (#20295)
5 years ago
danleifeng 3a0f93b3f9 fix error message for elementwise_add/mul (#20283)
5 years ago
liym27 670937e11d add input type and dtype check for reshape op. (#20099)
5 years ago