Commit Graph

8864 Commits (6e6eab07e80d287fb10f6033a01f15650b36fcdb)

Author SHA1 Message Date
zhaoyuchen2018 6e6eab07e8
Fix multihead op bug. (#20783)
5 years ago
lvmengsi dfa0549f87
Revert "fix_depthwise_conv_cudnn, test=develop (#20712)" (#20782)
5 years ago
whs 4c7d196d83
Add norm_by_time for warpctc op in padding mode. (#17580)
5 years ago
Pei Yang e89c16b90d
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)
5 years ago
石晓伟 37cd43545a
update the infer shape of matmul, test=develop (#20717)
5 years ago
石晓伟 e742760f8e
optimize version error, test=develop (#20715)
5 years ago
Adam 67b59ddb38 Minor MKL-DNN conv int8 performance fixes (#20753)
5 years ago
wangchaochaohu 0687bcd64f
Refine getitem of Variable (#20729)
5 years ago
zhongpu 72d1d72c09 fix ExecutionContext::HasInput and ExecutionContext::HasOutput depend on the scope structure, test=develop (#20721)
5 years ago
danleifeng 79e08ecebf add assertions on whether elementwise_div divison is zero (#20618)
5 years ago
bingyanghuang fd49ebcbd8 update int8 benchmark with 6271 data, test=develop test=document_fix (#20736)
5 years ago
123malin 95e90aa102
test=develop, add communicator_is_sgd_optimizer flag (#20677)
5 years ago
Aurelius84 74a28f5ea4
fix fill_constant shape with -1 and enhance cross_entropy test=develop (#20722)
5 years ago
石晓伟 48a774c713
fix ts_sort's bug, test=develop (#20720)
5 years ago
lvmengsi dc229b4195
fix_depthwise_conv_cudnn, test=develop (#20712)
5 years ago
石晓伟 d8f4f4239d
Ensure backward compatibility with the anakin interface, test=develop (#20691)
5 years ago
wopeizl 9e5948230e
add support to gcc8, add docker env test=develop (#19807)
5 years ago
xujiaqi01 5223b0dd9d
add check nan / inf in downpour worker (#20694)
5 years ago
WangXi 507afa8a8a Fix dgc nan by stripping nccl from sparseReduce. (#20630)
5 years ago
gongweibao c1710e91b2
Disable GRPC_ARG_ALLOW_REUSEPORT to avoid potencial problem. (#20690)
5 years ago
lidanqing 46e93f7c86 Revert "Refactor conv computeINT8" (#20640)
5 years ago
liu zhengxi d39777fefa
alter the capi of PD_PredictorRun to provide proper function, test=develop (#20697)
5 years ago
Zeng Jinle 4eeda9d676
fix tensor_util, test=develop (#20699)
5 years ago
Zeng Jinle ab575de725 Fix op run log when memory optimization strategy is enabled (#20695)
5 years ago
Jacek Czaja a1cd27f13f [MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241)
5 years ago
Zeng Jinle 10505faf4e
polish codes, test=develop (#20672)
5 years ago
liuwei1031 569951c418
improve the efficiency of BuddyAllocator (#19888)
5 years ago
Zeng Jinle 34e3adaece
Refine reduce codes to save compiling time and binary size (#20676)
5 years ago
liu zhengxi dbc2bb3376
improve the performance of capi in PD_PredictorRun (#20665)
5 years ago
whs a3e641e93c
Fix infer shape of warpctc op. (#20653)
5 years ago
Zeng Jinle 4922eb6da5
make_conv_workspace_size_configurable, test=develop (#20662)
5 years ago
zhongpu efa10937bd fix elementwise_floordiv_op and elementwise_mod_op (#20534)
5 years ago
tangwei12 04384502a8
fix bug with heart beat , test=develop (#20654)
5 years ago
wangchaochaohu 7783d3bd43
Conv refine (#20644)
5 years ago
lidanqing 57b656f956 Add document for int8 object detection quantization (#19356)
5 years ago
Chen Weihang 003f369bb2
Add IndicateVarDataType interface to block tensor is not initialized problem in OP GetExceptedKernelType (#20044)
5 years ago
gongweibao f3f52fc1e2
Retry when failed to bind address. (#20642)
5 years ago
qingqing01 01eddc1a04
Support fp16 in GPU impl of fused_elemwise_activation_op. (#20636)
5 years ago
Chengmo 940c6ff1c8
Fix communicator slow bug & fix communicator stop bug (#20366)
5 years ago
zhaoyuchen2018 8314e64a8b
Fix sum op fails as no memory in tensor(#20602)
5 years ago
Yibing Liu ee2869cae9
Remove redundant infershape in linear chain crf grad, test=develop (#20629)
5 years ago
123malin b4a3b75002
bug fix: invalid learning rate decay in pserver async mode (#20325)
5 years ago
WangXi cadc6a9704 fix dgc test and bug when not set trainers_endpoints_, test=develop (#20617)
5 years ago
Zeng Jinle 40c258a77b
Refine API.spec mechanism (#20574)
5 years ago
石晓伟 a4753f3a79 Optimize error message of mean_op and matmul_op (#20413)
5 years ago
Leo Chen d6c1d6ca56 update class name, test=develop (#20578)
5 years ago
gongweibao 1d82025e89
Add interface so user can get scaled loss when they use customized loss. (#20571)
5 years ago
liu zhengxi 922d432477
fix the PD_ZeroCopyPredictorRun output problem (#20612)
5 years ago
翟飞跃 36acfaeeda Add fused_embedding_seq layer into fluid.contrib (#19771)
5 years ago
Double_V 0b39218749 memory optimizer for reshape op,test=develop (#20569)
5 years ago