Commit Graph

16382 Commits (642b33564e9d4d132f87bd94cf87361fefff66d4)

Author SHA1 Message Date
WangXi 3c98ec90ce Fix INF bug of softmax_cross_entropy_op (#21165)
5 years ago
xujiaqi01 23876de55b
fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052)
5 years ago
Yihua Xu eec9c9cbe7 Fix jit tls issue (#21151)
5 years ago
GaoWei8 a9d4eed3a8 fix cmake fails on inference_download_and_uncompress (#21185)
5 years ago
xujiaqi01 9e045170c0
add copy table (#21086)
5 years ago
ruri aeb887911f
Refine edit distance cn (#21121)
5 years ago
Kaipeng Deng 98b59cb82c
fix elementwise_mod float point kernel. test=develop (#21183)
5 years ago
Zeng Jinle 5fdfbe3413
Add friendly dygraph trace API (#21091)
5 years ago
Chen Weihang 4bd9463630
fix detail error message error, test=develop (#21170)
5 years ago
whs cfdd1fc2cd
Fix warpctc in padding mode. (#21033)
5 years ago
Chen Weihang 8da0cd537a
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134)
5 years ago
zhaoyuchen2018 b93870e696
Improve topk performance. (#21087)
5 years ago
Adam d74ea0855f Add relative error measure when (value > 1) (#21144)
5 years ago
Chen Weihang b3a3e6f60c change cuda enforce & add example (#21142)
5 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
5 years ago
Chen Weihang 7e5f74b825
Add examples for error message writing specification - InvalidArgument (#21132)
5 years ago
Chen Weihang 27fa9c100b
add examples for resource exhausted error, test=develop (#21140)
5 years ago
zhaoyuchen2018 4a544762a2
Add Asypadding for conv fusion. (#21041)
5 years ago
WangXi de5d3ff688 Fix dgc buffer illegal & reuse velocity (#21012)
5 years ago
ceci3 f62a929151
fix instance norm (#21042)
5 years ago
Zeng Jinle d625aaf0c1
remove so many logs of parallel executor, test=develop (#21105)
5 years ago
lilong12 e249d9a3e2
fix the computation for dx (grad for x) for prelu operation. (#20949)
5 years ago
Chen Weihang edd6680a71
Further simplify the C++ error info stack (#21093)
5 years ago
Zhang Ting e0285eae64 add check for input channels and Attr(groups), test=develop (#21095)
5 years ago
Yiqun Liu 35f17ae28f
Add the check of lod_level between compile-time and runtime. (#20961)
5 years ago
Chen Weihang 826254f664
Add pre-condition check for fuse optimizer op pass (#21005)
5 years ago
Yiqun Liu 9091f8cdf9
Support generating code for grad_op (#21066)
5 years ago
Aurelius84 1cd6721873 Optimizer mmcpy if _rand_len=16 and remove data copy in GradKernel (#21099)
5 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
5 years ago
LielinJiang 06063b7001 add op locality_aware_nms, test=develop (#20976)
5 years ago
wangchaochaohu fc385777e4
fix the compile cost long time test=develop (#21064)
5 years ago
Chen Weihang 2f27b10331
Add dependency for error_codes.proto (#21084)
5 years ago
wangchaochaohu 149a1e3124
Expand refine (#21063)
5 years ago
Wojciech Uss af3ff422cc Fix dst memory allocation in elementwise_add (#21059)
5 years ago
liym27 26a6e27afe fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997)
5 years ago
GaoWei8 829bf871d7 Add ernie c++ inference test (#21015)
5 years ago
mapingshuo b592deec90 add dlpack to imdb demo, test=develop (#21069)
5 years ago
Chen Weihang 7ee25189c3
Enrich the type of error and declare the error type interfaces (#21024)
5 years ago
Adam 3fda695bb0 Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062)
5 years ago
Huihuang Zheng 1957192f05
Add select_input_op and select_output_op (#21016)
5 years ago
Liufang Sang e5e699ecc0 set lod level for compile time test=develop (#21022)
5 years ago
liym27 f0e95a6049 Polish error messages of pool_2d/3d and add Raises in English document. test=develop (#21017)
5 years ago
ddokupil c98712d56e add ending message in paddle_build.sh (#20334)
5 years ago
Zeng Jinle a710ccc0cb
refine error message of allocator again, test=develop (#21023)
5 years ago
tianshuo78520a d89ca2ffb5 split api_spec document (#20999)
5 years ago
zhaoyuchen2018 0059404e77
Fix ce ocr_recognition test fails (#20987)
5 years ago
Zeng Jinle f56967c483
refine error message of gpu allocator, test=develop (#21008)
5 years ago
Chengmo bc8e600ce5
Fix rpc not wait in GEO communicator (#20967)
5 years ago
Leo Chen 008ed65fd5 Add c++ global current tracer for dygraph (#20882)
5 years ago
Zeng Jinle 5aae595902
fix no_need_buffer_vars_dep, test=develop, test=document_fix (#21007)
5 years ago
xujiaqi01 1d1a07937a
simplify master+patch,remove ins when size != merge_size or has conflict slot (#20913)
5 years ago
Tao Luo 25ffa8445d
refine murmurhash3_x64_128 for bloom_filter (#20996)
5 years ago
Zeng Jinle 878a40f57d
Support NoNeedBufferVarsInference in dygraph backward (#20868)
5 years ago
wangchaochaohu bf379fef96
refine code for code reuse test=develop (#20988)
5 years ago
Yucheng 98f1cebd38 add sample code test under python3 and enabled multi-thread (#20950)
5 years ago
Zhang Ting de9bec607e lrn supports channel_last input, test=develop (#20954)
5 years ago
Liufang Sang 9b666cae67 fix diff in dequantize op between cpu and gpu test=develop (#20953)
5 years ago
zhongpu 065804d39e fix bug in grad_op compute for dygraph, test=develop (#20975)
5 years ago
Wilber c534149642
fix squared_mat_sub_fuse_pass when elementwise_op input is from persistable param test=develop (#20960)
5 years ago
Zhang Ting f4f85831d3 fix the bug of conv_transpose cudnn kernel, test=develop (#20958)
5 years ago
wangchaochaohu 7695b713e1
gpu info query refine test=develop (#20904)
5 years ago
Leo Chen 2c3c579b9b tensor.set() supports array list and remove unused code, test=develop (#20959)
5 years ago
WangXi eec4fa9099 And Enforce to fuse pass for DGC doesn't support fuse for now, test=develop (#20935)
5 years ago
Leo Chen 9974e40787 Update Tensor.set() to support float16 (#19964)
5 years ago
zhaoyuchen2018 7f3a445e9a
Fix gru as small frame_size has error. (#20922)
5 years ago
Zeng Jinle b0c0ffb9ae
refine pe when exception raises, test=develop (#20894)
5 years ago
123malin 20cdff0e02
Optimize decay (#20816)
5 years ago
Chengmo 16596f6498
Fix Paddle Cloud role maker (#20860)
5 years ago
liym27 59de8e1214 Compatible int32 and int64 for attr in concat/split/unsqueeze. test=develop (#20912)
5 years ago
Zhang Ting 8d1e9f0f7e maxout supports channel_last input (#20846)
5 years ago
Yihua Xu b6260f3866 Optimize the kernel implementation of layernorm with openmp (#20895)
5 years ago
hong 8c4573a3cb
GradMaker for dygraph (#19706)
5 years ago
Thunderbrook 59bcdc8a19
support dump param of model into afs (#20302)
5 years ago
Chen Weihang 768551b25d Add parameter init check add run_startup_progrom error message for fc(mul) (#20906)
5 years ago
Zhang Ting c18f1bd716 fix the bug of conv_transpose:compatible with Anylayout setting, test=develop (#20897)
5 years ago
Chen Weihang 3358455c86
Polish and arrange code in enforce.h (#20901)
5 years ago
Yiqun Liu 16e4d02675
Refine the cache of program, context and scope in executor. (#18483)
5 years ago
Wilber b489760099
fix jit_matmul bug test=develop (#20886)
5 years ago
Yiqun Liu 03ba0fdae6
Move the codes of fused operators to operators/fused directory. (#20881)
5 years ago
Leo Chen a9bc92c314 add c++ unique_name_generator, test=develop (#20871)
5 years ago
zhang wenhui d428912503
fix select_rows mergeadd bug, test=develop (#20876)
5 years ago
Zeng Jinle c51722c820
refine err msg of allocator, test=develop (#20879)
5 years ago
hong ff0886a92a
save load problem fix and new feature add (#20823)
5 years ago
liym27 6802539a2e support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780)
5 years ago
wangchaochaohu 28ca2e5ffa
strided_slice perforamnce improvement test=develop (#20852)
5 years ago
Yiqun Liu 6fcfd32e6c
Check and correct the output's lod_level in DynamicRNN related operators (#19144)
5 years ago
Yiqun Liu b5f3be8330
Implement a pass detect fusion group of elementwise op (#19884)
5 years ago
liym27 84d221b667 improve unsqueeze op to support int, Tensor for argument axes (#20824)
5 years ago
silingtong123 03d7f3ddb2 Make shape tensor support int32 (#20757)
5 years ago
Huihuang Zheng 95ba4bd2ab
Add shape and type check at read_op (#20754)
5 years ago
Zeng Jinle bb8d778358
lazy init of allocators, test=develop (#20854)
5 years ago
Aurelius84 aacd16dbb4 add pyramid_hash_op (#20698)
5 years ago
Zeng Jinle 98103d3003
remove some unnecessary logs in pe, test=develop (#20848)
5 years ago
Chen Weihang 8b59ac3ad0 delete paddle infershape enforce marco (#20832)
5 years ago
whs c8e49be2f1
Fix roi_perspective_transform op (#20764)
5 years ago
Chen Weihang 26cc1fe508
Replace risky GetInputType method with secure IndicateVarDataType interface (#20668)
5 years ago
xujiaqi01 48669aa8f0
fix several sparse table issuses (#20686)
5 years ago
Yamei-Lee cf717fd6dd fix bug in reshape: (#20781)
5 years ago
Chen Weihang 1d1552d106
Make formatted ENFORCE stack adapt to more situations (#20826)
5 years ago
Zeng Jinle 378fc4fb1c
add some docs to jit.trace, test=develop (#20811)
5 years ago
Zhang Ting 5a8d885d72 All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756)
5 years ago
danleifeng 9171f73714 fix fp16 grid_size for size=1; test=develop (#20812)
5 years ago
Zeng Jinle cd1c404353
refine err msg of allocator, test=develop (#20804)
5 years ago
Zeng Jinle ac813bbaf4
Add more error debug message to Operator::Run (#20793)
5 years ago
Tao Luo efbdad0596
make search_compute support avx default (#20779)
5 years ago
zhongpu 3556514e97 add PADDLE_ENFORCE for dygraph to optimize error throw (#19783)
5 years ago
WangXi 250e72d254 Fix DGC algorithm flow to make it the same as paper (#20758)
5 years ago
wangchaochaohu ba45dce35d
fix codetest for windows make test=develop (#20796)
5 years ago
Zeng Jinle 8ff6b289bd
[Dygraph to static graph]JIT/Trace (#20775)
5 years ago
zhaoyuchen2018 6e6eab07e8
Fix multihead op bug. (#20783)
5 years ago
lvmengsi dfa0549f87
Revert "fix_depthwise_conv_cudnn, test=develop (#20712)" (#20782)
5 years ago
whs 4c7d196d83
Add norm_by_time for warpctc op in padding mode. (#17580)
5 years ago
Pei Yang e89c16b90d
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)
5 years ago
tianshuo78520a 1105b93288 del uninstall protobuf (#20769)
5 years ago
Tao Luo 2f5f19dfb5
mv sampcd_processor.py to tools/ (#20761)
5 years ago
石晓伟 37cd43545a
update the infer shape of matmul, test=develop (#20717)
5 years ago
石晓伟 e742760f8e
optimize version error, test=develop (#20715)
5 years ago
Adam 67b59ddb38 Minor MKL-DNN conv int8 performance fixes (#20753)
5 years ago
wangchaochaohu 0687bcd64f
Refine getitem of Variable (#20729)
5 years ago
zhongpu 72d1d72c09 fix ExecutionContext::HasInput and ExecutionContext::HasOutput depend on the scope structure, test=develop (#20721)
5 years ago
danleifeng 79e08ecebf add assertions on whether elementwise_div divison is zero (#20618)
5 years ago
bingyanghuang fd49ebcbd8 update int8 benchmark with 6271 data, test=develop test=document_fix (#20736)
5 years ago
123malin 95e90aa102
test=develop, add communicator_is_sgd_optimizer flag (#20677)
5 years ago
Aurelius84 74a28f5ea4
fix fill_constant shape with -1 and enhance cross_entropy test=develop (#20722)
5 years ago
石晓伟 48a774c713
fix ts_sort's bug, test=develop (#20720)
5 years ago
lvmengsi dc229b4195
fix_depthwise_conv_cudnn, test=develop (#20712)
5 years ago
石晓伟 d8f4f4239d
Ensure backward compatibility with the anakin interface, test=develop (#20691)
5 years ago
wopeizl 9e5948230e
add support to gcc8, add docker env test=develop (#19807)
5 years ago
xujiaqi01 5223b0dd9d
add check nan / inf in downpour worker (#20694)
5 years ago
WangXi 507afa8a8a Fix dgc nan by stripping nccl from sparseReduce. (#20630)
5 years ago
gongweibao c1710e91b2
Disable GRPC_ARG_ALLOW_REUSEPORT to avoid potencial problem. (#20690)
5 years ago
lidanqing 46e93f7c86 Revert "Refactor conv computeINT8" (#20640)
5 years ago
liu zhengxi d39777fefa
alter the capi of PD_PredictorRun to provide proper function, test=develop (#20697)
5 years ago
Zeng Jinle 4eeda9d676
fix tensor_util, test=develop (#20699)
5 years ago
Zeng Jinle ab575de725 Fix op run log when memory optimization strategy is enabled (#20695)
5 years ago
Jacek Czaja a1cd27f13f [MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241)
5 years ago
Zeng Jinle 10505faf4e
polish codes, test=develop (#20672)
5 years ago
Tao Luo dd3d8997cf
remove deprecated contrib/float16 directory (#20685)
5 years ago
liuwei1031 569951c418
improve the efficiency of BuddyAllocator (#19888)
5 years ago
tianshuo78520a eafc7023c1 test=develop test=document_fix (#20682)
5 years ago
Zeng Jinle 34e3adaece
Refine reduce codes to save compiling time and binary size (#20676)
5 years ago
liu zhengxi dbc2bb3376
improve the performance of capi in PD_PredictorRun (#20665)
5 years ago
whs a3e641e93c
Fix infer shape of warpctc op. (#20653)
5 years ago
Zeng Jinle 4922eb6da5
make_conv_workspace_size_configurable, test=develop (#20662)
5 years ago
zhongpu efa10937bd fix elementwise_floordiv_op and elementwise_mod_op (#20534)
5 years ago
tangwei12 04384502a8
fix bug with heart beat , test=develop (#20654)
5 years ago
wangchaochaohu 7783d3bd43
Conv refine (#20644)
5 years ago
lidanqing 57b656f956 Add document for int8 object detection quantization (#19356)
5 years ago
Chen Weihang 003f369bb2
Add IndicateVarDataType interface to block tensor is not initialized problem in OP GetExceptedKernelType (#20044)
5 years ago
Tao Luo dfa239253c
reduce make install time in CI (#20643)
5 years ago
gongweibao f3f52fc1e2
Retry when failed to bind address. (#20642)
5 years ago
qingqing01 01eddc1a04
Support fp16 in GPU impl of fused_elemwise_activation_op. (#20636)
5 years ago
Chengmo 940c6ff1c8
Fix communicator slow bug & fix communicator stop bug (#20366)
5 years ago
gongweibao 69e0b98fea Cleanup the debug lines in paddle_build.sh (#20631)
5 years ago
zhaoyuchen2018 8314e64a8b
Fix sum op fails as no memory in tensor(#20602)
5 years ago
Yibing Liu ee2869cae9
Remove redundant infershape in linear chain crf grad, test=develop (#20629)
5 years ago
123malin b4a3b75002
bug fix: invalid learning rate decay in pserver async mode (#20325)
5 years ago
WangXi cadc6a9704 fix dgc test and bug when not set trainers_endpoints_, test=develop (#20617)
5 years ago
Zeng Jinle 40c258a77b
Refine API.spec mechanism (#20574)
5 years ago
石晓伟 a4753f3a79 Optimize error message of mean_op and matmul_op (#20413)
5 years ago
Leo Chen d6c1d6ca56 update class name, test=develop (#20578)
5 years ago
gongweibao 1d82025e89
Add interface so user can get scaled loss when they use customized loss. (#20571)
5 years ago
liu zhengxi 922d432477
fix the PD_ZeroCopyPredictorRun output problem (#20612)
5 years ago
翟飞跃 36acfaeeda Add fused_embedding_seq layer into fluid.contrib (#19771)
5 years ago
Double_V 0b39218749 memory optimizer for reshape op,test=develop (#20569)
5 years ago
bingyanghuang 85e1f2150b Modify the helper information in full_pascalvoc_test_preprocess.py (#20475)
5 years ago
Thunderbrook f76a32df4a
dump fix dov vec file num (#20539)
5 years ago
chengduo 36c85ef492
Add sub-scope check in RecurrentOp (#20468)
5 years ago
633WHU 12e4be0382 Dlpack support (#20039)
5 years ago
Pei Yang 443f604c3b
add DisableGlogInfo() to AnalysisConfig, test=develop (#20581)
5 years ago
JesseyXujin 2ff18e537f
add expand_as op, test=develop (#20565)
5 years ago
Zeng Jinle 40effc61af
Refine py_reader exit (#20331)
5 years ago
Zeng Jinle a9c8bdad7b
refine pe codes, test=develop (#20479)
5 years ago
Zeng Jinle 76b321872a
fix cuda dev_ctx by event, test=develop (#20553)
5 years ago
Guo Sheng bd99df715a
Fix basic_gru and docs of gru_unit and dynamic_gru (#19393)
5 years ago
Zhang Ting 78910480c1 fix conv_transpose's bug: compatible with Anylayout setting, test=develop (#20589)
5 years ago
Yuan Shuai 172e91c008
Refine error message of transpose_op (#20437)
5 years ago
zhaoyuchen2018 b8333edef6
Add Multihead matmul fuse pass (#20167)
5 years ago
liym27 fc6ec3b9f6 fill_constant support Tensor; (#20521)
5 years ago
Zhang Ting 0130cc969c fixed group_norm's bug and modified unittest (#20506)
5 years ago
Adam 7faa3e9555 Add ConvTranspose + BatchNorm fuse pass (#20161)
5 years ago
Diego Zhang 27d1ef6081
Refine seq enum expand mask pad (#20344)
5 years ago
Youwei Song 9a09ff14a5 fix en docs of Layer and guard (#20512)
5 years ago
guofei 9b85f40140 Modify English documents (#20452)
5 years ago
liuwei1031 9dc83dda7a
update data feeder API sample, change fluid.layers.data to fluid.data (#20568)
5 years ago
liuwei1031 9d6ee5eb1f
fix doc of default_main_program, multiprocess_reader (#20536)
5 years ago
zhongpu ece611b028 update paddle_build.sh, test=develop (#20443)
5 years ago
Yiqun Liu ce1b25cc8b
Polish the English documentation of sums (#20495)
5 years ago
lanxianghit 22ecaef03b
Add API 'fluid.requird_version(min_version, max_version=None)' to check if the installed PaddlePaddle version is satisfied, test=develop (#20263)
5 years ago
zhaoyuchen2018 8fb569e5b9
Fix api doc example bug and polish square doc (#20491)
5 years ago
hong19860320 512c0bb04d
refine the en api doc of ones, zeros, reverse, assign, increment, hsigmoid and create_py_reader_by_data ops (#20343)
5 years ago
Guo Sheng dfd1eee7f7 Add seq2seq api related code (#19820)
5 years ago
silingtong123 e87cabb7f2 updates document of sequence_softmax, sequence_scatter, sequence_unpad (#20269)
5 years ago
silingtong123 d5aa2dd818 fix doc, updates API documents of uniform_random and uniform_random_batch_size_like (#20316)
5 years ago
Wilber 751812a674
enable cpu machine to run paddle in gpu lib
5 years ago
lvmengsi 2384589383
Fix conv_grad_grad (#20469)
5 years ago
Double_V 8299203370 Support reshape_op double gradient (#20304)
5 years ago
LielinJiang faa8e30a14 Set batch norm and data norm argument 'do_model_average_for_mean_and_var' default as True (#20421)
5 years ago
hong19860320 4d0d5e4cc7
refine eng doc for hard_sigmoid op (#20442)
5 years ago
Aurelius84 22823df2e2
enhance embedding error message test=develop (#20246)
5 years ago