Commit Graph

2771 Commits (063c51c7487d83b6618ba7a8b6f218526119b3e0)

Author SHA1 Message Date
Dong Daxiang 691ced87c0
Refactor fetch handler (#21264)
5 years ago
Yiqun Liu c918788ba9 Disable fusion_group pass for windows and mac. We will do some experiments on Linux first. (#21310)
5 years ago
Chen Weihang 952508527a
Polish some PE code details (#21274)
5 years ago
Thunderbrook 0d17c1b816
solve pslib core in stop worker (#21263)
5 years ago
Thunderbrook 349e82d669
support general embedding params (#21217)
5 years ago
Yiqun Liu 6b1e1f0dda
Enable generating code for a given subgraph. (#21126)
5 years ago
Zeng Jinle a152315be7
refine Tensor method, test=develop (#21031)
5 years ago
Zeng Jinle cdb3d27985
Fix warn of gcc8 (#21205)
5 years ago
Zhaolong Xing 65f7052554
TRT int8: refine trt int8 for dynamic range set (#21112)
5 years ago
xujiaqi01 23876de55b
fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052)
5 years ago
xujiaqi01 9e045170c0
add copy table (#21086)
5 years ago
Chen Weihang 4bd9463630
fix detail error message error, test=develop (#21170)
5 years ago
Chen Weihang 8da0cd537a
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134)
5 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
5 years ago
Chen Weihang 7e5f74b825
Add examples for error message writing specification - InvalidArgument (#21132)
5 years ago
WangXi de5d3ff688 Fix dgc buffer illegal & reuse velocity (#21012)
5 years ago
Zeng Jinle d625aaf0c1
remove so many logs of parallel executor, test=develop (#21105)
5 years ago
Yiqun Liu 35f17ae28f
Add the check of lod_level between compile-time and runtime. (#20961)
5 years ago
Chen Weihang 826254f664
Add pre-condition check for fuse optimizer op pass (#21005)
5 years ago
Yiqun Liu 9091f8cdf9
Support generating code for grad_op (#21066)
5 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
5 years ago
Chen Weihang 7ee25189c3
Enrich the type of error and declare the error type interfaces (#21024)
5 years ago
Zeng Jinle 5aae595902
fix no_need_buffer_vars_dep, test=develop, test=document_fix (#21007)
5 years ago
xujiaqi01 1d1a07937a
simplify master+patch,remove ins when size != merge_size or has conflict slot (#20913)
5 years ago
Zeng Jinle 878a40f57d
Support NoNeedBufferVarsInference in dygraph backward (#20868)
5 years ago
Wilber c534149642
fix squared_mat_sub_fuse_pass when elementwise_op input is from persistable param test=develop (#20960)
5 years ago
WangXi eec4fa9099 And Enforce to fuse pass for DGC doesn't support fuse for now, test=develop (#20935)
5 years ago
Zeng Jinle b0c0ffb9ae
refine pe when exception raises, test=develop (#20894)
5 years ago
123malin 20cdff0e02
Optimize decay (#20816)
5 years ago
hong 8c4573a3cb
GradMaker for dygraph (#19706)
5 years ago
Thunderbrook 59bcdc8a19
support dump param of model into afs (#20302)
5 years ago
Yiqun Liu 16e4d02675
Refine the cache of program, context and scope in executor. (#18483)
5 years ago
hong ff0886a92a
save load problem fix and new feature add (#20823)
5 years ago
Yiqun Liu 6fcfd32e6c
Check and correct the output's lod_level in DynamicRNN related operators (#19144)
5 years ago
Yiqun Liu b5f3be8330
Implement a pass detect fusion group of elementwise op (#19884)
5 years ago
Huihuang Zheng 95ba4bd2ab
Add shape and type check at read_op (#20754)
5 years ago
Zeng Jinle 98103d3003
remove some unnecessary logs in pe, test=develop (#20848)
5 years ago
Chen Weihang 26cc1fe508
Replace risky GetInputType method with secure IndicateVarDataType interface (#20668)
5 years ago
xujiaqi01 48669aa8f0
fix several sparse table issuses (#20686)
5 years ago
Chen Weihang 1d1552d106
Make formatted ENFORCE stack adapt to more situations (#20826)
5 years ago
Zeng Jinle ac813bbaf4
Add more error debug message to Operator::Run (#20793)
5 years ago
wangchaochaohu ba45dce35d
fix codetest for windows make test=develop (#20796)
5 years ago
zhongpu 72d1d72c09 fix ExecutionContext::HasInput and ExecutionContext::HasOutput depend on the scope structure, test=develop (#20721)
5 years ago
石晓伟 48a774c713
fix ts_sort's bug, test=develop (#20720)
5 years ago
wopeizl 9e5948230e
add support to gcc8, add docker env test=develop (#19807)
5 years ago
xujiaqi01 5223b0dd9d
add check nan / inf in downpour worker (#20694)
5 years ago
WangXi 507afa8a8a Fix dgc nan by stripping nccl from sparseReduce. (#20630)
5 years ago
Zeng Jinle 4eeda9d676
fix tensor_util, test=develop (#20699)
5 years ago
Zeng Jinle ab575de725 Fix op run log when memory optimization strategy is enabled (#20695)
5 years ago
Jacek Czaja a1cd27f13f [MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241)
5 years ago
Zeng Jinle 10505faf4e
polish codes, test=develop (#20672)
5 years ago
Chen Weihang 003f369bb2
Add IndicateVarDataType interface to block tensor is not initialized problem in OP GetExceptedKernelType (#20044)
5 years ago
Chengmo 940c6ff1c8
Fix communicator slow bug & fix communicator stop bug (#20366)
5 years ago
WangXi cadc6a9704 fix dgc test and bug when not set trainers_endpoints_, test=develop (#20617)
5 years ago
Thunderbrook f76a32df4a
dump fix dov vec file num (#20539)
5 years ago
633WHU 12e4be0382 Dlpack support (#20039)
5 years ago
Pei Yang 443f604c3b
add DisableGlogInfo() to AnalysisConfig, test=develop (#20581)
5 years ago
Zeng Jinle a9c8bdad7b
refine pe codes, test=develop (#20479)
5 years ago
Zeng Jinle 76b321872a
fix cuda dev_ctx by event, test=develop (#20553)
5 years ago
zhaoyuchen2018 b8333edef6
Add Multihead matmul fuse pass (#20167)
5 years ago
Adam 7faa3e9555 Add ConvTranspose + BatchNorm fuse pass (#20161)
5 years ago
xujiaqi01 22b80e1246
fix parse content in CreatePreLoadReaders (#20258)
5 years ago
hong fa43e80e19 New save load interface (#20148)
5 years ago
Zeng Jinle c20b11ba11
simplify op_info.h, test=develop (#20195)
5 years ago
hong 0ec2c081d9
update op compatible list; test=develop (#20175)
5 years ago
tangwei12 c9139c3db3
trainer from dataset fetch targets (#19760)
5 years ago
chengduo bfa55c9ddb Add place deps for fused_all_reduce_op_handle (#20077)
5 years ago
Zeng Jinle 5fef859c65
remove map type from var_type_traits.h, test=develop (#20090)
5 years ago
Zeng Jinle 4ad66c779c
fix op_compatiable_compile_error, test=develop (#20076)
5 years ago
qingqing01 1a3eef026c
Enable users to create custom cpp op outside framework. (#19256)
5 years ago
bingyanghuang 9de6772510 Follow comment of Merged QAT PR 18970 (#19979)
5 years ago
石晓伟 01b9d07963
update operator compatible info, test=develop (#19978)
5 years ago
joanna.wozna.intel f5221ac19f Disable conv requant squash (#20041)
5 years ago
wangchaochaohu c9ea317b36
codegen code for reconstruction (#19728)
5 years ago
tangwei12 8f0b3c0516
the integrated communicator (#19849)
5 years ago
Chen Weihang b916335025 Paddle error message stack shaping and optimization (#19895)
5 years ago
chengduo 2450d15b78
disable fuse_all_optimizer_ops (#19966)
5 years ago
chengduo 101a2b610a Add dtype for coalesce_tensor_op (#20016)
5 years ago
Huihuang Zheng 88af4ab650
Add new data layer (#19916)
5 years ago
xujiaqi01 f50e701b3b
fix memory leak in HogwildWorker (#19956)
5 years ago
xujiaqi01 cedc04775c
support change shuffle and train thread num (#19841)
5 years ago
Zeng Jinle cc157d5990
add inplace to assign op, test=develop (#19927)
5 years ago
chengduo 55ce696986
clean tensor array (#19930)
5 years ago
chengduo d7251a8e1e
Delete local execution scopes (#19749)
5 years ago
wopeizl 5452b6a152
remove the useless warning for user to avoid confuse test=develop (#19871)
5 years ago
hong 85b398f171
Add op compatible information (#19910)
5 years ago
Huihuang Zheng e117114289
Set states of recurrent op as dependent vars in prune (#19865)
6 years ago
Zeng Jinle b754700fb5
fix reduce and broadcast to avoid multi-stream, test=develop (#19889)
6 years ago
joanna.wozna.intel 3f1d0234ae Fix conv2d+dequantize squash for residual fusion (#19545)
6 years ago
Huihuang Zheng a35557d8f4
Fix deps of prune (#19876)
6 years ago
Leo Chen 578a2f5da3 fix SplitLodTensor when batch_size = 0, test=develop (#19866)
6 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
6 years ago
Zeng Jinle 3f87464e9c
refine executor_gc_helper codes, test=develop (#19814)
6 years ago
Zeng Jinle 3fd3b663a8
fix gc bug in controlflow ops, test=develop (#19827)
6 years ago
Zeng Jinle db26de8389
[Bug fix] Disable memory reuse on feeded variables (#19835)
6 years ago
Thunderbrook 40c66f8df9
rm return in vfork (#19734)
6 years ago
xujiaqi01 6bf298bf09
support preload thread, optimize hdfs log, fix master+patch bug (#19695)
6 years ago
Jiabin Yang cc311bdf95
Feature/add transform data dygraph (#19707)
6 years ago
Zeng Jinle 754fd57ed7
disable memory optimization passes when FLAGS_use_ngraph=True, test=develop (#19778)
6 years ago
chengduo 8281497030
Fix warning info of build_strategy (#19805)
6 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
6 years ago
Chen Weihang 00d5375e0c
Add prune_backward function to cover complicated test_program.clone situation (#19772)
6 years ago
Adam d4413a54bc Add common CreateKey for mkldnn handlers (#19767)
6 years ago
chengduo 056fdedde3
Open fuse all reduce option (#19765)
6 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
6 years ago
Zeng Jinle 0daa5c9772
Make leaky relu inplacable (#19676)
6 years ago
chengduo e506c99c20
Open fuse broadcast option (#18833)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
chengduo 5866a7a5fe
Enable fused_all_reduce_op_handle support GPU and CPU Gradients (#19418)
6 years ago
Tao Luo ec9bc1bd9f
paddle::framework::vectorize() templatization (#19730)
6 years ago
Zeng Jinle bb4f8dee83
add logs to left var memory size, test=develop (#19722)
6 years ago
wangguanzhong 25dcd74d34
merge empty lod tensor, test=develop (#19228)
6 years ago
Zeng Jinle 713c05dd60
refine tensor.mutable_data, test=develop (#19680)
6 years ago
hutuxian 1ca6ea0318
fix cmakelist deps (#19668)
6 years ago
Tao Luo bcddbc78d4
remove -Wmaybe-uninitialized warning (#19653)
6 years ago
wangchaochaohu ed8f44ea21
codegen for fused elementwise operation (#19520)
6 years ago
mapingshuo dca9b6c5b0 add feed_var_names to Prune interface (#19589)
6 years ago
Tao Luo 3ae939e48a
unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631)
6 years ago
tensor-tang e3e98ed678
fix scope lock bug on infer (#19624)
6 years ago
Tao Luo 0a46d34538
refine some PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19607)
6 years ago
baojun a3a4b6e570 Enable ngraph through build_strategy (#19266)
6 years ago
Adam 8d6d95cc2b paddle::framework::vectorize() templatization (#19611)
6 years ago
Tao Luo 75d1571995
refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603)
6 years ago
Yiqun Liu c5548178b0
A a pass to enable the use of cudnn (#19346)
6 years ago
Adam e94b26daf5 using MKLDNNMemoryFormat = mkldnn::memory::format changes (#19568)
6 years ago
gongweibao abaf87be2b
Change backward_guard to optimize_guard to maximize the allreduce overlap. (#19506)
6 years ago
Zeng Jinle 19474019c2
fix fast pe to run highest priority ops first, test=develop (#19575)
6 years ago
Zeng Jinle 0af8549750 fix seg fault of share lod, test=develop (#19573)
6 years ago
hutuxian c756b5d231
Paddlebox Framework (#18982)
6 years ago
Jacek Czaja ecd9f330c9 [MKL-DNN] Fix to face model on AVX512 platforms (#19282)
6 years ago
yaoxuefeng 10ca3f9609
add thread scope stat accurate metrics test=develop (#19480)
6 years ago
Tao Luo 02270b3eb1
remove unused assert.h (#19529)
6 years ago
chengduo e340df013e
Support feed single persistable variable to PE (#19417)
6 years ago
Yiqun Liu fcec365d29
Add a pass to replace dropout_op with scale_op when is_test is true (#19297)
6 years ago
Thunderbrook 1fe468d319
support debug each output of each ins (#19004)
6 years ago
Zeng Jinle 5c8f210ce3
refine inplace inference registry, test=develop (#19032)
6 years ago
chengduo b6d1d8901f
Increase num_iteration_per_drop_scope (#19075)
6 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training (#18863)
6 years ago
joanna.wozna.intel 2e3ec66be0 Add conv dequant squash for int8 (#18905)
6 years ago
Tao Luo c82280e445
remove unused conv_elementwise_add2_act_fuse.cc (#19344)
6 years ago
Leo Chen a9d5fc5142 Enhance OpTest to check the consistency of operators when using and not using inplace (#19101)
6 years ago
Tao Luo e3c68bde78
stronger the error message of tensor's mutable_data (#19303)
6 years ago
Adam 97d1db1874 Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
Zeng Jinle 5b6673c44d
merge develop to solve conflict, also fix API doc, test=develop (#18823)
6 years ago
liuwei1031 50582071dc
fix compilation issue in windows vs2017 (#19183)
6 years ago
juncaipeng 5368b36512 remove the warning for reminding user to avoid using the OriginProgram method, test=develop (#19244)
6 years ago
chengduo 8a89ca94ce
Fix REGISTER_OP_WITHOUT_GRADIENT (#19251)
6 years ago
Zeng Jinle 708bd9798d
move_flags_to_unified_files_for_management, test=develop (#19224)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago