Commit Graph

3196 Commits (5165bc854ac35c22c3d1d8c04629f5420972a23a)

Author SHA1 Message Date
WangXi 2c9d0f3cb9
【paddle.fleet】Add dgc to fleet meta optimizer (#25738)
5 years ago
WangXi a6c87fd091
Add amp to fleet meta optimizer, test=develop (#25770)
5 years ago
xujiaqi01 d11c140e28
fix dump, fix cvm check (#25400)
5 years ago
JZ-LIANG 8ebffc78c9
add lars to fleet meta optimizer (#25884)
5 years ago
Dong Daxiang 8d2896f1fe
【paddle.fleet】Fleet run graph in Executor and add two more strategies (#25844)
5 years ago
Chen Weihang 48b9a56f1c
Polish framework error message - part 4 (#25807)
5 years ago
lilong12 5f30e57cdd
fix test_pipeline, test=develop (#25808)
5 years ago
Chen Weihang d47304e6d9
Refine paddle error stack format (#25790)
5 years ago
tangwei12 caa90a6510
Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) (#22957)
5 years ago
Dong Daxiang a96d54ac19
Generate final strategy (#25782)
5 years ago
Chen Weihang 1b3081b1b4
Simplify BufferedReader to improve DataLoader performance (#25648)
5 years ago
Zhou Wei e0a9115e28
fix random compile failure due to missing file (#25661)
5 years ago
Dong Daxiang 920d998f1e
add more settings for distributed strategy (#25685)
5 years ago
Chen Weihang c34c80d302
Polish framework error message part3 (#25701)
5 years ago
arlesniak e52df3b125
Added DNNL cache management for DyGraph (#25624)
5 years ago
Chen Weihang 364cc53618
Polish paddle fluid framework error message - part2 (#25667)
5 years ago
Adam 98899b73d2
Fix FC + GRU fuse pass (#25687)
5 years ago
wanghuancoder 1917b38099
fix some errmsg report,in framework/ir/, about 21 files (#25525)
5 years ago
joanna.wozna.intel e5bbffa84c
Add NOMINMAX define due to windows.h max/min macro conflict (#25637)
5 years ago
石晓伟 7206417259
supports xpu runtime, test=develop (#25554)
5 years ago
Chen Weihang dfb3ae1b9b
Polish some error message in framework holder - Part 1 (#25509)
5 years ago
wanghuancoder 9b46fe0440
fix some errmsg report,in framework/ir/, about 5 files (#25539)
5 years ago
Dong Daxiang e657d7062d
fleet base initial implementation and the API (#25442)
5 years ago
wanghuancoder e65c5b8e83
fix some errmsg report, in framework/ir/ (#25471)
5 years ago
wanghuancoder 6c0982b942
fix some errmsg report, in framework/ir/mkldnn (#25467)
5 years ago
wanghuancoder fce6466217
fix some errmsg report, in framework/ir/ subdir(memory,optimizer,multi_device) (#25460)
5 years ago
Chen Weihang 0b54d54fd8
Fix index overflow bug of the CUDA kernel loop increment (#25435)
5 years ago
Chen Weihang 4061aa6488
Polish ParallelExecutor exception process logic (#25449)
5 years ago
lilong12 e39aa70ec7
add the support for pipeline (#24560)
5 years ago
hong 70d7d07fea
catch bad alloc exception (#25140)
5 years ago
gongweibao 80f1c50738
Fix typo in interface. (#24779)
5 years ago
Zhaolong Xing 7b7e605189
[Fix BUGs]: fix multhead matmul pass's instable bug (#25123)
5 years ago
Dong Daxiang d5e40d1ba9
Paddle fleet distributed strategy (#25379)
5 years ago
Wojciech Uss d0a921ba98
Quant2 updates and fixes (#25313)
5 years ago
Jacek Czaja 17c751bec6
[oneDNN] Fix to #25078 (#25256)
5 years ago
MRXLT 3b8f0a64c2
Encryption infer (#25119)
5 years ago
Sylwester Fraczek 36abeff44f
adding elementwiseadd quantization (#25178)
5 years ago
Pei Yang b2f5a149e7
[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models (#25097)
5 years ago
Shibo Tao 19c4db1b56
don't re-generate header file if content doesn't change (#25130)
5 years ago
Jacek Czaja a7944904d3
[oneDNN]elementwise_add and elementwise_mul int8 support (#24984)
5 years ago
hutuxian 5822862d8a
Monitor Framework (#24079)
5 years ago
Leo Chen 25a4dac4c2
Use allow list instead of white list (#25002)
5 years ago
hutuxian 1c224e26af
support CMatchAuc (#24990)
5 years ago
Sylwester Fraczek 53d563a0fe
Reshape transpose matmul coverage (#24970)
5 years ago
Sylwester Fraczek a7ee634b45
fix WARNING: ThreadSanitizer: heap-use-after-free (#24929)
5 years ago
Zhou Wei 4058e736ff
temporarily disable these unittests failed on windows (#24942)
5 years ago
lilong12 29de0d97a5
add the support to specify device index for device_guard (#24555)
5 years ago
hutuxian b8f17a049d
fix problem in dump and add log (#24891)
5 years ago
Chen Weihang 6aae034f8d
add dep for fs.cc, test=develop, test=document_fix (#24881)
5 years ago
Jacek Czaja 40a5f3fd86
[oneDNN] Clearing mkldnn cache in naiveexecutor destructor (#24756)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
123malin 9d2bd0ac38
downpour_worker增加try_catch机制,打印program所有参数 (#24700)
5 years ago
Michał Gallus b2ba830eeb
Fix bug in ShareBufferWith causing eltwise_add inplace ut to fail (#24853)
5 years ago
Wojciech Uss 78d4f0cc91
add option to exclude ops by id from quantization (#24689)
5 years ago
Yanghello 62b4ff7dd2
Aes_cipher_test and cipher_utils_test failed fixed (#24816)
5 years ago
Yanghello 5a7a517cde
Add crypto api (#24694)
5 years ago
Chen Weihang 19e5f7879c
Append error op hint for GradOpMaker (#24750)
5 years ago
ShenLiang 950892044f
fix conflict, test=develop (#24238)
5 years ago
hutuxian e6b87b3193
Support AucRunner in PaddleBox (#22884)
5 years ago
wangchaochaohu dbfe5333c5
Add pe profiler Event (#24611)
5 years ago
Wilber ba2f8f0ce4
fix embedding_eltwise_layernorm_fuse_pass. test=develop (#24592)
5 years ago
hutuxian 0ec3a42e97
Random Dump (#24477)
5 years ago
Yiqun Liu 6b464f969a
Add an operator node in unittest to make the fusing result unique. (#24617)
5 years ago
Yiqun Liu 560c815390
Add some check for CUDA Driver API and NVRTC (#22719)
5 years ago
pawelpiotrowicz db2b6b6568
Hide globals & redesign restore PR (#24279)
5 years ago
Jacek Czaja 8b88cd5167
[oneDNN] Fix to inplace pass (#24442)
5 years ago
hutuxian 123255cf9f
change InitializeGPU to InitializeGPUAndLoadModel (#24377)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Wojciech Uss db052009c7
Enabled quantize all and skip missing in QAT (#24281)
5 years ago
Huihuang Zheng 8a1a2af82e
Add Assert Op (#24280)
5 years ago
joanna.wozna.intel 356f5ee220
[Refactoring] Unify op-dequant squashes (#24277)
5 years ago
liym27 ac9a7eeea4
[Dy2Stat]Support list pop (#24250)
5 years ago
xujiaqi01 1034ca316f
add timeout and http store in communication (#23436)
5 years ago
wawltor d1e1d85881
add the graph batch reader for pslib mode (#24178)
5 years ago
joanna.wozna.intel b43b46e619
[INT8] Add requant-op squash (#24143)
5 years ago
hutuxian 3e2bc8715f
Try to fix UT Random Fail (#24223)
5 years ago
Sylwester Fraczek e1a7a88057
added reshape transpose matmul fuse pass (#23754)
5 years ago
Chen Weihang 9b851ba216
[dy2static] Add print transformer and unify print format (#24068)
5 years ago
wangchaochaohu fa43d74a3a
fix the intermediate node of graph for fusion group test=develop (#24184)
5 years ago
Yiqun Liu ecfddebbef
Add the implementation of inverse (#23310)
5 years ago
liuwei1031 9a93f6aae0
improve efficiency of runtime InferVarType (#22778)
5 years ago
wangchaochaohu 2270864019
Fusion group optimize for cuda codegen(#23940)
5 years ago
ShenLiang 94dfb7d770
opt the postprocess, test=develop (#24155)
5 years ago
Jacek Czaja eb411613e9
[DNNL] refine activations Inplace support (#24145)
5 years ago
Jacek Czaja 461e6a01ec
[DNNL] activations Inplace support (#24123)
5 years ago
Zhang Ting fb0729ee7f
avoid warnings in MAC compile (#24124)
5 years ago
arlesniak d31a174f51
added fusing matmul-transpose-reshape pass (#23866)
5 years ago
Zeng Jinle a67eea9f00
polish code by adding final, test=develop, test=develop (#24114)
5 years ago
Zeng Jinle acef55df04
fix isolated var fetch bug, test=develop (#24070)
5 years ago
Jacek Czaja c6c65c65c7
[DNNL] Added elementwise_add mkl-dnn inplace (#23477)
5 years ago
hutuxian 9ff558a46f
Optimize DataFeed (#23957)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
ShenLiang 7f0b2c7407
fix memory leaking problem of dataset, test=develop (#23955)
5 years ago
guofei 2b896c1f6b
Support LoDTensorArray in fetch (#23645)
5 years ago
Yiqun Liu 071a702060
Fix the error misjudgment when there are control nodes in graph. (#23943)
5 years ago
hutuxian df64a96686
support set_test_mode and set comlog level(#23905)
5 years ago
Zhang Ting b88662254b
use 32 bit index to improve expand op (#23899)
5 years ago
yiicy a1e7387919
Variable error message enhancement, test=develop (#23548)
5 years ago
yaoxuefeng 5b69242fab
modify datanorm op test=develop (#23030)
5 years ago
Zeng Jinle c49791362f
Correct reader device index (#23802)
5 years ago
joanna.wozna.intel 12ba05ce0c
Add scale-matmul fuse pass (#23734)
5 years ago
Chen Weihang 532079a222
API (CompiledProgram) error message enhancement (#23559)
5 years ago
wawltor f3d7db98f1
Add the support of bool list for assign_value op (#23774)
5 years ago
zhongpu b4b6763ab2
fix bug for exhaustive_search in conv_fusion_op, test=develop (#23727)
5 years ago
Yiqun Liu 9e85d02373
Avoid crash when calling ctx->HasInputs and add the check of shape in fill_copnstant op. (#23698)
5 years ago
Huihuang Zheng 1d3b0134ca
Error Message Enhancement (#23483)
5 years ago
wangchaochaohu fb34bdb40c
API/OP(fill_constant) error message enhancement (#23584)
5 years ago
liuwei1031 2fd728a978
add new dot op(#23418)
5 years ago
chenhaoze 9b06dd8628
Add three passes and api reference of paddle_pass_builder. test=develop (#23741)
5 years ago
xujiaqi01 d98084e7ec
add save with prefix (#23449)
5 years ago
joanna.wozna.intel 5ee099ca57
Op-requant squash (#23665)
5 years ago
hutuxian 94a3789fd0
Add AfsAPI in PaddleBox (#23419)
5 years ago
liym27 06d4aa4e73
API (BuildStrategy) error message enhancement. (#23462)
5 years ago
Zhen Wang 84cd45f674
Solve the conflict of ops with the same name, test for CI. (#23573)
5 years ago
Zeng Jinle 7f3e0eaad1
refine error msg, test=develop (#23589)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
Zhang Ting 1b8fe70e48
fix VLOG, test=develop (#23327)
5 years ago
Chen Weihang 45880f604b
API(Program) error message enhancement (#23519)
5 years ago
joanna.wozna.intel 3cb5623dad
Add matmul dequant squash (#23505)
5 years ago
wangchaochaohu c1187cd6f4
Fp16 refine for fusion group (#23472)
5 years ago
joanna.wozna.intel ce08fdcf2b
Add support for INT8 matmul in C-API quantization (#23463)
5 years ago
Aurelius84 8674a82c03
Op (Scope) error message enhancement (#23458)
5 years ago
wangchaochaohu d085f79228
fix untime fail for output var stop_gradient=True for fusion group (#23317)
5 years ago
qingqing01 6162cf2f2e
Make optimizer consistent in dygraph and static-graph and remove some LOG-INFO. (#23426)
5 years ago
ShenLiang 5223e2bbc4
Add a new DataFeed named PaddleBoxDataFeed (#23321)
5 years ago
Chen Weihang 75bd350710
Implement StaticModelRunner to support dygraph fine-tune static graph pre-training model (#23171)
5 years ago
Kaipeng Deng d223a24904
Fix inplace_abn compile error on Windows (#23464)
5 years ago
Tao Luo 0b583235f5
Revert "Solve the conflict of ops with the same name. (#23199)" (#23494)
5 years ago
wawltor 6577f91b74
Add the sum op to API 2.0, add some parameters for new api
5 years ago
Zhen Wang abe3e6906d
Solve the conflict of ops with the same name. (#23199)
5 years ago
tianshuo78520a d8a21ef6f3
test=develop;fix error (#23467)
5 years ago
zhongpu dbfbd7eac4
support Exhaustive search in dygraph (#23415)
5 years ago
gongweibao 24a063f6ac
Add fleet checkpoint on local fs and remote fs(such as hdfs) for EDL (#22586)
5 years ago
wangchaochaohu 5c60778731
polish the code of fusion group test=develop (#23370)
5 years ago
Leo Chen a62599a888
[feature] prune program by feed and fetch_list automatically (#22474)
5 years ago
Yiqun Liu bc2981e998
Disable test_code_generator and test_post_training_quantization_mobilenetv1 (#23440)
5 years ago
Zeng Jinle 29337f4e17
fix conflict of inferne partial feed with gpu parallel ssa graph executor, test=develop (#23400)
5 years ago
zhongpu bfb07aafe8
Revert "Exhaustive search (#22821)", test=develop (#23401)
5 years ago
xujiaqi01 93ea9dd27a
fix stat var in hogwild worker (#23367)
5 years ago
joanna.wozna.intel 8c463700e1
Add default pass attributes (#23042)
5 years ago
zhongpu 48144e4099
Exhaustive search (#22821)
5 years ago
Kaipeng Deng 21d95be0db
Add inplace abn op (#22806)
5 years ago
Yi Liu 821534efd3
add paralell_executor dependancy to collective_helper (#23380)
5 years ago
Zeng Jinle 3a21980b78
add reader dependency pass, test=develop (#23301)
5 years ago
wangchaochaohu d280106007
Add support for attr type Op and add fill_constant Op and scale Op (#23163)
5 years ago
xujiaqi01 3a45767d49
add fleet pslib pull and push sparse op and push dense op (#23139)
5 years ago
Jacek Czaja 2bb1b0e89e
[DNNL] Added MKL-DNN inplace pass for C-API inference (#23315)
5 years ago
Wojciech Uss f836c8aa8f
add check for scales and a message (#23119)
5 years ago
Tao Luo c00d427d52
simplify the cmake log of ir/CMakeLists.txt (#23262)
5 years ago
xujiaqi01 68ea1ad55b
add clear one table (#23089)
5 years ago
danleifeng ae3bb16d06
add MaskAucCalculator in paddlebox (#23157)
5 years ago
Zeng Jinle 53e6f8e1da
rename macro, test=develop (#23161)
5 years ago
Zeng Jinle 7ca77a90ac
add Tensor::IsSharedBufferWith method, test=develop (#23175)
5 years ago
Zeng Jinle b8886bf122
rename no_need_buffer_vars_macro, test=develop (#23159)
5 years ago
Zeng Jinle bae5930ba1
fix graph attr copy issues, test=develop (#23191)
5 years ago
Zeng Jinle acfc9b8a70
Reader sequential and inference partial feed (#22699)
5 years ago
Wilber 95b356a069
update embedding_eltwise_layernorm fuse and kernel. test=develop (#23114)
5 years ago
Zeng Jinle a31d7328b7
Add dygraph double grad implementation (#22939)
5 years ago
Yiqun Liu 3af4771122
Add the detection and code-generation of sqrt and square in fusion_group (#23095)
5 years ago
hutuxian 0c30098f8b
Add need_save_delta parameter to solve OOM (#23097)
5 years ago
Sylwester Fraczek abee05a8c8
added mkldnn swish activation (#23041)
5 years ago
Zhang Ting 880eb04d93
skip PrepareData when it is unnecessary (#22839)
5 years ago
Adam 5842ae6785
Revert "Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695)" (#22985)
5 years ago
yaoxuefeng 660ff18488
fix datsset test=develop (#23043)
5 years ago
wangchaochaohu 3757e0687c
Add Unittest for backward of fusion group (#22932)
5 years ago
wangchaochaohu f0d193a23c
Cast fusion for fusion group (#22876)
5 years ago
Wilber ff3ddbb502
add skip_layernorm pass. test=develop (#22895)
5 years ago
Adam 056edf3929
Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695)
5 years ago
Zhaolong Xing 8d6dc102fe
[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494)
5 years ago
Zeng Jinle d33c4343e1
Imperative tracer refactoring (#22457)
5 years ago
liu zhengxi 61fef9754b
Fix fc padding bug during inference fusion (#22860)
5 years ago
wangchaochaohu dbb0b9b3b6
refine the profiler print (#22823)
5 years ago
hong 5191e54494
reduce default attrs for dynamic graph (#22850)
5 years ago
Zhang Ting 72ff5a09c3
fix print bug of profile, test=develop (#22804)
5 years ago
Zhang Ting 4e8bc02461
add fluid.device_guard to specify the device type for Op (#22254)
5 years ago
Zhen Wang 89cfa49156
Unmerged fetch list (#22635)
5 years ago
hutuxian 53a2b68f4e
support customized download command in dataset (#22782)
5 years ago
wangchaochaohu ca9e77a8d4
add sum op support for fusion group (#22771)
5 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
5 years ago
Leo Chen b2c1be851a
support cond in clone, test=develop (#22657)
5 years ago
hutuxian 175954d894
PaddleBox Framework Part2 (#22466)
5 years ago
GaoWei8 cdf5f6fb8c
Add an inference interface to disable FC padding (#22097)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
tangwei12 66a3150135
SYNC with communicaotor (#22344)
5 years ago
Yiqun Liu 22bbd54719
Add the support of fp16 in fusion_group (#22239)
5 years ago
wangchaochaohu c65c6ae534
add flag to control profile level in python API (#22319)
5 years ago
123malin 00594c1c88
support dumping params/grads in transpiler mode (#22490)
5 years ago
flame f7eafca828
remove python inference warning (#22602)
5 years ago
Wilber 9a8203aa25
fix fc_lstm_fuse when multi sub-graph use same fc_bias. test=develop (#22551)
5 years ago
Zhaolong Xing 8acd745c25
[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486)
5 years ago
Yiqun Liu 96770f519e
Disable fusion_group for windows and mac in build_strategy. (#22549)
5 years ago
tangwei12 b0675c8193
fix bug with compiledProgram (#22495)
5 years ago
hutuxian 1a7962be97
Paddlebox about box_wrapper (#22497)
5 years ago
yaoxuefeng 2235ee1a5e
multi-loss optimization by adding a DownpourOpt worker (#22025)
5 years ago
zhaoyuchen2018 54970444ce
Improve transpose performance with tile sm copy, test=develop (#22311)
5 years ago
Wilber a90fa54092
Compile without nccl deps. [1/2] (#22509)
5 years ago
guofei 3a59a7a11f
Make assign op support LoDTensorArray and modify while_loop API (#22309)
5 years ago
Yiqun Liu dcfb603897
Enable the detection of subgraph composed of grad ops (#21223)
5 years ago
joanna.wozna.intel 17f2c0899f
Add dequant-scale squash (#22409)
5 years ago
Wilber 7bc4b09500
add WITH_NCCL option for cmake. (#22384)
5 years ago
xujiaqi01 d51ffe860a
fix copy table bug (#22432)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
xujiaqi01 371f377bea
add GeneralRoleMaker (#22295)
5 years ago
Michał Gallus 269db0d1d1
[DNNL] Fix accuracy in INT8 FC (#22404)
5 years ago
joanna.wozna.intel 3099d9d47c Restore requantize squash (#22399)
5 years ago
Adam e7a9f6bbb7 [Bugfix] Preserve shape in inpalce operators (#22360)
5 years ago
Yiqun Liu b7cac50b64
Implement a common python unittest to test the ir passes. (#22209)
5 years ago
tangwei12 82bc814a57
integrated HALF_ASYNC to communicator (#21869)
5 years ago
Leo Chen 3e5744aa65
Remove unused inputs for some operators (#22284)
5 years ago
lidanqing 895f8da7d6 change std::cout to log(INFO), vlog (#22316)
5 years ago
Zhen Wang e40cfb1010
fix the bug of assert_is_op_output. test=develop (#22262)
5 years ago
Wojciech Uss d3a6647372 improve placement pass tests code coverage (#22197)
5 years ago
zhouwei25 549e6de7ac faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
5 years ago
xujiaqi01 e3a457d34b
add collective communication library in fleet (#22211)
5 years ago
Zhen Wang f2522e91c4 fix the type error caused by setting bool attr in OpDesc. test=develop (#22257)
5 years ago
Chen Weihang fc0b21e17b
Polish fetch error message of parallel executor (#22206)
5 years ago
wangchaochaohu 621d3e0b66
fix the bug of profile update (#22207)
6 years ago
Zhen Wang 46189b166d Add bn and relu fuse pass (#22048)
6 years ago
zhongpu d0f0a2520c test Optimizer in dygraph (#21949)
6 years ago
joanna.wozna.intel 5b2e98aa17 Add multiple quantize operators fuse (#22062)
6 years ago
Yiqun Liu 96980c2244
Polish the PADDLE_ENFORCE in fusion_group pass related codes. (#22144)
6 years ago
wangchaochaohu c3876cf82d
add support for nested profiling event and printing in different level (#22061)
6 years ago
liu zhengxi 724b13e459
fix xception precision problem, test=develop (#22124)
6 years ago
Yiqun Liu b1401fb74d Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#22094)
6 years ago
bingyanghuang 4b4a9cc88f fix format in operator.cc (#22101)
6 years ago
silingtong123 6c20e7c4e6 test=develop, remove unused parameter from class RuntimeInferShapeContext constructors (#22046)
6 years ago
Jacek Czaja b0b27ff699 [MKL-DNN] Conv grad and Batch Norm grad NHWC support (#22088)
6 years ago
Huihuang Zheng dd4361568e
Add ParallelExecutor Test for Cond API and Fix PE Checks Shape Bug (#22029)
6 years ago
Jacek Czaja ad8a9cb82c [MKL-DNN] Pool & LRN Grad Ops NHWC support (#21747)
6 years ago
Yiqun Liu d48320777e
Add the first implememtation of fusion_group op (#19621)
6 years ago
Michał Gallus 6192108408 [DNNL] 3D Fully-Connected (#21746)
6 years ago
liu zhengxi 196e20dfbb
Fix multi-threads memory out of bounds error for passes (#21920)
6 years ago
石晓伟 03479469a7
fix multi-thread error of fc_gru_fuse_pass.cc, test=develop (#21841)
6 years ago
Pei Yang 3e5008ad01
fix trt calib not working bug, test=develop (#21934)
6 years ago
qingqing01 2066745847
Pack imperative/layer into paddle_framework.so (#21921)
6 years ago
Aurelius84 51a86d2b6b Optimize adam speed (#21777)
6 years ago
Thunderbrook c3cf42d0f7
add table id in cache shuffle (#21585)
6 years ago
WangXi 17299b8d21 fix batch_norm_grad infer shape=0 & add allreduce enforce shape, test=develop (#21801)
6 years ago
Huihuang Zheng 557bce77da
Fix Backward Bugs in Conditional Block (#21809)
6 years ago
xujiaqi01 0eb4d990c4
fix compiled error when with_pslib=on (#21769)
6 years ago
lidanqing d3a96632fa Add fc-dequantize squash in cpu_quantize_squash_pass for ernie model (#21714)
6 years ago
WangXi 8754cbd1f2 fix std::min type in nan_inf, test=develop (#21725)
6 years ago
joanna.wozna.intel d419b859c0 Add reshape int8 mkldnn op (#21428)
6 years ago
WangXi 8a0f611b64 Rewrite check nan inf tools (#21076)
6 years ago
tangwei12 9ad940fdfe
memory leak for cpu (#21174)
6 years ago
Zeng Jinle 73461a7ae6
Make OperatorWithKernel::InferShape abstract (#21633)
6 years ago
Zeng Jinle 6828f3684b
fix op_registry, add ignore op_function_impl.h, test=develop (#21654)
6 years ago
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
6 years ago
xujiaqi01 f404157205
fix master patch when slot is dense (#21580)
6 years ago
xujiaqi01 c05706fe73
fix code style of fleet_wrapper (#21639)
6 years ago
xujiaqi01 88960684aa
rm optimize_for in framework.proto (#21571)
6 years ago
Zeng Jinle 0f8888360e
Polish op registry codes (#21561)
6 years ago
hutuxian c5aec2fe68
Paddlebox Related to Framework (#21586)
6 years ago
liym27 9da7e6b4d4
add file check_op_desc.py and add interface to get default value. (#21530)
6 years ago
Pei Yang 122b37ce62
make config option DisableGlogInfo() able to mute all inference logs (#21318)
6 years ago
Jacek Czaja 18a5d30754 [MKL-DNN] Conv2d and Conv2d transpose MKL-DNN NHWC support (#21466)
6 years ago
Zhaolong Xing c5f0293cf3
NV jetson(nano, tx2, xavier) inference compile support (#21393)
6 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
6 years ago
wangchaochaohu d4776ec027
fix the correctness of memcpy profiling result test=develop (#21458)
6 years ago
Jie Fang 5e813b53c5 nhwc optimization for batchnorm (#21090)
6 years ago
Leo Chen e0c9d856fb
add unused input vars check for OpWithKernel, test=develop (#21169)
6 years ago
Huihuang Zheng 630be31952
Fix Cond Bug for Nested Control Flow (#21340)
6 years ago
Jacek Czaja cd43c4440e [MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375)
6 years ago
Zeng Jinle 6b09b73e17
add explicit conversion to NoNeedBufferVarsFunctor, test=develop (#21430)
6 years ago
hong ac8546701d
Add dygraph execution context (#20157)
6 years ago
Zeng Jinle 09696d5df8
Use system allocator in OpTest (#21335)
6 years ago
Tao Luo c0656dcb1a
remove -Wno-error=sign-compare, make warning as error (#21358)
6 years ago
Zeng Jinle b97fc16d21
fix lod_reset bug, test=develop (#21392)
6 years ago
Zeng Jinle 89966525f1
Polish reference count pass (#21324)
6 years ago
Youwei Song d5ff79e55e Support numpy bridge (enabled by default in dygraph mode) (#20983)
6 years ago
GaoWei8 8493f20ebc Polish the codes of fc when needs padding (#21378)
6 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
6 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
6 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
6 years ago
Thunderbrook 9a7832f8be
print table stat info for pslib (#21296)
6 years ago
Dong Daxiang 691ced87c0
Refactor fetch handler (#21264)
6 years ago
Yiqun Liu c918788ba9 Disable fusion_group pass for windows and mac. We will do some experiments on Linux first. (#21310)
6 years ago
Chen Weihang 952508527a
Polish some PE code details (#21274)
6 years ago
Thunderbrook 0d17c1b816
solve pslib core in stop worker (#21263)
6 years ago
Thunderbrook 349e82d669
support general embedding params (#21217)
6 years ago
Yiqun Liu 6b1e1f0dda
Enable generating code for a given subgraph. (#21126)
6 years ago
Zeng Jinle a152315be7
refine Tensor method, test=develop (#21031)
6 years ago
Zeng Jinle cdb3d27985
Fix warn of gcc8 (#21205)
6 years ago
Zhaolong Xing 65f7052554
TRT int8: refine trt int8 for dynamic range set (#21112)
6 years ago
xujiaqi01 23876de55b
fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052)
6 years ago
xujiaqi01 9e045170c0
add copy table (#21086)
6 years ago
Chen Weihang 4bd9463630
fix detail error message error, test=develop (#21170)
6 years ago
Chen Weihang 8da0cd537a
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134)
6 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
6 years ago
Chen Weihang 7e5f74b825
Add examples for error message writing specification - InvalidArgument (#21132)
6 years ago
WangXi de5d3ff688 Fix dgc buffer illegal & reuse velocity (#21012)
6 years ago
Zeng Jinle d625aaf0c1
remove so many logs of parallel executor, test=develop (#21105)
6 years ago
Yiqun Liu 35f17ae28f
Add the check of lod_level between compile-time and runtime. (#20961)
6 years ago
Chen Weihang 826254f664
Add pre-condition check for fuse optimizer op pass (#21005)
6 years ago
Yiqun Liu 9091f8cdf9
Support generating code for grad_op (#21066)
6 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
6 years ago
Chen Weihang 7ee25189c3
Enrich the type of error and declare the error type interfaces (#21024)
6 years ago
Zeng Jinle 5aae595902
fix no_need_buffer_vars_dep, test=develop, test=document_fix (#21007)
6 years ago
xujiaqi01 1d1a07937a
simplify master+patch,remove ins when size != merge_size or has conflict slot (#20913)
6 years ago
Zeng Jinle 878a40f57d
Support NoNeedBufferVarsInference in dygraph backward (#20868)
6 years ago