Commit Graph

3196 Commits (5165bc854ac35c22c3d1d8c04629f5420972a23a)

Author SHA1 Message Date
joanna.wozna.intel 7821759d48
Add bfloat16 softmax and gelu (#28394)
5 years ago
石晓伟 c41fd033e5
check op_version_registry in CI test, test=develop (#28402)
5 years ago
Jacek Czaja ca41541472
[oneDNN]Sum bf16 kernel (#28382)
5 years ago
lidanqing 12b9587be5
Add conv_bias pass version python test (#28278)
5 years ago
石晓伟 21a63f6f90
enhance the op_version_registry, test=develop (#28347)
5 years ago
joanna.wozna.intel 571a63e7ec
Add bf16 transpose2, reshape2, concat ops (#28195)
5 years ago
Zhang Ting fdc06f2158
add Fuse bn add act pass (#28196)
5 years ago
Chen Weihang 813b2ade34
Enrich the python error types of paddle & polish format (#28124)
5 years ago
Adam Osewski 7db747d9e8
oneDNN BatchNorm + Act fusion pass. (#27912)
5 years ago
mapingshuo 81244fbfab
add sharding strategy in fleet(#27900)
5 years ago
Chen Weihang 2babd6ff67
Add compile limit for PADDLE_ENFORCE without error message (#28221)
5 years ago
Leo Chen 1f3be85914
Fix bug of fetch_async_op_handle when fetching the feed variable (#28194)
5 years ago
lidanqing 7cb4a8b8f2
[oneDNN] Conv dilation support (#27914)
5 years ago
Zhou Wei 2ac6c6c3af
fix bug of tensor copy of CUDAPinnedPlace (#27966)
5 years ago
guofei 6bbb6e7f45
Implement the function of OutScaleForTraining/OutScaleForInference in dygraph (#26601)
5 years ago
Thunderbrook 3ee6ad6ec5
solve bug in pull_dense_worker (#27918)
5 years ago
zhang wenhui 5a83496c8d
Multi task (#26002)
5 years ago
wanghuancoder 41aad9bfcd
revert 4 files, from clear include by iwyu, test=develop (#27895)
5 years ago
Leo Chen 049696bf67
Refine the format of printing tensor (#27673)
5 years ago
Chengmo c5f2802d56
【paddle.fleet】Update fleetrun & ps-heter (#27472)
5 years ago
石晓伟 0d27591642
save operator version infomation to program desc, test=develop (#27668)
5 years ago
Jacek Czaja 631c1f3018
- Fix to 27398 (#27770)
5 years ago
Jacek Czaja 606611d351
[oneDNN] GRU BF16 kernel (#27731)
5 years ago
Jacek Czaja b9fda2ff09
Fix to issue #25537 (#27546)
5 years ago
Wojciech Uss 966447e338
Added support for quantization of fusion_gru (#27518)
5 years ago
Pei Yang 8a4f85feb9
Add unittests and OP version registry for quant_conv2d_dequant_fuse_pass (#27689)
5 years ago
AshburnLee c3a3df6466
Add cuda support for unique op (#27646)
5 years ago
Leo Chen 35074963e3
Refine error msg in paddle/fluid/framework/details [part 2] (#27429)
5 years ago
Chengmo 0e101c4f6f
Fix test dist fleet heter ctr (#27513)
5 years ago
joanna.wozna.intel b0ee1405f7
Add conv2d bfloat16 support (#27325)
5 years ago
Thunderbrook 6f69a4cb05
add xpu in heter mode (#27000)
5 years ago
WangXi e550fc02ae
fleet2.0 add fp16 grad compression (#27480)
5 years ago
cc c5c13473c6
Add compatibility check for four mkldnn pass (#27364)
5 years ago
Wilber 3d5522146e
register seq_concat_fc_fuse pass. (#27479)
5 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
5 years ago
Pei Yang 8182337096
clear pass logs (#27434)
5 years ago
Shang Zhizhou d93661942e
fix bug sequececonv_eltadd_relu_fuse_pass (#27404)
5 years ago
Leo Chen aba759ba16
[Feature] Enhance inplace addto strategy for gradient accumulation in static graph (#27112)
5 years ago
Wilber 39546aa2f3
Add pass compatible and unit test. (#27377)
5 years ago
Leo Chen bbc84e0fe0
Refine error msg in paddle/fluid/framework/details [part 1] (#25631)
5 years ago
tangwei12 99626502f7
【paddle.fleet】gloo and util (#27213)
5 years ago
yaoxuefeng d726fd5e86
enhance dataset err msg (#27363)
5 years ago
Pei Yang fd7ab4e63c
register pass compatibility (#27357)
5 years ago
haozech 7e6dfcf9b2
Add 3 pass version check (#27283)
5 years ago
Shang Zhizhou 3c11717988
add op version checker to ir passes (#27329)
5 years ago
lilong12 9f9d15e285
fix the bug of non-exit, test=develop (#27350)
5 years ago
ShenLiang 54b81fa32c
add adaptivelsgd in meta_optimizer (#27289)
5 years ago
Chen Weihang 4f9d6529fe
Polish framework error message part 7 (#27266)
5 years ago
Wilber f827665ae6
[Pass Compatible] Bind python compatible. (#27262)
5 years ago
Chen Weihang dafb0e3bb7
Polish framework error message part 6 (#27257)
5 years ago
joanna.wozna.intel 1483ea2304
Add bfloat16 passes (#26999)
5 years ago
Chen Weihang 79149c8ee6
polish framework error message part 8 (#27269)
5 years ago
ShenLiang 2b6a5793fe
remove auto mode from localsgd optimizer (#27237)
5 years ago
JZ-LIANG 5d039f4086
modified the implement of Lars optimizer (#26733)
5 years ago
WeiXin 13804ed80c
Error msg/polish tensor error msg (#26976)
5 years ago
Pei Yang 5fb8c92054
fix multihead matmul shared params (#27121)
5 years ago
yaoxuefeng 7f3e6ca596
add cuda generator (#26786)
5 years ago
Feiyu Chan c8cc094576
add template specialization for bfloat16 for gcc 4.8 compatability (#26985)
5 years ago
joanna.wozna.intel 95e1434bb2
Add bfloat16 data type (#25402)
5 years ago
Shang Zhizhou 61fc7a3e45
Pass version check (#26887)
5 years ago
wanghuancoder 2d2c31a63a
Add FetchAsyncOpHandle, and use it in FastThreadedExecutor (#26643)
5 years ago
Thunderbrook 5205748481
fix eigen in push sparse; fix hadoop command (#26872)
5 years ago
yaoxuefeng a47d92d868
fleet add save with whitelist test=develop (#23376)
5 years ago
Adam 8bcb1f29d9
Add conv+affine_channel fuse pass to MKLDNN pass strategy and fix it (#26779)
5 years ago
Leo Chen 844583c8fd
Refine paddle.manual_seed (#26496)
5 years ago
Pei Yang e3f8e5cf5c
trt int8 support conv2d_transpose (#26636)
5 years ago
zhangchunle 623a4c2e56
fix ci coverage build error (#26761)
5 years ago
joanna.wozna.intel eb097d64f6
Fix int8 performace drop cpu_quantize_placement_pass (#26715)
5 years ago
Wilber 1c898b66d6
add bug fix enum. (#26736)
5 years ago
Zhou Wei 8071d23073
fix bug that can't print int8_t (#26712)
5 years ago
Adam Osewski c2c689582e
Update Paddle-Lite commit hash. (#26413)
5 years ago
lilong12 1c68138327
[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis (#26552)
5 years ago
joanna.wozna.intel 559e43eee4
Small change in conv2d and quantize pass (#26671)
5 years ago
石晓伟 32ceacf317
update op_version_registry, test=develop (#26644)
5 years ago
Dong Daxiang 08d736ad78
【paddle.fleet】add cudnn related strategies to DistributedStrategy (#26598)
5 years ago
wanghuancoder c1f5df5269
optimized transformation form tensor to numpy (#26447)
5 years ago
石晓伟 fa08a834be
update op_version_registry, test=develop (#26592)
5 years ago
石晓伟 656e60b18f
new class: op_version_registry, test=develop (#26542)
5 years ago
Jack Zhou 199b0c7c1b
Add isfinite v2 op (#26344)
5 years ago
QingshuChen 138ecf24aa
support Baidu Kunlun AI Accelerator (#25959)
5 years ago
Chen Weihang 9108282883
Polish framework error message part 5 (#26204)
5 years ago
Pei Yang b757466b0d
fix trt dynamic ernie serialization unit test (#26228)
5 years ago
Wilber 3ec0bcbbb8
[Bug] Fix prune for save_inference_model about transformer (#25347)
5 years ago
cc 3f816bc8b4
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639)
5 years ago
Thunderbrook a83e0f264c
fix heter proto (#26093)
5 years ago
yaoxuefeng 23261ff44b
add cpu random Generator (#26013)
5 years ago
Zhou Wei 6de463d3d1
expose and unify the Tensor concepts to the user (#25978)
5 years ago
Dong Daxiang 50a5bcfc9d
【paddle.fleet】paddle.fleet -> paddle.distributed.fleet. (#26186)
5 years ago
Leo Chen ffe52b4452
[OpDevOptimize] Add common infershape functions (#26096)
5 years ago
Chen Weihang 838e36e9ed
Fix loaded variable suffix repeat error (#26169)
5 years ago
JZ-LIANG 54003b873e
【paddle.fleet】add lamb to fleet meta optimizer (#26025)
5 years ago
Yiqun Liu 1be6bf45ae
Add assign to fusion_group and enhance inplace execution in fusion_group. (#26121)
5 years ago
MRXLT 6559229b7e
fix encryption infer (#25979)
5 years ago
tangwei12 c14ec8782b
【paddle.fleet】Feature/fleet ps api 2.0 (#25857)
5 years ago
joanna.wozna.intel 734cf1c3e9
Change use_quantizer attribute name and data type (#25838)
5 years ago
tangwei12 3755564ae1
Fix/large scale fix (#25999)
5 years ago
Thunderbrook fd2947babf
fix compile error with mkl (#26030)
5 years ago
Leo Chen 0a47387bd8
Use static local variable instead of global variable for safty (#26018)
5 years ago
123malin 2191a08317
【paddle.fleet】fleet_util move to paddle.fleet (#25805)
5 years ago
Thunderbrook 0cb60c700d
add heter ps mode (#25682)
5 years ago
WangXi 2c9d0f3cb9
【paddle.fleet】Add dgc to fleet meta optimizer (#25738)
5 years ago
WangXi a6c87fd091
Add amp to fleet meta optimizer, test=develop (#25770)
5 years ago
xujiaqi01 d11c140e28
fix dump, fix cvm check (#25400)
5 years ago
JZ-LIANG 8ebffc78c9
add lars to fleet meta optimizer (#25884)
5 years ago
Dong Daxiang 8d2896f1fe
【paddle.fleet】Fleet run graph in Executor and add two more strategies (#25844)
5 years ago
Chen Weihang 48b9a56f1c
Polish framework error message - part 4 (#25807)
5 years ago
lilong12 5f30e57cdd
fix test_pipeline, test=develop (#25808)
5 years ago
Chen Weihang d47304e6d9
Refine paddle error stack format (#25790)
5 years ago
tangwei12 caa90a6510
Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) (#22957)
5 years ago
Dong Daxiang a96d54ac19
Generate final strategy (#25782)
5 years ago
Chen Weihang 1b3081b1b4
Simplify BufferedReader to improve DataLoader performance (#25648)
5 years ago
Zhou Wei e0a9115e28
fix random compile failure due to missing file (#25661)
5 years ago
Dong Daxiang 920d998f1e
add more settings for distributed strategy (#25685)
5 years ago
Chen Weihang c34c80d302
Polish framework error message part3 (#25701)
5 years ago
arlesniak e52df3b125
Added DNNL cache management for DyGraph (#25624)
5 years ago
Chen Weihang 364cc53618
Polish paddle fluid framework error message - part2 (#25667)
5 years ago
Adam 98899b73d2
Fix FC + GRU fuse pass (#25687)
5 years ago
wanghuancoder 1917b38099
fix some errmsg report,in framework/ir/, about 21 files (#25525)
5 years ago
joanna.wozna.intel e5bbffa84c
Add NOMINMAX define due to windows.h max/min macro conflict (#25637)
5 years ago
石晓伟 7206417259
supports xpu runtime, test=develop (#25554)
5 years ago
Chen Weihang dfb3ae1b9b
Polish some error message in framework holder - Part 1 (#25509)
5 years ago
wanghuancoder 9b46fe0440
fix some errmsg report,in framework/ir/, about 5 files (#25539)
5 years ago
Dong Daxiang e657d7062d
fleet base initial implementation and the API (#25442)
5 years ago
wanghuancoder e65c5b8e83
fix some errmsg report, in framework/ir/ (#25471)
5 years ago
wanghuancoder 6c0982b942
fix some errmsg report, in framework/ir/mkldnn (#25467)
5 years ago
wanghuancoder fce6466217
fix some errmsg report, in framework/ir/ subdir(memory,optimizer,multi_device) (#25460)
5 years ago
Chen Weihang 0b54d54fd8
Fix index overflow bug of the CUDA kernel loop increment (#25435)
5 years ago
Chen Weihang 4061aa6488
Polish ParallelExecutor exception process logic (#25449)
5 years ago
lilong12 e39aa70ec7
add the support for pipeline (#24560)
5 years ago
hong 70d7d07fea
catch bad alloc exception (#25140)
5 years ago
gongweibao 80f1c50738
Fix typo in interface. (#24779)
5 years ago
Zhaolong Xing 7b7e605189
[Fix BUGs]: fix multhead matmul pass's instable bug (#25123)
5 years ago
Dong Daxiang d5e40d1ba9
Paddle fleet distributed strategy (#25379)
5 years ago
Wojciech Uss d0a921ba98
Quant2 updates and fixes (#25313)
5 years ago
Jacek Czaja 17c751bec6
[oneDNN] Fix to #25078 (#25256)
5 years ago
MRXLT 3b8f0a64c2
Encryption infer (#25119)
5 years ago
Sylwester Fraczek 36abeff44f
adding elementwiseadd quantization (#25178)
5 years ago
Pei Yang b2f5a149e7
[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models (#25097)
5 years ago
Shibo Tao 19c4db1b56
don't re-generate header file if content doesn't change (#25130)
5 years ago
Jacek Czaja a7944904d3
[oneDNN]elementwise_add and elementwise_mul int8 support (#24984)
5 years ago
hutuxian 5822862d8a
Monitor Framework (#24079)
5 years ago
Leo Chen 25a4dac4c2
Use allow list instead of white list (#25002)
5 years ago
hutuxian 1c224e26af
support CMatchAuc (#24990)
5 years ago
Sylwester Fraczek 53d563a0fe
Reshape transpose matmul coverage (#24970)
5 years ago
Sylwester Fraczek a7ee634b45
fix WARNING: ThreadSanitizer: heap-use-after-free (#24929)
5 years ago
Zhou Wei 4058e736ff
temporarily disable these unittests failed on windows (#24942)
5 years ago
lilong12 29de0d97a5
add the support to specify device index for device_guard (#24555)
5 years ago
hutuxian b8f17a049d
fix problem in dump and add log (#24891)
5 years ago
Chen Weihang 6aae034f8d
add dep for fs.cc, test=develop, test=document_fix (#24881)
5 years ago
Jacek Czaja 40a5f3fd86
[oneDNN] Clearing mkldnn cache in naiveexecutor destructor (#24756)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
123malin 9d2bd0ac38
downpour_worker增加try_catch机制,打印program所有参数 (#24700)
5 years ago
Michał Gallus b2ba830eeb
Fix bug in ShareBufferWith causing eltwise_add inplace ut to fail (#24853)
5 years ago
Wojciech Uss 78d4f0cc91
add option to exclude ops by id from quantization (#24689)
5 years ago
Yanghello 62b4ff7dd2
Aes_cipher_test and cipher_utils_test failed fixed (#24816)
5 years ago
Yanghello 5a7a517cde
Add crypto api (#24694)
5 years ago
Chen Weihang 19e5f7879c
Append error op hint for GradOpMaker (#24750)
5 years ago
ShenLiang 950892044f
fix conflict, test=develop (#24238)
5 years ago
hutuxian e6b87b3193
Support AucRunner in PaddleBox (#22884)
5 years ago
wangchaochaohu dbfe5333c5
Add pe profiler Event (#24611)
5 years ago
Wilber ba2f8f0ce4
fix embedding_eltwise_layernorm_fuse_pass. test=develop (#24592)
5 years ago
hutuxian 0ec3a42e97
Random Dump (#24477)
5 years ago
Yiqun Liu 6b464f969a
Add an operator node in unittest to make the fusing result unique. (#24617)
5 years ago
Yiqun Liu 560c815390
Add some check for CUDA Driver API and NVRTC (#22719)
5 years ago
pawelpiotrowicz db2b6b6568
Hide globals & redesign restore PR (#24279)
5 years ago
Jacek Czaja 8b88cd5167
[oneDNN] Fix to inplace pass (#24442)
5 years ago
hutuxian 123255cf9f
change InitializeGPU to InitializeGPUAndLoadModel (#24377)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
Wojciech Uss db052009c7
Enabled quantize all and skip missing in QAT (#24281)
5 years ago
Huihuang Zheng 8a1a2af82e
Add Assert Op (#24280)
5 years ago
joanna.wozna.intel 356f5ee220
[Refactoring] Unify op-dequant squashes (#24277)
5 years ago
liym27 ac9a7eeea4
[Dy2Stat]Support list pop (#24250)
5 years ago
xujiaqi01 1034ca316f
add timeout and http store in communication (#23436)
5 years ago
wawltor d1e1d85881
add the graph batch reader for pslib mode (#24178)
5 years ago
joanna.wozna.intel b43b46e619
[INT8] Add requant-op squash (#24143)
5 years ago
hutuxian 3e2bc8715f
Try to fix UT Random Fail (#24223)
5 years ago
Sylwester Fraczek e1a7a88057
added reshape transpose matmul fuse pass (#23754)
5 years ago
Chen Weihang 9b851ba216
[dy2static] Add print transformer and unify print format (#24068)
5 years ago
wangchaochaohu fa43d74a3a
fix the intermediate node of graph for fusion group test=develop (#24184)
5 years ago
Yiqun Liu ecfddebbef
Add the implementation of inverse (#23310)
5 years ago
liuwei1031 9a93f6aae0
improve efficiency of runtime InferVarType (#22778)
5 years ago
wangchaochaohu 2270864019
Fusion group optimize for cuda codegen(#23940)
5 years ago
ShenLiang 94dfb7d770
opt the postprocess, test=develop (#24155)
5 years ago
Jacek Czaja eb411613e9
[DNNL] refine activations Inplace support (#24145)
5 years ago
Jacek Czaja 461e6a01ec
[DNNL] activations Inplace support (#24123)
5 years ago
Zhang Ting fb0729ee7f
avoid warnings in MAC compile (#24124)
5 years ago
arlesniak d31a174f51
added fusing matmul-transpose-reshape pass (#23866)
5 years ago
Zeng Jinle a67eea9f00
polish code by adding final, test=develop, test=develop (#24114)
5 years ago
Zeng Jinle acef55df04
fix isolated var fetch bug, test=develop (#24070)
5 years ago
Jacek Czaja c6c65c65c7
[DNNL] Added elementwise_add mkl-dnn inplace (#23477)
5 years ago
hutuxian 9ff558a46f
Optimize DataFeed (#23957)
5 years ago
Zhou Wei 7817003795
Optimize the error messages of paddle CUDA API (#23816)
5 years ago
ShenLiang 7f0b2c7407
fix memory leaking problem of dataset, test=develop (#23955)
5 years ago
guofei 2b896c1f6b
Support LoDTensorArray in fetch (#23645)
5 years ago
Yiqun Liu 071a702060
Fix the error misjudgment when there are control nodes in graph. (#23943)
5 years ago
hutuxian df64a96686
support set_test_mode and set comlog level(#23905)
5 years ago
Zhang Ting b88662254b
use 32 bit index to improve expand op (#23899)
5 years ago
yiicy a1e7387919
Variable error message enhancement, test=develop (#23548)
5 years ago
yaoxuefeng 5b69242fab
modify datanorm op test=develop (#23030)
5 years ago
Zeng Jinle c49791362f
Correct reader device index (#23802)
5 years ago