Commit Graph

10569 Commits (e2d01eb650dba6267046c1cfd6e64cf8cfd74267)

Author SHA1 Message Date
ShenLiang e2d01eb650
Support dynamic graph distributed (#28997)
5 years ago
lilong12 7e5e9934fe
update expand as op to use the shape of the target tensor instead of the target tensor itself. (#29020)
5 years ago
Zhou Wei e668cb07fb
fix CUDA 11 error on windows (#29101)
5 years ago
Jack Zhou 085260f3de
Add eigen gru and fix the dropout bug in the rnn
5 years ago
yaoxuefeng 545df287fc
add user_define_dump (#28596)
5 years ago
arlesniak bc902044a4
Fixes mkldnn dygraph learning rate scheduler crashes (#28988)
5 years ago
Shang Zhizhou b9e76a0103
detect tensorRT plugin fp16 in runtime (#27933)
5 years ago
Leo Chen fd3fcb051a
fix typo of flag name (#29154)
5 years ago
Noel da71173bc9
Fix ops doc for some ops
5 years ago
Leo Chen 770395cb93
Split train_mode and has_grad for tracer (#29064)
5 years ago
Aurelius84 7ae3cb554a
Polish CUDA Information stdout (#29109)
5 years ago
WangXi 173c22aec2
optimize fast graph executor (#28962)
5 years ago
Shang Zhizhou 562ded1041
fix unittest trt_dynamic_shape_transformer_prune_test error (#29122)
5 years ago
Shibo Tao db41258501
add API serialize_program, serialize_persistables, save_to_file, deserialize_program, deserialize_persistables, load_from_file. (#29034)
5 years ago
joanna.wozna.intel b0d1ac161e
Add bf16 pool2d and unify bf16 unit tests (#29039)
5 years ago
joanna.wozna.intel fddea67445
Fix cpu_bfloat16_pass (#28730)
5 years ago
Qi Li 2fd16cf6fc
fix win ci failure, test=develop (#29089)
5 years ago
Chen Weihang fea0e294ee
Hide the C++ stack by default and add hints (#29042)
5 years ago
joejiong 582c0a0468
add uint8 for reshape op (#28996)
5 years ago
Zhou Wei 8ca0a8a859
fix tensor detach to zero copy (#27921)
5 years ago
taixiurong a5aa4dc7a9
add xpu elementwise ops (#29031)
5 years ago
joejiong b04c78ef5e
Update pow (#29000)
5 years ago
wawltor b2c8a00745
remove eigen threadpool for the speed up
5 years ago
Wojciech Uss 7b5a8e46de
Add multi_gru_fuse_pass and tests (#28601)
5 years ago
lilong12 767d0ba267
update, test=develop (#28700)
5 years ago
Wojciech Uss 991345b368
Add multi_gru_seq_fuse_pass and tests (#28604)
5 years ago
123malin fbf9564f6b
【paddle.distributed.fleet】Optimize ParameterServer's Async Mode (#28442)
5 years ago
lilong12 f77a78cdee
enable pipeline to run with Executor.run() (#28373)
5 years ago
Thunderbrook 0073f9bdb0
support ps-gpu (#28752)
5 years ago
Chen Weihang 768dab441e
polish two api doc detail, test=document_fix (#28971)
5 years ago
furnace 8ff3550658
refactor momentum op to combine weight (#27414)
5 years ago
Jacek Czaja bd1d6d3b30
extends oneDNN caching keys so caching objects are unique to executor/predictor (#28758)
5 years ago
Pei Yang 994673bf4f
change avg pooling and global pooling to trt layer in dynamic shape mode (#28702)
5 years ago
yaoxuefeng 71c1cd1408
fix truncated_gaussian seed (#28777)
5 years ago
HappyAngel de528981e5
fix paddlepredictor build error. test=develop (#28792)
5 years ago
Wilber a22ea652cf
fix trt delete_pass bug. (#28763)
5 years ago
gongweibao 1dad8ceaab
Fix gpu memory allocation bug. (#28703)
5 years ago
Chen Weihang b969c32ab1
fix occupied 0 device memory bug (#28771)
5 years ago
joejiong 1a532d5133
add uint8 support for squeeze operator (#28734)
5 years ago
wangchaochaohu 8b853b3030
fix the number of perf algo for conv cudnn in exhaustive mode (#28694)
5 years ago
joanna.wozna.intel 8c0ea4bffe
Add bf16 matmul, fc, elementwise add and mul (#28729)
5 years ago
Wojciech Uss efc3b182f0
a fix for the fc_lstm_fuse_pass (#28709)
5 years ago
Zhou Wei 3b0dd5f620
fix bug that to_tensor not support paddle.Place (#28717)
5 years ago
yaoxuefeng 08b62f4902
fix shuffle batch op shuffle (#28533)
5 years ago
taixiurong d3d1a6b6e0
add kunlun kernel: slice, slice_grad, top_k, cast. *test=kunlun (#28542)
5 years ago
Jack Zhou 9362d85e0e
Add LSTM, Simple RNN and GRU CPU kernel (#28577)
5 years ago
QingshuChen 30ef3815b3
adjust kunlun header file (#28536)
5 years ago
Zhang Ting dab4920568
improve performance of cast op (#28727)
5 years ago
yaoxuefeng 03f46e3526
fix truncated_gaussian op cuda seed setting (#28678)
5 years ago
Wilber 04cefeacc5
Disable windows gpu static lib. (#28741)
5 years ago