Commit Graph

25186 Commits (63dd3183bcb89dbf1bf0501c70f3ded69a6feb09)
 

Author SHA1 Message Date
Chen Weihang 00d5375e0c
Add prune_backward function to cover complicated test_program.clone situation (#19772)
5 years ago
Kaipeng Deng 99c78b772a
fix softmax axis!=-1. test=develop (#19800)
5 years ago
tangwei12 6a1db2044c
fix sync_with_distributed_lookup_table, test=develop (#19737)
5 years ago
tianshuo78520a 38f1c2fe28 change approve site (#19791)
5 years ago
Adam d4413a54bc Add common CreateKey for mkldnn handlers (#19767)
5 years ago
Yihua Xu 0d6ea52958 Fix the definition issue when used mkl_scsrmm and mkl_dcsrmm functions. (#19774)
5 years ago
chengduo 056fdedde3
Open fuse all reduce option (#19765)
5 years ago
Aurelius84 8c7e411908
Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716)
5 years ago
JesseyXujin e352467c1c
modify activation op API, delete use_cudnn args, test=develop, (#19758)
5 years ago
Jacek Czaja 9e4c958552 Refactoring activation mkldnn op (#19748)
5 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
5 years ago
Zeng Jinle 0daa5c9772
Make leaky relu inplacable (#19676)
5 years ago
chengduo c308c88d71
Fix test_parallel_executor_test_while_train (#19723)
5 years ago
Zeng Jinle 078a678219
refine math_op_patch, test=develop (#19727)
5 years ago
chengduo e506c99c20
Open fuse broadcast option (#18833)
5 years ago
Jacek Czaja 47f670d58c - Softmax mkl-dnn refactoring (#19615)
5 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
5 years ago
Aurelius84 22301115d0
Remove constraint that last dimension is forced to be 1 in huber_loss op (#19562)
5 years ago
chengduo 5866a7a5fe
Enable fused_all_reduce_op_handle support GPU and CPU Gradients (#19418)
5 years ago
Youwei Song 3e5fb6361b fix api-doc error for dygraph and backward (#19721)
5 years ago
Thunderbrook bda92434db remove trainer desc test in windows temporarily (#19753)
5 years ago
Youwei Song 810a33c8bd fix dygraph partitial backward problem (#19625)
5 years ago
Tao Luo ec9bc1bd9f
paddle::framework::vectorize() templatization (#19730)
5 years ago
Bai Yifan aa63d5ac6d Make the distillation process not save teacher variables in PaddleSlim (#19633)
5 years ago
Zeng Jinle bb4f8dee83
add logs to left var memory size, test=develop (#19722)
5 years ago
Adam 428b2b9e17 MKLDNN handler cleanup (#19713)
5 years ago
chengduo 2c30e64b2f
increase timelimit test_pe_serexnext (#19702)
5 years ago
XiaoguangHu 27235cf222
Add document annotations for FLAGS that need to be open to external developers test=develop (#19692)
5 years ago
Zeng Jinle 1c25c88aba
refine memory usage of some operators, test=develop (#19700)
5 years ago
baojun 87f13f7569 upgrade ngraph to support mkldnn v1.0 (#19689)
5 years ago
wangguanzhong 25dcd74d34
merge empty lod tensor, test=develop (#19228)
5 years ago
yaoxuefeng c6756ed225 fix instag op (#19591)
5 years ago
gongweibao 6c2bc29cc0
Fix float16 optimizer. (#19682)
5 years ago
Zeng Jinle 713c05dd60
refine tensor.mutable_data, test=develop (#19680)
5 years ago
Chen Weihang c78a4781bf
Fix train error when test_program.clone is executed after optimizer.minimize (#19397)
5 years ago
zhongpu 5f627488db add kernel for unsqueeze_op and Add unsqueezed op test, test=develop (#19436)
5 years ago
Zeng Jinle a7691603a5
add gpu_allocator_try_time config, test=develop (#19675)
5 years ago
JesseyXujin 0b06db9413
delete transmission args in linear_chain_crf op (#19619)
5 years ago
tianshuo78520a fbc63a1f1f Change Dianhai GithubID to fit new code merge rules(#19674)
5 years ago
Tao Luo f05d2c519d paddle::framework::vectorize() templatization [PART3] (#19643)
5 years ago
XiaoguangHu 103d09169d
Update README_cn.md to latest version v1.5.2
5 years ago
XiaoguangHu c50b0d6b79
Update README.md to latest version v1.5.2
5 years ago
hutuxian 1ca6ea0318
fix cmakelist deps (#19668)
5 years ago
Tao Luo bcddbc78d4
remove -Wmaybe-uninitialized warning (#19653)
5 years ago
Zeng Jinle 2db40d9f60
reduce thread num of retry_allocator_test,test=develop (#19638)
5 years ago
wangchaochaohu 4440d7ced0
test=develop cuda realization of label smooth op (#19175)
5 years ago
chengduo 31c5a5ee26 Remove linear_chain_crf_op.cu (#19645)
5 years ago
123malin a25a716e87
Optimize fleet API: add input check for some interfaces (#18971)
5 years ago
wangchaochaohu ed8f44ea21
codegen for fused elementwise operation (#19520)
5 years ago
Aurelius84 25c0eb28cc
fix unittest random failed test=develop (#19654)
5 years ago