Commit Graph

25032 Commits (d4413a54bc95e80d54403fd5c48261ca7313d125)
 

Author SHA1 Message Date
Adam d4413a54bc Add common CreateKey for mkldnn handlers (#19767)
5 years ago
Yihua Xu 0d6ea52958 Fix the definition issue when used mkl_scsrmm and mkl_dcsrmm functions. (#19774)
5 years ago
chengduo 056fdedde3
Open fuse all reduce option (#19765)
5 years ago
Aurelius84 8c7e411908
Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716)
5 years ago
JesseyXujin e352467c1c
modify activation op API, delete use_cudnn args, test=develop, (#19758)
5 years ago
Jacek Czaja 9e4c958552 Refactoring activation mkldnn op (#19748)
5 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
5 years ago
Zeng Jinle 0daa5c9772
Make leaky relu inplacable (#19676)
5 years ago
chengduo c308c88d71
Fix test_parallel_executor_test_while_train (#19723)
5 years ago
Zeng Jinle 078a678219
refine math_op_patch, test=develop (#19727)
5 years ago
chengduo e506c99c20
Open fuse broadcast option (#18833)
5 years ago
Jacek Czaja 47f670d58c - Softmax mkl-dnn refactoring (#19615)
5 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
Aurelius84 22301115d0
Remove constraint that last dimension is forced to be 1 in huber_loss op (#19562)
6 years ago
chengduo 5866a7a5fe
Enable fused_all_reduce_op_handle support GPU and CPU Gradients (#19418)
6 years ago
Youwei Song 3e5fb6361b fix api-doc error for dygraph and backward (#19721)
6 years ago
Thunderbrook bda92434db remove trainer desc test in windows temporarily (#19753)
6 years ago
Youwei Song 810a33c8bd fix dygraph partitial backward problem (#19625)
6 years ago
Tao Luo ec9bc1bd9f
paddle::framework::vectorize() templatization (#19730)
6 years ago
Bai Yifan aa63d5ac6d Make the distillation process not save teacher variables in PaddleSlim (#19633)
6 years ago
Zeng Jinle bb4f8dee83
add logs to left var memory size, test=develop (#19722)
6 years ago
Adam 428b2b9e17 MKLDNN handler cleanup (#19713)
6 years ago
chengduo 2c30e64b2f
increase timelimit test_pe_serexnext (#19702)
6 years ago
XiaoguangHu 27235cf222
Add document annotations for FLAGS that need to be open to external developers test=develop (#19692)
6 years ago
Zeng Jinle 1c25c88aba
refine memory usage of some operators, test=develop (#19700)
6 years ago
baojun 87f13f7569 upgrade ngraph to support mkldnn v1.0 (#19689)
6 years ago
wangguanzhong 25dcd74d34
merge empty lod tensor, test=develop (#19228)
6 years ago
yaoxuefeng c6756ed225 fix instag op (#19591)
6 years ago
gongweibao 6c2bc29cc0
Fix float16 optimizer. (#19682)
6 years ago
Zeng Jinle 713c05dd60
refine tensor.mutable_data, test=develop (#19680)
6 years ago
Chen Weihang c78a4781bf
Fix train error when test_program.clone is executed after optimizer.minimize (#19397)
6 years ago
zhongpu 5f627488db add kernel for unsqueeze_op and Add unsqueezed op test, test=develop (#19436)
6 years ago
Zeng Jinle a7691603a5
add gpu_allocator_try_time config, test=develop (#19675)
6 years ago
JesseyXujin 0b06db9413
delete transmission args in linear_chain_crf op (#19619)
6 years ago
tianshuo78520a fbc63a1f1f Change Dianhai GithubID to fit new code merge rules(#19674)
6 years ago
Tao Luo f05d2c519d paddle::framework::vectorize() templatization [PART3] (#19643)
6 years ago
XiaoguangHu 103d09169d
Update README_cn.md to latest version v1.5.2
6 years ago
XiaoguangHu c50b0d6b79
Update README.md to latest version v1.5.2
6 years ago
hutuxian 1ca6ea0318
fix cmakelist deps (#19668)
6 years ago
Tao Luo bcddbc78d4
remove -Wmaybe-uninitialized warning (#19653)
6 years ago
Zeng Jinle 2db40d9f60
reduce thread num of retry_allocator_test,test=develop (#19638)
6 years ago
wangchaochaohu 4440d7ced0
test=develop cuda realization of label smooth op (#19175)
6 years ago
chengduo 31c5a5ee26 Remove linear_chain_crf_op.cu (#19645)
6 years ago
123malin a25a716e87
Optimize fleet API: add input check for some interfaces (#18971)
6 years ago
wangchaochaohu ed8f44ea21
codegen for fused elementwise operation (#19520)
6 years ago
Aurelius84 25c0eb28cc
fix unittest random failed test=develop (#19654)
6 years ago
chengduo 5c4eb394e9 Make test_pe_seresnext serial (#19634)
6 years ago
chengduo ac34aac33f Disable the test_parallel_executor_test_while_train temporarily (#19641)
6 years ago
Chen Weihang 73daa3d6c0
Code Cleanup: delete three useless raw variables in Conv2D (#19644)
6 years ago
Jie Fang c6a598a276 init new amp, optimize inserting cast op for batchnorm (#18596)
6 years ago