Commit Graph

24188 Commits (266444b8afcf653b45cfdbeb8cb446561ba16ce6)
 

Author SHA1 Message Date
Leo Zhao bf4b21fa3d fix assertion failure issue when test_analyzer_bert uses ngraph (#17148)
6 years ago
Lfc1993 626922d3cd fix run_time_error in uniform_random. test=develop (#17152)
6 years ago
tangwei12 deb510d451
cvm op feature (#17081)
6 years ago
Jiancheng Li 554d3a71d2 test=develop fix bug: fix selected_indices in nms (#17140)
6 years ago
wopeizl 3acb3635c2
1. move the API check into CPU process (#17110)
6 years ago
tianshuo78520a 92ce445227 Supplementary monitoring file reason explanation (#17131)
6 years ago
Zeng Jinle 28d69d710a
Refine dropout gpu memory (#17095)
6 years ago
Huihuang Zheng b9494058b3
Use CudnnWorkspaceHandle in exhaustive search (#17082)
6 years ago
tianshuo78520a 2192e7bb61 Path flag (#17105)
6 years ago
chengduo 9ccce576d6
fix test_weight_decay (#17109)
6 years ago
xiaoting 7da7881c0e Detailed coordinate description for yolov3 loss (#17007)
6 years ago
tangwei12 7330cd639c
truncated_gaussian_random supported in distributed training, test=develop (#17091)
6 years ago
chengduo 794a195881
fix fuse optimizer ops (#17102)
6 years ago
ceci3 258e000be6
test=develop, double backward leaky_relu (#17067)
6 years ago
Kaipeng Deng 10c487eb21
fix interpolate cu. test=develop (#17101)
6 years ago
Tao Luo aca60e9a20
remove unnecessary prepare_data (#17080)
6 years ago
whs 55ce36e981
Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)
6 years ago
Zeng Jinle 842ded14b0
fix reference_count_pass,test=develop (#17060)
6 years ago
Yibing Liu beda78258f
Init mixed precision training interface (#16856)
6 years ago
Yan Xu 0b07eef118
ParallelDyGraph with GPU collective mode (#16827)
6 years ago
tangwei12 1a4a51db2b
Fleet unify distributed training (#16791)
6 years ago
tangwei12 e707119a89
remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066)
6 years ago
Zhen Wang b8c166f632
Merge pull request #17029 from wzzju/add_graph_checkpoint
6 years ago
guomingz 2deac4e447 Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing (#17058)
6 years ago
Tao Luo d9cd989825
Merge pull request #17048 from luotao1/fix_runtime_cache_bug
6 years ago
wopeizl f5d6937fe1
specify the cuda arch name and bin to decrease the compile time for i… (#17020)
6 years ago
XiaoguangHu f7caf7d497
Merge pull request #17063 from PaddlePaddle/shanyi15-patch-1-1
6 years ago
Cheerego fd6a1b5db3
update pip version in Readme to 1.4.1
6 years ago
chengduo cc31681687
use fast executor as default (#17044)
6 years ago
XiaoguangHu 30f2f4576e
Merge pull request #17042 from shanyi15/update_release_1.4
6 years ago
chengduo a2be4b4d91
Add fuse momenutum ops (#16745)
6 years ago
guru4elephant 03d469ad98
Merge pull request #17005 from wopeizl/fix_ncclwrapper_win1
6 years ago
tangwei12 13295d90d9
load persistables with selected rows, test=develop (#17047)
6 years ago
luotao1 490e746269 fix runtime_context_cache bug when gpu model has an op runs only on cpu
6 years ago
Zeng Jinle 0c335dcd2c
Make conv cudnn workspace size configurable (#17036)
6 years ago
jerrywgz ea3504c7ec
Merge pull request #17017 from jerrywgz/fix_potential_hung
6 years ago
Kaipeng Deng 52de7fd83e
Merge pull request #17043 from tink2123/fix_split
6 years ago
Tao Luo 620b0541d3
Merge pull request #16990 from baojun-nervana/ng_cmake
6 years ago
tink2123 5e216fcf1a fix split
6 years ago
shanyi15 b612c46516 update_release_1.4
6 years ago
chengduo e296e0fead
fix test_parallel_executor_seresnet random fail (#17030)
6 years ago
Tao Luo b3a11943c1
Merge pull request #17031 from luotao1/reduce_test_time
6 years ago
qingqing01 c1c2633a63
Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862)
6 years ago
lujun 63d9fe3362
Merge pull request #17034 from seiriosPlus/fix/save_for_selected_rows
6 years ago
Zeng Jinle f188b3708e
Move gc test to each test of op (#16999)
6 years ago
tangwei12 45136b1b41 fix bug in save, test=develop
6 years ago
jerrywgz 47013af0a6
Merge pull request #17011 from jerrywgz/enhance_generate_proposal_labels
6 years ago
chengduo 7c370e42f9
Fix test_recurrent_op (#17001)
6 years ago
tianshuo78520a 73a360b504 Cmakelists fix (#17018)
6 years ago
Tao Luo 9466e956a7 reduce unittest time by rename testcuda to has_cuda
6 years ago