Commit Graph

8735 Commits (43c9561e9a383d08534a84d01808ff61d1e261e2)

Author SHA1 Message Date
Zeng Jinle 4f8594088d
Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225)
6 years ago
Tao Luo 8b62f53719
fix nn.py lack of `import warnings` (#17249)
6 years ago
baojun e782b54b9c update sofmax with axis arg test=develop (#17190)
6 years ago
tensor-tang 71f0c6d5bd
fix api doc of hash, relu, concat, argmin, argmax, argsoft and all activations (#17235)
6 years ago
Tao Luo ff1661f12a
remove unused FLAGS_warpctc_dir (#17162)
6 years ago
Kaipeng Deng a71d8fdb87
Softmax_cross_entropy op add axis (#16806)
6 years ago
Zhen Wang a40121e4c8
fix the initialization process error. test=develop (#17213)
6 years ago
Zhen Wang a914d9b116
Quant output scale (#17215)
6 years ago
Tao Luo 16922e0093
fix api_example of tree_conv (#17239)
6 years ago
jerrywgz ef66baedc0
Refine api doc (#17230)
6 years ago
Yibing Liu 6b0f27e802
Fix some APIs' example (#17214)
6 years ago
ruri 5817077c99
Fix unexecutable API examples (#17218)
6 years ago
jerrywgz cc95a7516c
fix distribute fpn proposals, test=develop (#16152)
6 years ago
Tao Luo 9ec4615deb
fix profiler and name_scope API examples (#17212)
6 years ago
wopeizl 06325c4de1
disable the test_distillation_strategy temporarily (#17227)
6 years ago
Zeng Jinle ee2028a110
Add use_cuda to inplace pass (#17205)
6 years ago
chengduo f2db475a3e
update ParallelExecutor (#17204)
6 years ago
jerrywgz a72907bbf4
Enhance concat op to support empty input. (#17015)
6 years ago
wopeizl 83c4f7721f
use two GPUs to run the exclusive test test=develop (#17187)
6 years ago
tianshuo78520a 8092c40560 Modify test timeout (#17181)
6 years ago
guru4elephant f938ccec62
remove async executor python api to fix document (#17174)
6 years ago
Zeng Jinle 5dfe2ab9e8
Fix mem leak when converting Tensor to numpy array (#17182)
6 years ago
Zeng Jinle 4e1bc6e805
Rewrite inplace pass and fix gc bug (#17126)
6 years ago
Lfc1993 aa5307ce5b fix python3 run_time_error in ops. test=develop (#17170)
6 years ago
guomingz e4a52e082f resolve #17159 (#17172)
6 years ago
xiaoting bc48453b73 polish the label_smooth (#17138)
6 years ago
Lfc1993 626922d3cd fix run_time_error in uniform_random. test=develop (#17152)
6 years ago
tangwei12 deb510d451
cvm op feature (#17081)
6 years ago
Jiancheng Li 554d3a71d2 test=develop fix bug: fix selected_indices in nms (#17140)
6 years ago
Zeng Jinle 28d69d710a
Refine dropout gpu memory (#17095)
6 years ago
chengduo 9ccce576d6
fix test_weight_decay (#17109)
6 years ago
xiaoting 7da7881c0e Detailed coordinate description for yolov3 loss (#17007)
6 years ago
tangwei12 7330cd639c
truncated_gaussian_random supported in distributed training, test=develop (#17091)
6 years ago
ceci3 258e000be6
test=develop, double backward leaky_relu (#17067)
6 years ago
Kaipeng Deng 10c487eb21
fix interpolate cu. test=develop (#17101)
6 years ago
whs 55ce36e981
Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)
6 years ago
Yibing Liu beda78258f
Init mixed precision training interface (#16856)
6 years ago
Yan Xu 0b07eef118
ParallelDyGraph with GPU collective mode (#16827)
6 years ago
tangwei12 1a4a51db2b
Fleet unify distributed training (#16791)
6 years ago
tangwei12 e707119a89
remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066)
6 years ago
Zhen Wang b8c166f632
Merge pull request #17029 from wzzju/add_graph_checkpoint
6 years ago
guomingz 2deac4e447 Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing (#17058)
6 years ago
chengduo a2be4b4d91
Add fuse momenutum ops (#16745)
6 years ago
tink2123 5e216fcf1a fix split
6 years ago
chengduo e296e0fead
fix test_parallel_executor_seresnet random fail (#17030)
6 years ago
Tao Luo b3a11943c1
Merge pull request #17031 from luotao1/reduce_test_time
6 years ago
qingqing01 c1c2633a63
Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862)
6 years ago
Zeng Jinle f188b3708e
Move gc test to each test of op (#16999)
6 years ago
chengduo 7c370e42f9
Fix test_recurrent_op (#17001)
6 years ago
Tao Luo 9466e956a7 reduce unittest time by rename testcuda to has_cuda
6 years ago
Zhen Wang bf6b9d6d37 add checkpoint functions for graph. test=develop
6 years ago
wopeizl d9991dccdd
add parallel build script to ci … (#16901)
6 years ago
Zeng Jinle 24923f7604
fix py_reader demo (#16997)
6 years ago
qingqing01 ea42e431f8
Speed unit testing. (#16978)
6 years ago
guomingz ae7a2cb8e3 resolve #16988 (#16995)
6 years ago
liuwei1031 765c70a1b0
Unittest improve, test=develop (#16941)
6 years ago
guomingz 23df084b32 resolve #16987 (#16994)
6 years ago
Zeng Jinle 1202d3fc74
Refine model gpu memory (#16993)
6 years ago
Zeng Jinle af8a041bb6
reduce py_reader unittest time (#16996)
6 years ago
Wang, Chuanqi 45ebafc33e Split test_calibration test to two tests (#16977)
6 years ago
Yibing Liu 3c375751f8
Support seq len equal to 0 in sequence ops (#16935)
6 years ago
lujun 9d2f7d762c fix dy-load bug, test=develop
6 years ago
lujun 94c4cd1a98 fix dy-load bug, test=develop
6 years ago
lujun a3f17280a3 fix dy-load bug, test=develop
6 years ago
Yibing Liu 36c05d36ab
Check some shapes only in runtime (#16919)
6 years ago
whs 486f7d8ed6
Restore quantization and distillation stategy before loading persistables. (#16958)
6 years ago
gongweibao cbdb8a17b1
Polish DGC code (#16818)
6 years ago
lujun dbf66dd034
Merge pull request #16954 from junjun315/fix-dygraph-checkpoint
6 years ago
Tao Luo aa9caa1691
Merge pull request #16951 from luotao1/reduce_ci_time
6 years ago
Tao Luo aed702cea3
Merge pull request #16920 from qingqing01/test_profile
6 years ago
Tao Luo b596eed73a
Merge pull request #16824 from LeoZhao-Intel/mkldnn_mul
6 years ago
lujun 3beed54cdd
Merge pull request #16917 from velconia/dygraph_untrack_op
6 years ago
lujun a7c11979ba fix dygraph save/load checkpoint error, test=develop
6 years ago
Tao Luo bc037c13c7 use multi-thread to speedup CI tests
6 years ago
tangwei12 2b61db07d1
fix sampling id op bug (#16909)
6 years ago
gongweibao b7f20ed6af
Fix unittest dataset error (#16925)
6 years ago
Hongyu Liu d5a7c09856
Merge pull request #16798 from phlrain/softmax_cross_support_high_rank
6 years ago
Dang Qingqing b73a71d11e Fix test_profiler when the machine has many cores
6 years ago
guru4elephant 03a7ec55cf
Merge pull request #16746 from xjqbest/dataset_merge_develop
6 years ago
Kaipeng Deng 5d45eb06f9
Merge pull request #16858 from heavengate/fix_yolo_param
6 years ago
minqiyang 97aa1838bc Fix dygraph train mode
6 years ago
Qiyang Min 102fc8596e
Merge pull request #16777 from velconia/dygraph_untrack_op
6 years ago
Leo Zhao 1edcd73115 remove unnecessary new line
6 years ago
Leo Zhao 61cc842a53 disable test_elementwise_mul_mkldnn_op case
6 years ago
xiaoting ccc3bd70c1 polish doc for uniform_random and multi_box_head (#16864)
6 years ago
Hongyu Liu 0701c2db47
Merge pull request #16518 from zhoukunsheng/rsqrt
6 years ago
Hongyu Liu bbcfa8ffb2
Merge pull request #16493 from zhoukunsheng/zeros_like
6 years ago
phlrain d722841622 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into softmax_cross_support_high_rank
6 years ago
zhoukunsheng f9223c5fa9 Logical compare (#16513)
6 years ago
wopeizl 0114f1d65a
give the output value to keep consistant with document test=develop (#16740)
6 years ago
phlrain 766c868199 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into softmax_cross_support_high_rank
6 years ago
Tao Luo 485bc6a055
Merge pull request #16868 from chengduoZH/speedup_test_parallel_executor_transformer
6 years ago
Tao Luo d4b5510c00
Merge pull request #16860 from junjun315/fix-utest-vgg
6 years ago
liuwei1031 072db0938b
optimize lstmp and sample_logits op, test=develop (#16845)
6 years ago
Jiabin Yang 84b7a7291e test=develop, fix hsigmoid dereference nullptr (#16769)
6 years ago
minqiyang 9a3848a2ea Fix attrs
6 years ago
Hongyu Liu 2de7f3cfc3
Merge pull request #16799 from phlrain/sigmoid_corss_entropy_support_high_rank
6 years ago
chengduozh 3349094fe2 reduce the layer number of transfromer
6 years ago
Qiyang Min 590af88c79
Merge pull request #16742 from velconia/imperative_fix_growing_dict
6 years ago
minqiyang 73cbdc2998 Add train mode
6 years ago