Commit Graph

5381 Commits (8d88c5a87d2e2485b7a7f8714e874f9c69c0620a)

Author SHA1 Message Date
Qiao Longfei 1870262ba9 pserver should crash early whe has problem
6 years ago
dengkaipeng a81fabd327 fix doc errors. test=develop
6 years ago
dengkaipeng cf06e50f1d add doc for adaptive pool. test=develop
6 years ago
dengkaipeng 266c6856c9 add adaptive pool 2d & 3d. test=develop
6 years ago
dengkaipeng eab4745965 add adaptive mode for pool.
6 years ago
Yu Yang 7604b1ad51 Fix Eigen macro when using GPU
6 years ago
Qiao Longfei 1213e2838f
Merge pull request #14820 from jacquesqiao/fix-split-selected-rows
6 years ago
nhzlx 644c13a387 fix compile error
6 years ago
sneaxiy 7923042365 merge develop
6 years ago
Qiao Longfei 0a7c7e97af test zero output of split_selected_rows_op
6 years ago
Tao Luo 3243b45bb2
Merge pull request #14651 from luotao1/has_attr
6 years ago
Xin Pan 1735022a1b fix clang
6 years ago
heqiaozhi 016a068775 stop server
7 years ago
zhang wenhui c4c5f0b8ca
Merge pull request #14771 from frankwhzhang/bpr
7 years ago
Tao Luo 067ed70f2d add HasProtoAttr function in op_desc.h, clean node.h
7 years ago
heqiaozhi 8e3fe2d735 add skip op
7 years ago
Yu Yang b22d638d8f Speed up SizeOfType
7 years ago
Xin Pan 4a716a70c6
Merge pull request #14811 from panyx0718/revert-14782-revert-14398-imperative
7 years ago
nhzlx a5bfed3776 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_benchmark_for_trt
7 years ago
nhzlx afc51e6f82 add benchmark for trt
7 years ago
Zhaolong Xing bc6d0a3427
Merge pull request #14762 from NHZlX/fix_bug_of_trt_pool
7 years ago
superjomn edd1f5a92b fix visualizer
7 years ago
Brian Liu 9623b45f40 Remove unnecessary MKLDNN reorder (#14799)
7 years ago
frankwhzhang 90c7f9870e fix 'name', test=develop
7 years ago
Qiao Longfei abf140289f split selected rows op should always init output selected rows
7 years ago
nhzlx 019e8bbed2 fix comments test=develop
7 years ago
frankwhzhang 271c480822 update API, test=develop
7 years ago
frankwhzhang c9a653820b fix label_pos ,add test_layers.py, test=develop
7 years ago
Tao Luo e99597d35c Merge branch 'develop' into luotao1-has_attr
7 years ago
sneaxiy 66182abda6 add cuda cudnn version check
7 years ago
Yu Yang f0c0bf328d Add gperftools supports for PE
7 years ago
frankwhzhang a672b291e5 fix code style, test=develop
7 years ago
frankwhzhang ea95f9c335 fix style bug, test=develop
7 years ago
frankwhzhang 68c2025844 fix nn.py&API.spec, test=develop
7 years ago
Xin Pan 748549b2e3 Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative"
7 years ago
bingyanghuang 943ad4781f One possible solution to add flexibility for mkldnn placement pass (#14768)
7 years ago
baojun-nervana fddbd87c0a Rename argument
7 years ago
baojun-nervana 22ac2133e4 Rename class
7 years ago
baojun-nervana bfde5e10ce Move ngraph compile control to cmake
7 years ago
sneaxiy 2c6159a151 fix unittest
7 years ago
Xin Pan c049fa7cf7
Revert "Revert "Revert "Imperative"""
7 years ago
gongweibao f1fb64b17f
Add reduce sparse tensor feature. (#14757)
7 years ago
sneaxiy eb8252466b polish code
7 years ago
Tao Luo c83d5b7a16
Merge pull request #14709 from yihuaxu/develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
7 years ago
Zeng Jinle add98c9e7d
Merge pull request #14745 from sneaxiy/fix_eigen_deallocate
7 years ago
frankwhzhang f4cc5881b0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
7 years ago
frankwhzhang 97de98cd0a update bpr_loss op code, test=develop
7 years ago
Xin Pan 6c80bb3ce9
Merge pull request #14786 from PaddlePaddle/revert-14782-revert-14398-imperative
7 years ago
heqiaozhi 575ae7c6c3 refine pslib inferface & fix some bugs
7 years ago
Yihua Xu 3821fc3950 Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
7 years ago
Yihua Xu 240d974ac5 Clean Code
7 years ago
Tao Luo 54fcafb5f6
Merge pull request #14707 from yihuaxu/develop_4f71a6ee2_conv3d_mkldnn_opt
7 years ago
Xin Pan 2538ef64f1
Revert "Revert "Imperative""
7 years ago
guru4elephant b82a44ea85
Merge pull request #14778 from wangguibao/async_executor_bugfix
7 years ago
sneaxiy 8095fb5e68 fix code bug in CPU compilation
7 years ago
sneaxiy 387bac46b5 refine code
7 years ago
Tao Luo cf66133857
Merge pull request #14734 from luotao1/memory_load
7 years ago
Yihua Xu 155328a488 Clean Code
7 years ago
Xin Pan 6217f42ab7
Revert "Imperative"
7 years ago
Tao Luo 743cb840f1 update with comments
7 years ago
wangguibao 5a2cd4505b AsyncExecutor bugfix: Tensor to LoDTensor
7 years ago
wangguibao 5f98d80039 AsyncExecutor bugfix: Tensor change to LoDTensor
7 years ago
flame f6a877bc57 add tool to visualize inference model (#14621)
7 years ago
frankwhzhang 93551a3440 update API.spec
7 years ago
Tao Luo 42359e88a4 clean code
7 years ago
Tao Luo 923b18877e Merge branch 'develop' into memory_load
7 years ago
Tao Luo 405b2486db support loading from memory
7 years ago
Xin Pan b52f5d2870
Merge pull request #14398 from panyx0718/imperative
7 years ago
frankwhzhang 272f3d3111 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
7 years ago
frankwhzhang 570d89ec84 add bpr_loss operator , test=develop
7 years ago
qingqing01 549f165b59
Speed conv_fusion_op for identity activation. (#14744)
7 years ago
Houjiang Chen c6b39a0099
Merge pull request #14714 from NHZlX/add_prelu_gpu
7 years ago
tensor-tang dbe451976b
Merge pull request #14753 from tensor-tang/refine/namespace
7 years ago
sneaxiy 0f96c2e80f fix thread-safety bug
7 years ago
nhzlx 722b0a805f fix bug of trt pool
7 years ago
Jiabin Yang d9bb55a1f9
Merge pull request #14756 from JiabinYang/fix_hs_op
7 years ago
Yihua Xu 65dbc7cca4
Merge branch 'develop' into develop_4f71a6ee2_conv3d_mkldnn_opt
7 years ago
JiabinYang e05e1d7d88 fix bug in dist train on hs, test=develop
7 years ago
tensor-tang 4a93db9288 remove jit namespace
7 years ago
Hongyu Liu 8cda28f345
Merge pull request #14733 from phlrain/add_cudnn_5_support
7 years ago
heqiaozhi d3ca359e44 config init & adapt to interface
7 years ago
Xin Pan 73b4d1aa72
Merge pull request #14742 from panyx0718/infer2
7 years ago
Jiabin Yang 21c0f8749e
Merge pull request #14728 from JiabinYang/optimize_hs_op
7 years ago
Xin Pan 82d68281c0 follow comments
7 years ago
sneaxiy 900765224c fix deallocate bug
7 years ago
liuhongyu b408fc4dac Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
7 years ago
liuhongyu 8b2898e201 fix bug of formate; test=develop
7 years ago
Xin Pan 41c28d54c6 allow customize kernel selection
7 years ago
Xin Pan 439af8d50a
Merge pull request #14717 from panyx0718/infer
7 years ago
liuhongyu 773dc73fbf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
7 years ago
liuhongyu 8daf67f90f fix bugs; test=develop
7 years ago
chengduo 04539d4c5d
Fix clip.py (#14718)
7 years ago
sneaxiy d0c8b9b9b3 remove timeout unittest
7 years ago
heqiaozhi 419506f510 refine for compile pslib.so
7 years ago
Xin Pan 052cc5f538
Merge pull request #14725 from ZongwuYang/my-cool-stuff
7 years ago
Michal Gallus 6fdbb365ce Include MKL-DNN header to concat op only when flag is set
7 years ago
Michal Gallus f2a880421e Fix style @ concat integration and tests
7 years ago
Michal Gallus 738069e491 Refactor MKL-DNN Concat
7 years ago
Michal Gallus 208f912512 Implement MKL-DNN Concat
7 years ago
Wu Yi 29d9fb53fc
[Feature] multi process multi gpu dist training, boost v100 performance by 20% (#14661)
7 years ago
liuhongyu e80402fd0e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
7 years ago
liuhongyu 968dd3c078 add cudnn 5 support; test=develop
7 years ago
sneaxiy e694d0c2e4 fix while_op eager deletion bug
7 years ago
Xin Pan 461ca35be1
Merge pull request #14590 from panyx0718/fix4
7 years ago
gongweibao 50a698525d
Fix log level (#14692)
7 years ago
JiabinYang 8c75705984 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize_hs_op
7 years ago
Xin Pan dc458b1482
Merge pull request #14713 from panyx0718/api
7 years ago
JiabinYang b387a19410 optimize op with blas
7 years ago
Zeng Jinle ff4237309a
Merge pull request #14720 from sneaxiy/fix_seq_mask_op_infershape
7 years ago
heqiaozhi 2301abc481 cc libaray add pslib
7 years ago
ZongwuYang 1560eb4a6d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my-cool-stuff
7 years ago
ZongwuYang deb04809bd test=develop
7 years ago
Xin Pan 7c5289f68e
Merge pull request #14719 from PaddlePaddle/revert-14666-feature/estiminate_flops
7 years ago
Kaipeng Deng 934f13a70a
Merge pull request #14371 from heavengate/yolo_loss
7 years ago
sneaxiy 35a2578426 fix bug
7 years ago
sneaxiy 65867d8989 test=develop
7 years ago
zhang wenhui abbe382e1e
Revert "Add EstiminateFlops"
7 years ago
Xin Pan 0591ba96ec fix hack
7 years ago
sneaxiy 64ad051b9a merge develop
7 years ago
sneaxiy c47c451a00 fix bug
7 years ago
heqiaozhi a77fa67bbd async_thread_trainer & libmct & pslib.cmake
7 years ago
Tao Luo 3437e17713 Merge branch 'has_attr' of https://github.com/luotao1/Paddle into luotao1-has_attr
7 years ago
nhzlx e7abe6b654 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_prelu_gpu
7 years ago
nhzlx f75815b78c add prelu gpu inference
7 years ago
Xin Pan bcf36d8401 add more files to protected file list
7 years ago
Tao Luo 0e3048db43
Merge pull request #14659 from luotao1/update_pass
7 years ago
Xin Pan 7e0801d4ed
Merge pull request #14441 from baojun-nervana/intel/ngraph_op
7 years ago
Yihua Xu 82eefceabe Add the profile_mkldnn flag for profile function(test=develop)
7 years ago
Xin Pan 35e6b5e16a polish
7 years ago
Yihua Xu ea00270fe8 Remove the dims checking when the dim is 3 (test=develop)
7 years ago
Xin Pan b80fe8264a polish
7 years ago
Yihua Xu 64e261c6cd Implement the fusion of convolution and bias for mkldnn
7 years ago
Tao Luo 8d6984eb9b change OpHasAttr to RuntimeHasAttr, add some comments
7 years ago
jerrywgz 96dc3d8326
Merge pull request #14511 from jerrywgz/ignore_index_for_sigmoid_cross_entropy
7 years ago
Tao Luo a6ac42669c Merge branch 'develop' into update_pass
7 years ago
Yihua Xu 669191c9cc Implement conv3d with mkldnn library (test=develop)
7 years ago
Hongyu Liu 4f71a6ee2c
Merge pull request #14622 from PaddlePaddle/add_cudnn_lstm
7 years ago
Yibing Liu c7382df80f
Print assert failure id in lookup_table_op (#14698)
7 years ago
Yu Yang 0f0e197914
Merge pull request #14666 from reyoung/feature/estiminate_flops
7 years ago
Xin Pan 93c16d9628 polish the autograd (need to verify correctness)
7 years ago
Xin Pan c3236f82d6 polish
7 years ago
Xin Pan e5d64fd4d1 initial imperative
7 years ago
Xin Pan 4d0df1fea7 add fields for autograd
7 years ago
Xin Pan 8138391631 add OpBase and unify with VarBase
7 years ago
Xin Pan f6f0692451 clean up
7 years ago
Xin Pan 0318c95149 rebase develop
7 years ago
Xin Pan aeb74af54c allow operator to run imperatively
7 years ago
Xin Pan b1f6fda5e5 run forward
7 years ago
Xin Pan a6d23083f0 some tracing
7 years ago
Xin Pan dac92e560c initial commit
7 years ago