Commit Graph

12693 Commits (01c00b07dd5739d6bc9f3a33eebe27d2d32e6d24)

Author SHA1 Message Date
peizhilin 23dec78772 fix script issue
6 years ago
heqiaozhi bd1c1724aa add ps_instance doc
6 years ago
minqiyang da796dfe05 Remove BinarySearch from Adam Op
6 years ago
dengkaipeng f16aa394f6 remove use_cudnn in python API. test=develop
6 years ago
Yu Yang b17444c84c Fix merge bug
6 years ago
Yu Yang 62b5772ffb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
Qiao Longfei c624417c6f change sparse mode to lazy mode
6 years ago
Qiao Longfei 4035e4bab2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-adam
6 years ago
tensor-tang 00d3afbcc9 add gru refer functions, test and benchmark
6 years ago
Qiao Longfei 3dc29b3905 change sparse_update to adam_update
6 years ago
tensor-tang 6eec461725 add lstm peephole benchmark
6 years ago
Wang Guibao e213050223
Fix multi-threading bug with WItH_MKL=ON (#14882)
6 years ago
tensor-tang bf9302f950 add lstm, peephole refer and test
6 years ago
dongdaxiang e52bb816e3 add copyright to __init__.py in distributed folder
6 years ago
dongdaxiang d839bd0dd4 simple commit
6 years ago
sneaxiy f0df62f136 add more unittest case
6 years ago
Qiao Longfei fc6ec6bd14 add sparse mode adam
6 years ago
Yu Yang 4ecdb6f486 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
Xin Pan 47ea2534fb clean parallel do
6 years ago
sneaxiy f6741df462 merge develop
6 years ago
Yu Yang 7b10bf0e60 Use mkl
6 years ago
Zeng Jinle 1b564bc49a
Merge pull request #14670 from sneaxiy/refactor_eager_deletion
6 years ago
SunGaofeng e3c4b0dace this is for psroi_pool op, test=develop (#14796)
6 years ago
dongdaxiang c9b799896e fix tag in async_executor
6 years ago
heqiaozhi 95b887c4f2 remove commit
6 years ago
Yu Yang b2b5241e2e Merge branch 'feature/tensor_type' into feature/refine_w2v
6 years ago
Yu Yang aa38fc4ce5 Fix compile
6 years ago
heqiaozhi f81957a753 refine cmake for pslib & pre_define
6 years ago
tensor-tang bf951fa737 add refer vrelu, videntity, vexp, vsigmoid, vtanh and test and benchmark
6 years ago
Yu Yang 15550a2753 Polish code
6 years ago
sneaxiy 2328bee1cc fix Windows compile bug
6 years ago
sneaxiy deb0d41cea fix cmake
6 years ago
heqiaozhi 2912d5311b fix code style bug & change pslib.cmake & change Cmakelist adapt pslib
6 years ago
dongdaxiang c59cdf3a24 refine executor_thread_worker.h and executor_thread_worker.cc code style
6 years ago
Yu Yang 9e0b33d7ad Merge branch 'feature/tensor_type' into feature/refine_w2v
6 years ago
heqiaozhi 3c01cdeff0 refine executor_thread_worker.cc & executor_thread_worker.h code style
6 years ago
dongdaxiang c71279bc69 refine code style for async_executor.h and async_executor.cc
6 years ago
Yu Yang 162637b64a Fix ngraph compile
6 years ago
dongdaxiang 33ee5cad61 format code style of executor_thread_worker.cc
6 years ago
Yu Yang 194e66f785 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
tensor-tang e9216e82f9 add refer vscal, vaddbias and test and benchmark
6 years ago
heqiaozhi 7bd16e3afa fix some bug & add log
6 years ago
Xin Pan 30aad88449
Merge pull request #14839 from panyx0718/infer3
6 years ago
sneaxiy e7c5c9d2de remove unnecesary code
6 years ago
sneaxiy db2daefe50 merge develop
6 years ago
sneaxiy 8b9d33fa1e add unittest and fix bug
6 years ago
tensor-tang a37038880e fix unit test with double type
6 years ago
tensor-tang 417d031f90 add refer vadd, vaddrelu, vsub and tests and benchmark
6 years ago
Yu Yang be11375661 Refine code
6 years ago
Yu Yang 8d9401152e Refine w2v
6 years ago
Tao Luo 66b6e473d0
Merge pull request #14732 from Sand3r-/mgallus/mkldnn-concat
6 years ago
sneaxiy e82772f425 fix cmake conflict
6 years ago
sneaxiy 0c554a59fa merge develop
6 years ago
tensor-tang f3250097bc fix bug and mac compile
6 years ago
tensor-tang bc0df6a948 make typename tuples
6 years ago
tensor-tang 194ce2e92c add benchmark
6 years ago
Yibing Liu 6951ef9a55
Fix the gelu backward to avoid nan (#14857)
6 years ago
Yu Yang c00e07cda0 Fix distribute compile
6 years ago
Tao Luo 322bb8d5c5
Merge pull request #14825 from NHZlX/add_benchmark_for_trt
6 years ago
Qiao Longfei 3668f07965
Merge pull request #14844 from jacquesqiao/pserver-should-crash
6 years ago
flame dd3aca3b96
Merge pull request #14824 from Superjomn/fix/visualizer
6 years ago
tensor-tang d846fb03a8
Merge pull request #14807 from baojun-nervana/ngraph_follow_up
6 years ago
sneaxiy ca84c2ca8f merge develop
6 years ago
sneaxiy e240ba2918 implement backward
6 years ago
sneaxiy 06f8aa5b97 remove while_op support temporarily
6 years ago
Yu Yang 81520a24cf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor
6 years ago
Yu Yang 9bd70a1e04 Change tensor uses proto::VarType::type
6 years ago
Yu Yang 8175983ef9
Merge pull request #14814 from reyoung/feature/gprof
6 years ago
Yu Yang eeca721a99
Merge pull request #14842 from reyoung/feature/refine_eigen_tensor
6 years ago
dengkaipeng 7ec3264b51 fix API spec. test=develop
6 years ago
tensor-tang d538513fce fix the compile error on mac
6 years ago
tensor-tang 28eb7d840c test all impls and all inplace cases
6 years ago
Yihua Xu acc6ae49b1 Fix the issue to run on AVX2 and AVX512F machines (#14851)
6 years ago
heqiaozhi 10ed9e0a6e download & run & instance
6 years ago
nhzlx 8c8019e388 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_benchmark_for_trt
6 years ago
Michal Gallus 92daace55c MKL-DNN Concat: Fix segfault related to referencing deleter memory primitive
6 years ago
Yu Yang 9f8d86858e Revert data_type
6 years ago
tensor-tang d4cab7d948 use jitkernel in one file
6 years ago
tensor-tang adc7ba2edd Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
tensor-tang 900c789a35 use jitcode and use vmul
6 years ago
Yu Yang 5e60906996 Fix compile error
6 years ago
Qiao Longfei 1870262ba9 pserver should crash early whe has problem
6 years ago
dengkaipeng a81fabd327 fix doc errors. test=develop
6 years ago
dengkaipeng cf06e50f1d add doc for adaptive pool. test=develop
6 years ago
dengkaipeng 266c6856c9 add adaptive pool 2d & 3d. test=develop
6 years ago
dengkaipeng eab4745965 add adaptive mode for pool.
6 years ago
Yu Yang 7604b1ad51 Fix Eigen macro when using GPU
6 years ago
Qiao Longfei 1213e2838f
Merge pull request #14820 from jacquesqiao/fix-split-selected-rows
6 years ago
nhzlx 644c13a387 fix compile error
6 years ago
sneaxiy 7923042365 merge develop
6 years ago
Qiao Longfei 0a7c7e97af test zero output of split_selected_rows_op
6 years ago
Tao Luo 3243b45bb2
Merge pull request #14651 from luotao1/has_attr
6 years ago
Xin Pan 1735022a1b fix clang
6 years ago
heqiaozhi 016a068775 stop server
6 years ago
sneaxiy 8760d23c7d featue/py_func
6 years ago
zhang wenhui c4c5f0b8ca
Merge pull request #14771 from frankwhzhang/bpr
6 years ago
Tao Luo 067ed70f2d add HasProtoAttr function in op_desc.h, clean node.h
6 years ago
heqiaozhi 8e3fe2d735 add skip op
6 years ago
Yu Yang b22d638d8f Speed up SizeOfType
6 years ago
Xin Pan 4a716a70c6
Merge pull request #14811 from panyx0718/revert-14782-revert-14398-imperative
6 years ago
nhzlx a5bfed3776 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_benchmark_for_trt
6 years ago
nhzlx afc51e6f82 add benchmark for trt
6 years ago
Zhaolong Xing bc6d0a3427
Merge pull request #14762 from NHZlX/fix_bug_of_trt_pool
6 years ago
tensor-tang 53709e7e61 refine names
6 years ago
superjomn edd1f5a92b fix visualizer
6 years ago
Brian Liu 9623b45f40 Remove unnecessary MKLDNN reorder (#14799)
6 years ago
frankwhzhang 90c7f9870e fix 'name', test=develop
6 years ago
Qiao Longfei abf140289f split selected rows op should always init output selected rows
6 years ago
nhzlx 019e8bbed2 fix comments test=develop
6 years ago
frankwhzhang 271c480822 update API, test=develop
6 years ago
frankwhzhang c9a653820b fix label_pos ,add test_layers.py, test=develop
6 years ago
Tao Luo e99597d35c Merge branch 'develop' into luotao1-has_attr
6 years ago
sneaxiy 66182abda6 add cuda cudnn version check
6 years ago
Yu Yang f0c0bf328d Add gperftools supports for PE
6 years ago
frankwhzhang a672b291e5 fix code style, test=develop
6 years ago
frankwhzhang ea95f9c335 fix style bug, test=develop
6 years ago
frankwhzhang 68c2025844 fix nn.py&API.spec, test=develop
6 years ago
Xin Pan 748549b2e3 Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative"
6 years ago
bingyanghuang 943ad4781f One possible solution to add flexibility for mkldnn placement pass (#14768)
6 years ago
baojun-nervana fddbd87c0a Rename argument
6 years ago
baojun-nervana 22ac2133e4 Rename class
6 years ago
baojun-nervana bfde5e10ce Move ngraph compile control to cmake
6 years ago
sneaxiy 2c6159a151 fix unittest
6 years ago
Xin Pan c049fa7cf7
Revert "Revert "Revert "Imperative"""
6 years ago
gongweibao f1fb64b17f
Add reduce sparse tensor feature. (#14757)
6 years ago
sneaxiy eb8252466b polish code
6 years ago
Tao Luo c83d5b7a16
Merge pull request #14709 from yihuaxu/develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
6 years ago
Zeng Jinle add98c9e7d
Merge pull request #14745 from sneaxiy/fix_eigen_deallocate
6 years ago
frankwhzhang f4cc5881b0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
6 years ago
frankwhzhang 97de98cd0a update bpr_loss op code, test=develop
6 years ago
Xin Pan 6c80bb3ce9
Merge pull request #14786 from PaddlePaddle/revert-14782-revert-14398-imperative
6 years ago
heqiaozhi 575ae7c6c3 refine pslib inferface & fix some bugs
6 years ago
Yihua Xu 3821fc3950 Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
6 years ago
Yihua Xu 240d974ac5 Clean Code
6 years ago
Tao Luo 54fcafb5f6
Merge pull request #14707 from yihuaxu/develop_4f71a6ee2_conv3d_mkldnn_opt
6 years ago
Xin Pan 2538ef64f1
Revert "Revert "Imperative""
6 years ago
guru4elephant b82a44ea85
Merge pull request #14778 from wangguibao/async_executor_bugfix
6 years ago
sneaxiy 8095fb5e68 fix code bug in CPU compilation
6 years ago
sneaxiy 387bac46b5 refine code
6 years ago
Tao Luo cf66133857
Merge pull request #14734 from luotao1/memory_load
6 years ago
Yihua Xu 155328a488 Clean Code
6 years ago
Xin Pan 6217f42ab7
Revert "Imperative"
6 years ago
Tao Luo 743cb840f1 update with comments
7 years ago
tensor-tang ce674b685f add readme doc and complete TODOs
7 years ago
wangguibao 5a2cd4505b AsyncExecutor bugfix: Tensor to LoDTensor
7 years ago
wangguibao 5f98d80039 AsyncExecutor bugfix: Tensor change to LoDTensor
7 years ago
flame f6a877bc57 add tool to visualize inference model (#14621)
7 years ago
frankwhzhang 93551a3440 update API.spec
7 years ago
Tao Luo 42359e88a4 clean code
7 years ago
Tao Luo 923b18877e Merge branch 'develop' into memory_load
7 years ago
Tao Luo 405b2486db support loading from memory
7 years ago
Xin Pan b52f5d2870
Merge pull request #14398 from panyx0718/imperative
7 years ago
frankwhzhang 272f3d3111 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
7 years ago
frankwhzhang 570d89ec84 add bpr_loss operator , test=develop
7 years ago
qingqing01 549f165b59
Speed conv_fusion_op for identity activation. (#14744)
7 years ago
tensor-tang fab0ee8757 Merge remote-tracking branch 'ups/develop' into refine/jitkernel
7 years ago
Houjiang Chen c6b39a0099
Merge pull request #14714 from NHZlX/add_prelu_gpu
7 years ago
Jiabin Yang 8a111ac64d
Merge pull request #14763 from junjun315/fix-mac-build-check
7 years ago
tensor-tang dbe451976b
Merge pull request #14753 from tensor-tang/refine/namespace
7 years ago
sneaxiy 0f96c2e80f fix thread-safety bug
7 years ago
lujun 5026741b82 fix the bug for mac build. python -c error. test=develop
7 years ago
nhzlx 722b0a805f fix bug of trt pool
7 years ago
Jiabin Yang d9bb55a1f9
Merge pull request #14756 from JiabinYang/fix_hs_op
7 years ago
Yihua Xu 65dbc7cca4
Merge branch 'develop' into develop_4f71a6ee2_conv3d_mkldnn_opt
7 years ago
JiabinYang e05e1d7d88 fix bug in dist train on hs, test=develop
7 years ago
tensor-tang a1eb21e704 refine names
7 years ago
tensor-tang b523787f9f remove jit namespace
7 years ago
tensor-tang 191948c933 enable jitcode
7 years ago
tensor-tang 4a93db9288 remove jit namespace
7 years ago
Hongyu Liu 8cda28f345
Merge pull request #14733 from phlrain/add_cudnn_5_support
7 years ago
heqiaozhi d3ca359e44 config init & adapt to interface
7 years ago
Xin Pan 73b4d1aa72
Merge pull request #14742 from panyx0718/infer2
7 years ago
Jiabin Yang 21c0f8749e
Merge pull request #14728 from JiabinYang/optimize_hs_op
7 years ago
tensor-tang 45bfa70cb8 complete vmul jit kernel
7 years ago
tensor-tang 77236e33fc init jitkernel
7 years ago
Xin Pan 82d68281c0 follow comments
7 years ago
sneaxiy 900765224c fix deallocate bug
7 years ago
liuhongyu b408fc4dac Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
7 years ago
liuhongyu 8b2898e201 fix bug of formate; test=develop
7 years ago
Xin Pan 41c28d54c6 allow customize kernel selection
7 years ago
Xin Pan 439af8d50a
Merge pull request #14717 from panyx0718/infer
7 years ago
lujun 104a332a28
Merge pull request #14722 from junjun315/up-12-python-install
7 years ago
liuhongyu 773dc73fbf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
7 years ago
liuhongyu 8daf67f90f fix bugs; test=develop
7 years ago
chengduo 04539d4c5d
Fix clip.py (#14718)
7 years ago
sneaxiy d0c8b9b9b3 remove timeout unittest
7 years ago
heqiaozhi 419506f510 refine for compile pslib.so
7 years ago
Xin Pan 052cc5f538
Merge pull request #14725 from ZongwuYang/my-cool-stuff
7 years ago
Michal Gallus 6fdbb365ce Include MKL-DNN header to concat op only when flag is set
7 years ago
Michal Gallus f2a880421e Fix style @ concat integration and tests
7 years ago
Michal Gallus 738069e491 Refactor MKL-DNN Concat
7 years ago
Michal Gallus 208f912512 Implement MKL-DNN Concat
7 years ago
Wu Yi 29d9fb53fc
[Feature] multi process multi gpu dist training, boost v100 performance by 20% (#14661)
7 years ago
liuhongyu e80402fd0e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
7 years ago
liuhongyu 968dd3c078 add cudnn 5 support; test=develop
7 years ago
sneaxiy e694d0c2e4 fix while_op eager deletion bug
7 years ago
Xin Pan 461ca35be1
Merge pull request #14590 from panyx0718/fix4
7 years ago
gongweibao 50a698525d
Fix log level (#14692)
7 years ago
JiabinYang 8c75705984 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize_hs_op
7 years ago
Xin Pan dc458b1482
Merge pull request #14713 from panyx0718/api
7 years ago
JiabinYang b387a19410 optimize op with blas
7 years ago
Zeng Jinle ff4237309a
Merge pull request #14720 from sneaxiy/fix_seq_mask_op_infershape
7 years ago
heqiaozhi 2301abc481 cc libaray add pslib
7 years ago
ZongwuYang 1560eb4a6d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my-cool-stuff
7 years ago
ZongwuYang deb04809bd test=develop
7 years ago
Xin Pan da4e0bf1a1 add 2 more files
7 years ago
Xin Pan 7c5289f68e
Merge pull request #14719 from PaddlePaddle/revert-14666-feature/estiminate_flops
7 years ago
lujun 9da5954a21 fix mac ci test step, test=develop
7 years ago
Kaipeng Deng 934f13a70a
Merge pull request #14371 from heavengate/yolo_loss
7 years ago
sneaxiy 35a2578426 fix bug
7 years ago
sneaxiy 65867d8989 test=develop
7 years ago
Jiabin Yang 6dcc6378b7
Merge pull request #14665 from JiabinYang/ci/add_import_check
7 years ago
zhang wenhui abbe382e1e
Revert "Add EstiminateFlops"
7 years ago
Xin Pan 0591ba96ec fix hack
7 years ago
sneaxiy 64ad051b9a merge develop
7 years ago
sneaxiy c47c451a00 fix bug
7 years ago
heqiaozhi a77fa67bbd async_thread_trainer & libmct & pslib.cmake
7 years ago
Tao Luo 3437e17713 Merge branch 'has_attr' of https://github.com/luotao1/Paddle into luotao1-has_attr
7 years ago
nhzlx e7abe6b654 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_prelu_gpu
7 years ago
nhzlx f75815b78c add prelu gpu inference
7 years ago
Xin Pan bcf36d8401 add more files to protected file list
7 years ago
Tao Luo 0e3048db43
Merge pull request #14659 from luotao1/update_pass
7 years ago
Xin Pan 7e0801d4ed
Merge pull request #14441 from baojun-nervana/intel/ngraph_op
7 years ago
Yihua Xu 82eefceabe Add the profile_mkldnn flag for profile function(test=develop)
7 years ago
Xin Pan 35e6b5e16a polish
7 years ago
Yihua Xu ea00270fe8 Remove the dims checking when the dim is 3 (test=develop)
7 years ago
Xin Pan b80fe8264a polish
7 years ago
Yihua Xu 64e261c6cd Implement the fusion of convolution and bias for mkldnn
7 years ago
Tao Luo 8d6984eb9b change OpHasAttr to RuntimeHasAttr, add some comments
7 years ago
jerrywgz 96dc3d8326
Merge pull request #14511 from jerrywgz/ignore_index_for_sigmoid_cross_entropy
7 years ago
Tao Luo a6ac42669c Merge branch 'develop' into update_pass
7 years ago
Yihua Xu 669191c9cc Implement conv3d with mkldnn library (test=develop)
7 years ago
Hongyu Liu 4f71a6ee2c
Merge pull request #14622 from PaddlePaddle/add_cudnn_lstm
7 years ago
Yibing Liu c7382df80f
Print assert failure id in lookup_table_op (#14698)
7 years ago
Yu Yang 0f0e197914
Merge pull request #14666 from reyoung/feature/estiminate_flops
7 years ago
Xin Pan 93c16d9628 polish the autograd (need to verify correctness)
7 years ago
Xin Pan c3236f82d6 polish
7 years ago
Xin Pan e5d64fd4d1 initial imperative
7 years ago
Xin Pan 4d0df1fea7 add fields for autograd
7 years ago
Xin Pan 8138391631 add OpBase and unify with VarBase
7 years ago
Xin Pan f6f0692451 clean up
7 years ago
Xin Pan 0318c95149 rebase develop
7 years ago
Xin Pan aeb74af54c allow operator to run imperatively
7 years ago
Xin Pan b1f6fda5e5 run forward
7 years ago
Xin Pan a6d23083f0 some tracing
7 years ago
Xin Pan dac92e560c initial commit
7 years ago
barrierye 08233beed7 add the comment for CheckFile function. test=develop
7 years ago
barrierye d62a3dd72d add the comment for CheckFile function. test=develop
7 years ago
barrierye d89108766c update CheckFile function in data_feed for ignore the space at the end of each line of data(for example, it may be added '\t' character to the end of the reduce task output when processes data by hadoop, which does not affect the correctness of the data). test=develop
7 years ago
phlrain 9f7eae861d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago