Commit Graph

12787 Commits (6a5f604607e06a0dffaf16ffe88d7033ecc42b30)

Author SHA1 Message Date
Zeng Jinle 16c244bc3f
Merge pull request #14928 from sneaxiy/fix_gc
6 years ago
Xin Pan f897bd16c0 clean
6 years ago
peizhilin fa135bbf52 Fix the mkl build script on windows
6 years ago
Xin Pan 70981f5d79 clean
6 years ago
colourful-tree 44ad2f4479
Merge pull request #14873 from colourful-tree/develop
6 years ago
minqiyang a3fa3f85d7 Polish code
6 years ago
Yu Yang 2803cf5776
Merge pull request #14868 from reyoung/feature/refine_w2v
6 years ago
Tao Luo 6aa6b8cfa0
Merge pull request #14918 from luotao1/mobilenet_test
6 years ago
Zhaolong Xing a9fb34fad8
Merge pull request #14903 from NHZlX/add_conv_elementwise_pass
6 years ago
peizhilin b601f2de8d include the mkl fix only
6 years ago
dzhwinter 7cd24b1318
add ir memory optimize. (#14530)
6 years ago
Xin Pan fb8ae30331 fix
6 years ago
Qiyang Min fd1d2c897e
Merge pull request #14894 from velconia/add_huber_regression_loss_op
6 years ago
Tao Luo 2f55a04ec6 add refer result comparasion
6 years ago
guru4elephant a79a3ea2f0
Merge branch 'develop' into develop
6 years ago
peizhilin 5a6d7fe2ff add mkl,ctc support for windows
6 years ago
wopeizl 0f085f0a5a
Merge pull request #14892 from wopeizl/windows/port3
6 years ago
sneaxiy c631412eab fix gc bug
6 years ago
Zeng Jinle 36a1d021a4
Merge pull request #14927 from sneaxiy/fix_cuda_stream_callback_in_cuda10
6 years ago
Xin Pan eaf8ba35b5 change input
6 years ago
Xin Pan 840e6729e2 inject context
6 years ago
Xin Pan bbff0df320 try cache variables
6 years ago
Xin Pan 52bc4ee75a delay infer scope
6 years ago
wopeizl fa78fc60be
Merge pull request #14907 from wopeizl/windows/avx
6 years ago
Qiao Longfei fe3995d335 refine code test=develop
6 years ago
tensor-tang 74292f414c enable eltwise nchw16c mul nc
6 years ago
Qiao Longfei 56686d0f34 clean code test=develop
6 years ago
Qiao Longfei fd152289fa clean for range in test=develop
6 years ago
sneaxiy 2373aeb5e8 fix bug
6 years ago
nhzlx 050a68dde3 fix comments
6 years ago
tensor-tang 720b55cbcf enable crf decoding and layer norm refer code
6 years ago
tensor-tang 64a90b2f1c use vadd, vaddrelu, lstm and gru jitkernel
6 years ago
Qiao Longfei 1141db8114 update test_adam_op
6 years ago
gongweibao addded48e1
test=develop (#14898)
6 years ago
Tao Luo 1a6d2cfe39 add test_analyzer_mobilenet
6 years ago
minqiyang aa41ee75a1 Accelerate PADDLE_ENFORCE
6 years ago
Qiao Longfei 96604fda10 fix gpu data
6 years ago
nhzlx fcc93d96d5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_conv_elementwise_pass
6 years ago
dongdaxiang bc4f16ca6f remove some comments
6 years ago
peizhilin 41456e1723 Remove the useless definition
6 years ago
minqiyang 24eb8f038c Fix bug
6 years ago
Yu Yang 740e1626ce Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_w2v
6 years ago
minqiyang 728e7e88fb Use xxHash as scope's hash algorithm
6 years ago
minqiyang 81651fca45 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
minqiyang bd0067b26c Polish code
6 years ago
peizhilin d519fd6944 test=develop
6 years ago
Yu Yang bacf1d2399 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
Yu Yang e439257ef7 Fix include style
6 years ago
Qiao Longfei 238b24bfa2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-adam
6 years ago
Qiao Longfei 763e8fdf02 fix compile error
6 years ago
peizhilin 01dd9061a0 add avx support for windows
6 years ago
Qiao Longfei fcde2b2725 add ForRangeIn
6 years ago
dongdaxiang 921b7f452a add API.spec
6 years ago
tensor-tang 3713d08d40 enable jitcode gru
6 years ago
tensor-tang 7c1f3ad6eb enable jitcode lstm
6 years ago
nhzlx c0c9fcd9c7 add source file
6 years ago
dongdaxiang 4c0a769d1d avoid clock time in WIN32 mode
6 years ago
dongdaxiang 66522046ad remove clock time in WIN32 mode
6 years ago
Xin Pan 363bf8a4d8
Merge pull request #14897 from panyx0718/clean2
6 years ago
dongdaxiang f2b92d77b5 remove clock time in WIN32 mode
6 years ago
nhzlx 388953027e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_conv_elementwise_pass
6 years ago
nhzlx 4e4a777243 add conv+elementwiseadd pass
6 years ago
Xin Pan 2ebf12f340 fix
6 years ago
nhzlx 514648665a fix trt_op test test=develop
6 years ago
Kaipeng Deng dc76e4b0f1
Merge pull request #14701 from heavengate/adaptive_pool
6 years ago
tensor-tang 80766bcb82 enable act jitcode vexp, vrelu, vsigmoid and vtanh
6 years ago
nhzlx 050e118f3c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_trt_thread_bug
6 years ago
nhzlx 96216052d5 1. fix trt multi thread bug
6 years ago
gongweibao 0b1c7d838c
Add brpc serialization support. (#11430)
6 years ago
tensor-tang fd0a954fbf enable blas jitcode vmul, vadd, vaddrelu, vscal and vaddbias
6 years ago
Xin Pan 787d837f50 fix
6 years ago
Yan Chunwei a985949be9
Fea/fuse conv elementwise add fuse (#14669)
6 years ago
tensor-tang 5e97be7ba7 enable jitkernel mkl vexp, vsigmoid and vtanh
6 years ago
minqiyang 5fea8cd478 Add sorted_result parameter to SelectedRows Functor
6 years ago
minqiyang 5fff20c21a Change name to huber loss
6 years ago
tensor-tang ae17926987 enable jitkernel mkl vmul, vadd and vscal
6 years ago
tensor-tang 77907a3502 refine benchmark template
6 years ago
Qiao Longfei cf5264629f update API.spec
6 years ago
Xin Pan e90b2f104c In most times, const_cast is bad and break interface contract and
6 years ago
tensor-tang 8e785fec8d clean code and refine tests template
6 years ago
minqiyang ab98101c2e Update API.spec
6 years ago
Yu Yang 04a570b463 Fix ut
6 years ago
heqiaozhi 09d669ba40 fix static_cast to const_cast
6 years ago
minqiyang 65d355a72c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_huber_regression_loss_op
6 years ago
minqiyang c550e0ce06 Add python interface for huber regression loss
6 years ago
peizhilin 23dec78772 fix script issue
6 years ago
heqiaozhi bd1c1724aa add ps_instance doc
6 years ago
minqiyang da796dfe05 Remove BinarySearch from Adam Op
6 years ago
dengkaipeng f16aa394f6 remove use_cudnn in python API. test=develop
6 years ago
Yu Yang b17444c84c Fix merge bug
6 years ago
Yu Yang 62b5772ffb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
Qiao Longfei c624417c6f change sparse mode to lazy mode
6 years ago
Qiao Longfei 4035e4bab2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-adam
6 years ago
tensor-tang 00d3afbcc9 add gru refer functions, test and benchmark
6 years ago
Qiao Longfei 3dc29b3905 change sparse_update to adam_update
6 years ago
tensor-tang 6eec461725 add lstm peephole benchmark
6 years ago
Wang Guibao e213050223
Fix multi-threading bug with WItH_MKL=ON (#14882)
6 years ago
tensor-tang bf9302f950 add lstm, peephole refer and test
6 years ago
dongdaxiang e52bb816e3 add copyright to __init__.py in distributed folder
6 years ago
dongdaxiang d839bd0dd4 simple commit
6 years ago
minqiyang 3bb9b0cac2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
minqiyang a81495d6f4 Fix code
6 years ago
minqiyang ad6ae0b071 1. Add SpinLock
6 years ago
sneaxiy f0df62f136 add more unittest case
6 years ago
Qiao Longfei fc6ec6bd14 add sparse mode adam
6 years ago
Yu Yang 4ecdb6f486 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
Xin Pan 47ea2534fb clean parallel do
6 years ago
sneaxiy f6741df462 merge develop
6 years ago
Yu Yang 7b10bf0e60 Use mkl
6 years ago
Zeng Jinle 1b564bc49a
Merge pull request #14670 from sneaxiy/refactor_eager_deletion
6 years ago
SunGaofeng e3c4b0dace this is for psroi_pool op, test=develop (#14796)
6 years ago
dongdaxiang c9b799896e fix tag in async_executor
6 years ago
heqiaozhi 95b887c4f2 remove commit
6 years ago
Yu Yang b2b5241e2e Merge branch 'feature/tensor_type' into feature/refine_w2v
6 years ago
Yu Yang aa38fc4ce5 Fix compile
6 years ago
heqiaozhi f81957a753 refine cmake for pslib & pre_define
6 years ago
tensor-tang bf951fa737 add refer vrelu, videntity, vexp, vsigmoid, vtanh and test and benchmark
6 years ago
Yu Yang 15550a2753 Polish code
6 years ago
sneaxiy 2328bee1cc fix Windows compile bug
6 years ago
sneaxiy deb0d41cea fix cmake
6 years ago
heqiaozhi 2912d5311b fix code style bug & change pslib.cmake & change Cmakelist adapt pslib
6 years ago
dongdaxiang c59cdf3a24 refine executor_thread_worker.h and executor_thread_worker.cc code style
6 years ago
Yu Yang 9e0b33d7ad Merge branch 'feature/tensor_type' into feature/refine_w2v
6 years ago
heqiaozhi 3c01cdeff0 refine executor_thread_worker.cc & executor_thread_worker.h code style
6 years ago
dongdaxiang c71279bc69 refine code style for async_executor.h and async_executor.cc
6 years ago
Yu Yang 162637b64a Fix ngraph compile
6 years ago
dongdaxiang 33ee5cad61 format code style of executor_thread_worker.cc
6 years ago
Yu Yang 194e66f785 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
tensor-tang e9216e82f9 add refer vscal, vaddbias and test and benchmark
6 years ago
heqiaozhi 7bd16e3afa fix some bug & add log
6 years ago
Xin Pan 30aad88449
Merge pull request #14839 from panyx0718/infer3
6 years ago
sneaxiy e7c5c9d2de remove unnecesary code
6 years ago
sneaxiy db2daefe50 merge develop
6 years ago
sneaxiy 8b9d33fa1e add unittest and fix bug
6 years ago
tensor-tang a37038880e fix unit test with double type
6 years ago
tensor-tang 417d031f90 add refer vadd, vaddrelu, vsub and tests and benchmark
6 years ago
minqiyang a61eb543f5 Add RWLock to Scope
6 years ago
Yu Yang be11375661 Refine code
6 years ago
Yu Yang 8d9401152e Refine w2v
6 years ago
Tao Luo 66b6e473d0
Merge pull request #14732 from Sand3r-/mgallus/mkldnn-concat
6 years ago
sneaxiy e82772f425 fix cmake conflict
6 years ago
sneaxiy 0c554a59fa merge develop
6 years ago
minqiyang 1b61021cb3 Polish code
6 years ago
minqiyang b75bd29c3a Remove debug info
6 years ago
tensor-tang f3250097bc fix bug and mac compile
6 years ago
minqiyang 7a43e51703 Add gperf tools
6 years ago
tensor-tang bc0df6a948 make typename tuples
6 years ago
tensor-tang 194ce2e92c add benchmark
6 years ago
Yibing Liu 6951ef9a55
Fix the gelu backward to avoid nan (#14857)
6 years ago
Yu Yang c00e07cda0 Fix distribute compile
6 years ago
Tao Luo 322bb8d5c5
Merge pull request #14825 from NHZlX/add_benchmark_for_trt
6 years ago
Qiao Longfei 3668f07965
Merge pull request #14844 from jacquesqiao/pserver-should-crash
6 years ago
flame dd3aca3b96
Merge pull request #14824 from Superjomn/fix/visualizer
6 years ago
tensor-tang d846fb03a8
Merge pull request #14807 from baojun-nervana/ngraph_follow_up
6 years ago
sneaxiy ca84c2ca8f merge develop
6 years ago
sneaxiy e240ba2918 implement backward
6 years ago
sneaxiy 06f8aa5b97 remove while_op support temporarily
6 years ago
Yu Yang 81520a24cf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor
6 years ago
Yu Yang 9bd70a1e04 Change tensor uses proto::VarType::type
6 years ago
Yu Yang 8175983ef9
Merge pull request #14814 from reyoung/feature/gprof
6 years ago
Yu Yang eeca721a99
Merge pull request #14842 from reyoung/feature/refine_eigen_tensor
6 years ago
dengkaipeng 7ec3264b51 fix API spec. test=develop
6 years ago
tensor-tang d538513fce fix the compile error on mac
6 years ago
tensor-tang 28eb7d840c test all impls and all inplace cases
6 years ago
Yihua Xu acc6ae49b1 Fix the issue to run on AVX2 and AVX512F machines (#14851)
6 years ago
heqiaozhi 10ed9e0a6e download & run & instance
6 years ago
nhzlx 8c8019e388 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_benchmark_for_trt
6 years ago
minqiyang 570338699b Add debug info
6 years ago
Michal Gallus 92daace55c MKL-DNN Concat: Fix segfault related to referencing deleter memory primitive
6 years ago
Yu Yang 9f8d86858e Revert data_type
6 years ago
tensor-tang d4cab7d948 use jitkernel in one file
6 years ago
tensor-tang adc7ba2edd Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
tensor-tang 900c789a35 use jitcode and use vmul
6 years ago
Yu Yang 5e60906996 Fix compile error
6 years ago
Qiao Longfei 1870262ba9 pserver should crash early whe has problem
6 years ago
dengkaipeng a81fabd327 fix doc errors. test=develop
6 years ago
dengkaipeng cf06e50f1d add doc for adaptive pool. test=develop
6 years ago
dengkaipeng 266c6856c9 add adaptive pool 2d & 3d. test=develop
6 years ago
dengkaipeng eab4745965 add adaptive mode for pool.
6 years ago
Yu Yang 7604b1ad51 Fix Eigen macro when using GPU
6 years ago
Qiao Longfei 1213e2838f
Merge pull request #14820 from jacquesqiao/fix-split-selected-rows
6 years ago
nhzlx 644c13a387 fix compile error
6 years ago
sneaxiy 7923042365 merge develop
6 years ago
Qiao Longfei 0a7c7e97af test zero output of split_selected_rows_op
6 years ago
Tao Luo 3243b45bb2
Merge pull request #14651 from luotao1/has_attr
6 years ago
Xin Pan 1735022a1b fix clang
6 years ago
heqiaozhi 016a068775 stop server
6 years ago
sneaxiy 8760d23c7d featue/py_func
6 years ago
zhang wenhui c4c5f0b8ca
Merge pull request #14771 from frankwhzhang/bpr
6 years ago
Tao Luo 067ed70f2d add HasProtoAttr function in op_desc.h, clean node.h
6 years ago
heqiaozhi 8e3fe2d735 add skip op
6 years ago
minqiyang 202b2f1fa7 Move the beta pow scale calculation into Adam Op
6 years ago
Yu Yang b22d638d8f Speed up SizeOfType
6 years ago
Xin Pan 4a716a70c6
Merge pull request #14811 from panyx0718/revert-14782-revert-14398-imperative
6 years ago
nhzlx a5bfed3776 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_benchmark_for_trt
6 years ago
nhzlx afc51e6f82 add benchmark for trt
6 years ago
Zhaolong Xing bc6d0a3427
Merge pull request #14762 from NHZlX/fix_bug_of_trt_pool
6 years ago
tensor-tang 53709e7e61 refine names
6 years ago
superjomn edd1f5a92b fix visualizer
6 years ago
Brian Liu 9623b45f40 Remove unnecessary MKLDNN reorder (#14799)
6 years ago
frankwhzhang 90c7f9870e fix 'name', test=develop
6 years ago
Qiao Longfei abf140289f split selected rows op should always init output selected rows
6 years ago
nhzlx 019e8bbed2 fix comments test=develop
6 years ago
frankwhzhang 271c480822 update API, test=develop
6 years ago
frankwhzhang c9a653820b fix label_pos ,add test_layers.py, test=develop
6 years ago
Tao Luo e99597d35c Merge branch 'develop' into luotao1-has_attr
6 years ago
sneaxiy 66182abda6 add cuda cudnn version check
6 years ago
Yu Yang f0c0bf328d Add gperftools supports for PE
6 years ago
frankwhzhang a672b291e5 fix code style, test=develop
6 years ago
frankwhzhang ea95f9c335 fix style bug, test=develop
6 years ago
frankwhzhang 68c2025844 fix nn.py&API.spec, test=develop
6 years ago
Xin Pan 748549b2e3 Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative"
6 years ago
bingyanghuang 943ad4781f One possible solution to add flexibility for mkldnn placement pass (#14768)
6 years ago
baojun-nervana fddbd87c0a Rename argument
6 years ago
baojun-nervana 22ac2133e4 Rename class
6 years ago
baojun-nervana bfde5e10ce Move ngraph compile control to cmake
6 years ago
sneaxiy 2c6159a151 fix unittest
6 years ago
Xin Pan c049fa7cf7
Revert "Revert "Revert "Imperative"""
6 years ago
gongweibao f1fb64b17f
Add reduce sparse tensor feature. (#14757)
6 years ago
sneaxiy eb8252466b polish code
6 years ago
Tao Luo c83d5b7a16
Merge pull request #14709 from yihuaxu/develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
6 years ago
Zeng Jinle add98c9e7d
Merge pull request #14745 from sneaxiy/fix_eigen_deallocate
6 years ago
frankwhzhang f4cc5881b0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
6 years ago
frankwhzhang 97de98cd0a update bpr_loss op code, test=develop
6 years ago
Xin Pan 6c80bb3ce9
Merge pull request #14786 from PaddlePaddle/revert-14782-revert-14398-imperative
6 years ago
heqiaozhi 575ae7c6c3 refine pslib inferface & fix some bugs
6 years ago
Yihua Xu 3821fc3950 Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
6 years ago
Yihua Xu 240d974ac5 Clean Code
6 years ago
Tao Luo 54fcafb5f6
Merge pull request #14707 from yihuaxu/develop_4f71a6ee2_conv3d_mkldnn_opt
6 years ago
Xin Pan 2538ef64f1
Revert "Revert "Imperative""
6 years ago
guru4elephant b82a44ea85
Merge pull request #14778 from wangguibao/async_executor_bugfix
6 years ago
sneaxiy 8095fb5e68 fix code bug in CPU compilation
6 years ago
sneaxiy 387bac46b5 refine code
6 years ago
Tao Luo cf66133857
Merge pull request #14734 from luotao1/memory_load
6 years ago
Yihua Xu 155328a488 Clean Code
6 years ago
Xin Pan 6217f42ab7
Revert "Imperative"
6 years ago
Tao Luo 743cb840f1 update with comments
6 years ago
tensor-tang ce674b685f add readme doc and complete TODOs
6 years ago
wangguibao 5a2cd4505b AsyncExecutor bugfix: Tensor to LoDTensor
6 years ago
wangguibao 5f98d80039 AsyncExecutor bugfix: Tensor change to LoDTensor
6 years ago
flame f6a877bc57 add tool to visualize inference model (#14621)
6 years ago
frankwhzhang 93551a3440 update API.spec
6 years ago
Tao Luo 42359e88a4 clean code
6 years ago
Tao Luo 923b18877e Merge branch 'develop' into memory_load
6 years ago
Tao Luo 405b2486db support loading from memory
6 years ago
Xin Pan b52f5d2870
Merge pull request #14398 from panyx0718/imperative
6 years ago
frankwhzhang 272f3d3111 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
6 years ago
frankwhzhang 570d89ec84 add bpr_loss operator , test=develop
6 years ago
qingqing01 549f165b59
Speed conv_fusion_op for identity activation. (#14744)
6 years ago
tensor-tang fab0ee8757 Merge remote-tracking branch 'ups/develop' into refine/jitkernel
6 years ago