Commit Graph

11959 Commits (5d5e0656b28b1ec9f27c73aff8bb4edcac719c17)

Author SHA1 Message Date
peizhilin bb3f6bd31c Merge branch 'windows/build' into windows/online
6 years ago
peizhilin 1b75fd2236 revert
6 years ago
peizhilin 61fa5218b9 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Yibing Liu bd2943788b
Fix gather & stack op (#14355)
6 years ago
Tao Luo 9d4425dd1c
Merge pull request #14227 from baojun-nervana/intel/ngraph_cmake
6 years ago
Yu Yang 8f9bfad246
perf(compile): speed up reduce_op compile by splitting files (#14294)
6 years ago
nhzlx 397de907ed merge develops
6 years ago
nhzlx d6ff006903 add serial to trt test and do not print log for unused trt logs
6 years ago
peizhilin 13bfee1f85 Merge branch 'windows/build' into windows/online
6 years ago
peizhilin 7840d181c9 fix style issue
6 years ago
peizhilin dc339b78d7 fix code style
6 years ago
sneaxiy d231e55065 merge develop
6 years ago
sneaxiy cf8d2e67e3 clean buffered_allocator
6 years ago
peizhilin ef8a7db81e Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
baojun-nervana 5d20c42219 Set ngraph off as default
6 years ago
Jacek Czaja 03299ed46c - Fix to linking for GPU builds of softmax inference
6 years ago
Jacek Czaja 0756343767 - Fix GPU compilation
6 years ago
Jacek Czaja d332326847 - Added unit tests for softmax is_test=True op
6 years ago
Jacek Czaja c1fccc29c1 - Noise adding removed for Test phase of softmax
6 years ago
Tao Luo 573e68eb40
Merge pull request #14348 from luotao1/speedup_analysis
6 years ago
peizhilin 9b558a8035 Merge branch 'windows/build' into windows/online
6 years ago
peizhilin 7638f0afb3 simplify the logic
6 years ago
peizhilin d01a26280e Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Xin Pan ff28b1ffc0
Merge pull request #14071 from barrierye/add_similarity_focus_op
6 years ago
li099 688ed60116 Add lod tensor array to tensor op (#13990)
6 years ago
peizhilin 6c2b891d87 Merge branch 'windows/build' into windows/online
6 years ago
peizhilin e23061e0dc Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
chengduo 6c6e638550
Add InferVarType for some op (#14201)
6 years ago
peizhilin 664a4e010c Merge branch 'windows/build' into windows/online
6 years ago
peizhilin 1eec5a428f Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Kaipeng Deng 0b38822624
Merge pull request #14345 from heavengate/fix_grid_sampler
6 years ago
peizhilin 6f9c70acb7 Merge branch 'windows/build' into windows/online
6 years ago
peizhilin ca60e1d34d Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Tao Luo 433fc7c1d4 skip mkldnn related pass when use_mkldnn=false
6 years ago
Qiyang Min 200c41026a
Merge pull request #14324 from velconia/fix_vlog
6 years ago
peizhilin 4bd0c4c5ee test=develop
6 years ago
peizhilin 63febbcb3e Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin 350f1f3971 remove duplicate function definition
6 years ago
Qiyang Min 0cceede5a2
Merge pull request #14332 from velconia/add_py3_to_gen_dockerfile
6 years ago
peizhilin 4b1f1a8787 fix merge issue
6 years ago
peizhilin d08334011a fix merge issue
6 years ago
Yu Yang 6ae0b91b39 Clean LockGuardPtr
6 years ago
peizhilin 52f7644f53 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Yu Yang 1420c3b155 Add enum AllocatorStrategy
6 years ago
Qiyang Min 698698f2fa
Merge branch 'develop' into fix_vlog
6 years ago
qingqing01 abe209234f
Exhaustive search for cuDNN conv. (#14286)
6 years ago
Yu Yang b59a9bfb7c Clean buffered_allocator
6 years ago
Kaipeng Deng f215534ecf
Merge pull request #14205 from heavengate/nearest_interp
6 years ago
dengkaipeng 72108d8dbe fix win compile error: EigenTenor * float unsupport. test=develop
6 years ago
Yu Yang 26fb34c365 Merge develop tiny fix
6 years ago
Yu Yang fdc689142c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
Yu Yang 7ffc9fd839 Merge branch 'rewrite_allocation' of https://github.com/sneaxiy/Paddle into rewrite_allocation
6 years ago
tensor-tang 22125ebaef
Merge pull request #14321 from tensor-tang/fea/jit/vscal
6 years ago
Tao Luo f1046d7e37
Merge pull request #14335 from wojtuss/wojtuss/add-graph-viz
6 years ago
Tao Luo 34e9e59f4a
Merge pull request #14333 from kbinias/change-hardcoded-format-and-bump-mkldnn-version
6 years ago
Qiao Longfei 3f91e0f001 update API.spec
6 years ago
Sylwester Fraczek b5f617fa9b make mobilenet test reuse resnet50 test
6 years ago
Sylwester Fraczek 1987d45e75 add comment for depthwise pass
6 years ago
minqiyang 87450b9ad4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
6 years ago
peizhilin 41b423d41b remove duplicate
6 years ago
peizhilin dcfab11193 merge from develop
6 years ago
peizhilin 4ffa92d4f0 Merge branch 'develop' into windows/build
6 years ago
chengduo c5b6573a5a
Fix input<tensor> (#14208)
6 years ago
Krzysztof Binias f1c1acf1ac Changed hardcoded format to any in convolution and bumped MKL-DNN version to 0.17-rc
6 years ago
Tao Luo 813e54efbd
Merge pull request #14328 from PaddlePaddle/revert-14046-windows/debug
6 years ago
Qiyang Min 0804bf333b
Merge pull request #14234 from velconia/fix_ut_test_decayed_adagrad_op
6 years ago
minqiyang 3db9fad764 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
6 years ago
minqiyang 3da43dcae2 Because anakin do NOT use glog, so we revert anakin related change
6 years ago
Tao Luo 387610aae1
Merge pull request #14325 from luotao1/fix_test_analysis_predictor
6 years ago
minqiyang 37ee36510e Change production mode Dockerfile to support python3
6 years ago
peizhilin 45125ba538 fix share library issue
6 years ago
Xin Pan b03a44e062
Merge pull request #14026 from JiabinYang/add_reorg_op
6 years ago
Xin Pan ff6c809bfc
Merge pull request #14251 from panyx0718/fix
6 years ago
Zhaolong Xing ba8b5619a3
Revert "cherry picked windows patches."
6 years ago
minqiyang 49710960ef Revert tensor_util.cu
6 years ago
minqiyang fcc0452c8b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
6 years ago
Tao Luo 381bea0a16 fix test_analysis_predictor
6 years ago
minqiyang 0c3227a523 Change the origin VLOG level to 10 times
6 years ago
tensor-tang 5e64244f25 add vaddbias jitcode
6 years ago
tensor-tang 5f7956ae59 Merge remote-tracking branch 'ups/develop' into fea/jit/vscal
6 years ago
Xin Pan 59c66532e7 add more logs and comments
6 years ago
dzhwinter 1f4a434302
Merge pull request #14046 from dzhwinter/windows/debug
6 years ago
peizhilin 869487a2b7 Merge remote-tracking branch 'origin/develop' into windows/build
6 years ago
Wojciech Uss 7fd640b882 added additional call to graph_viz_pass
6 years ago
tensor-tang 3d950a812d combine jitcode of vscal
6 years ago
tensor-tang 03e11f3fc9 add vscal jitcode
6 years ago
Qiao Longfei 5b7a9dd7ac
Merge pull request #13815 from jacquesqiao/optimize-pyreader
6 years ago
dzhwinter 234a1d9248 Merge remote-tracking branch 'origin/develop' into windows/debug
6 years ago
chengduo a270fdf2db
Fix SelectedRowsAdd bug (#14309)
6 years ago
Qiao Longfei ce994190ab Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pyreader
6 years ago
tensor-tang 2f0a379af7
Merge pull request #14307 from tensor-tang/fix/mac
6 years ago
Zeng Jinle b2af213009
Merge pull request #14292 from sneaxiy/delete_buggy_selected_rows_functor
6 years ago
tensor-tang 161ba9c9d1 fix mac
6 years ago
Sylwester Fraczek f395075efc rebased and stuff broke
6 years ago
tensor-tang e8642c3c1f
Merge pull request #14265 from tensor-tang/fea/jit/vadd
6 years ago
Sylwester Fraczek a60957f386 addd test_analyzer_mobilenet
6 years ago
dengkaipeng 8b47d90f5d add 'actual_shape' attribute. test=develop
6 years ago
tensor-tang 382307b943 refine code
6 years ago
tensor-tang 3319072858 fix jit kernel test on mac
6 years ago
tensor-tang 44cb70c088 Merge remote-tracking branch 'ups/develop' into fix/mac
6 years ago
Yu Yang c774bcbd2d Merge device_context
6 years ago
Yu Yang 057a682ee9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
Yu Yang c28beb8a3c
test(Pe): add dry run tests for pe (#14254)
6 years ago
tensor-tang c9730d33d9 fix run error on mac
6 years ago
Xin Pan 80132933b7
Merge pull request #14281 from luotao1/face
6 years ago
Qiao Longfei e0c8397426
Merge pull request #14257 from jacquesqiao/optimize-pserver-profiler-thread-pool
6 years ago
chengduo ffc866159f
hot fix log (#14293)
6 years ago
Zhaolong Xing 65b61db10a
Merge pull request #13927 from NHZlX/fix_googlenet_bug_with_rule
6 years ago
tensor-tang 25e070ecc7 Merge remote-tracking branch 'ups/develop' into fea/jit/vadd
6 years ago
barrierye ef8218be22 update docs test=develop
6 years ago
Tao Luo eea36739cc refine test_helper.h
6 years ago
Qiao Longfei 6449faec37
Merge pull request #14259 from jacquesqiao/optimize-thread-pool
6 years ago
sneaxiy 9518bc8d0a delete buggy selected_rows functor
6 years ago
chengduo a9b5d42dd4
Add fp16 backward support (#14202)
6 years ago
Qiao Longfei 3b8dd9ebbd optimize code test=develop
6 years ago
Tao Luo 2b791f1f63 unify analyzer_face_tester to analyzer_resnet50_tester
6 years ago
Qiao Longfei 2921f8a79c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
6 years ago
Tao Luo 1ead9318d5 remove unused code in test_helper.h to pass ci
6 years ago
Qiao Longfei 4062f00f2a optimize thread pool code
6 years ago
dzhwinter 2835e04409 merge develop branch. test=develop
6 years ago
dzhwinter deb4af70ef add test
6 years ago
qingqing01 db8c52da5e Revert " Exhaustive search for cuDNN conv. (#14043)"
6 years ago
qingqing01 ce7d9b0799
Exhaustive search for cuDNN conv. (#14043)
6 years ago
Sang Ik Lee f30c1ddb45 Include nGraph build.
6 years ago
tensor-tang cb4083b9fa fix compile error
6 years ago
tensor-tang dd343a4971 Merge remote-tracking branch 'ups/develop' into fea/jit/vadd
6 years ago
Zeng Jinle fcbe84cb50
Merge pull request #14270 from sneaxiy/fix_rmsprop_enforce_bug
6 years ago
Tao Luo 7a2887d212 add analyzer_face_tester
6 years ago
Tao Luo 2ec65ae0db download face_model in CMakeLists.txt
6 years ago
Tao Luo 2f9a5a2e0a add analyzer_face_tester
6 years ago
Xin Pan cb2d33a851 resolve conflict
6 years ago
nhzlx 5700fafd0f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_googlenet_bug_with_rule
6 years ago
nhzlx 86b99ac953 fix comments and fix bug
6 years ago
tensor-tang e6cfdf6c74
Merge pull request #14274 from tensor-tang/fix/jit
6 years ago
peizhilin a37918c31f fix python package issue
6 years ago
Xin Pan 25123a3b7e add tests
6 years ago
Xin Pan 8c11d3fed6 clean up
6 years ago
Xin Pan 0a89650507 fix more tests
6 years ago
Xin Pan a3b27e3237 fix
6 years ago
Xin Pan f25eb9a71d fix some tests.
6 years ago
Xin Pan adf5615e54 clean kGraphOp
6 years ago
Xin Pan fb576cb5cb allow to compare type
6 years ago
Xin Pan ead94bfc6c fix destructor
6 years ago
Xin Pan 2e14999942 clean1
6 years ago
Xin Pan 34b401fc6c clean up a global graph attr.
6 years ago
Zeng Jinle 8ac2242b6e
Merge pull request #14075 from sneaxiy/remove_some_locks_in_pe
6 years ago
tensor-tang b81e1b655e fix jit on mac
6 years ago
sneaxiy 11f032a82e fix rmsprop_op enforce bug
6 years ago
tensor-tang b68ececb73 add vaddrelu jitcode
6 years ago
sneaxiy 8684553633 stream callback support in cuda 10
6 years ago
peizhilin 1f12ba6192 gpu support, fix build issue:
6 years ago
Wu Yi 8fc05e0373
fix cpu build test=develop (#14260)
6 years ago
Zhen Wang 4dbc01841d Nlp dam (#14248)
6 years ago
tensor-tang bb09e31020 add vadd jitcode
6 years ago
sneaxiy faac8a76ce remove unnecessary codes
6 years ago
Yu Yang ff9e531bd9
style(platform): disable warning when cuda cc not matched (#14029)
6 years ago
Qiao Longfei 59fbfbfbf7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
6 years ago
Qiao Longfei fe4cd50286 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-thread-pool
6 years ago
whs d6a6a13039
Fix build error of affine grid op in mac os. (#14237)
6 years ago
Qiao Longfei ac415c0094 change lock_guard to unique_lock
6 years ago
Qiao Longfei f4a76078d0 optimize thread pool
6 years ago
tensor-tang d55481cfeb
Merge pull request #14241 from tensor-tang/refine/jit/vmulcode
6 years ago
Qiao Longfei 9e4e9e9b6e clean rpc server profiler
6 years ago
Zeng Jinle 8d930195d9
Merge pull request #14238 from sneaxiy/fix_read_lod_level_bug
6 years ago
Wu Yi 306236c2c0
feature/DC asgd (#12722)
6 years ago
dengkaipeng fef2faa709 limit CUDA kernel parallel threads max number to 4096. test=develop
6 years ago
tensor-tang c3cbf0b8ef
Merge pull request #14185 from tpatejko/tpatejko/mkldnn-conv-residual-data-reorder
6 years ago
peizhilin 71d7980f69 fix build issue 1
6 years ago
tensor-tang 6b49ee42c3
Merge pull request #14239 from tensor-tang/fix/avx
6 years ago
tensor-tang ef9c10927d
Merge pull request #14233 from tensor-tang/fix/guide
6 years ago
dengkaipeng 34bfae243a Add Interpolate operation. test=develop
6 years ago
sneaxiy 46d4829dd1 fix lod_level share bug in read_op
6 years ago
tensor-tang 8465e7876f auto grow the size and fix test
6 years ago
tensor-tang 9255119fd9 refine jit vmul with all size
6 years ago
tensor-tang a9c1824131 refine jit vmul code supporting multiple of 2
6 years ago
tensor-tang 61fdc38e51
Merge pull request #14206 from tensor-tang/fea/jit/gen
6 years ago
tensor-tang e09a7c793d remove the warning log since do not have avx2, avx512 flags
6 years ago
tensor-tang f524c1b62b throw error when mismatch cpu version
6 years ago
peizhilin 9d67c1fb69 cpu build support
6 years ago
barrierye 5e7bb6a9bd update docs test=develop
6 years ago
Xin Pan c2d70fca30 fix to only check block 0
6 years ago
minqiyang e46f03e19d Add TESTING_DEBUG_MODE to support debug info in daily CI test
6 years ago
dzhwinter baf0ff4510
Merge pull request #14020 from dzhwinter/fix/sign_op
6 years ago
dzhwinter 60f70b174d test=develop
6 years ago
sneaxiy 7ff320f8cc merge develop
6 years ago
Xin Pan d0459ac8d0
Merge pull request #14223 from panyx0718/fix5
6 years ago
dongzhihong 00cf66964f Merge remote-tracking branch 'origin/develop' into fix/sign_op
6 years ago
Kaipeng Deng daed473d4a
Merge pull request #14089 from heavengate/pool_exclude
6 years ago
Kaipeng Deng 64f3e3ed8f
Merge pull request #14069 from heavengate/grid_sampler
6 years ago
Xin Pan aaeedd0ff3 make it warn
6 years ago
Zeng Jinle b316437a50
Merge pull request #14087 from sneaxiy/add_use_cudnn_in_softmax_with_xe
6 years ago
Xin Pan ddd2225b56 add more debug info.
6 years ago
sneaxiy bbc818a5a1 test=develop
6 years ago
sneaxiy 366ebb93f7 test=develop
6 years ago
sneaxiy 203027ca86 test=develop
6 years ago
Tao Luo d2a56f7909
Merge pull request #14159 from sfraczek/sfraczek/depthwise-conv-mkldnn-pass
6 years ago
dzhwinter cc02353d10 test=develop
6 years ago
dzhwinter eb2f7ed21b refine tests. test=develop
6 years ago
Jiabin Yang 9f65b616b2
Merge branch 'develop' into add_reorg_op
6 years ago
Xin Pan 08d22cf7e1
Merge pull request #14091 from panyx0718/fix2
6 years ago
Wu Yi 91b2851cdc
enable pyreader use pin memory (#14066)
6 years ago
Kaipeng Deng 0b29078201
Merge branch 'develop' into grid_sampler
6 years ago
whs 0c319e0b35
Add affine grid generator op (#12238)
6 years ago
sneaxiy cf1944af2a test=develop
6 years ago
tangwei12 d325e668b8
[1.1] Load vars on PSERVER (#14037)
6 years ago
dengkaipeng e99da0b583 api change: create_variable_for_type_inference. test=develop
6 years ago
Tao Luo 2eaa291e91
Merge pull request #14197 from luotao1/remove_with_fast_bundle_test
6 years ago
Yan Chunwei f76fee644c
fix graph pattern detector (#14186)
6 years ago
Tao Luo fe8f178582 fix word2vec related inference unit-tests (#14203)
6 years ago
chengduo e1742050ea fix merge lod_tensor bug (#14199)
6 years ago
dzhwinter 0a180584e6 clean cmake. test=develop
6 years ago
tensor-tang 85bcb286f5 refine vmul jitcode
6 years ago
tensor-tang a764e900a5 Merge remote-tracking branch 'ups/develop' into fea/jit/gen
6 years ago
tensor-tang a3377f7b0a refine jitcode and add vmul jitcode implementation
6 years ago
dzhwinter 1ace55c8ee merge develop branch
6 years ago
dzhwinter 9da7b33515 details
6 years ago
dengkaipeng df4a3544aa nearest neighbor interp add cuda kernel. test=develop
6 years ago
Xin Pan 913b569903
Merge pull request #14151 from panyx0718/fix
6 years ago
sneaxiy c7305fbe2f buffered_allocator: add unittest and fix bug
6 years ago
dengkaipeng da8ee1fbaa fix API.spec not add defaults. test=develop
6 years ago
chengduo 2ccf77d1c1
Refine GetTensorFromVar (#14160)
6 years ago
Tao Luo 5ac575cf62 remove unused WITH_FAST_BUNDLE_TEST option
6 years ago
dengkaipeng 9755611938 add unittest for nearest_neighbor_interp_op
6 years ago
dengkaipeng a24691a2a9 add nearest neighbor interpolation operator cpu kernel
6 years ago
sneaxiy e3fc544cf7 merge develop
6 years ago
sneaxiy 2bef0ca346 add buffered_allocator
6 years ago
JiabinYang 8d3c3e048b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
6 years ago
Yan Xu d10b8efcc0
Merge pull request #14152 from Yancey1989/add_fused_broadcast_unittest
6 years ago
Yu Yang c21597cf07
fix(PE): use shared_ptr<BlockingQueue> for cross thread communication (#14136)
6 years ago
tensor-tang f3badacd97 Merge remote-tracking branch 'ups/develop' into fea/jit/gen
6 years ago
tensor-tang a53b1b0b1b refine and init jitkernel vmul
6 years ago
tensor-tang 2139b9f677 add jit gencode
6 years ago
Yan Chunwei 06e508ab58
fix simple_on_word2vec random fail (#14171)
6 years ago
Tomasz Patejko 8899d42265 MKLDNN conv residual data: primitive reuse interface used. Reorder done when formats are different
6 years ago
chengduo b73708d20b
add int and int64 dtype for gather_op (#14175)
6 years ago
Tomasz Patejko f11934cbe6 MKLDNN conv residual data: residual data is reorder when formats are incorrect
6 years ago
Yan Chunwei 62a0fe0860
fix tensor array bug (#14166)
6 years ago
chengduo ed087f8232
refine op_handle (#14178)
6 years ago
Tao Luo cdf2579d08
Merge pull request #14053 from jczaja/prv-seqpool-max
6 years ago
Kaipeng Deng a3b26e8528
Merge branch 'develop' into grid_sampler
6 years ago
dengkaipeng 7333fe8e55 add math formula for exclusive/inclusive mode in avg pool. test=develop
6 years ago
Xin Pan 35915fc543
Merge pull request #14147 from luotao1/remove_with_inference
6 years ago
Yu Yang 90d9e5aee8
feat(platform): lazy initialization of devicecontext in pool (#14067)
6 years ago
dzhwinter 316765839d add back jit simd instructions. stage.
6 years ago
Xin Pan eb7ed1b720
Merge pull request #13897 from gmcather/develop
6 years ago
Sylwester Fraczek 4e2aaf01bc add depthwise conv mkldnn pass
6 years ago
barrierye fc23cc9d30 update paddle/fluid/API.spec
6 years ago
Yancey1989 6bfa6a0a33 add fused broadcast op unit test, test=develop
6 years ago
Xin Pan e2db0b9bf3 add a small test to verify tensor type
6 years ago
dzhwinter bf2e4cb188 cleard. staged
6 years ago