Commit Graph

11824 Commits (ee0fd78c811bb2cf726d206a7433d3804f8570c1)

Author SHA1 Message Date
tensor-tang 3319072858 fix jit kernel test on mac
6 years ago
tensor-tang 44cb70c088 Merge remote-tracking branch 'ups/develop' into fix/mac
6 years ago
Yu Yang c28beb8a3c
test(Pe): add dry run tests for pe (#14254)
6 years ago
tensor-tang c9730d33d9 fix run error on mac
6 years ago
Xin Pan 80132933b7
Merge pull request #14281 from luotao1/face
6 years ago
Qiao Longfei e0c8397426
Merge pull request #14257 from jacquesqiao/optimize-pserver-profiler-thread-pool
6 years ago
chengduo ffc866159f
hot fix log (#14293)
6 years ago
Zhaolong Xing 65b61db10a
Merge pull request #13927 from NHZlX/fix_googlenet_bug_with_rule
6 years ago
tensor-tang 25e070ecc7 Merge remote-tracking branch 'ups/develop' into fea/jit/vadd
6 years ago
barrierye ef8218be22 update docs test=develop
6 years ago
Tao Luo eea36739cc refine test_helper.h
6 years ago
Qiao Longfei 6449faec37
Merge pull request #14259 from jacquesqiao/optimize-thread-pool
6 years ago
sneaxiy 9518bc8d0a delete buggy selected_rows functor
6 years ago
chengduo a9b5d42dd4
Add fp16 backward support (#14202)
6 years ago
Qiao Longfei 3b8dd9ebbd optimize code test=develop
6 years ago
Tao Luo 2b791f1f63 unify analyzer_face_tester to analyzer_resnet50_tester
6 years ago
Qiao Longfei 2921f8a79c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
6 years ago
Tao Luo 1ead9318d5 remove unused code in test_helper.h to pass ci
6 years ago
Qiao Longfei 4062f00f2a optimize thread pool code
6 years ago
dzhwinter 2835e04409 merge develop branch. test=develop
6 years ago
dzhwinter deb4af70ef add test
6 years ago
qingqing01 db8c52da5e Revert " Exhaustive search for cuDNN conv. (#14043)"
6 years ago
qingqing01 ce7d9b0799
Exhaustive search for cuDNN conv. (#14043)
6 years ago
Sang Ik Lee f30c1ddb45 Include nGraph build.
6 years ago
tensor-tang cb4083b9fa fix compile error
6 years ago
tensor-tang dd343a4971 Merge remote-tracking branch 'ups/develop' into fea/jit/vadd
6 years ago
Zeng Jinle fcbe84cb50
Merge pull request #14270 from sneaxiy/fix_rmsprop_enforce_bug
6 years ago
Tao Luo 7a2887d212 add analyzer_face_tester
6 years ago
Tao Luo 2ec65ae0db download face_model in CMakeLists.txt
6 years ago
Tao Luo 2f9a5a2e0a add analyzer_face_tester
6 years ago
Xin Pan cb2d33a851 resolve conflict
6 years ago
nhzlx 5700fafd0f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_googlenet_bug_with_rule
6 years ago
nhzlx 86b99ac953 fix comments and fix bug
6 years ago
tensor-tang e6cfdf6c74
Merge pull request #14274 from tensor-tang/fix/jit
6 years ago
peizhilin a37918c31f fix python package issue
6 years ago
Xin Pan 25123a3b7e add tests
6 years ago
Xin Pan 8c11d3fed6 clean up
6 years ago
Xin Pan 0a89650507 fix more tests
6 years ago
Xin Pan a3b27e3237 fix
6 years ago
Xin Pan f25eb9a71d fix some tests.
6 years ago
Xin Pan adf5615e54 clean kGraphOp
6 years ago
Xin Pan fb576cb5cb allow to compare type
6 years ago
Xin Pan ead94bfc6c fix destructor
6 years ago
Xin Pan 2e14999942 clean1
6 years ago
Xin Pan 34b401fc6c clean up a global graph attr.
6 years ago
Zeng Jinle 8ac2242b6e
Merge pull request #14075 from sneaxiy/remove_some_locks_in_pe
6 years ago
tensor-tang b81e1b655e fix jit on mac
6 years ago
sneaxiy 11f032a82e fix rmsprop_op enforce bug
6 years ago
tensor-tang b68ececb73 add vaddrelu jitcode
6 years ago
sneaxiy 8684553633 stream callback support in cuda 10
6 years ago
peizhilin 1f12ba6192 gpu support, fix build issue:
6 years ago
Wu Yi 8fc05e0373
fix cpu build test=develop (#14260)
6 years ago
Zhen Wang 4dbc01841d Nlp dam (#14248)
6 years ago
tensor-tang bb09e31020 add vadd jitcode
6 years ago
sneaxiy faac8a76ce remove unnecessary codes
6 years ago
Yu Yang ff9e531bd9
style(platform): disable warning when cuda cc not matched (#14029)
6 years ago
Qiao Longfei 59fbfbfbf7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
6 years ago
Qiao Longfei fe4cd50286 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-thread-pool
6 years ago
whs d6a6a13039
Fix build error of affine grid op in mac os. (#14237)
6 years ago
Qiao Longfei ac415c0094 change lock_guard to unique_lock
6 years ago
Qiao Longfei f4a76078d0 optimize thread pool
6 years ago
tensor-tang d55481cfeb
Merge pull request #14241 from tensor-tang/refine/jit/vmulcode
6 years ago
Qiao Longfei 9e4e9e9b6e clean rpc server profiler
6 years ago
Zeng Jinle 8d930195d9
Merge pull request #14238 from sneaxiy/fix_read_lod_level_bug
6 years ago
Wu Yi 306236c2c0
feature/DC asgd (#12722)
6 years ago
dengkaipeng fef2faa709 limit CUDA kernel parallel threads max number to 4096. test=develop
6 years ago
tensor-tang c3cbf0b8ef
Merge pull request #14185 from tpatejko/tpatejko/mkldnn-conv-residual-data-reorder
6 years ago
peizhilin 71d7980f69 fix build issue 1
6 years ago
tensor-tang 6b49ee42c3
Merge pull request #14239 from tensor-tang/fix/avx
6 years ago
tensor-tang ef9c10927d
Merge pull request #14233 from tensor-tang/fix/guide
6 years ago
dengkaipeng 34bfae243a Add Interpolate operation. test=develop
6 years ago
sneaxiy 46d4829dd1 fix lod_level share bug in read_op
6 years ago
tensor-tang 8465e7876f auto grow the size and fix test
6 years ago
tensor-tang 9255119fd9 refine jit vmul with all size
6 years ago
tensor-tang a9c1824131 refine jit vmul code supporting multiple of 2
6 years ago
tensor-tang 61fdc38e51
Merge pull request #14206 from tensor-tang/fea/jit/gen
6 years ago
tensor-tang e09a7c793d remove the warning log since do not have avx2, avx512 flags
6 years ago
tensor-tang f524c1b62b throw error when mismatch cpu version
6 years ago
peizhilin 9d67c1fb69 cpu build support
6 years ago
barrierye 5e7bb6a9bd update docs test=develop
6 years ago
Xin Pan c2d70fca30 fix to only check block 0
6 years ago
minqiyang e46f03e19d Add TESTING_DEBUG_MODE to support debug info in daily CI test
6 years ago
dzhwinter baf0ff4510
Merge pull request #14020 from dzhwinter/fix/sign_op
6 years ago
dzhwinter 60f70b174d test=develop
6 years ago
sneaxiy 7ff320f8cc merge develop
6 years ago
Xin Pan d0459ac8d0
Merge pull request #14223 from panyx0718/fix5
6 years ago
dongzhihong 00cf66964f Merge remote-tracking branch 'origin/develop' into fix/sign_op
6 years ago
Kaipeng Deng daed473d4a
Merge pull request #14089 from heavengate/pool_exclude
6 years ago
Kaipeng Deng 64f3e3ed8f
Merge pull request #14069 from heavengate/grid_sampler
6 years ago
Xin Pan aaeedd0ff3 make it warn
6 years ago
Zeng Jinle b316437a50
Merge pull request #14087 from sneaxiy/add_use_cudnn_in_softmax_with_xe
6 years ago
Xin Pan ddd2225b56 add more debug info.
6 years ago
sneaxiy bbc818a5a1 test=develop
6 years ago
sneaxiy 366ebb93f7 test=develop
6 years ago
sneaxiy 203027ca86 test=develop
6 years ago
Tao Luo d2a56f7909
Merge pull request #14159 from sfraczek/sfraczek/depthwise-conv-mkldnn-pass
6 years ago
dzhwinter cc02353d10 test=develop
6 years ago
dzhwinter eb2f7ed21b refine tests. test=develop
6 years ago
Jiabin Yang 9f65b616b2
Merge branch 'develop' into add_reorg_op
6 years ago
Xin Pan 08d22cf7e1
Merge pull request #14091 from panyx0718/fix2
6 years ago
Wu Yi 91b2851cdc
enable pyreader use pin memory (#14066)
6 years ago
Kaipeng Deng 0b29078201
Merge branch 'develop' into grid_sampler
6 years ago
whs 0c319e0b35
Add affine grid generator op (#12238)
6 years ago
sneaxiy cf1944af2a test=develop
6 years ago
tangwei12 d325e668b8
[1.1] Load vars on PSERVER (#14037)
6 years ago
dengkaipeng e99da0b583 api change: create_variable_for_type_inference. test=develop
6 years ago
Tao Luo 2eaa291e91
Merge pull request #14197 from luotao1/remove_with_fast_bundle_test
6 years ago
Yan Chunwei f76fee644c
fix graph pattern detector (#14186)
6 years ago
Tao Luo fe8f178582 fix word2vec related inference unit-tests (#14203)
6 years ago
chengduo e1742050ea fix merge lod_tensor bug (#14199)
6 years ago
dzhwinter 0a180584e6 clean cmake. test=develop
6 years ago
tensor-tang 85bcb286f5 refine vmul jitcode
6 years ago
tensor-tang a764e900a5 Merge remote-tracking branch 'ups/develop' into fea/jit/gen
6 years ago
tensor-tang a3377f7b0a refine jitcode and add vmul jitcode implementation
6 years ago
dzhwinter 1ace55c8ee merge develop branch
6 years ago
dzhwinter 9da7b33515 details
6 years ago
dengkaipeng df4a3544aa nearest neighbor interp add cuda kernel. test=develop
6 years ago
Xin Pan 913b569903
Merge pull request #14151 from panyx0718/fix
6 years ago
dengkaipeng da8ee1fbaa fix API.spec not add defaults. test=develop
6 years ago
chengduo 2ccf77d1c1
Refine GetTensorFromVar (#14160)
6 years ago
Tao Luo 5ac575cf62 remove unused WITH_FAST_BUNDLE_TEST option
6 years ago
dengkaipeng 9755611938 add unittest for nearest_neighbor_interp_op
6 years ago
dengkaipeng a24691a2a9 add nearest neighbor interpolation operator cpu kernel
6 years ago
JiabinYang 8d3c3e048b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
6 years ago
Yan Xu d10b8efcc0
Merge pull request #14152 from Yancey1989/add_fused_broadcast_unittest
6 years ago
Yu Yang c21597cf07
fix(PE): use shared_ptr<BlockingQueue> for cross thread communication (#14136)
6 years ago
tensor-tang f3badacd97 Merge remote-tracking branch 'ups/develop' into fea/jit/gen
6 years ago
tensor-tang a53b1b0b1b refine and init jitkernel vmul
6 years ago
tensor-tang 2139b9f677 add jit gencode
6 years ago
Yan Chunwei 06e508ab58
fix simple_on_word2vec random fail (#14171)
6 years ago
Tomasz Patejko 8899d42265 MKLDNN conv residual data: primitive reuse interface used. Reorder done when formats are different
6 years ago
chengduo b73708d20b
add int and int64 dtype for gather_op (#14175)
6 years ago
Tomasz Patejko f11934cbe6 MKLDNN conv residual data: residual data is reorder when formats are incorrect
6 years ago
Yan Chunwei 62a0fe0860
fix tensor array bug (#14166)
6 years ago
chengduo ed087f8232
refine op_handle (#14178)
6 years ago
Tao Luo cdf2579d08
Merge pull request #14053 from jczaja/prv-seqpool-max
6 years ago
Kaipeng Deng a3b26e8528
Merge branch 'develop' into grid_sampler
6 years ago
dengkaipeng 7333fe8e55 add math formula for exclusive/inclusive mode in avg pool. test=develop
6 years ago
Xin Pan 35915fc543
Merge pull request #14147 from luotao1/remove_with_inference
6 years ago
Yu Yang 90d9e5aee8
feat(platform): lazy initialization of devicecontext in pool (#14067)
6 years ago
dzhwinter 316765839d add back jit simd instructions. stage.
6 years ago
Xin Pan eb7ed1b720
Merge pull request #13897 from gmcather/develop
6 years ago
Sylwester Fraczek 4e2aaf01bc add depthwise conv mkldnn pass
6 years ago
barrierye fc23cc9d30 update paddle/fluid/API.spec
6 years ago
Yancey1989 6bfa6a0a33 add fused broadcast op unit test, test=develop
6 years ago
Xin Pan e2db0b9bf3 add a small test to verify tensor type
6 years ago
dzhwinter bf2e4cb188 cleard. staged
6 years ago
Yan Chunwei 70ce6dcd67
fix api_impl ci error (#14140)
6 years ago
Xin Pan eb37ed4c16
Merge pull request #14141 from JiabinYang/fix_inference_model_latest
6 years ago
Xin Pan a943134a97 fix a few more tests
6 years ago
chengduo 2f639113ee
Fix sum_op's GetExpectedKernelType (#14112)
6 years ago
Xin Pan 5839e3236b add program check
6 years ago
gmcather ba22624d7e position encoding && log loss
6 years ago
Tao Luo 3a96d41d72 remove with_inference option
6 years ago
sneaxiy 2494ca83ab test=develop
6 years ago
dzhwinter ebfe5a02b3 merge develop branch
6 years ago
JiabinYang 7c45e77c41 test=develop
6 years ago
barrierye b5f78ce42d update paddle/fluid/API.spec
6 years ago
qingqing01 cb27a9219d
Merge pull request #13971 from sefira/FasterOpDoc
6 years ago
sneaxiy 5e5d2223a1 test=develop
6 years ago
tensor-tang 3c957af139
Merge pull request #14080 from tensor-tang/refine/jit/crf2
6 years ago
Xin Pan aa87a989ec
Merge pull request #14119 from Superjomn/fix/api-impl-tester
6 years ago
barrierye 5f3acac9b3 update paddle/fluid/API.spec
6 years ago
Xin Pan 9ef19d4919
Merge pull request #14106 from luotao1/fix_cmake_warning
6 years ago
sneaxiy f2eed667c0 test=develop
6 years ago
Xin Pan 16dfedb8b8
Merge pull request #14103 from jacquesqiao/cpu-for-1.1-merge-with-shape
6 years ago
sneaxiy cef8cc81db merge develop
6 years ago
Jacek Czaja 458b16f42a Rebase of seqpool-max optimization
6 years ago
superjomn 5f7fda0b07 disable some tests
6 years ago
dengkaipeng ff6329bd5f fix some inappropriate expressions in api doc for grid_sampler. test=develop
6 years ago
Tao Luo d3534d2b14 refine warning message
6 years ago
Xin Pan 177720a737
Merge pull request #14116 from chengduoZH/release/1.1.0
6 years ago
chengduozh acec4cb8ca [1.1]fix op_role value
6 years ago
barrierye 73671379cd update paddle/fluid/API.spec
6 years ago
dengkaipeng 8f1e398824 move param exclusive to the last in pool2d/pool3d for forward compatibility:. test=develop
6 years ago
dengkaipeng 593e1b18d7 fix some bugs and add some doc for GridSampleOp
6 years ago
dengkaipeng 0bb0e0c10f add Grid Sampler Operator for STN.
6 years ago
Qiao Longfei 3d4e050802 fix compile, optimize code test=develop
6 years ago
Qiao Longfei d26ff8cb2d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpu-for-1.1-merge-with-shape
6 years ago
JiabinYang e0a89503f8 test=develop
6 years ago
JiabinYang 0e3038680b test=develop
6 years ago
Tao Luo 79da263b11
Merge pull request #14032 from sfraczek/sfraczek/fix-test-multithreading-mkldnn
6 years ago
Wu Yi 26200f2e42
[1.1] [project] train imagenet using large batch size (#13766)
6 years ago
sneaxiy 2414f92f54 test=develop
6 years ago
barrierye 8c1e304307 merge nn.py
6 years ago
sneaxiy 45559d042c move to pass
6 years ago
dengkaipeng c93e044ae0 add inclusive/exclusive mode in PoolOp avg pool type
6 years ago
JiabinYang 9a74c4489f test=develop
6 years ago
barrierye 9dc28179a4 add similarity_focus op
6 years ago
Qiao Longfei 7cd2417fe2 Merge branch 'develop' into cpu-for-1.1-merge-with-shape
6 years ago
Xin Pan 0a80f06ec4
Merge pull request #14086 from panyx0718/fix6
6 years ago
sneaxiy a314a80cdb merge develop
6 years ago
Tao Luo 4928ff32a9 fix cmake warning when ON_INFER=false
6 years ago
dzhwinter c8adc2c6fe cudnn version. staged.
6 years ago
Qiao Longfei 06ffbc4f28 Merge branch 'shape_int_to_int64' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge-with-shape
6 years ago
seiriosPlus 06de824ba8 fix shape in floats
6 years ago
Yan Chunwei ee74be3a49
[1.1] Bugfix/tensorarray (#14044)
6 years ago
Qiyang Min 33b4920d2d
Merge pull request #14057 from velconia/continue_hash_op
6 years ago
Qiyang Min 209f24a241
Merge pull request #14051 from velconia/accelerate_embedding_grad
6 years ago
minqiyang 2fec8c5d9a Polish code
6 years ago
minqiyang 7f7af5d412 Add xxhash deps to inference demo and trainer demo
6 years ago
Qiao Longfei 7cfc3c4415 Merge branch 'optimize-sum-seq-pooling-op' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
6 years ago
Qiao Longfei 72aef6b168 sum selected rows check empty
6 years ago
minqiyang fe18adfbaa Add fluid inference support
6 years ago
seiriosPlus c34610f86d Fix lookup table at CPU Reduce strategy, test=develop
6 years ago
Qiao Longfei 641369f92b Merge branch 'dist-table-do-not-init-on-trainer' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
6 years ago
Qiao Longfei d69c820707 Merge branch 'add-flag-to-control-rpc-thread-num' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
6 years ago
Qiao Longfei 1ed9ef6d70 Merge branch 'shape_int_to_int64' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
6 years ago
Qiao Longfei f1a3fb041b Merge branch 'fix_lookuptable_in_reduce' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
6 years ago
Qiao Longfei da61a5b672 Merge branch 'optimizer-prefetch' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
6 years ago
tangwei12 5ce3a32e06
Merge branch 'develop' into optimizer-prefetch
6 years ago
seiriosPlus b6590b05fb submit by tangwei12, test=develop
6 years ago
Wu Yi 9da9b1926b
[1.1] fix graph num hang (#14072)
6 years ago
tangwei12 cb1ccc710b fix shape type in uniform_random_op.cu
6 years ago
Qiao Longfei 575f22711d optimize code
6 years ago
Qiao Longfei 96d5500934 optimize code
6 years ago
Qiao Longfei 748ee35c89 sum op handle empty input update selected_rows_functor.cu
6 years ago
Qiao Longfei dd78b5df93 sum op handle empty input
6 years ago
Qiao Longfei cbe128bbae Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
6 years ago
Qiao Longfei f4df0cb1a2 update the type of shape to int64, format code
6 years ago
Qiao Longfei fad42fe7cc broadcast handle not inited parameter
6 years ago
Qiao Longfei 7dcb0dc8c6 update year
6 years ago
Qiao Longfei 68aeb4e7e9 add fake init test in test_dist_transpiler
6 years ago
Tao Luo 5ed3e6f3f6
Merge pull request #14042 from luotao1/remove_unused_code
6 years ago
Qiao Longfei a13c788a04 fix a bug
6 years ago
Zeng Jinle 97d47a7d08
Merge pull request #13913 from sneaxiy/seq_reverse
6 years ago
JiabinYang 6e3615422f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
6 years ago
Jiabin Yang a3efba176c
Merge pull request #14085 from jerrywgz/fix_generate_proposals_op
6 years ago
dzhwinter 7141debe38 add cudnn back. staged.
6 years ago
Guo Sheng b9ae1c49f8
Merge pull request #13994 from guoshengCS/add-reshape-reuse-input
6 years ago
Zeng Jinle 60058180cb
Merge pull request #13945 from sneaxiy/unify_mixed_vector_api
6 years ago
Qiao Longfei 0328ffd3ab add fake init op
6 years ago
Xin Pan bcc9126e7b
Merge pull request #14056 from panyx0718/fix
6 years ago
Sylwester Fraczek 2098b42584 review fixes (Teamcity fails)
6 years ago
Tao Luo 961baea16c
Merge pull request #14063 from wojtuss/wojtuss/remove-unused-EnableMKLDNN
6 years ago
Hongyu Liu 379d933ae5
Merge pull request #14036 from phlrain/add_dropout_att_new
6 years ago
tangwei12 d8b697357f update height_sections to int64_t
6 years ago
minqiyang a2820b9899 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_embedding_grad
6 years ago
Xin Pan bba0c4a9f2 delete unused codes.
6 years ago
jerrywgz de2f965c9b test=develop
6 years ago
guosheng cc0e23973d Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
6 years ago
tangwei12 318ba99124 revert changes in protobuf.cc and type_defs
6 years ago
tangwei12 aa6dc82f4b revert changes in protobuf.cc and type_defs
6 years ago
dzhwinter 09409bad4d staged. test speed=49ms in 1080.
6 years ago
tensor-tang 64d5b4385e fix crf decode avx512
6 years ago
tensor-tang 21487d78bf add crf decode jit kernel
6 years ago
sneaxiy b1fd62f39e test=develop
6 years ago
guosheng 3cfaeac288 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
6 years ago
sneaxiy 1af3fe8c35 test=develop
6 years ago
Xin Pan d5d09672c8 better fix
6 years ago