Commit Graph

11683 Commits (44ecf9a4816222df9fb673aa4ab9d4f74cb4acd3)

Author SHA1 Message Date
Xin Pan a3b27e3237 fix
6 years ago
Xin Pan f25eb9a71d fix some tests.
6 years ago
Xin Pan adf5615e54 clean kGraphOp
6 years ago
Xin Pan fb576cb5cb allow to compare type
6 years ago
Xin Pan ead94bfc6c fix destructor
6 years ago
Xin Pan 2e14999942 clean1
6 years ago
Xin Pan 34b401fc6c clean up a global graph attr.
6 years ago
Zeng Jinle 8ac2242b6e
Merge pull request #14075 from sneaxiy/remove_some_locks_in_pe
6 years ago
tensor-tang b81e1b655e fix jit on mac
6 years ago
sneaxiy 11f032a82e fix rmsprop_op enforce bug
6 years ago
tensor-tang b68ececb73 add vaddrelu jitcode
6 years ago
sneaxiy 8684553633 stream callback support in cuda 10
6 years ago
Wu Yi 8fc05e0373
fix cpu build test=develop (#14260)
6 years ago
Zhen Wang 4dbc01841d Nlp dam (#14248)
6 years ago
tensor-tang bb09e31020 add vadd jitcode
6 years ago
sneaxiy faac8a76ce remove unnecessary codes
6 years ago
Yu Yang ff9e531bd9
style(platform): disable warning when cuda cc not matched (#14029)
6 years ago
Qiao Longfei 59fbfbfbf7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
6 years ago
Qiao Longfei fe4cd50286 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-thread-pool
6 years ago
whs d6a6a13039
Fix build error of affine grid op in mac os. (#14237)
6 years ago
Qiao Longfei ac415c0094 change lock_guard to unique_lock
6 years ago
Qiao Longfei f4a76078d0 optimize thread pool
6 years ago
tensor-tang d55481cfeb
Merge pull request #14241 from tensor-tang/refine/jit/vmulcode
6 years ago
Qiao Longfei 9e4e9e9b6e clean rpc server profiler
6 years ago
Zeng Jinle 8d930195d9
Merge pull request #14238 from sneaxiy/fix_read_lod_level_bug
6 years ago
Wu Yi 306236c2c0
feature/DC asgd (#12722)
6 years ago
dengkaipeng fef2faa709 limit CUDA kernel parallel threads max number to 4096. test=develop
6 years ago
tensor-tang c3cbf0b8ef
Merge pull request #14185 from tpatejko/tpatejko/mkldnn-conv-residual-data-reorder
6 years ago
tensor-tang 6b49ee42c3
Merge pull request #14239 from tensor-tang/fix/avx
6 years ago
tensor-tang ef9c10927d
Merge pull request #14233 from tensor-tang/fix/guide
6 years ago
dengkaipeng 34bfae243a Add Interpolate operation. test=develop
6 years ago
sneaxiy 46d4829dd1 fix lod_level share bug in read_op
6 years ago
tensor-tang 8465e7876f auto grow the size and fix test
6 years ago
tensor-tang 9255119fd9 refine jit vmul with all size
6 years ago
tensor-tang a9c1824131 refine jit vmul code supporting multiple of 2
6 years ago
tensor-tang 61fdc38e51
Merge pull request #14206 from tensor-tang/fea/jit/gen
6 years ago
tensor-tang e09a7c793d remove the warning log since do not have avx2, avx512 flags
6 years ago
tensor-tang f524c1b62b throw error when mismatch cpu version
6 years ago
barrierye 5e7bb6a9bd update docs test=develop
6 years ago
Xin Pan c2d70fca30 fix to only check block 0
6 years ago
minqiyang e46f03e19d Add TESTING_DEBUG_MODE to support debug info in daily CI test
6 years ago
dzhwinter baf0ff4510
Merge pull request #14020 from dzhwinter/fix/sign_op
6 years ago
dzhwinter 60f70b174d test=develop
6 years ago
sneaxiy 7ff320f8cc merge develop
6 years ago
Xin Pan d0459ac8d0
Merge pull request #14223 from panyx0718/fix5
6 years ago
dongzhihong 00cf66964f Merge remote-tracking branch 'origin/develop' into fix/sign_op
6 years ago
Kaipeng Deng daed473d4a
Merge pull request #14089 from heavengate/pool_exclude
6 years ago
Kaipeng Deng 64f3e3ed8f
Merge pull request #14069 from heavengate/grid_sampler
6 years ago
Xin Pan aaeedd0ff3 make it warn
6 years ago
Zeng Jinle b316437a50
Merge pull request #14087 from sneaxiy/add_use_cudnn_in_softmax_with_xe
6 years ago
Xin Pan ddd2225b56 add more debug info.
6 years ago
sneaxiy bbc818a5a1 test=develop
6 years ago
sneaxiy 366ebb93f7 test=develop
6 years ago
sneaxiy 203027ca86 test=develop
6 years ago
Tao Luo d2a56f7909
Merge pull request #14159 from sfraczek/sfraczek/depthwise-conv-mkldnn-pass
6 years ago
dzhwinter cc02353d10 test=develop
6 years ago
dzhwinter eb2f7ed21b refine tests. test=develop
6 years ago
Jiabin Yang 9f65b616b2
Merge branch 'develop' into add_reorg_op
6 years ago
Xin Pan 08d22cf7e1
Merge pull request #14091 from panyx0718/fix2
6 years ago
Wu Yi 91b2851cdc
enable pyreader use pin memory (#14066)
6 years ago
Kaipeng Deng 0b29078201
Merge branch 'develop' into grid_sampler
6 years ago
whs 0c319e0b35
Add affine grid generator op (#12238)
6 years ago
sneaxiy cf1944af2a test=develop
6 years ago
tangwei12 d325e668b8
[1.1] Load vars on PSERVER (#14037)
6 years ago
dengkaipeng e99da0b583 api change: create_variable_for_type_inference. test=develop
6 years ago
Tao Luo 2eaa291e91
Merge pull request #14197 from luotao1/remove_with_fast_bundle_test
6 years ago
Yan Chunwei f76fee644c
fix graph pattern detector (#14186)
6 years ago
Tao Luo fe8f178582 fix word2vec related inference unit-tests (#14203)
6 years ago
chengduo e1742050ea fix merge lod_tensor bug (#14199)
6 years ago
dzhwinter 0a180584e6 clean cmake. test=develop
6 years ago
tensor-tang 85bcb286f5 refine vmul jitcode
6 years ago
tensor-tang a764e900a5 Merge remote-tracking branch 'ups/develop' into fea/jit/gen
6 years ago
tensor-tang a3377f7b0a refine jitcode and add vmul jitcode implementation
6 years ago
dzhwinter 1ace55c8ee merge develop branch
6 years ago
dzhwinter 9da7b33515 details
6 years ago
dengkaipeng df4a3544aa nearest neighbor interp add cuda kernel. test=develop
6 years ago
Xin Pan 913b569903
Merge pull request #14151 from panyx0718/fix
6 years ago
dengkaipeng da8ee1fbaa fix API.spec not add defaults. test=develop
6 years ago
chengduo 2ccf77d1c1
Refine GetTensorFromVar (#14160)
6 years ago
Tao Luo 5ac575cf62 remove unused WITH_FAST_BUNDLE_TEST option
6 years ago
dengkaipeng 9755611938 add unittest for nearest_neighbor_interp_op
6 years ago
dengkaipeng a24691a2a9 add nearest neighbor interpolation operator cpu kernel
6 years ago
JiabinYang 8d3c3e048b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
6 years ago
Yan Xu d10b8efcc0
Merge pull request #14152 from Yancey1989/add_fused_broadcast_unittest
6 years ago
Yu Yang c21597cf07
fix(PE): use shared_ptr<BlockingQueue> for cross thread communication (#14136)
6 years ago
tensor-tang f3badacd97 Merge remote-tracking branch 'ups/develop' into fea/jit/gen
6 years ago
tensor-tang a53b1b0b1b refine and init jitkernel vmul
6 years ago
tensor-tang 2139b9f677 add jit gencode
6 years ago
Yan Chunwei 06e508ab58
fix simple_on_word2vec random fail (#14171)
6 years ago
Tomasz Patejko 8899d42265 MKLDNN conv residual data: primitive reuse interface used. Reorder done when formats are different
6 years ago
chengduo b73708d20b
add int and int64 dtype for gather_op (#14175)
6 years ago
Tomasz Patejko f11934cbe6 MKLDNN conv residual data: residual data is reorder when formats are incorrect
6 years ago
Yan Chunwei 62a0fe0860
fix tensor array bug (#14166)
6 years ago
chengduo ed087f8232
refine op_handle (#14178)
6 years ago
Tao Luo cdf2579d08
Merge pull request #14053 from jczaja/prv-seqpool-max
6 years ago
Kaipeng Deng a3b26e8528
Merge branch 'develop' into grid_sampler
6 years ago
dengkaipeng 7333fe8e55 add math formula for exclusive/inclusive mode in avg pool. test=develop
6 years ago
Xin Pan 35915fc543
Merge pull request #14147 from luotao1/remove_with_inference
6 years ago
Yu Yang 90d9e5aee8
feat(platform): lazy initialization of devicecontext in pool (#14067)
6 years ago
dzhwinter 316765839d add back jit simd instructions. stage.
6 years ago
Xin Pan eb7ed1b720
Merge pull request #13897 from gmcather/develop
6 years ago
Sylwester Fraczek 4e2aaf01bc add depthwise conv mkldnn pass
6 years ago
barrierye fc23cc9d30 update paddle/fluid/API.spec
6 years ago
Yancey1989 6bfa6a0a33 add fused broadcast op unit test, test=develop
6 years ago
Xin Pan e2db0b9bf3 add a small test to verify tensor type
6 years ago
dzhwinter bf2e4cb188 cleard. staged
6 years ago
Yan Chunwei 70ce6dcd67
fix api_impl ci error (#14140)
6 years ago
Xin Pan eb37ed4c16
Merge pull request #14141 from JiabinYang/fix_inference_model_latest
6 years ago
Xin Pan a943134a97 fix a few more tests
6 years ago
chengduo 2f639113ee
Fix sum_op's GetExpectedKernelType (#14112)
6 years ago
Xin Pan 5839e3236b add program check
6 years ago
gmcather ba22624d7e position encoding && log loss
6 years ago
Tao Luo 3a96d41d72 remove with_inference option
6 years ago
sneaxiy 2494ca83ab test=develop
6 years ago
dzhwinter ebfe5a02b3 merge develop branch
6 years ago
JiabinYang 7c45e77c41 test=develop
6 years ago
barrierye b5f78ce42d update paddle/fluid/API.spec
6 years ago
qingqing01 cb27a9219d
Merge pull request #13971 from sefira/FasterOpDoc
6 years ago
sneaxiy 5e5d2223a1 test=develop
6 years ago
tensor-tang 3c957af139
Merge pull request #14080 from tensor-tang/refine/jit/crf2
6 years ago
Xin Pan aa87a989ec
Merge pull request #14119 from Superjomn/fix/api-impl-tester
6 years ago
barrierye 5f3acac9b3 update paddle/fluid/API.spec
6 years ago
Xin Pan 9ef19d4919
Merge pull request #14106 from luotao1/fix_cmake_warning
6 years ago
sneaxiy f2eed667c0 test=develop
6 years ago
Xin Pan 16dfedb8b8
Merge pull request #14103 from jacquesqiao/cpu-for-1.1-merge-with-shape
6 years ago
sneaxiy cef8cc81db merge develop
6 years ago
Jacek Czaja 458b16f42a Rebase of seqpool-max optimization
6 years ago
superjomn 5f7fda0b07 disable some tests
6 years ago
dengkaipeng ff6329bd5f fix some inappropriate expressions in api doc for grid_sampler. test=develop
6 years ago
Tao Luo d3534d2b14 refine warning message
6 years ago
Xin Pan 177720a737
Merge pull request #14116 from chengduoZH/release/1.1.0
6 years ago
chengduozh acec4cb8ca [1.1]fix op_role value
6 years ago
barrierye 73671379cd update paddle/fluid/API.spec
6 years ago
dengkaipeng 8f1e398824 move param exclusive to the last in pool2d/pool3d for forward compatibility:. test=develop
6 years ago
dengkaipeng 593e1b18d7 fix some bugs and add some doc for GridSampleOp
6 years ago
dengkaipeng 0bb0e0c10f add Grid Sampler Operator for STN.
6 years ago
Qiao Longfei 3d4e050802 fix compile, optimize code test=develop
6 years ago
Qiao Longfei d26ff8cb2d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpu-for-1.1-merge-with-shape
6 years ago
JiabinYang e0a89503f8 test=develop
6 years ago
JiabinYang 0e3038680b test=develop
6 years ago
Tao Luo 79da263b11
Merge pull request #14032 from sfraczek/sfraczek/fix-test-multithreading-mkldnn
6 years ago
Wu Yi 26200f2e42
[1.1] [project] train imagenet using large batch size (#13766)
6 years ago
sneaxiy 2414f92f54 test=develop
6 years ago
barrierye 8c1e304307 merge nn.py
6 years ago
sneaxiy 45559d042c move to pass
6 years ago
dengkaipeng c93e044ae0 add inclusive/exclusive mode in PoolOp avg pool type
6 years ago
JiabinYang 9a74c4489f test=develop
6 years ago
barrierye 9dc28179a4 add similarity_focus op
6 years ago
Qiao Longfei 7cd2417fe2 Merge branch 'develop' into cpu-for-1.1-merge-with-shape
6 years ago
Xin Pan 0a80f06ec4
Merge pull request #14086 from panyx0718/fix6
6 years ago