Commit Graph

19635 Commits (6c7b64cc200f2254d4275b3e360f8562a3387b2c)
 

Author SHA1 Message Date
tensor-tang b81e1b655e fix jit on mac
7 years ago
sneaxiy 11f032a82e fix rmsprop_op enforce bug
7 years ago
tensor-tang b68ececb73 add vaddrelu jitcode
7 years ago
sneaxiy 8684553633 stream callback support in cuda 10
7 years ago
Wu Yi 8fc05e0373
fix cpu build test=develop (#14260)
7 years ago
Shan Yi 1fb1a0bc6b
fix_recordio_internal_link
7 years ago
Zhen Wang 4dbc01841d Nlp dam (#14248)
7 years ago
tensor-tang bb09e31020 add vadd jitcode
7 years ago
typhoonzero 93c689aa96 run dist tests in serial test=develop
7 years ago
sneaxiy faac8a76ce remove unnecessary codes
7 years ago
Yu Yang ff9e531bd9
style(platform): disable warning when cuda cc not matched (#14029)
7 years ago
Qiao Longfei 59fbfbfbf7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
7 years ago
Qiao Longfei fe4cd50286 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-thread-pool
7 years ago
whs d6a6a13039
Fix build error of affine grid op in mac os. (#14237)
7 years ago
Qiao Longfei ac415c0094 change lock_guard to unique_lock
7 years ago
Qiao Longfei f4a76078d0 optimize thread pool
7 years ago
tensor-tang d55481cfeb
Merge pull request #14241 from tensor-tang/refine/jit/vmulcode
7 years ago
tensor-tang d277a2e6ef
fix avx512f flag (#14041)
7 years ago
Qiao Longfei 9e4e9e9b6e clean rpc server profiler
7 years ago
Zeng Jinle 8d930195d9
Merge pull request #14238 from sneaxiy/fix_read_lod_level_bug
7 years ago
Wu Yi 306236c2c0
feature/DC asgd (#12722)
7 years ago
dengkaipeng fef2faa709 limit CUDA kernel parallel threads max number to 4096. test=develop
7 years ago
tensor-tang c3cbf0b8ef
Merge pull request #14185 from tpatejko/tpatejko/mkldnn-conv-residual-data-reorder
7 years ago
tensor-tang 6b49ee42c3
Merge pull request #14239 from tensor-tang/fix/avx
7 years ago
barrierye 19b68de79f submit again test=develop
7 years ago
tensor-tang ef9c10927d
Merge pull request #14233 from tensor-tang/fix/guide
7 years ago
dengkaipeng 34bfae243a Add Interpolate operation. test=develop
7 years ago
sneaxiy 46d4829dd1 fix lod_level share bug in read_op
7 years ago
tensor-tang 8465e7876f auto grow the size and fix test
7 years ago
tensor-tang 9255119fd9 refine jit vmul with all size
7 years ago
tensor-tang a9c1824131 refine jit vmul code supporting multiple of 2
7 years ago
tensor-tang 61fdc38e51
Merge pull request #14206 from tensor-tang/fea/jit/gen
7 years ago
Xin Pan d4c771c6bd
Merge pull request #14235 from panyx0718/fix5
7 years ago
tensor-tang e09a7c793d remove the warning log since do not have avx2, avx512 flags
7 years ago
tensor-tang f524c1b62b throw error when mismatch cpu version
7 years ago
tensor-tang 94ab65d591 disable avx2 and avx512 flag
7 years ago
barrierye 5e7bb6a9bd update docs test=develop
7 years ago
Xin Pan c2d70fca30 fix to only check block 0
7 years ago
minqiyang e46f03e19d Add TESTING_DEBUG_MODE to support debug info in daily CI test
7 years ago
dongzhihong d4e8d7077f Merge remote-tracking branch 'origin/develop' into fix/sequence_pad
7 years ago
dzhwinter baf0ff4510
Merge pull request #14020 from dzhwinter/fix/sign_op
7 years ago
barrierye 316e020a11 Submit PR again test=develop
7 years ago
barrierye 162cf75c88 Submit PR again test=develop
7 years ago
dzhwinter 60f70b174d test=develop
7 years ago
sneaxiy 7ff320f8cc merge develop
7 years ago
Zeng Jinle df038b6149
Merge pull request #14222 from sneaxiy/fix_selected_rows_clip_bug
7 years ago
Xin Pan d0459ac8d0
Merge pull request #14223 from panyx0718/fix5
7 years ago
dongzhihong 00cf66964f Merge remote-tracking branch 'origin/develop' into fix/sign_op
7 years ago
Kaipeng Deng daed473d4a
Merge pull request #14089 from heavengate/pool_exclude
7 years ago
Kaipeng Deng 64f3e3ed8f
Merge pull request #14069 from heavengate/grid_sampler
7 years ago