Commit Graph

19574 Commits (433fc7c1d44b4f7c9b2ac9cf856b12b06d756b25)
 

Author SHA1 Message Date
Zeng Jinle 02d757dafc
Merge pull request #14268 from sneaxiy/stream_callback_support_in_cuda10
6 years ago
Wu Yi 97f7b181d4
Merge pull request #14264 from typhoonzero/dist_test_serial
6 years ago
tensor-tang b81e1b655e fix jit on mac
6 years ago
sneaxiy 11f032a82e fix rmsprop_op enforce bug
6 years ago
tensor-tang b68ececb73 add vaddrelu jitcode
6 years ago
sneaxiy 8684553633 stream callback support in cuda 10
6 years ago
Wu Yi 8fc05e0373
fix cpu build test=develop (#14260)
6 years ago
Shan Yi 1fb1a0bc6b
fix_recordio_internal_link
6 years ago
Zhen Wang 4dbc01841d Nlp dam (#14248)
6 years ago
tensor-tang bb09e31020 add vadd jitcode
6 years ago
typhoonzero 93c689aa96 run dist tests in serial test=develop
6 years ago
sneaxiy faac8a76ce remove unnecessary codes
6 years ago
Yu Yang ff9e531bd9
style(platform): disable warning when cuda cc not matched (#14029)
6 years ago
Qiao Longfei 59fbfbfbf7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
6 years ago
Qiao Longfei fe4cd50286 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-thread-pool
6 years ago
whs d6a6a13039
Fix build error of affine grid op in mac os. (#14237)
6 years ago
Qiao Longfei ac415c0094 change lock_guard to unique_lock
6 years ago
Qiao Longfei f4a76078d0 optimize thread pool
6 years ago
tensor-tang d55481cfeb
Merge pull request #14241 from tensor-tang/refine/jit/vmulcode
6 years ago
tensor-tang d277a2e6ef
fix avx512f flag (#14041)
6 years ago
Qiao Longfei 9e4e9e9b6e clean rpc server profiler
6 years ago
Zeng Jinle 8d930195d9
Merge pull request #14238 from sneaxiy/fix_read_lod_level_bug
6 years ago
Wu Yi 306236c2c0
feature/DC asgd (#12722)
6 years ago
dengkaipeng fef2faa709 limit CUDA kernel parallel threads max number to 4096. test=develop
6 years ago
tensor-tang c3cbf0b8ef
Merge pull request #14185 from tpatejko/tpatejko/mkldnn-conv-residual-data-reorder
6 years ago
tensor-tang 6b49ee42c3
Merge pull request #14239 from tensor-tang/fix/avx
6 years ago
tensor-tang ef9c10927d
Merge pull request #14233 from tensor-tang/fix/guide
6 years ago
dengkaipeng 34bfae243a Add Interpolate operation. test=develop
6 years ago
sneaxiy 46d4829dd1 fix lod_level share bug in read_op
6 years ago
tensor-tang 8465e7876f auto grow the size and fix test
6 years ago
tensor-tang 9255119fd9 refine jit vmul with all size
6 years ago
tensor-tang a9c1824131 refine jit vmul code supporting multiple of 2
6 years ago
tensor-tang 61fdc38e51
Merge pull request #14206 from tensor-tang/fea/jit/gen
6 years ago
Xin Pan d4c771c6bd
Merge pull request #14235 from panyx0718/fix5
6 years ago
tensor-tang e09a7c793d remove the warning log since do not have avx2, avx512 flags
6 years ago
tensor-tang f524c1b62b throw error when mismatch cpu version
6 years ago
tensor-tang 94ab65d591 disable avx2 and avx512 flag
6 years ago
Xin Pan c2d70fca30 fix to only check block 0
6 years ago
minqiyang e46f03e19d Add TESTING_DEBUG_MODE to support debug info in daily CI test
6 years ago
dongzhihong d4e8d7077f Merge remote-tracking branch 'origin/develop' into fix/sequence_pad
6 years ago
dzhwinter baf0ff4510
Merge pull request #14020 from dzhwinter/fix/sign_op
6 years ago
dzhwinter 60f70b174d test=develop
6 years ago
sneaxiy 7ff320f8cc merge develop
6 years ago
Zeng Jinle df038b6149
Merge pull request #14222 from sneaxiy/fix_selected_rows_clip_bug
6 years ago
Xin Pan d0459ac8d0
Merge pull request #14223 from panyx0718/fix5
6 years ago
dongzhihong 00cf66964f Merge remote-tracking branch 'origin/develop' into fix/sign_op
6 years ago
Kaipeng Deng daed473d4a
Merge pull request #14089 from heavengate/pool_exclude
6 years ago
Kaipeng Deng 64f3e3ed8f
Merge pull request #14069 from heavengate/grid_sampler
6 years ago
Qiyang Min 8690deb0e4
Merge pull request #14101 from velconia/polish_op_comment
6 years ago
Qiyang Min 67b86d4bb7
Merge pull request #14196 from velconia/fix_version
6 years ago