Commit Graph

19381 Commits (382307b94345916dd4094623e06c5ade7a87e32e)
 

Author SHA1 Message Date
tensor-tang 382307b943 refine code
7 years ago
tensor-tang 25e070ecc7 Merge remote-tracking branch 'ups/develop' into fea/jit/vadd
7 years ago
Shan Yi ea8984c924
Merge pull request #14267 from PaddlePaddle/fix_recordio_link
7 years ago
tangwei12 e564eb341f
Fix mkdir conflict in save_inference_model (#14285)
7 years ago
Qiao Longfei 6449faec37
Merge pull request #14259 from jacquesqiao/optimize-thread-pool
7 years ago
chengduo a9b5d42dd4
Add fp16 backward support (#14202)
7 years ago
Qiao Longfei 4062f00f2a optimize thread pool code
7 years ago
Xin Pan 0953cd3e16
Merge pull request #14284 from PaddlePaddle/revert-14043-conv_cudnn_cache
7 years ago
qingqing01 db8c52da5e Revert " Exhaustive search for cuDNN conv. (#14043)"
7 years ago
qingqing01 ce7d9b0799
Exhaustive search for cuDNN conv. (#14043)
7 years ago
tensor-tang cb4083b9fa fix compile error
7 years ago
tensor-tang dd343a4971 Merge remote-tracking branch 'ups/develop' into fea/jit/vadd
7 years ago
Zeng Jinle fcbe84cb50
Merge pull request #14270 from sneaxiy/fix_rmsprop_enforce_bug
7 years ago
chengduo 45bad7626a
open test_parallel_executor_crf (#14255)
7 years ago
tensor-tang e6cfdf6c74
Merge pull request #14274 from tensor-tang/fix/jit
7 years ago
Zeng Jinle 8ac2242b6e
Merge pull request #14075 from sneaxiy/remove_some_locks_in_pe
7 years ago
Zeng Jinle 02d757dafc
Merge pull request #14268 from sneaxiy/stream_callback_support_in_cuda10
7 years ago
Wu Yi 97f7b181d4
Merge pull request #14264 from typhoonzero/dist_test_serial
7 years ago
tensor-tang b81e1b655e fix jit on mac
7 years ago
sneaxiy 11f032a82e fix rmsprop_op enforce bug
7 years ago
tensor-tang b68ececb73 add vaddrelu jitcode
7 years ago
sneaxiy 8684553633 stream callback support in cuda 10
7 years ago
Wu Yi 8fc05e0373
fix cpu build test=develop (#14260)
7 years ago
Shan Yi 1fb1a0bc6b
fix_recordio_internal_link
7 years ago
Zhen Wang 4dbc01841d Nlp dam (#14248)
7 years ago
tensor-tang bb09e31020 add vadd jitcode
7 years ago
typhoonzero 93c689aa96 run dist tests in serial test=develop
7 years ago
sneaxiy faac8a76ce remove unnecessary codes
7 years ago
Yu Yang ff9e531bd9
style(platform): disable warning when cuda cc not matched (#14029)
7 years ago
Qiao Longfei fe4cd50286 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-thread-pool
7 years ago
whs d6a6a13039
Fix build error of affine grid op in mac os. (#14237)
7 years ago
Qiao Longfei ac415c0094 change lock_guard to unique_lock
7 years ago
Qiao Longfei f4a76078d0 optimize thread pool
7 years ago
tensor-tang d55481cfeb
Merge pull request #14241 from tensor-tang/refine/jit/vmulcode
7 years ago
tensor-tang d277a2e6ef
fix avx512f flag (#14041)
7 years ago
Zeng Jinle 8d930195d9
Merge pull request #14238 from sneaxiy/fix_read_lod_level_bug
7 years ago
Wu Yi 306236c2c0
feature/DC asgd (#12722)
7 years ago
tensor-tang c3cbf0b8ef
Merge pull request #14185 from tpatejko/tpatejko/mkldnn-conv-residual-data-reorder
7 years ago
tensor-tang 6b49ee42c3
Merge pull request #14239 from tensor-tang/fix/avx
7 years ago
tensor-tang ef9c10927d
Merge pull request #14233 from tensor-tang/fix/guide
7 years ago
sneaxiy 46d4829dd1 fix lod_level share bug in read_op
7 years ago
tensor-tang 8465e7876f auto grow the size and fix test
7 years ago
tensor-tang 9255119fd9 refine jit vmul with all size
7 years ago
tensor-tang a9c1824131 refine jit vmul code supporting multiple of 2
7 years ago
tensor-tang 61fdc38e51
Merge pull request #14206 from tensor-tang/fea/jit/gen
7 years ago
Xin Pan d4c771c6bd
Merge pull request #14235 from panyx0718/fix5
7 years ago
tensor-tang e09a7c793d remove the warning log since do not have avx2, avx512 flags
7 years ago
tensor-tang f524c1b62b throw error when mismatch cpu version
7 years ago
tensor-tang 94ab65d591 disable avx2 and avx512 flag
7 years ago
Xin Pan c2d70fca30 fix to only check block 0
7 years ago