Commit Graph

11683 Commits (44ecf9a4816222df9fb673aa4ab9d4f74cb4acd3)

Author SHA1 Message Date
Wojciech Uss 7fd640b882 added additional call to graph_viz_pass
6 years ago
tensor-tang 3d950a812d combine jitcode of vscal
6 years ago
tensor-tang 03e11f3fc9 add vscal jitcode
6 years ago
Qiao Longfei 5b7a9dd7ac
Merge pull request #13815 from jacquesqiao/optimize-pyreader
6 years ago
dzhwinter 234a1d9248 Merge remote-tracking branch 'origin/develop' into windows/debug
6 years ago
chengduo a270fdf2db
Fix SelectedRowsAdd bug (#14309)
6 years ago
Qiao Longfei ce994190ab Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pyreader
6 years ago
tensor-tang 2f0a379af7
Merge pull request #14307 from tensor-tang/fix/mac
6 years ago
Zeng Jinle b2af213009
Merge pull request #14292 from sneaxiy/delete_buggy_selected_rows_functor
6 years ago
tensor-tang 161ba9c9d1 fix mac
6 years ago
tensor-tang e8642c3c1f
Merge pull request #14265 from tensor-tang/fea/jit/vadd
6 years ago
dengkaipeng 8b47d90f5d add 'actual_shape' attribute. test=develop
6 years ago
tensor-tang 382307b943 refine code
6 years ago
tensor-tang 3319072858 fix jit kernel test on mac
6 years ago
tensor-tang 44cb70c088 Merge remote-tracking branch 'ups/develop' into fix/mac
6 years ago
Yu Yang c28beb8a3c
test(Pe): add dry run tests for pe (#14254)
6 years ago
tensor-tang c9730d33d9 fix run error on mac
6 years ago
Xin Pan 80132933b7
Merge pull request #14281 from luotao1/face
6 years ago
Qiao Longfei e0c8397426
Merge pull request #14257 from jacquesqiao/optimize-pserver-profiler-thread-pool
6 years ago
chengduo ffc866159f
hot fix log (#14293)
6 years ago
Zhaolong Xing 65b61db10a
Merge pull request #13927 from NHZlX/fix_googlenet_bug_with_rule
6 years ago
tensor-tang 25e070ecc7 Merge remote-tracking branch 'ups/develop' into fea/jit/vadd
6 years ago
barrierye ef8218be22 update docs test=develop
6 years ago
Tao Luo eea36739cc refine test_helper.h
6 years ago
Qiao Longfei 6449faec37
Merge pull request #14259 from jacquesqiao/optimize-thread-pool
6 years ago
sneaxiy 9518bc8d0a delete buggy selected_rows functor
6 years ago
chengduo a9b5d42dd4
Add fp16 backward support (#14202)
6 years ago
Qiao Longfei 3b8dd9ebbd optimize code test=develop
6 years ago
Tao Luo 2b791f1f63 unify analyzer_face_tester to analyzer_resnet50_tester
6 years ago
Qiao Longfei 2921f8a79c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
6 years ago
Tao Luo 1ead9318d5 remove unused code in test_helper.h to pass ci
6 years ago
Qiao Longfei 4062f00f2a optimize thread pool code
6 years ago
dzhwinter 2835e04409 merge develop branch. test=develop
6 years ago
dzhwinter deb4af70ef add test
6 years ago
qingqing01 db8c52da5e Revert " Exhaustive search for cuDNN conv. (#14043)"
6 years ago
qingqing01 ce7d9b0799
Exhaustive search for cuDNN conv. (#14043)
6 years ago
Sang Ik Lee f30c1ddb45 Include nGraph build.
6 years ago
tensor-tang cb4083b9fa fix compile error
6 years ago
tensor-tang dd343a4971 Merge remote-tracking branch 'ups/develop' into fea/jit/vadd
6 years ago
Zeng Jinle fcbe84cb50
Merge pull request #14270 from sneaxiy/fix_rmsprop_enforce_bug
6 years ago
Tao Luo 7a2887d212 add analyzer_face_tester
6 years ago
Tao Luo 2ec65ae0db download face_model in CMakeLists.txt
6 years ago
Tao Luo 2f9a5a2e0a add analyzer_face_tester
6 years ago
Xin Pan cb2d33a851 resolve conflict
6 years ago
nhzlx 5700fafd0f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_googlenet_bug_with_rule
6 years ago
nhzlx 86b99ac953 fix comments and fix bug
6 years ago
tensor-tang e6cfdf6c74
Merge pull request #14274 from tensor-tang/fix/jit
6 years ago
Xin Pan 25123a3b7e add tests
6 years ago
Xin Pan 8c11d3fed6 clean up
6 years ago
Xin Pan 0a89650507 fix more tests
6 years ago
Xin Pan a3b27e3237 fix
6 years ago
Xin Pan f25eb9a71d fix some tests.
6 years ago
Xin Pan adf5615e54 clean kGraphOp
6 years ago
Xin Pan fb576cb5cb allow to compare type
6 years ago
Xin Pan ead94bfc6c fix destructor
6 years ago
Xin Pan 2e14999942 clean1
6 years ago
Xin Pan 34b401fc6c clean up a global graph attr.
6 years ago
Zeng Jinle 8ac2242b6e
Merge pull request #14075 from sneaxiy/remove_some_locks_in_pe
6 years ago
tensor-tang b81e1b655e fix jit on mac
6 years ago
sneaxiy 11f032a82e fix rmsprop_op enforce bug
6 years ago
tensor-tang b68ececb73 add vaddrelu jitcode
6 years ago
sneaxiy 8684553633 stream callback support in cuda 10
6 years ago
Wu Yi 8fc05e0373
fix cpu build test=develop (#14260)
6 years ago
Zhen Wang 4dbc01841d Nlp dam (#14248)
6 years ago
tensor-tang bb09e31020 add vadd jitcode
6 years ago
sneaxiy faac8a76ce remove unnecessary codes
6 years ago
Yu Yang ff9e531bd9
style(platform): disable warning when cuda cc not matched (#14029)
6 years ago
Qiao Longfei 59fbfbfbf7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
6 years ago
Qiao Longfei fe4cd50286 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-thread-pool
6 years ago
whs d6a6a13039
Fix build error of affine grid op in mac os. (#14237)
6 years ago
Qiao Longfei ac415c0094 change lock_guard to unique_lock
6 years ago
Qiao Longfei f4a76078d0 optimize thread pool
6 years ago
tensor-tang d55481cfeb
Merge pull request #14241 from tensor-tang/refine/jit/vmulcode
6 years ago
Qiao Longfei 9e4e9e9b6e clean rpc server profiler
6 years ago
Zeng Jinle 8d930195d9
Merge pull request #14238 from sneaxiy/fix_read_lod_level_bug
6 years ago
Wu Yi 306236c2c0
feature/DC asgd (#12722)
6 years ago
dengkaipeng fef2faa709 limit CUDA kernel parallel threads max number to 4096. test=develop
6 years ago
tensor-tang c3cbf0b8ef
Merge pull request #14185 from tpatejko/tpatejko/mkldnn-conv-residual-data-reorder
6 years ago
tensor-tang 6b49ee42c3
Merge pull request #14239 from tensor-tang/fix/avx
6 years ago
tensor-tang ef9c10927d
Merge pull request #14233 from tensor-tang/fix/guide
6 years ago
dengkaipeng 34bfae243a Add Interpolate operation. test=develop
6 years ago
sneaxiy 46d4829dd1 fix lod_level share bug in read_op
6 years ago
tensor-tang 8465e7876f auto grow the size and fix test
6 years ago
tensor-tang 9255119fd9 refine jit vmul with all size
6 years ago
tensor-tang a9c1824131 refine jit vmul code supporting multiple of 2
6 years ago
tensor-tang 61fdc38e51
Merge pull request #14206 from tensor-tang/fea/jit/gen
6 years ago
tensor-tang e09a7c793d remove the warning log since do not have avx2, avx512 flags
6 years ago
tensor-tang f524c1b62b throw error when mismatch cpu version
6 years ago
barrierye 5e7bb6a9bd update docs test=develop
6 years ago
Xin Pan c2d70fca30 fix to only check block 0
6 years ago
minqiyang e46f03e19d Add TESTING_DEBUG_MODE to support debug info in daily CI test
6 years ago
dzhwinter baf0ff4510
Merge pull request #14020 from dzhwinter/fix/sign_op
6 years ago
dzhwinter 60f70b174d test=develop
6 years ago
sneaxiy 7ff320f8cc merge develop
6 years ago
Xin Pan d0459ac8d0
Merge pull request #14223 from panyx0718/fix5
6 years ago
dongzhihong 00cf66964f Merge remote-tracking branch 'origin/develop' into fix/sign_op
6 years ago
Kaipeng Deng daed473d4a
Merge pull request #14089 from heavengate/pool_exclude
6 years ago
Kaipeng Deng 64f3e3ed8f
Merge pull request #14069 from heavengate/grid_sampler
6 years ago
Xin Pan aaeedd0ff3 make it warn
6 years ago
Zeng Jinle b316437a50
Merge pull request #14087 from sneaxiy/add_use_cudnn_in_softmax_with_xe
6 years ago