Commit Graph

230 Commits (24283d9fc2ffd4238e6a382fdc5d1be88283c036)

Author SHA1 Message Date
Qiao Longfei e8fcb71bed
Merge pull request #12620 from jacquesqiao/timeline-support-pure-cpu
7 years ago
qiaolongfei 5a6c3cd9e0 fix profiler dead lock
7 years ago
tensor-tang a50889f523 introduce xbyak
7 years ago
qiaolongfei 3f2aa91970 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into timeline-support-pure-cpu
7 years ago
qiaolongfei e008600b08 optimize code
7 years ago
qiaolongfei 7c649e06c3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into timeline-support-pure-cpu
7 years ago
Sylwester Fraczek d74bb6ab9c fix ut for mkldnn 0.15 - added forcing layout NCHW in mkldnn conv tests
7 years ago
qiaolongfei 954d680b40 fix test_parallel_do.py
7 years ago
qiaolongfei 1623f1ba4f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-profiler
7 years ago
qiaolongfei 4c5bcd7859 add guard to profiler
7 years ago
Xin Pan caf10b474f make profiler use thread_id from g_thread_id
7 years ago
dzhwinter 6d3da458a7
Fix/float16 style (#12446)
7 years ago
dzhwinter 39ac9e39c2
float16 type support enhance (#12181)
7 years ago
tensor-tang 4f0383f52e fix unknown flag
7 years ago
tensor-tang 9788e5ab87 add flags to control num_threads
7 years ago
tensor-tang 10a1c2bb86 control omp num_threads
7 years ago
typhoonzero 54e9fd3f61 fix cudnn enforce
7 years ago
qiaolongfei a6d30a8607 profiler support cpu
7 years ago
Xin Pan 7781297c70 variants
7 years ago
Tao Luo e568acbee2
Merge pull request #12092 from velconia/add_deps_to_device_ctx
7 years ago
minqiyang 2cc6ca43a0 Add framework_proto to device context deps
7 years ago
Jacek Czaja fbe25ef510 MKLDNN: Extending Conv MKLDNN op to reuse MKLDNN primitives (#11750)
7 years ago
tensor-tang 2e418a5227 fix conflicts
7 years ago
tensor-tang 3df99e72ab Merge remote-tracking branch 'ups/develop' into refine/set_num_threads
7 years ago
dzhwinter 4ed0b62476
Move fluid::framework::InitDevices into fluid::platform (#11757)
7 years ago
dzhwinter 99a99ec7e3
"remove lapack" (#11966)
7 years ago
fengjiayi ce16b40b04
Merge pull request #11891 from JiayiFeng/dev_eof_exp
7 years ago
Yu Yang 037ce12ee4
Merge pull request #11907 from reyoung/feature/use_dev_ctx_for_op
7 years ago
yuyang18 2d0e5592b5
Use std::map for Place <--> DeviceContext
7 years ago
Xin Pan 94cb59ad09 hide utils to legacy
7 years ago
fengjiayi ed4b2475f5 add an unittest
7 years ago
fengjiayi 8553ac6a95 fix unittests
7 years ago
fengjiayi 3fab4f65a4 Add EOFException to represent EOF in C++ reader
7 years ago
Yan Chunwei 28172bbb8e
add debug to replacing enforce with GLOG for debug (#11244)
7 years ago
gongweibao e2b1c5d925
fix code style (#11862)
7 years ago
mozga-intel b8a04c2fa1 Duplicated code was moved to common function
7 years ago
tensor-tang e3a96300bb move SetNumThreads to platform
7 years ago
Tao Luo 2dae8a4631
Merge pull request #11596 from tensor-tang/refine/mklml/dyload
7 years ago
Yi Wang 2625178add
No NCCL on macOS (#11652)
7 years ago
Tao Luo 60647c9aa4
Merge pull request #11519 from jczaja/prv-softmax-mkldnn-grad-operator
7 years ago
chengduo da556ed6d4
enhance ParallelExecutor stable (#11637)
7 years ago
Jacek Czaja 98f3ad3ba1 - MKLDNN Softmax Grad Op
7 years ago
tensor-tang d5fb8fa778 Revert "Merge pull request #11628 from PaddlePaddle/revert-11102-mozga-intel/Sum_mkldnn_layout"
7 years ago
Yu Yang 9b3f48d7e6
Merge pull request #11616 from chengduoZH/fix_parallel_exe
7 years ago
tensor-tang 28a0ef9522 remove usr local lib when dynamic load lib
7 years ago
tensor-tang 90780e22ce
Revert "MKLDNN layout: Support for sum operator"
7 years ago
chengduoZH c99fca5f90 Add No Mutex
7 years ago
tensor-tang 3e73a7a924 add usr local lib to dynamic search path
7 years ago
tensor-tang f503f12925 enable dynamic load mklml lib on fluid
7 years ago
mozga-intel 6512be59ec MKLDNN layout: the code-review changes
7 years ago