Commit Graph

7345 Commits (54bd17fe7b537a20b88e09a39d0e16416d446b41)

Author SHA1 Message Date
Kexin Zhao 4eaa789730 resolve conflict
7 years ago
Tomasz Patejko 72cc64e40e Device blobs are created only in training. Added testing attribute
7 years ago
tensor-tang 7260e3a443
Merge pull request #9214 from jczaja/prv-softmax-mkldnn-operator-PR
7 years ago
Yu Yang 79989c9025 Add SSA builder
7 years ago
Yancey1989 2a4221ac07 split send op to send_vars and send_barrier
7 years ago
Yu Yang 0760aaf440 Shrink batch_norm_grad's inputs
7 years ago
Jacek Czaja 3b95b55f07 - Softmax MKLDNN primitive integration
7 years ago
Yu Yang 64d7a30271 Extract SSAGraph
7 years ago
Yu Yang 8dec4ad7a1 Use int not Place for vars
7 years ago
Yu Yang 3181501013 Rerange code
7 years ago
Yu Yang f28ae6e4b1 Reorganize Code
7 years ago
Yu Yang 5c333e4143 Add dctor for dev_ctx
7 years ago
Yu Yang 15f5f10ed5 AddInput/AddOutput for OpHandle
7 years ago
Yu Yang 5368e50d84 Reorganize code
7 years ago
typhoonzero 1eec926124 updates
7 years ago
Yu Yang fe7ed285d1 Extract NCCLCtxMap
7 years ago
typhoonzero e9d815e32b prepare and create op before run
7 years ago
Kexin Zhao ed2bc194c5
Merge pull request #9176 from kexinzhao/batch_norm_fp16
7 years ago
fengjiayi cd07c0f021
Merge pull request #9259 from JiayiFeng/dev_MultiEpochReader
7 years ago
Yu Yang 6ebc6bf533 ReorganizeCode
7 years ago
Yiqun Liu 7bb4ea9c13
Add an argument in Executor.Run to allow users to choose whether to create and destroy variables every time. (#9242)
7 years ago
Yu Yang a478a11e0b NCCL Guard for bcast
7 years ago
Yu Yang f2685bed81 Clean code
7 years ago
Yu Yang 41ad632341 Add NCCL Group Guard
7 years ago
Yu Yang 99fe83a020 Move nccl helper
7 years ago
Yu Yang 90f980167d Do not wait computation stream
7 years ago
Yu Yang 7ac969b88c Debug
7 years ago
fengjiayi 809530f418 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_MultiEpochReader
7 years ago
fengjiayi 7c041e48f4
Merge pull request #9182 from JiayiFeng/dev_MultipleReader
7 years ago
fengjiayi e4bd63d0e1
Merge pull request #9240 from JiayiFeng/fix_bug_in_recordio
7 years ago
typhoonzero 18461d0935 wip
7 years ago
wanghaoshuang edb4e29ab7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
Kexin Zhao b7801b9fcb small fix
7 years ago
Kexin Zhao 70e7122785 initial commit
7 years ago
Kexin Zhao d60180af39 inital commit
7 years ago
Kexin Zhao c1e9b1e37e
Merge pull request #9231 from kexinzhao/elementwise_add_fp16
7 years ago
Qiao Longfei 37a272e670
add executor.prepare (#9022)
7 years ago
fengjiayi 4286ea6197 Merge branch 'fix_bug_in_recordio' into dev_MultiEpochReader
7 years ago
fengjiayi 0b2f1b3f45 clear stream during Scanner::Reset()
7 years ago
fengjiayi 91b6d60003 Merge branch 'fix_bug_in_recordio' into dev_MultiEpochReader
7 years ago
Yu Yang 599f7a87ba Refine code
7 years ago
Yu Yang 43e54079a8 Debug code
7 years ago
fengjiayi 2532b922dc Add more unittests and fix bugs
7 years ago
Yu Yang e335f01826 Add more logs
7 years ago
Yu Yang 82693e7227 Wait nccl all reduce
7 years ago
Yu Yang eb0a580e78 Add enforce
7 years ago
wanghaoshuang ad63722ed9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
Yu Yang 65bc7d17d5 Add mtx to ncclAllReduce
7 years ago
Yu Yang d42117e742 Set NumThreads
7 years ago
Yu Yang ba227df941 Expose num_threads
7 years ago
Yu Yang 1533bf12df Use event and single thread
7 years ago
Yu Yang 176277b824 Add log
7 years ago
Yu Yang ed7727e8f0 Fix bug in system allocator
7 years ago
Yu Yang 95a0d7c7c1 Illegal memory access
7 years ago
Yu Yang 798e6907b4 Change mem order
7 years ago
fengjiayi f863866471 Add an unitest
7 years ago
Yu Yang 1c2b6100b0 Add
7 years ago
武毅 5008020d19
Merge pull request #9154 from typhoonzero/pserver_parallel
7 years ago
Yu Yang a0494f8e55 Mutex lock wait
7 years ago
Yu Yang 4e43b71377 Add wait log
7 years ago
Yu Yang dbed123382 Debug
7 years ago
Yu Yang e53b6aba63 Use no thread
7 years ago
Yu Yang a8bd7b9809 Add log
7 years ago
fengjiayi 02b7d8bea5 Merge branch 'fix_bug_in_recordio' into dev_MultipleReader
7 years ago
Yu Yang 3c9cea597e Add more log
7 years ago
Yu Yang f8f1a963d9 Add debug code
7 years ago
Yu Yang fbbcedda01 Fix bug
7 years ago
Yu Yang 7643c2cbab Add flag for use event
7 years ago
Yu Yang ca4b3d2532 Use 12 threads
7 years ago
fengjiayi c346a345e0 fix a bug
7 years ago
Yu Yang f251a58e85 Use base class manage events
7 years ago
typhoonzero 3666d7c02f fix num_blocks==2
7 years ago
Yu Yang 1dd216dc3b Wait bcast param
7 years ago
Yu Yang 4185dd48e4 Disable multi-thread
7 years ago
Yu Yang 631aa3d10a Wait all inputs ready
7 years ago
Yu Yang 9b1f4d5d62 After nccl add event
7 years ago
sabreshao e50205e744 CMake refine for HIP support.
7 years ago
fengjiayi a2981f5c50 fix a bug
7 years ago
Yu Yang feb569f8ea Add log
7 years ago
Yang yaming 381c6a026d
Merge pull request #9100 from pkuyym/fix-9049
7 years ago
Kexin Zhao d307b5e4a6 Merge remote-tracking branch 'upstream/develop' into elementwise_add_fp16
7 years ago
typhoonzero 139ae08fdf workable
7 years ago
Kexin Zhao 5271c32d24
Merge pull request #9223 from kexinzhao/dropout_fp16
7 years ago
Yu Yang 260cfe3b86 Stop Wait NCCL Stream
7 years ago
Yu Yang e025e284c6 Exchange wait op
7 years ago
Yu Yang 3238ce0672 Add wait
7 years ago
Yu Yang 8a9de67e17 Remove wait
7 years ago
Yu Yang d2cb3790e9 Wait all evernts
7 years ago
Kexin Zhao 3da094fd7b rearrange test
7 years ago
Yu Yang 4137bb4eda Add wait
7 years ago
fengjiayi 832deee448
Merge pull request #9178 from JiayiFeng/fix_bugs_in_reader
7 years ago
Yu Yang 3da4159f88 Add run iter
7 years ago
Yu Yang d3c82c356e Wait multiple stream
7 years ago
Yu Yang c18c2f6ab0 Sync all computation streams at the end of run
7 years ago
wanghaoshuang e01c770c05 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into average_model
7 years ago
wanghaoshuang d22f4de794 Refine sum_accumulates_op.
7 years ago
yangyaming 2c22552542 Fix some comments and adapt test_machine_translation.py.
7 years ago
fengjiayi 6f7e812bb3 fix bugs
7 years ago
yangyaming 2f2c5f5e60 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-9049
7 years ago
Kexin Zhao 4bf168b274 add fp16 kernel for elementwise add
7 years ago