qingqing01
3d567864bb
Merge pull request #5247 from qingqing01/memory_alignment
...
Allocate aligned memory by posix_memalign.
7 years ago
Qiao Longfei
2e91c7da2b
memory log level change from 3 to 10 ( #5231 )
7 years ago
dangqingqing
a328ae3b9b
Use posix_memalign to allocate aligned memory, since some SIMD instructions require the alignment of memory accesses.
7 years ago
Yu Yang
2a5edec03e
Add debug logs in scope, meta_cache and memory ( #5170 )
...
* Add debug logs in scope, meta_cache and memory
* Add missing deps
7 years ago
Yu Yang
efc2464f6c
Feature/save op ( #5090 )
...
* Init
* Stash
* Polish SaveLoadOp
* Fix CI
* Polish code
* Save GPU Tensor
* Stash
* Fix CI
7 years ago
Yu Yang
064c3695ec
Make global scope not thread-safe ( #4850 )
...
* Make global scope not thread-safe
1. It is no need to make global scope thread-safe, since it will be
invoked in Python main thread.
2. Do not free the global scope when C++ exit. Let the OS free memories,
otherwise, we need to handle the destroy dependencies.
See
https://google.github.io/styleguide/cppguide.html#Static_and_Global_Variables
* Revert "FIX: Release CPU/GPU memory via deleter"
This reverts commit 8f80f5bc79
.
8 years ago
Luo Tao
871a3f6e76
remove unused PADDLE_ONLY_CPU comment
8 years ago
Yi Wang
880b874b47
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into paddle_only_cpu
8 years ago
Yi Wang
2b204f048b
Rename platform::GetDeviceCount into platform::GetCUDADeviceCount
8 years ago
Yi Wang
4558807c48
Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU
8 years ago
Yu Yang
84500f9487
Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU`
...
By shell command
```bash
sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'`
```
8 years ago
Yancey
d7db15f3e5
Use StridedMemCpy in Concat/Split Kernel ( #4188 )
...
User StridedMemCpy in Concat/Split Op
8 years ago
Yu Yang
60857f4909
Unify clang-format and add some missing clang-format
8 years ago
Yu Yang
c5fa417c62
Host and device transform API
...
* with unit-tests
* Also complete `memcpy`
8 years ago
liaogang
4de2b8e196
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into environ
8 years ago
liaogang
55437b58b9
Add ENVIRONMENT interface interface
8 years ago
liaogang
017a3818de
Add memory.h for unique_ptr
8 years ago
liaogang
94b58a29d6
Follow comments
8 years ago
liaogang
3f9fe62487
Fix typo error
8 years ago
liaogang
ff5bfc1c89
Google style for header file includes
8 years ago
liaogang
33228cacdd
Fix typo error
8 years ago
liaogang
2f7489fbd5
change use_pinned_memory to true for cpu
8 years ago
liaogang
8f80f5bc79
FIX: Release CPU/GPU memory via deleter
8 years ago
qijun
881c5d00d0
remove device_context deps in memcpy
8 years ago
qijun
f168843e47
fix gpu build error
8 years ago
liaogang
051fe1724f
remove duplicate cpplint
8 years ago
liaogang
b58725bd51
Add cpplint for *.h and cuda *.cu
8 years ago
Helin Wang
cffd1ae4c9
Fix build
8 years ago
qijun
bd8d2d8ba3
fix bug in memory GetGPUBuddyAllocator
8 years ago
gangliao
c54f14d89d
Merge pull request #3095 from gangliao/comment_mem
...
ENH: Add comments for memory and memcpy
8 years ago
liaogang
201e715787
ENH: Add comments for memory and memcpy
8 years ago
liaogang
b4ff2e4394
Remove GPUPlaceGuard
8 years ago
gangliao
13ec0dd8df
Merge pull request #3057 from gangliao/mem_vlog
...
ENH: for clarity, change DLOG to VLOG
8 years ago
liaogang
de8c344bda
ENH: for clarity, change DLOG to VLOG
8 years ago
liaogang
4e94cd757c
FIX: restricting c++ template usage to POD types
8 years ago
Yi Wang
de6f9c4879
Add dependency memory->device_context, because we now use platform::GPUPlaceGuard
8 years ago
Yi Wang
f81caa4e11
Add dependency memory->device_context, because we now use platform::GPUPlaceGuard
8 years ago
Yi Wang
616a236581
Add target memcpy into paddle_memory
8 years ago
Yi Wang
51152adf0b
Fix end of namespace
8 years ago
Yi Wang
858dea8834
Move memory::Copy out from memory.h into memcpy.h
8 years ago
Yi Wang
6cae35b5b3
Fix conflicts
8 years ago
fengjiayi
bf3b8f045e
Retrigger CI
8 years ago
fengjiayi
da07ec1886
Update Tensor and PODDeleter's template parameter
...
1. Change PODDeleter's template parameter 'PlaceType' to 'Place'.
2. Limit PODDeleter and Tensor::mutable_data()'s `T` to POD type.
8 years ago
fengjiayi
6cd94cc73f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_refactor_tensor
8 years ago
Yi Wang
75b53179a2
Merge pull request #2882 from gangliao/cpu_mem
...
Update Memory ReadMe Doc
8 years ago
liaogang
b3115fb01c
Add SetDeviceId in memcpy
8 years ago
liaogang
0897d18a0a
Fix string.h for memcpy
8 years ago
liaogang
00500eeb7d
Add stdlib.h for memcpy
8 years ago
liaogang
ca89bfada3
Fix paddle enforce special cases
8 years ago
liaogang
527c85970b
Fix H2D and D2H order
8 years ago