Yu Yang
c14f3e8ff2
Merge branch 'develop' into feature/middle_level_net_api
8 years ago
fengjiayi
f85ccdd351
Renew CMakeList dependence
8 years ago
Yu Yang
c2543f5b29
Remove ScopePtr and OperatorPtr
...
* ScopePtr means pointer of scope, but it can be shared or uniqued.
Change it to std::shared_ptr<Scope> to make code better to read.
8 years ago
Yu Yang
1294b3c53e
Expose Net to Python ( #2967 )
...
* Expose Net to Python
* Expose PlainNet to Python, make python can add_op, complete_add_op
* Provide a low level api to manipulate Net
* Unittest for Net::DebugString
8 years ago
Superjom
45072ed2cd
add more details
8 years ago
Yi Wang
de6f9c4879
Add dependency memory->device_context, because we now use platform::GPUPlaceGuard
8 years ago
Yi Wang
f81caa4e11
Add dependency memory->device_context, because we now use platform::GPUPlaceGuard
8 years ago
Yi Wang
a16cd515b4
Merge pull request #3016 from wangkuiyi/memcpy
...
Move Copy out from memory.h into memcpy.h
8 years ago
Yi Wang
616a236581
Add target memcpy into paddle_memory
8 years ago
QI JUN
85dff17985
Merge pull request #3014 from wangkuiyi/formalize_eigen
...
Slight simplification of eigen.h in hope to make it more readable.
8 years ago
Yi Wang
51152adf0b
Fix end of namespace
8 years ago
Yi Wang
858dea8834
Move memory::Copy out from memory.h into memcpy.h
8 years ago
Yi Wang
6cae35b5b3
Fix conflicts
8 years ago
Yi Wang
045470cf00
Slight simplification of eigen.h in hope to make it more readable.
8 years ago
Yi Wang
6129ab42f9
Merge pull request #2964 from Canpio/dev_refactor_tensor
...
Simplify Tensor implementation
8 years ago
Tao Luo
ef28f66f86
Merge pull request #2940 from tensor-tang/develop
...
enable Intel MKLDNN library and Intel MKL small package
8 years ago
Zhaolong Xing
91d2a57a2e
Merge pull request #2776 from NHZlX/mobilenet_gpu
...
Mobilenet gpu implementation
8 years ago
Superjom
4c33d55a32
add rnn_design.md
8 years ago
qijun
06acd6d0cb
add unittest for some basic OpKernels
8 years ago
fengjiayi
06358129a2
Fix compile error
8 years ago
fengjiayi
bf3b8f045e
Retrigger CI
8 years ago
fengjiayi
da07ec1886
Update Tensor and PODDeleter's template parameter
...
1. Change PODDeleter's template parameter 'PlaceType' to 'Place'.
2. Limit PODDeleter and Tensor::mutable_data()'s `T` to POD type.
8 years ago
qijun
ba79050923
Merge remote-tracking branch 'baidu/develop' into implement_basic_OpKernel
8 years ago
fengjiayi
6cd94cc73f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_refactor_tensor
8 years ago
xzl
6c528cbc2a
add check: CHECK_EQ(outputs[0].getArgType(), ADD_TO)
8 years ago
fengjiayi
4736b239d9
Add a simple test for grad_op_creator
8 years ago
Qiao Longfei
2685765905
add cross-entropy-op ( #2965 )
...
* add cross-entropy-op
* add infershape and compute
* implement Infershape and compute of onehotcrossentropy op
8 years ago
xzl
e8d171bbd2
add check for groups and inputChannels
8 years ago
xzl
cfd4c05e2e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into mobilenet_gpu
8 years ago
Yi Wang
75b53179a2
Merge pull request #2882 from gangliao/cpu_mem
...
Update Memory ReadMe Doc
8 years ago
xzl
d5b0c57d7e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into mobilenet_gpu
8 years ago
xzl
248149f448
add depthwiseconv test and fix the little bug of the convOpTest
8 years ago
QI JUN
155f727c21
Merge pull request #2978 from QiJune/fix_bug_EigenVector
...
fix bug in EigenVector::Flatten method
8 years ago
fengjiayi
9418717f80
Fix compile errors
8 years ago
fengjiayi
f41fcd43d0
Merge branch 'backward' of https://github.com/dzhwinter/Paddle into backward
8 years ago
fengjiayi
99a5904c75
Merge branch 'backward_dev' into backward
8 years ago
dongzhihong
088e2200c4
"remove unused fake fc op"
8 years ago
xzl
5b07d4e009
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into mobilenet_gpu
8 years ago
tensor-tang
577bb4e346
rename mkllite to mklml
8 years ago
qijun
586365e0ad
using alias to simplify code
8 years ago
liaogang
b3115fb01c
Add SetDeviceId in memcpy
8 years ago
qijun
407b0efc79
fix bug in EigenVector::Flatten method
8 years ago
fengjiayi
b635af71d8
Fix some compile error
8 years ago
fengjiayi
8a5ee462a5
Fix some compile errors
8 years ago
wanghaoshuang
7c09999d57
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_croplayer
8 years ago
wanghaoshuang
4409255c27
fix code format
8 years ago
dongzhihong
6f0539246c
Merge remote-tracking branch 'origin/develop' into backward2
8 years ago
dongzhihong
81a352aff7
"test fc without gradient"
8 years ago
xzl
4d6be9741c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into mobilenet_gpu
8 years ago
liaogang
0897d18a0a
Fix string.h for memcpy
8 years ago
dongzhihong
14424f314c
"use built-in operator"
8 years ago
liaogang
00500eeb7d
Add stdlib.h for memcpy
8 years ago
liaogang
ca89bfada3
Fix paddle enforce special cases
8 years ago
qijun
f46961e223
Merge remote-tracking branch 'baidu/develop' into implement_basic_OpKernel
8 years ago
liaogang
98f0171c94
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into error
8 years ago
fengjiayi
e192d0fd01
Refactor the implementation of gradient Op creating
8 years ago
dongzhihong
4876f35858
"make plainNet shared"
8 years ago
xzl
bd54eb9834
tiny modify the test
8 years ago
qiaolongfei
e4984f13e9
fix tensor usage in sgd-op
8 years ago
liaogang
527c85970b
Fix H2D and D2H order
8 years ago
qijun
a6347daa05
fix clang compile warning
8 years ago
liaogang
5e8a4f16c7
Fix conflcts
8 years ago
liaogang
ee112915e3
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into error
8 years ago
qijun
3014662d4a
Merge branch 'implement_basic_OpKernel' of https://github.com/QiJune/Paddle into implement_basic_OpKernel
8 years ago
Qiao Longfei
e3b27d1998
Add sgd op ( #2950 )
...
* a simplest SGD op
8 years ago
qijun
736d078cbf
replace Tensor::tensor to EigenTensor::From
8 years ago
liaogang
aae53d836b
Merge conflicts
8 years ago
liaogang
b05886414b
Add cuda memcpy in gpu_info
8 years ago
liaogang
e53a48b46a
Add memcpy
8 years ago
xzl
1f516fa0ef
modify format, and modify the layer grad test, op test
8 years ago
qijun
8ad9006dd5
Merge remote-tracking branch 'baidu/develop' into implement_basic_OpKernel
8 years ago
QI JUN
d81084939b
Merge pull request #2953 from wangkuiyi/tensor_type_to_eigen
...
Refactorize Tensor to Eigen convesion
8 years ago
liaogang
2d2ee47bda
FIX: fix string
8 years ago
qijun
d6d057b4e8
change EQ to NEAR for float value
8 years ago
xzl
81998868f0
fuse interface of depthwise to expandconv
8 years ago
whs
2e2a674892
Merge pull request #2490 from wanghaoshuang/crop_layer
...
add crop layer
8 years ago
dongzhihong
73f4779808
Merge remote-tracking branch 'origin/develop' into backward2
8 years ago
Yu Yang
0380bfb3cf
Expose Net to Python
...
* Expose PlainNet to Python, make python can add_op, complete_add_op
* Provide a low level api to manipulate Net
* Unittest for Net::DebugString
8 years ago
qijun
57c27b4e00
make EigenTensor default unaligned and follow comments
8 years ago
liaogang
97c2a9a969
Fix: compiler error under gpu
8 years ago
fengjiayi
55d301722f
Simplify Tensor implimentation
...
ATTENTION: some interfaces changed:
1. void Tensor::set_dims(const DDim& dims) ==> void Tensor::Resize(const DDim& dims).
2. void Tensor::ShareDataFrom(const Tensor& src) ==> void Tensor::ShareDataWith(const Tensor& src)
3. DDim Tensor::dims() const ==> const DDim& Tensor::dims() const
8 years ago
qijun
14cfb8c262
fix gpu build error
8 years ago
liaogang
fab896c5a0
Remove using namespace
8 years ago
fengjiayi
3dc70ff282
Merge branch 'backward' of https://github.com/dzhwinter/Paddle into backward
8 years ago
Yu Yang
e8304bd92a
Merge pull request #2945 from reyoung/feature/grouped_ops
...
Skeleton Of fully connected operator
8 years ago
dongzhihong
8bc4892106
"fix comment "
8 years ago
qijun
cff8762f26
add more uinttest for EigenTensor
8 years ago
liaogang
941eccac83
Follow comments
8 years ago
fengjiayi
94a6b1fa43
rename a macro
8 years ago
dongzhihong
cb95587feb
"ignore some gradient of specific op"
8 years ago
xzl
21ab0eb8e1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into mobilenet_gpu
8 years ago
xzl
f7390d115c
delete useless .h header in DepthwiseConvOpGpu.cu
8 years ago
wanghaoshuang
60a788933b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into crop_layer
8 years ago
qijun
d9fa6159b7
add Flatten method to EigenVector
8 years ago
wanghaoshuang
3e7819c276
1. Reading image shape from input data instead of image_config
...
2. Add crop layer unitest
3. Fix bugs
8 years ago
fengjiayi
bf4da3d97a
Refactor Rigistry::CreateGradOp()
...
We put forward Op's inputs, outputs and output gradients into Grad
Op's inputs, and put forward Op's input gradients into Grad Op's output.
So Grad Op's `in_out_idx`, `input_format` and 'output format' need to be
rebuilt during Op creating.
8 years ago
liaogang
028f3dc4e5
Add memcpy
8 years ago
Yu Yang
0a0b4caaa7
Change Operator::create use py::bytes not std::string
8 years ago
Yu Yang
b90780c3a8
Merge pull request #2939 from reyoung/feature/refine_tensor_hpp
...
Tensor should use PADDLE_ONLY_CPU macro
8 years ago
xzl
d50c71f3e2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into mobilenet_gpu
8 years ago
xzl
66520af9ca
accelerate inputbackward(delete 'if' in this func) of depthwise conv
8 years ago
Yi Wang
1981eaf922
Fix Tensor::data interface
8 years ago
Yi Wang
2538e20787
Fix wrong inclusion path
8 years ago
Yi Wang
00ed564307
Update
8 years ago
Yi Wang
cb1d1f167c
Add unit test
8 years ago
Yi Wang
642d3c4687
Refactorize Tensor to Eigen convesion
8 years ago
emailweixu
84a0d6d898
Merge pull request #2572 from skylian/error_clipping
...
Enable error clipping for fc layer
8 years ago
helinwang
a818afcc6c
Merge pull request #2887 from helinwang/fix_pre_commit
...
fix travis check style for Go
8 years ago
xzl
dbb658805e
modity the format
8 years ago
fengjiayi
e786746f6f
Merge branch 'backward' of https://github.com/dzhwinter/Paddle into backward
8 years ago
dongzhihong
855cae603c
move unused file
8 years ago
fengjiayi
7f1533f251
test collaborating
8 years ago
xzl
44927bf70a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into mobilenet_gpu
8 years ago
xzl
11588b3670
support inputchannels != outputchannels of depthwiseconv
8 years ago
xzl
02e04b4441
fuse the conv and depthwise conv together
8 years ago
dongzhihong
8b80cf8989
"add net op testing"
8 years ago
dongzhihong
9890b23c82
fix confilct
8 years ago
dongzhihong
816b4c8ab0
"add backward Op"
8 years ago
qijun
b6c075527c
implement some basic OpKernel
8 years ago
Yu Yang
d3373c5b85
Skeleton Of fully connected operator
...
* Fc operator is a grouped operator, which combined by may internal
operators.
* InferShape & Run a FC operator in Python.
8 years ago
Yu Yang
ad728419d6
Merge pull request #2912 from reyoung/feature/better_network_debug_str
...
Override Net::DebugString to invoke inner ops_
8 years ago
Yu Yang
107b3395b4
Merge branch 'develop' into feature/better_network_debug_str
8 years ago
QI JUN
3208914b56
Merge pull request #2805 from QiJune/tensor_to_EigenTensor
...
Add method converting Tensor to Eigen TensorMap
8 years ago
Yu Yang
fb48cb120d
Merge pull request #2936 from reyoung/feature/create_op_in_cpp_params
...
Make CreateOp in Plain C++ params
8 years ago
Yu Yang
9b964b535b
Tensor should use PADDLE_ONLY_CPU macro
...
* Not use __CUDACC__ macro, because malloc in GPU memory do not have to
used in .cu file.
* Also, `PADDLE_ENFORCE` logic is wrong.
8 years ago
tensor-tang
89a4158038
enable MKLDNN library and MKL small package
8 years ago
Yu Yang
8539222a1a
Merge branch 'develop' into feature/is_in_gpu
8 years ago
Yu Yang
1dc53a289f
Use friend not to expose tensor's `type/place`
8 years ago
Yu Yang
f6a51d9b4e
Make CreateOp in Plain C++ params
8 years ago
QI JUN
d344f6702c
Merge branch 'develop' into tensor_to_EigenTensor
8 years ago
Yu Yang
875946fff4
Merge pull request #2935 from reyoung/feature/create_op_use_cpp_params
...
Change `in_out_idxs_` to shared_ptr
8 years ago
Helin Wang
22be2b2bb8
fix travis check style
8 years ago
helinwang
2db1b68d7c
Merge pull request #2885 from helinwang/handle_err
...
Add gometalinter errcheck to pre-commit-hook, fix all Go unchecked errors.
8 years ago
Yu Yang
c1219a530c
Change `in_out_idxs_` to shared_ptr
...
* `in_out_idxs_` shares between all operator instance in same type
of operator.
8 years ago
Yu Yang
1ac0bffaf7
Override Net::DebugString to invoke inner ops_
...
Net::DebugString() print what it is and invokes inner operators
DebugString() with a prefix '\t'.
8 years ago
Yu Yang
e00aae5386
Fix Merge Bugs
...
* There is a merge conflict when merge PR #2914
* Develop and PR #2914 both add `DDim::size` method, but did not
triger git merge conflict while merge.
8 years ago
Qiao Longfei
051676a7e4
support multiple template parameter in KernelType for REGISTER_OP_XPU_KERNEL ( #2932 )
8 years ago
Yu Yang
861b66d4e3
Merge pull request #2914 from reyoung/feature/add_some_skeletons_of_ops
...
Add skeletons of `mul`, `rowwise_add`, `sigmoid`, `softmax` ops
8 years ago
Yu Yang
a89c7ffa94
Make Tensor <--> Numpy interactive in tensor.h
...
* Follow review comments to seperate Tensor Numpy interactive methods in
tensor.h.
* Simplify logic for `CastToPyBufferImpl`, make it as one struct and in
details namespace.
* Remove `Scope` expose in Python, since it currently is useless.
* Remove some debug functions.
8 years ago
qijun
4d49f1d892
merge baidu/develop
8 years ago
hedaoyuan
f146b03b44
Merge pull request #2824 from hedaoyuan/fix_nnpack
...
Fix some link errors about NNPACK.
8 years ago
Helin Wang
37624b30ff
Fix Go pre-commit
8 years ago
Yu Yang
bde90be71b
Read/Write a Tensor Python
...
Basically following
http://pybind11.readthedocs.io/en/stable/advanced/pycpp/numpy.html
* Use buffer protocol to return a view of Tensor. It can be cast to
numpy array in Python.
* Set a numpy array to a tensor.
8 years ago
Yu Yang
122e83e36c
Fix unittest
8 years ago
fengjiayi
78fa5e307d
Add DDim::size()
8 years ago
fengjiayi
78bd815e85
refine conditional compilation and remove `numel_`
8 years ago
Yu Yang
73a9f0f25d
Merge develop
8 years ago
fengjiayi
1cd14f661b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_enable_tensor_test
8 years ago
Yu Yang
ad8fa77cce
Merge branch 'develop' into feature/add_some_skeletons_of_ops
8 years ago
Yu Yang
5847b96a61
Follow comments, correct implement `DDim::size()`
...
* Also fix unit test
8 years ago
qijun
87189665d9
merge baidu/develop
8 years ago
qijun
2a03e3808d
set correct place for output tensor
8 years ago
Yu Yang
0e77b31a02
Merge branch 'develop' into feature/op_creation_methods
8 years ago
Yan Chunwei
a0caf23430
Op varient inputs ( #2901 )
...
* add inputs
* add ut for multiple inputs
* fix AddToLayer
* op_desc -> op_proto
* CreateArgumentOffsetMap -> CreateInOutOffsetMap
* move CreateInOutOffsetMap from OperatorBase to op registry
* arg_idxs_ -> in_out_idxs_
8 years ago
Yu Yang
1ed237c11e
Add skeletons of `mul`, `rowwise_add`, `sigmoid`, `softmax` ops
...
* Implement InferShape and register them, give a stub Kernel method
by LOG(INFO)
8 years ago
qijun
6dc567a52e
merge baidu/develop
8 years ago
qijun
5017b15468
refactor tensor mutable_data
8 years ago
Yu Yang
a4eaf2d3ba
Merge pull request #2897 from reyoung/feature/op_library
...
CMake `op_library` function
8 years ago
Yiqun Liu
0ddbdbb14c
Merge pull request #2874 from Xreki/update_android_dockerfile
...
Update the dockerfile to build cross-compiling image for Android.
8 years ago
fengjiayi
a33e9da9c9
Merge pull request #2900 from Canpio/dev_ddim_update
...
Refactor DDim's product() and add slice_ddim()
8 years ago
Yu Yang
38310f9349
Refine CMake dependencies graph
8 years ago
Yu Yang
acc8025e48
Merge branch 'develop' into feature/op_library
8 years ago
Yu Yang
3bb1f98b65
Merge branch 'develop' into feature/op_library
8 years ago
Qiao Longfei
80a26a6308
check duplicate of ProtoAndCheckerMaker ( #2903 )
8 years ago
Liu Yiqun
8a3e735307
Delete the blank line at the end of script file build_android.sh.
8 years ago
Liu Yiqun
ac8c8bd5fb
Merge branch 'develop' into update_android_dockerfile
8 years ago
Yan Chunwei
cdec563449
Add enforce switch for convient develop ( #2850 )
...
* add NDEBUG switch to PADDLE_ENFORCE
8 years ago
Yu Yang
c5bc126762
Follow comment, rename to `GenerateTempVariableName`
8 years ago
Yu Yang
68f6ac011c
Merge branch 'develop' into feature/op_creation_methods
8 years ago
fengjiayi
778a1a9f87
add a comment for
8 years ago
qijun
65dbeb6a24
fix gpu build error
8 years ago
qingqing01
1032ef1893
Merge pull request #2907 from juliecbd/to_meet_cpplint
...
Change to meet cpplint style
8 years ago
qijun
9e3a9eb233
merge baidu/develop
8 years ago
武毅
23b8346072
Fault tolerant distributed training, just work version, with etcd ( #2849 )
...
* using etcd as fault tolerant training
* update
* workable version, ft not tested
* small fix
* update
* remove TODO
8 years ago
qijun
d649dbf442
implement add_op kernel
8 years ago
jc
a8c427e0d4
Change to meet cpplint style
8 years ago
Qiao Longfei
45ce1649a1
change net to operator ( #2846 )
...
* OperatorBase should not store OpDesc because not All op contains an
OpDesc and not all ops create from OpDesc.
* Networks do not contain OpDesc and are not created by OpDesc
* Do not register Network to OpRegistry.
* The network is directly created by the user in Python. Not from
registry.
* Correctly handle the `inputs` and `outputs` of a Network.
* Add CompleteAddOp() methods
* Remove `AddOp(OpDesc&)` in net-op. All op are added by OperatorPtr.
* Rewrite unit test for truly tested what networks do.
* optimise operator_test
8 years ago
fengjiayi
7de10ee7e2
formate code
8 years ago
fengjiayi
85c4f488f3
Refactor DDim's product() and add slice_ddim()
...
1. Refactor DDim's product() to make it more efficiently.
2. Add slice_ddim().
8 years ago
Qiao Longfei
afa99d9ae6
add ADD_OP_CPU to enable add op with only cpu kernel ( #2896 )
...
* add ADD_OP_CPU to enable add op with only cpu kernel
8 years ago
Yu Yang
d3a749a5bf
CMake `op_library` function
...
* It is used to create an operator library. It handles to split CPU and
GPU sources and links operator common libraries.
* It also give a reasonable warning and error when operator developer
not correctly implement an operator.
* Warning for lack of GPU kernel.
* Same interface as `cc_library` to make code style consistent.
8 years ago
Yu Yang
9e0c6800c5
Python Generate OpCreation Methods by OpProto
...
All OpCreation method are generated by
`create_op_creation_methods::__bootstrap__` method, and stores in
`op_creations` object and its methods.
There are three parts to implement this feature.
1. Get all registered `OpProto` from C++ side. It is implemented in
`get_all_op_protos` method.
1. Create a function to convert `kwargs` to `OpDesc` base on each op's
`OpProto`. The `OpDescCreationMethod` class.
1. Convert `OpProto` to `docstring` by `get_docstring_from_op_proto`
method.
All three methods are unit tested. The `__bootstrap__` just combines
them together and create a method in runtime.
For details, please reference the doc string in
`create_op_creation_methods.py` and the unit test
`test_op_creation_methods.py`.
8 years ago
Yu Yang
1faf5e0313
Merge pull request #2864 from reyoung/feature/op_creation_methods
...
Get OpProtos in Python
8 years ago
fengjiayi
afa2a88d78
add conditional compilation for tensor
8 years ago
fengjiayi
66cf21c880
fix compile error
8 years ago
fengjiayi
68adb9541d
enbale tensor memory test
8 years ago
fengjiayi
ee90c2d22b
add slice_dim draft
8 years ago
liaogang
ab8c7f11f3
FIX: virtual OptimizerTest
8 years ago
liaogang
7010a5da1a
FIX: throw_on_error on cuda
8 years ago
liaogang
569f7e83a6
FIX: cppint code style
8 years ago
liaogang
620575b6eb
FIX: cpplint code style
8 years ago
liaogang
90c4cd8323
FIX: header file
8 years ago
liaogang
2680dca9c8
ENH: add cuda enforce to PADDLE_ENFORCE
8 years ago
liaogang
f812de2cce
ENH: unify PADDLE_ENFORCE
8 years ago
liaogang
340b8bad2c
Update Memory ReadMe Doc
8 years ago
gangliao
48cf64e833
Merge pull request #2674 from gangliao/cpu_mem
...
[Done] Memory Management: Buddy Allocator
8 years ago
fengjiayi
c48fc4d826
Merge pull request #2825 from Canpio/dev_add_tensor_copy
...
Add Tensor::CopyFrom and Tensor::mutable_data(Place place)
8 years ago
liaogang
03b3d0d8a8
Follow comments
8 years ago
fengjiayi
57a22db3fa
update PADDLE_ENFORCE message
8 years ago
fengjiayi
34beec0f7a
update tensor.h
8 years ago
fengjiayi
8594d5c314
change int numel_ to size_t numel
8 years ago
fengjiayi
8bcd1faffc
refactor product(DDim ddim)
8 years ago
Liu Yiqun
450cf18b53
Add Go compiler to Dockfile.android and rename the build directory to build-android.
...
The newest developing image was push to dockerhub, named xreki/paddle-android:dev.
8 years ago
hedaoyuan
3f5e650d2d
Merge pull request #2862 from hedaoyuan/fix_14
...
Remove useless empty pointer check.
8 years ago
fengjiayi
1f97388a34
fix several compile error
8 years ago
fengjiayi
dcfcf6872a
Refactor Tensor::CopyFrom()
...
1. Add template T which indicates data type to `CopyFrom()`, `Slice()`
and `ShareData()` functions. This makes `CopyData()` code much clearer.
2. Add `set_dim()`.
3. `product(DDim)` transforms `DDim` to `vector<int>` first and then calculate
its product. That might be quite slow. For `product(dims_)` is frequently
used in Tensor, we add a mumber variable `numel_` as a cache of the
product result.
TODO: refactor `product()` to make it more efficient.
4. Unable Tensor::operator=
5. Remove the limit of POD type, because `float16` and `int8` are not POD type.
8 years ago
fengjiayi
a1dc4311a3
Refactor `Tensor::CopyFrom()`
8 years ago
qijun
bac1426d47
add_op kernel implementation
8 years ago
Yu Yang
a76f7ed2eb
Get OpProtos in Python
...
* PyBind and SWIG of paddle cannot be load in a single Python process,
lazy import all SWIG library of Paddle. Otherwise, the glog, gflags
are imported twice in a same Python process.
* Note that all PyBind11 return C++ std::string as an unicode. For
protobuf, it is need be cast to `str` before use them.
* Add unit test for Get `OpProtos`
8 years ago
Qiao Longfei
58f3de95cf
Optimize ptr ( #2851 )
...
* use OperatorPtr = std::shared_ptr<OperatorBase>;
* use ScopePtr = std::share_ptr<Scope>;
8 years ago
Yu Yang
2462d0c5fe
Let OpProto support multiple and temporary ( #2860 )
...
* Let OpProto support multiple and temporary
* Each input/output of Paddle's Op could be a list. Add multiple mark to
OpProto. Also add a `input_format`/`output_format` attribute if that
Op has multiple input or output. The format of that attribute please
reference the comments in `op_proto.proto`
* Add temporary mark, because some output of an Op is not used by user
but used by other op for faster computation. Explicitly mark which
output is temporary could let future memory/computation optimization.
* Add generated field to AttrProto.
* Add `AddInputs`/`AddOutputs` function
* It is more readable to invoke `AddInputs` not
`AddInput(multiple=true)`.
8 years ago
hedaoyuan
010adb99b4
Remove useless empty pointer check.
8 years ago
liaogang
033523ea9d
update
8 years ago
qijun
6f2eba3e7e
Merge remote-tracking branch 'baidu/develop' into tensor_to_EigenTensor
8 years ago
liaogang
ea916c8487
Fix: alignment metric
8 years ago
Yu Yang
cd5113c197
Init commit
8 years ago
liaogang
21b7915d91
Fix condition compile
8 years ago
liaogang
365b457a04
Merge conflicts
8 years ago
liaogang
ab5fe1e907
ENH: memory test: check alignment and memory size
8 years ago
helinwang
f49fda5e73
Merge pull request #2855 from helinwang/issue/2854
...
Fix optimizer parameter buffer allocation size.
8 years ago
xzl
6267312aa9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into mobilenet_gpu
8 years ago
xzl
c43f6936c0
modify the format and delete useless comment
8 years ago
Yiqun Liu
bb8ac28b20
Merge pull request #2845 from Xreki/travis_build_android
...
Add build_android task on Travis CI.
8 years ago
qijun
0b0b3ba16a
Merge remote-tracking branch 'baidu/develop' into tensor_to_EigenTensor
8 years ago
fengjiayi
dfa4650920
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_add_tensor_copy
8 years ago
qiaolongfei
e588730147
change op to operators
8 years ago
Helin Wang
11660eab0e
Fix optimizer parameter buffer allocation size.
...
The buffer allocation size should be number of bytes, not number of
floats.
8 years ago
Liu Yiqun
62908dcc24
Move the download of ndk to build_android.sh script file.
8 years ago
Yu Yang
79b70c2d69
Follow comments
...
* Convert `op` --> `operators`
* Remove AddType in OpProtoMaker, because type is part of registry.
* Rename CPU_OR_GPU --> DEVICE_TYPE in registry macro.
8 years ago
liaogang
00572aa451
Add memory alignment test
8 years ago
Yu Yang
a0aaafe9de
Add a sample op, `add_op`
...
* Refine register methods, make Op can get rid of whole-archieve
* `USE_OP` before a op is used.
* Add unittest for add_op.
8 years ago
Liu Yiqun
95897fd1e1
Add build_android task on Travis CI.
8 years ago
qijun
f6fe37155f
Merge remote-tracking branch 'baidu/develop' into fix_bug_dynload
8 years ago
Qiao Longfei
728665d709
Add Init to OperatorBase ( #2838 )
8 years ago
liaogang
ff98e3c1ec
ENH: Remove comments
8 years ago
qijun
71e2a94310
Merge remote-tracking branch 'baidu/develop' into tensor_to_EigenTensor
8 years ago
qijun
4e918377d0
fix bug in dynload
8 years ago
helinwang
f5f7d6bd4f
Merge pull request #2811 from helinwang/go_test_1
...
Add go testing into cmake
8 years ago
Helin Wang
e4be077ffa
Add go testing into cmake and fix libpaddle_go_optimizer.a link path
8 years ago
xzl
fc8aedb1de
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into mobilenet_gpu
8 years ago
Qiao Longfei
be441f7d16
test OpKernel ( #2820 )
...
Add unit test for OpKernel
8 years ago
hedaoyuan
891e5dcc48
Modify the default value of nnpack_allocate_outside.
8 years ago
Yu Yang
555b0a72d6
Merge pull request #2814 from QiJune/feature/refine_device_context
...
Add GetEigenDevice To DeviceContext.
8 years ago
qijun
70d937c595
add memory header file
8 years ago
hedaoyuan
06748210d4
Fix some link errors about NNPACK.
8 years ago
fengjiayi
69d99d481d
Add Tensor::CopyFrom and Tensor::mutable_data(Place place)
...
1. Add `Tensor::CopyFrom`. Current version can only support CPU memory
copy. The support of GPU will be provided later by `paddle::memory`.
The current implementation of `Tensor::CopyFrom` is a little inefficient:
Every time `CopyFrom` is called, tensor will re-allocate its memory. However, if
we try to check and reuse `placeholder_`, we have to provide a template
parameter for `CopyFrom` to indicate the data type. It seems strange for
a simple copy function.
2. Add `Tensor::mutable_data(Place place)`, which directly use member
variable `dims_` as its dim parameter. This interface is required by
`Op::InferShape`.
8 years ago
qijun
be2c1a3b99
follow comments
8 years ago
qijun
a07deac9ef
follow comments
8 years ago
qijun
85806e7585
follow comments
8 years ago
xzl
ccd46d1bf6
modify format accored with clang-format 3.8
8 years ago
qijun
8ee50a35d4
fix gpu build error
8 years ago
Luo Tao
e0ea87c99d
fix pybind compile question
8 years ago
xzl
433935aa51
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into mobilenet_gpu
8 years ago
xzl
fd4b1136a9
move DepthwiseConvOpTest.cpp to ConvOpTest.cpp
8 years ago
qijun
ca23d86160
merge baidu/develop
8 years ago
qijun
4d336d9063
follow comments
8 years ago
Yu Yang
0a32008126
Merge pull request #2815 from reyoung/feature/op_kernel
...
OperatorWithKernel
8 years ago
Yu Yang
0ff8192072
Add OperatorWithKernel class
...
* User can register OpKernel to its Ops. The OpKernelMap saved in
OperatorWithKernel. Each Op which inherits OperatorWithKernel will
use `OpKernel::Compute` instead of Run.
8 years ago
qijun
ef5f9debc6
refine device_context
8 years ago
qijun
6bbc2944ae
fix code style
8 years ago
qijun
37bcc0860a
merge baidu/develop
8 years ago
fengjiayi
8681c5800b
Merge pull request #2813 from Canpio/dev_update_dim
...
Remove Dim::contiguous and Dim::contiguous_strides
8 years ago
fengjiayi
76b4dd6226
Merge pull request #2809 from Canpio/dev_add_tensor_interface
...
add tensor interfaces
8 years ago
qijun
b5a8d5b4b4
remove unused deps
8 years ago
qijun
8f5a9fd9a7
fix gpu build error
8 years ago
fengjiayi
2dbe60e489
Remove Dim::contiguous and Dim::contiguous_strides
...
Paddle's data block is row-major order, while Dim::contiguous and
Dim::contiguous_strides are based on column-order. So remove them to
prevent misuse.
8 years ago
fengjiayi
2dccab87d4
fix some syntax problems
8 years ago
qijun
14d2c3990f
split device_context
8 years ago
QI JUN
2749b71f27
Merge pull request #2806 from reyoung/feature/refine_cuda_h
...
Refine CUDA Related libraries
8 years ago
武毅
bcf9f421c3
Merge pull request #2774 from typhoonzero/fix_newupdater
...
Fix new remote updater for go pserver
8 years ago
Qiao Longfei
a2e5f652d3
add operator base ( #2725 )
...
Add OperatorBase.
issue: https://github.com/PaddlePaddle/Paddle/issues/2790
Paddle design the Operator with Kernel. OperatorBase has no type and device information when create, One operator can have multiple kernels, Operator will choose a kernel to run according to context. The kernel should be bind to Operator before or during Operator running.
8 years ago
fengjiayi
0665dc9755
add more test
8 years ago
fengjiayi
ca39515e24
Add several interfaces for Tensor class
...
1. Add member variable 'DDim dims_' and a getter function 'dims()'.
'dims' is supposed to hold tensor's shape during Op::InferShape.
2. Remove 'mutable_data' which use default Place. User must specify a
explicit Place when call 'mutable_data'.
3. A PlaceHolder may be shared by more than one tensor, and some of them may be the others' slices. So we add a new member variable 'offset_' for Tensor, which is used to show the byte offset between PlaceHolder::ptr_ and where tensor's data really begins.
4. Add functions 'ShareDataFrom' and 'Slice' for Tensor.
TODO: Tensor needs a 'CopyFrom' function.
8 years ago
qijun
69d76812ae
fix cublas dynload bug
8 years ago
Yu Yang
267f9a2cdf
Move static variable defined in .cc ( #2782 )
...
* Move static variable defined in .cc
We cannot define static variable in .h, because it will be multi-defined
errors.
Also fix some cpp syntax, like:
* Prefer to use algorithm not manually for-loop, to make code more
readable.
* Remove unused `()`.
* Enforce take a bool. It is no need `xxx==true`.
* Use range-based for-loop iterator from op_desc.attrs
* Fix a protential static variable init order error
8 years ago
Yu Yang
27b196ba6d
Expose paddle.framework by pybind11 ( #2793 )
...
* Expose paddle.framework by pybind11
* Export paddle.framework.{Scope, Variable} to paddle.v2.framework.core.
* See python/paddle/v2/framework/tests/test_scope.py for Python usage
* See paddle/pybind/pybind.cc for C++ bind code.
* add copyright
8 years ago
liaogang
383b96f32c
FIX: merge conflicts
8 years ago
qijun
a5eb1d8fab
fix build error
8 years ago
Yu Yang
a046605307
Refine CUDA Related libraries
8 years ago
liaogang
f404282dc2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpu_mem
8 years ago
liaogang
6a3b8416df
FIX: clang-format
8 years ago
liaogang
d4017cadcd
ENH: Add auto-free if allocate too much
8 years ago
qijun
d607f0b703
use cached rank
8 years ago
Yu Yang
abff52abcb
Simplelize framework/CMakeLists.txt ( #2803 )
...
* generic.cmake can propogate dependencies through libraries. It is no
need to specific all dependencies.
8 years ago
qijun
958511160b
add simple add_op_functor
8 years ago
liaogang
49fd49f76b
Fix conflicts
8 years ago
qijun
d6f7c3535d
move unaligned tensor types
8 years ago
dongzhihong
b871641a53
"switch to shared_ptr"
8 years ago
dongzhihong
18e65b0c08
"support net_proto header"
8 years ago
wanghaoshuang
de5ded6bbd
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into crop_layer
8 years ago
yi.wu
5a4f33df7e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_newupdater
8 years ago
dongzhihong
bc021d775e
"move opContext to DeviceContext"
8 years ago
xzl
e92f002178
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into mobilenet_gpu
8 years ago
xzl
a3ce6aa8ca
add depthwise conv test
8 years ago
xzl
198164adef
use the expandconvlayer forward and backward, add the explain for class
8 years ago
qijun
1a968b4f64
init
8 years ago
xzl
064dc888ef
add the comments for .h file and code tiny modify
8 years ago
dongzhihong
052d1d16ee
Merge remote-tracking branch 'origin/develop' into net_ycw
8 years ago
QI JUN
1038bc461a
implement DeviceContext ( #2709 )
...
* add device_context
* add unittest for device_context
* transfer to use function paddle::platform::throw_on_error
* fix cuda build error
* using dynload functions
* follow comments
8 years ago
liaogang
199b5fcb45
ENH: refine code comments
8 years ago
caoying03
82801f24e5
save log probabilty for every generated words.
8 years ago
liaogang
1ce2fca484
Merge conflicts
8 years ago
dongzhihong
45adbfc4e0
Merge remote-tracking branch 'origin/develop' into lr_state
8 years ago
xzl
36e7800aa1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into mobilenet_gpu
8 years ago
xzl
f4e7ae5540
add mobilenet gpu grad test, the test is ok
8 years ago
xzl
efae51ce24
add the mobilenet gpu acceleration, cpu is in the process
8 years ago
wuyi05
26d95a6bbf
fix new remote updater for go pserver
8 years ago
fengjiayi
1d2ef1db82
[draft] add registry for Op, OpProto and OpAttrChecker ( #2739 )
...
* init op_registry.h
* dev op_registry.h
* add 'attr_checker.h', which is a draft of op attribute checker.
* rename some macro parameters
* 1. Use `Attribute` and `AttributeMap` instead of `OpDesc`. `AttributeMap` is a unordered_map of <string, Attribute>, and `Attribute` is a boost::variant object to hold multiple types of attribute value.
2. Use `PADDLE_ENFORCE` to print checkers' fail message.
3. Abstract default value operations to a new function: `DefaultChecker`.
* rename DefaultChecker to DefaultValueSetter
ZZ
* Finish op_registry
1. Complete the development of interfaces between OpRegistry and
Protobuf.
2. Add unit test for op_registry.h
* Add demo and test of custome checker
* fix merge conflict
8 years ago
liaogang
ddfa6cf0d1
FIX: remove boost from memory folder
8 years ago
武毅
77604bca27
Merge pull request #2748 from typhoonzero/add_golang_precommit
...
Add golang precommit
8 years ago
wanghaoshuang
0b788ef75b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into crop_layer
8 years ago
Yu Yang
e2ea1f42e9
Generate python protobufs for paddle.v2.framework
...
Python should be able to manipulate Protobuf message because:
1. Python's `create_op_creation_methods` take the `OpProto` array to
generate all `op_creation_methods` in RunTime.
2. All `op_creation_methods` will create an `OpDesc` and pass it to
Paddle C++ method `CreateOp` and return the Op handle.
Here is the list of what is added in this commit:
* Add `protobuf_generate_python` if it is not defined.
* Before cmake 3.4, `protobuf_generate_python` is not defined. Just
copy the implementation of that function in `protobuf.cmake`
* Add `py_proto_compile` function in `cmake/generic.cmake`.
* It follows bazel's API interface.
* https://github.com/pubref/rules_protobuf#rules
* Add an empty package named `paddle.v2.framework`, all python code of
`paddle::framework` will be in that package.
* Generate protobuf's python module `__init__.py` by `touch` while
compiling.
* Change setup.py.in, make `paddle.v2.framework.proto` uses the
generated protobuf pythons.
8 years ago
wuyi05
024a6537da
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_golang_precommit
8 years ago
Yu Yang
1b366dc2ff
Fix CI error on test_LayerGrad.LSTM
...
* We should not EXPECT_EQ between a float value and a int value.
Use ASSERT_NEAR instead.
8 years ago
liaogang
a669bf48d9
FIX: explicit construct pool element
8 years ago
Superjom
1264480b04
fix ci
8 years ago
liaogang
5d2e8edb3e
FIX: dynamic loader deps
8 years ago
liaogang
936cd1e118
FIX: code format
8 years ago
liaogang
74691789e9
ENH: add memory unit test
8 years ago
yi.wu
9f64332258
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_golang_precommit
8 years ago
Superjom
0b6430a363
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into network
8 years ago
wanghaoshuang
d378e0a0cd
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into crop_layer
8 years ago
Yang yaming
f21e3f731f
Merge pull request #2247 from pkuyym/fix-2240
...
fix bugs for CrossChannelNormLayer
8 years ago
liaogang
ada1c20bbc
FIX: Buddy Allocator Free with Merge feature
8 years ago
gangliao
1cc8fe7229
Merge pull request #2745 from gongweibao/fixcmake
...
fix cmake errors
8 years ago
yangyaming
c37da0bd3b
Remove hasParamInitialValue flag.
8 years ago
Yu Yang
5d2a08775a
Merge pull request #2747 from reyoung/feature/correct_glog_check
...
Correct GLOG CHECK in Paddle
8 years ago
gongweibao
204869c2da
fix bugs
8 years ago
gongweibao
2e302085d7
fix bugs
8 years ago
Tao Luo
7f380c1be3
Merge pull request #2701 from luotao1/stride
...
stride pooling for max and average layer
8 years ago
Yu Yang
5eb8bf0324
Correct GLOG CHECK in Paddle
...
Use CHECK instead of PCHECK, because PCHECK is used for errno.
8 years ago
liaogang
d0ad0314bb
FIX: glog dependency
8 years ago
wuyi05
7c6aa04f61
add go pre-commit and travis build
8 years ago
yangyaming
2bf4f1bbc1
make adding eps more clear
8 years ago
yangyaming
7ed6463ee9
fix bugs for CrossChannelNormLayer
8 years ago
Luo Tao
e7b071f33a
update annotation with comments
8 years ago
Superjom
309b37e52e
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into network
8 years ago
wanghaoshuang
acfd2fc6df
fix cpp format
8 years ago
Yu Yang
9837896827
Merge pull request #2740 from reyoung/feature/remove_buggy_barrier_stats
...
Remove buggy BarrierStat
8 years ago
wanghaoshuang
cf86891801
fix unittest of crop layer
8 years ago
wanghaoshuang
86bdb2f33f
fix crop function test
8 years ago
wanghaoshuang
5e6e1f636a
Add grad test and python wrapper for crop layer
8 years ago
wanghaoshuang
d1d70ec831
Refine configure option of crop layer
...
1. change configure content to 'axis, offset, shape'
2. add an optional input to crop layer as cropping reference
8 years ago
wanghaoshuang
e10040ca8a
add crop layer
8 years ago
Superjom
5cf9a23f9b
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into network
8 years ago
Superjom
568c03ba1d
add virtual implementation
8 years ago
fengjiayi
830877f59b
Merge pull request #2706 from Canpio/dev_tensor_test
...
fix compile errors in tensor.
`PlaceholderImpl::Deleter` maybe used in other modules. We can consider moving it to be `paddle::memory::Deleter`.
8 years ago
Superjom
5c10a5ad55
remove virtual
8 years ago
Superjom
9f2357561d
fix ci error
8 years ago
Superjom
d1945e8d56
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into network
8 years ago
helinwang
fa21616532
Merge pull request #2735 from gongweibao/fixcmake
...
fix cmake error
8 years ago
liaogang
4dc3c9e0cc
ENH: Add paddle_memory for external usage
8 years ago
Yang yaming
312ce8b419
Merge pull request #2497 from pkuyym/ssd_outloss
...
Add DetectionOutputLayer and MultiBoxLossLayer.
8 years ago
liaogang
0ba6347565
ENH: Add buddy allocator Free
8 years ago
Yu Yang
1ecddd8174
Remove buggy BarrierStat
...
The implementation of BarrierStat is buggy, and it is not necessary
for Paddle to diagnose which node in cluster is slow.
8 years ago
Qiao Longfei
571714159a
add op_desc.proto ( #2736 )
...
* add op_desc.proto
In Operator design, we need a proto message to describe an Operator.
Third-party language such as python can build this proto message and use
AddOp(const OpDesc& op_desc) of Paddle core to construct an Op in the
Network.
8 years ago
qijun
e6fcdd47e5
fix wrong including header-file in files in paddle/platform/dynload dir
8 years ago
Superjom
91c6a7926e
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into network
8 years ago
liaogang
379434b243
Delete cmake in dynload
8 years ago
liaogang
fb413508e2
Merge conflict
8 years ago
zlx
eeb17c26fd
add depthwise operation and depthwise conv layer
8 years ago
yangyaming
86543f7f6a
Follow comments.
8 years ago
Superjom
e95299b583
fix ci error
8 years ago
gongweibao
d8941e67ec
fix bugs
8 years ago
QI JUN
6fd41f7b5c
Merge pull request #2722 from QiJune/feature/port_dyload_to_platform
...
port dyload to platform
8 years ago
liaogang
ff36389452
ENH: code style
8 years ago
Superjom
109937b8d5
fix ci error
8 years ago
qijun
3567ea6d7c
move to dynload directory
8 years ago
Superjom (Chunwei Yan)
58b7d9a048
Merge branch 'develop' into network
8 years ago
qijun
9eeabe986d
follow comments
8 years ago
Superjom
16a1637ffe
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into network
8 years ago
Yu Yang
b7397031e9
Add target_link_libraries for cc_library
...
It will fix #2728 .
Maybe it is silly to `target_link_libraries` for static library,
because a static library do not need to link other libraries. But
it will tell cmake how to propagate dependencies.
The solution comes from
[here](http://floooh.github.io/2016/01/12/cmake-dependency-juggling.html ).
* Also change op_proto_test DEPS for testing this fix works.
8 years ago
Superjom
04e20034df
replace Error with void
8 years ago
Superjom
c602e04613
add fake interfaces to make compilable
8 years ago
Yu Yang
9558770f26
Merge pull request #2731 from reyoung/feature/add_deps_for_generic_protobuf
...
Add DEPS to `proto_library`
8 years ago
Superjom
90f55b1b18
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into network
8 years ago
liaogang
4e1617d059
ENH: add buddy alloctor Free
8 years ago
Yu Yang
3de3894b82
Add DEPS to `proto_library`
...
Missing DEPS will cause compile error when parallel is large.
8 years ago
Superjom (Chunwei Yan)
06156daa28
net design with NetBuilder ( #2598 )
...
* move net_design to framework
* change CreateNet result to unique_ptr
* rename "ScratchNet" -> "PlainNet"
* add three methods to NetBase
* add NetBuilder
* add InferShape to NetBuilder.Run
* rename ApplyGradient, ApplyOptimizer -> AddGradientOps, AddOptimiz
* rename PlainNet::CreateNet -> BuildNet
* add Error and other rename actions
8 years ago
Luo Tao
3b073fdc2b
fix error in test_LayerGrad
8 years ago
dongzhihong
9f365d3636
"add net proto"
8 years ago
qijun
76b7be46da
add deps for dyload cc_library
8 years ago
qijun
ed18647e37
finish test
8 years ago
Luo Tao
b02c453d02
Merge branch 'develop' into stride
8 years ago
dongzhihong
aadbd49814
Merge remote-tracking branch 'origin/develop' into network
8 years ago
dongzhihong
dec65aca7d
"fix parameter accumulate size"
8 years ago
dongzhihong
7edabe74d4
"polish name convention"
8 years ago
dongzhihong
e1acd73fab
"fix typo deleted part"
8 years ago
dongzhihong
6935dd7bc9
"lr state serialization"
8 years ago
Yu Yang
80f8e24209
Merge pull request #2703 from reyoung/feature/op_proto
...
Add OpProto implementation
8 years ago
Yu Yang
3f63d96abe
Fix link error in op_proto_test.
8 years ago
qijun
a77fcef3f9
fix cuda compile error
8 years ago
qijun
a30754b05e
test device_context
8 years ago
qijun
3ba7a738f3
add dynamic_load
8 years ago
fengjiayi
e48e21da2b
remove unnecessary include
8 years ago
liaogang
bbd3eab7ee
ENH: Add Alloc for buddy Allocator
...
* Free will be added soon
8 years ago
liaogang
929f9cbdff
ENH: Add Metadata for memory block
8 years ago
liaogang
89110fd266
ENH: Add useGpu in system allocator
8 years ago
fengjiayi
2d1f95de87
fix a compile error
8 years ago
fengjiayi
d054a5eef8
re-submit
8 years ago