chengduoZH
ff09b21cd0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_concat_rows
7 years ago
Yu Yang
e13aec601a
Merge pull request #8830 from reyoung/feature/recordio_file_reader
...
Feature/recordio file reader
7 years ago
Yu Yang
f9974a4a12
Make double_buffer reader async
7 years ago
Yu Yang
a8c076e577
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/shuffle_reader
7 years ago
Luo Tao
de13f0eb4e
Merge branch 'develop' into infer_mkl
7 years ago
chengduoZH
b9397b2668
remove concat_rows
7 years ago
QI JUN
7287630e83
Repair nccl op test ( #8575 )
...
* fix nccl op unit test
* fix build error
* format code
* refine nccl related unit test
* fix build error
* add setGPUData
* clean up
* follow comments
* rm test_nccl.cu
* follow comment
* rm wait
7 years ago
caoying03
cf08185145
fix bugs and complete codes.
7 years ago
Yu Yang
b52ad9de92
Merge pull request #9000 from reyoung/feature/extract_prepare_from_executor_run
...
Extract Prepare from Executor
7 years ago
Tao Luo
b62874429d
Merge pull request #8910 from Xreki/core_inference_profile
...
Refine the profile codes for inference.
7 years ago
Yu Yang
43d09a1c5f
Extract Prepare from Executor
7 years ago
Yu Yang
225efa671f
Remove dims in base class
7 years ago
QI JUN
f7e9fe57d3
[Memory]More memory optimization policy ( #8690 )
...
* add memopt level
* add opt level for image classification demo
* clean code
* add delete op
* clean code
* test machine translation demo
* clean code
* clean code
* skip fill constant with force cpu
* clean code
* clean code
* refine code
* clean code
* fix bug
7 years ago
Yu Yang
2ea4a5d96c
Polish double buffer reader
7 years ago
kexinzhao
607eec30a8
Merge pull request #8946 from kexinzhao/fix_cuda_arch_fp16
...
Add GPU compute capability check for float16 math function test
7 years ago
Yancey
b5ef315cf1
Fix dist compile error ( #8987 )
7 years ago
qingqing01
b3d26cd3ad
Fix bug in detection_output and mAP calculation in SSD. ( #8985 )
...
* Clipping bbox in the mAP evaluator calculation.
* Fix bug in detection_output and mAP calculation in SSD.
* Fix bug in detection.py.
* Fix bug in test_detection_map_op.py.
7 years ago
Yu Yang
46ae4075ee
Polish ShuffleReader and test
7 years ago
Kexin Zhao
c88f58dbd8
add comment
7 years ago
chengduoZH
f1c3ecb2b2
add concat rows
7 years ago
chengduo
685f03762e
Merge pull request #8890 from chengduoZH/feature/fix_bug_of_elementwise
...
Add ElementwiseOpInferVarType for Elementwise_op
7 years ago
Kexin Zhao
3b44b849d3
address comments
7 years ago
fengjiayi
dd1244f3c9
Merge pull request #8943 from JiayiFeng/fix_bugs_in_readers
...
Fix a potential bug in the c++ reader
7 years ago
Yu Yang
7eedced82a
Polish RecordIO
7 years ago
caoying03
a8cdd97ef5
Merge branch 'develop' into enhance_reshape
7 years ago
caoying03
1d4dfc0966
fix bugs.
7 years ago
Yu Yang
cfca8a3a26
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/recordio_file_reader
7 years ago
Yu Yang
fea43077f6
Refine
7 years ago
pzelazko-intel
4730a4be24
MKLDNN pool2d OP kernel added ( #8879 )
...
* MKLDNN pool2d OP kernel added
* conv2d and pool2d MKLDNN kernels renamed
* MKLDNN conv2d kernel refactoring
7 years ago
Kexin Zhao
95de7617eb
fix bug
7 years ago
Kexin Zhao
1998d5afa2
add gpu info func to get compute cap
7 years ago
Kexin Zhao
d400b4192d
fix math function arch mismatch for older GPU
7 years ago
fengjiayi
614c33fb3a
fix a potential bug in the c++ reader
7 years ago
chengduoZH
1509ce6638
enhancement look_up_table
7 years ago
fengjiayi
aa3f5058d3
Merge pull request #8841 from JiayiFeng/dev_double_buffer_for_cpp_reader
...
Basic double buffer for cpp reader
7 years ago
QI JUN
b341bac7e1
Refine cast op ( #8923 )
...
* fix mac build error
* override GetExpectedKernelType for cast op
* fix typo
* add cuda unittest
7 years ago
Yancey
8468037918
Fix sparse update memory error for distributed training ( #8837 )
...
Fix sparse update memory error for distributed training
7 years ago
fengjiayi
35e1e0d521
uses channel to replace the traditional buffer
7 years ago
fengjiayi
b3a11fdf3a
Merge branch 'rm_reader_HasNext' into dev_double_buffer_for_cpp_reader
7 years ago
fengjiayi
6e5736e270
fix a compile error
7 years ago
fengjiayi
4e517881f7
remove HasNext
7 years ago
Liu Yiqun
a8e8507767
Refine the profile codes for inference.
7 years ago
武毅
9dd34e4169
update unpushed commits for zerocopy grpc ( #8900 )
7 years ago
zhouhanqing
9d78971d8b
Some comments have been modified.
7 years ago
Xin Pan
b825c79261
Merge pull request #8897 from panyx0718/message
...
Print exception message from threads
7 years ago
zhouhanqing
3ca968441d
Merge branch 'develop' into Paddle-ReduceProd
7 years ago
kexinzhao
90215b7844
Add float16 GEMM math function on GPU ( #8695 )
...
* test cpu float16 data transform
* add isnan etc
* small fix
* fix containsNAN test error
* add data_type transform GPU test
* add float16 GPU example
* fix error
* fix GPU test error
* initial commit
* fix error
* small fix
* add more gemm fp16 tests
* fix error
* add utility function
7 years ago
武毅
45af8c1e99
Performance/zero copy variable seriralization ( #8839 )
7 years ago
Xin Pan
9a27d3af23
Print exception message from threads
7 years ago
chengduoZH
53d19f5b1e
Add ElementwiseOpInferVarType
7 years ago
ying
d3d16f76f5
enhance reshape operator.
7 years ago
Luo Tao
bc0cfb2283
remove PADDLE_USE_ATLAS
7 years ago
qingqing01
ffda2c414d
Clipping bbox in the mAP evaluator calculation. ( #8872 )
7 years ago
Yiqun Liu
fecc9a38c6
Add test for nested RecordEvent. ( #8773 )
...
* Add test for nested RecordEvent.
* Remove the debug information.
* Add log information for the 3 usages and reduce the loop counts of nested case.
7 years ago
Xin Pan
a9b9ec45ab
Merge pull request #8775 from panyx0718/test2
...
Improve the timeline profiler
7 years ago
Yu Yang
9d4c93a0a7
Fix CI
7 years ago
chengduo
abb10556e8
Merge pull request #8859 from chengduoZH/feature/refine_exe_log
...
Add log before op Run
7 years ago
Yu Yang
b536799af0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/recordio_file_reader
7 years ago
Xin Pan
30e556d675
Use vlog instead.
7 years ago
Yu Yang
db46778bdd
Polish codes and comments
7 years ago
Yu Yang
5cb79524d2
Fix CI
7 years ago
QI JUN
47ca1814f3
fix mac build error ( #8856 )
7 years ago
chengduoZH
f7c7135673
Add log before op Run
7 years ago
chengduo
f3cdeb9a29
Merge pull request #8820 from chengduoZH/feature/refine_elementwise_
...
[Speed] Refine elementwise sub,div,min,max gradient functor
7 years ago
Xin Pan
eb46845313
Add warning
7 years ago
Yiqun Liu
a032f56f7c
Add profiling information for inference example ( #8748 )
...
* Add profiling information for inference example, recognize digits.
* Refine the profiling method.
* Correct the use of RecordEvent and simplify recognize_digits.
7 years ago
qingqing01
ded34b2c0f
Fix detection_map_op for multi-device. ( #8845 )
7 years ago
kexinzhao
7f00716c87
Add context wait in type_transform ( #8850 )
7 years ago
Tao Luo
6f50dee4d5
compile and install the static library of fluid inference ( #7827 )
...
* compile and install the static library of fluid inference
* fix dynload_cuda not in CPU mode
* update shared library and adjust the deploy of openblas
* adjust the deploy of openblas
* * auto add all fluid modules for static library
* use libprotobuf.a instead of libprotobuf-lite.a for profiler
* use set_property to set the global varible instead of ENV
* add gpu depends of fluid modules, auto add inference_lib_dist depends
* change the condition of openblas_lib, and fix a typo
7 years ago
Yu Yang
72be7a6151
Complete RecordIO reader op
7 years ago
fengjiayi
b1f647fd6d
fix errors
7 years ago
zhouhanqing
732eebb286
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into Paddle-ReduceProd
7 years ago
zhouhanqing
15306ffdc3
add product reduction for reduce_op
7 years ago
fengjiayi
e8d21b6349
fix an error
7 years ago
fengjiayi
4fb7b96756
Add basic double buffer reader
7 years ago
Luo Tao
49f3f1db07
add back framework_proto depends
7 years ago
Luo Tao
3ddc997182
rename concat_functor to concat, refine CMakeLists based on comments
7 years ago
Luo Tao
1ef97fa7b1
Merge branch 'develop' into math_function
7 years ago
Yu Yang
bcb80756af
Add Writer/Scanner
...
Make vec<Tensor> can be serialized to RecordIO
7 years ago
chengduo
84aea8a8a1
Merge pull request #8669 from chengduoZH/feature/concat_op
...
Refine concat_op
7 years ago
pzelazko-intel
8c71adaa8c
MKLDNN conv2d kernel added ( #8451 )
...
* MKLDNN conv2 OP kernel added
* TODOs added
* mkldnn conv2d OP refactor
* CanCUDNNBeUsed and CanMKLDNNBeUsed moved
7 years ago
Yan Chunwei
049383c615
add inplace to reshape ( #8747 )
7 years ago
fengjiayi
77200a7057
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_double_buffer_for_cpp_reader
7 years ago
fengjiayi
af64f39bfd
fix compile errors
7 years ago
Yu Yang
42e65a20c6
Merge pull request #8791 from reyoung/feature/extract_reader_ops
...
Extract create_reader_op to three files
7 years ago
Yu Yang
10343123e3
Merge branch 'feature/extract_reader_ops' into feature/recordio_file_reader
7 years ago
Yu Yang
f1d61e675c
Add magic number in recordio
7 years ago
chengduoZH
8b30fadac3
refine elementwise sub,div,min,max
7 years ago
Yu Yang
4690b9c987
FIX CI
7 years ago
chengduo
c43995ed36
Merge pull request #8810 from chengduoZH/feature/refine_elementwise_mul
...
[Speed]Refine elementwise_mul_op
7 years ago
kexinzhao
266ccaa843
Integrate float16 into data_type_transform ( #8619 )
...
* test cpu float16 data transform
* add isnan etc
* small fix
* fix containsNAN test error
* add data_type transform GPU test
* add float16 GPU example
* fix error
* fix GPU test error
* add context wait
7 years ago
Abhinav Arora
78c884d7a7
Redesign channel implementation for Select Op ( #8814 )
...
* Redesign channel implementation for Select Op
* Remove unecessary header
* Remove unnecessary comments
7 years ago
chengduoZH
a1331f9877
refine elementwise_mul_op
7 years ago
fengjiayi
3fcd16ede3
init double buffer
7 years ago
qingqing01
0e1f82fd2a
Fix bug in detection mAP evaluator. ( #8778 )
...
* Fix mAP evaluator bug.
* Fix bug in detection mAP evaluator.
* Fix unit testing.
* Support to set background label index in detection mAP op.
7 years ago
Yu Yang
4d8345e3ac
Extract create_reader_op to three files
7 years ago
Yu Yang
82b149cd29
Merge pull request #8780 from reyoung/feature/recordio
...
Feature/recordio
7 years ago
whs
f608bb2756
Merge pull request #8651 from wanghaoshuang/fix_pool
...
Add ceil_mode option for pool2d and pool3d
7 years ago
Xin Pan
f10152df78
Fix nullptr when doing nested profileing
7 years ago
Xin Pan
cf6244c1b8
Improve profiler
...
smaller binary proto
avoid untrackable kernel
7 years ago
Yu Yang
9dc69582de
Make recordio simple
7 years ago
Xin Pan
c7b7291b9f
Merge pull request #8758 from panyx0718/nccl
...
[Speed]Avoid init_nccl for every steps.
7 years ago
Xin Pan
a4d68ed3d8
Add lock
7 years ago
kexinzhao
6720681cc2
Enable is_test attr of batch norm and drop out op for test program ( #8642 )
...
* fix is_test issue
* add paddle enforce
* fix bug
* add new func
* small fix
* address comments
7 years ago
qingqing01
608feea204
Implement detection mAP evaluator wrapper and unify label format between SSD loss and mAP evaluator ( #8736 )
...
* Implement mAP evalutor Python interface.
* Fix unit testing and uniy label format between SSD loss and mAP evalutor.
* Update doc.
7 years ago
chengduoZH
c3864eab99
if axis == 0; directly copy D->D
7 years ago
dongzhihong
fe18341585
"seperate internal library and exported library"
7 years ago
dongzhihong
7364348d04
"move from recordio repo to paddle"
7 years ago
chengduoZH
131ec276ed
fix bug for big number; float->double and code refine
7 years ago
Xin Pan
d054cfeae6
Avoid init_nccl for every steps.
7 years ago
dongzhihong
7016979cd5
"add crc32 encoder"
7 years ago
dongzhihong
69c7991108
"add snappy library"
7 years ago
chengduoZH
82bd82c186
follow comments and refine code
7 years ago
Tao Luo
b02e12e8e4
Merge pull request #8740 from luotao1/warning
...
fix warning: statement is unreachable
7 years ago
Luo Tao
ac149bc948
fix warning: statement is unreachable
7 years ago
qingqing01
457d012a8e
Enable device automatically switching in mine_hard_examples_op. ( #8706 )
7 years ago
dongzhihong
6540cda109
Merge remote-tracking branch 'origin/develop' into feature/recordio
7 years ago
dongzhihong
af5dcda481
"add testing"
7 years ago
chengduoZH
00e596edbe
get max threads of GPU
7 years ago
Luo Tao
8c691c8de0
fix only shared variables could be declared as static in the device code
7 years ago
typhoonzero
f94a758c16
fix fluid distribute build
7 years ago
Luo Tao
f67275a920
refine operator/math/CMakeLists.txt, seperate im2col from math_function
7 years ago
Xin Pan
92974d4186
Merge pull request #8679 from panyx0718/test2
...
Add CPU time and MemCopy to the timeline.
7 years ago
Xin Pan
f3cbfc021c
Add MEMCPY information
7 years ago
chengduoZH
60e7ee0611
refine concat_op
7 years ago
qingqing01
9e1ec8c919
Enable device switching automatically for serveral operators ( #8684 )
7 years ago
Xin Pan
55b2d3d032
Add CPU time to the timeline.
7 years ago
Xin Pan
6720198731
Merge pull request #8663 from panyx0718/test2
...
First timeline version
7 years ago
Abhinav Arora
01536e8dc5
Adding more unit tests for ChannelHolder class ( #8668 )
7 years ago
chengduo
12a3cea087
Add tuple type ( #8519 )
...
* add the type of tuple
* add lod_tensor to tuple
7 years ago
Xin Pan
12843a3a53
Firt timeline version
7 years ago
Yu Yang
db77006923
Merge pull request #8657 from reyoung/feature/fix_compile
...
Fix the compilation on CUDA 9.1/GCC 5.3
7 years ago
emailweixu
eac2c3cf36
Merge pull request #8505 from emailweixu/math_op
...
Correctly handling variable with batch dimension for math ops.
7 years ago
Abhinav Arora
0d878e4c09
Add Go_op, Channel_create, channel_close, channel_send and channel_receive ops ( #8593 )
...
* Adding Python boilerplate code for Go op
* Add very basic test case
* Adding the python logic for go routine
* Fix syntax
* Changing test to notest
* Rename Routine to Go
* Combining GoGuard and Go in one class
* Modify test
* Adding fluid close channel
* Fixing __init__.py for calling fluid.go()
* Adding stubs for channel methods and updating test case
* Removing import *
* Adding imports from concurrency
* Initial commit of GO_OP (for varun)
* Creating local scopes and go through them
* Updated go op inputs persistability enforcement
* Add thread execution; compile failing though
* Fix go op
* Cleaned up Go op
* Fix yapf format issue
* Readd warp ctc dir for unit tests
* Updated make_channel, channel_send, channel_recv and channel_close
* Moved thread function to another method, update unit tests
* remove output var
* Add stubs for channel operators
* Updating concurrency with signatures
* Updated the signature with return status
* Fixed dtype in variables
* Updating stub of ChannelSend + add infershape
* Updating stub of ChannelRecv + add infershape
* Updated signature
* Adding the channel_create operator
* Merge channel send+receive ops
* Update concurrency tests using all operators
* Updating the create op with ChannelHolder
* Fix issues with channel_create_op
* Add the implementation for channel_close op
* Add channel close operator, fix channel close op
* Adding the channel_send op
* Comment channels C++ and Python code
* Concurrency python api comment fix
* Update unit test to add Status variable
* Adding channel receive operator
* Update concurrency test to demonstrate a complete CSP flow
* Fix clang-format issues
* Fixed "Out" parameter name
* Fixing merge conflict in framework.py
* Add channel ops to framework.py no_kernel_op_set
* Seperating channel_send and channel_recv operators
* Documenting capacity type
* Update concurrency test to create go block as child block of main program
* Changing set status implementation
7 years ago
Yancey
718642e93f
Merge pull request #8659 from Yancey1989/fix_dist_bug
...
Registry var type infer in split_selected_rows op
7 years ago
chengduo
e9f2033175
Merge pull request #8539 from chengduoZH/feature/refine_elementwise_op_function.h
...
Refine Sum in elementwise_op_function
7 years ago
Yancey1989
7bd16fe13b
registry var type infer
7 years ago
Yuan Gao
e84615bae6
Fix box coder op ( #8647 )
...
* fix ssd problems
* fix box decoder op
* fix dimension problem in detection tests
* update detection doc
* Update detection doc
* Update detection doc
* update detection doc
* update detection doc
7 years ago
wanghaoshuang
0ffec514cd
Fix comments
7 years ago
Yu Yang
22b5c07a7d
Fix the compilation on CUDA 9.1/GCC 5.3
...
* Make CUPTI_LIB_PATH not passing by macro.
* Add missing header
7 years ago
Yibing Liu
ace512a3ed
Merge pull request #8596 from kuke/profiler_multi_gpu
...
Fix the profiler's bug in multi-gpu mode
7 years ago
wanghaoshuang
1e510d9914
Add ceil_mode option for pool2d and pool3d
7 years ago
Yibing Liu
c0876cf686
update due to upstream's change
7 years ago
dzhwinter
313454dfae
"init"
7 years ago
chengduoZH
90dc33b5ff
Add todo for reduceSum
7 years ago
chengduo
cf883d9caf
Merge pull request #8613 from chengduoZH/feature/fix_bug_conv_op
...
Fix conv_op bug
7 years ago
qingqing01
f449180b1c
Register more data type for reshape operator. ( #8617 )
7 years ago
chengduoZH
a779b42450
follow comments
7 years ago
xuwei06
e9b8ebf42c
Correctly handling variable with batch dimension for math ops.
...
When the second argument contains batch dimension, the axis should be 0.
Also makes elementwise ops more tolerant at handling tensors with trailing
singular dimensions.
7 years ago
xuwei06
7d56c6d078
Adding Dim<0>
...
Dim<0> is for scalar (rank-0 tensor). Adding Dim<0> can simplify
a lot of code.
7 years ago
chengduoZH
b5c920920c
fix conv_op bug
7 years ago
Luo Tao
6dd3a61b2f
combine batch_size_like.cc into batch_size_like.h
7 years ago
Xin Pan
decaad5ccc
Merge pull request #8542 from panyx0718/test
...
Extend current profiler for timeline and more features.
7 years ago
Tao Luo
1ac31d3d08
Merge pull request #8591 from chengduoZH/feature/refine_cmake_for_cudnn
...
Refine cmake for cudnn op
7 years ago
chengduoZH
62fe2f28a1
follow comments
7 years ago
qingqing01
4948f7b3fe
Enhance bipartite_match_op to support argmax matching after bipartite matching. ( #8580 )
...
* Enhance bipartite_match_op to support argmax matching after bipartite matching.
* Fix typo error.
7 years ago
Yibing Liu
ee88855ddf
Fix the profiler's bug in multi-gpu mode
7 years ago
Tao Luo
dce0383f89
Merge pull request #8404 from Xreki/core_refine_inference
...
Refine the inference API and unittests
7 years ago
chengduoZH
16fc5e3893
refine cmake for cudnn
7 years ago
武毅
0250e8083d
Merge pull request #8586 from Yancey1989/fix_dist_unittest
...
Fix send_recv unit test
7 years ago
Xin Pan
9bbce49353
Fix version date.
7 years ago
Xin Pan
b9ec24c6e9
Extend current profiler for timeline and more features.
7 years ago
dzhwinter
34605d2641
accelerate the cuda concat op, avoid many times copy ( #8585 )
...
* "try enhance concat op"
* "enhance the concat operator"
7 years ago
Yancey1989
7a1d6ae5f6
Fix send_recv unit test
7 years ago
typhoonzero
6a686794ac
fix comment typo
7 years ago
Liu Yiqun
efb6ba3531
Merge branch 'develop' into core_refine_inference
7 years ago
typhoonzero
a50230add6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_raw_var_type
7 years ago
chengduoZH
b8938b448c
refine Sum
7 years ago
Luo Tao
7453312dee
fix error python path when building framework_py_proto
7 years ago
Luo Tao
bb36084949
fix error directory of fluid inference unitest
7 years ago
chengduoZH
a82883922e
follow comments
7 years ago
QI JUN
44e3015412
fix nccl version ( #8540 )
...
* fix nccl version
* enable nccl test
7 years ago
Qiao Longfei
b2f530e3cc
Merge pull request #8532 from jacquesqiao/fix-compare-op
...
Fix compare op
7 years ago
chengduoZH
22b9ab052d
refine Sum
7 years ago
typhoonzero
9b5be6ef43
fix short connection again
7 years ago
Yu Yang
eebfb71d3f
Merge pull request #8524 from reyoung/feature/unique_name_guard
...
Moving unique_name to python
7 years ago
qiaolongfei
28d07e3cb8
add python part of compare op
7 years ago
qiaolongfei
d4e3495cf5
add larger_than and larger_equal op and kernel
7 years ago
qiaolongfei
bad01596f9
rename register macro
7 years ago
Yu Yang
e8cb97b8a2
Moving unique_name to python
...
* Add reset and guard to unique_name
7 years ago
Yu Yang
7a9098a60e
Add block.fwd_block_id ( #8489 )
...
* Add block.fwd_block_id
* fix bug in memory optimization transpiler
* Change DFS to BFS
* Add comments
7 years ago
Abhinav Arora
6e7fee0ee1
Add unit tests for ChannelHolder ( #8510 )
7 years ago
chengduo
0e187bc93e
fix get_mid_dims annotation ( #8490 )
7 years ago
kavyasrinet
77ee8fb240
Exposing Channel to be used as a Variable and integrating with Fluid ( #8486 )
...
* Adding set_capacity method support
* Adding Python for make_channel
* Updating notest_concurrency
* Write python for make_channel method
* Write python for make_channel method
* Fix make_channel and test
* Placeholder ops for channel send, recv and close
* Adding ToTypeIndex method to var_type.h
* Add var_type.h to channel:
* Added POD_Type to the method
* Add CHANNEL to executor
* Updated get and set DataType to accomodate Channels
* Updating get and set to incorporate channels
* Adding CHANNEL as supported VarType in protobuf
* Removing unecessary import
* Fixing VarDesc to adapt to Channel as VarType
* Add channel.h to executor
* Remove innclude from channel
* Updated var_type to support Channel as var type
* Adding get_channel to pybind
* Added ChannelHolder
* Adding make_channel as an op
* Adding ChannelHolder in channel
* Fixing typo
* Commenting out operators in concurrency
* Removing totypeid right now since we don't need it.
* Reverting python changes
* Fixing typo in framework.py
* Modify comments for ReaderHolder
7 years ago
Yu Yang
88c22e9d1a
Speed up elemwise grad ( #8402 )
...
* Speed up elemwise grad
* Fix bug
* Add macro for MAX_BLOCK_DIM
7 years ago
Yu Yang
d50016b2a7
Remove build warnings in float16.h ( #8481 )
7 years ago
武毅
c490f1b38b
Merge pull request #8049 from typhoonzero/no_counter_on_pserver
...
Enhancement/transpiler rename grad vars to add trainer id, so RPC call can be retried.
7 years ago
dzhwinter
46e4f6ffab
small fix
7 years ago
Yang Yang(Tony)
3c2cafbe1d
fix parallel do hard coded empty var name ( #8469 )
7 years ago
helinwang
633756ad13
Merge pull request #8361 from tonyyang-svail/backward_on_parallel_do
...
Backward on parallel do using nccl
7 years ago
Siddharth Goyal
a040239d3a
Add conv test case for inference-recognize digits ( #8466 )
7 years ago
Yang Yang
ae69f0b04f
merge develop
7 years ago
kavyasrinet
bd58bf3e03
[WIP] Expose Channel in Python and add to VarType ( #8465 )
...
Add Channel as a VarType
7 years ago
Yang Yang
ec01f635f5
merge develop
7 years ago
helinwang
6566d7c740
Merge pull request #8458 from tonyyang-svail/rename_output
...
change outputsize func name
7 years ago
Abhinav Arora
c7ad26d6a4
[WIP] Move DataType enum inside VarType ( #8447 )
...
* Move Pod Types from DataType enum to Type enum
* Fixed data_type.h
* Fix type in TensorDesc
* Add comment to framework.proto
* Fixed type in data_type.h
* Updated format of type in data_type.h
* Fix var_desc.h
* Fix op_kernel_type.h
* Fixed data_type_transform_test.cc
* Fix operator.h
* Fixed data_type_transform.cc
* Fixed op_kernel_type_test.cc
* Fix operator.cc
* Fixed data_layout_transform_test.cc
* Fix var_desc.cc
* Fixed assign_value_op.cc
* Fixed assign_value_op.h
* fixed protobuf.cc
* Fix data_layout_transform_test.cc and op_kernel_type_test.cc
* Fixed rnn_memory_helper_op.cc
* Fix progrma_desc_test.cc
* Fixed fill_constant_batch_size_like_op.cc
* Fix operator_test.cc
* Fixed fill_constant_op.cc
* Fixed gaussian_random_op.cc
* Fixed uniform_random_op.cc
* Fixed edit_distance_op.cc
* Fixed fill_constant_batch_size_like_op.cc
* Fixed rnn_memory_helper_op.cc
* Fixed chunk_eval_op.cc
* Fixed assign_value_op.cc
* Fixed assign_value_op.h
* Fixed cast_op.h
* Fixed cast_op.h
* Fix fill constant op
* Fixed clang for assign_value_op.cc
* Fix one_hot_op.h
* Fix one_hot_op.cc
* Fix fill_op.cc
* Fixed sum_op.cc
* Fixed sum_op clang
* Fix uniform_random_op.cc
* Fix gaussian_random_op.cc
* Fix backward.cc
* Fix protobuf.cc
* Fixed prune_test.cc
* Fixed op_registry_test.cc
* Fix data_device_transform_test.cu
* Fix travis error
* Fixed one_hot_op.cu
* Fixed op_registry_test.cc
* Fixed nccl_op.cc
* Fixing python tests
* Revert "Fixing python tests"
This reverts commit fccaa4c5818ed9f379ea1ce4315066cc78076c64.
* Fixing Pybind to remove data type
* Fixing tensor.py
* Updated the new files:
* Resolve error in merge conflict of fill_constant_batch_size_like_op.cc
7 years ago
kexinzhao
74e0eb7267
make float16 a pod type ( #8456 )
7 years ago
Yang Yang
cb06337f9e
change outputsize func name
7 years ago
Yang Yang
5229ccbdc7
merge develop
7 years ago
Yang Yang
1d9fd1c006
pass test_recognize_digits
7 years ago
Yu Yang
89ead8d151
Feature/understand sentiment parallel do ( #7994 )
...
* Support parallel test for understand_sentiment
* Full test on understand_sentiment
* Skip normal tests
* Debug CI
* Enable benchmark
* Revert init.cc
* Make CI pass
7 years ago
emailweixu
4f122c0760
Remove incorrect statement in compare_op.h ( #8416 )
...
The type of tensor z should be bool. And there's no need to call mutable_data because ElementwiseComputeEx will do it.
7 years ago
emailweixu
6752b06f8c
Generating random numbers with given batch size ( #8337 )
...
* Generating random numbers with given batch size
uniform_random_batch_size_like_op
gaussian_random_batch_size_like_op
* More comments about random seed.
* Move test_*_random_batch_size_like_op to unittests
7 years ago
Yang Yang
9d26f1a3df
callback to list of callbacks
7 years ago
Yi Wang
cfffb1a362
Update tensor_util.h ( #8422 )
...
* Update tensor_util.h
* Update with moved TensorDesc
* Fix tensur_utils.cu
* Update
* Update
* Update
* Update
* Make tensor_util.cu a symbolic link
7 years ago
Yang Yang
3f09620ef2
pass compile
7 years ago
Yang Yang
e021ad674e
Merge remote-tracking branch 'upstream/develop' into backward_on_parallel_do
7 years ago
Yang Yang(Tony)
87f4311a88
compile with nccl2 ( #8411 )
...
* compile with nccl2
* add ncclGroup; it is necessary in nccl2
* add back libnccl-dev
7 years ago
typhoonzero
ebb2bcfe0a
remove comments
7 years ago
typhoonzero
1494673dcb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into no_counter_on_pserver
7 years ago
typhoonzero
8acad27e8d
refine code
7 years ago
QI JUN
8b01546fe4
fix compile warning ( #8430 )
7 years ago
fengjiayi
ed5dc3d4ae
remove 'friend lod_tensor in tensor'
7 years ago
Yang Yang
7129fa3c90
merge develop
7 years ago
emailweixu
3beafff282
Merge pull request #8415 from emailweixu/print_op
...
Make print_op able to show the value of bool tensor
7 years ago
Abhinav Arora
fcadb45251
Separate VarType from VarDesc in framework.proto and fix all related compiler errors ( #8414 )
...
* Refine Type system
* Fixing type inference
* Fixed create_reader_op.cc
* Fix var_desc.h
* Fixed executor.cc
* Fix shape_inference.h
* Fixed create_reader_op.cc
* Fix tensor_util.h
* Fixed var_type_inference_test.cc
* Fix shape_inference.cc
* Fixed sum_op.c
* Fixed read_op.cc
* Fix var_type.h
* Fixed beam_search_decode_op.cc
* sendrecvop_utils.cc
* Fix operator.cc
* Fixed lookup_table_op.cc
* Fixed op_desc.cc
* Fixed get_places_op.cc
* Fixed lod_rank_table_op.cc
* Fixed beam_search_op.cc
* Fix var_desc.cc
* Fixed lod_tensor_to_array_op.cc
* Fixed while_op.cc
* Fix program_desc_test.cc
* tensor_array_read_write_op.cc
* Fix assign_op.cc
* Fix executor.cc
* Fix protobuf.cc
* Fix protobuf.cc
7 years ago
Yang Yang
a259ad41b0
remove duplicated cbegin and cend in mixed vector
7 years ago
Yang Yang
5f343e3eb0
Merge remote-tracking branch 'pr/8411' into backward_on_parallel_do
7 years ago
Yang Yang
da97d9dfd6
merge develop
7 years ago
xuwei06
004df46f28
Make print_op able to show the value of bool tensor
...
And some minor fixes on comments.
7 years ago
kexinzhao
f82fa64a06
Move float16 into fluid folder ( #8394 )
...
* move float16 into fluid
* fix include
* move to platform folder
7 years ago
qingqing01
24509f4af9
Fix the grammar in copyright. ( #8403 )
7 years ago
武毅
c0fc50d5f3
Merge pull request #8409 from typhoonzero/fix_grpc_short_conn
...
Fix grpc short connection
7 years ago
dzhwinter
07923ba006
Memory/dropout4 ( #8407 )
...
* "merge random generator kernel and mul"
* "fix dropout"
7 years ago
Liu Yiqun
2d74b5f9ba
Refine the Python API load/save_inference_model.
7 years ago
typhoonzero
0d4d9c4e13
fix grpc short connection
7 years ago
Liu Yiqun
b44917d09b
Implement IsPersistable() in c++.
7 years ago
qingqing01
057efd1709
Implement multibox loss wrapper for SSD in Python API. ( #8385 )
...
* Implement multibox loss wrapper in Python API.
* Add some wrappers for SSD detection.
* Fix conflicts.
* Add unit testing for SSD loss wrapper.
* Update doc in Python API.
* Refine unit testing.
* Add more unit testing and update some interface arguments.
7 years ago
typhoonzero
4b91cb5289
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into no_counter_on_pserver
7 years ago
武毅
9942565fe7
Merge pull request #8386 from typhoonzero/fix_dist_transpiler_develop
...
Fix dist transpiler develop
7 years ago
Liu Yiqun
f95e05a388
Refine the inference unittests.
7 years ago
typhoonzero
dca9941e4b
pass size when copy
7 years ago
Liu Yiqun
899ba0d05a
Merge branch 'develop' into core_refine_inference
7 years ago
Liu Yiqun
c796e013c6
Refine the inference unittests.
7 years ago
Yu Yang
5046869e84
Merge pull request #8287 from tonyyang-svail/operator_set_device
...
Correctly handle cuda place for operators
7 years ago
chengduo
7757a8ad7c
Merge pull request #8265 from chengduoZH/feature/add_prior_box_py
...
Add Python interface of prior_boxes
7 years ago
Yu Yang
37792e546b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into HEAD
7 years ago
qingqing01
8a0dd2409e
Expose softmax_with_cross_entropy and smooth_l1 into Python API. ( #8375 )
...
* Add softmax_with_cross_entropy and smooth_l1 in Python API.
* Fix doc format.
7 years ago
Wang Hao
a824da91e6
Merge pull request #6588 from wanghaox/detection_map
...
detection map evaluator for SSD
7 years ago
chengduoZH
dff1bf33c9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prior_box_py
7 years ago
Qiao Longfei
e9d3099145
Merge pull request #8357 from jacquesqiao/override-compare-op-in-python
...
override comparison operators in Python for Variable
7 years ago
wanghaox
91a2188301
update detection_map
7 years ago
typhoonzero
67d6f3a8ce
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_dist_transpiler_develop
7 years ago
typhoonzero
9a05c90750
fix StridedNumelCopyWithAxis
7 years ago
chengduoZH
5ccab2dc65
remove conflict
7 years ago
Yu Yang
6f625f9c2f
Disable unstable unittest
7 years ago
Yu Yang
892cc28c7b
Fix bug
7 years ago
Yu Yang
07bb413977
Revert changes
7 years ago
Yu Yang
da757696de
Merge branch 'develop' into feature/fix_get_empty_lod
7 years ago
Yu Yang
18efe5aa1d
Fix CI
7 years ago
Yancey
caf9a09d7b
Merge selected rows with dynamic variable count ( #8023 )
...
* dynamic send/recv selected rows
* update by comment
* fix by comment
7 years ago
typhoonzero
11bcb43a44
fix merge issue
7 years ago
typhoonzero
21071f7106
no create trainer var on listen_and_serv
7 years ago
Yu Yang
a43fac3567
Fix empty Vector foreach
...
Fix #8368
7 years ago
Yancey
525a4fdae8
Merge pull request #8270 from Yancey1989/improve_concat_split_op
...
Improve split and concat op
7 years ago
Yang Yang
f35401c4da
diable debug string due to vector bug
7 years ago
Yang Yang
0c45eab7ff
no getmutable nccl_com
7 years ago
Tao Luo
b56f4a4ee2
move code from /paddle/string to /paddle/fluid/string ( #8363 )
7 years ago
Yu Yang
9029a9d9c6
Fix constructor bug in mixed_vector ( #8364 )
...
* Fix constructor bug in mixed_vector
* Fix warnings
* Clean code
* Extract for-loop init. Make nvcc happy
7 years ago
Yang Yang
0e2deaa5fd
Merge remote-tracking branch 'pr/8364' into backward_on_parallel_do
7 years ago
whs
274f4e94af
Merge pull request #8334 from wanghaoshuang/fix_seq_expand
...
Fix output dims of sequence expand op
7 years ago
Yancey
279aa626ab
Move learning rate and releated op to pserver ( #8209 )
...
* dist train support lr decay
* update by comment
* revert elementwise method creator
* delete comment
7 years ago
Yu Yang
190119bb98
Extract for-loop init. Make nvcc happy
7 years ago
wanghaox
006ef1fd7a
migrate detection_map code directory
7 years ago
Yu Yang
ae2296e806
Clean code
7 years ago
qingqing01
72bcf72c66
Refine target_assign_op to unify the classification and regression targets assigning. ( #8326 )
...
* Refine target_assign_op to unify the classification and regression targets assignment.
* Fix the unit testing.
* Fix conflicts.
7 years ago
Yu Yang
816fa8f32e
Fix warnings
7 years ago
Yu Yang
82c33c61d9
Fix constructor bug in mixed_vector
7 years ago
typhoonzero
b0096361c7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into no_counter_on_pserver
7 years ago
qiaolongfei
6f78cb9969
add not_equal
7 years ago
Yancey1989
235332544c
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into improve_concat_split_op
7 years ago
qingqing01
d38b869438
Fix the input dimension for multiclass_nms_op. ( #8232 )
7 years ago
chengduoZH
282dadb0c6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prior_box_py
7 years ago
Yang Yang
3067114f3a
clean up
7 years ago
Yang Yang
cd9e660d4b
merge develop
7 years ago
Yi Wang
23bbaada95
Mt pusherge branch 'develop' of http://github.com/paddlepaddle/paddle into nccl2
7 years ago
kexinzhao
e800597bcf
Fix include path in inference test codes ( #8349 )
...
* fix absolute include path
* Remove test_helper.h in old location
* update include path
7 years ago
Yi Wang
40c7972d46
Merge branch 'develop' of http://github.com/paddlepaddle/paddle into operator_set_device
7 years ago
Yi Wang
a51c8ccc16
Merge branch 'develop' of http://github.com/paddlepaddle/paddle into fix_seq_expand
7 years ago
Luo Tao
77f04fd97a
move paddle/pybind/pybind.h to paddle/fluid/pybind/pybind.h, and cancel the test_parallel_op temporary
7 years ago
Yi Wang
a6c1bffa47
Merge with upstream
7 years ago
Yi Wang
bc7be8320e
Update pre-commit
7 years ago
Yi Wang
35e61b3e7e
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into move_to_fluid
7 years ago
Yi Wang
154368b1a0
Re-add the lack pybind.h
7 years ago
Yi Wang
fc374821dd
Correct #include path
7 years ago
Yi Wang
90648f336d
Move file to fluid/; Edit CMakeLists.txt
7 years ago