Yibing Liu
54c160aa72
Merge branch 'develop' of upstream into add_lstmp_doc
7 years ago
Yang Yu
8d6f827a0d
Merge branch 'develop' of github.com:baidu/Paddle into feature/fast_ci
7 years ago
chengduo
f20f0eb7cf
Merge pull request #7924 from chengduoZH/enhancement/dtype2str
...
Add DataTypeToString
7 years ago
Qiao Longfei
59357f4fb9
fix floor_op ( #7926 )
7 years ago
Yi Wang
4fb3c676a8
Polish threadpool ( #7918 )
...
* Polish threadpool
* Add #include <vector>
* Rename variables
* Rename variables
* clang-format
7 years ago
chengduoZH
e3952b9fae
fix unit test
7 years ago
chengduoZH
0f47703dd5
add begin_norm_axis
7 years ago
chengduoZH
a0669e387b
dtype2str
7 years ago
Yibing Liu
634faab1c0
Format doc & add unit test for dynamic_lstmp api
7 years ago
gaoyuan
72eccb238e
add box coder op
7 years ago
Yancey
0f0ce4e5ec
Fix cpplint ( #7914 )
7 years ago
chengduo
a026f52b73
refine channel ( #7910 )
7 years ago
chengduoZH
4ce397964b
fix unit test and c++ code
7 years ago
武毅
8a6a339eb4
Merge pull request #7889 from helinwang/send_err
...
Fix send op data race
7 years ago
kexinzhao
9b6387e7ae
address comments ( #7900 )
7 years ago
kavyasrinet
06e226378f
Fix Latex ( #7901 )
7 years ago
Kexin Zhao
2f9cf7ccae
address comments
7 years ago
chengduo
9beec1212b
Add Channel ( #7442 )
...
* add Channle
* refine Channel
7 years ago
Liu Yiqun
eca58a6200
Add unittest for GPU.
7 years ago
Yang yaming
84c12c6edc
Add one_hot operator. ( #7819 )
...
* Add one_hot operator.
* Add more unit tests.
7 years ago
Liu Yiqun
438aad24a5
Update the inference unittest using the new Executor.Run().
7 years ago
Liu Yiqun
2cf56367b8
Merge branch 'develop' into core_add_inference_unittest
7 years ago
kexinzhao
788f5c6d43
New Run() method for framework::Executor ( #7807 )
...
* initial commit
* add new executor run function
* fix bug
* fix multiple definition of feed_fetch_method issue
* fix cmake
* fix tensor copy error
* refine executor code
* add comments
* temporary modification
* address comments
* fix bug
7 years ago
Kexin Zhao
a671879759
remove unreferenced vars
7 years ago
Kexin Zhao
6592ea1568
initial commit
7 years ago
Helin Wang
0e1109cdf3
Fix send op data race
...
std::vector<bool> is not safe for concurrent write, even to difference indices.
More discussion:
https://stackoverflow.com/questions/48452611/is-stdfuturewait-a-memory-barrier-i-can-not-explain-this-data-race
7 years ago
chengduoZH
ae0ea54159
fix unit test
7 years ago
xzl
b5ea048395
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_depthwiseConv_op_gpu
7 years ago
xzl
aece290599
add depthwise gpu forward, backward, test, interface
7 years ago
QI JUN
7333df8510
fix pool_op bug ( #7879 )
7 years ago
Liu Yiqun
20d3af62d0
Enable the dependency.
7 years ago
Yang yaming
7278aa7b1c
Merge pull request #7739 from pkuyym/fix-7717
...
Add python wrapper for multiplex operator.
7 years ago
Yang yaming
b66f1ada97
Merge pull request #7792 from kuke/add_lstmp
...
Add lstm with recurrent projection layer operator
7 years ago
Yang Yu
5594456a4b
Add build_type to build.sh
7 years ago
Liu Yiqun
f6f7102b1c
Merge branch 'develop' into core_add_inference_unittest
7 years ago
Liu Yiqun
250206d1cf
Change the example of inference to a unittest.
7 years ago
Yu Yang
b456517214
Merge pull request #7821 from reyoung/feature/add_demo_for_parallel.do
...
Feature/add demo for parallel.do
7 years ago
yangyaming
9d826e1c65
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-7717
7 years ago
Yang Yang(Tony)
1d2c99c336
fix nccl root number ( #7842 )
...
* fix call once logic
* clean up
* further clean up
* fix root number
7 years ago
Yang Yang(Tony)
25cb906403
Fix call once logic ( #7839 )
...
* fix call once logic
* clean up
* further clean up
7 years ago
Yang Yu
cf1254df23
Merge branch 'develop' of github.com:baidu/Paddle into feature/add_demo_for_parallel.do
7 years ago
Darcy
f9fe48e00c
Merge pull request #7835 from putcn/fix_build_cblas_issue
...
fix openblas build
7 years ago
kexinzhao
5009f7c12f
Fix save load inference model and remove pickle ( #7712 )
...
* remove pick dependency
* fix bug
* small fix
* modify executor.py for save and load
* clean code
* Add usage example
* refine executor run function
* fix bug
* refine executor code
* fix block bug
* fix comments
* fix bug
* fix pass num
7 years ago
Xi Chen
8360ff0b32
fix openblas build
7 years ago
Yibing Liu
9ecc54a11b
Remove redundant code in unit test
7 years ago
yangyaming
8314412b0a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-7717
7 years ago
helinwang
1ab1181460
aysnc send/recv, seriliaze/deserialize using threadpool. ( #7705 )
...
* aysnc send/recv, seriliaze/deserialize using threadpool
* implement paralell deserialization correctly
7 years ago
Yang Yu
f044b23fac
Merge branch 'develop' of github.com:baidu/Paddle into feature/add_demo_for_parallel.do
7 years ago
Yu Yang
0c5cbc4e55
Merge pull request #7817 from reyoung/feature/rename_is_compile_gpu_to_is_compiled_with_cuda
...
Rename is_compile_gpu to is_compiled_with_cuda
7 years ago
Yibing Liu
76beff86a0
Make the projection activation configurable
7 years ago
Cao Ying
32a5dfd345
Merge pull request #7791 from lcy-seso/multihead_attention
...
Add the wrapper for multi-head scaled dot product attention.
7 years ago
typhoonzero
35b4d42ab6
merge doc fixes
7 years ago
typhoonzero
e163fd305f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into recv_op_python_with_guard
7 years ago
Yu Yang
e6a4facfd7
Merge pull request #7782 from reyoung/feature/disable_unstable_unittest
...
Disable the unstable unittest
7 years ago
Yang Yu
ef55a8f608
Polish english
7 years ago
Yang Yu
a06205f569
Add demo for parallel.do
...
Unify the recognize_digits
7 years ago
Yang Yu
d0a0475736
Rename is_compile_gpu to is_compiled_with_cuda
...
The English of the previous API is bad.
7 years ago
typhoonzero
b9d9b11c80
remove recv_op input
7 years ago
qingqing01
750299f577
Merge pull request #7576 from qingqing01/profiling_py
...
Refine profiler and expose to Python.
7 years ago
yangyaming
f3fe41078a
Fix conflicts and add more supported dtype.
7 years ago
chengduoZH
ca0177190f
add layer_norm
7 years ago
whs
42549fb792
Merge pull request #7811 from wanghaoshuang/fix_im2seq
...
Fix LoD calculate of im2seq
7 years ago
Yibing Liu
db1f6a591a
Update doc in lstmp_op
7 years ago
Yu Yang
9609c17a01
Merge pull request #7754 from tonyyang-svail/7724
...
Make Parallel.Do support sparse gradients
7 years ago
Yang Yu
9abd3b1e11
Add issue link
7 years ago
qingqing01
114d0c153e
Merge pull request #7707 from qingqing01/boost_cmake
...
Use ExternalProject_Add to download and install boost.
7 years ago
wanghaoshuang
f2a32dddbc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_im2seq
7 years ago
Tao Luo
95853fc1dd
Merge pull request #7806 from abhinavarora/fix_warnings
...
Fix warnings in priorbox, sequence_reshape and ctc_align_op
7 years ago
wanghaoshuang
1234b8b489
Fix setting lod
7 years ago
dangqingqing
f18016b958
Resolve conflicts.
7 years ago
ying
d163592a3f
Merge branch 'develop' into multihead_attention
7 years ago
Yang Yang
4c3dd8dc99
fix string name
7 years ago
Yang Yang
ed14528188
fix style
7 years ago
Yang Yang
b64dde2a03
fix share tensor
7 years ago
Abhinav Arora
f5517de7bc
Fix warnings in priorbox, sequence_reshape and ctc_align_op
7 years ago
Yibing Liu
3f3459d3d8
Merge branch 'develop' of upstream into add_lstmp
7 years ago
Yibing Liu
7a5b8ffacb
Pass grad checking for projection weight
7 years ago
Qiao Longfei
e7d44a2034
Nmt model ( #7340 )
...
neural machine translation model support beam search with while op
7 years ago
dangqingqing
a2b560d2cd
Fix the cmake dependence.
7 years ago
Yibing Liu
552c901204
Enable backward computation in lstmp_op
7 years ago
Cao Ying
9ae1523ee6
Merge pull request #7719 from guoshengCS/enhance-lookup_table_op-padidx
...
Enhance lookup_table_op to support padding_idx.
7 years ago
dangqingqing
920db1a2ef
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into boost_cmake
7 years ago
dangqingqing
608ebece36
Fix the dependence.
7 years ago
typhoonzero
bcc6740111
WIP python binding of send recv
7 years ago
Wang Hao
d43932c846
Merge pull request #7566 from wanghaox/iou_sim
...
add iou similarity operator
7 years ago
dangqingqing
0358fd0197
Refine profiler code.
7 years ago
ying
3be6c736f6
add multihead_attention.
7 years ago
QI JUN
d76fcb6fad
Memory optimization on Dynamic RNN ( #7599 )
...
* limit variable type to lod tensor in memory optimization transpiler
* refine policy
* support while operator
* fix random seed and training data order
* refine get_cfgs method to support multi while operators
* refine codes
7 years ago
wanghaox
fa10f03f92
update code
7 years ago
Wang Hao
81be9cef47
Merge pull request #6150 from wanghaox/prior_box
...
prior box operator for ssd
7 years ago
wanghaox
25a0193215
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into iou_sim
7 years ago
Yang Yu
97856d89ac
Disable the unstable unittest
7 years ago
Yang Yu
6f7b346f26
Merge branch 'develop' of github.com:baidu/Paddle into 7724
7 years ago
Yu Yang
b455502844
Merge pull request #7723 from reyoung/feature/wrap_nce_loss
...
Wrap NCE to python
7 years ago
wanghaox
ca2e96f270
update code
7 years ago
wanghaox
534cf74119
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into prior_box
7 years ago
xzl
06db703843
../../../../../paddle/api
7 years ago
dangqingqing
e5fc79d88a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into boost_cmake
7 years ago
dangqingqing
e7700640e6
Remove Boost finding in CMake.
7 years ago
wanghaox
a05d25cf41
update code and doc, change input x to LoDTensor
7 years ago
Tao Luo
d57bca16f2
fix license format of variable_test.cc ( #7763 )
7 years ago
Yang Yu
deb7509820
Merge branch 'develop' of github.com:baidu/Paddle into feature/wrap_nce_loss
7 years ago
Yang Yu
df4133d0b3
Fix compile error in bipartite_match_op
7 years ago
Yang Yu
69074eadbb
Merge branch 'develop' of github.com:baidu/Paddle into feature/wrap_nce_loss
7 years ago
qingqing01
575289209f
Fix bug and unit test in bipartite_match_op.
7 years ago
dangqingqing
eaabf2aa9f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into profiling_py
7 years ago
wanghaox
d662e85f12
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into prior_box
7 years ago
Yang Yang
f816e116b5
clean up
7 years ago
Yang Yang
e443d034d6
bring back share data
7 years ago
wanghaox
d4587959f9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into iou_sim
7 years ago
chengduo
9b1a17a835
Refine conv2d_transpose layer doc ( #6920 )
...
* refine conv2d_transpose layer doc
* fix conv2d_transpose doc
* fix doc
7 years ago
Yang Yang
0ed3f35919
pass parallel_do cbow
7 years ago
Guo Sheng
cd25adbe27
Merge pull request #7659 from guoshengCS/enhance-reduceOp-datatype
...
Register reduce_op_kernel with multiple data types
7 years ago
qingqing01
2b19a68cc9
Merge pull request #7695 from qingqing01/bipartite_match_op
...
Add bipartite matching operator and unit testing.
7 years ago
Cao Ying
479c861b57
Merge pull request #7726 from lcy-seso/fix_rendering_error_of_transpose_op
...
fix rendering error of transpose operator and add wrapper.
7 years ago
wanghaoshuang
09544bc1e6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into blockexpand
7 years ago
kavyasrinet
b7eeef2489
Updating the comments for send_op and recv_op. ( #7747 )
...
* Updating the cluster trainign doc
* Fixed comments
* Updating few comments in recv_op
7 years ago
guosheng
d51204425a
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-lookup_table_op-padidx
7 years ago
dangqingqing
e44dedf90f
Fix the warning and unit test.
7 years ago
Tao Luo
1575c2c825
Merge pull request #7572 from luotao1/fluid_infer
...
compile and install the shared library of fluid inference
7 years ago
ying
64606eafff
Merge branch 'develop' into fix_rendering_error_of_transpose_op
7 years ago
whs
44561a24c9
Merge pull request #7655 from wanghaoshuang/ctc_evaluator_py
...
Add python wrapper for CTC greedy decoder and edit distance evaluator
7 years ago
dangqingqing
74af23b6d1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bipartite_match_op
7 years ago
dangqingqing
530df1b289
Fix the naming.
7 years ago
zlx
3772d27dfb
add depthwise conv forward
7 years ago
wanghaoshuang
c9e208c845
Fix white space in comments.
7 years ago
dangqingqing
07908686d5
Update some comments and add more check.
7 years ago
wanghaox
0b178b8a59
update code
7 years ago
guosheng
b03c8b90ae
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-lookup_table_op-padidx
7 years ago
wanghaox
0e165032a8
update code
7 years ago
dangqingqing
a89e1b85fe
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into boost_cmake
7 years ago
dangqingqing
609ede25b9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into profiling_py
7 years ago
dangqingqing
579449b050
Update comments and revert pybind11.
7 years ago
wanghaoshuang
1bc8de3209
1. Add sequence_num as edit distance op's output
...
2. Fix evaluator using 'reduce_sum' op instead of 'mean' op
7 years ago
ying
c6b78e56b1
Merge branch 'develop' into fix_rendering_error_of_transpose_op
7 years ago
Luo Tao
acb13e7fac
Merge branch 'develop' into fluid_infer
7 years ago
wanghaox
3c255ad39d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into iou_sim
7 years ago
guosheng
048bc12d60
Fix GPU kernel of lookup_table_op enhancement
7 years ago
wanghaox
f7c0ad9d35
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into prior_box
7 years ago
wanghaox
528bcac52c
update iou_sim code
7 years ago
wanghaoshuang
648ca7a3b2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into blockexpand
7 years ago
Yu Yang
54a4696f2a
Merge pull request #7660 from reyoung/feature/compare_op_use_elemwise
...
Make compare_op reuse elemwise_op_funcs
7 years ago
Yang Yu
af8cb820a7
Fix bug of nce_op
...
* also div num_samples when return cost of nce_op
7 years ago
Luo Tao
5c056535ed
Merge branch 'develop' into fluid_infer
7 years ago
wanghaox
142f632886
update code
7 years ago
wanghaoshuang
3a48282e61
Fix unitest
7 years ago
ying
f6cea35743
fix rendering error of transpose operator.
7 years ago
QI JUN
a6da470b11
add memory optimization transpiler demo ( #7443 )
...
* add memory optimization transpiler demo
* add memory benchmark compile option
* add gflags instead of macro
* refine code
7 years ago
wanghaox
f020f4b51e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into prior_box
7 years ago
Yang Yu
f035f327b1
Wrap NCE to python
...
And write an unittest for it
7 years ago
guosheng
e54eaee58a
Fix print_operators_doc to support AttrType int_64
7 years ago
wanghaoshuang
500e29a4a4
1. Reduce attributes
...
2. Rename 'get_output_size' to 'OutputSize'
3. Remove redundant whitespace char.
7 years ago
guosheng
fd5c1c8a69
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-lookup_table_op-padidx
7 years ago
Yang Yu
2024489bb8
Fix CI
7 years ago
Yang Yu
2b9b6c3d32
Merge branch 'develop' of github.com:baidu/Paddle into feature/compare_op_use_elemwise
7 years ago
Wang Meng
cb17dd20f7
Merge pull request #7698 from will-am/fix_priorbox
...
Fix priorbox layer when multiple values given in min_size
7 years ago
whs
32cc11e358
Merge pull request #5945 from wanghaoshuang/sampler
...
Add math function for sampling integers
7 years ago
guosheng
4c7cb771aa
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-lookup_table_op-padidx
7 years ago
guosheng
ae676a609f
Enhance lookup_table_op to support padding_idx
7 years ago
武毅
18615626a8
Merge pull request #7715 from Yancey1989/split_selected_rows_to_multi_pserver
...
[WIP] Split SelectedRows to multiple pservers
7 years ago
dangqingqing
dd5e8d6c23
Fix the cmake dependences.
7 years ago
Yibing Liu
f2c4bb679b
Add lstm with recurrent projection operator
7 years ago
Qiao Longfei
85671b8acb
Data type transform ( #7653 )
...
* init complete data layout transform
* can compile
* test passed
* optimize code
* fix while_grad_op first step loss lod problem
* optimize in out ptr for transform
* add check
* update copyright
* clean code
* add NeedTransformLayout
* add comment
* change the interface of data_type_transform
* init data_type_transform_test
* complete data_type_transform_test
* add TransDataType to data_transform
7 years ago
Abhinav Arora
cd775a130a
Fixing typo in Send op ( #7625 )
7 years ago
dzhwinter
e983cc90fc
"fix decode bug" ( #7711 )
...
* "fix decode bug"
* "follow commnet"
* "fix error"
* "fix hook bug"
* fix based comment
* fix copyright
* fix based on comment
7 years ago
Cao Ying
04806ffe83
Merge pull request #7656 from chengduoZH/feature/enhance_matmul_op
...
Enhance matmul_op to support 4-D inputs.
7 years ago
Yancey1989
d0a9393668
Split SelectedRows to multiple pservers
7 years ago
chengduoZH
782ddc5f59
follow comments
7 years ago
dangqingqing
564c6abda3
Add cmake for extern project of boost.
7 years ago
Xi Chen
84568d38e8
remove unused vars which block the build
7 years ago
helinwang
d85a9dc4de
Merge pull request #7621 from helinwang/remote_optimize
...
Recv OP: use BlockDesc* instread of ProgramDesc proto as Attribute
7 years ago
helinwang
c8e7d1085e
Merge pull request #7686 from peterzhang2029/scale_fix
...
Fix the description of default scale attribute for scale op.
7 years ago
helinwang
e92fe9e64f
Merge branch 'develop' into remote_optimize
7 years ago
wangmeng28
871b0e6854
Simplify priorbox layer
7 years ago
wangmeng28
e22039f82b
Fix priorbox layer when multiple values given in min_size
7 years ago
Luo Tao
f1c4c80025
Merge branch 'develop' into fluid_infer
7 years ago
chengduoZH
cd38e2d142
remove conflict
7 years ago
chengduoZH
0468422d06
follow comments
7 years ago
wanghaoshuang
a123c169ea
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into sampler
7 years ago
dangqingqing
c2edcde1b1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bipartite_match_op
7 years ago
dangqingqing
bf33b191d0
Add bipartite matching operator and unit testing.
7 years ago
Yang yaming
4f933312db
Merge pull request #7662 from pkuyym/fix-6678
...
Add sequence reshape operator
7 years ago
yangyaming
b07ca1de1f
resize before computing LoD.
7 years ago
Qiao Longfei
50ac67fc63
Bugfix/check if kernel for type exist ( #7657 )
...
* check if kernel if found for kernel type
* do kernel check before data transform
7 years ago
whs
55ec0e2a92
Merge pull request #7649 from wanghaoshuang/fix_edit
...
Change input data type to int64_t
7 years ago
peterzhang2029
b48fedc30b
init
7 years ago
Qiao Longfei
0071b5f797
complete data layout transform ( #7440 )
...
* add data layout transform and optimize the implementation of data_transform
7 years ago
whs
47753a9667
Merge pull request #7527 from wanghaoshuang/ctc_greedy_decode
...
Add CTC align op
7 years ago
Cao Ying
6720e67fbd
Merge pull request #7675 from lcy-seso/remove_copy_from_crf
...
delete memory copy from linear_chain_crf_op.
7 years ago
yangyaming
08cb472ab9
Simplify the implementation.
7 years ago
Yibing Liu
a1c281f07e
Merge pull request #7603 from kuke/simplify_erase
...
Enhance GPU kernel of sequence erase op
7 years ago
wanghaox
c5a14ed4cd
add mine_hard_examples operator
7 years ago
Yang Yu
9c0b29014c
Make compare_op reuse elemwise_op_funcs
7 years ago
Yibing Liu
41b8388492
Merge pull request #7640 from kuke/add_lstm_doc
...
Add python doc for dynamic_lstm
7 years ago
Yibing Liu
8809d43ad7
Remove unnecessary dtype conversion & register int64 kernels
7 years ago
Yu Yang
47622d7f5c
Merge pull request #7624 from tonyyang-svail/7450
...
Make merge and split support lodtensor
7 years ago
Yu Yang
259fcc3026
Merge pull request #7676 from tonyyang-svail/7673
...
bring back test_parallel_do
7 years ago
wanghaoshuang
6089b50c4b
Registry int64_t kernels
7 years ago
武毅
58be41fa91
Merge pull request #7608 from typhoonzero/distributed_split_selectedrows
...
Enhance distributed train performance
7 years ago
Yang Yang
07cf302e6e
first commit
7 years ago
caoying03
4020451a7b
delete memory copy from linear_chain_crf_op.
7 years ago
Yancey
c79d530ad3
Add split selected rows op ( #7604 )
...
* add split selected rows op
* update comment
* add grad check
* registry cuda kernel
* fix ci failed
7 years ago
Tao Luo
161bd4a4c3
Merge pull request #7663 from chengduoZH/fix_warning/seq_pad_test
...
Fix compiling warnings
7 years ago
typhoonzero
0aff136395
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into distributed_split_selectedrows
7 years ago
kexinzhao
d77e6a6760
Merge pull request #7636 from kexinzhao/save_inference_model
...
Add feed and fetch op to ProgramDesc before saving for inference
7 years ago
chengduoZH
8e02870ce2
modify doc
7 years ago
chengduoZH
259858b41b
modify doc
7 years ago
yangyaming
fc581bc5f2
Change the CopyRight.
7 years ago
yangyaming
bea41444d7
Refine the implementation and add unit test.
7 years ago
typhoonzero
5f4d9130f0
merge codes
7 years ago
wanghaoshuang
6e04e580d2
Change input data type to int64_t in unitest and GPU kernel
7 years ago
typhoonzero
ae19d2ea1e
fix comm issues
7 years ago
chengduoZH
578d60bf4a
code refine
7 years ago
guosheng
b83ff4514b
Register reduce_op_kernel with multiple data types
7 years ago
chengduoZH
2edc136c5f
add 4-d for matmul_op
7 years ago
guosheng
9247aee7e4
Enhance lookup_table_op to support padding_idx
7 years ago
Tao Luo
c73f00fe35
Merge pull request #7634 from Yancey1989/fix_compile_warning
...
Fix sequence_padding compile warning
7 years ago
wanghaoshuang
77cf21e53e
Change input data type to int64_t
7 years ago
Yancey1989
2ce5c9d45e
fix gpu implement
7 years ago
Yan Chunwei
3388e52da1
Bugfix/beamsearch op ( #7611 )
7 years ago
yangyaming
f20617be46
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-6678
7 years ago
Luo Tao
a12db45c20
rename libpaddle_fluid_shared.so to libpaddle_fluid.so
7 years ago
yangyaming
9bd9d8b5ca
Add sequence_reshape_op.
7 years ago
Yibing Liu
f46257fa4a
Merge branch 'develop' of upstream into add_lstm_doc
7 years ago
Luo Tao
5f202db861
Merge branch 'develop' into fluid_infer
7 years ago
chengduoZH
edd2132679
remove conflict
7 years ago
Yancey1989
07e87ff185
Fix sequence_padding compile warning
7 years ago
Guo Sheng
4b3e22b865
Merge pull request #7574 from lcy-seso/wraper_for_l2_normalize
...
add python wrapper for l2 normalize layer.
7 years ago
fengjiayi
1d89866995
Merge pull request #7593 from JiayiFeng/dev_elementwise_scalar
...
Make elementwise_op supporting scalar input `Y`
7 years ago
fengjiayi
37a9437073
Merge pull request #7538 from JiayiFeng/dev_elementwise_max_min
...
elementwise max min
7 years ago
Tao Luo
388aa51ccf
Merge pull request #7622 from abhinavarora/reduce_op_doc
...
Fixing the rendering for Reduce operators in operator documentation
7 years ago
Kexin Zhao
6edbf1387c
remove ptools
7 years ago
Kexin Zhao
ed3e5717ba
fix bug
7 years ago
Yang Yang
d002f60af9
merge develop
7 years ago
Yang Yang
e0e45c052e
merge develop
7 years ago
Yang Yang
28b240bbcf
delete todo in MergeLoDTensor
7 years ago
Kexin Zhao
dc168ed00a
modify programDesc based on feed and fetch names
7 years ago
Abhinav Arora
484af6c7e0
Fixing the rendering for Reduce operators in operator documentation
7 years ago
Helin Wang
151f579c23
Recv OP: use BlockDesc* instread of ProgramDesc proto as Attribute
7 years ago
Yu Yang
b8a17987ec
Feature/parallel for bug fix ( #7474 )
...
* Fix ParallelDo not support empty input gradient
* Polish ParallelDo and fix several bugs
* Fix CI
* Fix CI
7 years ago
Helin Wang
d4d143796f
Fix build error
7 years ago
Qiao Longfei
b870e04256
clean lod_tensor_test ( #7618 )
7 years ago
fengjiayi
193096194c
Merge branch 'dev_elementwise_max_min' into dev_elementwise_scalar
7 years ago
fengjiayi
a37f6ad379
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_elementwise_max_min
7 years ago
ying
e043c2ce45
Merge branch 'develop' into wraper_for_l2_normalize
7 years ago
Cao Ying
a9899dbe2b
Merge pull request #7609 from lcy-seso/fix_copyright
...
fix copyright.
7 years ago
Yan Chunwei
ac73900942
enhance/add lod check ( #7439 )
7 years ago
Tao Luo
939e1b1af9
Merge pull request #7592 from tensor-tang/fixds2
...
fix mkldnn concat dimension issue for rnn
7 years ago
ying
8f9480cc2c
fix copyright.
7 years ago
Qiao Longfei
6f71f89ded
change DEVICE_TYPE in op_registry to LIBRARY_TYPE ( #7588 )
7 years ago
typhoonzero
f233b936c7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into distributed_split_selectedrows
7 years ago
fengjiayi
3f5c77cc4b
fix copyright information
7 years ago
Luo Tao
fc75da7902
Merge branch 'develop' into fluid_infer
7 years ago
Luo Tao
c96b7e8047
add missing framework.pb.h and fix string install typo
7 years ago
wanghaoshuang
fe45f2115f
1. Rename 'block_expand' to im2sequence
...
2. Refine code and doc
7 years ago
fengjiayi
f1a889720a
fix copyright
7 years ago
Yibing Liu
7a2aa486cc
Unify data type in sequence_erase_op
7 years ago
typhoonzero
02ea349101
enhance dist train performance
7 years ago
Yibing Liu
d1d614b9f8
Refine the GPU kernel for sequence_erase_op
7 years ago
wanghaoshuang
7150289b5c
Refine CPU kernel
...
1. Allocate memory for output before compute.
2. Rename 'ctc_decode' to 'ctc_align'
7 years ago
Yibing Liu
7d3b2e4b03
Fix a bug in sequence_erase_op
7 years ago
chengduoZH
c9641a03dc
refine code
7 years ago
fengjiayi
14f6fa346b
make elementwise op support scalar as input Y
7 years ago
chengduoZH
ed7e74abf2
follow comments and refine python doc
7 years ago
tensor-tang
5d226743eb
fix mkldnn concat dimension issue for rnn
7 years ago
ying
87a59d65d6
Merge branch 'develop' into wraper_for_l2_normalize
7 years ago
wanghaoshuang
09adb76903
Fix code style
7 years ago
chengduoZH
24f528a1a5
follow comments
7 years ago
wanghaoshuang
0c10b5ff73
Merge branch 'develop' of https://github.com/wanghaoshuang/Paddle into ctc_greedy_decode
7 years ago
fengjiayi
3177c9a691
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_elementwise_max_min
7 years ago
wanghaoshuang
bfe7e24243
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into blockexpand
7 years ago
wanghaoshuang
92baa885a0
Fix code style
7 years ago
fengjiayi
d15bfabbd0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_elementwise_max_min
7 years ago
wanghaoshuang
e82f1008a8
Finish block expand op
...
1. Add lod to output
2. Fix im2col arguments list
3. Refine code and doc
4. Fix output shape
7 years ago
Yibing Liu
aab4cfeb65
Add doc for dynamic_lstm python api
7 years ago
dangqingqing
d2a70243f1
Refine profiler and expose to Python.
7 years ago
wanghaox
3b63815629
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into iou_sim
7 years ago
wanghaox
2ad5a6f0d1
add iou similarity operator
7 years ago
caoying03
6497bff901
add python wrapper for l2 normalize.
7 years ago
Luo Tao
2be7cf909a
add paddle INSTALL for fluid api
7 years ago
wanghaoshuang
10dd632659
Rename 'ctc_greedy_decode' to 'ctc_decode'
7 years ago
Luo Tao
984d09c8de
Merge branch 'develop' into fluid_infer
7 years ago
wanghaoshuang
281e93bcbb
Remove 'top 1' from CPU and GPU kernel
...
1. Remove 'top 1'(or argmax) from CPU and GPU kernel
2. Add a new test case
3. Refine doc
7 years ago
fengjiayi
f59599a3dd
refine elementwise_add_op
7 years ago
fengjiayi
ead7059bf9
Refine code
7 years ago
sidgoyal78
ba2f6f71ad
Modify directory structure to show plots
7 years ago
helinwang
052c05be34
Merge pull request #7537 from Yancey1989/distributed_gpu
...
Fluid distributed supports CUDA place
7 years ago
fengjiayi
5f4481350e
Merge pull request #7529 from JiayiFeng/remove_functor1
...
remove `functor1` of ElementwiseGradCompute
7 years ago
gongweibao
535fefb7e8
Fix grpc bugs ( #7435 )
...
Fix grpc bugs
7 years ago
whs
448fee3db4
Merge pull request #7414 from wanghaoshuang/warpctc
...
Adapt warpctc grad op for gradient checking
7 years ago
dzhwinter
b9b75377a2
Feature/hooks ( #7513 )
...
* add copyright hook
* add copyright hook
* refine copyright hook
* "test copyright hook"
* fix check style
* fix ci
7 years ago
chengduoZH
251c6032fb
set use_cudnn as default
7 years ago
Yancey1989
329f1e0f79
add some comment
7 years ago
Yancey1989
9de1809590
fluid distributed on CUDA place
7 years ago
chengduoZH
79aa51229a
fix conv, pool, conv_trans to decide use cudnn or not
7 years ago
fengjiayi
f5cd961900
complete elementwise_min_op
7 years ago
fengjiayi
acf37ad675
Complete elementwise_max_op
7 years ago
fengjiayi
76a74f1f77
Merge branch 'remove_functor1' into dev_elementwise_max_min
7 years ago
fengjiayi
6ee8a2e1db
remove unnecessary functor1
7 years ago
fengjiayi
736842e447
wip
7 years ago
wanghaoshuang
8f37c3c2a7
Fix sequence scale functor cuda kernel
...
1. Fix kernel
2. Add more test case
7 years ago
wanghaoshuang
579f684661
Add ctc_greedy_decode_op
7 years ago
Luo Tao
816e556b98
Merge branch 'develop' into fluid_infer
7 years ago
guosheng
ef129718ea
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-python-glu
7 years ago
guosheng
c083a60d7a
Add python split and glu
7 years ago
yangyaming
a091d1a31c
Enhance print_op.
7 years ago
Qiao Longfei
9deb1756a2
fix while_grad_op first step loss lod problem ( #7490 )
...
* fix while_grad_op first step loss lod problem
* optimize code
7 years ago
dzhwinter
5ad1aef051
"cudnn operators change to cudnn kernel" ( #6660 )
...
* "unified operators"
* "add CUDNN register"
* "add use cudnn attribute"
* "add attribute"
* "test conv tranpose op"
* "remove duplicated attr"
* "fix op test"
* "add attribute to set cudnn"
* "add more log"
* "need layout op register support"
* "add more log"
* "change GetExpectedKernelType "
* "fix Get attr in conv_op"
* "fix CI"
* "fix tests"
* "removed kernel priority fallback"
* "fix CI"
* "fix stack pointer bug"
* "refine buggy interface"
* "add const cast to save life"
* "fix get_output_with_grad"
* "fix op test with dataformat"
* ""fix pooling
* "fix pooling test"
* "fix CI"
* "fix with_gpu error"
* "add transform needed functional check"
* "fix unpack list error"
* "comment out parallel.do temporary"
* "fix CI"
* "fix compile doc error"
* "make threshold larger"
7 years ago
wanghaoshuang
45cf234127
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into warpctc
7 years ago
wanghaoshuang
137f0dfc21
1. Fix warpctc grad tensor initial bug.
...
2. Remove num_seq arguments.
3. Refine CUDA kernel of ScaleLoDTensorFunctor.
4. Change max_relative_error of gradient unitest to 0.007
7 years ago
wanghaox
8ab611d559
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into prior_box
7 years ago
wanghaox
1ba3d29789
update code
7 years ago
Cao Ying
db65f497ff
Update comments for two operators. ( #7457 )
...
* update code comments.
* update the comments.
* follow comments.
7 years ago
Luo Tao
3cf23bece3
create paddle_fluid_shared.so library
7 years ago
Tao Luo
04bfed660b
Merge pull request #7413 from tensor-tang/v1mac
...
enbale auto set env of V1 on Mac
7 years ago
Qiao Longfei
23df6c4478
Add get lod for debug ( #7375 )
...
* add GetLoD for debug
* add LoDToString
* optimize if
* typo
* add lod_tensor to operator's dependency
7 years ago
Yan Chunwei
3423022e84
feature/add print op ( #6799 )
7 years ago
Tao Luo
03c858a9be
Merge pull request #6596 from lcy-seso/update_capi_doc
...
add doc for how to use C-API.
7 years ago
emailweixu
d954becb1c
Merge pull request #7371 from emailweixu/assign_value_op
...
assign_value operator
7 years ago
Yang Yang
208f950ccc
delete todo
7 years ago
Tao Luo
29b2693afb
Merge pull request #7445 from abhinavarora/api_doc_bug
...
Fix the documentation for elementwise op in fluid layers
7 years ago
xuwei06
25ecd2061a
Use CopyFromVector for assign_value_op
7 years ago
xuwei06
7306aab61d
GetActualKernelType => GetExpectedKernelType
7 years ago
xuwei06
ce233796ea
assign_value operator
...
We need this operator to assign value to a tensor and the values are stored in the program so that they can be used independent of python.
7 years ago
Yang Yang
8c1025d66f
first commit
7 years ago
Yu Yang
a9dbdab5e2
Merge pull request #7396 from reyoung/feature/parallel_for_unittest
...
Feature/parallel for unittest
7 years ago
ranqiu92
95c0c12641
Merge pull request #7384 from dzhwinter/feature/sync_wait
...
Feature/sync wait
7 years ago
hedaoyuan
784e59406c
Bug fix of Im2ColMobileFunctor.
7 years ago
hedaoyuan
ed0a564c90
Optimize GemmConvMobileFunction.
7 years ago
Abhinav Arora
f784dae3da
Fix the documentation for elementwise op in fluid layers
7 years ago
tensor-tang
74c5e7c52d
enbale auto set env of V1 on Mac
7 years ago
Yang Yu
dc488c17d1
Merge branch 'develop' of github.com:baidu/Paddle into feature/parallel_for_unittest
7 years ago
whs
59bf85d914
Merge pull request #7325 from kuke/sequence_erase_op
...
Add sequence erase op
7 years ago
wanghaoshuang
89de5d5e66
Fix cuda kernel of sequence scale functor
7 years ago
gongweibao
da3087ada1
Async GRPC sendrecv ( #7133 )
...
Async GRPC sendrecv
7 years ago
wanghaoshuang
9eb3fb2946
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into warpctc
7 years ago
wanghaoshuang
b1af5e435f
1. Fix warpctc grad op
...
2. Add check grad test
7 years ago
Tao Luo
020630b7a3
Merge pull request #7395 from tensor-tang/fixds2
...
fix ds2 issue
7 years ago
hedaoyuan
2b202f754b
Optimize maxPoolForward.
7 years ago
Tao Luo
af96cd5877
Merge pull request #7397 from luotao1/openblas_env
...
auto set openblas env
7 years ago
Yibing Liu
861b84f557
Merge pull request #5300 from kuke/ctc_edit_distance_dev
...
Add edit distance operator
7 years ago
tensor-tang
a7e847b648
fix ds2 issue
7 years ago
Yibing Liu
fe0ef91a3f
fix ci error in edit_distance_op
7 years ago
Qiao Longfei
377424bf21
reorganize data transform related code ( #7391 )
...
* init data_type_transform
* split data_layout_transform
* tmp rm data_transform_test
* change device_data_transform to data_device_transform
* clean code
* clean code
7 years ago
Yibing Liu
f3dcd00603
Merge branch 'develop' of upstream into ctc_edit_distance_dev
7 years ago
Yu Yang
6cff3c96db
Merge pull request #7381 from reyoung/feature/refine_get_places_op
...
Polish GetPlacesOp
7 years ago
Yibing Liu
f594ca4369
Reuse the usable variable in edit_distance_op
7 years ago
Yang yaming
a32027604f
Merge pull request #7176 from pkuyym/fix-7173
...
Enhence shrink_rnn_memory_op.
7 years ago
Luo Tao
929d22c622
auto set openblas env
7 years ago
Yang Yu
52ed103899
Merge branch 'feature/fix_ini_glog' into feature/parallel_for_unittest
7 years ago
Yang Yu
2f56995f7c
Fix InitGLOG
...
glog will not hold ARGV[0] inside.
7 years ago
Yibing Liu
10779460c5
Simplify calc in test_sequence_erase_op
7 years ago
Yang Yu
e5e206e2b6
Merge branch 'develop' of github.com:baidu/Paddle into feature/refine_get_places_op
7 years ago
Yang Yang(Tony)
4bcc0b64cb
[WIP] feature/parallel_gpu ( #7293 )
...
feature/parallel_gpu
7 years ago
dzhwinter
a6edc0389e
"fix CI"
7 years ago
dzhwinter
f0316bdbbd
"add flags"
7 years ago
Yang Yu
4059c9ca7f
Polish GetPlacesOp
7 years ago
fengjiayi
df92776850
Merge pull request #7269 from emailweixu/calc_gradient
...
Calculating gradients for partial graph
7 years ago
dzhwinter
5f98500009
Make init device on all gpu by default ( #7345 )
...
* "init use all default devices"
* "fix init test"
7 years ago
Qiao Longfei
efe06caa3d
change data type of beam_search op ( #7374 )
7 years ago
Qiao Longfei
91f80f792d
Topk share lod ( #7373 )
...
* add lod tensor ToAbsOffset test
* add share lod to topk op and softmax op
7 years ago
fengjiayi
cedd9805f5
Merge pull request #7361 from JiayiFeng/refine_and_enhence_WhileGradOp
...
Refine while grad op
7 years ago
xuwei06
585dec3dc2
Calculating gradients for partial graph
...
Added backward.calc_gradient to backpropagate gradient from given targets to inputs.
7 years ago
xuwei06
0ef9dc6122
Fix comment for norm_op
7 years ago
yangyaming
106d063c5f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-7346
7 years ago
yangyaming
427e47459b
Add grad_op_maker for sequence_pool.
7 years ago
QI JUN
45e77154cf
add general memory usage interface for both CPU/CUDA ( #7352 )
7 years ago
wanghaox
99a6c5d40e
change output shape to [2, layer_height, layer_width, num_priors, 4]
7 years ago
fengjiayi
fbc30215d4
refine WhileGradOp code
7 years ago
Yancey
e249ad1211
Test dist word2vec ( #7334 )
...
* test dist word2vec
* multiple trainers work
7 years ago
Yiqun Liu
b5fda2723f
Port WarpCTC Operator ( #5107 )
...
* Add Seq2BatchFunctor, which will be used in WarpCTCOp.
* Implement WrapCTCFunctor and WrapCTCKernel.
* Add unittest of warpctc_op.
* Modify the check_output inferface in python unittest framework to allow check a subset of outputs.
* Use absolute offset lod in warpctc_op and related functors.
* Refine the comments of warpctc_op.
* The new python unittest supports checking a subset of the outputs, so revoke the previous change.
* Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor.
* Update to the newest codes.
* Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.
7 years ago
fengjiayi
8f962f7433
Update
7 years ago
Yu Yang
ce6dad3b35
Rename CopyFrom to Copy for tensors ( #7292 )
...
* Rename Tensor::CopyFrom to Tensor::Copy
* Fix CI
* Fix compile
7 years ago
Yu Yang
1dad4bb2e9
Remove unused LoDTensor methods ( #7247 )
...
* Remove unused LoDTensor methods
* Update
7 years ago
Qiao Longfei
d762e07ecc
Merge pull request #7294 from jacquesqiao/add-back-priority
...
add back priority
7 years ago
Yibing Liu
7b9d5b325c
Add document for sequence_erase_op
7 years ago
qiaolongfei
8b1a81a9bf
fix GetDims bug
7 years ago
Yibing Liu
37f933b8ad
Add gpu kernel for sequence_erase_op
7 years ago
qiaolongfei
5b94948b32
disable UseAll when init
7 years ago
qiaolongfei
0b52cc886f
fix priority
7 years ago
Yu Yang
6e3cc0c993
Merge pull request #7240 from reyoung/feature/make_lod_a_share_ptr
...
Add COWPtr and its unittest
7 years ago
hedaoyuan
e8a96a8e89
Merge pull request #7159 from hedaoyuan/inference
...
Release the middle layer's output memory
7 years ago
Yancey
aa75f1e2c5
Create tensor in recv op ( #7286 )
...
* create tensor in recv op
* static global function to global function
7 years ago
Tao Luo
b9ce55faad
Merge pull request #7290 from luotao1/profiler
...
fix compile error in profiler.cc
7 years ago
qiaolongfei
ca90356b0e
add back priority
7 years ago
yangyaming
12ed53c1f2
Inherit LoD from x to x_grad and enhance the unit test.
7 years ago
Yu Yang
219fbd51bc
Merge pull request #6732 from QiJune/get_places_op
...
add GetPlaces operator
7 years ago
Luo Tao
01ee42b121
fix compile error in profiler.cc
7 years ago
Yibing Liu
2a54ddd267
Merge branch 'develop' of upstream into sequence_erase_op
7 years ago
Guo Sheng
26d451116b
Merge pull request #7251 from guoshengCS/enhance-reorderTensor
...
Enhance reorder_lod_tensor_by_rank_op to support Tensor
7 years ago
Yang Yu
3b0afae3d1
Add more comments
7 years ago
Yang Yu
bdc82956d6
Merge branch 'develop' of github.com:baidu/Paddle into feature/make_lod_a_share_ptr
7 years ago
dzhwinter
e94db381ba
Feature/add shared layout ( #7233 )
...
* "reuse ShareLoD with no regret"
* "removed base class shareLayout"
* "fix CI"
7 years ago
Yang Yu
63ff0b4ba9
Refine get_places
7 years ago
Yibing Liu
5e90f5e187
Merge pull request #7043 from kuke/profiler_tool
...
Add the parsing part for the profiling tool
7 years ago
Tao Luo
c62383e951
Merge pull request #7255 from tensor-tang/capi
...
fix capi package
7 years ago
Qiao Longfei
0f353ab46e
cpu gpu transform function ( #7191 )
...
* add rename guard
* add device_data_transform
* add device_data_transform_test
* modify GetExpectedKernelType
* update operator.run
* support test test_label_semantic_roles
* optimize code
* optimize code
* rename GetActualKernelType to GetExpectedKernelType
* fix chunk_eval_op and device_data_transform_test
* add is_same_place to place
* optimize code, refine rename_guard
* refine rename guard, add GetKernelTypeForVar
* optimize code
* add some log
* rename guard
* use sub scope to create var
* fix compile
* add IsInitialized for Tensor
* add VarIsTensor
* fix op_registry_test
* test
* tmp disable priority
* restore switch_kernel.md
* code clean
7 years ago
Yibing Liu
6416eee780
Merge branch 'develop' of upstream into profiling_tool
7 years ago
Yibing Liu
ea0280b4a1
Remove unused included header gflags
7 years ago
Yibing Liu
d09503b2be
Remove the redundant switch case statement
7 years ago