sneaxiy
5d698589ce
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_elementwise_add
7 years ago
sneaxiy
19ff254d05
Merge branch 'refine_elementwise_add' of https://github.com/sneaxiy/Paddle into refine_elementwise_add
7 years ago
Sylwester Fraczek
d74bb6ab9c
fix ut for mkldnn 0.15 - added forcing layout NCHW in mkldnn conv tests
7 years ago
fengjiayi
855c9e3311
clean softmax_op code
7 years ago
fengjiayi
24d51de022
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_op_tensor_support
7 years ago
fengjiayi
27df3a9f2b
make cross_entropy_op supporting tensors
7 years ago
fengjiayi
66be53264e
Merge pull request #12592 from JiayiFeng/fix_mac_compile_error
...
fix mac compile error
7 years ago
fengjiayi
8e604a10aa
fix mac compile error
7 years ago
nhzlx
551c802cdc
merge develop
7 years ago
sneaxiy
ad45d39222
refine layer_norm
7 years ago
chengduo
7c8b69c700
Feature/op fusion ( #12240 )
...
* Add Preface
* Add demo code
* Save file
* Refine code
* seems can work
* use elementwise strategy
* Use ElementwiseComputeEx
* Add comments
* extract functions from operator
* Refine code
* Follow comment
* code refine
* follow comments
* follow comments
7 years ago
sneaxiy
1b4515f6db
refine softmax_with_cross_entropy
7 years ago
nhzlx
3a0caf801f
modify trt engine op test
7 years ago
nhzlx
e51d045a6d
modify trt engine op test
7 years ago
nhzlx
e8954a36f5
merge develop
7 years ago
nhzlx
32a9e050bc
mapping the variable name inside the subgraph
7 years ago
Wu Yi
2d036c47cd
polish dist unit test code ( #12512 )
...
* polish dist se resnext ut
* update
* update
* update
* avoid cpu initializer differ
* change to use executor for now
* update by comment
* remove lr decay use para exe, should fix para exe bug later
* update by comment
7 years ago
fengjiayi
7834b4a470
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_op_tensor_support
7 years ago
tangwei12
5bfdefae91
Merge branch 'Pdv' into samplingIdOp
7 years ago
tangwei12
b30bdde15a
random optimize
7 years ago
tangwei12
9c63fef63c
random optimize
7 years ago
Qiao Longfei
88a607c342
Merge pull request #12541 from jacquesqiao/optimize-profiler
...
optimize profiler
7 years ago
tangwei12
5b9716d1f6
add dims check
7 years ago
tangwei12
4cd504d3b4
bug fix
7 years ago
sneaxiy
e57bc4d745
Merge branch 'refine_elementwise_add' of https://github.com/sneaxiy/Paddle into refine_elementwise_add
7 years ago
sneaxiy
222fbbedfb
Merge branch 'develop' into refine_elementwise_add
7 years ago
sneaxiy
4b83afff6e
Merge branch 'develop' into refine_elementwise_add
7 years ago
sneaxiy
b2d0ee5159
refine elementwise_add op
7 years ago
tangwei12
da2cc99f67
sampling op optimize
7 years ago
fengjiayi
7c55e08c93
stash
7 years ago
tangwei12
4973e07be3
sampling op optimize
7 years ago
tensor-tang
836068569f
Merge remote-tracking branch 'ups/develop' into refine/op/gru
7 years ago
tensor-tang
18c322c2a1
seperate cpu and gpu implementations for gru kernel compute
7 years ago
tensor-tang
54c95e49f0
fix blas
7 years ago
fengjiayi
b656d97e86
Merge pull request #12485 from JiayiFeng/dev_ops_tensor_support
...
Make lookup_table_op and softmax_op supporting high rank tensor
7 years ago
qiaolongfei
1623f1ba4f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-profiler
7 years ago
tangwei12
3206970b77
sampling op rename
7 years ago
Xin Pan
99a77cfc62
Merge pull request #12468 from panyx0718/improve_profiler2
...
Improve profiler
7 years ago
qiaolongfei
a3f9d6a38c
optimize profiler
7 years ago
tangwei12
e0ab2f7158
new sampling op
7 years ago
tensor-tang
8c23f7c4f0
fix blas and use packed weight
7 years ago
tensor-tang
d9cc6b1866
replace gru compute with details
7 years ago
tensor-tang
43cee33a23
add mkl packed gemm
7 years ago
tangwei12
766ac488ac
sum_op selectedRows dim bug fix
7 years ago
dzhwinter
595a2c83ae
explicit gradient of elementwise_add/elementwise_sub ( #11970 )
...
* "add gradient register"
* "make some enhance"
* "better format"
* "fix typo"
* "fix reuse"
* "fix get expected kernel"
* "change the mkldnn code"
* "fix mkldnn"
* "fix mkldnn failed test"
* "add comment"
7 years ago
fengjiayi
e7d8e16a66
update softmax_mkldnn_op
7 years ago
Yu Yang
2567afa35d
Merge pull request #12462 from reyoung/feature/fix_cudnn_deterministic
...
Fix bug in cudnn_determistic
7 years ago
fengjiayi
dc111d3476
update softmax_cudnn_op
7 years ago
fengjiayi
f7bd0b227b
Add unittests for softmax_op
7 years ago
gongweibao
819ac3df0a
Modify style ( #12465 )
7 years ago
fengjiayi
b314a69523
make softmax supporting tensors
7 years ago
fengjiayi
b1af7e5d9b
Add unittests for lookup_table_op
7 years ago
tangwei12
c4c8f60bec
sum_op selectedRows dim bug fix
7 years ago
Xin Pan
486345551d
clean
7 years ago
Xin Pan
caf10b474f
make profiler use thread_id from g_thread_id
...
Add a few more RecordEvent.
Cleanup
7 years ago
Yu Yang
040fc1c39b
Fix bug in cudnn_determistic
...
* Introduced by #11205
7 years ago
fengjiayi
7efdf05ac2
make look_up_op supporting tensor ids
7 years ago
Qiao Longfei
690625fe15
Merge pull request #12456 from jacquesqiao/add-profiler-to-pserver
...
Add profiler to pserver
7 years ago
qiaolongfei
7e46a8d172
fix logical bug, optimize code
7 years ago
qiaolongfei
0b62f61d29
add init flag in __init__.py for listen_and_serv_profile_period
7 years ago
dzhwinter
91fb0156ca
Memory/reshape op ( #12414 )
...
* "remove inplace in single op"
* "fix ci"
* "add transpiler case"
* fix conflict
* "fix reshape"
* "delete reshape inplace attr"
* "follo the comments"
* "rerun ci"
7 years ago
qiaolongfei
0b861bbca9
add profiler for listen_and_serv op
7 years ago
tensor-tang
059b27840c
Merge pull request #12408 from tensor-tang/refine/im2col
...
Refine CPU im2col padding with 1
7 years ago
qiaolongfei
147bf00ffe
clear mutable rows for the output of split_ids_op
7 years ago
qiaolongfei
91b114a787
change map to unordered_map
7 years ago
tensor-tang
d8d2dbcfac
further optimize im2col using variables
7 years ago
qiaolongfei
91f63cd401
fix split_ids_op and add unit test
7 years ago
tensor-tang
5373fe29c2
Merge remote-tracking branch 'ups/develop' into refine/im2col
7 years ago
Qiyang Min
7da453630e
Merge pull request #12403 from velconia/fix_hang_up
...
Fix grpc destroy bug
7 years ago
Tao Luo
5a634786af
Merge pull request #12312 from luotao1/unify
...
unify libpaddle_inference_api and libpaddle_fluid
7 years ago
Bai Yifan
e12b1d1792
Add flatten op ( #12341 )
...
* add flatten op
7 years ago
Luo Tao
062556f938
Merge branch 'develop' into unify
7 years ago
chengduo
2409d0f710
Refine regularization for selected_rows ( #12369 )
...
* refine regularization for selected_rows
* clean lookup_table
* refine rpc_server_test
* temporally disable rpc_server_test
* fix rpc_server_test
* add unit test
7 years ago
tensor-tang
687a322267
Merge remote-tracking branch 'ups/develop' into refine/im2col
7 years ago
tensor-tang
65d418f060
complete im2col with padding==1 and speedup filter width==1
7 years ago
minqiyang
053540e199
Add volatile to stopped_ member
7 years ago
minqiyang
b78ffde6d5
Add stopped sign for grpc client
7 years ago
tensor-tang
52eb86e30f
refine im2col benchmark
7 years ago
tensor-tang
3017f46076
add more test cases
7 years ago
tensor-tang
8d6be4fb5f
refine im2col test and add benchmark
7 years ago
tensor-tang
507c143047
im2col cfo cpu code clean
7 years ago
tensor-tang
4eeed0b5e4
refine width padding and enable core copy
7 years ago
Wu Yi
73fcfc06ec
refine conv cudnn enforce ( #12353 )
...
* refine conv cudnn enforce
* update
* update all cudnn ops
* fix
7 years ago
tensor-tang
e3131e2d73
enable width padding
7 years ago
Xin Pan
d7e08c53c2
Merge pull request #12169 from panyx0718/ir_graph_sort
...
construct a SSAGraph at the beginning.
7 years ago
tensor-tang
92518c519f
reuse sizes saving time
7 years ago
tensor-tang
660df122ce
enable padding!=0 and fill height padding with 0
7 years ago
tensor-tang
d8e00facf7
reuse im_size
7 years ago
tensor-tang
179dd0cb8a
Merge pull request #12337 from tensor-tang/refine/im2col
...
refine cpu im2col no padding
7 years ago
Luo Tao
5ba4337698
unify libpaddle_inference_api into libpaddle_fluid
7 years ago
tensor-tang
b72befc5cc
reuse copy size
7 years ago
Yancey
6133efd9ed
Merge pull request #12218 from Yancey1989/rpc_complete_interface
...
Add rpc complete interface
7 years ago
Zhaolong Xing
6169d724b9
Merge pull request #12324 from NHZlX/enhance_for_tensorrt_infer
...
Enhance for tensorrt infer
7 years ago
nhzlx
4d49e61ab8
fix comments
7 years ago
tensor-tang
6788af4bf1
refine test cases
7 years ago
tensor-tang
b163e601b6
add gtest
7 years ago
nhzlx
bcd67bdd71
add assert for GetOutput
7 years ago
tensor-tang
aae994fd26
refine im2col no padding
7 years ago
Yancey1989
fb06ed7bdc
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface
7 years ago
Yu Yang
21387e3c2a
Tiny refines for lod_tensor_blocking_queue and reshape_op
7 years ago
nhzlx
f42ea48996
deal with conflict
7 years ago
nhzlx
940f5dbcac
modify the tensorrt engine op to adapt to chage
7 years ago
Yan Chunwei
02cf54d331
bugfix lod cpu performance ( #12297 )
7 years ago
Qiao Longfei
b41f8b9d42
Merge pull request #12295 from jacquesqiao/speedup-reduce-sum-grad-op
...
Speedup reduce sum grad op
7 years ago
fengjiayi
eec412b230
Merge pull request #12273 from JiayiFeng/update_py_reader
...
Some enhancement on readers
7 years ago
Xin Pan
21a45420f0
polish and test
7 years ago
Qiao Longfei
95a2b5f56a
fix mac build of sendrecvop_utils ( #12272 )
7 years ago
qiaolongfei
273f737517
optimize code
7 years ago
Xin Pan
93355cc0d2
fix control deps
7 years ago
fengjiayi
ea8a375fa4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update_py_reader
7 years ago
qiaolongfei
5d718a5886
optimize reduce_sum_grad op
7 years ago
Yancey1989
d4f51218ef
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface
7 years ago
qiaolongfei
b643473d31
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-mac-build
7 years ago
fengjiayi
060f421797
Some enhancement on readers
...
1. Make the feeding thread of py_reader a daemon thread.
2. Update buffer_reader's destructor, fixing a bug.
3. Make pyreader demo script supporting CPU environment.
7 years ago
qingqing01
873a50ce35
Fix serious bug in nesterov momentum optimizer. ( #12231 )
...
* Fix serious bug in nesterov momentum optimizer.
7 years ago
Yan Chunwei
b42ced8eda
bugfix/tensorrt analysis fix subgraph trigger ( #12266 )
7 years ago
qiaolongfei
938390b38d
fix mac build of sendrecvop_utils
7 years ago
gongweibao
3a6213f493
Change grpc interface to compatible with brpc. ( #12164 )
7 years ago
Yu Yang
b06309381b
Merge pull request #12149 from reyoung/feature/combine_open_files_and_double_buffer
...
Change and polish readers
7 years ago
tensor-tang
be04fbff42
Merge pull request #12233 from tensor-tang/refine/mkl/gemm
...
add option split mkl gemm
7 years ago
Qiao Longfei
2b58c62aa0
Update auc op ( #12199 )
...
fix AUC op
optimize it's test
7 years ago
Yancey1989
efd5a84986
update executor interface
7 years ago
tensor-tang
fc2b578842
add gemm_warp test
7 years ago
tensor-tang
a916c52579
refine gemm
7 years ago
tensor-tang
961e754c9f
mkl split gemm for better perf
7 years ago
Yancey1989
ade6675490
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface
7 years ago
yuyang18
e9c8d930a5
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/combine_open_files_and_double_buffer
7 years ago
Yancey1989
d0771cf912
update
7 years ago
Yancey1989
7570d8e77c
add rpc complete interface
7 years ago
yuyang18
8c70183ba6
Polish function names
7 years ago
yuyang18
b789a3a484
Change code
7 years ago
whs
8284947b82
Fix infershape of im2sequence. ( #12183 )
7 years ago
yuyang18
401e92f6e3
Change attr comment
7 years ago
yuyang18
be528f9815
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/combine_open_files_and_double_buffer
7 years ago
Tomasz Patejko
b2b8b15bfe
MKLDNN sum fix: remove in_place condition in loop creating memory primitives for sum
7 years ago
yuyang18
72b78154b2
Polish reader speed
7 years ago
Wu Yi
866fcb0c15
Merge pull request #12171 from typhoonzero/fix_pserver_with_condition_block
...
fix pserver with condition block
7 years ago
typhoonzero
32d81909dc
fix pserver with condition block
7 years ago
tensor-tang
d24fd2c6b1
Merge pull request #12099 from jczaja/prv-conv-grad-mkldnn-upstream2
...
MKLDNN: Extending Conv grad MKLDNN op with reusing MKLDNN primitives
7 years ago
yuyang18
e576345f5b
Try to speed up buffered reader
7 years ago
Wu Yi
c5619bbcde
fix auc op ( #12087 )
...
* fix auc
* update
* update
* fix compile
* fix param name
* add doc string
* fix test
7 years ago
Yancey
0042ba93c8
Merge pull request #12127 from Yancey1989/enforce_rpc_timeout
...
Enforce rpc timeout
7 years ago
yuyang18
61b3a5977f
Refine Python Reader
7 years ago
yuyang18
b048ddf0bd
Merge error
7 years ago
yuyang18
b8975d6842
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/combine_open_files_and_double_buffer
7 years ago
yuyang18
d36e13efd8
Merge branch 'feature/add_pyreader_demo' into feature/combine_open_files_and_double_buffer
7 years ago
yuyang18
1478a5fc0b
Make open_files use buffer
7 years ago
yuyang18
dc34effd35
Extract buffered reader
7 years ago
yuyang18
392318045f
Merge branch 'feature/dctor_all_readers' into feature/combine_open_files_and_double_buffer
7 years ago
yuyang18
fecbe52200
Rewrite open_files
7 years ago
Yu Yang
ba997b8ccd
Merge pull request #12097 from reyoung/feature/hide_api_cont
...
Hide internal API of LoDTensors, Clipping, etc.
7 years ago
yuyang18
c680bc1d7f
Rewrite DoubleBuffer
7 years ago
yuyang18
c9cf2bdb9c
Dctor cache
7 years ago
yuyang18
ee7d8b4d66
Refine Shutdown Impl
7 years ago
Jacek Czaja
8e20d36bc8
- comment update
7 years ago
Jacek Czaja
c981222b3b
- Conv MKLDNN grad op reuse of mkldnn primitives
7 years ago
tensor-tang
f0cd493c0d
Merge pull request #11989 from tensor-tang/feature/libxsmm
...
introduce libxsmm
7 years ago
Sylwester Fraczek
4d55aca40e
reserve vector space before loop in top-k
7 years ago
Yu Yang
ebe3b5e78a
Merge pull request #11853 from sneaxiy/complete_py_reader_python
...
Add Python Reader Op (Python side and unittests)
7 years ago
Yancey1989
4a91a14549
enforce rpc client timeout
7 years ago
Guo Sheng
da3f766821
Merge pull request #12088 from guoshengCS/complete-hsigmoid
...
Complete hsigmoid_op
7 years ago
sneaxiy
31c7f6b968
Merge branch 'develop' into complete_py_reader_python
7 years ago
fengjiayi
6ff7f2380c
Merge pull request #12063 from reyoung/feature/exception_safe_pe
...
Make scope_buffered_ssa_graph_executor Exception safe
7 years ago
tensor-tang
2f7b09319a
Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
guosheng
4ee069fdba
Fix the HierarchicalSigmoidGradOpKernel and refine the codes. Now hsigmoid_op is same with V2 implementation and can pass gradient check.
7 years ago
yuyang18
c87e08c28d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exception_safe_pe
7 years ago
chenweihang
938319bbd2
Merge branch 'develop' into unsqueeze_op
7 years ago
Yibing Liu
092d620187
Merge pull request #11812 from chenwhql/squeeze_op
...
Add squeeze operator and unit testing
7 years ago
tensor-tang
1c5d6c5692
disable xsmm with float16
7 years ago
tensor-tang
c9ba51ead8
Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
tensor-tang
64a8e6d20e
refine the threshold functions
7 years ago
Tao Luo
c620c522d7
Merge pull request #12093 from Noplz/fix_warning
...
fix warning
7 years ago
lemon34
29145e1e31
change im2sequence for ctc batch inference ( #11696 )
...
* change im2sequence for ctc batch inference
* Update im2sequence_op.cc
* change im2sequence for ctc batch inference
* update
* change PR by comment
* fix ocr test error
* fix test_im2sequence
* modify the old name to standard name
* fix test_layers failed
7 years ago
Noplz
cfa4479b06
fix warning
7 years ago
tensor-tang
32822b2a59
Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
chenweihang
b8ea7a081a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
Jacek Czaja
fbe25ef510
MKLDNN: Extending Conv MKLDNN op to reuse MKLDNN primitives ( #11750 )
...
* - Rebase of conv reuse
- clag formatter fixes
- Fix to conv reuse
- Yet another fix
- Fix
- Fix
- clagn format
* - comment update
7 years ago
baiyf
be2d9dc2b8
Add prior_box output order control ( #12032 )
...
* Add flag to set prior_box output order.
7 years ago
guosheng
e7f7ba97fe
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into complete-hsigmoid
7 years ago
guosheng
e7a4cfc0ff
complete the hsigmoid_op
7 years ago
chenweihang
84a525a38a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into squeeze_op
7 years ago
sneaxiy
f85e16f1de
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into complete_py_reader_python
7 years ago
chenweihang
0ea468225b
docs: fix some errors of description
7 years ago
chenweihang
fbef49e772
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
chenweihang
3d15968958
docs: fix some errors of description
7 years ago
achao2013
8e4b225fe4
Add fake_quantize_op. ( #11359 )
...
* Add a fake_quantize_op, which quantize an input tensor to a tensor with lower bits.
7 years ago
Yuan Gao
50aa6ba6f5
add rpn target assign op ( #11449 )
...
* Add region proposal network (RPN) target assign operator and Python API for Faster-RCNN.
7 years ago
chenweihang
2bd65dbf71
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into squeeze_op
7 years ago
chenweihang
fd01a43a3c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
tensor-tang
7bb67b6788
Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
chenweihang
cef8dbc1f7
refine some messages and adjust data type
7 years ago
chenweihang
05eafcca73
refine some messages and adjust data type
7 years ago
minqiyang
fceaabdd81
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_destroy_bug
7 years ago
guosheng
d695381677
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into complete-hsigmoid
7 years ago
yuyang18
3aaf798182
Refine size_t and int
7 years ago
fengjiayi
26ae6111d1
Merge pull request #12051 from JiayiFeng/dev_reader_ResetAll
...
[WIP] Dev reader reset all
7 years ago
qingqing01
10fbb831ed
Skip BatchNorm when feature only has 1 element. ( #11578 )
...
* Fix batch norm when only 1 elements in normzalize dimension during training.
7 years ago
chenweihang
8f2486ca16
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into squeeze_op
7 years ago
tensor-tang
6bc1aaaac7
refine the ColMajor replacement
7 years ago
tensor-tang
c3862a7519
Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
chenweihang
d552b900f0
change the copyright year form 2016 to 2018
7 years ago
qingqing01
ef4895df3b
Make IfElse operator works and fix unit testing. ( #11972 )
...
1. Fix bug when only true or false branch works.
2. Fix bug in unit testing.
7 years ago
tensor-tang
de856da9a6
fix ColMajor and RowMajor replacement
7 years ago
tensor-tang
00ee6c3c17
Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
fengjiayi
6d6f49cd56
Merge remote-tracking branch 'yuyang/feature/decorated_reader_chain' into dev_reader_ResetAll
7 years ago
chenweihang
7526eaaf13
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
chenweihang
4453473f71
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into squeeze_op
7 years ago
chenweihang
1721613f1e
simplify construct function
7 years ago
fengjiayi
611716e9bc
Merge branch 'dev_reader_shutdown_start' of https://github.com/JiayiFeng/Paddle into dev_reader_shutdown_start
7 years ago
fengjiayi
0e9f1e2790
Make ReaderBase thread safe and remove ThreadedReader
7 years ago
yuyang18
e8ee9dc7f8
Several Polish
7 years ago
chenweihang
5f89272c89
change the bit insert to array insert for understandability
7 years ago
fengjiayi
b4f0e57956
fix errors
7 years ago
Tao Luo
436bb4500b
Merge pull request #11699 from pzelazko-intel/pzelazko/workaround-for-missing-mklnn-kernels
...
workaround for no MKLDNN kernel
7 years ago
fengjiayi
6fc6cc2f4c
Some updates on readers
...
1. Shrink DoubleBufferReader's buffer size to 3.
2. Add BatchReader an option to discard leftover instances.
3. Fix a MultiPassReader bug on pass count.
7 years ago
fengjiayi
5528f59900
Split ReInit() to Shutdown() and Start()
7 years ago
fengjiayi
de9a411f1c
adjust readers' inheritance relationships
...
1. Make PyReader and RandomDataGenerator inherited from FileReader.
2. Remove the memeber variable 'dims_' and realated checks in FileReader.
7 years ago
yuyang18
c48c586aca
Use weak_ptr to implement DecoratedReaderChain
7 years ago
minqiyang
1377b332bc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_destroy_bug
7 years ago
chenweihang
fccdc1abea
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
chenweihang
62a17f5053
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
chenweihang
80126a7496
small fix based reviewer's advice
7 years ago
yuyang18
8e86721fe7
Fix data balance on single GPU
7 years ago
tensor-tang
21516e5cbe
add unit test of smm
7 years ago
tensor-tang
c3941745b3
add libxsmm_gemm
7 years ago
minqiyang
2c4fb585db
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_destroy_bug
7 years ago
minqiyang
0d04545e9c
Remove debug info
7 years ago
chenweihang
9ca8db237a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into squeeze_op
7 years ago
tensor-tang
7782a4ab53
fix blas build issue
7 years ago
tensor-tang
17987eb3fc
link libxsmm
7 years ago
minqiyang
207d1b81fe
Add fixed grpc
7 years ago
tensor-tang
3df99e72ab
Merge remote-tracking branch 'ups/develop' into refine/set_num_threads
...
fix conflicts
7 years ago
dzhwinter
4ed0b62476
Move fluid::framework::InitDevices into fluid::platform ( #11757 )
...
* move to platform
* "move init from framework to platform"
* "remove used init"
* "fix ci"
* "fix ci"
* "fix generic"
* "fix ci"
* "fix ci"
* "fix ci"
* "disable fragile test"
7 years ago
dzhwinter
99a99ec7e3
"remove lapack" ( #11966 )
7 years ago
chenweihang
a6d94e8dc6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
chenweihang
49b2cf5fee
adjust some code based reviewer's advice
7 years ago
sneaxiy
9b28260029
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into complete_py_reader_python
7 years ago
sneaxiy
739c330914
fix merge conflict
7 years ago
fengjiayi
ce16b40b04
Merge pull request #11891 from JiayiFeng/dev_eof_exp
...
Add EOFException to represent EOF in C++ reader
7 years ago
chenweihang
79333fa7b8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
chenweihang
ca15779394
rewrite, use reshape op in unsqueeze op, test passed
7 years ago
Xin Pan
71b1c397d7
Merge pull request #11874 from panyx0718/move_trainer
...
Move trainer and utils api
7 years ago
Xin Pan
d70a38d8ec
fix
7 years ago
yuyang18
c31519036b
Merge branch 'squeeze_op' of https://github.com/chenwhql/Paddle into pr/11812
7 years ago
yuyang18
1854814d49
Use reshape_op inside squeeze_op
...
* also convert tab to space
7 years ago
Xin Pan
94cb59ad09
hide utils to legacy
7 years ago
chenweihang
ee760d1c2d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into squeeze_op
7 years ago
chenweihang
0cef33a468
adjust the dims range to [1,6] and fix some problem
7 years ago
Yancey
f7fd711e3f
Merge pull request #11868 from Yancey1989/dist_pass_barrier
...
add dist pass barrier
7 years ago
yuyang18
3777f10286
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into pr/11812
7 years ago
Yu Yang
9401b64d61
Merge pull request #11877 from reyoung/feature/fix_reshape_op_size
...
User can register a standard C++ functor as Kernel
7 years ago
chenweihang
996c157f61
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
chenweihang
e402496238
complete unsqueeze op and related unittest.
7 years ago
fengjiayi
3fab4f65a4
Add EOFException to represent EOF in C++ reader
7 years ago
minqiyang
1d6ecd3c4e
Change grpc version to 1.13.x
7 years ago
yuyang18
550ab8d723
Use single file than multiple files
7 years ago
Paweł Żelazko
ac323343a0
typos fix
7 years ago
yuyang18
6038a63120
Fix fc mkldnn op
7 years ago
yuyang18
82866d4a18
Add register kernel functor and shrink reshape op
...
* Shrink reshape_op library size
* User can register a standard C++ functor as a op kernel
7 years ago
fengjiayi
58560622bc
Merge pull request #11854 from JiayiFeng/dev_data_balance
...
Data balance for the ParallelExecutor
7 years ago
yuyang18
1ce478f100
Polish reshape op
7 years ago
Yancey1989
37410a0c75
update by comment
7 years ago
chenweihang
9ca88fa8a5
Adjust squeeze op and code the unittest, test passed
7 years ago
sneaxiy
3f9292c6e6
fix merge conflict
7 years ago
sneaxiy
dd70fb4393
fix type comparation bugs
7 years ago
Xin Pan
982dabe293
Merge pull request #11866 from panyx0718/move_func
...
Move some v2 codes to a legacy directory.
7 years ago
Xingyuan Bu
5056d3ec56
FasterRCNN Anchor Generator Op ( #11218 )
...
* Add anchor generator operator for Faster-RCNN.
* Add unittest testing.
* Add Python API.
7 years ago
Yibing Liu
5f79c7fbb6
Merge pull request #11174 from kuke/argsort_dev
...
Add the argsort operator
7 years ago
Yancey1989
029425a5f4
update
7 years ago
Yancey1989
c1ab215e26
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into dist_pass_barrier
7 years ago
Yancey1989
1366832a41
add dist pass barrier
7 years ago
Xin Pan
a9086bf320
also move a few other dir to legacy/
7 years ago
gongweibao
66c91911cf
Improve brpccmake ( #11842 )
7 years ago
Yibing Liu
9386ac0a40
Enhance cuda code & unittest for argsort_op
7 years ago
guochaorong
c318aa5ffa
Merge pull request #11850 from guochaorong/revert_11496
...
Revert "Extend fill_zeros_like_op for zero-filling an LoDTensorArray …
7 years ago
fengjiayi
49a04d75ee
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_data_balance
7 years ago
fengjiayi
4b950951d3
Add unittests and fix a few bugs
7 years ago
chenweihang
a1e7f2d520
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
chenweihang
70729ad641
Add Unsqueeze Operator Framework, not finshed
7 years ago
guochaorong
6a35899131
Revert "Extend fill_zeros_like_op for zero-filling an LoDTensorArray ( #11496 )"
...
This reverts commit bc28cf613f
.
7 years ago
chenweihang
298e74da1e
add squeeze op c++ part, compile success
7 years ago
fengjiayi
5b4f283069
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_data_balance
7 years ago
fengjiayi
b6dc3a59f1
Add DataBalanceOpHandle to MultiDeviceSSAGragh
7 years ago
mozga-intel
b8a04c2fa1
Duplicated code was moved to common function
7 years ago
mozga-intel
3b128337a1
The mkldnn batch norm supports other data format
7 years ago
Xin Pan
2ecc56226d
small AverageOptimizer enhance. ( #11761 )
...
* small AverageOptimizer enhance.
* clean
* clean
7 years ago
Yan Chunwei
5082642bdb
feature/analysis to support sub-graph for TRT engine ( #11538 )
7 years ago
Haichao Zhang
bc28cf613f
Extend fill_zeros_like_op for zero-filling an LoDTensorArray ( #11496 )
...
* Add fill_zeros_array op. This op is used for zero-filling an LoDTensorArray.
* merge fill_zeros_array_op with fill_zeros_like_op
* add unit_test for fill_zeros_like for array
7 years ago
Qiao Longfei
593bbfe392
Merge pull request #11765 from jacquesqiao/fix-adam-op-for-selectedrows
...
fix adam op for selected rows
7 years ago
qiaolongfei
20fae68136
adam op handle grad.rows().size == 0 condition
7 years ago
pzelazko-intel
9a15c92317
bnorm+relu fuse for mkldnn (inference) ( #11434 )
...
* bnorm+relu fuse for mkldnn
* separate fuse_relu function
* bug fix
* proper while range in inference_transpiler
* description fix
* review fix
* review fix
* unit test for fwd batch norm+relu MKLDNN fuse
7 years ago
baiyf
778b71fc93
Optimize bipartite_match_op in large scale input ( #11730 )
...
* optimize bipartite_match_op in large scale input
7 years ago
qiaolongfei
df7a266ae2
fix adam op for selected rows
7 years ago
tensor-tang
e3a96300bb
move SetNumThreads to platform
7 years ago
qingqing01
b756063ce7
Speed depthwise transposed conv2d. ( #11740 )
...
* Speed depthwise transposed conv2d.
7 years ago
Qingsheng Li
8630ba2eb1
Fix sequence expand op ( #11618 )
...
* Set zero outside functor
7 years ago
sneaxiy
01fbcb0bbb
Merge pull request #11695 from sneaxiy/complete_py_reader_cpp
...
Add Python Reader Op (CPP side)
7 years ago
Guo Sheng
8df303c09b
Merge pull request #11238 from guoshengCS/fix-beam_search
...
Fix and enhance beam_search_op and beam_searc_decode_op
7 years ago
guosheng
d15b2e02c8
Fix copying empty tensor in beam_search_decode_op
7 years ago
sneaxiy
d4d946db5a
update blocking queue
7 years ago
tangwei12
a1f1a5ed8a
rename grpc to rpc ( #11717 )
7 years ago
Tao Luo
2dae8a4631
Merge pull request #11596 from tensor-tang/refine/mklml/dyload
...
enable dynamic load mklml lib on fluid
7 years ago
tangwei12
b20fa022ed
Merge pull request #11490 from seiriosPlus/ckpt_m2
...
Checkpoint M2: lookup table checkpoint
7 years ago
gongweibao
ab0c2e1dab
Fix rpc_deadline ( #11709 )
7 years ago
tangwei12
f57978e6b5
renae
7 years ago
tangwei12
b6e63554e6
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into ckpt_m2
7 years ago
Yancey
991cedb4c3
Merge pull request #11702 from Yancey1989/fix_async_update_failed
...
Fix async update failed
7 years ago
Tomasz Patejko
e26f51ce74
MKLDNN elementwis_add with default broadcast operations ( #11544 )
...
* elementwise_add with bcast: Brian's implementation by Brian added, with default bcasts
* elementwise_add with bcast: GetExpectedKernelType added to elementwise_op
* elementwise_add with bcast: use_mkldnn attribute added
* elementwise_add with bcast: changes after review and some formatting
* elementwise_add with bcast: changes after style check
* elementwise_add with bcast: changes after style check cont.
* elementwise_add with bcast: MKLDNN unittests added
* elementwise_add with bcast: original unittests with use_mkldnn flag
* elementwise_add with bcast: handling of MKLDNN format corrected
* elementwise_add with bcast: setting MKLDNN format turned into lambda
* elementwise_add with bcast: MKDNN format setting turned into separate function
* elementwise_add with bcast: condition for choosing MKLDNN simplified
* elementwise_add with bcast: fix for MKLDNN format set incorrectly in bcasts
* elementwise_add with bcast: changes in unittests for broadcasts
* elementwise_add with bcast: fixes in unittests regarding dimensions
* elementwise_add with bcast: bring back correct format setting in mklml grad path
* elementwise_add with bcast: fixed compilation error
7 years ago
Yancey1989
bf0c90f241
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into fix_async_update_failed
7 years ago
Yancey1989
86e09b34e3
fix asyn update error on pserver
7 years ago
tangwei12
dc847f129e
bug fix and code optimize
7 years ago
sneaxiy
67556e4aa4
update blocking queue
7 years ago
sneaxiy
502faf62a9
complete_py_reader_cpp
7 years ago
sneaxiy
7b2339d7c5
delete create_py_array_reader_op.cc
7 years ago
sneaxiy
2dcf0e4e66
delete py_array_feed_queue.h
7 years ago
sneaxiy
748e204eff
Revert "refine ZeroGradFunctor in activation_op.h"
...
This reverts commit 1eeb11ef61
.
7 years ago
fengjiayi
b4f28ccc19
Merge pull request #11632 from JiayiFeng/some_small_fixes
...
Some small fixes
7 years ago
guosheng
983566d946
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into fix-beam_search-dev
7 years ago
guosheng
4e9d7ddbc6
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into fix-beam_search-dev
7 years ago
Wu Yi
acfd177d0c
Retry rpc calls ( #11651 )
...
* make deadline configurable
* configurable deadline
* update
* fix grpc deadline exceeded
7 years ago
Yibing Liu
9c69fdf5c6
Merge branch 'develop' of upstream into argsort_dev
7 years ago
fengjiayi
e1a46bba59
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into some_small_fixes
7 years ago
tangwei12
fb7e479188
merger paddle develop
7 years ago
tangwei12
b519bf05d0
log level optimize
7 years ago
sneaxiy
9b63fef32d
delete some redundant comments
7 years ago
sneaxiy
697ba4b13d
Add Python array reader op
7 years ago
sneaxiy
2c12af76c5
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
Yancey
0d4b376fc0
Merge pull request #11585 from Yancey1989/fix_pserver_sub_blocks
...
fix pserver sub-blocks
7 years ago
Tao Luo
60647c9aa4
Merge pull request #11519 from jczaja/prv-softmax-mkldnn-grad-operator
...
MKLDNN: SoftmaxGrad Op
7 years ago
Yancey1989
8cb494f79c
add blocks attr type in proto
7 years ago
Yancey1989
aa84b21e3b
fix unit tests
7 years ago
gongweibao
dbca7f166d
tune logs ( #11649 )
7 years ago
tangwei12
4388ce112e
checkpoint notify op optimized
7 years ago
tangwei12
7fae9e0a7b
checkpoint feature optimized
7 years ago
tangwei12
e684575f66
checkpoint feature optimized
7 years ago
Yancey1989
d723022e1b
fix compile error
7 years ago
Yancey1989
56a903d3ac
use optimize block list instead of first optimize block
7 years ago
Jacek Czaja
98f3ad3ba1
- MKLDNN Softmax Grad Op
...
- Added hash function inside of MKLDNN softmax op to be used as handle for primitives stroing in a
context
- Style fixes to softmax mkldnn op
- Fixes after review
- Coding style
- Fix to style
- style fixes
- style fix
- style fixes
- Fix to cody style check
- Rephrasing a comment
fix t obroken merge
Fixes to rebase
Conflicts:
benchmark/fluid/models/machine_translation.py
cmake/external/mkldnn.cmake
paddle/fluid/operators/softmax_mkldnn_op.cc
- Bumped revision of MKL-DNN up to have softmax backward primitive
- Added choosing MKLDNN softmax grad operator
- First reuse of softmax backward
- Reinvented reusing for softmax
- Fix to crash in reinvented reuse
- Clang format fixes
- Clang format fixes
- Improved softmax mkldnn reuse mechanism
- clang format fixes
- Fix to broken merge
- Fix
7 years ago
tensor-tang
d5fb8fa778
Revert "Merge pull request #11628 from PaddlePaddle/revert-11102-mozga-intel/Sum_mkldnn_layout"
...
This reverts commit 4d8e8ee226
, reversing
changes made to d6a9f005c8
.
7 years ago
fengjiayi
bc9d19c7d6
fix a bug
7 years ago
fengjiayi
32478fe0ea
Make buffers of DoubleBufferReader and open_files bigger
7 years ago
Yancey1989
3a37e14285
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into fix_pserver_sub_blocks
7 years ago
Yancey1989
0970bd9edc
use optimize blocks attr to record optimize block id
7 years ago
fengjiayi
9dccca963b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into some_small_fixes
7 years ago
fengjiayi
e71948f167
Refine random crop
...
1. Add a new attribute named 'startuo_seed' to RandomCropOp. If the input
'Seed' is not initialized, the 'startup_seed' will be used to replace
it.
2. Refine CustomReader. Add a member variable 'scope_' to it. The
'scope_' will act as the global scope of preprocesing, making it
possiable to save something cross batches.
7 years ago
tangwei12
620999c917
save checkpoint bug fix
7 years ago
tensor-tang
1f09ddf806
Merge remote-tracking branch 'ups/develop' into refine/mklml/dyload
7 years ago
tensor-tang
90780e22ce
Revert "MKLDNN layout: Support for sum operator"
7 years ago
tangwei12
97648442cd
merge develop
7 years ago
fengjiayi
732eef57f5
Register assign_value_op an empty grad_op
7 years ago
tangwei12
e589005229
merge
7 years ago
sneaxiy
7ecb7b7dba
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into python_data_feeding
7 years ago
Yancey
c413c46915
Merge pull request #11603 from Yancey1989/rename_details_folder
...
move dist codes from operaotrs/detail to operators/distributed
7 years ago
Tao Luo
bfe5dc6312
Merge pull request #11607 from chengduoZH/fix_concat_warning
...
Fix concat compile warning
7 years ago
Yancey1989
712adc786f
polish dist cmake
7 years ago
chengduoZH
804c767107
fix concat warning
7 years ago
Yancey1989
e02cbf3538
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into fix_pserver_sub_blocks
7 years ago
Yancey1989
1ef6cdb60e
move dist codes from operaotrs/detail to operators/distributed
7 years ago
fengjiayi
6e1c48d1fd
Merge pull request #11576 from JiayiFeng/dev_refine_bilinear_interp
...
Add bilinear interp supporting for uint8
7 years ago
Tao Luo
8d5ab1f9cd
Merge pull request #11595 from luotao1/refine_code
...
add url of cuda9.0_cudnn7_avx_mkl library
7 years ago
Yu Yang
5f0c780af6
Merge pull request #11534 from typhoonzero/try_fix_logical_doc
...
try fix logical op doc
7 years ago
fengjiayi
47c02b5c32
Add unit tests
7 years ago
Luo Tao
5aac910b89
add url of cuda9.0_cudnn7_avx_mkl library
7 years ago
tangwei12
298588f8c7
Merge branch 'ckpt_m2' of github.com:seiriosPlus/Paddle into ckpt_m2
7 years ago
tangwei12
91eae9cc91
code style
7 years ago
gongweibao
19958eeb71
fix ( #11590 )
7 years ago
tensor-tang
f503f12925
enable dynamic load mklml lib on fluid
7 years ago
sneaxiy
28ff4bdd1e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into python_data_feeding
7 years ago
tangwei12
5a4a24cce1
Merge branch 'develop' into ckpt_m2
7 years ago
fengjiayi
12619fcf90
fix a compile error
7 years ago
fengjiayi
8567d0424d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_refine_bilinear_interp
7 years ago
tangwei12
db6126ca99
code style
7 years ago
Yancey1989
b54d1ba968
fix pserver sub-blocks
7 years ago
Qiao Longfei
4116b55163
Merge pull request #11579 from jacquesqiao/fix-concat-grad-kernel
...
fix concat grad kernel
7 years ago
Yan Chunwei
d020d7fd29
add beam search doc ( #11469 )
7 years ago
qiaolongfei
762160bd8c
fix concat grad kernel
7 years ago
tangwei12
8af4d4c7a0
code style
7 years ago
Tao Luo
5972990295
Merge pull request #11523 from mozga-intel/mozga-intel/Gausian_random_mkldnn_layout
...
MKLDNN layout: Gaussian random layout
7 years ago
tensor-tang
64045c29c2
Merge pull request #11102 from mozga-intel/mozga-intel/Sum_mkldnn_layout
...
MKLDNN layout: Support for sum operator
7 years ago
tangwei12
32fa832b4b
code style
7 years ago
qingqing01
9c90dc9728
Make the CUDA kernel of concat correct and fix unit tests. ( #11541 )
...
* Make the CUDA kernel of concat correct and fix unit tests.
7 years ago
fengjiayi
5d33481c37
Add bilinear interp supporting for uint8
7 years ago
Yu Yang
dad15b6e9a
Merge pull request #11563 from panyx0718/doc
...
Refine detection_map doc.
7 years ago
tangwei12
06f6c21303
bug fix
7 years ago
tangwei12
28482f81a8
bug fix
7 years ago
mozga-intel
7b9aa60198
MKLDNN gausian_random tests
7 years ago
mozga-intel
b88cda84f4
MKLDNN sum unit-test
7 years ago
tangwei12
6abf07693a
checkpoint_notify_id rename
7 years ago
tangwei12
16ecead837
load op optimize
7 years ago
mozga-intel
701102283c
MKLDNN layouts: Gaussian random layout
7 years ago
tangwei12
49c2d0c5fb
bug fix
7 years ago
guosheng
f4bdfbc697
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into fix-beam_search-dev
7 years ago
sneaxiy
5c9dcc4d7d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into argmin_argmax
7 years ago
tangwei12
8c0e1d5cba
unittest case fix
7 years ago
mozga-intel
6512be59ec
MKLDNN layout: the code-review changes
7 years ago
mozga-intel
96b4904d2f
MKLDNN layout: Support for sum operator
7 years ago
tangwei12
d93dc81c4e
add handle when checkpoint_notify_id = -1
7 years ago
Xin Pan
efcbe27263
Refine detection_map doc.
7 years ago
tangwei12
7efd73ac53
code clean
7 years ago
Qiao Longfei
69827f305b
Merge pull request #11527 from jacquesqiao/concat-grad-support-data-input
...
concat support data as input
7 years ago
tangwei12
5250ca8c87
bug fux
7 years ago
tangwei12
459690ae3b
bug fux
7 years ago
tangwei12
620698e7e6
bug fux
7 years ago
tangwei12
ee64f577d4
load op add seletedRows
7 years ago
tangwei12
ca27f78e29
load op add seletedRows
7 years ago
tangwei12
a501766ab1
load op add seletedRows
7 years ago
tangwei12
549f0aa0d3
load op add seletedRows
7 years ago
tangwei12
af0a6a149f
checkpoint notify
7 years ago
tangwei12
ae12281d9b
checkpoint notify
7 years ago
tangwei12
30880844bb
merge develop
7 years ago
tangwei12
5553adf85d
move checkpoint message to variable message
7 years ago
tangwei12
8af8da4fe4
move checkpoint message to variable message
7 years ago
tangwei12
85215df087
move checkpoint message to variable message
7 years ago
mozga-intel
792d3b2406
MKLDNN layout: Support for activation operator
7 years ago
Yibing Liu
e710d2c6d8
Merge branch 'develop' of upstream into argsort_dev
7 years ago
tangwei12
527b86b7d0
bug fix
7 years ago
Yibing Liu
a523b6f49f
Add python api for argsort_op
7 years ago
Yan Chunwei
d734595978
Feature/pass manager ( #11440 )
7 years ago
tangwei12
bbb349fbf0
add RequestCheckpointNotify in grpc
7 years ago
tangwei12
15532c74b1
add RequestCheckpointNotify in grpc
7 years ago
tangwei12
54013a93b1
add RequestCheckpointNotify in grpc
7 years ago
tangwei12
050b66e27c
add RequestCheckpointNotify in grpc
7 years ago
Yibing Liu
7ca511e0c6
Merge branch 'develop' of upstream into argsort_dev
7 years ago
tangwei12
74384b750e
add RequestCheckpointNotify in grpc
7 years ago
tangwei12
36d17d11a4
add RequestCheckpointNotify in grpc
7 years ago
tangwei12
a9ac2007f2
add RequestCheckpointNotify in grpc
7 years ago
tangwei12
925e2324b3
add RequestCheckpointNotify in grpc
7 years ago
yi.wu
acc7451853
update
7 years ago
yi.wu
f297e9ecbf
try fix logical op doc
7 years ago
Yibing Liu
92cfa2be3a
Avoid using dynamic array in cuda kernel
7 years ago
Yu Yang
16a0f746f8
Merge pull request #11383 from jacquesqiao/update-api-reference-1
...
update split_lod_tensor, create_array and array_length doc
7 years ago
Yu Yang
ce60bbf506
Merge pull request #11314 from typhoonzero/fix_api_reference_docs
...
Fix api reference docs
7 years ago
yuyang18
7a56705e4a
polish doc
7 years ago
gongweibao
962711dc3f
Add some paddleenforce. ( #11516 )
7 years ago
yuyang18
fda1a78844
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_api_reference_docs
7 years ago
qiaolongfei
b77c886ed4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-api-reference-1
7 years ago
qiaolongfei
82a4cf1960
update image_resize_short and shape doc
7 years ago
Yu Yang
c2e8f40dcb
Merge pull request #11492 from dzhwinter/doc/api1
...
[API Reference] fix some typo in layers
7 years ago
Yu Yang
1171c2c57d
Merge pull request #11457 from JiayiFeng/dev_add_doc
...
Add API reference
7 years ago
qiaolongfei
d1a8498b22
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-api-reference-1
7 years ago
qiaolongfei
ad1ad738d8
add gpu support for concat
7 years ago
chengduo
ea03a2287e
Merge pull request #11513 from chengduoZH/refine_conv3d_doc
...
Fix conv3d/conv3d_trans/slice/mean_iou Doc
7 years ago
qiaolongfei
9c128fe656
concat support data as input
7 years ago
qiaolongfei
bf3ff5b091
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-api-reference-1
7 years ago
gongweibao
dd55cc1647
fix warning ( #11518 )
7 years ago
Yibing Liu
3438dfecce
Merge pull request #11491 from kuke/fix_docs
...
Fix some problems in api reference
7 years ago
Tao Luo
3a4b6cdaa0
Merge pull request #11488 from luotao1/softmax_doc
...
add doc of sequence_softmax and parallelDo
7 years ago
Yibing Liu
f3a777d8e2
Fix the display of reciprocal's formula
7 years ago
tangwei12
985026ce42
add checkpoint_notify in python
7 years ago
tangwei12
1c2e9bdd49
fix cmakelist
7 years ago
Yibing Liu
bf5ce626c0
Merge branch 'develop' of upstream into fix_docs
7 years ago
fengjiayi
bcae8729ac
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_add_doc
7 years ago
tangwei12
860360d96d
bug fix
7 years ago
dzhwinter
4970414b1c
Merge remote-tracking branch 'origin/develop' into doc/api1
7 years ago
tangwei12
8d46d1ddf2
bug fix
7 years ago
tangwei12
f224948f31
bug fix
7 years ago
qiaolongfei
8f59d79d75
update doc for sigmoid_cross_entropy_with_logits
7 years ago
Luo Tao
8415e18b3f
Merge branch 'develop' into softmax_doc
7 years ago
qingqing01
cc1239ffc9
Update some doc about API reference. ( #11495 )
...
* Update some doc about layers' API.
* Fix format.
* Fix example bug in random_data_generator.
* Fix example bug in dropout.
* Follow comments and some small fix for some examples.
7 years ago
fengjiayi
29ddf6c5d8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_add_doc
7 years ago
qiaolongfei
5b50307bf8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-api-reference-1
7 years ago
qiaolongfei
6ace04f655
update
7 years ago
Tao Luo
5ea039b3c1
Merge pull request #11470 from typhoonzero/fix_unitests
...
Fix dist ut
7 years ago
chengduoZH
7b82353010
fix conv3d/conv3d_trans/slice/mean_iou doc
7 years ago
tangwei12
98c30c7cbe
bug fix
7 years ago
Yibing Liu
67dc5c7f8a
Polish the doc of nce layer
7 years ago
dzhwinter
6ac8383f28
"fix based comments"
7 years ago
tangwei12
fe76244f0e
bug fix
7 years ago
weixing02
1021089cda
fix
7 years ago
weixing02
ee13b396f2
fix some errors
7 years ago
Yibing Liu
279ebdd0b2
Fix reciprocal op's doc
7 years ago
tangwei12
fb27c9a5a3
bug fix
7 years ago
tangwei12
1cb0ab36f0
bug fix
7 years ago
tangwei12
bb17604b36
bug fix
7 years ago
Luo Tao
1958654d6f
refine \odot in elementwise_mul
7 years ago
tensor-tang
0329ee7496
Merge pull request #11497 from tensor-tang/doc
...
Doc update
7 years ago
Yan Chunwei
5fd142c3fd
bugfix/trt engine op ( #11487 )
7 years ago
qiaolongfei
3a25ceebf2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-api-reference-1
7 years ago
Yu Yang
745ea4dcf0
Merge pull request #11354 from reyoung/feature/polish_doc
...
Polish documentation
7 years ago
dzhwinter
1f38cbf79b
"fix some typo"
7 years ago
yi.wu
3380737cb7
update by comment
7 years ago
fengjiayi
d684b49cb0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_add_doc
7 years ago
fengjiayi
d91060d300
fix errors
7 years ago
weixing02
8bd148dc00
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into hsigmoid_op
7 years ago
dzhwinter
45f18b0062
Merge remote-tracking branch 'origin/develop' into doc/api1
7 years ago
chengduo
8bc74f10e9
Merge pull request #11476 from chengduoZH/enable_more_type_for_SplitOpKernel
...
Enable more type for SplitOp and ConcatOp
7 years ago
Xin Pan
261d5f0232
Merge pull request #11409 from panyx0718/doc
...
Add API docs.
7 years ago
tensor-tang
98ab2b403e
polish doc: softshrink, assign, shuffle
7 years ago
tensor-tang
24fea628cc
polish doc: mean
7 years ago
dzhwinter
16a3d88a20
fix typo
7 years ago
tangwei12
b089b80988
update rpc to add checkpoint notify
7 years ago
tangwei12
12de20f5f7
add checkpoint_notify_op for trainer to notify pserver, update listen_and_serv_op
7 years ago
chengduoZH
ca743de2e0
enable more type for splitOp and ConcatOp
7 years ago
yi.wu
40c631e554
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_unitests
7 years ago
yi.wu
44925eb4c2
fix dist ut
7 years ago
yuyang18
055df47035
Polish code
7 years ago
yuyang18
cbc1b7f1ce
Polish documentation
7 years ago
fengjiayi
8a136d142f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_add_doc
7 years ago
fengjiayi
980499faf1
fix errors
7 years ago
sneaxiy
fa002cd67c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into argmin_argmax
7 years ago
tensor-tang
9169b3b802
Merge pull request #10789 from Xreki/core_fix_openblas_threads
...
Add an interface to set the number of threads for math function, and set the default value to 1 for inference.
7 years ago
Qiao Longfei
cbaa24f597
Merge pull request #11450 from jacquesqiao/fix-prefetch-bug
...
outvar must be create in local scope for prefetch
7 years ago
whs
6fcdb240fa
Add mean IOU op. ( #10519 )
...
* Add mean_iou op.
* Add unitest for mean iou op.
* Add optional collections of confusion matrix and mean_iou.
* Fix cuda kernel.
* Refine code.
1. Merge computing in GPU to two kernel.
2. Use wrong array and correct array instead of confusion matrix.
* Add python api and fix cuda kernel.
* Fix comments.
* Small fix.
* Small fix.
7 years ago
qiaolongfei
490a07f5b6
add comment that out var of prefetch must be created in local scope
7 years ago
guosheng
49ca424d6e
Fix src_idx out of range in beam_search_op
7 years ago
qiaolongfei
a49ee22e31
fix a bug in prefetch
7 years ago
tangwei12
41701969a9
[wip] ckpt m2 develop
7 years ago
yi.wu
a83b792ada
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_api_reference_docs
7 years ago
yi.wu
c58ba827bb
update
7 years ago
qiaolongfei
fd9b650d7f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-api-reference-1
7 years ago
yuyang18
674327a4b1
Polish several API
7 years ago
yuyang18
ce6394ed73
Polish example
7 years ago
Qiao Longfei
7c426be98c
Merge pull request #11342 from jacquesqiao/add-merge-splited-ids
...
Add merge_ids_op
7 years ago
qiaolongfei
e6f54d5aa2
update comment
7 years ago
qiaolongfei
2e48ab623e
add more detailed comment
7 years ago
tensor-tang
647c0eb7ec
Merge pull request #11099 from mozga-intel/mozga-intel/Conv_mkldnn_layout
...
MKLDNN layout: Support for convolution operator
7 years ago
Wu Yi
688e18508f
Merge pull request #11423 from typhoonzero/fix_develop_bugs
...
Fix nccl dist train bug
7 years ago
yuyang18
b9843abb61
Polish comsum, DynamicRNN
7 years ago
qiaolongfei
cdd55dbcaa
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
guosheng
35e32a8e15
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into fix-beam_search
7 years ago
yi.wu
d76ebd7853
fix nccl dist train bug
7 years ago
yuyang18
54002c3bab
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/polish_doc
7 years ago
tensor-tang
489f581d49
Merge pull request #11098 from mozga-intel/mozga-intel/Batch_norm_mkldnn_layout
...
MKLDNN layout: Support for batch norm operator
7 years ago
Yibing Liu
28a0ac535a
Merge branch 'develop' of upstream into argsort_dev
7 years ago
Yibing Liu
98460c009e
Simplify the computation in cpu
7 years ago
guosheng
92a7766ecb
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into fix-beam_search
7 years ago
guosheng
592f84a4af
Complete the docs of beam_search_op, beam_searc_decode_op and the python wrapper
7 years ago
qiaolongfei
d6c8d2675c
optimize code and comment
7 years ago
Yibing Liu
94e72ea6e7
Support more negative axes in argsort_op
7 years ago
Xin Pan
5d0bf8bc8f
Add API docs.
7 years ago
Yibing Liu
42645ff779
Compute target index on gpu
7 years ago
qiaolongfei
e72eb0edec
small update
7 years ago
fengjiayi
88fa9c2ec7
Merge pull request #11267 from JiayiFeng/fix_reader_bug
...
Fix a multi-thread bug in readers
7 years ago
yuyang18
ff55d4c593
Polish documents
...
* less_than
* cumsum
* multiplex
* open_recordio_file
7 years ago
Yibing Liu
6ee22c4f71
Add gpu kernel for argsort op
7 years ago
yi.wu
7b54b30be5
follow comments
7 years ago
yi.wu
1457be3e7b
fix_api_reference_docs
7 years ago
qiaolongfei
f031555cfb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
Qiao Longfei
431491a2a8
Merge pull request #11366 from jacquesqiao/refine-prefetch
...
Refine prefetch
7 years ago
Wu Yi
34865f2de3
Trainer send term signal ( #11220 )
...
* wip
* use executor.complete to end trainer
* fix build
* fix build with distribute off
* fix typo
* fix cmake typo
* fix build
7 years ago
Tao Luo
ca4d528b35
Merge pull request #11369 from luotao1/refine_doc
...
refine docs of elementwise_op etc.
7 years ago
Luo Tao
c4c787337a
update with comments
7 years ago
sneaxiy
1eeb11ef61
refine ZeroGradFunctor in activation_op.h
7 years ago
qiaolongfei
2b9ff39f5f
fix the default value prefetch_var_name_to_block_id
7 years ago
qingqing01
19fd071785
Make the normalization operator more general and fix bug in l2_normalize. ( #11348 )
...
* Add normalization operator.
1. Refine the raw norm_op and let it more general to support to normalize Tensor along any axis.
2. There is a bug in l2_normalize API, which lacks sqrt after `reduce_sum`.
3. Use norm_op to refine the l2_normalize API.
4. Fix bug in test_normalization_wrapper.py.
7 years ago
whs
adc09087c1
Add slice op. ( #11052 )
...
* Add slice op.
* Remove using from header file and fix doc.
* Fix doc
* Small fix.
7 years ago
qiaolongfei
6dd3f3cf27
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei
16658f7b59
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch
7 years ago
Xin Pan
1d198494d7
Merge pull request #11370 from panyx0718/dist
...
Make status update thread-safe
7 years ago
qiaolongfei
fe65064827
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch
7 years ago
Luo Tao
7bdb573d79
update with comments
7 years ago
qiaolongfei
506fc8d9e8
optimize code
7 years ago
gongweibao
d9de6b8621
Add brpc surpport. ( #11263 )
7 years ago
Xin Pan
1509ae3a53
Make status update thread-safe
...
The status is updated in the Process() thread
and can be checked in another HandleRequest() thread.
7 years ago
qiaolongfei
ea106c91e0
optimize comment and code
7 years ago
Luo Tao
7694199050
refine docs of elementwise_op etc.
7 years ago
qiaolongfei
7f4b9656a4
set status before Finish in prefetch process
7 years ago
dzhwinter
bfa3fd6f15
add inplace attribute to op_proto_maker ( #10665 )
...
* "add inplace attribute"
* "register inplace attribute"
* "change se-next model for memory-reuse"
* "fix typo"
* repick
* fix merge conflict
* "fix stupid error"
7 years ago
qiaolongfei
5aba10b585
set the thread pool of prefetch to 1 to fix a bug
7 years ago
qiaolongfei
8fb78f6c07
fix grpc_server_test
7 years ago
tensor-tang
b3fd9da60e
Merge pull request #11101 from mozga-intel/mozga-intel/Pool_mkldnn_layout
...
MKLDNN layout: Support for pool operator
7 years ago
qiaolongfei
4e36c0ecab
update prefetch logic in grpc_server
7 years ago
gongweibao
627d7a64f8
Clean `sendop` `recv` operator. ( #11309 )
7 years ago
yuyang18
2955ff5887
Polish documentation
...
* row_conv
* uniform_random
* layer_norm
* create_parameter
* hard_shrink
* ssd_loss
7 years ago
qiaolongfei
0d3d4ae775
refine prefetch logic
7 years ago
yi.wu
ff4efb84d9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_api_reference_docs
7 years ago
sneaxiy
831909ce69
Merge pull request #11313 from sneaxiy/argmin_argmax
...
Add argmin and argmax op
7 years ago
yuyang18
9b43edeae0
Polish arg_min_max_op
...
* Remove unused arg_max/min_op.h
* Remove reference parameter. Use pointer insteaded.
* undef macro
* Always set OutT as int64_t.
7 years ago
Yu Yang
9328c3cf7b
Merge pull request #11308 from reyoung/feature/polish_api_ref
...
Simplize API Reference Documentation
7 years ago
mozga-intel
7d5643562f
MKLDNN layout: Support for batch norm operator
7 years ago
mozga-intel
9908d3cfbc
MKLDNN layout: Support for convolution operator
7 years ago
mozga-intel
36031cb50f
MKLDNN layout: Support for pool operator
7 years ago
qiaolongfei
509cb0bc76
add unit test, pass the unit test
7 years ago
qiaolongfei
7cebec4b7e
init merge_ids_op
7 years ago
Wu Yi
7bcc98089a
Merge pull request #11321 from Yancey1989/polish_sparse_update
...
polish sparse update logic
7 years ago
sneaxiy
6d32e96096
remove redundant comments
7 years ago
Yancey1989
56964946d4
polish sparse update logic
7 years ago
guochaorong
04b8d3d03c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into paddle_fix
7 years ago
guochaorong
0fec9469f9
fix some bugs introduced by unfreed memory
7 years ago
yi.wu
efcff3d9e5
polish api ref docs
7 years ago
yuyang18
8c9041f486
Refine LinearCRF
7 years ago
sneaxiy
568c4e5ec4
recommit using account sneaxiy
7 years ago
yi.wu
5be454bf33
polish docs
7 years ago
yuyang18
0d29e65924
Add resize_bilinear
7 years ago
guosheng
5e20a8ef93
Make python unit test of beam_search_op and beam_searc_decode_op run correctly
7 years ago
yuyang18
b000e0de5d
Simplize API Reference Documentation
7 years ago
fengjiayi
b587a7f66e
Merge pull request #11293 from JiayiFeng/update_crop_op
...
Update crop op
7 years ago
Xin Pan
259e63d4a1
Merge pull request #11248 from panyx0718/dist
...
Fix sparse vars usage for dist train
7 years ago
Xin Pan
2d7c836d32
Merge pull request #11295 from panyx0718/doc
...
Refine API doc string
7 years ago
Yu Yang
8deff48db0
Merge pull request #11081 from reyoung/feature/python_doc
...
Add document to random crop operator
7 years ago
fengjiayi
c7bbfb33ad
Fix a GPU bug
7 years ago
Yancey1989
1239fce771
polish sparse update code
7 years ago
Xin Pan
e80c6b3c24
Refine API doc string
7 years ago
fengjiayi
24649a780d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update_crop_op
7 years ago
dzhwinter
d48172f22a
split reduce op into multiple libraries, accelerate the compiling ( #11029 )
...
* "split into multiple .ccl"
* "refine file structure"
* "refine files"
* "remove the cmakelist"
* "fix typo"
* "fix typo"
* fix ci
7 years ago
fengjiayi
9c61409a18
Make crop op supporting taking offsets as one of its inputs
7 years ago
guosheng
a281e1016e
Make cc_test of beam_search_op and beam_searc_decode_op run correctly
7 years ago
fengjiayi
4f46a98fa9
stash
7 years ago
mozga-intel
3ff9ba0e6b
Mkldnn layout ( #11040 )
...
* Add MKLDNN layout support in Paddle
Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.
* Add MKLDNN layout support in activation OP
* Don't populate layout from input to output when kMKLDNN in
* Refine pool mkldnn op kernel
* MKLDNN layout
* Remove the inferitance from tensor file
* MKLDNN layout: refactoring
* Remove additional #define to register new operator
* Prepare mkldnn tests to work with layout
7 years ago
fengjiayi
499dbe0536
fix a multi-thread bug in readers
7 years ago