Yancey1989
3a37e14285
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into fix_pserver_sub_blocks
7 years ago
Yancey1989
0970bd9edc
use optimize blocks attr to record optimize block id
7 years ago
fengjiayi
9dccca963b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into some_small_fixes
7 years ago
fengjiayi
e71948f167
Refine random crop
...
1. Add a new attribute named 'startuo_seed' to RandomCropOp. If the input
'Seed' is not initialized, the 'startup_seed' will be used to replace
it.
2. Refine CustomReader. Add a member variable 'scope_' to it. The
'scope_' will act as the global scope of preprocesing, making it
possiable to save something cross batches.
7 years ago
Yan Chunwei
bcea248b60
doc/inference api ( #11332 )
7 years ago
tensor-tang
28a0ef9522
remove usr local lib when dynamic load lib
7 years ago
tangwei12
620999c917
save checkpoint bug fix
7 years ago
tensor-tang
1f09ddf806
Merge remote-tracking branch 'ups/develop' into refine/mklml/dyload
7 years ago
tensor-tang
90780e22ce
Revert "MKLDNN layout: Support for sum operator"
7 years ago
chengduoZH
c99fca5f90
Add No Mutex
7 years ago
Xin Pan
32bfebfe38
disable the LODTensor warning for now
7 years ago
tangwei12
97648442cd
merge develop
7 years ago
chengduoZH
13de72388d
Fix broadcast
7 years ago
fengjiayi
732eef57f5
Register assign_value_op an empty grad_op
7 years ago
sneaxiy
d2c0e2f065
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into python_data_feeding
7 years ago
tangwei12
e589005229
merge
7 years ago
Xin Pan
0151e4eb3d
Merge pull request #11608 from panyx0718/doc
...
small thread-safety fix and doc improvements.
7 years ago
chengduoZH
28a86aebc3
Fix Parallel Exe(VarHandel's version)
7 years ago
sneaxiy
7ecb7b7dba
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into python_data_feeding
7 years ago
Yancey
c413c46915
Merge pull request #11603 from Yancey1989/rename_details_folder
...
move dist codes from operaotrs/detail to operators/distributed
7 years ago
Yancey
9cc1eb434e
Merge pull request #11221 from Yancey1989/overlap_memcpy_with_dist
...
overlap rpc op memcpy in distributed training
7 years ago
Tao Luo
bfe5dc6312
Merge pull request #11607 from chengduoZH/fix_concat_warning
...
Fix concat compile warning
7 years ago
Yancey1989
712adc786f
polish dist cmake
7 years ago
Xin Pan
df31926fcf
small thread-safety fix and doc improvements.
7 years ago
chengduoZH
804c767107
fix concat warning
7 years ago
Yancey1989
e02cbf3538
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into fix_pserver_sub_blocks
7 years ago
Yancey1989
1ef6cdb60e
move dist codes from operaotrs/detail to operators/distributed
7 years ago
fengjiayi
6e1c48d1fd
Merge pull request #11576 from JiayiFeng/dev_refine_bilinear_interp
...
Add bilinear interp supporting for uint8
7 years ago
gongweibao
80f63642e6
Add comments to `set_lod`. ( #11588 )
7 years ago
tensor-tang
3e73a7a924
add usr local lib to dynamic search path
7 years ago
Tao Luo
8d5ab1f9cd
Merge pull request #11595 from luotao1/refine_code
...
add url of cuda9.0_cudnn7_avx_mkl library
7 years ago
Yan Chunwei
a009272ec7
inference/unify output buffer management ( #11569 )
7 years ago
Yu Yang
5f0c780af6
Merge pull request #11534 from typhoonzero/try_fix_logical_doc
...
try fix logical op doc
7 years ago
fengjiayi
47c02b5c32
Add unit tests
7 years ago
Qiao Longfei
05a92779f4
Merge pull request #11524 from jacquesqiao/add-keep_kids-for-executor
...
add keep_kids flag for executor
7 years ago
Luo Tao
5aac910b89
add url of cuda9.0_cudnn7_avx_mkl library
7 years ago
Yancey1989
7e6518e8ca
fix compile warning
7 years ago
Yancey1989
7d1b146939
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
tangwei12
298588f8c7
Merge branch 'ckpt_m2' of github.com:seiriosPlus/Paddle into ckpt_m2
7 years ago
tangwei12
91eae9cc91
code style
7 years ago
gongweibao
19958eeb71
fix ( #11590 )
7 years ago
tensor-tang
f503f12925
enable dynamic load mklml lib on fluid
7 years ago
sneaxiy
28ff4bdd1e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into python_data_feeding
7 years ago
tangwei12
5a4a24cce1
Merge branch 'develop' into ckpt_m2
7 years ago
fengjiayi
12619fcf90
fix a compile error
7 years ago
fengjiayi
8567d0424d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_refine_bilinear_interp
7 years ago
tangwei12
db6126ca99
code style
7 years ago
Yancey1989
b54d1ba968
fix pserver sub-blocks
7 years ago
Qiao Longfei
4116b55163
Merge pull request #11579 from jacquesqiao/fix-concat-grad-kernel
...
fix concat grad kernel
7 years ago
Yan Chunwei
d020d7fd29
add beam search doc ( #11469 )
7 years ago
qiaolongfei
762160bd8c
fix concat grad kernel
7 years ago
tangwei12
8af4d4c7a0
code style
7 years ago
Tao Luo
5972990295
Merge pull request #11523 from mozga-intel/mozga-intel/Gausian_random_mkldnn_layout
...
MKLDNN layout: Gaussian random layout
7 years ago
tensor-tang
64045c29c2
Merge pull request #11102 from mozga-intel/mozga-intel/Sum_mkldnn_layout
...
MKLDNN layout: Support for sum operator
7 years ago
tangwei12
32fa832b4b
code style
7 years ago
qingqing01
9c90dc9728
Make the CUDA kernel of concat correct and fix unit tests. ( #11541 )
...
* Make the CUDA kernel of concat correct and fix unit tests.
7 years ago
fengjiayi
5d33481c37
Add bilinear interp supporting for uint8
7 years ago
Yu Yang
dad15b6e9a
Merge pull request #11563 from panyx0718/doc
...
Refine detection_map doc.
7 years ago
tangwei12
06f6c21303
bug fix
7 years ago
tangwei12
28482f81a8
bug fix
7 years ago
mozga-intel
7b9aa60198
MKLDNN gausian_random tests
7 years ago
mozga-intel
b88cda84f4
MKLDNN sum unit-test
7 years ago
tangwei12
6abf07693a
checkpoint_notify_id rename
7 years ago
tangwei12
16ecead837
load op optimize
7 years ago
mozga-intel
701102283c
MKLDNN layouts: Gaussian random layout
7 years ago
tangwei12
49c2d0c5fb
bug fix
7 years ago
guosheng
f4bdfbc697
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into fix-beam_search-dev
7 years ago
sneaxiy
5c9dcc4d7d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into argmin_argmax
7 years ago
tangwei12
8c0e1d5cba
unittest case fix
7 years ago
mozga-intel
6512be59ec
MKLDNN layout: the code-review changes
7 years ago
mozga-intel
96b4904d2f
MKLDNN layout: Support for sum operator
7 years ago
tangwei12
d93dc81c4e
add handle when checkpoint_notify_id = -1
7 years ago
Qiyang Min
a29cb4be2a
Fix decay bug ( #11520 )
...
* Add sub_blocks of lr_decay_op to pserver_prog after distribute_transpiler
* Remove unused logs and logics
* 1. Add ops to new block (considering the nested block condition)
2. Follow the original hierarchy of blocks
3. Change the function's name and remove debug lines
7 years ago
Xin Pan
efcbe27263
Refine detection_map doc.
7 years ago
tensor-tang
e8f5757d66
Merge pull request #11525 from tensor-tang/refine/mem
...
refine the initial cpu memory flag for mkldnn
7 years ago
tangwei12
7efd73ac53
code clean
7 years ago
tensor-tang
9a25f2895c
update the default cpu memory with MKLDNN
7 years ago
Qiao Longfei
69827f305b
Merge pull request #11527 from jacquesqiao/concat-grad-support-data-input
...
concat support data as input
7 years ago
tensor-tang
f1f8327c31
Merge remote-tracking branch 'ups/develop' into refine/mem
7 years ago
Tao Luo
49f23e63a9
Merge pull request #11124 from mozga-intel/mozga-intel/Activation_mkldnn_layout
...
MKLDNN layout: Support for activation operator
7 years ago
tangwei12
5250ca8c87
bug fux
7 years ago
tangwei12
459690ae3b
bug fux
7 years ago
Yang Yang(Tony)
d00a0436b1
Remove tape ( #11548 )
...
* Remove tape
* remove tape in cmake
* fix CI
7 years ago
tangwei12
620698e7e6
bug fux
7 years ago
gongweibao
4dda54aa5a
Fix unlikely ( #11537 )
7 years ago
tangwei12
ee64f577d4
load op add seletedRows
7 years ago
tangwei12
ca27f78e29
load op add seletedRows
7 years ago
tangwei12
a501766ab1
load op add seletedRows
7 years ago
tangwei12
549f0aa0d3
load op add seletedRows
7 years ago
tangwei12
af0a6a149f
checkpoint notify
7 years ago
tangwei12
ae12281d9b
checkpoint notify
7 years ago
tangwei12
30880844bb
merge develop
7 years ago
tangwei12
5553adf85d
move checkpoint message to variable message
7 years ago
tangwei12
8af8da4fe4
move checkpoint message to variable message
7 years ago
tangwei12
85215df087
move checkpoint message to variable message
7 years ago
mozga-intel
792d3b2406
MKLDNN layout: Support for activation operator
7 years ago
Yibing Liu
e710d2c6d8
Merge branch 'develop' of upstream into argsort_dev
7 years ago
tangwei12
527b86b7d0
bug fix
7 years ago
Yibing Liu
a523b6f49f
Add python api for argsort_op
7 years ago
Yan Chunwei
d734595978
Feature/pass manager ( #11440 )
7 years ago
tangwei12
bbb349fbf0
add RequestCheckpointNotify in grpc
7 years ago
tangwei12
15532c74b1
add RequestCheckpointNotify in grpc
7 years ago
tangwei12
54013a93b1
add RequestCheckpointNotify in grpc
7 years ago
tangwei12
050b66e27c
add RequestCheckpointNotify in grpc
7 years ago
Yibing Liu
7ca511e0c6
Merge branch 'develop' of upstream into argsort_dev
7 years ago
tangwei12
74384b750e
add RequestCheckpointNotify in grpc
7 years ago
tangwei12
36d17d11a4
add RequestCheckpointNotify in grpc
7 years ago
tangwei12
a9ac2007f2
add RequestCheckpointNotify in grpc
7 years ago
tangwei12
925e2324b3
add RequestCheckpointNotify in grpc
7 years ago
yi.wu
acc7451853
update
7 years ago
yi.wu
f297e9ecbf
try fix logical op doc
7 years ago
Yibing Liu
92cfa2be3a
Avoid using dynamic array in cuda kernel
7 years ago
Yu Yang
16a0f746f8
Merge pull request #11383 from jacquesqiao/update-api-reference-1
...
update split_lod_tensor, create_array and array_length doc
7 years ago
Yu Yang
ce60bbf506
Merge pull request #11314 from typhoonzero/fix_api_reference_docs
...
Fix api reference docs
7 years ago
yuyang18
7a56705e4a
polish doc
7 years ago
gongweibao
962711dc3f
Add some paddleenforce. ( #11516 )
7 years ago
yuyang18
fda1a78844
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_api_reference_docs
7 years ago
qiaolongfei
b77c886ed4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-api-reference-1
7 years ago
qiaolongfei
82a4cf1960
update image_resize_short and shape doc
7 years ago
Yu Yang
c2e8f40dcb
Merge pull request #11492 from dzhwinter/doc/api1
...
[API Reference] fix some typo in layers
7 years ago
Yu Yang
1171c2c57d
Merge pull request #11457 from JiayiFeng/dev_add_doc
...
Add API reference
7 years ago
qiaolongfei
d1a8498b22
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-api-reference-1
7 years ago
qiaolongfei
ad1ad738d8
add gpu support for concat
7 years ago
tensor-tang
a0c5fd83b2
enable setting initial memory from env
7 years ago
chengduo
ea03a2287e
Merge pull request #11513 from chengduoZH/refine_conv3d_doc
...
Fix conv3d/conv3d_trans/slice/mean_iou Doc
7 years ago
qiaolongfei
9c128fe656
concat support data as input
7 years ago
tensor-tang
a8c2ff316f
refine the initial cpu memory flag for mkldnn
7 years ago
qiaolongfei
bf3ff5b091
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-api-reference-1
7 years ago
qiaolongfei
2b1ecdf55a
update comment
7 years ago
qiaolongfei
daa0fbd5f4
add keep_kids flag for executor
7 years ago
tensor-tang
609dccfb55
Merge pull request #11395 from tensor-tang/fix
...
remove mkldnn flag from gtest strdup for cpu
7 years ago
Kexin Zhao
417fcf4f43
Modify Pybind LoDTensor API according to length-based LoD ( #11106 )
...
* add lod_tensor util and modify pybind
* refind pybind LoDTensor API and modify LoDTensor and DataFeeder test
* fix test error
* fix detection map op test
* fix reorder_lod_tensor test
* fix seq_concat_op
* fix chunk evel op test
* fix target assign op
* fix warp ctc op
* address comments step 1: reverse reset_lod op
* step 2: modify op test
* add warning message
* remove has_valid_lod
* add back has_valid_lod
* address comments
* add exception catching trial
7 years ago
gongweibao
dd55cc1647
fix warning ( #11518 )
7 years ago
Yibing Liu
3438dfecce
Merge pull request #11491 from kuke/fix_docs
...
Fix some problems in api reference
7 years ago
Tao Luo
3a4b6cdaa0
Merge pull request #11488 from luotao1/softmax_doc
...
add doc of sequence_softmax and parallelDo
7 years ago
dongzhihong
f4a49cb07e
Merge remote-tracking branch 'origin/develop' into doc/api1
7 years ago
Yibing Liu
f3a777d8e2
Fix the display of reciprocal's formula
7 years ago
tangwei12
985026ce42
add checkpoint_notify in python
7 years ago
tangwei12
1c2e9bdd49
fix cmakelist
7 years ago
tensor-tang
0ddc5d8631
Merge pull request #11258 from tensor-tang/refine
...
Refine test and scope lock
7 years ago
Yibing Liu
bf5ce626c0
Merge branch 'develop' of upstream into fix_docs
7 years ago
fengjiayi
bcae8729ac
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_add_doc
7 years ago
tangwei12
860360d96d
bug fix
7 years ago
dzhwinter
4970414b1c
Merge remote-tracking branch 'origin/develop' into doc/api1
7 years ago
tangwei12
8d46d1ddf2
bug fix
7 years ago
tensor-tang
98b14a3a2e
Merge remote-tracking branch 'ups/develop' into fix
7 years ago
tensor-tang
a427e76958
skip use_mkldnn if do not use it
7 years ago
tangwei12
f224948f31
bug fix
7 years ago
qiaolongfei
8f59d79d75
update doc for sigmoid_cross_entropy_with_logits
7 years ago
Luo Tao
8415e18b3f
Merge branch 'develop' into softmax_doc
7 years ago
qingqing01
cc1239ffc9
Update some doc about API reference. ( #11495 )
...
* Update some doc about layers' API.
* Fix format.
* Fix example bug in random_data_generator.
* Fix example bug in dropout.
* Follow comments and some small fix for some examples.
7 years ago
fengjiayi
29ddf6c5d8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_add_doc
7 years ago
qiaolongfei
5b50307bf8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-api-reference-1
7 years ago
qiaolongfei
6ace04f655
update
7 years ago
Tao Luo
5ea039b3c1
Merge pull request #11470 from typhoonzero/fix_unitests
...
Fix dist ut
7 years ago
chengduoZH
7b82353010
fix conv3d/conv3d_trans/slice/mean_iou doc
7 years ago
tangwei12
98c30c7cbe
bug fix
7 years ago
Yibing Liu
67dc5c7f8a
Polish the doc of nce layer
7 years ago
dzhwinter
6ac8383f28
"fix based comments"
7 years ago
tangwei12
fe76244f0e
bug fix
7 years ago
tensor-tang
6c1cf60950
Merge remote-tracking branch 'ups/develop' into fix
7 years ago
weixing02
1021089cda
fix
7 years ago
weixing02
ee13b396f2
fix some errors
7 years ago
Yibing Liu
279ebdd0b2
Fix reciprocal op's doc
7 years ago
tangwei12
fb27c9a5a3
bug fix
7 years ago
tangwei12
1cb0ab36f0
bug fix
7 years ago
tangwei12
bb17604b36
bug fix
7 years ago
Luo Tao
1958654d6f
refine \odot in elementwise_mul
7 years ago
tensor-tang
0329ee7496
Merge pull request #11497 from tensor-tang/doc
...
Doc update
7 years ago
Yan Chunwei
5fd142c3fd
bugfix/trt engine op ( #11487 )
7 years ago
qiaolongfei
3a25ceebf2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-api-reference-1
7 years ago
Yu Yang
745ea4dcf0
Merge pull request #11354 from reyoung/feature/polish_doc
...
Polish documentation
7 years ago
dzhwinter
1f38cbf79b
"fix some typo"
7 years ago
yi.wu
3380737cb7
update by comment
7 years ago
fengjiayi
d684b49cb0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_add_doc
7 years ago
fengjiayi
d91060d300
fix errors
7 years ago
weixing02
8bd148dc00
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into hsigmoid_op
7 years ago
dzhwinter
45f18b0062
Merge remote-tracking branch 'origin/develop' into doc/api1
7 years ago
chengduo
8bc74f10e9
Merge pull request #11476 from chengduoZH/enable_more_type_for_SplitOpKernel
...
Enable more type for SplitOp and ConcatOp
7 years ago
Xin Pan
261d5f0232
Merge pull request #11409 from panyx0718/doc
...
Add API docs.
7 years ago
tensor-tang
98ab2b403e
polish doc: softshrink, assign, shuffle
7 years ago
tensor-tang
24fea628cc
polish doc: mean
7 years ago
dzhwinter
16a3d88a20
fix typo
7 years ago
tangwei12
b089b80988
update rpc to add checkpoint notify
7 years ago
tangwei12
12de20f5f7
add checkpoint_notify_op for trainer to notify pserver, update listen_and_serv_op
7 years ago
gongweibao
cdc06b011f
Fix dependency of `tape/variable.cc` ( #11472 )
7 years ago
Yan Chunwei
212651a5b8
bugfix/anakin-ci ( #11473 )
7 years ago
chengduoZH
ca743de2e0
enable more type for splitOp and ConcatOp
7 years ago
yi.wu
40c631e554
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_unitests
7 years ago
yi.wu
44925eb4c2
fix dist ut
7 years ago
yuyang18
055df47035
Polish code
7 years ago
tensor-tang
d0fb00d797
Merge remote-tracking branch 'ups/develop' into set_nthreads
7 years ago
yuyang18
cbc1b7f1ce
Polish documentation
7 years ago
tensor-tang
c453573286
Merge remote-tracking branch 'ups/develop' into fix
7 years ago
fengjiayi
8a136d142f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_add_doc
7 years ago
Luo Tao
4c0bf774b9
fix compiler error in contrib/tape
7 years ago
tensor-tang
3e58df20df
initial with only 1 mkl/openblas threads for each pthreads
7 years ago
fengjiayi
980499faf1
fix errors
7 years ago
sneaxiy
fa002cd67c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into argmin_argmax
7 years ago
tensor-tang
9169b3b802
Merge pull request #10789 from Xreki/core_fix_openblas_threads
...
Add an interface to set the number of threads for math function, and set the default value to 1 for inference.
7 years ago
Qiyang Min
046bb5c8cb
Fix NCCLBcast hang up bug in Parallel Executor ( #11377 )
...
* 1. Create buddy allocator in each places before NcclBcast the variables
2. Check the memory usage of ALL gpus rather than the first one
* 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing
2. NOTE the usage of NCCLGroupGuard
* Remove the memory usage check of gpus
* Fix code style
7 years ago
Qiao Longfei
cbaa24f597
Merge pull request #11450 from jacquesqiao/fix-prefetch-bug
...
outvar must be create in local scope for prefetch
7 years ago
tensor-tang
85c3bfc1f9
Merge remote-tracking branch 'ups/develop' into fix
7 years ago
Yancey1989
3d875b693f
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Xin Pan
d07d9535ca
Merge pull request #11454 from panyx0718/profile
...
Remove cuptiFinalize.
7 years ago
Yan Chunwei
bd2a537b05
feature/anakin ci ( #11330 )
7 years ago
whs
6fcdb240fa
Add mean IOU op. ( #10519 )
...
* Add mean_iou op.
* Add unitest for mean iou op.
* Add optional collections of confusion matrix and mean_iou.
* Fix cuda kernel.
* Refine code.
1. Merge computing in GPU to two kernel.
2. Use wrong array and correct array instead of confusion matrix.
* Add python api and fix cuda kernel.
* Fix comments.
* Small fix.
* Small fix.
7 years ago
qiaolongfei
490a07f5b6
add comment that out var of prefetch must be created in local scope
7 years ago
Xin Pan
d2afd21021
Remove cuptiFinalize.
...
In cupti samples, only cuptiFlush is used.
I can't find any places calling cuptiFinalize and
this API can error out as not_implemented in some
cuda installation.
7 years ago
Yang Yang(Tony)
f790b96d6f
make variable->Grad() a weak_ptr ( #11453 )
...
* fix #11416
* make sgd check tape has been backwarded_
* add error message
7 years ago
Yang Yang(Tony)
a59c3b73bd
change dynamic graph folder ( #11451 )
...
* change dynamic to tape
* update readme link
7 years ago
Yang Yang(Tony)
d827c6e87a
Dynamic Graph first prototype ( #11415 )
7 years ago
guosheng
49ca424d6e
Fix src_idx out of range in beam_search_op
7 years ago
qiaolongfei
a49ee22e31
fix a bug in prefetch
7 years ago
tangwei12
41701969a9
[wip] ckpt m2 develop
7 years ago
yi.wu
a83b792ada
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_api_reference_docs
7 years ago
yi.wu
c58ba827bb
update
7 years ago
qiaolongfei
fd9b650d7f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-api-reference-1
7 years ago
yuyang18
674327a4b1
Polish several API
7 years ago
yuyang18
ce6394ed73
Polish example
7 years ago
Qiao Longfei
7c426be98c
Merge pull request #11342 from jacquesqiao/add-merge-splited-ids
...
Add merge_ids_op
7 years ago
qiaolongfei
e6f54d5aa2
update comment
7 years ago
qiaolongfei
2e48ab623e
add more detailed comment
7 years ago
tensor-tang
647c0eb7ec
Merge pull request #11099 from mozga-intel/mozga-intel/Conv_mkldnn_layout
...
MKLDNN layout: Support for convolution operator
7 years ago
qiaolongfei
7ebef493d5
add row_size for selected rows in DebugStringEx
7 years ago
Wu Yi
688e18508f
Merge pull request #11423 from typhoonzero/fix_develop_bugs
...
Fix nccl dist train bug
7 years ago
yuyang18
b9843abb61
Polish comsum, DynamicRNN
7 years ago
qiaolongfei
cdd55dbcaa
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
guosheng
35e32a8e15
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into fix-beam_search
7 years ago
yi.wu
d76ebd7853
fix nccl dist train bug
7 years ago
yuyang18
54002c3bab
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/polish_doc
7 years ago
Qiao Longfei
94a741d743
Merge pull request #11413 from jacquesqiao/fix-build
...
fix build on mac
7 years ago
tensor-tang
489f581d49
Merge pull request #11098 from mozga-intel/mozga-intel/Batch_norm_mkldnn_layout
...
MKLDNN layout: Support for batch norm operator
7 years ago
qiaolongfei
82416f1844
fix concurrency_test build error on mac
7 years ago
Yibing Liu
28a0ac535a
Merge branch 'develop' of upstream into argsort_dev
7 years ago
Yibing Liu
98460c009e
Simplify the computation in cpu
7 years ago
qiaolongfei
9ebbfa6bbc
fix build on mac
7 years ago
guosheng
92a7766ecb
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into fix-beam_search
7 years ago
guosheng
592f84a4af
Complete the docs of beam_search_op, beam_searc_decode_op and the python wrapper
7 years ago
qiaolongfei
d6c8d2675c
optimize code and comment
7 years ago
Yibing Liu
94e72ea6e7
Support more negative axes in argsort_op
7 years ago
Xin Pan
5d0bf8bc8f
Add API docs.
7 years ago
Yibing Liu
42645ff779
Compute target index on gpu
7 years ago
Yancey1989
f52d78d189
update by comment
7 years ago
qiaolongfei
e72eb0edec
small update
7 years ago
fengjiayi
88fa9c2ec7
Merge pull request #11267 from JiayiFeng/fix_reader_bug
...
Fix a multi-thread bug in readers
7 years ago
yuyang18
ff55d4c593
Polish documents
...
* less_than
* cumsum
* multiplex
* open_recordio_file
7 years ago
tensor-tang
6602db5b3e
throw warning if try to use mkldnn while not compiled
7 years ago
Yibing Liu
6ee22c4f71
Add gpu kernel for argsort op
7 years ago
tensor-tang
6a32f19865
fix unknown use_mkldnn
7 years ago
tensor-tang
6a494380e8
remove mkldnn flag from gtest strdup
7 years ago
yi.wu
7b54b30be5
follow comments
7 years ago
tensor-tang
056dd40475
add initial memory flag in MB for infer
7 years ago
yi.wu
1457be3e7b
fix_api_reference_docs
7 years ago
Yancey1989
6d752bafd8
use get_appropriate_dev to schedule rpc op
7 years ago
qiaolongfei
f031555cfb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
Qiao Longfei
431491a2a8
Merge pull request #11366 from jacquesqiao/refine-prefetch
...
Refine prefetch
7 years ago
Wu Yi
34865f2de3
Trainer send term signal ( #11220 )
...
* wip
* use executor.complete to end trainer
* fix build
* fix build with distribute off
* fix typo
* fix cmake typo
* fix build
7 years ago
Tao Luo
ca4d528b35
Merge pull request #11369 from luotao1/refine_doc
...
refine docs of elementwise_op etc.
7 years ago
Luo Tao
c4c787337a
update with comments
7 years ago
sneaxiy
1eeb11ef61
refine ZeroGradFunctor in activation_op.h
7 years ago
Yancey1989
4444e79e46
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
qiaolongfei
2b9ff39f5f
fix the default value prefetch_var_name_to_block_id
7 years ago
qingqing01
19fd071785
Make the normalization operator more general and fix bug in l2_normalize. ( #11348 )
...
* Add normalization operator.
1. Refine the raw norm_op and let it more general to support to normalize Tensor along any axis.
2. There is a bug in l2_normalize API, which lacks sqrt after `reduce_sum`.
3. Use norm_op to refine the l2_normalize API.
4. Fix bug in test_normalization_wrapper.py.
7 years ago
Lei Wang
24391c76de
Build: add make before make install to catch up Makefile change.
7 years ago
whs
adc09087c1
Add slice op. ( #11052 )
...
* Add slice op.
* Remove using from header file and fix doc.
* Fix doc
* Small fix.
7 years ago
qiaolongfei
6dd3f3cf27
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei
16658f7b59
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch
7 years ago
Xin Pan
1d198494d7
Merge pull request #11370 from panyx0718/dist
...
Make status update thread-safe
7 years ago
chengduo
183377f410
Merge pull request #11306 from chengduoZH/enable_cpu_on_pe
...
Enable CPU on Parallel executor
7 years ago
qiaolongfei
83a577e8ce
fix build problem
7 years ago
qiaolongfei
fe65064827
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch
7 years ago
Luo Tao
7bdb573d79
update with comments
7 years ago
qiaolongfei
506fc8d9e8
optimize code
7 years ago
gongweibao
d9de6b8621
Add brpc surpport. ( #11263 )
7 years ago
Xin Pan
1509ae3a53
Make status update thread-safe
...
The status is updated in the Process() thread
and can be checked in another HandleRequest() thread.
7 years ago
qiaolongfei
ea106c91e0
optimize comment and code
7 years ago
Luo Tao
7694199050
refine docs of elementwise_op etc.
7 years ago
qiaolongfei
7f4b9656a4
set status before Finish in prefetch process
7 years ago
dzhwinter
bfa3fd6f15
add inplace attribute to op_proto_maker ( #10665 )
...
* "add inplace attribute"
* "register inplace attribute"
* "change se-next model for memory-reuse"
* "fix typo"
* repick
* fix merge conflict
* "fix stupid error"
7 years ago
qiaolongfei
5aba10b585
set the thread pool of prefetch to 1 to fix a bug
7 years ago
gongweibao
9087c6687f
polish ( #11363 )
7 years ago
qiaolongfei
8fb78f6c07
fix grpc_server_test
7 years ago
chengduoZH
173d72b481
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enable_cpu_on_pe
7 years ago
tensor-tang
b3fd9da60e
Merge pull request #11101 from mozga-intel/mozga-intel/Pool_mkldnn_layout
...
MKLDNN layout: Support for pool operator
7 years ago
chengduoZH
aadaadf735
replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.
7 years ago
qiaolongfei
4e36c0ecab
update prefetch logic in grpc_server
7 years ago
gongweibao
627d7a64f8
Clean `sendop` `recv` operator. ( #11309 )
7 years ago
chengduo
fa29ef0b0d
Merge pull request #11277 from chengduoZH/check_ssa_graph
...
Check SSA Graph
7 years ago
chengduoZH
961fbce8e2
follow comments
7 years ago
Yu Yang
3fd3e500cc
Merge pull request #11346 from reyoung/feature/add_lock_to_device_ctx
...
Add lock to record_event.
7 years ago
yuyang18
2955ff5887
Polish documentation
...
* row_conv
* uniform_random
* layer_norm
* create_parameter
* hard_shrink
* ssd_loss
7 years ago
qiaolongfei
0d3d4ae775
refine prefetch logic
7 years ago
chengduoZH
7b723839ef
Add cpu test for parallel_executor_crf executor_fetch_feed, and enable these tests
7 years ago
yi.wu
ff4efb84d9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_api_reference_docs
7 years ago
sneaxiy
831909ce69
Merge pull request #11313 from sneaxiy/argmin_argmax
...
Add argmin and argmax op
7 years ago
chengduoZH
d24e046c1e
fix allReduce bug
7 years ago
yuyang18
a1254a86ba
Add lock to record_event.
7 years ago
Tao Luo
69b5a62c65
Merge pull request #11319 from luotao1/mkldnn
...
add FLAGS_use_mkldnn to global control use_mkldnn
7 years ago
yuyang18
9b43edeae0
Polish arg_min_max_op
...
* Remove unused arg_max/min_op.h
* Remove reference parameter. Use pointer insteaded.
* undef macro
* Always set OutT as int64_t.
7 years ago
chengduoZH
a57e8a4338
add cpu test
7 years ago
Yu Yang
9328c3cf7b
Merge pull request #11308 from reyoung/feature/polish_api_ref
...
Simplize API Reference Documentation
7 years ago
qiaolongfei
0485405b3d
add more debug string
7 years ago
Luo Tao
045589fae4
fix compiler error in high-level api
7 years ago
Luo Tao
79d555b9f2
Merge branch 'develop' into mkldnn
7 years ago
gongweibao
062d5a56b4
Add comments to a singleton. ( #11333 )
7 years ago
mozga-intel
7d5643562f
MKLDNN layout: Support for batch norm operator
7 years ago
mozga-intel
9908d3cfbc
MKLDNN layout: Support for convolution operator
7 years ago
mozga-intel
36031cb50f
MKLDNN layout: Support for pool operator
7 years ago
qiaolongfei
509cb0bc76
add unit test, pass the unit test
7 years ago
qiaolongfei
7cebec4b7e
init merge_ids_op
7 years ago
chengduoZH
1e731f5964
small fix
7 years ago
chengduoZH
495368c243
ADD CPU_NUM
7 years ago
chengduoZH
27073c284d
nccl_all_reduce_op_handle => all_reduce_op_handle
7 years ago
chengduoZH
2d94697a82
code refine
7 years ago
chengduoZH
5a3c8bf813
fix in c++ side
7 years ago
Wu Yi
7bcc98089a
Merge pull request #11321 from Yancey1989/polish_sparse_update
...
polish sparse update logic
7 years ago
guochaorong
eced973091
Merge pull request #11317 from guochaorong/fix_bad_code
...
Fix bad code in c plus and python
7 years ago
guochaorong
310598f99b
Update device_tracer.cc
7 years ago
fengjiayi
fae3d8d2dc
Merge pull request #11311 from JiayiFeng/a_small_fix
...
fix a small compile error on Mac
7 years ago
sneaxiy
6d32e96096
remove redundant comments
7 years ago
Yancey1989
56964946d4
polish sparse update logic
7 years ago
Luo Tao
c6d230e03e
add FLAGS_use_mkldnn to global control use_mkldnn
7 years ago
guochaorong
04b8d3d03c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into paddle_fix
7 years ago
guochaorong
0fec9469f9
fix some bugs introduced by unfreed memory
7 years ago
yi.wu
efcff3d9e5
polish api ref docs
7 years ago
yuyang18
8c9041f486
Refine LinearCRF
7 years ago
sneaxiy
568c4e5ec4
recommit using account sneaxiy
7 years ago
Yan Chunwei
145aaa4b49
loose threshold of TRT for CI in different model ( #11305 )
7 years ago
fengjiayi
d745840a6e
fix a small compile error on Mac
7 years ago
yi.wu
5be454bf33
polish docs
7 years ago
yuyang18
0d29e65924
Add resize_bilinear
7 years ago
guosheng
5e20a8ef93
Make python unit test of beam_search_op and beam_searc_decode_op run correctly
7 years ago
yuyang18
b000e0de5d
Simplize API Reference Documentation
7 years ago
chengduoZH
0c851cab22
add SSA graph checker
7 years ago
fengjiayi
b587a7f66e
Merge pull request #11293 from JiayiFeng/update_crop_op
...
Update crop op
7 years ago
Xin Pan
259e63d4a1
Merge pull request #11248 from panyx0718/dist
...
Fix sparse vars usage for dist train
7 years ago
Yancey1989
d5a88b9340
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
Xin Pan
2d7c836d32
Merge pull request #11295 from panyx0718/doc
...
Refine API doc string
7 years ago
Yu Yang
8deff48db0
Merge pull request #11081 from reyoung/feature/python_doc
...
Add document to random crop operator
7 years ago
fengjiayi
c7bbfb33ad
Fix a GPU bug
7 years ago
Yancey1989
1239fce771
polish sparse update code
7 years ago
chengduoZH
1076e85135
refine logic
7 years ago
Yancey
0aa9546eed
fix dist train error ( #11281 )
...
* fix dist train error
* update by comment
7 years ago
Xin Pan
e80c6b3c24
Refine API doc string
7 years ago
tensor-tang
b8d315fb69
make scope thread safe
7 years ago
tensor-tang
80e882a398
Merge pull request #11247 from tensor-tang/infer_api
...
Infer multi-threads API Demo and UT
7 years ago
cuichaowen
9141bee1e7
add Anakin api for paddle ( #11228 )
7 years ago
fengjiayi
24649a780d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update_crop_op
7 years ago
dzhwinter
d48172f22a
split reduce op into multiple libraries, accelerate the compiling ( #11029 )
...
* "split into multiple .ccl"
* "refine file structure"
* "refine files"
* "remove the cmakelist"
* "fix typo"
* "fix typo"
* fix ci
7 years ago
fengjiayi
5803115720
Merge pull request #11288 from JiayiFeng/fix_bug_of_ExecutionContext
...
fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
tensor-tang
bfd42683ca
Merge remote-tracking branch 'ups/develop' into refine
7 years ago
fengjiayi
9c61409a18
Make crop op supporting taking offsets as one of its inputs
7 years ago
dzhwinter
f7c96f079b
Big data op_test benchmark, for checking output consistent in different runs. ( #10646 )
...
* "init benchmark ops"
* "untrack outputs"
* "delete some usused code"
* "benchmark"
* "fix ci"
* "fix op test"
* "fix uint16 missing"
* "fix ci"
* "follow comments"
* "fix ci"
* "follow comments"
* "conficts. merge develop branch"
* repick
* "merge develop branch"
7 years ago
fengjiayi
9ce0885067
Merge branch 'fix_bug_of_ExecutionContext' into update_crop_op
7 years ago
fengjiayi
dc8e0b494d
fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
guosheng
a281e1016e
Make cc_test of beam_search_op and beam_searc_decode_op run correctly
7 years ago
tensor-tang
e030741df9
fix gpu fraction
7 years ago
fengjiayi
4f46a98fa9
stash
7 years ago
tensor-tang
746a62ebe6
add gpu tests
7 years ago
tensor-tang
35e820dc2b
Merge remote-tracking branch 'ups/develop' into infer_api
7 years ago
mozga-intel
3ff9ba0e6b
Mkldnn layout ( #11040 )
...
* Add MKLDNN layout support in Paddle
Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.
* Add MKLDNN layout support in activation OP
* Don't populate layout from input to output when kMKLDNN in
* Refine pool mkldnn op kernel
* MKLDNN layout
* Remove the inferitance from tensor file
* MKLDNN layout: refactoring
* Remove additional #define to register new operator
* Prepare mkldnn tests to work with layout
7 years ago
chengduoZH
8291b916d6
replace graph_builder_factory with ssa_graph_builder_factory
7 years ago
chengduoZH
9ac785be39
check graph's validation
7 years ago
fengjiayi
a1e046bfc0
Merge pull request #11270 from JiayiFeng/fix_a_error_on_max
...
fix a compile error on Mac
7 years ago
Yu Yang
03073df182
Merge pull request #11237 from chengduoZH/add_fuse_var_op_handle
...
[Feature] Add fuse vars op handle
7 years ago
Tao Luo
6d80dd5a50
Merge pull request #11222 from luotao1/trt
...
rewrite unittest of trt_activation_op
7 years ago
fengjiayi
499dbe0536
fix a multi-thread bug in readers
7 years ago
fengjiayi
7344210070
Merge branch 'fix_a_error_on_max' into fix_reader_bug
7 years ago
fengjiayi
2f5e310167
fix a compile error
7 years ago
yuyang18
8149b0a9aa
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fuse_var_op_handle
7 years ago
Yancey1989
23433def4b
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
fengjiayi
5d45793936
hot fix
7 years ago
tensor-tang
aad8f4d1a7
enable image_classification multi-threads ut
7 years ago
tensor-tang
abd9ccf3f9
Merge remote-tracking branch 'ups/develop' into infer_api
7 years ago
yuyang18
59d75bda74
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/python_doc
7 years ago
Luo Tao
f6fb51a164
add test_mode in trt/activation_op
7 years ago
tensor-tang
db1747a500
enable word2vec multi-threads ut
7 years ago
tensor-tang
0693b4148d
Merge remote-tracking branch 'ups/develop' into refine
7 years ago
fengjiayi
65a94be1a1
Merge pull request #11223 from JiayiFeng/dev_reverse_op
...
Add reverse op
7 years ago
Luo Tao
c73977af03
Merge branch 'develop' into trt
7 years ago
Tao Luo
f40fc24974
Merge pull request #11260 from luotao1/gtk
...
install libgtk2.0-dev in latest images
7 years ago
Yu Yang
ff9b1a0f95
Merge pull request #11234 from reyoung/feature/refine_code
...
SSA Graph Builder Factory
7 years ago
Yu Yang
08823146ec
Merge pull request #11232 from reyoung/feature/extract_tensor
...
Extract method from tensor_impl.h to tensor.cc
7 years ago
Luo Tao
08220d39e7
install libgtk2.0-dev in latest images
7 years ago
tensor-tang
f326b0117e
refine scope lock
7 years ago
Yancey1989
15913d92c5
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_memcpy_with_dist
7 years ago
tensor-tang
9cf1f351d2
refine nlp test
7 years ago
gongweibao
2028a8ef6d
Add rpc_client interface. ( #11154 )
7 years ago
Xin Pan
ca2d6d3c66
Merge pull request #11224 from dzhwinter/fix/cudnn
...
fix cudnn version issue
7 years ago
tensor-tang
3a294042c8
Merge pull request #11233 from tensor-tang/multithreads
...
Fix abort issue in cpu multi-threads
7 years ago
Yan Chunwei
4f95bc9463
feature/trt engine op test ( #11182 )
7 years ago
Tao Luo
fdf2d6fd9d
Merge pull request #11242 from luotao1/opencv
...
add python-opencv in paddlepaddle/paddle:lastest images
7 years ago
qingqing01
e0a32074bd
Fix PADDLE_ASSERT. ( #10981 )
...
* Enable assertions in CUDA.
* Fix PADDLE_ASSERT.
7 years ago
tensor-tang
9dd993950a
fix free
7 years ago
Xin Pan
e0895e49dc
remove some seems unused codes.
7 years ago
tensor-tang
4b7b17a84f
fix conflcts
...
Merge remote-tracking branch 'ups/develop' into multithreads
7 years ago
yuyang18
d9af153232
SSA Graph Builder Factory
...
* Use Builder Chain to decorate new builders. It is easy to extend
builders.
* Make graphviz path as a build strategy, not a FLAGS.
7 years ago
yuyang18
b6c8701e45
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/extract_tensor
7 years ago
chengduoZH
a584bc86dd
add fuse var op handle
7 years ago