sneaxiy
ca84c2ca8f
merge develop
...
test=develop
6 years ago
sneaxiy
e240ba2918
implement backward
...
test=develop
6 years ago
sneaxiy
06f8aa5b97
remove while_op support temporarily
...
test=develop
6 years ago
Yu Yang
81520a24cf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor
6 years ago
Yu Yang
9bd70a1e04
Change tensor uses proto::VarType::type
...
test=develop
6 years ago
Yu Yang
8175983ef9
Merge pull request #14814 from reyoung/feature/gprof
...
Add gperftools supports for PE
6 years ago
Yu Yang
eeca721a99
Merge pull request #14842 from reyoung/feature/refine_eigen_tensor
...
Fix Eigen macro when using GPU
6 years ago
dengkaipeng
7ec3264b51
fix API spec. test=develop
6 years ago
tensor-tang
d538513fce
fix the compile error on mac
6 years ago
tensor-tang
28eb7d840c
test all impls and all inplace cases
6 years ago
Yihua Xu
acc6ae49b1
Fix the issue to run on AVX2 and AVX512F machines ( #14851 )
...
test=develop
6 years ago
heqiaozhi
10ed9e0a6e
download & run & instance
6 years ago
nhzlx
8c8019e388
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_benchmark_for_trt
...
test=develop
6 years ago
Michal Gallus
92daace55c
MKL-DNN Concat: Fix segfault related to referencing deleter memory primitive
...
test=develop
6 years ago
Yu Yang
9f8d86858e
Revert data_type
...
test=develop
6 years ago
tensor-tang
d4cab7d948
use jitkernel in one file
6 years ago
tensor-tang
adc7ba2edd
Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
tensor-tang
900c789a35
use jitcode and use vmul
6 years ago
Yu Yang
5e60906996
Fix compile error
...
test=develop
6 years ago
Qiao Longfei
1870262ba9
pserver should crash early whe has problem
...
test=develop
6 years ago
dengkaipeng
a81fabd327
fix doc errors. test=develop
6 years ago
dengkaipeng
cf06e50f1d
add doc for adaptive pool. test=develop
6 years ago
dengkaipeng
266c6856c9
add adaptive pool 2d & 3d. test=develop
6 years ago
dengkaipeng
eab4745965
add adaptive mode for pool.
6 years ago
Yu Yang
7604b1ad51
Fix Eigen macro when using GPU
...
The macro should be defined by compiler rather than by source.
test=develop
6 years ago
Qiao Longfei
1213e2838f
Merge pull request #14820 from jacquesqiao/fix-split-selected-rows
...
split selected rows op should always init output selected rows
6 years ago
nhzlx
644c13a387
fix compile error
6 years ago
sneaxiy
7923042365
merge develop
...
test=develop
6 years ago
Qiao Longfei
0a7c7e97af
test zero output of split_selected_rows_op
...
test=develop
6 years ago
Tao Luo
3243b45bb2
Merge pull request #14651 from luotao1/has_attr
...
add Opdesc's HasProtoAttr
6 years ago
Xin Pan
1735022a1b
fix clang
...
test=develop
6 years ago
heqiaozhi
016a068775
stop server
6 years ago
sneaxiy
8760d23c7d
featue/py_func
6 years ago
zhang wenhui
c4c5f0b8ca
Merge pull request #14771 from frankwhzhang/bpr
...
add bpr_loss operator
6 years ago
Tao Luo
067ed70f2d
add HasProtoAttr function in op_desc.h, clean node.h
...
test=develop
6 years ago
heqiaozhi
8e3fe2d735
add skip op
6 years ago
Yu Yang
b22d638d8f
Speed up SizeOfType
...
test=develop
6 years ago
Xin Pan
4a716a70c6
Merge pull request #14811 from panyx0718/revert-14782-revert-14398-imperative
...
Imperative
6 years ago
nhzlx
a5bfed3776
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_benchmark_for_trt
...
test=develop
6 years ago
nhzlx
afc51e6f82
add benchmark for trt
6 years ago
Zhaolong Xing
bc6d0a3427
Merge pull request #14762 from NHZlX/fix_bug_of_trt_pool
...
fix bug of trt pool2d converter
6 years ago
tensor-tang
53709e7e61
refine names
6 years ago
superjomn
edd1f5a92b
fix visualizer
...
test=develop
6 years ago
Brian Liu
9623b45f40
Remove unnecessary MKLDNN reorder ( #14799 )
...
When data flow from a MKLDNN OP kernel to a non-MKLDNN OP kernel,
data layout transform (via MKLDNN reorder) will occur even when
those two OP kernels share same layout. Add code to remove this
unnecessary reorder.
test=develop
6 years ago
frankwhzhang
90c7f9870e
fix 'name', test=develop
6 years ago
Qiao Longfei
abf140289f
split selected rows op should always init output selected rows
...
test=develop
6 years ago
nhzlx
019e8bbed2
fix comments test=develop
6 years ago
frankwhzhang
271c480822
update API, test=develop
6 years ago
frankwhzhang
c9a653820b
fix label_pos ,add test_layers.py, test=develop
6 years ago
Tao Luo
e99597d35c
Merge branch 'develop' into luotao1-has_attr
6 years ago
sneaxiy
66182abda6
add cuda cudnn version check
...
test=develop
6 years ago
Yu Yang
f0c0bf328d
Add gperftools supports for PE
6 years ago
frankwhzhang
a672b291e5
fix code style, test=develop
6 years ago
frankwhzhang
ea95f9c335
fix style bug, test=develop
6 years ago
frankwhzhang
68c2025844
fix nn.py&API.spec, test=develop
6 years ago
Xin Pan
748549b2e3
Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative"
...
This reverts commit b1d3a1c8b4
, reversing
changes made to f1fb64b17f
.
6 years ago
bingyanghuang
943ad4781f
One possible solution to add flexibility for mkldnn placement pass ( #14768 )
...
* Choose to turn on use_mkldnn attribute v1
* Fix mkldnn_op empty bug
* format change test=develop
* fix ci test=develop
* fix ci test and add test in dam test=develop
* add example to dam compare test test=develop
* review changes test=develop
6 years ago
baojun-nervana
fddbd87c0a
Rename argument
...
test=develop
6 years ago
baojun-nervana
22ac2133e4
Rename class
...
test=develop
6 years ago
baojun-nervana
bfde5e10ce
Move ngraph compile control to cmake
...
test=develop
6 years ago
sneaxiy
2c6159a151
fix unittest
...
fix cmake
test=develop
6 years ago
Xin Pan
c049fa7cf7
Revert "Revert "Revert "Imperative"""
6 years ago
gongweibao
f1fb64b17f
Add reduce sparse tensor feature. ( #14757 )
6 years ago
sneaxiy
eb8252466b
polish code
...
add unittest model containing while_op
remove unnecessary codes
test=develop
6 years ago
Tao Luo
c83d5b7a16
Merge pull request #14709 from yihuaxu/develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
...
Implement the fusion of convolution 3D and bias for mkldnn
6 years ago
Zeng Jinle
add98c9e7d
Merge pull request #14745 from sneaxiy/fix_eigen_deallocate
...
Fix eigen deallocate bug
6 years ago
frankwhzhang
f4cc5881b0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
6 years ago
frankwhzhang
97de98cd0a
update bpr_loss op code, test=develop
6 years ago
Xin Pan
6c80bb3ce9
Merge pull request #14786 from PaddlePaddle/revert-14782-revert-14398-imperative
...
Revert "Revert "Imperative""
6 years ago
heqiaozhi
575ae7c6c3
refine pslib inferface & fix some bugs
6 years ago
Yihua Xu
3821fc3950
Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
...
test=develop
6 years ago
Yihua Xu
240d974ac5
Clean Code
...
test=develop
6 years ago
Tao Luo
54fcafb5f6
Merge pull request #14707 from yihuaxu/develop_4f71a6ee2_conv3d_mkldnn_opt
...
Implement conv3d with mkldnn library
6 years ago
Xin Pan
2538ef64f1
Revert "Revert "Imperative""
6 years ago
guru4elephant
b82a44ea85
Merge pull request #14778 from wangguibao/async_executor_bugfix
...
Async executor bugfix: Tensor changed to LoDTensor
6 years ago
sneaxiy
8095fb5e68
fix code bug in CPU compilation
...
test=develop
6 years ago
sneaxiy
387bac46b5
refine code
...
test=develop
6 years ago
Tao Luo
cf66133857
Merge pull request #14734 from luotao1/memory_load
...
support loading from memory
6 years ago
Yihua Xu
155328a488
Clean Code
...
test=develop
6 years ago
Xin Pan
6217f42ab7
Revert "Imperative"
6 years ago
Tao Luo
743cb840f1
update with comments
...
test=develop
6 years ago
tensor-tang
ce674b685f
add readme doc and complete TODOs
6 years ago
wangguibao
5a2cd4505b
AsyncExecutor bugfix: Tensor to LoDTensor
...
test=develop
6 years ago
wangguibao
5f98d80039
AsyncExecutor bugfix: Tensor change to LoDTensor
6 years ago
flame
f6a877bc57
add tool to visualize inference model ( #14621 )
6 years ago
frankwhzhang
93551a3440
update API.spec
6 years ago
Tao Luo
42359e88a4
clean code
...
test=develop
6 years ago
Tao Luo
923b18877e
Merge branch 'develop' into memory_load
...
test=develop
6 years ago
Tao Luo
405b2486db
support loading from memory
...
test=develop
6 years ago
Xin Pan
b52f5d2870
Merge pull request #14398 from panyx0718/imperative
...
Imperative
6 years ago
frankwhzhang
272f3d3111
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
6 years ago
frankwhzhang
570d89ec84
add bpr_loss operator , test=develop
6 years ago
qingqing01
549f165b59
Speed conv_fusion_op for identity activation. ( #14744 )
...
* Refine conv_fusion_op for identity activation.
* Fix unit testing.
test=develop
6 years ago
tensor-tang
fab0ee8757
Merge remote-tracking branch 'ups/develop' into refine/jitkernel
6 years ago
Houjiang Chen
c6b39a0099
Merge pull request #14714 from NHZlX/add_prelu_gpu
...
add prelu cuda kernel for inference.
6 years ago
Jiabin Yang
8a111ac64d
Merge pull request #14763 from junjun315/fix-mac-build-check
...
fix the bug for mac build. python -c error. test=develop
6 years ago
tensor-tang
dbe451976b
Merge pull request #14753 from tensor-tang/refine/namespace
...
remove jit namespace
6 years ago
sneaxiy
0f96c2e80f
fix thread-safety bug
...
test=develop
6 years ago
lujun
5026741b82
fix the bug for mac build. python -c error. test=develop
6 years ago
nhzlx
722b0a805f
fix bug of trt pool
...
test=develop
6 years ago
Jiabin Yang
d9bb55a1f9
Merge pull request #14756 from JiabinYang/fix_hs_op
...
fix bug in dist train on hs, test=develop
6 years ago
Yihua Xu
65dbc7cca4
Merge branch 'develop' into develop_4f71a6ee2_conv3d_mkldnn_opt
6 years ago
JiabinYang
e05e1d7d88
fix bug in dist train on hs, test=develop
6 years ago
tensor-tang
a1eb21e704
refine names
6 years ago
tensor-tang
b523787f9f
remove jit namespace
...
test=develop
6 years ago
tensor-tang
191948c933
enable jitcode
6 years ago
tensor-tang
4a93db9288
remove jit namespace
...
test=develop
6 years ago
Hongyu Liu
8cda28f345
Merge pull request #14733 from phlrain/add_cudnn_5_support
...
Add cudnn 5 support
6 years ago
heqiaozhi
d3ca359e44
config init & adapt to interface
6 years ago
Xin Pan
73b4d1aa72
Merge pull request #14742 from panyx0718/infer2
...
support customized kernel selection
6 years ago
Jiabin Yang
21c0f8749e
Merge pull request #14728 from JiabinYang/optimize_hs_op
...
Optimize hs op
6 years ago
tensor-tang
45bfa70cb8
complete vmul jit kernel
6 years ago
tensor-tang
77236e33fc
init jitkernel
6 years ago
Xin Pan
82d68281c0
follow comments
...
test=develop
6 years ago
sneaxiy
900765224c
fix deallocate bug
...
test=develop
6 years ago
liuhongyu
b408fc4dac
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
6 years ago
liuhongyu
8b2898e201
fix bug of formate; test=develop
6 years ago
Xin Pan
41c28d54c6
allow customize kernel selection
...
test=develop
6 years ago
Xin Pan
439af8d50a
Merge pull request #14717 from panyx0718/infer
...
fix a const_cast and avoid using stale program.
6 years ago
lujun
104a332a28
Merge pull request #14722 from junjun315/up-12-python-install
...
fix mac ci test step, test=develop
7 years ago
liuhongyu
773dc73fbf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
7 years ago
liuhongyu
8daf67f90f
fix bugs; test=develop
7 years ago
chengduo
04539d4c5d
Fix clip.py ( #14718 )
...
* expose square
test=develop
* fix activation
test=develop
* Add square API
test=develop
* add necessary op
* code refine
* fix API.spec
test=develop
* fix unit test
test=develop
* add unit test sparse_grad_clip
test=develop
* fix API.spec
test=develop
* remove mac test for test_gradient_clip
test=develop
* remove selectedrows_mul_tensor
test=develop
7 years ago
sneaxiy
d0c8b9b9b3
remove timeout unittest
...
test=develop
7 years ago
heqiaozhi
419506f510
refine for compile pslib.so
7 years ago
Xin Pan
052cc5f538
Merge pull request #14725 from ZongwuYang/my-cool-stuff
...
My cool stuff
7 years ago
Michal Gallus
6fdbb365ce
Include MKL-DNN header to concat op only when flag is set
...
test=develop
7 years ago
Michal Gallus
f2a880421e
Fix style @ concat integration and tests
...
test=develop
7 years ago
Michal Gallus
738069e491
Refactor MKL-DNN Concat
...
test=develop
7 years ago
Michal Gallus
208f912512
Implement MKL-DNN Concat
...
test=develop
7 years ago
Wu Yi
29d9fb53fc
[Feature] multi process multi gpu dist training, boost v100 performance by 20% ( #14661 )
...
* wip multi process multi gpu dist training
* workable for p2p
* update test=develop
* change back env name test=develop
* fix alloc init
* fix cpu build test=devlop
* fix mac tests test=develop
* refine code
* refine test=develop
7 years ago
liuhongyu
e80402fd0e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
7 years ago
liuhongyu
968dd3c078
add cudnn 5 support; test=develop
7 years ago
sneaxiy
e694d0c2e4
fix while_op eager deletion bug
...
add unittest
test=develop
7 years ago
Xin Pan
461ca35be1
Merge pull request #14590 from panyx0718/fix4
...
enable API check for readers
7 years ago
gongweibao
50a698525d
Fix log level ( #14692 )
7 years ago
JiabinYang
8c75705984
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize_hs_op
...
, test=develop
7 years ago
Xin Pan
dc458b1482
Merge pull request #14713 from panyx0718/api
...
add more files to protected file list
7 years ago
JiabinYang
b387a19410
optimize op with blas
7 years ago
Zeng Jinle
ff4237309a
Merge pull request #14720 from sneaxiy/fix_seq_mask_op_infershape
...
Fix sequence_mask_op InferShape
7 years ago
heqiaozhi
2301abc481
cc libaray add pslib
7 years ago
ZongwuYang
1560eb4a6d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my-cool-stuff
7 years ago
ZongwuYang
deb04809bd
test=develop
...
Fix the bug that profiler cannot trace the nccl allreduce operator
7 years ago
Xin Pan
da4e0bf1a1
add 2 more files
...
test=develop
7 years ago
Xin Pan
7c5289f68e
Merge pull request #14719 from PaddlePaddle/revert-14666-feature/estiminate_flops
...
Revert "Add EstiminateFlops"
7 years ago
lujun
9da5954a21
fix mac ci test step, test=develop
7 years ago
Kaipeng Deng
934f13a70a
Merge pull request #14371 from heavengate/yolo_loss
...
Add YOLOv3 loss operator for YOLOv3 model
7 years ago
sneaxiy
35a2578426
fix bug
...
test=develop
7 years ago
sneaxiy
65867d8989
test=develop
7 years ago
Jiabin Yang
6dcc6378b7
Merge pull request #14665 from JiabinYang/ci/add_import_check
...
add mac ci check on import
7 years ago
zhang wenhui
abbe382e1e
Revert "Add EstiminateFlops"
7 years ago
Xin Pan
0591ba96ec
fix hack
...
test=develop
7 years ago
sneaxiy
64ad051b9a
merge develop
...
test=develop
7 years ago
sneaxiy
c47c451a00
fix bug
7 years ago
heqiaozhi
a77fa67bbd
async_thread_trainer & libmct & pslib.cmake
7 years ago
Tao Luo
3437e17713
Merge branch 'has_attr' of https://github.com/luotao1/Paddle into luotao1-has_attr
7 years ago
nhzlx
e7abe6b654
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_prelu_gpu
...
test=develop
7 years ago
nhzlx
f75815b78c
add prelu gpu inference
7 years ago
Xin Pan
bcf36d8401
add more files to protected file list
...
test=develop
7 years ago
Tao Luo
0e3048db43
Merge pull request #14659 from luotao1/update_pass
...
update is_test_pass and mkldnn_placement_pass
7 years ago
Xin Pan
7e0801d4ed
Merge pull request #14441 from baojun-nervana/intel/ngraph_op
...
Implementing ngraph engine
7 years ago
Yihua Xu
82eefceabe
Add the profile_mkldnn flag for profile function(test=develop)
7 years ago
Xin Pan
35e6b5e16a
polish
...
test=develop
7 years ago
Yihua Xu
ea00270fe8
Remove the dims checking when the dim is 3 (test=develop)
7 years ago
Xin Pan
b80fe8264a
polish
...
test=develop
7 years ago
Yihua Xu
64e261c6cd
Implement the fusion of convolution and bias for mkldnn
...
(test=develop)
7 years ago
Tao Luo
8d6984eb9b
change OpHasAttr to RuntimeHasAttr, add some comments
...
test=develop
7 years ago
jerrywgz
96dc3d8326
Merge pull request #14511 from jerrywgz/ignore_index_for_sigmoid_cross_entropy
...
add ignore index for sigmoid cross entropy with logits op, test=develop
7 years ago
Tao Luo
a6ac42669c
Merge branch 'develop' into update_pass
7 years ago
Yihua Xu
669191c9cc
Implement conv3d with mkldnn library (test=develop)
7 years ago
Hongyu Liu
4f71a6ee2c
Merge pull request #14622 from PaddlePaddle/add_cudnn_lstm
...
Add cudnn lstm
7 years ago
Yibing Liu
c7382df80f
Print assert failure id in lookup_table_op ( #14698 )
7 years ago
Yu Yang
0f0e197914
Merge pull request #14666 from reyoung/feature/estiminate_flops
...
Add EstiminateFlops
7 years ago
Xin Pan
93c16d9628
polish the autograd (need to verify correctness)
...
test=develop
7 years ago
Xin Pan
c3236f82d6
polish
7 years ago
Xin Pan
e5d64fd4d1
initial imperative
...
test=develop
7 years ago
Xin Pan
4d0df1fea7
add fields for autograd
...
test=develop
7 years ago
Xin Pan
8138391631
add OpBase and unify with VarBase
...
test=develop
7 years ago
Xin Pan
f6f0692451
clean up
...
test=develop
7 years ago
Xin Pan
0318c95149
rebase develop
7 years ago
Xin Pan
aeb74af54c
allow operator to run imperatively
7 years ago
Xin Pan
b1f6fda5e5
run forward
7 years ago
Xin Pan
a6d23083f0
some tracing
...
test=develop
7 years ago
Xin Pan
dac92e560c
initial commit
7 years ago
barrierye
08233beed7
add the comment for CheckFile function. test=develop
7 years ago
barrierye
d62a3dd72d
add the comment for CheckFile function. test=develop
7 years ago
barrierye
d89108766c
update CheckFile function in data_feed for ignore the space at the end of each line of data(for example, it may be added '\t' character to the end of the reduce task output when processes data by hadoop, which does not affect the correctness of the data). test=develop
7 years ago
phlrain
9f7eae861d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
Tao Luo
61ae88b760
Revert "Fix for accuracy problem for inplace operators when MKL-DNN mode is enabled"
7 years ago
dongdaxiang
52a0be7bb4
add mct into CMakeLists.txt
7 years ago
phlrain
25df78eaf3
fix api spec; test=develop
7 years ago
phlrain
4c256ca6be
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
phlrain
b65722d3cf
fix uni test; test=develop
7 years ago
Tao Luo
99177b424b
Merge pull request #14693 from kbinias/fix-for-accuracy-problem-for-inlplace-operators
...
Fix for accuracy problem for inplace operators when MKL-DNN mode is enabled
7 years ago
heqiaozhi
3c239cd640
pslib
7 years ago
tangwei12
618f7620e2
add enforce for auc ( #14687 )
...
* add enforce for AUC, test=develop
7 years ago
Krzysztof Binias
bc7db6cec9
Fix for accuracy problem for inplace operators when MKL-DNN mode is enabled
...
test=develop
7 years ago
phlrain
2770ea1a73
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
chengduozh
3f4aca618f
code refine
...
test=develop
7 years ago
chengduozh
af8c2cec13
fix operator.cmake
...
test=develop
7 years ago
chengduozh
679d8fc6fe
rename op name
...
test=develop
7 years ago
chengduozh
1013d6d05d
Merge branch 'add_cudnn_lstm' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
jerrywgz
3df0538940
replace -100 to kIgnoreIndex
7 years ago
Wang Guibao
41e19eb431
AsyncExecutor ( #14627 )
...
* AsyncExecutor: C++ side
* Google naming conventions
* Rename MultiExecutor to AsyncExecutor
* pybind with async_executor
* Naming convention
* remove some flags and unused code
* add refactored file of async_executor and data_feed
* clear async executor interface and add data feed factory
* split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
* Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
* refine async_executor_refactor.cc
* add some files about datafeed
* Revert "add some files about datafeed"
This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
* Interface rework
* add MultiSlotDataFeed
* Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
* update data_feed for add MultiSlotDataFeed
* update datafeed and async_executor to run bow_net demo
* fix bug that finish_set_filelist failed in multithread
* delete finish_binding_memory_(flag), because it can not be marked under the current interface
* Fix bug
* update async_executor.py for support set_use_slots
* update async_executor.py for support set_use_slots and set set_dense_slots
* fix bug that when the number of files is less than the number of threads, it will fetch nan
* remove redundant code, and make executor exit when set a illegal queue size
* add batch_size check
* add MultiSlotDesc
* Revert "add MultiSlotDesc"
This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
* add some checkpoint in DataFeedDesc
* add CheckFile function in MultiSlotDataFeed
* update something error info
* fix deaded lock bug
* Fix fetch variable
* Merge error
* fix code style in async_executor
* using one lock blocking queue replace two lock blocking queue because of some bugs
* update code style
* add utest for data_feed
* Fix fetch var
* update utest for data_feed for multithread
* update SetFileList info
* fix bug in utest of data_feed
* Add comments for python
* Add comments for python code
* Fix pybind.cc with new pybind11 version
* add note for DataFeedDesc's set_use_slots function
* Add save_model
* update data_feed_test for multi-type
* add comment for executor_thread_worker
* Remove unused code
* update data_feed_test for generate test data file
* removed unnecessary interfaces and add comments
* c++ style check
* update data_feed.cc
* AsyncExecutor: C++ side
Google naming conventions
Rename MultiExecutor to AsyncExecutor
pybind with async_executor
Naming convention
remove some flags and unused code
add refactored file of async_executor and data_feed
clear async executor interface and add data feed factory
split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
refine async_executor_refactor.cc
add some files about datafeed
Revert "add some files about datafeed"
This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
add MultiSlotDataFeed
Interface rework
Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
update datafeed and async_executor to run bow_net demo
update async_executor.py for support set_use_slots
Fix bug
update async_executor.py for support set_use_slots and set set_dense_slots
fix bug that when the number of files is less than the number of threads, it will fetch nan
remove redundant code, and make executor exit when set a illegal queue size
add MultiSlotDesc
Revert "add MultiSlotDesc"
This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
add some checkpoint in DataFeedDesc
Fix fetch variable
fix code style in async_executor
Fix fetch var
add utest for data_feed
Add comments for python
update utest for data_feed for multithread
fix bug in utest of data_feed
Add comments for python code
Fix pybind.cc with new pybind11 version
add note for DataFeedDesc's set_use_slots function
update data_feed_test for multi-type
Add save_model
update data_feed_test for generate test data file
removed unnecessary interfaces and add comments
add comment for executor_thread_worker
Remove unused code
update data_feed.cc
c++ style check
* commit for code style
* commit for code style
* commit for code style
* commit for code style
* Comment away __init__ in async_executor.py
* clang-format fix test=develop
* use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc
* commit for update code style
* commit for update code style
* Add async_executor demo; Remove some methods
test=develop
* commit for update code style
* commit for update code style
* commit for update code style
* update API.spec
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* Fix API.spec
test=develop
* Fix API.spec
test=develop
* Fix windows build error
test=develop
* FIx windows build error
test=develop
* FIx windows build error
test=develop
* FIx windows build error
test=develop
* Fix Windows Build
test=develop
* Fix Windows Build
test=develop
* Fix Windows Build
test=develop
* Fix code style
test=develop
* Fix code style
test=develop
* update datafeed
* Fix code style
test=develop
* update data_feed_test for test Tensor test=develop
* Fix code style
test=develop
* Fix windows build failure
test=develop
* Fix code style and windows build failure
test=develop
* Fix PYTHON3.5 build failure
test=develop
* AsyncExecutor API
test=develop
7 years ago
JiabinYang
a770d5c9db
fix error don't interupt shell
...
, test=develop
7 years ago
whs
1b9753d109
Make pad2d support for variable paddings. ( #14667 )
...
* Make pad2d support for variable paddings.
test=develop
* Rename get_paddings and add inline modifier.
test=develop
* Fix comments.
7 years ago
Tao Luo
2af5762cf8
Merge pull request #14668 from wzzju/use_small_dam
...
support the small dam model. test=develop
7 years ago
Tao Luo
ff16c47898
Merge pull request #14671 from luotao1/box_coder
...
speedup box_coder_op for multi-threads
7 years ago
baojun-nervana
fc61bf1b16
Renamed methods
...
test=develope
7 years ago
sneaxiy
096673f675
refactor eager deletion
...
test=develop
7 years ago
ZhenWang
6e48e47406
test=develop
7 years ago
ZhenWang
e1da6cd754
add the normal dam and the small dam
7 years ago
luotao1
bcc90123f0
speedup box_coder_op for multi-threads
...
test=develop
7 years ago
ZhenWang
d5947b0ed7
test=develop
7 years ago
ZhenWang
33b4963505
unify the normal and small dam model.
7 years ago
Yan Chunwei
4b7617740e
fix container not cleared ( #14231 )
7 years ago
Tao Luo
c856ac8721
add OpHasAttr in node.h, update is_test_pass and mkldnn_placement_pass
...
test=develop
7 years ago
ZhenWang
8f2e556e65
support the small dam model. test=develop
7 years ago
phlrain
6ce4250172
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
Qiao Longfei
44debca844
Merge pull request #14589 from jacquesqiao/refactor-prefetch
...
Refactor prefetch
7 years ago
phlrain
bd94ab0ef3
rename op; test=develop
7 years ago
phlrain
92f5be1d82
remove inputvarname in operator; test=develop
7 years ago
Xin Pan
40f1c4a6f0
fix
...
test=develop
7 years ago
phlrain
cf1fe61004
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
luotao1
5db273d874
enhance HasAttr to fix ci
...
test=develop
7 years ago
Yu Yang
589b863b98
Add EstiminateFlops
...
test=develop
7 years ago
phlrain
4b9689379f
fix cudnn lstm; test=develop
7 years ago
phlrain
d1a17cadd4
fix cudnn rnn; test=develop
7 years ago
JiabinYang
4124253796
add mac ci check on import, test=develop
7 years ago
Qiao Longfei
9450048acb
add PADDLE_ENABLE_REMOTE_PREFETCH to enable remote prefetch
...
test=develop
7 years ago
Xin Pan
75939c2059
fix
...
test=develop
7 years ago
Tao Luo
20120d9c97
Merge pull request #14608 from jczaja/prv-conv2d-transpose-mkldnn
...
[MKL-DNN]conv2d transpose
7 years ago
Qiao Longfei
3e45a5a5ec
lookup_table gpu kernel support prefetch
...
test=develop
7 years ago
Zhaolong Xing
d215293c92
Merge pull request #14649 from NHZlX/add_params_sync_pass
...
Add params sync pass
7 years ago
Qiyang Min
055da6e00d
Merge pull request #14656 from velconia/disable_dist_transpiler_ut_in_mac
...
Change pip to correct version when install wheel package
7 years ago
qingqing01
731d45a39a
Enable BatchNorm to use global mean and variane during training ( #14630 )
...
* Enable BatchNorm to use global mean and variane during training
* Update doc and follow comments.
7 years ago
nhzlx
49c28b8c52
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass
...
test=develop
7 years ago
nhzlx
3c83a2f720
fix comments
7 years ago
Xin Pan
ad6ed5b745
fix py3
...
test=develop
7 years ago
Xin Pan
0cc9ab3dc2
enable API check for readers
...
test=develop
7 years ago
luotao1
4a4daa8ab4
Merge branch 'develop' into has_attr
7 years ago
Qiao Longfei
75eba6108d
Add scope doc ( #14582 )
...
* add doc for scope
* update doc for force_init_on_cpu
test=develop
* follow comment test=develop
* update format test=develop
7 years ago
Tao Luo
ea47685f91
Merge pull request #14646 from jczaja/prv-softmax-mkl-sasum
...
Softmax for inference MKL further changes
7 years ago
Qiao Longfei
3a3cfc2d8d
prefetch support gpu
...
test=develop
7 years ago
minqiyang
fe0dee88d8
Change pip version to correct version when install wheel package
...
test=develop
7 years ago
baojun-nervana
d5ee05e6c3
Replaced VarIsTensor
...
test=develop
7 years ago
baojun-nervana
e6bd53be60
Named to RuntimeInferShape
...
test=develop
7 years ago
Sang Ik Lee
24e70920db
Refactor some build settings.
...
test=develop
7 years ago
baojun-nervana
a29696146c
Added annotation
...
test=develop
7 years ago
Sang Ik Lee
d6125a5eec
Include ngraph in inference demo build.
...
test=develop
7 years ago
baojun-nervana
caf4b937b3
Added RunInferShape
...
test=develop
7 years ago
baojun-nervana
1d19eb2bd4
Implemented ngraph engine
...
test=develop
7 years ago
Qiao Longfei
4b9082a4cd
follow comment
7 years ago
Tao Luo
b4de023ee1
Merge pull request #14636 from Superjomn/fix/word2vec
...
fix word2vec bug
7 years ago
luotao1
fe915901cd
update Opdesc's HasAttr
...
test=develop
7 years ago
chengduo
6776e92846
refine tensor_array_write_read ( #14643 )
...
test=develop
7 years ago
nhzlx
d3e140a572
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass
...
test=develop
7 years ago
nhzlx
d666c8eb1d
fix benchmark
7 years ago
nhzlx
900fbb83f9
add params sync pass
7 years ago
superjomn
9c665c81ae
update
...
test=develop
7 years ago
Jacek Czaja
48e1b97e8e
- Coding style fixes
...
test=develop
7 years ago
Qiao Longfei
d32de7e6e1
fix code format test=develop
7 years ago
Qiao Longfei
5a660aee7d
update log level in parameter prefetch test=develop
7 years ago
Qiao Longfei
8ebde595c9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
...
test=develop
7 years ago
Qiao Longfei
b9d3d75fc4
fix prefetch dependency test=develop
7 years ago
Qiao Longfei
145c535750
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
...
test=develop
7 years ago
minqiyang
9d7c3b18c0
Polish code
...
test=develop
7 years ago
minqiyang
2b430adaee
Polish code
...
test=develop
7 years ago
minqiyang
a02ce58f2c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
...
test=develop
7 years ago
Jiabin Yang
12e1719f96
Merge pull request #14352 from JiabinYang/enhance_hierachical_sigmod_op
...
Enhance hierarchical sigmoid op
7 years ago
Qiao Longfei
40f68b1349
unit test ready
7 years ago
Qiao Longfei
36e26a53b0
Optimize bilinear tensor product op ( #14485 )
...
* optimize bilinear_tensor_product
* add set zero to set grad to 0.
7 years ago
Tao Luo
4ec9de0122
Merge pull request #14628 from Sand3r-/mgallus/mkldnn-elementwise_mul
...
EltwiseMul: Changes from previous PR
7 years ago
Qiao Longfei
35b79ab865
Merge pull request #13983 from jacquesqiao/add-ctr-reader
...
Add ctr reader
7 years ago
wopeizl
b1dbbb7f88
Merge pull request #14629 from wopeizl/windows/port
...
fix the build issue on manylinux1
7 years ago
Qiao Longfei
da387720d7
fix infer compile test=develop
7 years ago
Jacek Czaja
cf40daee58
- Building fix to softmax for inference
7 years ago
Clementine
6c71c1f8f9
Add activation gelu ( #14569 )
7 years ago
Michal Gallus
9455be0ba5
EltwiseMul: Extract StringToFormat to MKLDNN helper
...
test=develop
7 years ago
peizhilin
351dc78e1c
code style fix
...
test=develop
7 years ago
Jacek Czaja
1540df51cf
- Fix to test_conv2d_transpose_mkldnn for GPU
...
test=develop
7 years ago
JiabinYang
eda069068d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
7 years ago
JiabinYang
a08dc83eb0
remove arg 'non_leaf_num', test=develop
7 years ago
chengduo
6648f5ed6f
add ShareLoD for dropout_grad ( #14616 )
...
test=develop
7 years ago
peizhilin
b6b8626e9c
fix the build issue on manylinux1
7 years ago
Qiao Longfei
18fd2d01b7
update embedding api
7 years ago
JiabinYang
7594787deb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
7 years ago
JiabinYang
c469334cfb
polish python code and comment, test=develop
7 years ago
Xin Pan
3c77ce3751
Merge pull request #14593 from panyx0718/fix5
...
Protect important header files.
7 years ago
Qiao Longfei
92afbb923c
fix compile problem test=develop
7 years ago
Tao Luo
e8ef14d2a7
Merge pull request #14610 from Superjomn/revert/cache_fix
...
Revert "fix transfer cache thread_local bug (#14581 )"
7 years ago
Qiao Longfei
97cbec9b74
clean code
7 years ago
Qiao Longfei
1edd435da6
fix ci problem test=develop
7 years ago
JiabinYang
87648f8edf
merge develop, test=develop
7 years ago
Yiqun Liu
726f2cefe3
Fix bug of referencing a temporary variable. ( #14614 )
...
test=develop
7 years ago
wopeizl
db9284ecde
Merge pull request #14617 from wopeizl/windows/online
...
Windows/online
7 years ago
JiabinYang
c3c3c0b33c
polish code, test=develop
7 years ago
gongweibao
867c312bc4
Fix allreduce dependency order. ( #14586 )
7 years ago
Jacek Czaja
8bfa1fa9bb
- ASUM MKL integration
7 years ago
phlrain
487ee36aec
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago