Yancey1989
ca8c77d966
selecte execution according to strategy test=develop
6 years ago
qingqing01
6f0a1d7b47
Inception fusion operator. ( #14968 )
...
* Inception fusion operator.
* Support horizontal layer fusion in conv_fusion_op.
* Search conv algo strategy for variable-length input.
search N times and cache the searched algos. For other input, choose the algo of input whose area is closest to this input.
6 years ago
Zeng Jinle
77e11ce843
Merge pull request #15101 from sneaxiy/disable_data_balance_test
...
Disable data balance unittest
6 years ago
Qiao Longfei
877289c4ca
fix dist_ctr getenv, test=develop
6 years ago
Qiao Longfei
1e04222890
add test_dist_ctr_with_l2_decay.py
6 years ago
Kaipeng Deng
eebb82a2f6
Merge pull request #15097 from heavengate/adaptive_pool_ksize
...
adaptive_pool support pool_size as int. test=develop
6 years ago
Qiao Longfei
e77f54734b
add unit test for dist sparse l2 decay
6 years ago
sneaxiy
8bd0b028e2
disable data balance unittest
...
test=develop
6 years ago
Qiao Longfei
49cce3fd0e
fix dist sparse l2 decay
...
test=develop
6 years ago
dengkaipeng
ccc83bb4e5
adaptive_pool support pool_size as int. test=develop
6 years ago
Zeng Jinle
f129471bc1
Merge pull request #15083 from sneaxiy/fix_py_reader_unittest
...
Try to fix test_py_reader_use_executor.py
6 years ago
minqiyang
9e3155e01d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
6bb84490af
Fix imperative unit test
...
test=develop
6 years ago
Xin Pan
3e8408429d
Merge pull request #15053 from panyx0718/imperative_hold
...
refactor to avoid scope.
6 years ago
minqiyang
336160e651
Complete imperative optimizer implementation
...
test=develop
6 years ago
Yancey1989
4743c9cd5d
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
sneaxiy
26695e0bb2
add thread join
...
test=develop
6 years ago
Yu Yang
efa630eadb
Refine Dockerfile ( #14908 )
...
* Refine Dockerfile
* Add tasks, cmake gen
* Fix code error
* Disable compile after paddle_build.sh
* Refine
* Skip on PY35 CI
* Change env
* Refine paddle_build.sh
* Expose gen_fluid_lib
* Refine mkldnn.cmake
* Refine mkldnn.cmake
* Refine mkldnnlib
* Skip unstable tests
6 years ago
Qiao Longfei
39a400345e
add unit test for test_adam_op_multi_thread
...
test=develop
6 years ago
chengduo
fe8495a758
[WIP] Refine MultiDevSSAGraph ( #15040 )
...
* refine parallel_exe
test=develop
* rename shared_var_device
* code refine
* add test_weight_decay
* remove Sort
test=develop
* Add SortForReduce
test=develop
* code refine
test=develop
* follow comment
test=develop
6 years ago
Qiao Longfei
d161215332
optimize adam multi thread
6 years ago
sneaxiy
6dd623b1e3
test=develop
6 years ago
minqiyang
d4b9928c5a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
Xin Pan
f52b514dcd
call kernel
6 years ago
Xin Pan
ce7e503cbe
refactor to avoid scope.
...
test=develop
6 years ago
Yancey1989
8cad371a60
fix nccl unittest acc test=develop
6 years ago
Xin Pan
71a4a8e981
Merge pull request #15071 from wopeizl/revert/15035
...
Revert "cherry-pick the #12759"
6 years ago
minqiyang
68e9b841ab
Add support for optimizer
6 years ago
Zeng Jinle
9c6a0203e2
Merge pull request #15073 from sneaxiy/add_scope_pool
...
Add scope_pool
6 years ago
Qiao Longfei
0384f3309a
enable unit test for test_nce
...
test=develop
6 years ago
sneaxiy
3e917a934a
add scope_pool
...
add module cleanup
test=develop
6 years ago
Qiao Longfei
b53eb7dcda
add init once for assign layer
6 years ago
Yancey1989
1a4f79a7de
fix unittest test=develop
6 years ago
Yancey1989
a8612adb04
fix lr scale test=develop
6 years ago
Yancey1989
86bb583881
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989
845bfd5807
cleanup code
6 years ago
peizhilin
2388d0e7d6
Revert "cherry-pick the #12759"
...
test=develop
This reverts commit 7f6d8acecb
.
6 years ago
Yancey1989
41a64f6a2a
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
JiabinYang
2aa1dc67ce
test=develop, fix test_dist_transpiler failed
6 years ago
whs
2314f2ebb3
Make topk op support variable k. ( #15044 )
...
* Make topk op support variable k.
test=develop
* Fix tensor type.
test=develop
6 years ago
shippingwang
5631fc0817
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
6 years ago
shippingwang
3757c1ee47
Modify test layers, test=develop
6 years ago
wopeizl
09bd8fa67a
Merge pull request #15035 from wopeizl/debug/improvement1
...
cherry-pick the #12759
6 years ago
Wu Yi
856f0da0fe
Fp16 training ( #14992 )
...
* wip
* wip
* wip
* wip for test
* add fp16 tests test=develop
* fix cpu build test=develop
* fix test=develop
* fix py3 tests test=develop
* fix lr_scheduler dtype test=develop
* fix test=dvelop
* test fix ci compile test=develop
* fix build and merge test=develop
* fallback momentumop change to general test=develop
* make fp16 lr schedule simple test=develop
* fix ut test=develop
* fix tests test=develop
* remove fp16 learning rate cast test=develop
6 years ago
shippingwang
942d7cf7ef
Modify python interface, test=develop
6 years ago
JiabinYang
cb478f7a94
just for test
6 years ago
shippingwang
0a0b6f4a22
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
6 years ago
peizhilin
7f6d8acecb
cherry-pick the #12759
...
test=develop
6 years ago
tensor-tang
05d1121b22
Merge pull request #14802 from mozga-intel/mozga-intel/fill_constant_operator_ngraph
...
Enable fill_constant operator for a ngraph engine
6 years ago
tensor-tang
9d4f1d468a
Merge pull request #14804 from mozga-intel/mozga-intel/top_k_operator_ngraph
...
Enable top_k operator for a ngraph engine
6 years ago
tensor-tang
8a6ac4dba7
Merge pull request #14973 from xiaolil1/dequantize
...
Add Dequantize OP
6 years ago
tensor-tang
f0e02a65ed
Merge pull request #14974 from xiaolil1/quantize
...
Add Quantize OP
6 years ago
shippingwang
76c6f11584
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
6 years ago
shippingwang
16d4e13711
Add ShuffleChannelOP
6 years ago
Tao Luo
d4f384767a
Merge pull request #14987 from jczaja/prv-transpose-mkldnn-grad
...
[MKL-DNN] Added transpose/transpose2 MKLDNN grad ops
6 years ago
Zeng Jinle
95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
...
Featue/py_func op
6 years ago
mozga-intel
7048caf9a0
Enable top_k operator for a ngraph
...
test=develop
6 years ago
mozga-intel
ecfa68ecaa
Enable fill_constant operator for a ngraph
...
test=develop
6 years ago
minqiyang
9e24fa3aeb
Polish code
...
test=develop
6 years ago
minqiyang
1c116462cc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
29697c2e25
Add stop_gradient to VarBase to support loss function
...
test=develop
6 years ago
Jacek Czaja
d05094eec3
- Added transpose/transpose2 MKLDNN grad ops
...
test=develop
6 years ago
Xin Pan
2537ac512e
Merge pull request #14983 from typhoonzero/revert_14850
...
Revert "[Feature] Fp16 training for resnet50 (#14850 )"
6 years ago
chengduo
a015a8a39d
Refine merge_selected_rows Doc ( #14748 )
...
* add doc for MergeSelectedRows
test=develop
* checkout selected_rows
test=develop
6 years ago
typhoonzero
da87f7a698
Revert "[Feature] Fp16 training for resnet50 ( #14850 )"
...
This reverts commit 3d750f9c5a
.
6 years ago
Yihua Xu
3babc80160
Fix the regression issue and add the group unitest for conv2d ( #14932 )
...
* Add test items for mkldnn conv2d
* Fix the regression issue and pass the unit test for conv2d and conv3d
test=develop
6 years ago
minqiyang
fba3712a7b
Add multi-input to forward function in Layer
6 years ago
Wu Yi
3d750f9c5a
[Feature] Fp16 training for resnet50 ( #14850 )
...
* wip
* wip
* wip
* wip for test
* add fp16 tests test=develop
* fix cpu build test=develop
* fix test=develop
* fix py3 tests test=develop
* fix lr_scheduler dtype test=develop
* fix test=dvelop
* test fix ci compile test=develop
* fix build and merge test=develop
* fallback momentumop change to general test=develop
6 years ago
heqiaozhi
f18e8a7a5e
remove some comments & refine doc & put template class in .h
...
test=develop
6 years ago
xiaoli.liu@intel.com
3146629485
Add Dequantize OP
...
test=develop
6 years ago
xiaoli.liu@intel.com
019dbf7f5f
Add Quantize OP
...
test=develop
6 years ago
minqiyang
3cd10a7c4f
Add Conv2D forward
...
test=develop
6 years ago
JiabinYang
bfcb5e5235
test=develop, fix gpu compile error on prefetch, and fix hs/nce ut failed on gpu
6 years ago
Xin Pan
cf3a07e8f8
Merge pull request #14878 from panyx0718/imperative
...
MLP forward backward
6 years ago
minqiyang
8d88c5a87d
Shameless copy
6 years ago
JiabinYang
8515ee3a29
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
6 years ago
mozga-intel
9035bb81fe
Enable mul operator for a ngraph engine ( #14801 )
...
* Enable mul operator for a ngraph
test=develop
* Enable activation ops test
test=develop
* Remove unused line
test=develop
6 years ago
heqiaozhi
5c7a8aee07
merge upstream to my develop
...
test=develop
Merge remote-tracking branch 'upstream/develop' into develop
6 years ago
Jacek Czaja
aa6e9c30be
[MKL-DNN ]Added transpose/transpose2 Op ( #14872 )
...
* - Added transpose MKLDNN Op
- Few basic UT works
- Added 1D transpose
- implementing generic mem desc for MKLDNN transpose
- Modified trnaspose op to support more dimensional data eg. 5,6..10
- Added is_test attribute to transpose op
test=develop
* - Added support for MKLDNN::memory::format::any for Transpose MKLDNN op
test=develop
* - Additional transpose mkldnn op correction to mkldnn layout
test=develop
* Cosmetic fixes
test=develop
* - Removed const_cast to obey coding standard
test=develop
6 years ago
heqiaozhi
4f6e9e3ac3
teacher student sigmoid loss
6 years ago
tangwei12
f7fb937bfe
fix in cmake, test=develop
6 years ago
JiabinYang
19a8d96585
fix nce in test_dist_transpiler, test=develop
6 years ago
JiabinYang
52ca40fcfe
Merge branch 'feature/add_prefech_hs' of https://github.com/JiabinYang/Paddle into feature/add_prefech_hs
6 years ago
JiabinYang
3b7b2e1ded
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
6 years ago
JiabinYang
b2f789c66d
add test transpiler dist test, test=develop
6 years ago
tangwei12
547c1bd11d
Merge branch 'feature/add_prefech_hs' of https://github.com/JiabinYang/Paddle into add_prefetch_nce_hs
6 years ago
tangwei12
e0c3c56b06
add nce remote ut, test=develop
6 years ago
Xin Pan
6324032602
MLP forward backward
...
test=develop
6 years ago
Xin Pan
c89a1fb287
Merge pull request #14879 from panyx0718/clean
...
clean parallel do
6 years ago
Qiao Longfei
e2d56561e7
Merge pull request #14889 from jacquesqiao/optimize-adam
...
adam optimizer support lazy mode
6 years ago
tangwei12
aed3872c1c
add int cast, test=develop
6 years ago
Zeng Jinle
16c244bc3f
Merge pull request #14928 from sneaxiy/fix_gc
...
Fix gc bug
6 years ago
tangwei12
41790f1366
add ut about nce
6 years ago
dzhwinter
7cd24b1318
add ir memory optimize. ( #14530 )
...
* follow comments. test=develop
* Fix typo
* fix compile error. test=develop
* merge develop branch. test=develop
* Remove set_equal
* Polish code
* Delete unused functions
test=develop
* polish code. test=develop
* follow comment
* polish code.
* fix windows compile error. test=develop
* fix op handle.
* rerun ci. test=develop
* rerun ci. test=develop
* rerun macci. test=develop
* polish code. test=develop
* rewrite sort code. test=develop
* remove unused code. test=develop
* fix tests. test=develop
* fix conflict. test=develop
* follow comment. test=develop
* merge develop branch. test=develop
* fix tests. test=develop
* remove ToTypeIndex. test=develop
* rerun ci. test=develop
6 years ago
JiabinYang
656040c726
merge reyoung optimization
6 years ago
sneaxiy
c631412eab
fix gc bug
...
test=develop
6 years ago
shippingwang
7f73c16e42
Add
6 years ago
shippingwang
2dd55b873f
Add shuffle_channel_op
6 years ago
Qiao Longfei
1141db8114
update test_adam_op
...
test=develop
6 years ago
Yancey1989
a7d6b1f921
code cleanup test=develop
6 years ago
Yancey1989
a760a550b0
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989
fd144954ed
redefine api test=develop
6 years ago
Qiao Longfei
238b24bfa2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-adam
6 years ago
tangwei12
e196fa367b
update ut, test=develop
6 years ago
Kaipeng Deng
dc76e4b0f1
Merge pull request #14701 from heavengate/adaptive_pool
...
add adaptive pool2d and pool3d
6 years ago
mozga-intel
67b555d3d3
Enable ngraph tests for a ngraph engine ( #14800 )
...
* Enable ngraph tests for a ngraph engine
test=develop
* Move the test structure to other place
test=develop
* Add USE_NGRAPH flag, simple structure
test=develop
6 years ago
tangwei12
723f68727d
add ut about nce in transpiler
6 years ago
Yancey1989
4a4ccac1d0
update by comment test=develop
6 years ago
JiabinYang
f702ab74b9
add dist transpiler test
6 years ago
Qiao Longfei
c624417c6f
change sparse mode to lazy mode
6 years ago
Qiao Longfei
4035e4bab2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-adam
6 years ago
Yancey1989
4f304eaa6f
fix unittest test=develop
6 years ago
sneaxiy
f0df62f136
add more unittest case
...
test=develop
6 years ago
Qiao Longfei
fc6ec6bd14
add sparse mode adam
6 years ago
Yancey1989
c722b1dcb6
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
...
test=develop
6 years ago
Xin Pan
47ea2534fb
clean parallel do
...
test=develop
6 years ago
Yan Xu
5384206aec
Merge pull request #14869 from Yancey1989/fix_dist_unittest
...
fix dist unit test
6 years ago
sneaxiy
f6741df462
merge develop
...
fix bug
test=develop
6 years ago
Zeng Jinle
1b564bc49a
Merge pull request #14670 from sneaxiy/refactor_eager_deletion
...
Rewrite eager deletion
6 years ago
SunGaofeng
e3c4b0dace
this is for psroi_pool op, test=develop ( #14796 )
...
* Add psroi_pool operator.
6 years ago
Yancey1989
fa1f77e20c
enable ci test=develop
6 years ago
Yancey1989
23eb8c4299
fix ci test=develop
6 years ago
JiabinYang
50fce87905
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
6 years ago
chengduo
582011ba76
Add L2 unit test ( #14792 )
...
* add l2 unit test
test=develop
* code refine
test=develop
6 years ago
sneaxiy
e7c5c9d2de
remove unnecesary code
...
test=develop
6 years ago
sneaxiy
db2daefe50
merge develop
...
test=develop
6 years ago
sneaxiy
8b9d33fa1e
add unittest and fix bug
...
add API.spec
test=develop
6 years ago
Wu Yi
f95ee9c09f
fix nccl dist test acc ( #14867 )
...
* fix nccl dist test acc test=develop
* fix test=develop
6 years ago
Yancey1989
106e285236
add unittest for parllelgraph mode test=develop
6 years ago
JiabinYang
c2e851f7b2
test=develop, remove sparse bias and add prefetch and related tests
6 years ago
Tao Luo
66b6e473d0
Merge pull request #14732 from Sand3r-/mgallus/mkldnn-concat
...
[MKL-DNN] Concat Layer
6 years ago
sneaxiy
ca84c2ca8f
merge develop
...
test=develop
6 years ago
dengkaipeng
a81fabd327
fix doc errors. test=develop
6 years ago
dengkaipeng
266c6856c9
add adaptive pool 2d & 3d. test=develop
6 years ago
Qiao Longfei
1213e2838f
Merge pull request #14820 from jacquesqiao/fix-split-selected-rows
...
split selected rows op should always init output selected rows
6 years ago
JiabinYang
c35fdf1581
Merge branch 'add_prefetch_in_nce' of https://github.com/seiriosPlus/Paddle into feature/add_prefech_hs
6 years ago
sneaxiy
7923042365
merge develop
...
test=develop
6 years ago
Qiao Longfei
0a7c7e97af
test zero output of split_selected_rows_op
...
test=develop
6 years ago
dzhwinter
00776b167a
fix memory opt skip set by name ( #14774 )
...
* random failed. rerun ci. test=develop
* windows failed. rerun ci. test=develop
6 years ago
zhang wenhui
c4c5f0b8ca
Merge pull request #14771 from frankwhzhang/bpr
...
add bpr_loss operator
6 years ago
Xin Pan
4a716a70c6
Merge pull request #14811 from panyx0718/revert-14782-revert-14398-imperative
...
Imperative
6 years ago
Wu Yi
554bcdbdfc
add more log for dist test for ci test=develop ( #14813 )
...
* add more log for dist test for ci test=develop
* increase deadline test=develop
6 years ago
frankwhzhang
c9a653820b
fix label_pos ,add test_layers.py, test=develop
6 years ago
Xin Pan
748549b2e3
Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative"
...
This reverts commit b1d3a1c8b4
, reversing
changes made to f1fb64b17f
.
6 years ago
sneaxiy
2c6159a151
fix unittest
...
fix cmake
test=develop
6 years ago
Xin Pan
c049fa7cf7
Revert "Revert "Revert "Imperative"""
6 years ago
sneaxiy
eb8252466b
polish code
...
add unittest model containing while_op
remove unnecessary codes
test=develop
6 years ago
chengduo
c80c693a0f
Enable test_gradient_clip in mac ( #14765 )
...
* enable test_gradient_clip in mac
test=develop
* refine test_gradient_clip
test=develop
6 years ago
frankwhzhang
a3381dc740
update question for api ,test=develop
6 years ago
frankwhzhang
f4cc5881b0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
6 years ago
frankwhzhang
97de98cd0a
update bpr_loss op code, test=develop
6 years ago
Xin Pan
6c80bb3ce9
Merge pull request #14786 from PaddlePaddle/revert-14782-revert-14398-imperative
...
Revert "Revert "Imperative""
6 years ago
Tao Luo
54fcafb5f6
Merge pull request #14707 from yihuaxu/develop_4f71a6ee2_conv3d_mkldnn_opt
...
Implement conv3d with mkldnn library
6 years ago
tangwei12
b653ed0516
add prefetch and remvoe selectedrows of bias
6 years ago
Xin Pan
2538ef64f1
Revert "Revert "Imperative""
6 years ago
Wu Yi
aebc175cd4
add nccl2 dist tests ( #14755 )
...
* add nccl2 dist tests test=develop
* fix dist_base test=develop
* fix tests test=develop
* fix test on mac test=develop
6 years ago
Xin Pan
6217f42ab7
Revert "Imperative"
6 years ago
frankwhzhang
b51df39874
update , test=develop
6 years ago
Xin Pan
b52f5d2870
Merge pull request #14398 from panyx0718/imperative
...
Imperative
6 years ago
frankwhzhang
272f3d3111
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
6 years ago
frankwhzhang
570d89ec84
add bpr_loss operator , test=develop
6 years ago
qingqing01
549f165b59
Speed conv_fusion_op for identity activation. ( #14744 )
...
* Refine conv_fusion_op for identity activation.
* Fix unit testing.
test=develop
6 years ago
Xin Pan
0492158da5
polish
...
test=develop
6 years ago
Yihua Xu
65dbc7cca4
Merge branch 'develop' into develop_4f71a6ee2_conv3d_mkldnn_opt
6 years ago
Wu Yi
e2011f1353
test dist ut fixes test=develop ( #14706 )
...
* test dist ut fixes test=develop
* fix cmake
* for test
6 years ago
chengduo
04539d4c5d
Fix clip.py ( #14718 )
...
* expose square
test=develop
* fix activation
test=develop
* Add square API
test=develop
* add necessary op
* code refine
* fix API.spec
test=develop
* fix unit test
test=develop
* add unit test sparse_grad_clip
test=develop
* fix API.spec
test=develop
* remove mac test for test_gradient_clip
test=develop
* remove selectedrows_mul_tensor
test=develop
6 years ago
sneaxiy
d0c8b9b9b3
remove timeout unittest
...
test=develop
6 years ago
Michal Gallus
87eb8b0e28
Set cpu only for MKL-DNN concat UTs
...
test=develop
6 years ago
Michal Gallus
f2a880421e
Fix style @ concat integration and tests
...
test=develop
6 years ago
Michal Gallus
208f912512
Implement MKL-DNN Concat
...
test=develop
6 years ago
sneaxiy
e694d0c2e4
fix while_op eager deletion bug
...
add unittest
test=develop
6 years ago
Kaipeng Deng
934f13a70a
Merge pull request #14371 from heavengate/yolo_loss
...
Add YOLOv3 loss operator for YOLOv3 model
6 years ago
Yihua Xu
ac803fed18
Fix the compile issue for cuda device (test=develop)
6 years ago
Xin Pan
b80fe8264a
polish
...
test=develop
6 years ago
jerrywgz
96dc3d8326
Merge pull request #14511 from jerrywgz/ignore_index_for_sigmoid_cross_entropy
...
add ignore index for sigmoid cross entropy with logits op, test=develop
6 years ago
Yihua Xu
669191c9cc
Implement conv3d with mkldnn library (test=develop)
6 years ago
Xin Pan
e5d64fd4d1
initial imperative
...
test=develop
6 years ago
Xin Pan
aeb74af54c
allow operator to run imperatively
6 years ago
Xin Pan
b1f6fda5e5
run forward
6 years ago
Xin Pan
a6d23083f0
some tracing
...
test=develop
6 years ago
Xin Pan
dac92e560c
initial commit
6 years ago
phlrain
1d63b06bd7
add grad test unit; test=develop
6 years ago
phlrain
24fa1f4b8c
fix test uni; test=develop
6 years ago
phlrain
0b8a377d9c
fix cpu test; test=develop
6 years ago
phlrain
b65722d3cf
fix uni test; test=develop
6 years ago
phlrain
2770ea1a73
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
6 years ago
phlrain
a6aa782ee5
add unitest
6 years ago
Wang Guibao
41e19eb431
AsyncExecutor ( #14627 )
...
* AsyncExecutor: C++ side
* Google naming conventions
* Rename MultiExecutor to AsyncExecutor
* pybind with async_executor
* Naming convention
* remove some flags and unused code
* add refactored file of async_executor and data_feed
* clear async executor interface and add data feed factory
* split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
* Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
* refine async_executor_refactor.cc
* add some files about datafeed
* Revert "add some files about datafeed"
This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
* Interface rework
* add MultiSlotDataFeed
* Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
* update data_feed for add MultiSlotDataFeed
* update datafeed and async_executor to run bow_net demo
* fix bug that finish_set_filelist failed in multithread
* delete finish_binding_memory_(flag), because it can not be marked under the current interface
* Fix bug
* update async_executor.py for support set_use_slots
* update async_executor.py for support set_use_slots and set set_dense_slots
* fix bug that when the number of files is less than the number of threads, it will fetch nan
* remove redundant code, and make executor exit when set a illegal queue size
* add batch_size check
* add MultiSlotDesc
* Revert "add MultiSlotDesc"
This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
* add some checkpoint in DataFeedDesc
* add CheckFile function in MultiSlotDataFeed
* update something error info
* fix deaded lock bug
* Fix fetch variable
* Merge error
* fix code style in async_executor
* using one lock blocking queue replace two lock blocking queue because of some bugs
* update code style
* add utest for data_feed
* Fix fetch var
* update utest for data_feed for multithread
* update SetFileList info
* fix bug in utest of data_feed
* Add comments for python
* Add comments for python code
* Fix pybind.cc with new pybind11 version
* add note for DataFeedDesc's set_use_slots function
* Add save_model
* update data_feed_test for multi-type
* add comment for executor_thread_worker
* Remove unused code
* update data_feed_test for generate test data file
* removed unnecessary interfaces and add comments
* c++ style check
* update data_feed.cc
* AsyncExecutor: C++ side
Google naming conventions
Rename MultiExecutor to AsyncExecutor
pybind with async_executor
Naming convention
remove some flags and unused code
add refactored file of async_executor and data_feed
clear async executor interface and add data feed factory
split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
refine async_executor_refactor.cc
add some files about datafeed
Revert "add some files about datafeed"
This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
add MultiSlotDataFeed
Interface rework
Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
update datafeed and async_executor to run bow_net demo
update async_executor.py for support set_use_slots
Fix bug
update async_executor.py for support set_use_slots and set set_dense_slots
fix bug that when the number of files is less than the number of threads, it will fetch nan
remove redundant code, and make executor exit when set a illegal queue size
add MultiSlotDesc
Revert "add MultiSlotDesc"
This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
add some checkpoint in DataFeedDesc
Fix fetch variable
fix code style in async_executor
Fix fetch var
add utest for data_feed
Add comments for python
update utest for data_feed for multithread
fix bug in utest of data_feed
Add comments for python code
Fix pybind.cc with new pybind11 version
add note for DataFeedDesc's set_use_slots function
update data_feed_test for multi-type
Add save_model
update data_feed_test for generate test data file
removed unnecessary interfaces and add comments
add comment for executor_thread_worker
Remove unused code
update data_feed.cc
c++ style check
* commit for code style
* commit for code style
* commit for code style
* commit for code style
* Comment away __init__ in async_executor.py
* clang-format fix test=develop
* use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc
* commit for update code style
* commit for update code style
* Add async_executor demo; Remove some methods
test=develop
* commit for update code style
* commit for update code style
* commit for update code style
* update API.spec
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* Fix API.spec
test=develop
* Fix API.spec
test=develop
* Fix windows build error
test=develop
* FIx windows build error
test=develop
* FIx windows build error
test=develop
* FIx windows build error
test=develop
* Fix Windows Build
test=develop
* Fix Windows Build
test=develop
* Fix Windows Build
test=develop
* Fix code style
test=develop
* Fix code style
test=develop
* update datafeed
* Fix code style
test=develop
* update data_feed_test for test Tensor test=develop
* Fix code style
test=develop
* Fix windows build failure
test=develop
* Fix code style and windows build failure
test=develop
* Fix PYTHON3.5 build failure
test=develop
* AsyncExecutor API
test=develop
6 years ago
whs
1b9753d109
Make pad2d support for variable paddings. ( #14667 )
...
* Make pad2d support for variable paddings.
test=develop
* Rename get_paddings and add inline modifier.
test=develop
* Fix comments.
6 years ago
Qiao Longfei
44debca844
Merge pull request #14589 from jacquesqiao/refactor-prefetch
...
Refactor prefetch
6 years ago
Qiao Longfei
839193fd1f
fix unit test test=develop
6 years ago
Qiao Longfei
9450048acb
add PADDLE_ENABLE_REMOTE_PREFETCH to enable remote prefetch
...
test=develop
6 years ago
Tao Luo
20120d9c97
Merge pull request #14608 from jczaja/prv-conv2d-transpose-mkldnn
...
[MKL-DNN]conv2d transpose
6 years ago
qingqing01
731d45a39a
Enable BatchNorm to use global mean and variane during training ( #14630 )
...
* Enable BatchNorm to use global mean and variane during training
* Update doc and follow comments.
6 years ago
Qiao Longfei
3a3cfc2d8d
prefetch support gpu
...
test=develop
6 years ago
chengduo
6776e92846
refine tensor_array_write_read ( #14643 )
...
test=develop
6 years ago
Qiao Longfei
8ebde595c9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
...
test=develop
6 years ago
Yu Yang
4ffc376497
Merge pull request #14635 from reyoung/feature/loose_atol
...
Loose atol in test_pe_test_wwhile_train
6 years ago
Qiao Longfei
145c535750
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
...
test=develop
6 years ago
Qiao Longfei
b2c9efef2b
add more unit test for lookup_remote_table
...
test=develop
6 years ago
Jiabin Yang
12e1719f96
Merge pull request #14352 from JiabinYang/enhance_hierachical_sigmod_op
...
Enhance hierarchical sigmoid op
6 years ago
Yu Yang
d40881cb82
Loose atol in test_pe_test_wwhile_train
...
test=develop
6 years ago
Qiao Longfei
40f68b1349
unit test ready
6 years ago
Clementine
6c71c1f8f9
Add activation gelu ( #14569 )
6 years ago
Jacek Czaja
1540df51cf
- Fix to test_conv2d_transpose_mkldnn for GPU
...
test=develop
6 years ago
JiabinYang
a08dc83eb0
remove arg 'non_leaf_num', test=develop
6 years ago
Qiao Longfei
953c79caec
use dist_ctr to test remote_prefetch
6 years ago
JiabinYang
c469334cfb
polish python code and comment, test=develop
6 years ago
JiabinYang
87648f8edf
merge develop, test=develop
6 years ago
wopeizl
db9284ecde
Merge pull request #14617 from wopeizl/windows/online
...
Windows/online
6 years ago
JiabinYang
c3c3c0b33c
polish code, test=develop
6 years ago
tangwei12
56a4912b76
Make NCE_OP more efficient and support SelectedRows ( #14469 )
...
* Fix truncated normal.
* Fix.
* Make nce support more distribution.
* Fix API.spec.
* Fix python API.
* Fix.
test=develop
* Fix API.spec
test=develop
* Fix sampler.
* Fix order of arguments in python API.
test=develop
* NCE add selectedrows support
* NCE update weighted sampling
* fix bugs in nce_op, and assign_value_op optimized
* fix bugs in nce_op, revert assign_value_op
* nce_op optimize
* nce_op optimize
* nce_op optimize
* add selectedRows test later
test=develop
* add selectedRows supported
* add selectedRows supported
test=develop
* add selectedRows supported
* add nce selectedRows supported, test=develop
* add nce selectedRows supported
* add nce selectedRows supported, test=develop
* fix height in nce, test=develop
* add ut
* add ut, test=develop
* make AutoGrownIndex inline
test=develop
* fix tinny error, test=develop
6 years ago
Qiao Longfei
9589babe12
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
...
test=develop
6 years ago
peizhilin
6a85dd3278
Merge remote-tracking branch 'upstream/develop' into windows/build
...
test=develop
6 years ago
peizhilin
38715e6fd0
minor fix
6 years ago
JiabinYang
7e4bd695e6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
6 years ago
JiabinYang
b10df8bcfa
refine code and add none bias ut, test=develop
6 years ago
Kaipeng Deng
251a1bb0f4
Merge pull request #14588 from heavengate/revert_interpolate
...
fix interpolate_op incompatible. test=develop
6 years ago
Wu Yi
6861fb66a1
disable dist_se_resnext test for later fixes test=develop ( #14605 )
6 years ago
dengkaipeng
ae84b9f0a4
set unittest serial. test=develop
6 years ago
dengkaipeng
78f563917c
revert interpolate_op to bilinear_interp_op & nearest_interp_op. test=develop
6 years ago
Jacek Czaja
06d8777ec2
Coding style fixes
...
test=develop
6 years ago
Jacek Czaja
fb24690a58
- conv2d transpose MKL-DNN
...
test=develop
- Added new header for MKLDNN reuse functionality
- Extended conv2d_transpose GetExpectedKernelType for MKL-DNN supporrt
- Buildable conv transpose mkldnn and conv mkldnn using conv template
- Conv2d transpose roughlt implemented and buildable
- Added modifications conv2d transpose MKLDNN unit tests
- Fix to UT of conv2d transpose mkldnn op
- Wrong type of MKLDNN primitive was chosen for conv2d transpose
- HAcks for conv2d transpose
- UT enalbed
- Replaced copying loop with memcpy
- Draft of passing lambda into AcquireMemory
- Made reorder (IOHW->OIHW) to be called only once
6 years ago
Qiao Longfei
5d5e0656b2
clean code
6 years ago
qingqing01
6224e61fd9
Transpose-Flatten-Concat fusion operator. ( #14568 )
...
* Transpose-Flatten-Concat fusion operator.
* Add unit testing and fix bug.
6 years ago
JiabinYang
02d68051db
add sparsed bias grad, test=develop
6 years ago
Qiao Longfei
2b6c0c09d6
add unit test
6 years ago
minqiyang
3d0f6df313
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_nms_op_ut_in_py36
...
test=develop
6 years ago
minqiyang
5431d5c471
Polish code
...
test=develop
6 years ago
minqiyang
c35bf3d34b
Fix multiclass_nms_op unit test fail in python3.6
...
test=develop
6 years ago
Jiabin Yang
3fe2def1ca
Merge pull request #14540 from JiabinYang/fix_pool2d_doc
...
Fix pool2d doc and add pool2d test in test_layers
6 years ago
qingqing01
36f08eef3b
CUDA kernel for density_prior_box_op. ( #14513 )
...
* CUDA kernel for density_prior_box_op.
* Support flatten to 2D.
6 years ago
JiabinYang
3c6102a367
test=develop
6 years ago
JiabinYang
510601b279
test=develop
6 years ago
Dun
ae7d22862b
Group Norm ( #13843 )
...
Add group normalization operator.
6 years ago
JiabinYang
982e489220
test=develop
6 years ago
Yu Yang
533c5d5803
fix(Cpu): fix cpu compile and unittest
...
test=develop
6 years ago
Tao Luo
dc75cc9136
Merge pull request #14507 from heavengate/fix_interpolate_unittest
...
change interpolate unittest to serial
6 years ago
Tao Luo
5d4d117edc
Merge pull request #14502 from qingqing01/cudnn5_fix
...
Fix compling with cuDNN v5
6 years ago
Jiabin Yang
f7b55de9e5
Merge branch 'develop' into enhance_hierachical_sigmod_op
6 years ago
JiabinYang
af9a3301da
test=develop
6 years ago
dengkaipeng
6c0e09cb1d
change interpolate unittest to serial. test=develop
6 years ago
JiabinYang
014e50c284
test=develop
6 years ago
Dang Qingqing
cda60311f9
Fix compling with cuDNN v5
...
test=develop
6 years ago
tensor-tang
10fb4ceefc
Merge pull request #14351 from tpatejko/tpatejko/mkldnn-elementwise_mul
...
[MKLDNN][JIT][AVX512] Elementwise Mul
6 years ago
jerrywgz
13e254faed
refine code, test=develop
6 years ago
jerrywgz
79cec53111
add ignore index for sigmoid cross entropy with logits op, test=develop
6 years ago
Tao Luo
09ee266f8e
disable two openblas test temporary
...
test=develop
6 years ago
Yu Yang
f1a392a5fe
Merge pull request #13804 from sneaxiy/rewrite_allocation
...
Rewrite allocation
6 years ago
qingqing01
fd7e643153
Convolution fusion operator. ( #14449 )
...
* Convolution fusion operator.
* Clean code
test=develop
6 years ago