JiabinYang
ff2a9786f3
test=develop
6 years ago
ZhenWang
43b9202d9b
test=develop
6 years ago
ZhenWang
1f0fa67571
add some activation api examples.
6 years ago
chengduo
00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH ( #13929 )
...
* refine cublase
test=develop
* code refine
* refine cublas
* add GEMME_EX
* add enable_cublas_tensor_op_math doc and add cublasCall
test=develop
* fix CublasCall for cuda version
test=develop
* fix error
test=develop
* fix GEMM_EX to be compatible with gcc 4.8
test=develop
* add GEMM_EX
test=develop
* to compatiable with gcc4.8
test=develop
6 years ago
peizhilin
dfbac60398
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin
7c8c9dc9bf
fix unit test cases
6 years ago
JiabinYang
3c6102a367
test=develop
6 years ago
JiabinYang
510601b279
test=develop
6 years ago
tangwei12
dd6fd4c747
Utils for download and upload files with HDFS ( #14473 )
...
* add hdfs utils
* add hdfs utils
* test=develop
* update hdfs utils and add demo
* fix multi_download return local files
* test=develop
* add sync multi upload, test=develop
6 years ago
Dun
ae7d22862b
Group Norm ( #13843 )
...
Add group normalization operator.
6 years ago
Jiabin Yang
de2db11735
Merge pull request #14537 from reyoung/feature/fix_macos_ut
...
fix(Cpu): fix cpu compile and unittest
6 years ago
JiabinYang
982e489220
test=develop
6 years ago
wopeizl
d9a1f3e58e
Windows/online ( #14474 )
...
* add recordio support
* disable the openblas multi-thread on windows since no support
adjust the python script
* code style
* code style
test=develop
* add create_recordio_file_reader back
* fix code style
test=develop
* fix the gtest.cmake on windows
* fix cc_test on windows
* fix the win build
test=develop
* remove fused compile support on windows
test=develop
* add the jit support
test=develop
* add the jit support, test=develop
* add the jit support, test=develop
* add the jit back
fix compile error on windows
* rollback test=develop
* test case fix
* disable DSO by default on windows
* exclude warpctc_op on windows
* exclude the dynload_warpctc out on windows
test=develop
* fix the scripts error
test=develop
* disable avx on windows by default
test=develop
* re-organize the cmake file
* disable mkl on windows by default
* add warp_ctc back
* fix the dependency
* fix the dependency
* fix the build issue on windows
* remove unsupported flag on windows
* code style
* code style
test=develop
* fix issue
* add profiler, parallel_executor back
* clean up the pre-definitions on windows
* fix build issue
* test=develop
6 years ago
Yu Yang
533c5d5803
fix(Cpu): fix cpu compile and unittest
...
test=develop
6 years ago
Tao Luo
dc75cc9136
Merge pull request #14507 from heavengate/fix_interpolate_unittest
...
change interpolate unittest to serial
6 years ago
peizhilin
bef475c92b
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Tao Luo
5d4d117edc
Merge pull request #14502 from qingqing01/cudnn5_fix
...
Fix compling with cuDNN v5
6 years ago
Jiabin Yang
f7b55de9e5
Merge branch 'develop' into enhance_hierachical_sigmod_op
6 years ago
JiabinYang
af9a3301da
test=develop
6 years ago
dengkaipeng
6c0e09cb1d
change interpolate unittest to serial. test=develop
6 years ago
Qiyang Min
b8f36bd074
Merge pull request #14519 from velconia/add_api_example
...
Add API example for logical ops and clip ops
6 years ago
jerrywgz
94de2290f4
fix format in api doc, test=develop
6 years ago
JiabinYang
014e50c284
test=develop
6 years ago
minqiyang
1b372d9b9d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into change_requirements
...
test=develop
6 years ago
minqiyang
53760bb111
Change requirements to support python 3.7
...
test=develop
6 years ago
Dang Qingqing
cda60311f9
Fix compling with cuDNN v5
...
test=develop
6 years ago
minqiyang
175b847f6d
Add API example for logical ops and clip ops
...
test=develop
6 years ago
peizhilin
67562a6fcd
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin
703b26e697
add profiler, parallel_executor back
6 years ago
tensor-tang
10fb4ceefc
Merge pull request #14351 from tpatejko/tpatejko/mkldnn-elementwise_mul
...
[MKLDNN][JIT][AVX512] Elementwise Mul
6 years ago
jerrywgz
13e254faed
refine code, test=develop
6 years ago
jerrywgz
79cec53111
add ignore index for sigmoid cross entropy with logits op, test=develop
6 years ago
Tao Luo
5cc7946313
Merge pull request #14499 from luotao1/disable_openblas_test
...
disable two openblas test temporary
6 years ago
jerrywgz
0e1b426c83
refine prelu api doc, test=develop
6 years ago
peizhilin
935387f3fc
code style
6 years ago
peizhilin
25adf970b2
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin
301ed15323
remove unsupported flag on windows
6 years ago
Kaipeng Deng
f17b05d4a1
Merge pull request #14492 from heavengate/resize_nearest_example
...
Add python example for resize_nearest Api
6 years ago
Tao Luo
09ee266f8e
disable two openblas test temporary
...
test=develop
6 years ago
dengkaipeng
bb2b35c85e
Add python example for resize_nearest. test=develop
6 years ago
chengduo
a94a7355f0
Refine the GraphNum check ( #14144 )
...
* refine GraphCheck
test=develop
* fix ci fail
test=develop
6 years ago
peizhilin
3a72a634cf
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
tensor-tang
48be9dc3e1
Merge pull request #14489 from tensor-tang/api/example
...
add api example of brelu, leaky_relu and soft_relu
6 years ago
qingqing01
9eefd2c766
Modify some infer-shape about detection operators in compile-time. ( #14483 )
...
* Modify some infer-shape in compile-time.
6 years ago
tensor-tang
e3645c2708
add api example of brelu, leaky_relu and soft_relu
...
test=develop
6 years ago
peizhilin
ee0fd78c81
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Yu Yang
f1a392a5fe
Merge pull request #13804 from sneaxiy/rewrite_allocation
...
Rewrite allocation
6 years ago
qingqing01
fd7e643153
Convolution fusion operator. ( #14449 )
...
* Convolution fusion operator.
* Clean code
test=develop
6 years ago
Yu Yang
98bbfc17be
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
dengkaipeng
8ef6280c03
Add operator double support. test=develop
6 years ago
peizhilin
2ceb4ae941
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin
a3e952f41d
add the jit back
...
fix compile error on windows
6 years ago
Qiao Longfei
05c15a0867
Merge pull request #14467 from jacquesqiao/update-trainer-retry
...
optimize distribute checkport
6 years ago
Qiao Longfei
fbc529db91
update test=develop
6 years ago
Qiao Longfei
98a0437d70
optimize distribute checkport
...
test=develop
6 years ago
Michal Gallus
def272cf42
MKLDNN elementwise_mul: Revert changes to eltwise_add tests
6 years ago
Michal Gallus
99e3e36a57
MKLDNN elementwise_mul: Disable UT for CUDA
...
test=develop
6 years ago
Michal Gallus
08f63c4d12
MKLDNN elementwise_mul: Lint changes to UT & integration
...
test=develop
6 years ago
Michal Gallus
49b09327f6
MKLDNN elementwise_mul: Reorder on non-nchw input, fallback on non-16 divisable fm
...
test=develop
6 years ago
Michal Gallus
f820573b9c
MKLDNN elementwise_mul: Add UTs
6 years ago
Michal Gallus
ed31936ba1
MKLDNN elementwise_mul: Support NCHW, update UT
6 years ago
Tomasz Patejko
ad09facafe
MKLDNN elementwise_mul: CPU tests initially refactored. MKLDNN mul test for broadcast added
6 years ago
peizhilin
764f97deac
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin
8580b7a130
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Qiao Longfei
142b5ed474
Merge pull request #14450 from jacquesqiao/fix-test-label-sentiment
...
change the target cost of test_label_semantic_roles to speed up test
6 years ago
tink2123
853878cbf2
fix the wrong format
...
test=develop
6 years ago
whs
1722678258
Make nce support more distribution. ( #13549 )
...
* Fix truncated normal.
* Fix.
* Make nce support more distribution.
* Fix API.spec.
* Fix python API.
* Fix.
test=develop
* Fix API.spec
test=develop
* Fix sampler.
* Fix order of arguments in python API.
test=develop
6 years ago
Qiao Longfei
09bca67395
add check if the model does not save model
...
test=develop
6 years ago
Qiao Longfei
e4d8f47fcb
change the target cost of test_label_semantic_roles to speed up test
6 years ago
peizhilin
fcbd5a12b8
add create_recordio_file_reader back
6 years ago
Qiyang Min
d971d5b875
Merge pull request #14431 from velconia/fix_expand_op_dim_in_compile_time
...
Fix expand op incorrect infer shape
6 years ago
Wu Yi
b32c13dc20
Add cudnn ctc loss ( #12366 )
...
* add cudnn ctc loss
* wip add test test=develop
* wip
* wip
* done test=develop
* move include cudnn test=develop
* test test=develop
* fix build test=develop
* fix build test=develop
* fix build on cudnn5 test=develop
* fix cudnn5 build test=develop
* fix cudnn5 build test=develop
* merge develop softmax functor change test=develop
6 years ago
peizhilin
d1a1fafc4c
code style
6 years ago
peizhilin
6d0d5a76eb
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Xin Pan
b984c70997
Merge pull request #14439 from wopeizl/windows/online
...
fix typo test=develop
6 years ago
peizhilin
162f2d4109
disable the openblas multi-thread on windows since no support
...
adjust the python script
6 years ago
dengkaipeng
f115eb0d1e
enhance api. test=develop
6 years ago
Yu Yang
c8f6e70ab4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
peizhilin
336c123032
fix typo
...
test=develop
6 years ago
Sylwester Fraczek
8a1eeec579
add mkldnn prop_kind phase for inference-only case to pooling and activations ( #14278 )
...
* add is_test to pooling and activations
add prop_kind support for layers activation. conv and pooling
add a pass that sets is_test to true
add transpiler version of is_test pass
test=develop
* patch test and pass
test=develop
* add pass to analyzer.h
test=develop
* add is_test attr description & pass only on mkldnn
in:
activation_op.cc
batch_norm_op.cc
conv_op.cc
dropout_op.cc
lrn_op.cc
pool_op.cc
sequence_pool_op.cc
softmax_op.cc
* fix is_test handling for activation pool and conv
* change description of is_test for all layers again
* remove GetAttr(use_mkldnn) from pass
* rename correct_mkldnn_test_phase to is_test
and remove dependency on MKLDNN
test=develop
* review fix magic number
* two if(..)s into one
* Check is_test once and pass mkldnn forward prop kind
* dereference shared_ptr with * (without get())
test=develop
* add is_test_pass back
test=develop
6 years ago
peizhilin
d1429ac4a5
add recordio support
6 years ago
chengduo
82773477ae
Add selu ( #14415 )
...
* add selu
* use for range
test=develop
* add API
test=develop
* follow comment
test=develop
* update API.spec
test=develop
6 years ago
dengkaipeng
95d5060ddd
fix abs -> fabs error. test=develop
6 years ago
minqiyang
30147d7f58
Fix expand op incorrect infer shape
...
test=develop
6 years ago
dengkaipeng
2faa2b4048
remove cu file. test=develop
6 years ago
Tao Luo
91c6e7a0f4
fix compiler error
...
test=develop
6 years ago
tangwei12
7a64d48f55
fix test_save_load with pickle ( #14410 )
...
* fix test_save_load with pickle
test=develop
* fix test_save_load with pickle
test=develop
* fix test_save_load with pickle
test=develop
6 years ago
dzhwinter
d3aed98d86
Merge pull request #14320 from wopeizl/windows/online
...
Windows/online
6 years ago
peizhilin
2202f22429
Merge branch 'windows/build' into windows/online
...
test=develop
6 years ago
peizhilin
be332a13bc
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Xin Pan
8cfda7ee0c
Merge pull request #14382 from panyx0718/fix4
...
Refine the pass builder and buildstrategy
6 years ago
Tao Luo
c7b3bfcdf1
Merge pull request #14376 from baojun-nervana/intel/ngraph_fusedop
...
Adding fused operator for ngraph
6 years ago
peizhilin
228e1934b8
code style fix
...
test=develop
6 years ago
peizhilin
e2a1cd19f1
code style fix
...
test=develop
6 years ago
Xin Pan
bae3659714
more test
...
test=develop
6 years ago
Qiao Longfei
f8d5775564
Merge pull request #14400 from jacquesqiao/fix-prelu
...
fix prelu
6 years ago
JiabinYang
99d1446a8b
test=develop
6 years ago
peizhilin
30ddc07a7e
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Qiao Longfei
e65cbd3b06
Merge pull request #14387 from jacquesqiao/lookup_sparse_table_add_test_mode
...
Lookup sparse table add test mode
6 years ago
Tao Luo
5b9c62faee
Revert "Softmax op optimization for inference "
6 years ago
Qiao Longfei
8e0616ebee
fix prelu
...
test=develop
6 years ago
Tao Luo
6490bb2765
Merge pull request #14337 from jczaja/prv-dam-softmax
...
Softmax op optimization for inference
6 years ago
dengkaipeng
a0284f6fbc
Add backward CPU kernel. test=develop
6 years ago
Xin Pan
44ecf9a481
fix
...
test=develop
6 years ago
Qiao Longfei
bad0c27e6e
add test_lookup_sparse_table_op
6 years ago
Shan Yi
38f499df7d
Merge pull request #14341 from tink2123/fix_example_edit_distance
...
fix the nn.py example
6 years ago
JiabinYang
30332ad91d
test=develop
6 years ago
JiabinYang
f4be1d99d0
polish code and test
6 years ago
Yibing Liu
6c7b64cc20
Support softmax return in softmax_with_cross_entropy ( #14367 )
...
* Support softmax return in softmax_with_cross_entropy
* Add test for return_softmax=False
test=develop
6 years ago
baojun-nervana
ea3538d8dd
Added fused operator
...
test=develop
6 years ago
Xin Pan
99dffb91d6
allow to repeatedly share and update BuildStrategy
...
test=develop
6 years ago
li099
df826de76e
revise tensor array to tensor op ( #14368 )
...
test=develop
6 years ago
ruri
4a55fb5f5b
Add density_prior_box_op ( #14226 )
...
Density prior box operator for image detection model.
6 years ago
dengkaipeng
36c46152e1
Add unittest for yolov3_loss. test=develop
6 years ago
JiabinYang
b8ff0972b6
test=develop
6 years ago
JiabinYang
32e05b01f2
test=develop
6 years ago
Yu Yang
02631965c8
Refine
6 years ago
Qiao Longfei
c27554ac33
Merge pull request #14336 from jacquesqiao/add_bilinear_tensor_product_layer
...
add bilinear_tensor_product layer
6 years ago
Yu Yang
4d546f6083
fix(pe): fix ut of dry run parallel executor ( #14359 )
...
test=develop
6 years ago
peizhilin
61fa5218b9
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Tao Luo
9d4425dd1c
Merge pull request #14227 from baojun-nervana/intel/ngraph_cmake
...
Adding Intel nGraph build
6 years ago
peizhilin
7840d181c9
fix style issue
6 years ago
sneaxiy
d231e55065
merge develop
...
test=develop
6 years ago
peizhilin
ef8a7db81e
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Qiao Longfei
4d6f75152e
optimize comment test=develop
6 years ago
Qiao Longfei
080740b340
Merge pull request #14300 from jacquesqiao/dist-table-support-optimizer-regular
...
dist table support other optimize and regular config
6 years ago
Qiao Longfei
2d9859984d
Merge pull request #14353 from jacquesqiao/reduce-pass-num-of-test_label_semantic_roles
...
reduce pass num of test_label_semantic_roles to avoid test timeout
6 years ago
Qiao Longfei
04da1dcfb8
optimize import
...
test=develop
6 years ago
Qiao Longfei
45eebf69e8
reduce pass num of test_label_semantic_roles to avoid test timeout
...
test=develop
6 years ago
Qiao Longfei
d03cbd1b8c
follow comment test=develop
6 years ago
JiabinYang
c8801e100f
grad diff problem to be fixed and need api spec change to be done
6 years ago
Jacek Czaja
d332326847
- Added unit tests for softmax is_test=True op
...
test=develop
6 years ago
peizhilin
d01a26280e
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Xin Pan
ff28b1ffc0
Merge pull request #14071 from barrierye/add_similarity_focus_op
...
Add similarity focus op
6 years ago
li099
688ed60116
Add lod tensor array to tensor op ( #13990 )
...
* add lod tensor array concat
* add lod tensor array concat
* test=develop
* add lod tensor array concat
test=develop
* Fix API.spec
test=develop
* add lod tensor array concat
test=develop
* revise some bug of lod tensor array concat
test=develop
* add unittest for tensor array concat
test=develop
* change to tensor array to tensor
test=develop
* revise bug
test=develop
* revise a bug
test=develop
* revise a bug
test=develop
* revise a bug of python3
test=develop
6 years ago
peizhilin
81476ff3cf
fix api check
6 years ago
peizhilin
e768c370e8
fix api check
6 years ago
peizhilin
1b0ce151df
fix API check issue
6 years ago
peizhilin
52f7644f53
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
qingqing01
abe209234f
Exhaustive search for cuDNN conv. ( #14286 )
...
* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
* Fix compiling test=develop
6 years ago
Yu Yang
b59a9bfb7c
Clean buffered_allocator
...
test=develop
6 years ago
Kaipeng Deng
f215534ecf
Merge pull request #14205 from heavengate/nearest_interp
...
Add interpolate operator replace bilinear_interp_op and add nearest neighbor interp mode
6 years ago
Qiao Longfei
e33bf70a23
update comment test=develop
6 years ago
tink2123
1fca1a395b
fix the nn.py example
...
test=develop
6 years ago
Yu Yang
fdc689142c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
peizhilin
e8519a6e89
use the ext_name instead of specific extension name
6 years ago
Qiao Longfei
319618e980
optimize comment, add unit test test=develop
6 years ago
Qiao Longfei
53781fc000
fix some bug
6 years ago
Qiao Longfei
db27c5612d
add comment
6 years ago
Qiao Longfei
6097b8b365
add bilinear_tensor_product layer
6 years ago
Xin Pan
9735e3016a
fix test
...
the build strategy is finalized after create_passes. So future
change of build strategy has no effects.
test=develop
6 years ago
peizhilin
4ffa92d4f0
Merge branch 'develop' into windows/build
6 years ago
chengduo
f8b2680c53
fix test_conv2d ( #14330 )
...
test=develop
6 years ago
chengduo
c5b6573a5a
Fix input<tensor> ( #14208 )
...
* fix input<tensor>
test=develop
* fix split_ids
test=develop
* ElementwiseMul should not support SelectedRows
* fix scale op
test=develop
* change GetTensorFromVar() method to GetTensorOrSelectedRowsFromVar()
* fix operator
* refine MultiOutput
* fix MultiOutput
test=develop
* disable test_dist_save_load
test=develop
* fix elementwise_op
test=develop
* add get_sparse_as_op
test=develop
* add info for check
test=develop
* rename get_sparse_as_op with extract_rows_as_op.
test=develop
* elementwise doesn't support selected_rows
* fix regularizer
* remove extract_rows_as
test=develop
* fix ci
test=develop
* add test for sum_op
* fix regularizer
test=develop
* test=develop
* fix pserver weight decay multi inputs test=develop
6 years ago
Wu Yi
0627ee83cc
Merge pull request #14314 from typhoonzero/fix_pserver_weight_decay_multi_inputs
...
fix pserver weight decay multi inputs
6 years ago
Xin Pan
b03a44e062
Merge pull request #14026 from JiabinYang/add_reorg_op
...
Add reorg op
6 years ago
peizhilin
869487a2b7
Merge remote-tracking branch 'origin/develop' into windows/build
6 years ago
dzhwinter
d8425e0fa6
Merge pull request #14215 from dzhwinter/fix/sequence_pad
...
"fix comment"
6 years ago
Qiao Longfei
373f64986d
add comment and unit test
...
test=develop
6 years ago
typhoonzero
f3eafec19d
fix pserver weight decay multi inputs test=develop
6 years ago
Qiao Longfei
67050468e1
optimize code
...
test=develop
6 years ago
Qiao Longfei
ffd5a832d8
fix code style
6 years ago
Qiao Longfei
55edfca2b8
revert unused change
6 years ago
Qiao Longfei
fec0b192a2
fix unit test
...
test=develop
6 years ago
Qiao Longfei
a20f8e2966
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dist-table-support-optimizer-regular
6 years ago
Qiao Longfei
ce994190ab
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pyreader
...
test=develop
6 years ago
chengduo
1001f8e1db
Add is_compiled_with_cuda for parallel_exe_crf ( #14304 )
...
test=develop
6 years ago
Qiao Longfei
d9dc81a6c6
fix dist transpiler test
...
test=develop
6 years ago
Qiao Longfei
b07ebb379b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dist-table-support-optimizer-regular
6 years ago
dengkaipeng
8b47d90f5d
add 'actual_shape' attribute. test=develop
6 years ago
Qiao Longfei
3d8077e9fb
update optimizer
6 years ago
Yu Yang
2466ca13ec
test(Pe): remove unittests for recordio in test_pe_mnist ( #14262 )
...
recordio is not the official API in Fluid 1.0. Remove unittests for it.
test=develop
6 years ago
Yu Yang
03992630b5
fix(py): set `cwd` when get commit sha in setup.py ( #14299 )
...
`cwd` was not set before when get commit SHA. The default `cwd` is the current build directory. However, the build directory might not be the subdirectory of source. The `git` command will fail when that happened.
test=develop
6 years ago
Qiao Longfei
fbcdb29d8c
fix import issue
6 years ago
Qiao Longfei
866d6bfe59
dist table support other optimize and regular config
6 years ago
Yu Yang
057a682ee9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
Yu Yang
c28beb8a3c
test(Pe): add dry run tests for pe ( #14254 )
...
Dry run tests will skip `Op.Run` and just perform job scheduling. It helps to analysis dead lock in PE.
test=develop
6 years ago
Qiao Longfei
e0c8397426
Merge pull request #14257 from jacquesqiao/optimize-pserver-profiler-thread-pool
...
clean rpc server profiler
6 years ago
barrierye
ef8218be22
update docs test=develop
6 years ago
Shan Yi
ea8984c924
Merge pull request #14267 from PaddlePaddle/fix_recordio_link
...
fix_recordio_internal_link
6 years ago
tangwei12
e564eb341f
Fix mkdir conflict in save_inference_model ( #14285 )
...
* fix mkdir conflict
test=develop
6 years ago
chengduo
a9b5d42dd4
Add fp16 backward support ( #14202 )
...
* add fp16 backward support
test=develop
* add sum_op fp16 test
* disable test_dist_save_load
test=develop
* add check_grad for sum
* add unit test for softmax_grad fp16
test=develop
* add scale_op unit test
* add mul_grad_op unit test for fp16
* add cross_entropy_grad and eman_grad unit test for fp16
test=develop
* fix cross_entropy unit test
* add pool2d fp16 unit test
* refine conv2d fp16 unit test
test=develop
* refine activation unit test
test=develop
* fix ci
test=develop
* follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796
test=develop
6 years ago
Qiao Longfei
2921f8a79c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
6 years ago
qingqing01
db8c52da5e
Revert " Exhaustive search for cuDNN conv. ( #14043 )"
...
This reverts commit ce7d9b0799
.
6 years ago
qingqing01
ce7d9b0799
Exhaustive search for cuDNN conv. ( #14043 )
...
* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Clean code
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
6 years ago
Sang Ik Lee
f30c1ddb45
Include nGraph build.
...
test=develop
6 years ago
chengduo
45bad7626a
open test_parallel_executor_crf ( #14255 )
...
test=develop
6 years ago
peizhilin
a37918c31f
fix python package issue
6 years ago
Zeng Jinle
8ac2242b6e
Merge pull request #14075 from sneaxiy/remove_some_locks_in_pe
...
Remove some locks in ParallelExecutor
6 years ago
Wu Yi
97f7b181d4
Merge pull request #14264 from typhoonzero/dist_test_serial
...
run dist tests in serial
6 years ago
peizhilin
1f12ba6192
gpu support, fix build issue:
...
1. Non utf-8 characters within comments of OPs may lead to protobuf fail to parse_from_string
2. comment out some ops which not supported on windows
3. cuda libs may not be correctly linked to target on windows
6 years ago
Wu Yi
8fc05e0373
fix cpu build test=develop ( #14260 )
6 years ago
Shan Yi
1fb1a0bc6b
fix_recordio_internal_link
...
test=develop
6 years ago
typhoonzero
93c689aa96
run dist tests in serial test=develop
6 years ago
sneaxiy
faac8a76ce
remove unnecessary codes
...
test=develop
6 years ago
Qiao Longfei
59fbfbfbf7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
...
test=develop
6 years ago
Qiao Longfei
9e4e9e9b6e
clean rpc server profiler
6 years ago
Zeng Jinle
8d930195d9
Merge pull request #14238 from sneaxiy/fix_read_lod_level_bug
...
Fix lod_level share bug in read_op
6 years ago
Wu Yi
306236c2c0
feature/DC asgd ( #12722 )
...
* wip
* add ref_by_trainer_id op
* ready to test
* fix ref inputs
* refine rpc_op_handle
* fix merge bug
6 years ago
dengkaipeng
fef2faa709
limit CUDA kernel parallel threads max number to 4096. test=develop
6 years ago
peizhilin
71d7980f69
fix build issue 1
6 years ago
barrierye
19b68de79f
submit again test=develop
6 years ago
dengkaipeng
34bfae243a
Add Interpolate operation. test=develop
6 years ago
sneaxiy
46d4829dd1
fix lod_level share bug in read_op
...
test=develop
6 years ago
peizhilin
9d67c1fb69
cpu build support
6 years ago
barrierye
5e7bb6a9bd
update docs test=develop
6 years ago
dongzhihong
d4e8d7077f
Merge remote-tracking branch 'origin/develop' into fix/sequence_pad
...
test=develop
6 years ago
barrierye
316e020a11
Submit PR again test=develop
6 years ago
barrierye
162cf75c88
Submit PR again test=develop
6 years ago
Zeng Jinle
df038b6149
Merge pull request #14222 from sneaxiy/fix_selected_rows_clip_bug
...
Fix selected_rows clip bug
6 years ago
Kaipeng Deng
daed473d4a
Merge pull request #14089 from heavengate/pool_exclude
...
add inclusive/exclusive mode in avg pool
6 years ago
Kaipeng Deng
64f3e3ed8f
Merge pull request #14069 from heavengate/grid_sampler
...
Grid sampler operator for spatial transformer network.
6 years ago
Qiyang Min
8690deb0e4
Merge pull request #14101 from velconia/polish_op_comment
...
Polish hash op comment
6 years ago
minqiyang
4d06d1d7ac
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into polish_op_comment
...
test=develop
6 years ago
minqiyang
db5a7e0d9b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_version
...
test=develop
6 years ago
Zeng Jinle
b316437a50
Merge pull request #14087 from sneaxiy/add_use_cudnn_in_softmax_with_xe
...
Add numeric_stable_mode parameters to softmax_with_xe op
6 years ago
Zeng Jinle
fcf370e621
Merge pull request #13773 from sneaxiy/seq_executor
...
Enable sequential execution mode in parallel executor
6 years ago
chengduo
61b4812f2f
Remove unnecessary var_and_op of DynamicRnn ( #14134 )
...
* remove unnecessary var_and_op
test=develop
* fix _init_zero_idx_
test=develop
6 years ago
sneaxiy
55befbaa2a
fix selected_rows clip bug
...
test=develop
6 years ago
sneaxiy
bbc818a5a1
test=develop
6 years ago
sneaxiy
366ebb93f7
test=develop
6 years ago
chengduo
57c90e95ae
disable test_dist_save_load ( #14220 )
...
test=develop
6 years ago
sneaxiy
203027ca86
test=develop
6 years ago
dengkaipeng
decaeb1c6d
fix style check after conflicts check. test=develop
6 years ago
Jiabin Yang
9f65b616b2
Merge branch 'develop' into add_reorg_op
6 years ago
dongzhihong
7f3c6ea411
"fix comment"
6 years ago
Wu Yi
91b2851cdc
enable pyreader use pin memory ( #14066 )
...
* enable pyreader use pin memory
* add py reader pin memory test test=develop
6 years ago
Kaipeng Deng
0b29078201
Merge branch 'develop' into grid_sampler
6 years ago
whs
0c319e0b35
Add affine grid generator op ( #12238 )
...
* Add affine grid generator.
* fix ffine grid.
* Add unitest.
* Add CPU kernel and fix unitest.
* Fix CPU kernel.
* Refine code.
test=develop
* Fix python api.
test=develop
* Update python api.
test=develop
* Fix comment.
test=develop
* Rename affine_grid_generator to affine_grid and enhence unitest.
test=develop
* Fix unitest.
test=develop
6 years ago
tangwei12
d325e668b8
[1.1] Load vars on PSERVER ( #14037 )
...
* fix dim0 in _load_slice_up_vars
* fix dim0 in _load_slice_up_vars, fix innershape in delete_var_op
* Revert "fix lookuptable in reduce strategy"
This reverts commit 0e722c5
* add unit test for dist
* add unit test for dist, test=develop
* cancel revert, test=develop
6 years ago
Tao Luo
a8532f1a67
Merge pull request #13918 from guoshengCS/fix-transpose-doc
...
Fix the example in the doc of transpose_op
6 years ago
Tao Luo
a2bee7b41b
Merge pull request #13964 from guoshengCS/refine-gru-doc
...
Refine the doc of dynamic_gru and gru_unit.
6 years ago
dengkaipeng
e99da0b583
api change: create_variable_for_type_inference. test=develop
6 years ago
Tao Luo
a7a3544f72
Merge pull request #14181 from sfraczek/sfraczek/transpiler-depthwise_conv-mkldnn-pass
...
added transpiler pass for mkldnn depthwise_conv
6 years ago
dengkaipeng
df4a3544aa
nearest neighbor interp add cuda kernel. test=develop
6 years ago
Wu Yi
d51daede93
add ftrl support for dist train test=develop ( #14176 )
6 years ago
Qiao Longfei
f37bd03529
Merge pull request #14153 from jacquesqiao/fix-pserver-crash-when-no-parameter
...
set en empty optimize block if pserver has no optimize block
6 years ago
Qiao Longfei
7825ae9c60
Merge pull request #14190 from jacquesqiao/dist-table-support-multi-table
...
Dist table support multi table
6 years ago
JiabinYang
45565784bf
test=develop
6 years ago
Qiao Longfei
f3bbd3b43a
code style format
...
test=develop
6 years ago
dengkaipeng
9755611938
add unittest for nearest_neighbor_interp_op
6 years ago
minqiyang
d638d1cd80
Fix paddle version
...
test=develop
6 years ago
Qiao Longfei
2d461cb080
code style format
6 years ago
Qiao Longfei
add4b466d8
dist table only handle is_distributed table
6 years ago
JiabinYang
8d3c3e048b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
6 years ago
Qiao Longfei
d78e8f23a6
code format
...
test=develop
6 years ago
Xin Pan
5cc99c476c
Merge pull request #14154 from gmcather/seq_pad_example
...
fix sequence_pad example error
6 years ago
Wu Yi
d186e7434e
Refine dist ut ( #14118 )
...
* fix use_reader_alloc uts
* dist ut fixes test=develop
* update test=develop
* fix test for py3 test=develop
6 years ago
Zeng Jinle
da73bc39d3
Merge pull request #14143 from sneaxiy/fix_dynamic_gru_bug
...
Fix dynamic_gru h_0 bug
6 years ago
Qiyang Min
bfd2e3beca
Merge pull request #14158 from velconia/fix_mac_py3
...
Fix Mac Python3 CI job
6 years ago
Sylwester Fraczek
ebd1d753ed
added transpiler pass for mkldnn depthwise_conv
...
test=develop
6 years ago
Tao Luo
cdf2579d08
Merge pull request #14053 from jczaja/prv-seqpool-max
...
Max Sequence pool optimization
6 years ago
Kaipeng Deng
a3b26e8528
Merge branch 'develop' into grid_sampler
6 years ago
Qiao Longfei
ba8bbe159b
add test pserver run empty block into test_listen_and_serv_op
6 years ago
Qiao Longfei
f2a205c2f5
add test_pserver_run_empty_optimize_block
6 years ago
Qiao Longfei
bf9764898d
add TestEmptyPserverOptimizeBlocks
6 years ago
minqiyang
5038f623b7
Polish code
...
test=develop
6 years ago
minqiyang
8230b74281
Polish code
...
test=develop
6 years ago
Qiao Longfei
11b5c448dc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-pserver-crash-when-no-parameter
6 years ago
minqiyang
159b0eb7e3
Remove random fail ut
...
test=develop
6 years ago
minqiyang
cc752f1af4
Remove dist_test from CMakeFiles
...
test=develop
6 years ago
Qiao Longfei
a11d4f300e
use len instead of size for python list
6 years ago
Xin Pan
b2ab293c47
increase test timeout coverage.
6 years ago
barrierye
0e60bb3c4f
Submit PR again test=develop
6 years ago
Xin Pan
eb7ed1b720
Merge pull request #13897 from gmcather/develop
...
1.add position encoding 2.logloss in nn.py
6 years ago
minqiyang
59420d5bd2
Polish code
...
test=develop
6 years ago
minqiyang
2cc939bbfa
Fix Mac Python3 CI job
...
test=develop
6 years ago
barrierye
fc23cc9d30
update paddle/fluid/API.spec
...
test=develop
6 years ago
gmcather
1a98e0a44f
fix sequence_pad example error
...
test=develop
6 years ago
Qiao Longfei
fa84ba2350
set en empty optimize block if pserver has no optimize block
6 years ago
dzhwinter
e74267ae19
"fix comp bug. test=develop" ( #14104 )
6 years ago
Xin Pan
eb37ed4c16
Merge pull request #14141 from JiabinYang/fix_inference_model_latest
...
Fix inference model not found on Mac CI
6 years ago
gmcather
ba22624d7e
position encoding && log loss
...
test=develop
6 years ago
sneaxiy
7bb1178ea6
test=develop
6 years ago
JiabinYang
7c45e77c41
test=develop
6 years ago
qingqing01
cb27a9219d
Merge pull request #13971 from sefira/FasterOpDoc
...
generate proposal labels doc
6 years ago
sneaxiy
5e5d2223a1
test=develop
6 years ago
Xin Pan
b3b329255f
Merge pull request #14123 from shippingwang/fix_import_plot_py3
...
Fix import plot in py3
6 years ago
barrierye
5f3acac9b3
update paddle/fluid/API.spec
...
test=develop
6 years ago
Xin Pan
eabb75f6b3
Merge pull request #14055 from dzhwinter/fix/mem_opt
...
OrderSet default is mutableSet, not set.
6 years ago
root
99302a7075
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_import_plot_py3
6 years ago
root
5a220dc218
Fix python3 utils plot
6 years ago
Jacek Czaja
458b16f42a
Rebase of seqpool-max optimization
...
test=develop
- Added rough profiling
- Profiled maxpool itself
- First draft of max seqpool optimization (is_test added)
- Added unit tests to seqpool
- Cosmetic fixes
- Fix to UT of Seq pool
Disabled grad checking for sequence max pool when is_test is set to True
-Cosmetic fix to comment
test=develop
- Fix to GPU build
test=develop
- yet another GPU fix for sequence max pool
- Fix to comment
test=develop
- Change to API of sequence_pool
test=develop
- Yet another API spec change
test=develop
6 years ago
dengkaipeng
ff6329bd5f
fix some inappropriate expressions in api doc for grid_sampler. test=develop
6 years ago
dengkaipeng
8f1e398824
move param exclusive to the last in pool2d/pool3d for forward compatibility:. test=develop
6 years ago
dengkaipeng
593e1b18d7
fix some bugs and add some doc for GridSampleOp
6 years ago
dengkaipeng
0bb0e0c10f
add Grid Sampler Operator for STN.
6 years ago
Yu Yang
c01696f8c2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
6 years ago
barrierye
99707b281d
change / to // to fit py3
6 years ago
Qiao Longfei
d26ff8cb2d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpu-for-1.1-merge-with-shape
6 years ago
JiabinYang
e0a89503f8
test=develop
6 years ago
Wu Yi
26200f2e42
[1.1] [project] train imagenet using large batch size ( #13766 )
...
* fix nccl2 lars dist support
* put lars in momentum op
* add tests lars
* fix ci
* fix cpu kernel
* soft warning
* remove lars in test_recognize_digits.py
* move to another op
* add file
* update api.spec test=develop
* update test=develop
* fix api.spec test=develop
* wip
* wip, finish grad merge ops
* wip, finish graph build
* wip test running
* work on 1 gpu
* workable version
* update
* fix tests
* fuse broadcast op
* fix compile failed
* refine
* add batch merge test mnist
* fix CI test=develop
* fix build
* use independent bn params for batch merge test=develop
* update api.spec
* follow comments and for test
* wip
* refine tests test=develop
* follow comments test=develop
* remove startup bn modify test=develop
* follow comments test=develop
* fix merge test=develop
6 years ago
barrierye
8c1e304307
merge nn.py
6 years ago
dengkaipeng
c93e044ae0
add inclusive/exclusive mode in PoolOp avg pool type
6 years ago
JiabinYang
9a74c4489f
test=develop
6 years ago
barrierye
9dc28179a4
add similarity_focus op
6 years ago
Qiao Longfei
7cd2417fe2
Merge branch 'develop' into cpu-for-1.1-merge-with-shape
...
test=develop
6 years ago
dongzhihong
755c04df6e
rerun ci. test=develop
6 years ago
dongzhihong
c2bc700157
Merge remote-tracking branch 'origin/develop' into fix/mem_opt
6 years ago
Zeng Jinle
aa6b2bda2c
Merge pull request #14094 from sneaxiy/fix_metrics_api_bug
...
Fix EditDistanceMetrics API bug
6 years ago