Qiao Longfei
6449faec37
Merge pull request #14259 from jacquesqiao/optimize-thread-pool
...
Optimize thread pool
6 years ago
sneaxiy
9518bc8d0a
delete buggy selected_rows functor
...
test=develop
6 years ago
chengduo
a9b5d42dd4
Add fp16 backward support ( #14202 )
...
* add fp16 backward support
test=develop
* add sum_op fp16 test
* disable test_dist_save_load
test=develop
* add check_grad for sum
* add unit test for softmax_grad fp16
test=develop
* add scale_op unit test
* add mul_grad_op unit test for fp16
* add cross_entropy_grad and eman_grad unit test for fp16
test=develop
* fix cross_entropy unit test
* add pool2d fp16 unit test
* refine conv2d fp16 unit test
test=develop
* refine activation unit test
test=develop
* fix ci
test=develop
* follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796
test=develop
6 years ago
Qiao Longfei
3b8dd9ebbd
optimize code test=develop
6 years ago
Tao Luo
2b791f1f63
unify analyzer_face_tester to analyzer_resnet50_tester
...
test=develop
6 years ago
Qiao Longfei
2921f8a79c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pserver-profiler-thread-pool
6 years ago
Tao Luo
1ead9318d5
remove unused code in test_helper.h to pass ci
...
test=develop
6 years ago
Qiao Longfei
4062f00f2a
optimize thread pool code
...
test=develop
6 years ago
dzhwinter
2835e04409
merge develop branch. test=develop
6 years ago
dzhwinter
deb4af70ef
add test
6 years ago
Xin Pan
0953cd3e16
Merge pull request #14284 from PaddlePaddle/revert-14043-conv_cudnn_cache
...
Revert " Exhaustive search for cuDNN conv."
6 years ago
qingqing01
db8c52da5e
Revert " Exhaustive search for cuDNN conv. ( #14043 )"
...
This reverts commit ce7d9b0799
.
6 years ago
qingqing01
ce7d9b0799
Exhaustive search for cuDNN conv. ( #14043 )
...
* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Clean code
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
6 years ago
tensor-tang
cb4083b9fa
fix compile error
...
test=develop
6 years ago
tensor-tang
dd343a4971
Merge remote-tracking branch 'ups/develop' into fea/jit/vadd
6 years ago
Zeng Jinle
fcbe84cb50
Merge pull request #14270 from sneaxiy/fix_rmsprop_enforce_bug
...
Fix rmsprop_op enforce bug
6 years ago
Tao Luo
7a2887d212
add analyzer_face_tester
...
test=develop
6 years ago
Tao Luo
2ec65ae0db
download face_model in CMakeLists.txt
...
test=develop
6 years ago
Tao Luo
2f9a5a2e0a
add analyzer_face_tester
6 years ago
chengduo
45bad7626a
open test_parallel_executor_crf ( #14255 )
...
test=develop
6 years ago
Xin Pan
cb2d33a851
resolve conflict
...
test=develop
6 years ago
nhzlx
5700fafd0f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_googlenet_bug_with_rule
...
test=develop
6 years ago
nhzlx
86b99ac953
fix comments and fix bug
6 years ago
tensor-tang
e6cfdf6c74
Merge pull request #14274 from tensor-tang/fix/jit
...
fix jit on mac
6 years ago
peizhilin
a37918c31f
fix python package issue
6 years ago
Xin Pan
25123a3b7e
add tests
...
test=develop
6 years ago
Xin Pan
8c11d3fed6
clean up
6 years ago
Xin Pan
0a89650507
fix more tests
...
test=develop
6 years ago
Xin Pan
a3b27e3237
fix
...
test=develop
6 years ago
Xin Pan
f25eb9a71d
fix some tests.
...
test=develop
6 years ago
Xin Pan
adf5615e54
clean kGraphOp
...
test=develop
6 years ago
Xin Pan
fb576cb5cb
allow to compare type
...
test=develop
6 years ago
Xin Pan
ead94bfc6c
fix destructor
...
test=develop
6 years ago
Xin Pan
2e14999942
clean1
...
test=develop
6 years ago
Xin Pan
34b401fc6c
clean up a global graph attr.
6 years ago
Zeng Jinle
8ac2242b6e
Merge pull request #14075 from sneaxiy/remove_some_locks_in_pe
...
Remove some locks in ParallelExecutor
6 years ago
Zeng Jinle
02d757dafc
Merge pull request #14268 from sneaxiy/stream_callback_support_in_cuda10
...
Stream Callback Support in CUDA 10
6 years ago
Wu Yi
97f7b181d4
Merge pull request #14264 from typhoonzero/dist_test_serial
...
run dist tests in serial
6 years ago
tensor-tang
b81e1b655e
fix jit on mac
...
test=develop
6 years ago
sneaxiy
11f032a82e
fix rmsprop_op enforce bug
...
test=develop
6 years ago
tensor-tang
b68ececb73
add vaddrelu jitcode
...
test=develop
6 years ago
sneaxiy
8684553633
stream callback support in cuda 10
...
test=develop
6 years ago
peizhilin
1f12ba6192
gpu support, fix build issue:
...
1. Non utf-8 characters within comments of OPs may lead to protobuf fail to parse_from_string
2. comment out some ops which not supported on windows
3. cuda libs may not be correctly linked to target on windows
6 years ago
Wu Yi
8fc05e0373
fix cpu build test=develop ( #14260 )
6 years ago
Shan Yi
1fb1a0bc6b
fix_recordio_internal_link
...
test=develop
6 years ago
Zhen Wang
4dbc01841d
Nlp dam ( #14248 )
...
* add dam test
* update fuse_statis
* use separated dam model.
* Revert "use separated dam model."
This reverts commit 13e775c86f909b164b7cc1d35a8a24b964ec622e.
* test=develop
* modify the cmake file about infer test, test=develop.
* remove one comment, test=develop.
6 years ago
tensor-tang
bb09e31020
add vadd jitcode
...
test=develop
6 years ago
typhoonzero
93c689aa96
run dist tests in serial test=develop
6 years ago
sneaxiy
faac8a76ce
remove unnecessary codes
...
test=develop
6 years ago
Yu Yang
ff9e531bd9
style(platform): disable warning when cuda cc not matched ( #14029 )
...
Warning only at first when CUDA CC not matched.
test=develop
6 years ago