Tao Luo
e3dd6970fc
disable dam temporarily ( #15860 )
...
test=develop
6 years ago
Xin Pan
26e32e095a
allow compiler to use graph
...
test=develop
6 years ago
Sylwester Fraczek
1943119fc5
fix typo memeroy->memory
...
test=develop
6 years ago
Sylwester Fraczek
0b926114c0
add override to ApplyImpl
...
and #pragma once in edited headers
add #include<string> in edited headers
test=develop
6 years ago
minqiyang
8fe0c0c52c
implement backward refs
6 years ago
Sylwester Fraczek
8bc604571f
fix typo seriazlized->serialized
6 years ago
Sylwester Fraczek
543e53db05
fix typo releated->related
6 years ago
Krzysztof Binias
1578c60bdd
Add new ut and remove unnecessary code
...
test=develop
6 years ago
Dun Liang
35a90e06bf
test=develop
6 years ago
Dun Liang
c9080f516b
test=develop
6 years ago
Dun Liang
1c7bb0e40c
test=develop
6 years ago
Xin Pan
5eb87506bc
add per kernel config and remove const_cast.
...
test=develop
6 years ago
Qiao Longfei
62f1248ff5
fix use gpu test=develop
6 years ago
xuezhong
a7e7d95203
Merge pull request #15630 from xuezhong/fix_bug_adagrad2
...
add initial_accumulator_value for adagrad
6 years ago
Xin Pan
6019054cdd
Merge pull request #15716 from Yancey1989/refine_pg
...
Refine ParallelGraph Execution
6 years ago
dzhwinter
98ec579d79
Merge pull request #15786 from liuwei1031/develop
...
fix comments of PR 15529, test=develop
6 years ago
Dun
a83e470405
Profiler refine and add CUDA runtime api tracer ( #15301 )
...
* refine profiler && add runtime tracer
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* fix bug && test=develop
* add thread id map && test=develop
* test=develop
* testing
* bug fix
* remove cuda event && refine code && test=develop
* test=develop
* test=develop
* test=develop
* fix windows temp file && test=develop
* test=develop
* fix windows bug && test=develop
* fix start up issue && test=develop
* code polish && test=develop
* remove unused code && test=develop
* add some cupti cbid && test=develop
* add FLAGS_multiple_of_cupti_buffer_size && test=develop
* fix compile error && test=develop
* add keyword && test=develop
* fix && test=develop
* code polish && test=develop
6 years ago
minqiyang
74551758cc
Polish code
...
test=develop
6 years ago
Zeng Jinle
c420375185
Merge pull request #15806 from sneaxiy/fix-compiler
...
Fix compiler.py place compare bug
6 years ago
minqiyang
f53e1d5c4b
implement ClearBlock
6 years ago
mozga-intel
13ec2d331b
Enable momentum operator for a ngraph engine ( #15673 )
...
* Enable momentum operator for a ngraph engine
test=develop
* Update tests
test=develop
* Unnecessary line of the code as intended was removed
test=develop
6 years ago
Yiqun Liu
e38dd91f04
Refine cmake's download function. ( #15512 )
...
* Refine cmake's download function.
test=develop
* Set DOWNLOAD_NO_EXTRACT to 1 pure download function.
test=develop
* Fix unpack problem in ExternalProject_Add, and it seem DOWNLOAD_NO_EXTRACT option is not support in cmake-3.5.
test=develop
6 years ago
xuezhong
eb7bc3e7ea
remove non-ascii charactor
...
test=develop
6 years ago
Tao Luo
50ffed27f6
Merge pull request #15813 from luotao1/legacy_any
...
remove legacy any.cmake
6 years ago
Tao Luo
b28afef73e
Merge pull request #15801 from luotao1/legacy_option
...
remove Legacy compiler option
6 years ago
sneaxiy
d331e97af8
fix compiler place compare
...
test=develop
6 years ago
Tao Luo
c797a1f050
remove legacy any.cmake
6 years ago
dzhwinter
f2e8409f5a
Merge pull request #15795 from dzhwinter/fix/block_desc
...
fix bug when op node has no block pointer
6 years ago
Tao Luo
bd2fa73620
Merge pull request #15794 from sneaxiy/fix-warnings
...
Fix compile warning
6 years ago
Tao Luo
b9d1bf2364
remove leacy WITH_FLUID_ONLY option
6 years ago
xuezhong
6b83845c41
update for backward compatibility
...
test=develop
6 years ago
tensor-tang
e1c707fe9c
fix warnings ( #15790 )
...
* fix warnings
test=develop
* fix enforce test
test=develop
6 years ago
xuezhong
d328660304
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
xuezhong
57294fa890
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_adagrad
...
test=develop
6 years ago
xuezhong
794b90c93f
for backward compatibility
6 years ago
xuezhong
f2262d7336
update comment
...
test=develop
6 years ago
Tao Luo
6402424f7a
Merge pull request #15773 from chengduoZH/fix_shape_api_doc
...
Fix shape api doc
6 years ago
xuezhong
ef44f1b81d
update api spec
...
test=develop
6 years ago
xuezhong
d12252e6a6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op
...
test=develop
6 years ago
sneaxiy
9b8e0e2f17
fix enforce_test
...
test=develop
6 years ago
xuezhong
c5360a3f6b
refine code
6 years ago
Yancey1989
4b193db14c
polish code test=develop
6 years ago
dzhwinter
6deb17ed8c
fix default value. test=develop
6 years ago
dzhwinter
089d262c41
fix default value. test=develop
6 years ago
dzhwinter
9c92d0304f
fix default value. test=develop
6 years ago
sneaxiy
209b355762
fix many warning
...
test=develop
6 years ago
Yancey1989
d5090c892d
polish code test=develop
6 years ago
tensor-tang
5aea2cd2e0
Merge pull request #15652 from tensor-tang/refine/pyramiddnn
...
refine fused emb seq pool
6 years ago
liuwei1031
4c7b6e2e67
fix comment, test=develop
6 years ago
Zeng Jinle
adea672b0e
Merge pull request #15783 from sneaxiy/develop-fix-api-arg0
...
Fix pybind api arg0
6 years ago
mozga-intel
df23a6f894
Enable cross_entropy operator for a ngraph engine ( #15674 )
...
* Enable cross_entropy operator for a ngraph engine
test=develop
* Update tests
test=develop
* Added PADDLE_ENFORCE for the batch_norm operator
test=develop
* Update the message about which format are supported right now
test=develop
6 years ago
liuwei1031
b20a21e299
fix comments of PR 15529, test=develop
6 years ago
Yiqun Liu
56a5039e24
Correct the doc in Python API ( #15725 )
...
* Correct the comment in control_flow.py.
* Correct the argument list of ops.
test=develop
* Update API.spec.
test=develop
* Skip op_callstack attr for all op apis.
test=develop
* Remove use_mkldnn and is_test from python api.
test=develop
* Remove use_mkldnn and is_test from op_proto_maker and hard-coding them in python when generating doc string.
test=develop
6 years ago
sneaxiy
e6ff549849
small fix doc
...
test=release/1.3
6 years ago
sneaxiy
796e221efc
fix api arg0
...
test=release/1.3
6 years ago
baojun
72061b0ac0
Add ngraph op coverage ( #15721 )
6 years ago
chengduozh
d79d2f686c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_shape_api_doc
...
test=develop
6 years ago
dzhwinter
28609b3435
Merge pull request #15696 from dzhwinter/cherry-pick/memory
...
cherry picked modifies.
6 years ago
xuezhong
4424021623
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op
6 years ago
Yihua Xu
685a20ef56
Add JIT CRF_decoding and Layer_norm unit-test ( #15699 )
...
* Add the CRFDecoding and LayerNorm's test case
test=develop
* Fix the size checking issue
test=develop
* Remove the remnant code
test=develop
* Add TestAllImpls and double support
test=develop
* Clean Code
test=develop
* Add benchmark test for LayerNorm & CRFDecoding
test=develop
6 years ago
tensor-tang
75fc792d40
fix when table width larger than 64
...
test=develop
6 years ago
tensor-tang
40402d5e68
add emb seqpool jitcode
...
test=develop
6 years ago
tensor-tang
2ccbcb157d
Merge remote-tracking branch 'ups/develop' into refine/pyramiddnn
6 years ago
chengduozh
3ce12b1b8e
fix shape api doc
...
test=develop
6 years ago
minqiyang
52e5ee60bd
Add debug info
6 years ago
Dun
5e6834d891
inplace group_norm ( #15754 )
...
* inplace group
* test=develop
6 years ago
Yancey1989
0f8bd73cc9
cleanup code test=develop
6 years ago
Yancey1989
5677c9d4ee
update comment test=develop
6 years ago
Yancey1989
642fd68ce0
update by comment test=develop
6 years ago
dzhwinter
d94a314db5
add reference. test=develop
6 years ago
dzhwinter
591ad33e32
polish code for reading. test=develop
6 years ago
dzhwinter
18afb77e78
polish code for reading. test=develop
6 years ago
Yan Chunwei
077d12b939
fix scale cleaner ( #15742 )
6 years ago
dzhwinter
684b572307
polish code for reading. test=develop
6 years ago
Tao Luo
daab5ad77e
Merge pull request #15719 from wojtuss/wojtuss/remove-duplicated-code
...
Removed duplicated code
6 years ago
Hongyu Liu
8c0292dead
Merge pull request #15717 from phlrain/fix_leak
...
Fix lstm possible leak
6 years ago
Zeng Jinle
fc87ef741b
Merge pull request #15687 from sneaxiy/fix_enforce
...
fix enforce
6 years ago
dzhwinter
3787e61fca
polish code for reading. test=develop
6 years ago
dzhwinter
c1455e606d
Merge remote-tracking branch 'origin/develop' into cherry-pick/memory
...
test=develop
6 years ago
dzhwinter
d376cf71b7
polish code for reading. test=develop
6 years ago
Xin Pan
69f5c0eee2
Merge pull request #15557 from panyx0718/imperative
...
add sugar for fetching parameters and layers
6 years ago
Zhen Wang
832bd720d1
Merge pull request #15610 from wzzju/quantization_inference_passes
...
Quantization inference passes
6 years ago
Tao Luo
4da291c6a3
Merge pull request #15726 from qingqing01/fix_api_doc
...
Fix row_conv doc
6 years ago
Dun
e4b9fcdbd2
More restrict check load_combine_op. ( #15479 )
...
* fix && test=develop
* fix && test=develop
* test=develop
6 years ago
qingqing01
48a5cccbcd
Fix debug mode in prior_box_op ( #15702 )
...
* Fix debug mode in prior_box_op
* Refine code
6 years ago
Dang Qingqing
2868232556
Fix row_conv doc
...
test=develop
6 years ago
Xin Pan
989138378d
add sugar for fetching parameters
...
test=develop
6 years ago
tensor-tang
a3a3d3d861
add embseqpool jitkernel mkl impl and use it
...
test=develop
6 years ago
tensor-tang
15da2f9a0d
add embseqpool jitkernel refer code, test and benchmark
...
test=develop
6 years ago
tensor-tang
c2ccf14590
Merge remote-tracking branch 'ups/develop' into refine/pyramiddnn
6 years ago
qingqing01
abcefe7211
Fix debug mode in fake_quantize_op ( #15693 )
...
* Fix debug mode in fake_quantize_op
* Remove template specialization
6 years ago
Yancey1989
7cd6de37f5
fix cpu test=develop
6 years ago
Yancey1989
bd0d44af24
fix build failed test=develop
6 years ago
liuhongyu
029be5fda9
fix lstmp bug; test=develop
6 years ago
Yancey1989
ecdd1166b8
cleanup code test=develop
6 years ago
Yancey1989
73005ee00d
cleanup code test=develop
6 years ago
liuhongyu
393fa6021e
set lstm lstmp unsed pointer to nullptr; test=develop
6 years ago
Yancey1989
88d3dc949e
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into refine_pg
...
test=develop
6 years ago
liuhongyu
869f00ffc6
set lstm lstmp unsed pointer to null
6 years ago
Yancey1989
f3463ecb6e
refine pg execution
6 years ago
Yan Chunwei
3a5d6e5e64
move passes to src to avoid different behavior in deployment ( #15705 )
6 years ago
Wojciech Uss
daac6a05f5
Removed duplicated code
...
This also fixes linking to libpaddle_fluid.so built in debug mode
test=develop
6 years ago
jerrywgz
6f11f35abe
Merge pull request #15703 from jerrywgz/enhance_expand_op
...
support multiple var types for expand op
6 years ago
Yan Chunwei
c00ed19df2
add more comment ( #15603 )
6 years ago
sneaxiy
f0590947c3
fix enforce
...
test=develop
6 years ago
Tao Luo
3086502522
Merge pull request #15704 from Sand3r-/mgallus/old-fc-mkldnn-branch-fix-develop
...
Fix old FC backward weights descriptor creation
6 years ago
dzhwinter
283573c6aa
add details. test=develop
6 years ago
chengduo
5a03b515ae
fix potential bug in async_executor ( #15707 )
...
test=develop
6 years ago
乔龙飞 Qiao Longfei
45b19cbc9a
Revert "Revert "cpu reduce mode did not need to broadcast params test=develop""
6 years ago
baojun
c47e258ea4
Add ngraph sum, sigmoid, relu_grad and tanh_grad op ( #15642 )
...
* Added ngraph sum op test=develop
* Added sigmoid, relu_grad and tanh_grad test=develop
* remove duplicates test=develop
6 years ago
tensor-tang
33d0cebbff
Merge pull request #15695 from tensor-tang/fix/name
...
fix jitcode name, use after free
6 years ago
Michal Gallus
7a8eff36a6
Fix old FC backward weights descriptor creation
...
test=develop
6 years ago
chengduo
ad61e1b22c
fix potential bug ( #15688 )
...
test=develop
6 years ago
dzhwinter
6d6ddcfe15
add details. test=develop
6 years ago
Zhen Wang
bc95a4ccfe
Merge branch 'develop' into quantization_inference_passes
6 years ago
dzhwinter
f9ac88e1a0
Merge pull request #15694 from liuwei1031/fix_security_issue
...
Fix security issue
6 years ago
jerrywgz
8fc0fc314a
support multiple var types for expand op, test=develop
6 years ago
tensor-tang
fb2a7b2300
fix aligned-new error in jitkernel ( #15626 )
...
* fix aligned-new error in jitkernel
test=develop
* override genbase new to fix mis-align
test=develop
6 years ago
乔龙飞 Qiao Longfei
08ad72d0b9
Merge pull request #15679 from jacquesqiao/update-lookup_table_grad-padding-index
...
lookup_table_grad kernel should consider padding_idx test=develop
6 years ago
Tao Luo
d9270e34d1
Merge pull request #15691 from luotao1/activation_doc
...
fix generate doc error in activation ops
6 years ago
dzhwinter
11afbe0f53
add details. test=develop
6 years ago
tensor-tang
e49706c80e
Merge pull request #15659 from GBuella/add_to_string
...
Tests - add some missing to_string calls
6 years ago
tensor-tang
15d7220f94
fix jitcode name
...
test=develop
6 years ago
liuwei1031
14fe9219dc
reset unexpected changes, test=develop
6 years ago
tensor-tang
31fd8ce1e1
Merge pull request #15375 from mozga-intel/mozga-intel/batch_norm_ngraph_operator
...
Enable batch_norm operator for a ngraph engine
6 years ago
liuwei1031
b1f97a6fa9
fix security issue 27, 38 test=develop
6 years ago
liuwei1031
1413d83a0f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
6 years ago
Tao Luo
882e7ec480
fix generate doc error in activation ops
...
test=develop
6 years ago
Gabor Buella
da9c94da33
Clang build fixes ( #15628 )
...
* Remove some superfluous std::move calls
The std:move triggered a build error (with -Werror):
```
[ 9%] Building CXX object paddle/fluid/memory/allocation/CMakeFiles/allocator_facade.dir/allocator_facade.cc.o
/home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: error: moving a temporary object prevents copy elision [-Werror,-Wpessimizing-move]
[this] { return std::move(CreateAllocatorWithChunk()); }, capacity);
^
/home/tej/code/gbuella_paddle/paddle/fluid/memory/allocation/allocator_facade.cc:86:29: note: remove std::move call here
[this] { return std::move(CreateAllocatorWithChunk()); }, capacity);
^~~~~~~~~~ ~
1 error generated.
```
See: https://reviews.llvm.org/D7633
* Remove a superfluous lambda capture from framework/operator.h
```
[ 10%] Building CXX object paddle/fluid/platform/CMakeFiles/device_context.dir/init.cc.o
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/platform/init.cc:19:
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.h:229:21: error: lambda capture 'this' is not used [-Werror,-Wunused-lambda-capture]
[this](Variable* var) { return var; });
^~~~
1 error generated.
```
Changing it to `return it->second;`, as is in the function below.
* Rethrow an exception (instead of copying it)
```
[ 11%] Building CXX object paddle/fluid/framework/CMakeFiles/operator.dir/operator.cc.o
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: error: local variable 'exception' will be copied despite being thrown by name [-Werror,-Wreturn-std-move]
throw exception;
^~~~~~~~~
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:191:13: note: call 'std::move' explicitly to avoid copying
throw exception;
^~~~~~~~~
std::move(exception)
```
See https://reviews.llvm.org/D43322 for an explanation of this diagnostic message.
* Remove an unused variable
```
/home/tej/code/gbuella_paddle/paddle/fluid/framework/operator.cc:884:16: error: private field 'scope_' is not used [-Werror,-Wunused-private-field]
const Scope& scope_;
^
```
* struct ComputationOpHandle -> class ComputationOpHandle
```
[ 13%] Building CXX object paddle/fluid/framework/details/CMakeFiles/memory_early_delete_pass.dir/memory_early_delete_pass.cc.o
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/framework/details/memory_early_delete_pass.cc:21:
/home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: error: class 'ComputationOpHandle' was previously declared as a struct; this is valid, but may result in linker errors under the Microsoft C++ ABI [-Werror,-Wmismatched-tags]
class ComputationOpHandle;
^
/home/tej/code/gbuella_paddle/paddle/fluid/framework/details/computation_op_handle.h:29:8: note: previous use is here
struct ComputationOpHandle : public OpHandleBase {
^
/home/tej/code/gbuella_paddle/paddle/fluid/framework/details/reference_count_pass_helper.h:30:1: note: did you mean struct here?
class ComputationOpHandle;
^~~~~
struct
1 error generated.
```
* Fix name() methods under fluid/operators
```
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.cc:15:
In file included from /home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/act.h:19:
/home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen/jitcode.h:71:23: error: 'name' overrides a member function but is not marked 'override' [-Werror,-Winconsistent-missing-override]
virtual const char* name() const = 0;
^
/home/tej/code/gbuella_paddle/paddle/fluid/operators/jit/gen_base.h:31:23: note: overridden virtual function is here
virtual const char* name() const = 0;
^
```
test=develop
6 years ago
Zeng Jinle
fea7f0de25
Merge pull request #15667 from sneaxiy/fix_decorator_signature
...
Fix decorator signature error
6 years ago
kolinwei
acfe28d5eb
Merge pull request #15684 from PaddlePaddle/revert-15661-fix-cpu-broadcast
...
Revert "cpu reduce mode did not need to broadcast params test=develop"
6 years ago
Xin Pan
d670d8ef1d
Merge pull request #15671 from cjld/fix_graph
...
fix bug CreateControlDepVar duplicate name
6 years ago
乔龙飞 Qiao Longfei
6e0e706198
Revert "cpu reduce mode did not need to broadcast params test=develop"
6 years ago
Qiao Longfei
76c1378a70
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update-lookup_table_grad-padding-index
...
test=develop
6 years ago
Qiao Longfei
97b143fb49
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-cpu-broadcast
...
test=develop
6 years ago
Qiao Longfei
29a4b21bc8
fix problem test=develop
6 years ago
Qiao Longfei
7b673bce6a
lookup_table_grad kernel should consider padding_idx test=develop
6 years ago
jerrywgz
5ce48220f1
change default option related to softmax, test=develop
6 years ago
dzhwinter
381f2015a5
Merge pull request #15665 from dzhwinter/experiment/refactor_memory
...
refactor optimize pass.
6 years ago
Qiao Longfei
ffd0d1d216
clean need_broadcast_var_ test=develop
6 years ago
Qiao Longfei
fbadd4b60c
follow comment test=develop
6 years ago
xuezhong
eeaa2066e5
add device info to tensor
...
test=develop
6 years ago
xuezhong
9b24ac34dd
remove debug print
...
test=develop
6 years ago
xuezhong
50b48400bb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
...
test=develop
6 years ago
xuezhong
c0b8fd7ca0
update lstmp op api spec
...
test=develop
6 years ago
dzhwinter
04e9776aef
add details. test=develop
6 years ago
dzhwinter
b80bcbb4fd
Merge pull request #15660 from dzhwinter/enhance/memory
...
add elementwise_xxx_grad for inplace optimize
6 years ago
mozga-intel
1198ccae6b
Enable batch_norm operator for a ngraph engine
...
test=develop
6 years ago
xuezhong
58101e6d4d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
...
test=develop
6 years ago
xuezhong
4921c2cd02
add api spec change
...
test=develop
6 years ago
baojun
f4a0e68481
Fix ngraph compile WITH_DISTRIBUTE=ON ( #15636 )
...
* fix compile issue with_distribute test=develop
* simplified logic test=develop
* use ngraph dependency test=develop
* set cpu only test=develop
* update test and eliminate fp16 test test=develop
6 years ago
Tao Luo
2fbfebc534
Merge pull request #15666 from Superjomn/fix/anakin-api-dependency
...
fix anakin compile dependency
6 years ago
xuezhong
fb261793b9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op
...
test=develop
6 years ago
Dun Liang
1905f1a108
bug fix && test=develop
6 years ago
xuezhong
fb9a6a2bc6
pass test for lstm op
...
test=develop
6 years ago
xuezhong
1abb0d835e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
...
test=develop
6 years ago
sneaxiy
42f6d0f899
modify API.spec
...
test=develop
6 years ago
Chunwei
d85c2e4e5c
fix anakin compile dependency
...
test=develop
6 years ago
xuezhong
2ba256df40
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
dzhwinter
104d3b4e68
add details. test=develop
6 years ago
dzhwinter
3932cd6714
Merge remote-tracking branch 'origin/develop' into enhance/memory
6 years ago
Dun Liang
ceec13562c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_checkpoint
...
test=develop
6 years ago
Dun Liang
bc92192747
Fix Pr #15296
...
test=develop
6 years ago
Qiao Longfei
76072261f8
fix compiler
...
test=develop
6 years ago
Qiao Longfei
b99db0e2c2
cpu reduce mode did not need to broadcast test=develop
6 years ago
dzhwinter
94dd50c33f
add details. test=develop
6 years ago
Gabor Buella
4975a9050a
Tests - add some missing to_string calls
...
```
/home/tej/code/gbuella_paddle/paddle/fluid/framework/ir/seqpool_concat_fuse_pass_tester.cc:167:40: error: adding 'int' to a string does not append to the string [-Werror,-Wstring-plus-int]
std::string prefix = "seqpool_op_" + i;
~~~~~~~~~~~~~~^~~
/home/tej/code/gbuella_paddle/paddle/fluid/framework/ir/seqpool_concat_fuse_pass_tester.cc:167:40: note: use array indexing to silence this warning
std::string prefix = "seqpool_op_" + i;
^
& [ ]
1 error generated.
```
test=develop
6 years ago
Tao Luo
bec68fa0b3
Merge pull request #15637 from jerrywgz/refine_box_coder
...
speed up box_coder in CPU
6 years ago
乔龙飞 Qiao Longfei
7ddf4e2c55
Merge pull request #15649 from jacquesqiao/fix-ctr-reader-svm
...
fix ctr reader read svm data
6 years ago
peizhilin
883d22093a
fix the lib_any dependency
...
test=develop
6 years ago
Qiao Longfei
fa77186fdc
fix ctr_reader_test test=develop
6 years ago
Qiao Longfei
2afe82fe83
fix ctr reader read svm data
...
test=develop
6 years ago
Kaipeng Deng
9df7bc2c5a
fix exlusive pool doc. test=develop ( #15632 )
6 years ago
wopeizl
3614dadf23
Merge pull request #15631 from wopeizl/windows/fixci
...
fix ci broken randomly and disable some warnings
6 years ago
tensor-tang
18bff5298d
extract fused_emb_seq_pool forward function
...
test=develop
6 years ago
wopeizl
c1e18b13aa
Merge pull request #15635 from wopeizl/fixbuildissue
...
fix the build issue on gpu mode for win
6 years ago
peizhilin
55510744b5
test=develop
6 years ago
peizhilin
061299be87
fix dependency
...
test=develop
6 years ago
baojun
ac4cde009d
Enable accuracy op for ngraph engine ( #15592 )
...
* Added accuracy ngraph op test=develop
* fixed name type test=develop
6 years ago
dzhwinter
488719bac7
Enhance/memory optimize ( #15634 )
...
* add skip send.recv test=develop
* enhanced print message. test=develop
* rerun ci. test=develop
6 years ago
Gabor Buella
2bf63f4c33
Fix std::abs usage in memory_optimize_pass.cc ( #15627 )
...
test=develop
size_t is an unsigned integer, with a conversion rank
larger than int, therefore in the following expression
the int value was promoted to size_t, making it a
subtraction of unsigned values. The result of such
a subtraction is also an unsigned value.
6 years ago
peizhilin
db563ec2cd
test=develop
6 years ago
dzhwinter
5d30b55de1
rerun ci. test=develop
6 years ago
kolinwei
6f0f8045f6
Revert "Async double buffered py reader"
6 years ago
jerrywgz
ceb412b0ae
speed up box coder in CPU, test=develop
6 years ago
dzhwinter
4ef34916a4
enhanced print message. test=develop
6 years ago
peizhilin
238ef94702
fix the build issue on gpu mode for win
...
test=develop
6 years ago
xuezhong
e261b60f97
change api spec for adagrad optimizer
...
test=develop
6 years ago
dzhwinter
ce0394bcd0
merge develop branch. test=develop
6 years ago
peizhilin
3a4110f960
fix ci broken randomly and disable some warnings
...
test=develop
6 years ago
Dun
db8fcf6b95
Merge pull request #15296 from cjld/async_double_buffered_py_reader
...
Async double buffered py reader
6 years ago
xuezhong
dff7461ea8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
...
test=develop
6 years ago
Zeng Jinle
1ccbc51218
Merge pull request #15619 from sneaxiy/debug
...
Fix unittest failure in debug mode
6 years ago
Xin Pan
74bc55c2a6
Merge pull request #14975 from dzhwinter/ir_inplace_pass
...
Ir inplace pass
6 years ago
dzhwinter
cca71532eb
add skip send.recv test=develop
6 years ago
dzhwinter
9f001c6525
skip dist. test=develop
6 years ago
xuezhong
2857dac260
add assert for clip and remove print
6 years ago
xuezhong
4028943125
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
Tao Luo
546eefae6f
Merge pull request #15591 from lidanqing-intel/lidanqing/density_prior_box
...
optimize density_prior_box_op.h in detect model for cpu
6 years ago
sneaxiy
f26a1c9077
test=develop
6 years ago
Yan Chunwei
dc5e25fc7f
remove dot marked node ( #15606 )
6 years ago
dzhwinter
2561a6fc59
follow comment. test=develop
6 years ago
dzhwinter
2a5ecb68b0
follow comment. test=develop
6 years ago
dzhwinter
9f693fcac4
rerun ci. test=develop
6 years ago
guoshengCS
b6c3b69af8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-beam-search-size
...
test=develop
6 years ago
dzhwinter
e537634d16
delete graph print pass. test=develop
6 years ago
dzhwinter
4f01de6378
Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
dzhwinter
46a6cac91f
fix batch norm. test=develop ( #15597 )
6 years ago
qingqing01
943d972878
Fix analysis predictor when loading the persistable RAW type variable. ( #15613 )
6 years ago
liuwei1031
6e84eb131f
expose peak gpu memory API to python test=develop ( #15529 )
...
* expose peak gpu memory API to python test=develop
* add unittest for peak gpu memory monitoring test=develop
* add pybind change test=develop
* add mutex to gpu mem usage monitor test=develop
* update benchmark flag definition file test=develop
* tweak unittest for memory monitoring test=develop
6 years ago
dzhwinter
5cab99a686
fuck windows. rerun windows ci. test=develop
6 years ago
dzhwinter
9c9ad7d40b
Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
...
test=develop
6 years ago
jerrywgz
1743d1a58f
Merge pull request #15356 from jerrywgz/add_clip_op
...
Add box clip op
6 years ago
dzhwinter
0a63234c85
follow comments. test=develop
6 years ago
mozga-intel
43a67a2662
Enable conv2d operator for a ngraph engine ( #15269 )
...
test=develop
6 years ago
tensor-tang
a6a1a92ef7
Merge pull request #15586 from tensor-tang/jit/cache
...
refine bert
6 years ago
jerrywgz
4f18a9b87b
test=develop
6 years ago
guoshengCS
5dfce93101
To make CUDA_LAUNCH_KERNEL_HELPER support large size.
...
test=develop
6 years ago
Yan Chunwei
e887d71958
fix ir debug config ( #15571 )
6 years ago
Yan Chunwei
897789b16e
fix save_inferece_model bug ( #15365 )
6 years ago
lidanqing-intel
4b3c6612a1
optimize density_prior_box_op.h for cpu
...
test=develop
6 years ago
xuezhong
4c98c2ccc3
remove debug print
6 years ago
WangZhen
2175292634
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantization_inference_passes
...
test=develop
6 years ago
xuezhong
58ad40cc15
add sample_logits op
6 years ago
tensor-tang
ba02ac4692
use mat attr and refine test ( #15448 )
...
* use mat attr and refine test
test=develop
* add matmul jitcode
test=develop
* fix mac compile
test=develop
6 years ago
xuezhong
b0c75f1763
remove debug print
6 years ago
xuezhong
880836329d
add cell clip and proj clip, fix bug for h0
6 years ago
dzhwinter
9e87fbebb7
rerun windows ci. test=develop
6 years ago
dzhwinter
6f9904e99a
rerun windows ci. test=develop
6 years ago
jerrywgz
4eb44380a6
Merge branch 'develop' into add_clip_op
6 years ago
Xin Pan
30cc8b7a92
Merge pull request #15554 from heavengate/yolo_loss_darknet
...
Yolo loss darknet
6 years ago
Tao Luo
1a252f4be6
Merge pull request #15587 from luotao1/bert
...
use embedding=128 bert model for test
6 years ago
Haihao Shen
1b8047b712
Add INT8 calibration support in Paddle package ( #15569 )
...
* Add INT8 calibration support in Paddle package; test=develop
6 years ago
mozga-intel
312500dcb5
Enable pool2d operator for a ngraph engine ( #15395 )
...
* Enable pool2d operator for a ngraph engine
test=develop
* Update
test=develop
6 years ago
Jiabin Yang
b4c24f3f7c
Merge pull request #15575 from JiabinYang/feature/imperative
...
test=develop, polish code and fix some wrong change
6 years ago
dzhwinter
a52be7c081
refine build strategy. test=develop
6 years ago
Tao Luo
ea92905be4
Merge pull request #15478 from kbinias/kbinias/seperate-folders-for-mkldnn
...
Make separate folders for mkldnn codes
6 years ago
dzhwinter
32a2014939
refine build strategy. test=develop
6 years ago
Yibing Liu
170842cbb4
Some improvements to support bert mixed precision training ( #15585 )
...
* Some improvements to support bert mixed precision training
test=develop
* Revert the cast in layer_norm
test=develop
6 years ago
Yiqun Liu
16d54f7f23
Return parent_idx in beam_search op ( #15520 )
...
* Refine beam_search_op to output an extra parent_idx tensor.
test=develop
* Fix the unittest test_beam_search_op.
test=develop
* Fix the merging mistake.
test=develop
6 years ago
jerrywgz
72ee3c6232
Merge pull request #15398 from jerrywgz/add_axis_for_boxcoder
...
Add axis for boxcoder
6 years ago
jerrywgz
e402c0ec7d
test=develop
6 years ago
Kaipeng Deng
d3eeb92bba
Merge pull request #15491 from tink2123/new_align_corners
...
add align_corners and align_mode for image_resize
6 years ago
jerrywgz
3046799ecd
Merge branch 'develop' into add_clip_op
6 years ago
dzhwinter
1a44b2fbe8
Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
Jiabin Yang
2d0ffdc485
test=develop, fix debug mode unitest, hsigmoid ( #15574 )
6 years ago
Zhaolong Xing
90ffe74954
Merge pull request #15546 from NHZlX/fix_trt_utest_random_failed
...
fix trt models utest failed.
6 years ago
luotao1
8f0c2b07f2
use embedding=128 bert model for test
...
test=develop
6 years ago
tensor-tang
2b0811c3fb
refine vadd jitkernel choice
...
test=develop
6 years ago
tensor-tang
a18c0d4242
cache fc kernel
...
test=develop
6 years ago
tensor-tang
6e1ee7fb57
cache softmax kernel func
...
test=develop
6 years ago
Krzysztof Binias
69b7c595d6
Small fix
...
test=develop
6 years ago
Krzysztof Binias
b1bdcd4de8
Make separate folders for mkldnn codes
...
test=develop
6 years ago
dzhwinter
06f2448848
Merge remote-tracking branch 'origin/develop' into feature/ir_inplace_pass
6 years ago
dzhwinter
8156fedf56
merge develop branch. test=develop
6 years ago
dengkaipeng
23d34d1f7e
move yolov3_loss to detection. test=develop
6 years ago
JiabinYang
16f64b43d4
test=develop, Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/imperative
6 years ago
tensor-tang
c7449227e8
Merge pull request #15563 from tensor-tang/jit/softmax
...
refine softmax kernel
6 years ago
dengkaipeng
733bb82ec0
downsample -> downsample_ratio. test=develop
6 years ago
dengkaipeng
ae0b0d5f93
fix doc. test=develop
6 years ago
dengkaipeng
56e21c558e
add comments and docs. test=develop
6 years ago
dengkaipeng
577424e5ec
use darknet loss and trick
6 years ago
dengkaipeng
042fecefab
use L2Loss. test=develop
6 years ago
dengkaipeng
af124dcdf6
fix API error
6 years ago
dengkaipeng
c945ffa7f8
fix label_smooth and mixup score
6 years ago
tink2123
2b89f59055
add attr use_label_smooth test=develop
6 years ago
dengkaipeng
8218e30176
add gtscore. test=develop
6 years ago
dengkaipeng
3c08f620c2
add label smooth. test=develop
6 years ago
dengkaipeng
cc01db6029
calc valid gt before loss calc. test=develop
6 years ago
dengkaipeng
32d533c2cd
cache obj_mask and gt_match_mask. test=develop
6 years ago
dengkaipeng
6c5a5d0789
format code. test=develop
6 years ago
dengkaipeng
e7e4f084e5
ignore pred overlap gt > 0.7. test=develop
6 years ago
dengkaipeng
bd6deb1a8b
fix API.spec change. test=develop
6 years ago
dengkaipeng
db8ff57a61
remove useless code and update doc. test=develop
6 years ago
dengkaipeng
577a92d992
use typename DeviceContext. test=develop
6 years ago
dengkaipeng
0c4acc8305
imporve yolo loss implement. test=develop
6 years ago
dengkaipeng
2fbfef2ec9
fix no box expression. test=develop
6 years ago
dengkaipeng
c0fa8d2eec
use L1Loss for w, h. test=develop
6 years ago
dengkaipeng
3841983aa0
fix division error in mean process. test=develop
6 years ago
dengkaipeng
192d293854
use stable Sigmoid Cross Entropy implement. test=develop
6 years ago
Tao Luo
245b1f0579
Merge pull request #15570 from luotao1/bert
...
fix compiler error, use len20 dataset for bert
6 years ago
tink2123
909f864a9b
remove unnecessary flags
...
test=develop
6 years ago
JiabinYang
bb881199f2
test=develop, polish code and fix wrong change in /paddle/fluid/inference/utils/CMakeLists.txt
6 years ago
tink2123
6961a94e94
avoid out_size less than 1
...
test=develop
6 years ago
Jiabin Yang
075df09f86
Merge pull request #15470 from JiabinYang/feature/imperative
...
Add simple RNN in imperative
6 years ago
Qiyang Min
b69996c2d3
Merge pull request #15558 from velconia/imperative_resnet
...
Refine Batch Norm
6 years ago
luotao1
5504425eb3
fix compiler error, use len20 dataset for bert
...
test=develop
6 years ago
Yan Chunwei
655179089f
AnalysisConfig remove contrib namespace ( #15540 )
6 years ago
jerrywgz
7bc8481c62
Merge pull request #15418 from jerrywgz/refine_nms
...
Refine nms
6 years ago
tensor-tang
d59f733551
refine softmax and use with cache
...
test=develop
6 years ago
tensor-tang
7383eefd2d
add softmax mix and mkl code
...
test=develop
6 years ago
tensor-tang
50945685f2
add hmax, hsum jitcode
...
test=develop
6 years ago
tensor-tang
8117725852
add jit kernel hsum, hmax and softmax refer code
...
test=develop
6 years ago
Tao Luo
67e4450c34
Merge pull request #15485 from luotao1/fc500110-bert_test
...
add bert analyzer test
6 years ago
Qiyang Min
6000a6e76e
Merge pull request #15312 from velconia/add_pyramid_dnn_support
...
Use malloc and free in JeMalloc
6 years ago
Jiabin Yang
fd286f3596
Merge pull request #15534 from JiabinYang/fix/multi_output_support_imperative
...
test=develop, fix/multi_output_support_imperative
6 years ago
minqiyang
07822fef2c
Clear all parameters' gradient
...
test=develop
6 years ago
Zeng Jinle
bf7dedcbc7
Merge pull request #15545 from sneaxiy/fix_debug_nccl_error
...
Fix nccl unittest error in debug mode
6 years ago
minqiyang
49a7fba848
Polish code
...
test=develop
6 years ago
minqiyang
159c407328
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_resnet
...
test=develop
6 years ago