Jacek Czaja
71d883b8ef
[MKL-DNN] Reimplemented pool2d mkl-dnn to use Acquire API ( #18585 )
...
* - Added partial draft of pooling acquire
- Workspace support
- compilation fix
- Added draft of pooling backward reimplementation
- Segfault fix
- reverted 'any' for diff_dst crewation in pooling
- Lint fixes
test=develop
- lint fixes
test=develop
- Further lint fixes
test=develop
* - Fixes after review
test=develop
* - Lint fixes
test=develop
* - Even more lint fixes
test=develop
6 years ago
chengduo
f4ec7d54c8
fix bug of scatter op ( #18640 )
...
test=develop
6 years ago
tianshuo78520a
112cf850b7
change pip install whl;test=develop ( #18635 )
6 years ago
guru4elephant
ab57d3893e
make auc op compatible with 1 dim ( #18551 )
...
* make auc op compatible with 1 dim
6 years ago
guru4elephant
b71b454328
increase timeout again ( #18628 )
...
test=develop
6 years ago
tianshuo78520a
de22215c8f
change const_cast error message ( #18620 )
6 years ago
Leo Zhao
ff77dea969
not use transferscope cache in cpu case ( #18578 )
...
* not use transferscope cache in cpu case
test=develop
* adjust variable name and add comments
test=develop
* use correct format for class member in operator.h
* use correct format for class member in operator.cc
test=develop
6 years ago
123malin
b414645a65
fix #17430 : int64类型的attr训练非预期 ( #18264 )
...
* fix int64_t
* update fill constant op unittest
* add empty line
6 years ago
tangwei12
db212bb932
delete AllocatorFacade destructor ( #18606 )
...
* delete m, test=develop
6 years ago
Kevin
995d7d8600
Modify embedding_op input dtype to int64 ( #18598 )
6 years ago
kh2se2013
9ad57f2dfd
1)change to parallel mode on python coverage run ( #18594 )
...
2)add pip install coverage in Dockerfile.tmp
test=develop
6 years ago
Tao Luo
076f833110
add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy ( #18580 )
...
* add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy
test=develop
* enhance MkldnnPostReset
test=develop
* add comments for mkldnn_cache_capacity field
test=develop
6 years ago
Hongyu Liu
a20b2b43fc
fix cudnn lstm shape bug; test=develop ( #18492 )
6 years ago
gongweibao
c0a82748cf
Polish backwards optimizer dependency codes and use more default values. ( #18255 )
6 years ago
Zeng Jinle
d3003a1620
Feature/buffer_shared_inplace ( #17911 )
...
* feature/buffer_shared_inplace, test=develop
* refine code, test=develop
* fix elementwise_add op cpu inplace and sum inplace bug, test=develop
* add unittest and debug log, test=develop
* fix parallel_executor scope bug, polish code, test=develop
* fix sum op, activation op, single_in_place_inference bug, test=develop
* remove kLocalExecScopeName, test=develop
* fix unittest,test=develop
* fix out_var first version bug, test=develop
* follow comments,test=develop
6 years ago
tianshuo78520a
1c10dac4f2
Add code example in CI ( #18228 )
...
* test api example
* update python
* add sampcd_processor.py
* add if 0
* sort
* test paddle
* test paddle
* test paddle
* add whitelist
* change sampcd_processor.py
* change sampcd_processor.py
* change sampcd_processor.py
* add exit
* test=develop
* test=develop
6 years ago
Zeng Jinle
be24e5b391
Clean unused code of dim and place ( #18565 )
...
* clean code of dim and place, test=develop
* fix failed unittests, test=develop
6 years ago
Jacek Czaja
8869d7f735
Activations MKLDNN ops refactoring ( #18191 )
6 years ago
lujun
b6d5c74f69
update dygraph api doc for web ( #18550 )
...
remove dygraph.enable from __all__
hidden dygraph. profiler
add doc to dygraph. no_grad
6 years ago
Yibing Liu
b86234fc0b
Register fp16 for concat_op ( #18563 )
6 years ago
guru4elephant
9c17a899d7
upgrade collective fleet api ( #18533 )
...
* upgrade collective fleet api
6 years ago
Physher
5e1220ef37
fix compile error which caused by gcc4.8 related commit;test=develop ( #18567 )
6 years ago
bingyanghuang
a25be53cb5
QAT int8 MKL-DNN transformation pass with MUL ( #18322 )
6 years ago
Jiabin Yang
667f88f9a6
Fix/gcc 4.8 ubt link error ( #18558 )
...
* test=develop, fix docker with paddle nccl problem
* test=develop, fix/gcc_4.8_ubt_link_error
* test=develop, fix code format
6 years ago
Physher
0caa08ea40
Add mkldnn int8 mul-op kernel ( #17834 )
6 years ago
kh2se2013
ac81c81be1
unset CMAKE_BUILD_TYPE when WITH_COVERAGE = ON ( #18541 )
...
install coverage package in develop image
test = develop
6 years ago
LielinJiang
24d1c44a0c
Fix roi_perspective_transform_op bug ( #18522 )
...
* fix transform matrix bug, test=develop
* modify API.spec
6 years ago
Zhaolong Xing
88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. ( #18532 )
...
* Fix Mask rcnn predictor
1. refine memory optim algorithm to support the model with the block op.
2. output diff : modify the affine channel fuse
3. add condition_block_infer op
add interface for setting trt calib table dir
test=develop
* add the missing files.
test=develop
6 years ago
石晓伟
1529154821
Support Bitmain Anakin ( #18542 )
...
* update anakin-engine interfaces for content-dnn
test=develop
* support only-gpu mode of Anakin
modify eltwise parse
test=develop
* modification for thread-safe
test=develop
* Integrated template instance
test=develop
* increase template parameters
test=develop
* support MLU predictor
test=develop
* update anakin cmake files
test=develop
* update TargetWrapper::set_device
* update the initialization of anakin subgraph
test=develop
* use the default constructor of base class
test=develop
* load model from buffer with length
test=develop
* modify the access level of class
test=develop
* support anakin for bitmain arch
test=develop
* remove files
* checkout cmakelists
test=develop
6 years ago
tianshuo78520a
9b3d3b8387
Cancel jacquesqiao approval authority ( #18538 )
6 years ago
Leo Zhao
ce38bb5341
use static variable to do cache instead of thread local in thread frequent switching case ( #18428 )
6 years ago
gongweibao
160ddc980c
Regroup fusion by date type. ( #18496 )
6 years ago
guru4elephant
6f6ecbec4e
remove benchmark folder, since there is a benchmark repo already, distributed benchmark will be maintained in fleet repo ( #18537 )
...
test=develop
6 years ago
guru4elephant
1f1cc2221f
add random port ( #18504 )
...
* add random port
6 years ago
Tao Luo
fe32879d2a
add mkldnn shapeblob cache clear strategy ( #18513 )
...
* add mkldnn shapeblob cache clear strategy
test=develop
* refine with comments
test=develop
* make cache clear strategy more safey
test=develop
* add lock for GetShapeBlobSize
test=develop
6 years ago
chengduo
e576f2667b
update docker build ( #18523 )
...
test=develop
6 years ago
zhaoyuchen2018
832d8191ff
Fix topk cannot handle 1D vector bug ( #18466 )
...
* Fix topk cannot handle 1D vector bug
Add path to handle 1D vector
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* refine code
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
6 years ago
石晓伟
280a8784f7
Remove the obsolete cmake options ( #18493 )
...
* remove the obsolete cmake options, test=develop
* remove unittests, test=develop
* delete options in paddle/scripts/paddle_build.sh
6 years ago
Jiabin Yang
7586cdd545
Hide no support ( #18515 )
...
* test=develop, fix docker with paddle nccl problem
* test=develop, hide no_support api and add ut for it
6 years ago
LielinJiang
43e17c7951
Add distributions of normal and uniform ( #18023 )
...
* add_distributions_of_normal_and_uniform
* paddle/fluid/API.spec
* modify API.spec
* modified paddle/fluid/API.spec, test=develop
* modify paddle/fluid/API.spec, test=develop
* modify paddle/fluid/API.spec, test=develop
* fix some comment, test=develop
* modify API.spec, test=develop
* add comment for init function, modify hard code, test=develop
* modify API.spec, test=develop
* modify API.spec, test=develop
* make unit test function shorter, test=develop
* modify paddle/fluid/API.spec
6 years ago
bingyanghuang
3fe6bf5ee6
fix command line bug in int8v2 readme ( #18507 )
6 years ago
tensor-tang
4828a5e008
core remove pycpuinfo ( #18479 )
...
remove pycpuinfo deps in core
6 years ago
qingqing01
7ac4818a98
Refine Infershape in activation_op for double_grad. ( #18485 )
...
* Refine Infershape in activation_op for double_grad.
6 years ago
qingqing01
602cb6a5b4
Enhance linear_lr_warmup ( #18463 )
...
* make it support float/int learning as input.
6 years ago
chengduo
7453857324
Make fuse_all_reduce_op_pass support mix_precision ( #17652 )
6 years ago
chengduo
55baeceddb
Enhance execution error info ( #18482 )
...
* enhance execution error info
test=develop
6 years ago
石晓伟
047bba855b
Remove the obsolete cmake options ( #18481 )
...
* remove the obsolete cmake options, test=develop
* remove unittests, test=develop
6 years ago
pkpk
e9c7e218f2
Nan debugger init ( #18401 )
...
test=develop
6 years ago
Jiabin Yang
f72ced8814
test=develop, fix docker with paddle nccl problem ( #18451 )
6 years ago
Tao Luo
3f3112ceb0
add shape_blob for cache mkldnn primitive ( #18454 )
...
test=develop
6 years ago