baojun
256ba7cbb8
[NGraph] handle dim element 0 of ngraph op ( #18568 )
6 years ago
chengduo
a6d468a265
fix PE fetch bug ( #18644 )
...
test=develop
6 years ago
liuwei1031
759530966c
print out error code of cudaGetDeviceProperties if failed ( #18643 )
6 years ago
Jacek Czaja
71d883b8ef
[MKL-DNN] Reimplemented pool2d mkl-dnn to use Acquire API ( #18585 )
...
* - Added partial draft of pooling acquire
- Workspace support
- compilation fix
- Added draft of pooling backward reimplementation
- Segfault fix
- reverted 'any' for diff_dst crewation in pooling
- Lint fixes
test=develop
- lint fixes
test=develop
- Further lint fixes
test=develop
* - Fixes after review
test=develop
* - Lint fixes
test=develop
* - Even more lint fixes
test=develop
6 years ago
chengduo
f4ec7d54c8
fix bug of scatter op ( #18640 )
...
test=develop
6 years ago
tianshuo78520a
112cf850b7
change pip install whl;test=develop ( #18635 )
6 years ago
guru4elephant
ab57d3893e
make auc op compatible with 1 dim ( #18551 )
...
* make auc op compatible with 1 dim
6 years ago
tianshuo78520a
de22215c8f
change const_cast error message ( #18620 )
6 years ago
Leo Zhao
ff77dea969
not use transferscope cache in cpu case ( #18578 )
...
* not use transferscope cache in cpu case
test=develop
* adjust variable name and add comments
test=develop
* use correct format for class member in operator.h
* use correct format for class member in operator.cc
test=develop
6 years ago
123malin
b414645a65
fix #17430 : int64类型的attr训练非预期 ( #18264 )
...
* fix int64_t
* update fill constant op unittest
* add empty line
6 years ago
tangwei12
db212bb932
delete AllocatorFacade destructor ( #18606 )
...
* delete m, test=develop
6 years ago
Kevin
995d7d8600
Modify embedding_op input dtype to int64 ( #18598 )
6 years ago
kh2se2013
9ad57f2dfd
1)change to parallel mode on python coverage run ( #18594 )
...
2)add pip install coverage in Dockerfile.tmp
test=develop
6 years ago
Tao Luo
076f833110
add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy ( #18580 )
...
* add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy
test=develop
* enhance MkldnnPostReset
test=develop
* add comments for mkldnn_cache_capacity field
test=develop
6 years ago
Hongyu Liu
a20b2b43fc
fix cudnn lstm shape bug; test=develop ( #18492 )
6 years ago
gongweibao
c0a82748cf
Polish backwards optimizer dependency codes and use more default values. ( #18255 )
6 years ago
Zeng Jinle
d3003a1620
Feature/buffer_shared_inplace ( #17911 )
...
* feature/buffer_shared_inplace, test=develop
* refine code, test=develop
* fix elementwise_add op cpu inplace and sum inplace bug, test=develop
* add unittest and debug log, test=develop
* fix parallel_executor scope bug, polish code, test=develop
* fix sum op, activation op, single_in_place_inference bug, test=develop
* remove kLocalExecScopeName, test=develop
* fix unittest,test=develop
* fix out_var first version bug, test=develop
* follow comments,test=develop
6 years ago
tianshuo78520a
1c10dac4f2
Add code example in CI ( #18228 )
...
* test api example
* update python
* add sampcd_processor.py
* add if 0
* sort
* test paddle
* test paddle
* test paddle
* add whitelist
* change sampcd_processor.py
* change sampcd_processor.py
* change sampcd_processor.py
* add exit
* test=develop
* test=develop
6 years ago
Zeng Jinle
be24e5b391
Clean unused code of dim and place ( #18565 )
...
* clean code of dim and place, test=develop
* fix failed unittests, test=develop
6 years ago
Jacek Czaja
8869d7f735
Activations MKLDNN ops refactoring ( #18191 )
6 years ago
lujun
b6d5c74f69
update dygraph api doc for web ( #18550 )
...
remove dygraph.enable from __all__
hidden dygraph. profiler
add doc to dygraph. no_grad
6 years ago
Yibing Liu
b86234fc0b
Register fp16 for concat_op ( #18563 )
6 years ago
Physher
5e1220ef37
fix compile error which caused by gcc4.8 related commit;test=develop ( #18567 )
6 years ago
Jiabin Yang
667f88f9a6
Fix/gcc 4.8 ubt link error ( #18558 )
...
* test=develop, fix docker with paddle nccl problem
* test=develop, fix/gcc_4.8_ubt_link_error
* test=develop, fix code format
6 years ago
Physher
0caa08ea40
Add mkldnn int8 mul-op kernel ( #17834 )
6 years ago
LielinJiang
24d1c44a0c
Fix roi_perspective_transform_op bug ( #18522 )
...
* fix transform matrix bug, test=develop
* modify API.spec
6 years ago
Zhaolong Xing
88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. ( #18532 )
...
* Fix Mask rcnn predictor
1. refine memory optim algorithm to support the model with the block op.
2. output diff : modify the affine channel fuse
3. add condition_block_infer op
add interface for setting trt calib table dir
test=develop
* add the missing files.
test=develop
6 years ago
石晓伟
1529154821
Support Bitmain Anakin ( #18542 )
...
* update anakin-engine interfaces for content-dnn
test=develop
* support only-gpu mode of Anakin
modify eltwise parse
test=develop
* modification for thread-safe
test=develop
* Integrated template instance
test=develop
* increase template parameters
test=develop
* support MLU predictor
test=develop
* update anakin cmake files
test=develop
* update TargetWrapper::set_device
* update the initialization of anakin subgraph
test=develop
* use the default constructor of base class
test=develop
* load model from buffer with length
test=develop
* modify the access level of class
test=develop
* support anakin for bitmain arch
test=develop
* remove files
* checkout cmakelists
test=develop
6 years ago
tianshuo78520a
9b3d3b8387
Cancel jacquesqiao approval authority ( #18538 )
6 years ago
Leo Zhao
ce38bb5341
use static variable to do cache instead of thread local in thread frequent switching case ( #18428 )
6 years ago
gongweibao
160ddc980c
Regroup fusion by date type. ( #18496 )
6 years ago
Tao Luo
fe32879d2a
add mkldnn shapeblob cache clear strategy ( #18513 )
...
* add mkldnn shapeblob cache clear strategy
test=develop
* refine with comments
test=develop
* make cache clear strategy more safey
test=develop
* add lock for GetShapeBlobSize
test=develop
6 years ago
chengduo
e576f2667b
update docker build ( #18523 )
...
test=develop
6 years ago
zhaoyuchen2018
832d8191ff
Fix topk cannot handle 1D vector bug ( #18466 )
...
* Fix topk cannot handle 1D vector bug
Add path to handle 1D vector
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* refine code
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
6 years ago
石晓伟
280a8784f7
Remove the obsolete cmake options ( #18493 )
...
* remove the obsolete cmake options, test=develop
* remove unittests, test=develop
* delete options in paddle/scripts/paddle_build.sh
6 years ago
LielinJiang
43e17c7951
Add distributions of normal and uniform ( #18023 )
...
* add_distributions_of_normal_and_uniform
* paddle/fluid/API.spec
* modify API.spec
* modified paddle/fluid/API.spec, test=develop
* modify paddle/fluid/API.spec, test=develop
* modify paddle/fluid/API.spec, test=develop
* fix some comment, test=develop
* modify API.spec, test=develop
* add comment for init function, modify hard code, test=develop
* modify API.spec, test=develop
* modify API.spec, test=develop
* make unit test function shorter, test=develop
* modify paddle/fluid/API.spec
6 years ago
bingyanghuang
3fe6bf5ee6
fix command line bug in int8v2 readme ( #18507 )
6 years ago
tensor-tang
4828a5e008
core remove pycpuinfo ( #18479 )
...
remove pycpuinfo deps in core
6 years ago
qingqing01
7ac4818a98
Refine Infershape in activation_op for double_grad. ( #18485 )
...
* Refine Infershape in activation_op for double_grad.
6 years ago
qingqing01
602cb6a5b4
Enhance linear_lr_warmup ( #18463 )
...
* make it support float/int learning as input.
6 years ago
chengduo
7453857324
Make fuse_all_reduce_op_pass support mix_precision ( #17652 )
6 years ago
chengduo
55baeceddb
Enhance execution error info ( #18482 )
...
* enhance execution error info
test=develop
6 years ago
石晓伟
047bba855b
Remove the obsolete cmake options ( #18481 )
...
* remove the obsolete cmake options, test=develop
* remove unittests, test=develop
6 years ago
pkpk
e9c7e218f2
Nan debugger init ( #18401 )
...
test=develop
6 years ago
Jiabin Yang
f72ced8814
test=develop, fix docker with paddle nccl problem ( #18451 )
6 years ago
Tao Luo
3f3112ceb0
add shape_blob for cache mkldnn primitive ( #18454 )
...
test=develop
6 years ago
Tao Luo
d234aa02cd
add transfer_scope_cache unit-test ( #18467 )
...
test=develop
6 years ago
zhoukunsheng
7c6f2350b9
support Tensor input for edit_distance op ( #18162 )
6 years ago
zhoukunsheng
26318544d2
support Tensor input for chunk_eval op ( #18226 )
...
* test=develop
support Tensor input for chunk_eval op
* test=develop
fix testcase for chunk_eval op
* test=develop
fix typos in nn.py
6 years ago
zhoukunsheng
206c44e2a8
add unique kernel and op ( #17557 )
6 years ago
zhoukunsheng
71af72b1c2
upgrade hash op to support Tensor and LoDTensor input ( #17998 )
6 years ago
zhoukunsheng
d3b3443d10
add ones_like op ( #17388 )
6 years ago
zhoukunsheng
67b48d7fe7
add size op ( #17412 )
6 years ago
Leo Zhao
8f5fffca0a
rename mkldnn set/get_cur_thread_id() to set/get_cur_mkldnn_session_id() ( #18453 )
...
* rename mkldnn set/get_cur_thread_id() to set/get_cur_mkldnn_session_id()
test=develop
* update session id definition and adjust logic for default behavior
test=develop
* reset logic in mkldnn reuse as most of cases work in default.
test=develop
6 years ago
Tao Luo
3123d18787
remove unused AnalysisPredictor::SetMkldnnThreadID() ( #18444 )
...
test=develop
6 years ago
Yi Liu
a873fa84ce
supports collective training with programs ( #18392 )
...
1. Since allreduce op has 4 reduce types, We split these four reduce types into four ops
2. We also refined the collective op code, e.g. we separated the collective op kernel into CPUKernel and CUDAKernel, and remove the device specified DeviceContext parameter in template as we already knew the target DeviceContext
3. We remove the newly added Collective op role to reduce the complexity of program and graph analysis
6 years ago
tianshuo78520a
85b49d8473
fix the api.spec file does not get the class comment problem ( #18439 )
...
* fix the api.spec file does not get the class comment problem
* cat new.spec
* check api.spec
* test=develop
6 years ago
chengduo
e0d8c6ac68
Add find_no_grad_vars in backward.py ( #17942 )
...
* add not_been_used_vars to no_grad_set
test=develop
6 years ago
LielinJiang
449c7a9f98
Make roi_perspective_transform op return mask and transform matrix ( #18371 )
...
* modify roi_perspective_transform_op to output mask and transform matrix
* modify comment
* modify comment
* modify API.spec
* update API.spec
* remove no use header, test=develop
* resolve conflict
6 years ago
tensor-tang
a3bc804f5f
fix mac ci random fail ( #18430 )
...
* fix mac ci random fail
* use platform instead
6 years ago
Michał Gallus
7023a86c3a
Fix Pooling output scale ( #18186 )
...
* Int8: Fix Pooling output scale
test=develop
* Update scales quantization for certain operators
These include: concat, transpose, pool and reshape. test=develop
* Move concat minimum scale finding to quantizer
test=develop
6 years ago
Brian Liu
4bc2987d2f
Fix bug in quantize kernel which cause crash in vgg16/19 model ( #17964 )
...
* Fix bug in quantize kernel which cause crash in vgg16/19 model
test=develop
* refine the code to reduce verbose code; test=develop
* remove useless code; test=develop
6 years ago
xsrobin
47e2ef38e9
add "import paddle.fluid as fluid" to examples lack of it
6 years ago
tianshuo78520a
92ecb305c2
test=develop ( #18426 )
6 years ago
hutuxian
8a39e5c110
update api format ( #18413 )
...
* update api format
test=develop
* update API.spec
test=develop
6 years ago
jiaqi
93a2b317f7
fix data feed ptr error ( #18419 )
...
fix data feed ptr runtime error, pipeline trainer will core in some cases, so set it nullptr as default value.
6 years ago
tensor-tang
ce7a024c6d
fix py-cpuinfo mac random fail ( #18383 )
...
* fix py-cpuinfo mac random fail
* differentiate version on windows
6 years ago
Jie Fang
2b4ef509ea
init custom black white list ( #18377 )
...
test=develop
6 years ago
Leo Zhao
681d3553f1
Fix potential mkldnn concat/pool/conv kernel issues ( #18393 )
...
1. some key generation method is not aligned with PR#17965
2. enlarge ptr lifetime to avoid memory release if SetBlob fails
otherwise it will get core dump.
test=develop
6 years ago
tianshuo78520a
052b044873
Fix mac build nproc command not found ( #18362 )
...
* change nproc 8
6 years ago
Zeng Jinle
f5641000bb
Add a unittest to inplace elementwise_add ( #18385 )
...
* add_elementwise_add_inplace_test,test=develop
* rename file, test=develop
6 years ago
Jiabin Yang
43f64a177e
Fix/program doc ( #17908 )
...
* test=develop, add some comments for Program.clone
* test=develop, add API.spec
* test=develop, refine comments
* refine Program doc and clone doc
* test=develop, refine doc
6 years ago
Jiabin Yang
af874a1f1d
test=develop, fix multigpu hang on latest docker ( #18379 )
6 years ago
chengduo
871cc15e6a
Add is_compiled_with_cuda ( #18356 )
...
* add cuda_is_available
test=develop
* Fix api.spec
test=develop
* fix api doc
test=develop
6 years ago
lujun
fd6631ef2f
Fix dygraph show style ( #18297 )
...
Fix dygraph show style for FluidDoc.
6 years ago
HaoRen
9931bc64f5
add dependecy of collective_helper ( #18365 )
...
* add dependecy of collective_helper
* test=develop
fix dependecy of collective_helper
6 years ago
翟飞跃
19da59ed3f
Remove all the code, API and doc of MKL-DNN INT8v1 ( #18347 )
6 years ago
chengduo
8ed33bf91f
Fix Bug-prone code of PE ( #18354 )
...
* update pe reduce config
test=develop
* drop the local_exe_scopes of the previous parallel_executor
test=develop
6 years ago
tangwei12
999d9a59a5
fix communicator with pyreader ( #18350 )
...
* add is_runnning in communicator, test=develop
6 years ago
tianshuo78520a
cff2c2d83f
add combine_avx_noavx build to dockerfile
...
需要在avx_noavx build时候,生成dockerfile。
使用combine_avx_noavx 参数生成whl后发现不能build镜像,原因:没有生成dockerfile。需要添加生成dockerfile选项。
6 years ago
kh2se2013
27fb9cad65
add WITH_COVERAGE option, default OFF ( #17872 )
...
* add WITH_COVERAGE option, default OFF
test=develop
* add coverage for python sdk
test=develop
* fix code style
* fix COVERAGE_FILE path
test=develop
* remove coverage package
test=develop
* test = develop, run coverage as module
6 years ago
Michał Gallus
8409693272
Reset DeviceContext after quantization warmup ( #18182 )
...
test=develop
6 years ago
HaoRen
b7128bac5f
supports collective communicated training ( #18175 )
...
* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop
* supports collective training in executor
* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop
* fix comment
test=develop
* use unique name for nccl_id
* supports output to stream in program_to_code
* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code
* set op role in collective training
* add collective op role
* remove orig file
* add build optimizer by strategy
* add collective strategy
* refine collective strategy
* add multi-process role maker
* refine strategy building factory so that we can easily plugin more strategy
* scale loss grad in collective sgd transpiler
* add support for distributed fc
* code format
* revert some features for dist fc
* add support for distributed fc training
* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop
* supports collective training in executor
* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop
* use unique name for nccl_id
* supports output to stream in program_to_code
* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code
* set op role in collective training
* add collective op role
* fix comment
test=develop
* remove orig file
* add build optimizer by strategy
* add collective strategy
* refine collective strategy
* add multi-process role maker
* refine strategy building factory so that we can easily plugin more strategy
* scale loss grad in collective sgd transpiler
* add support for distributed fc
* code format
* revert some features for dist fc
* add support for distributed fc training
* test=develop
add collective op unittest standard
* test=develop
remove the test_collective directory
* test=develop
remove the test_collective directory
* remove slicegather test
* code format for reducescatter
* update attr of shard_index_op
* Modify macro nccl_helper
* remove test without distribute
* macro collective_helper
* marcro update
* test=develop
update support python3.5
* test=develop change gpu memory use to 0.1 when test
* test=develop
update ut equal func
* test=develop
set flags to 1.5
* test=develop fix pickle dumple py35
* test=develop
fix divide in slice and add sync_comm_stream
update atol and rtol to 1e-05
rm shard_index op and test
modify read input from file to read from memory
remove origin_program in framework and add i/o in c_sync_calc_stream
* test=develop update unittest sync operator I/O
6 years ago
Sylwester Fraczek
9252e8fa08
add int8 mkldnn prior_box ( #17242 )
...
add prior_box quantization code
add scale algo rules for prior box
test=develop
6 years ago
lidanqing
5fd68ac154
some fixes for int8 mobilenet_ssd tester ( #18112 )
...
* some fixes for int8 mobilenet_ssd tester
test=develop
* change wrong data file name
test=develop
* change test images bin file from 200 images to 100 images
* change directory existence to file existence during downloading
test=develop
* reuse download_data
test=develop
* run full dataset when iterations=0
test=develop
6 years ago
Jacek Czaja
c2efdfd5bc
[MKL-DNN] Extending reusing to Elementwise_add_mkldnn op ( #18146 )
...
* - Reusing of reuder used in elementwise_add_mkldnn
- Added MKL-DNN sum prim reusing
test=develop
- Compilation fixes
test=develop
- Yet another compilation fix
test=develop
- Yet another compilation fix
test=develo
- Yet another linking fix
test=develop
- Final compilation fix
test=develop
- lint fixes
test=develop
- Lint fixes
test=develop
* - Fixes after review
test=develop
6 years ago
qingqing01
9047ac687e
Simplify multi_box_head API in detection.py and remove assign op. ( #18310 )
...
* Simplify multi_box_head API in detection.py and remove assign op.
6 years ago
Zeng Jinle
5826b72e06
Refine CUDAPlace error message. ( #18343 )
...
* refine cuda place error msg, test=develop
* use LOG(ERROR)+exit(-1), test=develop
6 years ago
Tao Luo
3c9755bbb9
remove unused jemalloc option ( #18314 )
...
test=develop
6 years ago
Yibing Liu
23941e43ec
Update lamb optimizer ( #18333 )
...
* Update lamb optimizer
test=develop, test=document_preview
* Regenerate api spec
test=develop, test=document_preview
6 years ago
chengduo
135a59ed45
update reduce config ( #18334 )
...
test=develop
6 years ago
tensor-tang
81ec538279
fix softrelu doc ( #18324 )
...
* fix softrelu doc
test=develop
* update API doc
test=develop
6 years ago
Hongyu Liu
df2eee71d8
Sequence mask support tensor ( #18249 )
...
* sequnce mask support max length tensor input; test=develop
* add rnn_impl.py; test=develop
* add basic gru lstm unittest; test=develop
* fix api spec; test=develop
* fix sequence_mask op bug;
test=develop
test=document_preview
* change +-*x to elmentwise_op; test=develop
* add mkl flag; test=develop
* fix rnn impl bug; test=develop
* update api spec; test=develop
* fix doc bug; test=develop
* fix lstm bugs; test=develop
6 years ago
Qiao Longfei
0e08e91c18
optimize communicator merge sparse gradient test=develop ( #18159 )
...
* optimize communicator merge sparse gradient test=develop
* revert multithread selected rows merge add test=develop
* follow comment test=develop
6 years ago
chengduo
e06c69c788
Fix default value of fluid.memory_optimize ( #18295 )
...
* fix default value of fluid.memory_optimize
test=develop
* fix api.spec
test=develop
6 years ago
Zhaolong Xing
6978b2e48e
fix split and sampled softmax ( #18280 )
...
test=develop
6 years ago
Yibing Liu
f57ee3693b
Fix the bug of sequence_unpad op ( #18290 )
...
* Use TensorCopySync for sequence_unpad op
test=develop
* Fix the tensor memory alloc bug
test=develop
6 years ago
chengduo
5489216eba
Clean build strategy ( #18148 )
...
* clean build_strategy
test=develop
* DataBalanceOpHandle has been removed
test=develop
* debug
* update build_strategy.
test=develop
6 years ago
chengduo
14e1e165df
update alloc_continuous_space_for_grad_pass ( #18287 )
...
test=develop
6 years ago
lujun
7e61baaa94
add Dygraph api to api.spec ( #18235 )
...
add Dygraph api to api.spec
6 years ago
liuwei1031
a736c03b10
improve doc of lstm, sequence_enumerate, softmax_with_cross_entropy, space_to_depth APIs ( #18261 )
...
* improve doc of lstm, sequence_enumerate, softmax_with_cross_entropy, space_to_depth APIs, test=develop
* update API.spec, test=develop
6 years ago
flame
fdf798f95a
fix double buffer example ( #18169 )
...
test=develop
test=document_preview
6 years ago
Bai Yifan
23b8b18e56
fix api doc example, test=develop ( #18266 )
6 years ago
xiaoting
2f0d68261c
fix yolo_box example,test=develop ( #18247 )
6 years ago
songhao
6b3d96254d
fix some bug when merge sparse embedding parameters, test=develop ( #18223 )
...
1. fix the bug that out_put_var in SaveSelectedRows would be empty string
2. use merge_sparse_lookup_table to replace sum op for load_persistables_for_inference
3. fix the bug in _clone_var_in_block_ when the var is SELECTED_ROWS.
6 years ago
jiaqi
3f8031e256
dataset ( #17973 )
...
(1) use channel instead of vector/BlockingQueue in Dataset,to keep same with existing implementation, and make code more readable and flexible (dataset single output channel or multi output channel). one previous memory out of limit problem is cause by not release memory after training.
(2) add Record because MultiSlotType costs too much memory (80B),fix memory out of limit problem.
(3) add Channel, Archive in paddle/fluid/framework
(4) change dataset from shared_ptr to unique_ptr in pybind
(5) move create/destroy readers from trainer to dataset
(6) move shuffle from datafeed to dataset. dataset holds memory, datafeed is only for load data and feed data to network.
(7) fix thread num bug of Dataset when filelist size < thread num
(8) support set_queue_num in InMemoryDataset
6 years ago
liuwei1031
5d54ed4a84
improve the doc of DataFeeder and default_main_program ( #18241 )
...
* improve the doc of DataFeeder and default_main_program
* update API.spec, test=develop
6 years ago
xiaoting
b58bb80248
set src_idx > 0 for bilinear_interp_op ( #18238 )
...
* set src_idx > 0, test=develop
* add unittest and cu, test=develop
6 years ago
wopeizl
daa32d5383
fix package generation for inference test=develop ( #18220 )
6 years ago
Shuai Yuan
9a32dad811
[DOC] Fix comment code of API create_py_reader_by_data ( #18193 )
...
* [DOC] Fix comment code of API create_py_reader_by_data. test=develop, test=document_preview
* Fix code style of API comment. test=develop,test=document_preview
Fix code style of API comment. test=develop,test=document_preview
* update api spec of api create_py_reader_by_data
* remove default config code. test=develop
* remove useless code. test=develop
* update create_py_reader_by_data api. test=develop
6 years ago
Hongyu Liu
cefd0fb598
Fix slice op shape=-1 bug ( #18107 )
...
* fix slice op bug; test=develop
* fix variabel test bug; test=develop
* remove slice while true; test=develop
6 years ago
lijianshe02
ff4279e3b2
fix paddle.fluid.layers.io.open_files api doc bug test=develop ( #18203 )
...
* fix paddle.fluid.layers.io.open_files api doc bug test=develop
6 years ago
chengduo
5588b923f3
Add multi process reader ( #18115 )
...
* add multi process reader
test=develop
6 years ago
wangchaochaohu
a9dc534f48
fix API example ( #18153 )
...
* API.spec test=develop
* update
* update test=develop
* update test=develop
* update
* update test=develop
* update test=develop
* update test=develop
* update test=develop
* update test=develop
* test=develop
* update
* update test=develop
* update test=develop
* fix test=develop
6 years ago
翟飞跃
de42fe8fd5
Change int8v2 CAPI unit test name and add log in the prediction stage ( #18200 )
...
* fix issue 18111;test=develop
* fix timer;test=develop
* refine code;test=develop
6 years ago
翟飞跃
802ea50956
fix spelling errors ( #17941 )
...
* fix spelling errors; test=develop
* Update API.spec
update md5
* Update API.spec
* change the order of api;test=develop
6 years ago
zhoukunsheng
0569ff78fa
Fix doc example for greater_equal, greater_than, less_equal, not_equal, rank, reduce_all, reduce_any, sign, where, diag ( #18167 )
...
* test=develop
fix greater_than, greater_equal, less_equal, not_equal, rank, reduce_all, reduce_any, sign, where, diag doc example
* test=develop
fix API.spec conflict
6 years ago
Huihuang Zheng
bbc292920c
Fix API example code ( #18176 )
...
The fixed APIs:
6 Methods in paddle.fluid.io.PyReader
paddle.fluid.layers.Preprocessor
paddle.fluid.layers.py_reader
paddle.fluid.io.save_params
paddle.fluid.io.save_persistables
test=develop
test=document_preview
6 years ago
翟飞跃
78441c5449
add mkldnn Int8v2 slim doc ( #17909 )
6 years ago
lvmengsi
d658f1133b
Fix doc for transpose, conv3d and batch_norm. ( #18035 )
...
* update some op doc, test=develop
6 years ago
FlyingQianMM
944c3165ec
fix type error of std::pow in sigmoid_focal_loss_op.cu and sigmoid_focal_loss_op.h ( #18152 )
...
* test=develop
fix type error of std::pow in sigmoid_focal_loss_op.cu and sigmoid_focal_loss_op.h
* test=develop
fix wrong code stype in sigmoid_focal_loss_op.cu and sigmoid_focal_loss_op.h
6 years ago
chengduo
25f3cd6486
Update execution_strategy option default value ( #18183 )
...
* update execution_strategy option default value
test=develop
* fix doc error
test=develop
6 years ago
chengduo
4978db2c10
Remove nccl dep when the number of GPU is 1 ( #18158 )
...
* remove nccl dep when the number of GPU is 1
test=develop
6 years ago
Zeng Jinle
25ab23be28
Fix dygraph mem leak ( #18082 )
...
* fix dygraph mem leak, test=develop
* polish msg, test=develop
6 years ago
tensor-tang
1c6e560607
core replace x86cpu with py cpuinfo ( #18151 )
...
test=develop
6 years ago
Zeng Jinle
6eec66a1b1
Fix py_reader iterable bug ( #18108 )
...
* fix py_reader iterable bug, test=develop
* move data from buffered_reader,test=develop
6 years ago
qingqing01
80d2e66f9e
Update backward appending stragety to support double backward and fix some bug. ( #18104 )
...
* Update backward.py:
- If there is no input grad var in all outputs of previous ops, do not append this op into graph.
- Only apply this stragety when double backward.
* Update some double backward op.
* Update sum_op to judge whether a tensor is empty by numel or IsInitialized().
6 years ago
Wojciech Uss
ca5642c850
unify FP32 vs. INT8 comparison tests output ( #18111 )
...
test=develop
6 years ago
Wojciech Uss
c26130f3a9
reuse C-API INT8 unit test application ( #18077 )
...
* reuse C-API INT8 unit test application
test=develop
* updates after review
test=develop
6 years ago
FlyingQianMM
ff83655f7e
add detection output operator for supporting retinanet ( #17896 )
...
* test=develop
add detection output for supporting retinanet
* test=develop
add test_layers.py
* test=develop
add API.spec
* test=develop
alter test_retinanet_detection_output.py
* test=develop
alter round 2
* test=develop
alter retinanet_detection_output
* test=develop
alter paddle/fluid/API.spec
* test=devlop
alter detection.py
* test=develop
alter retinanet_detection_output
* test=develop
alter paddle/fluid/API.spec
* test=develop
alter detection.py
* test=develop
alter API.spec
* test=develop
alter retinanet_detection_output
* test=develop
alter paddle/fluid/API.spec
* test=develop
alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py
* test=develop
alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py
* test=develop
fix grammer error
* test=develop
fix grammer error
* test=develop
fix grammer error
* test=develop
alter python/paddle/fluid/tests/unittests/test_layers.py
* test=develop
alter paddle/fluid/API.spec
6 years ago
FlyingQianMM
0aee1f0074
add sigmoid focal loss operator for supporting retinanet ( #17895 )
...
* test=develop
add sigmoid_focal_loss for supporting retinanet
* test=develop
add test_layers
* test=develop
add API.spc
* test=develop
alter sigmoid_focal_loss_op.cc
* test=develop
alter detection.py
* test=develop
alter API.spec
* test=develop
alter round 1
* test=develop
alter simooid_focal_loss
* test=develop
alter sigmoid_focal_loss_op.cc
* test=develop
alter test_layers.py
* test=develop
alter paddle/fluid/API.spec
* test=develop
alter sigmoid_focal_loss_op.cu
* test=develop
alter paddle/fluid/operators/detection/sigmoid_focal_loss_op.cc
6 years ago
FDInSky
9e4b9d9798
Update generate_proposal_labels_op to support CascadeRCNN. ( #17200 )
...
* Update generate_proposal_labels_op to support CascadeRCNN.
6 years ago
FlyingQianMM
9ed2f936f1
add target assign operator for supporting retinanet ( #17893 )
...
* test=develop add target assign for retinanet
* test=develop
run ci
* test=developp
add test_layers
* test=develop
add APi.spec
* test=develop
alter round 1
* test=develop
alter rpn_target_assign_op.cc
* test=develop
alter test_rpn_target_assign_op.py
* test=develop
alter rpn_target_assign_op.cc
* test=develop
alter API.spec
* test=develop
alter paddle/fluid/operators/detection/rpn_target_assign_op.cc
* test=develop
alter rpn_target_assign_op.cc
* test=develop
alter python/paddle/fluid/layers/detection.py
* test=develop
alter paddle/fluid/API.spec
6 years ago
Huihuang Zheng
7faf095618
Sync Dockerfile change of PR#17889 ( #18072 )
...
Jian Tang made change on latest-dev Dockerfile, so sync the change
in the cuda9/10 Dockerfile
test=develop
6 years ago
Sylwester Fraczek
accb132f0f
fix slim int8 mkldnn multithreading issue ( #18009 )
6 years ago
tianshuo78520a
2e1d8cf7c8
add approval to requirements.txt
...
add luotao to approval requirements.txt
6 years ago
chengduo
24e988a471
Fix bug of scope_buffered_ssa_graph_executor ( #18100 )
...
* fix code bug
test=develop
6 years ago
Huihuang Zheng
3f55ab0f89
Modify format of GPU allocation failure log. ( #18034 )
...
As title
test=develop
6 years ago
gongweibao
f5caf3443c
Fix reinitialized ncclid error! ( #18025 )
6 years ago
whs
354643d8d9
Add warning for cudnn warpctc kernel in CUDA9\CUDA10. ( #18046 )
...
test=develop
6 years ago
qingqing01
e81756f1ba
Hidden paddle.fluid.layers.detection_map. ( #18033 )
...
* Remove layers.detection_map API
* Since uers can use fluid.metrics.DetectionMAP to calculate mAP of current-batch and cumulative-batch. layers.detection_map only can calculate cur-batch mAP.
6 years ago
Yiqun Liu
660c1a65f3
Optimize fused_elewise_activation_grad op. ( #18041 )
...
test=develop
6 years ago
lidanqing
466254151a
add Mobilienet ssd int8 analyzer tester ( #18075 )
...
* add pascalvoc preprocess script and mobilenet-ssd analyzer_tester, wait 17737
* change converting local dataset to downloading and converting tarfile
test=develop
* change the test data_path
test=develop
* change copyright (c) 2016 to copyright (c) 2019
test=develop
6 years ago
石晓伟
42f12a4aca
fix ci test cmake test=develop ( #18060 )
6 years ago
chengduo
b5a1c1463d
Update CPU_NUM config ( #18059 )
...
* update CPU_NUM config
test=develop
6 years ago
lidanqing
f8ecc3de89
refactor the function ConvFwdPrimitiveDesc ( #17897 )
...
* refractor the function ConvFwdPrimitiveDesc
test=develop
* change according to review
test=develop
* use pointer way without boost::optional
test=develop
* pass vector to function by reference instead of raw vector
test=develop
* change pointer to shared_ptr
test=develop
6 years ago
Michał Gallus
8462e2b805
Disable MKLDNN FC in Resnet50 test ( #18030 )
6 years ago
Wojciech Uss
78e932862c
Added unit test for QAT FP32 & INT8 comparison ( #17814 )
...
* added unit test for QAT FP32 & INT8 comparison
test=develop
* enabled other models and updated filenames
test=develop
* added accuracy check and multiple batch handling
test=develop
* removed quantization_mkldnn_pass.py
test=develop
* cleanup
test=develop
* updated model paths
test=develop
* renamed tests without MKL-DNN
test=develop
* fix reusing mkldnn pool2d primitive
test=develop
* add performance measuring
test=develop
* fix accuracy statistics
test=develop
* removed non-mkldnn tests
test=develop
* added conv2d_depthwise->conv2d mkldnn transformation
test=develop
* format update
test=develop
* fixed creating key for pool2d grad
test=develop
* added pass
* Fix the accuracy issue while using float precision to get the scale.
test=develop
* Fix the format issue when 'X' is not nchw.
test=develop
* removed output comparing and changed number of images
test=develop
* cmake and comment fix
test=develop
* updated acc threshold for QAT comparison tests
test=develop
* added OMP_NUM_THREADS setting
test=develop
* enable all QAT INT8 tests
test=develop
* restored upstream version of a file
test=develop
* modified directory names
test=develop
6 years ago
tensor-tang
566bf2ec56
concat op support negative axis ( #18045 )
...
test=develop
6 years ago
Yiqun Liu
7e463c84a6
Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. ( #17979 )
...
test=develop
6 years ago