lidanqing
d7c5c2bd64
Add input format in Transpose GetHash ( #17737 )
...
* fix the bug of mobilenet-ssd INT8 inference without overloading GetHash
test=develop
* remove the out_grad->format() in TransposeMKLDNNGradOpKernel
test=develop
6 years ago
tangwei12
659b72a97c
fix document of python api get_startup_program() ( #17764 )
...
* add example to get_startup_program()
* fix example to get_startup_program()
6 years ago
AIFollowers
93de124cec
modify some initializer api ( #17301 )
...
* test=develop modify some initializer api
* test=develop modify API.spec
* test=develop modify API.spec
* test=develop modify API.spec
* test=develop modify API.spec
6 years ago
guru4elephant
d52391094d
fix prepare context redundant code problem, optimize executor by cach… ( #17743 )
...
* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop
* cache sub_scope, program, var when use_program_cache=True is set
* make fetch_list runable with variables, add more unittest for use_program_cache
6 years ago
baojun
2c58f1a83c
[NGraph] Added lookup table to ngraph engine test=develop ( #17647 )
6 years ago
pawelpiotrowicz
bacc822492
[NGraph] Enable transpose ngraph operator ( #17636 )
...
test=develop
6 years ago
lujun
ed9d603a8a
fix api doc: Optimizer.ModelAverage ( #17395 )
6 years ago
baojun
90eae0b39a
[NGraph] Addded slice op to ngraph test=develop ( #17648 )
6 years ago
baojun
2fbaa5c075
[NGraph] added matmul op to ngraph engine test=develop ( #17645 )
6 years ago
hong19860320
68dcb1bd7b
fix API examples of assign, reverse and array_write, etc. ( #17287 )
...
* fix API examples of assign, reverse and array_write
test=develop
* update API.spec
test=develop
* update API examples for array_length, array_read, array_write, assign, hard_sigmoid, hsigmoid, increment, ones, pow, reverse, uniform_random and zeros
* update API.spec for assign, reverse and array_write, etc.(#17287 )
* test=develop
6 years ago
tianshuo78520a
f144740b73
change ci ctest exit code ( #17745 )
6 years ago
chengduo
67c8dade58
Add Event in ScopeBuffer Executor ( #17667 )
...
* add event for fast executor and add threads for scopebuffer executor
test=develop
6 years ago
Bai Yifan
bba57cdd82
Add deformable conv v2 op,test=develop ( #17145 )
...
* unit commits, test=develop
* update API.spec, test=develop
6 years ago
wangchaochaohu
bd48950c7e
fix paddlepaddle API examples ( #17306 )
...
* API.spec test=develop
* update
* update test=develop
* update test=develop
* update
* update test=develop
* update test=develop
* update test=develop
* update test=develop
* update test=develop
* test=develop
* update
* update test=develop
* update test=develop
6 years ago
YishengCheng
bd15912d65
fix bug for ctr_reader for svm data ( #17575 )
...
* fix bug for ctr_reader test=develop
* fix svm data test=develop
fix svm data test=develop
6 years ago
Yiqun Liu
8fd39f3e99
Enhance fused_elementwise_activation op and add python api in contrib.layers ( #17236 )
...
* Enhance fused_elementwise_activation op.
test=develop
* Move the api fused_elementwise_activation to contrib.
test=develop
* Add including files.
test=develop
* Add the support of sigmoid in fused_elementwise_activetion op.
* Update API.spec.
test=develop
6 years ago
yaoxuefeng
ac92e4c066
fix distributed_transpiler.py api test=develop ( #17668 )
6 years ago
Yiqun Liu
2704479bb2
Optimize recurrent_op using Prepare and RunPreparedContext, avoiding create operators in every iter. ( #17689 )
...
test=develop
6 years ago
pawelpiotrowicz
9b99876442
Enable less_than ngraph operator ( #17642 )
...
* Enable less_than ngraph operator
test=develop
* Added compare unit-tests test=develop
* Update: date && removed import test=develop
6 years ago
Zhaolong Xing
a9a531fa5f
Refine python api code example note: ( #17369 )
...
* fix:
1. infernce multi card occupy
2. facebox model inference occupy too much
test=develop
* refine python api comments: shuffle, while, scale, sampled_softmax_with_cross_entropy, scatter,
round, sin, sqrt, shape, split, soft_relu, slice, selu, ifelse, switch.
test=develodp
* fix conflict error.
test=develop
6 years ago
Jiabin Yang
effc555955
test=develop, layz init Grad ( #17653 )
6 years ago
hutuxian
4ff87c049d
remove useless input 'Softmax@GRAD' from softmax_with_cross_entropy op ( #17612 )
6 years ago
Tao Luo
b4b169467b
add fc_mkldnn_pass in compare_mkldnn ( #17712 )
...
test=develop
6 years ago
pawelpiotrowicz
70a887af63
[NGraph] Add reduce_sum operator for Ngraph ( #17450 )
...
test=develop
6 years ago
baojun
29baca0dd8
add depthwise_conv2d op to ngraph engine ( #17454 )
...
* add depthwise_conv2d test=develop
* use cpu for ngraph test=develop
6 years ago
gongweibao
0d561ef442
fix 2dconn test=develop ( #17681 )
6 years ago
mozga-intel
ccf9e2327b
[Lite] Enable cast operator test=develop ( #17294 )
6 years ago
tangwei12
0d3c48e0a8
fix doc in transpiler, test=develop ( #17313 )
...
* fix doc in transpiler, test=develop
6 years ago
Hongyu Liu
9f85f21880
Add new gard clip [old gradient clip not support in dy graph] ( #17523 )
...
* add gradient clip in minimize; test=develop
* fix bug; test=develop
* fix format; test=develop
* move new grad clip to dygraph/grad_clip.py; test=develop
* fix lr decay and grad clip test; test=develop
* seperate dygraph grad clip; test=develop
* fix grad clip test; develop
* fix api spec bug; test=develop
* add blank line, test=develop,test=document_preview
to fix format problem
6 years ago
Zhaolong Xing
4337009b92
fix trt ci timeout error ( #17701 )
...
test=develop
6 years ago
mozga-intel
5eb81fe595
Capi for a ngraph engine ( #17037 )
6 years ago
Yiqun Liu
5782dddad0
Optimize the concat and split kernel for specical cases when the number of inputs/outputs is 2 ( #17415 )
...
* Optimize the concat and split kernel for special cases that the number of inputs/outputs is 2.
test=develop
* Refine codes.
test=develop
* Correct the condition.
test=develop
* Move the define of tmp_data outside the if statement.
* Print the cudnn minor version.
test=develop
* Fix the case when in_num/o_num is 1 in concat/split op.
test=develop
* Remove const_cast.
test=develop
6 years ago
石晓伟
acbb4bf38d
update python API examples ( #17351 )
...
* update python APIs
test=document_preview test=develop
* update API.spec
test=document_preview
test=develop
* update merge_selected_rows
* update API.spec
test=document_preview
test=develop
* update API.spec
test=document_preview
test=develop
* fix the comment of less_than
test=develop
test=document_preview
* update API.spec
test=develop
test=document_preview
* update API.spec
test=develop
test=document_preview
* update API.spec
test=develop
test=document_preview
* update API.spec
test=develop
* update API
test=develop
6 years ago
Jiabin Yang
7a401da52f
test=develop, fix mac ci will not uninstall dependency files when error occurs ( #17688 )
6 years ago
lidanqing
04b6c29ee0
Improve mobilenetv2 INT8 performance by using INT8 relu as post-op ( #17570 )
...
* add INT8 conv+relu6 fuse and enbale mobilentv2 INT8 test
test=develop
* change fasle and 0.0 to fuse_brelu and brelu_threshold
test=develop
change the "fuse_relu||fuse_brelu" to "unsigned_output"
test=develop
* Use relu instead of brelu as INT8 post-op because INT8 brelu is not enabled in mkldnn v0.18
test=develop
* continuous-integration fix
test=develop
6 years ago
Jacek Czaja
6d8075ecef
[MKL-DNN] conv_transpose mkldnn bias pass ( #17644 )
...
* - changes to graph detector
- Changes to pass
- Added ut for new pass
- use_pass
- Added pass to mkldnn passes
- fix to registration
- improved verbose messaging for conv bias passes
- Lint fixes
test=develop
* - Lint fixes
test=develop
6 years ago
Shuai Yuan
41f1186c6b
[DOC][PYTHON] Fix api docs, test=develop, test=document_preview ( #17629 )
...
* [DOC] Fix api docs, test=develop, test=document_preview
* [DOC] Fix api annotation: fluid.layers.tensor_array_to_tensor. test=develop, test=document_preview
* test=develop, test=document_preview
update MD5 of tensor_array_to_tensor
6 years ago
wopeizl
058f1f1e1b
fix the api example for create_global_var, create_parameter, SGDOptim… ( #17371 )
...
* fix the api example for create_global_var, create_parameter, SGDOptimizer, RMSPropOptimizer, MomentumOptimizer, LarsMomentumOptimizer, FtrlOptimizer
test=develop
* add example for adamoptimizer
fix API.spec
test=develop
* test=develop
* test=develop
6 years ago
Yibing Liu
4f4f0993c1
Bias correction for exponential moving average ( #17677 )
...
* Bias correction for exponential moving average
test=develop, test=document_preview
* Fix docs
test=develop, test=document_preview
6 years ago
Tao Luo
962eed6f82
Revert "Enable SQRT operator for the nGraph Bridge ( #17549 )" ( #17680 )
...
This reverts commit f34830e2aa
.
6 years ago
Tao Luo
67a6297a9f
update unique_name notes and examples ( #17671 )
...
test=develop
6 years ago
Krzysztof Binias
f34830e2aa
Enable SQRT operator for the nGraph Bridge ( #17549 )
...
* Enable sqrt operator for the nGraph Bridge.
test=develop
* Update activation_op.h
6 years ago
Sylwester Fraczek
96845d2168
add Concat quantization ( #17448 )
...
* add Concat quantization
add unit test for quantizing concat
fix for wrong value when the input is not in map of calculated scales
add use_quantizer to concat_op.cc
add scale_algo rules for concat
test=develop
* missing fix for multiple inputs quantize-squash
* wojtuss review fix: adding comment
test=develop
6 years ago
Zeng Jinle
432ac70124
clean code of py_layer in dygraph mode,test=develop ( #17661 )
6 years ago
gongweibao
65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. ( #17263 )
6 years ago
Krzysztof Binias
b1bd483a7d
[NGraph] Enable gelu operator for the nGraph Bridge. ( #17547 )
...
test=develop
6 years ago
Zhen Wang
8bd651b7ed
Fix the bug in the AnalysisPredictor and add more directions about io APIs. ( #17639 )
...
* fix the bug that sub_scope_ may be null in AnalysisPredictor::Run.
* add more directions about io APIs' docs.
* update the API.spec. test=develop test=document_preview
6 years ago
chengduo
343017324e
Polish Print Op ( #17651 )
...
* enhance print
6 years ago
Zeng Jinle
4aa931dd85
Code clean of Allocator ( #17602 )
...
* Revert "Revert "Fix allocator bug""
This reverts commit 174d0d0b90
.
* Revert "fix travis ci"
This reverts commit 5656fa9f7c
.
test=develop
* add inlined_vector.h, test=develop
* add inlined_vector_test,test=develop
* clean code of allocator,test=develop
* delete zero_size_allocator.h,test=develop
* fix failed unittest,test=develop
6 years ago
Guo Sheng
430e25654b
Fix the usage of out_grad lod in sequence_slice_op. ( #17625 )
...
test=develop
6 years ago
Huihuang Zheng
afc3d85da2
Remove Docker build for CI tasks ( #17650 )
...
* Add Dockerfile for cuda9 and cuda10
6 years ago
Bai Yifan
bbd6e438fc
fix conflicts,test=develop ( #17186 )
6 years ago
bdzhuxiaoning
9f85afb7b6
test=develop ( #17643 )
6 years ago
chengduo
9322216170
Add data distributed_sampler ( #17573 )
...
* add data parallel batch
6 years ago
hutuxian
1670db5e86
Gather Op Index Support int64_t datatype ( #17610 )
...
* gather_op support int64_t index by adding a template typename
* add UT and rename typename
test=develop
6 years ago
Huihuang Zheng
febc07f047
Add Dockerfile for cuda9 and cuda10 ( #17600 )
...
* Add Dockerfile for cuda9 and cuda10
Add Dockerfile for building cuda9 cuda10 images.
6 years ago
mozga-intel
2b83d75bfa
Enable elementwise pow operator for ngraph ( #17526 )
6 years ago
Zhaolong Xing
61221ebc28
TRT: Support set dynamic range in int8 mode. ( #17524 )
...
* fluid int8 train and trt int8 predict align.
trt int8 predict init
op converter
* 2. align fluid int8 train and trt int8 inference.
enhance quant dequant fuse pass
enhance op converter, trt engine, trt engine op, trt subgraph pass.
* 3. add delete_quant_dequant_pass for trt
test=develop
* 4. add the missing file
test=develop
* 5. i modify the c++ interface, but forget to modify the pybind code
fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
test=develop
6 years ago
Michał Gallus
0c39b97b4e
[MKL-DNN] Add Fully Connected Op for inference only( #15226 )
...
* fuse mul and elementwise add to fc
* Reimplement the FC forward operator
* Fix FC MKLDNN integration by transposing weights
* Add FC MKLDNN Pass
test=develop
* FC MKLDNN Pass: change memcpy to std::copy
* Fix MKLDNN FC handling of mismatch input and weights dims
* Lower tolerance for MKL-DNN in resnet50 test
test=develop
* Adjust FC to support MKLDNN Op placement
test=develop
* Adjust Placement Op to set use_mkldnn attribute for graph
test=develop
* MKLDNN FC: fix weights format so that gemm version is called
test=develop
* FC MKLDNN: Remove tolerance decrease from tester_helper
* FC MKL-DNN: Refactor the code, change input reorder to weight reorder
* MKL-DNN FC: Introduce operator caching
test=develop
* FC MKL-DNN: Fix the tensor type in ExpectedKernelType
test=develop
* FC MKL-DNN: fix style changes
test=develop
* FC MKL-DNN: fallback to native on non-supported dim sizes
test=develop
* FC MKLDNN: fix CMake paths
test=develop
* FC MKLDNN: Refine placement pass graph mkldnn attribute
test=develop
* Fix Transpiler error for fuse_conv_eltwise
test=develop
* Fix missing STL includes in files
test=develop
* FC MKL-DNN: Enable new output size computation
Also, refine pass to comply with newest interface.
test=develop
* FC MKL-DNN: enable only when fc_mkldnn_pass is enabled
* FC MKL-DNN: Allow Weights to use oi or io format
* FC MKL-DNN: Adjust UT to work with correct dims
test=develop
* Enable MKL DEBUG for resnet50 analyzer
test=develop
* FC MKL-DNN: Improve Hashing function
test=develop
* FC MKL-DNN: Fix shape for fc weights in transpiler
* FC MKL-DNN: Update input pointer in re-used fc primitive
* Add log for not handling fc fuse for unsupported dims
test=develop
* FC MKL-DNN: Move transpose from pass to Op Kernel
test=develop
* FC MKL-DNN: Disable transpose in unit test
test=develop
* FC MKL-DNN: Remove fc_mkldnn_pass from default list
* Correct Flag for fake data analyzer tests
test=develop
* FC MKL-DNN: Add comment about fc mkldnn pass disablement
test=develop
* FC MKL-DNN: Disable fc in int8 tests
test=develop
6 years ago
wopeizl
6724a652f3
add __str__ method for tensor and lodtensor to support print test=dev… ( #17588 )
...
* add __str__ method for tensor and lodtensor to support print test=develop
6 years ago
Krzysztof Binias
e9216d0602
Enable logical operators for the nGraph Bridge. ( #17543 )
...
test=develop
6 years ago
Hongyu Liu
cbaf9e5344
Fix api example [ lstm, sequence_enumerate, sequence_expand,sequence_expand_as ] ( #17210 )
...
* fix example; test=develop
* fix api spec; test=develop
* fix api spec; test=develop
* add doc check
test=develop
test=document_preview
* test=develop,test=document_preview
add blank line to fix format, add one more "import"
* fix bug; test=develop
* fix bug; test=develop
6 years ago
guru4elephant
326bf8291a
add Run Prepared Ctx ( #17616 )
...
add Run Prepared Ctx, fix pybind problem
6 years ago
Yibing Liu
e8990e64f6
Fix trust ratio in lamb ( #17614 )
...
test=develop
6 years ago
Guo Sheng
2a7b321110
Fix the example code in some Python API. ( #17343 )
...
* Fix the example code in some Python API.
test=develop
* Fix the example code in some Python API by adding import.
test=develop
6 years ago
chengduo
b5f4d5ed0e
Add broadcast operators ( #17503 )
...
* This PR adds broadcast for multi-process. And it could be used in dynamic graph to broadcast parameters.
6 years ago
flame
2280f185d7
BuildStrategy api comment ( #17348 )
...
Python examples of fluid.layers.io.double_buffer and some BuildStrategy's methods.
6 years ago
Sylwester Fraczek
5b2a3c4b12
Conv concat relu quantization ( #17466 )
...
* add conv_concat_relu fuse
test=develop
* add test code
test=develop
* added missing include with unordered_map
test=develop
* review fixes for wojtuss
test=develop
* remove 'should (not) be fused' comment statements
one of them was invalid anyway
test=develop
6 years ago
Sylwester Fraczek
bccb0ba49a
fix quantize_squash_pass segfault when no tensor linked to Bias ( #17292 )
...
* fix quantize_squash_pass segfault when there is no tensor linked do Bias input
test=develop
* add googlenet test
test=develop
* fix concat CreateKey not using input format
test=develop
6 years ago
chengduo
2dc1c6f25c
Add profiler in tracer ( #17076 )
...
* add profiler in tracer.cc
* add profiler in layer.cc
test=develop
* add profiler in Layer.cc
test=develop
6 years ago
mozga-intel
0d4cbdad91
[NGraph] Enable elementwise mul operator ( #17552 )
6 years ago
tianshuo78520a
cee9dcc383
Delete LoDTensorset in API.spec ( #17577 )
...
* test=develop
* test=develop
* test=develop
* del #
6 years ago
mozga-intel
f2694e122d
[NGraph] Enable assign operator for a ngraph, test=develop ( #17437 )
...
* Enable assign operator for a ngraph, test=develop
* Cross_entropy operators needs to be updated
6 years ago
mozga-intel
cf02cb5e98
Enable elementwise sub operator for ngraph ( #17527 )
6 years ago
guru4elephant
7f8bc49d00
polish_executor_and_add_ctx_cache ( #17536 )
...
* polish_executor_and_add_ctx_cache
6 years ago
tensor-tang
7ae461eb13
[CPU] refine cpu softmax bwd ( #17534 )
...
* refine softmax fwd
test=develop
* refine cpu softmax bwd
test=develop
* fix batch size
test=develop
* fix compile issue with gpu
test=develop
* add value clip
6 years ago
Yibing Liu
6e11f97708
Add exponential moving average ( #17562 )
...
* Add exponential moving average
test=develop, test=document_preview
* Polish documents
test=develop, test=document_preview
* Update API spec
test=develop, test=document_preview
6 years ago
tensor-tang
0600b370ea
[CPU] refine softmax op fwd on CPU ( #17522 )
...
* refine softmax fwd
test=develop
* fix compile issue wih gpu
test=develop
* add value clip to avoid exp
6 years ago
Zeng Jinle
c6189637cd
Fix allocator bug ( #16712 )
...
* Revert "Revert "Fix allocator bug""
This reverts commit 174d0d0b90
.
* Revert "fix travis ci"
This reverts commit 5656fa9f7c
.
test=develop
* add inlined_vector.h, test=develop
* add inlined_vector_test,test=develop
6 years ago
mozga-intel
035771512d
Enable elementwise min operator for ngraph ( #17521 )
6 years ago
Kaipeng Deng
cf60e5a2db
fix API python example ( #17226 )
...
* fix api example. test=develop
* fix API.spec. test=develop
* fix spectral_norm format. test=develpp
* merge develop
* add import. test=develop
* fix indent. test=develop
* fix indent. test=develop
* add import fluid. test=develop
6 years ago
Qiao Longfei
92e7d5d7cc
fix distribute doc test=develop ( #17318 )
...
* fix distribute doc
6 years ago
jerrywgz
c1aae8b8d2
Fix GetExpectedKernelType in Concat op ( #17459 )
...
* fix concat op vartype check, test=develop
6 years ago
Qiao Longfei
58f7695ab2
Async exe support communicator ( #17386 )
...
Async exe support communicator
6 years ago
Zhaolong Xing
38da103034
fix trt ci bug temporary. ( #17565 )
...
ban all trt ut. will fix it later.
test=develop
6 years ago
mozga-intel
109b5aed5a
[NGraph] Enable reshape operator test=develop ( #17512 )
6 years ago
zhang wenhui
9bb6a421e3
fix bpr_loss data_norm teacher_student_sigmoid_loss api & fix continuous_value_model ( #17331 )
...
* fix bpr data_norm teacher_student_sigmoid , test=develop test=document_preview
修复了bpr_loss data_norm teacher_student_sigmoid_loss三个api, 同时修复了continuous_value_model文档英文拼写错误
6 years ago
lijianshe02
300bd7504d
fix api-doc related bugs test=develop test=document_preview ( #17360 )
...
* fix api doc according to the reviewer's comment test=develop
6 years ago
lijianshe02
daf88968e2
fix bug that saved optimal model path in test_analyzer_save_model con… ( #17555 )
...
* modify saved model path in analyzer_save_model.cc test=develop
6 years ago
Krzysztof Binias
43d15b9d96
Enable square operator for the nGraph Bridge. ( #17551 )
...
test=develop
6 years ago
Sevin F. Varoglu
f86f49e779
[NGraph] add increment op to ngraph engine ( #16929 )
...
* add increment op to ngraph engine
test=develop
* fix style errors
test=develop
6 years ago
baojun
8923612b10
NGraph enable parse serialized graph test=develop ( #17453 )
6 years ago
Yiqun Liu
cf5d271c5a
Fix examples of fluid.layers.sums and fluid.layers.DynamicRNN ( #17308 )
...
* Fix examples of fluid.layers.sums.
test=document_preview
* Correct the example of DynamicRNN and its functions.
test=develop
* Add 'import paddle.fluid as fluid' to examples.
test=develop
* Update API.spec.
test=develop
* Add space lines.
test=develop
* Update the API.spec.
test=develop
6 years ago
guomingz
2281ebf0f3
Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. ( #17130 )
...
* Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization.
Below table shows the benchmark(FPS) which measured on skx-8180(28 cores)
Batch size | with fusion | without fusion
-- | -- | --
1 | 214.7 | 53.4
50 | 1219.727 | 137.280
test=develop
* Fix the format issue
test=develop
* Add the missing nolint comments.
test=develop
* Fix the typos.
test=develop
* Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine.
test=develop
* Adjust the indentation.
test=develop
* Add the test_conv_brelu_mkldnn_fuse_pass case.
test=develop
* Slightly update the code per Baidu comments.
Let the parameter definition embedded into the code.
That's will make the code easy to understand.
test=develop
6 years ago
Yibing Liu
f9796b1249
Add LAMB Optimizer support ( #17489 )
...
* Add LAMB optimizer
* Expose LAMB Optimizer's APIs
test=develop, test=document_preview
* Cleanup code & doc
test=develop, test=document_preview
* Update lamb optimizer's formula
test=develop
6 years ago
mozga-intel
99ab57123c
Enabled ngraph elementwise max operator ( #17517 )
6 years ago
Tao Luo
3d19f44a89
remove unused SERIAL compiler option ( #17500 )
...
test=develop
6 years ago
zhaoyuchen2018
dfdcd91869
Add api doc code examples ( #17285 )
...
* Add api doc code examples
add or fix topk, squeeze, stack, StaticRNN,
StaticRNN memory in doc
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* Add squeeze md5.
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* Add import package
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
6 years ago
mozga-intel
1eb151752e
Enable abs operator for a ngraph test=develop ( #17436 )
6 years ago
lidanqing
36757ed203
Enabling resnet101, vgg16, vgg19 INT8v2 model tests ( #17468 )
...
* Add 6 models tests support in CMake
* enabling resnet101, vgg16, vgg19 INT8v2 model tests
test=develop
* remove SERIAL
test=develop
6 years ago
liuwei1031
ba70cc499e
fix security bugs : ( #17464 )
...
http://newicafe.baidu.com:80/issue/PaddleSec-33/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-28/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-25/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-24/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-21/show?from=page
http://newicafe.baidu.com:80/issue/PaddleSec-20/show?from=page
test=develop
6 years ago
Zhaolong Xing
ff7f911b4d
add quant_dequant_moving_avg_max_abs op ( #17480 )
...
* add quant_dequant_moving_avg_max_abs op
test=develop
* add more note for quantdequant op
test=develop
6 years ago
Qiao Longfei
287de41c04
Optimize communicator flags ( #17494 )
...
* optimize communicator flag
* change flags in init py test=develop
6 years ago
liuwei1031
c3949f5699
remove two useless flags: enable_subgraph_optimize, memory_optimize_debug, test=develop ( #17491 )
6 years ago
liuwei1031
f82e4d75e7
improve the doc of paddle.fluid.memory_optimize, test=develop ( #17473 )
...
* improve the doc of paddle.fluid.memory_optimize, test=develop
* fix typo, test=develop
6 years ago
Tao Luo
32da5e9c3d
remove unused expected_kernel_cache_pass ( #17486 )
...
test=develop
6 years ago
wopeizl
ca3ba378c7
fix the random compilation failure on windows test=develop ( #17475 )
...
* fix the random compilation failure on windows
6 years ago
lvmengsi
10b23a72c1
Double backward elementwise div ( #17416 )
...
* double backward, elementwise_div
* fix dx empty. test=develop
* bug fix (#17392 )
fix secure bug
* Eanble stack operator for a Ngraph, test=develop (#17406 )
* fix sqrt_grad_grad unittest. test=develop (#17410 )
* fix sqrt_grad_grad unittest. test=develop
* disable sqrt_grad_grad unittest. test=develop
* test=develop, fix unittest
* test=develop, fix unittest
* test=develop, fix unittest
* test=develop, fix bug
* fix unittest. test=develop
* fix unittest dx. test=develop
* tmp fix! for test... test=develop
* reduce tmp, test=develop
* test=develop, reduce tmp
* fix broadcast unittest. test=develop
* fix format. test=develop
* refine code. test=develop
* refine code. test=develop
* refine GetDoubleGradSafeTensor. test=develop
* fix format. test=develop
6 years ago
qingqing01
97f0ec2357
Fix compiling error with cuDNN 5.1 ( #17458 )
...
test=develop
6 years ago
Zeng Jinle
3d4e8268c6
fix recurrent fwd bug when no backward and scope clear ( #17460 )
6 years ago
lvmengsi
977e9fcb27
support elementwise_sub double backward ( #17476 )
...
add elementwise_sub_grad_grad op for backward of backward calculation
6 years ago
jiaqi
75cda4d9df
fix data_feed_desc.py example run error ( #17452 )
...
* fix data_feed_desc.py example run error
test=develop
test=test=document_preview
* fix data_feed_desc.py example display error
test=develop
test=document_preview
* update API.spec for DataFeedDesc
test=develop
test=document_preview
6 years ago
chengduo
5a6ab38013
Add record event And remove CSP ( #17447 )
...
* add record_event
test=develop
* remove csp
test=develop
6 years ago
Yan Xu
0217555530
polish parallel dygraph code ( #17164 )
...
* add var grad hook test=develop
6 years ago
Jiabin Yang
d7df4e5e5b
Fix/Fix memory leak in dygraph ( #17394 )
...
* test=develop, add gradient sort backward strategy
* test=develop, fix test by add FLAGS_cudnn_deterministic on new tests
* test=develop, fix memory leak in dygraph mode
* test=develop, fix memory leak in dygraph mode
* test=develop, polish code
* test=develop, polish code
* test=develop, polish code
6 years ago
Qiao Longfei
728bbaa4e3
add cache_update_mutex_ for operator test=develop ( #17124 )
...
* add cache_update_mutex_ for operator
6 years ago
Bai Yifan
3a9ae28d32
fix assert,test=develop ( #17445 )
6 years ago
zhaoyuchen2018
b02f2aff04
Add conditional compile for gru opt ( #17368 )
...
* improve gru unit performance.
refine code
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* Add conditional compile for gru opt
Not enable gru opt if compute ability < 700
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* refine code.
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
6 years ago
liuwei1031
6a53fa95e7
improve the API Sample of DataFeeder, memory_optimize and release_memory ( #17374 )
...
* improve the API Sample of DataFeeder, memory_optimize and release_memory, test=develop
* update API.spec, test=develop, test=document_preview
* tweak the code format of feed API, test=develop
* update API.spec, test=develop
* improve doc for DataFeeder and default_main_program, test=develop
6 years ago
guru4elephant
43c9561e9a
add inductive shape index ( #17435 )
...
add inductive shape index
6 years ago
Zeng Jinle
712bfb17cb
fix recurrent_op,test=develop ( #17433 )
6 years ago
Tao Luo
5babcd02dd
Revert "remove unnecessary prepare_data ( #17080 )" ( #17432 )
...
This reverts commit aca60e9a20
.
6 years ago
chengduo
e336dc86bb
[Speed] Refine the Executor when the num_thread=1 ( #17405 )
...
Refine the Executor when the num_thread=1
6 years ago
Jie Fang
30e178fa2c
init auto loss scaling ( #17194 )
...
* init auto loss scaling
test=develop
* change API.spec
* change ifelse to switch and use reduce_sum to optimize checking isfinite
test=develop
* Remove redundant code
test=develop
6 years ago
Zhen Wang
4a1b7fec96
Add setting Scope function for the graph class ( #17417 )
...
* add set_not_owned function for graph
* add scope set. test=develop
* add scope_ptr enforce not null before setting.test=develop
6 years ago
mozga-intel
6ee6700fac
Eanble stack operator for a Ngraph, test=develop ( #17406 )
6 years ago
flame
e48dd92fc8
bug fix ( #17392 )
...
fix secure bug
6 years ago
jiaqi
66d51206b1
add save/load model, shrink table, cvm, config file & fix pull dense bug ( #17118 )
...
* add save/load model, shrink table, cvm, config file & fix pull dense bug
test=develop
* fix global shuffle bug, fix pull dense bug, fix release memeory bug, fix shrink error
add client flush, add get data size
test=develop
* fix global shuffle bug
test=develop
* fix global shuffle bug
test=develop
* fix code style
test=develop
* fix code style & modify pslib cmake
test=develop
* fix error of _role_maker
test=develop
* fix code style
test=develop
* fix code style
test=develop
* fix code style
test=develop
* fix code style
test=develop
* fix code style
test=develop
* fix windows compile error of fleet
test=develop
* fix global shuffle bug
* add comment
test=develop
* update pslib.cmake
test=develop
* fix fill sparse bug
test=develop
* fix push sparse bug
test=develop
6 years ago
Krzysztof Binias
0823a7bc8b
Optimize the sequence padding op ( #17403 )
...
test=develop
6 years ago
baojun
1ce7b45b9e
NGraph Added fill_zeros_like op test=develop ( #17295 )
6 years ago
baojun
910196524d
NGraph Added dropout and dropout_grad to ngraph test=develop ( #17320 )
6 years ago
mozga-intel
b189480734
Ngraph Enable gather operator test=develop ( #17296 )
6 years ago
lvmengsi
4ef631013c
Double backward sqrt ( #17387 )
...
* double backward sqrt
* refine unittest. test=develop
* refine test. test=develop
* remove alpha in unittest. test=develop
6 years ago
JesseyXujin
829fcc98fb
Fix some APIs' example
...
* test=develop
* test=develop
* test=develop
6 years ago
Zeng Jinle
eab34b2df6
fix_dygraph_mem_leak, test=develop ( #17396 )
6 years ago
lvmengsi
5d1ac41b00
Double backward reduce mean ( #17372 )
...
* test=develop, double backward reduce_mean
* add comment. test=develop
* fix format. test=develop
* rename GradGrad -> DoubleGrad. test=develop
* fix op_use_default_grad_op_maker.spec. test=develop
6 years ago
jerrywgz
0cae5a36b6
enhance generate mask labels, test=develop ( #17380 )
6 years ago
Kaipeng Deng
bd9bef5a4e
add elementwise_add_grad_grad op ( #17366 )
...
* add elementwise_add_grad_grad op. test=develop
* use defined GradMaker. test=develop
6 years ago
jerrywgz
1c6d064627
add collect fpn proposals op,test=develop ( #16074 )
...
* add collect fpn proposals op,test=develop
6 years ago
Kaipeng Deng
60be66e2c0
support fc_op double grad ( #17317 )
...
* add double grad for mul_op. test=develop
* fix format. test=develop
* fix format. test=develop
* fix format. test=develop
* refine code. test=develop
* remove setzero. test=develop
* fix dx/dy init bug. test=develop
* fix format. test=develop
6 years ago
Zhen Wang
ad8bbe587e
Fix some api example codes' bugs and these APIs include load_inference_model, load_vars, save_vars, L1DecayRegularizer and L2DecayRegularizer. ( #17324 )
...
* fix some api example codes' bugs.
* update API.spec. test=develop test=document_preview
* add import fluid. test=develop test=document_preview
6 years ago
Tao Luo
68ec0a6f74
make parallel_executor support FLAGS_use_mkldnn ( #17341 )
...
* make parallel_executor support FLAGS_use_mkldnn
test=develop
* add warning when set mkldnn_enabled_op_types_ in non-mkldnn env
test=develop
6 years ago
liuwei1031
0863599323
Fix the uninitialized gru_value.output_value. ( #17197 )
...
test=develop
6 years ago
zhoukunsheng
2ff7ea3337
Expose sign op ( #17117 )
...
* test=develop
add sign op
* Update nn.py
test=develop
delete stop_gradient assignment
6 years ago
tianshuo78520a
f0acc36684
test=develop ( #17357 )
6 years ago
Yihua Xu
218d8d8f73
Optimize the computing kernel of sequence_reverse operator ( #17349 )
...
* Optimize the computing kernel of sequence_reverse operator.
test=develop
* Clean code
test=develop
* Fix for cpplint syntax checking.
test=develop
* Fix the compile warning issue.
test=develop
6 years ago
Yiqun Liu
dcda20233c
Optimize the elementwise op using eigen ( #15494 )
...
* Optimize the elementwise op with CUDA kernels.
test=develop
* Support setting of attr in op config file.
test=develop
* Add the support the setting dtype and initializer in config.
test=develop
* Save workspace.
* Add initializer "zeros".
test=develop
* Fix compiling error.
* Support the use of existed file to initailize tensor in op_tester.
* Use eigen to optimize the elementwise_add/mul for the case that x and y have the same dims.
test=develop
6 years ago
Jiabin Yang
4624d7c642
test=develop, add gradient sort backward strategy ( #17125 )
...
* test=develop, add gradient sort backward strategy
* test=develop, fix test by add FLAGS_cudnn_deterministic on new tests
6 years ago
qingqing01
1d0ba5e815
Fix the example code in some Python API ( #17333 )
...
* Fix the example code in some Python API
* Update paddle/fluid/API.spec
* Fix some examples format
6 years ago
Kaipeng Deng
8bae8590ac
add double grad for elementwise_mul op ( #17255 )
...
* add double grad for elementwise_mul. test=develop
* remove comment. test=develop
* fix grad sum. test=develop
* fix for axis expand. test=develop
* add test for axis expand. test=develop
6 years ago