Commit Graph

8169 Commits (af874a1f1d6c52265e452a7c6d0ef90b8b260fad)

Author SHA1 Message Date
yaoxuefeng ac92e4c066
fix distributed_transpiler.py api test=develop (#17668)
6 years ago
Yiqun Liu 2704479bb2
Optimize recurrent_op using Prepare and RunPreparedContext, avoiding create operators in every iter. (#17689)
6 years ago
pawelpiotrowicz 9b99876442 Enable less_than ngraph operator (#17642)
6 years ago
Zhaolong Xing a9a531fa5f
Refine python api code example note: (#17369)
6 years ago
Jiabin Yang effc555955
test=develop, layz init Grad (#17653)
6 years ago
hutuxian 4ff87c049d
remove useless input 'Softmax@GRAD' from softmax_with_cross_entropy op (#17612)
6 years ago
Tao Luo b4b169467b
add fc_mkldnn_pass in compare_mkldnn (#17712)
6 years ago
pawelpiotrowicz 70a887af63 [NGraph] Add reduce_sum operator for Ngraph (#17450)
6 years ago
baojun 29baca0dd8 add depthwise_conv2d op to ngraph engine (#17454)
6 years ago
gongweibao 0d561ef442
fix 2dconn test=develop (#17681)
6 years ago
mozga-intel ccf9e2327b [Lite] Enable cast operator test=develop (#17294)
6 years ago
tangwei12 0d3c48e0a8
fix doc in transpiler, test=develop (#17313)
6 years ago
Hongyu Liu 9f85f21880
Add new gard clip [old gradient clip not support in dy graph] (#17523)
6 years ago
Zhaolong Xing 4337009b92 fix trt ci timeout error (#17701)
6 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
6 years ago
Yiqun Liu 5782dddad0
Optimize the concat and split kernel for specical cases when the number of inputs/outputs is 2 (#17415)
6 years ago
石晓伟 acbb4bf38d
update python API examples (#17351)
6 years ago
lidanqing 04b6c29ee0 Improve mobilenetv2 INT8 performance by using INT8 relu as post-op (#17570)
6 years ago
Jacek Czaja 6d8075ecef [MKL-DNN] conv_transpose mkldnn bias pass (#17644)
6 years ago
Shuai Yuan 41f1186c6b
[DOC][PYTHON] Fix api docs, test=develop, test=document_preview (#17629)
6 years ago
wopeizl 058f1f1e1b
fix the api example for create_global_var, create_parameter, SGDOptim… (#17371)
6 years ago
Yibing Liu 4f4f0993c1
Bias correction for exponential moving average (#17677)
6 years ago
Tao Luo 962eed6f82
Revert "Enable SQRT operator for the nGraph Bridge (#17549)" (#17680)
6 years ago
Tao Luo 67a6297a9f
update unique_name notes and examples (#17671)
6 years ago
Krzysztof Binias f34830e2aa Enable SQRT operator for the nGraph Bridge (#17549)
6 years ago
Sylwester Fraczek 96845d2168 add Concat quantization (#17448)
6 years ago
Zeng Jinle 432ac70124
clean code of py_layer in dygraph mode,test=develop (#17661)
6 years ago
gongweibao 65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. (#17263)
6 years ago
Krzysztof Binias b1bd483a7d [NGraph] Enable gelu operator for the nGraph Bridge. (#17547)
6 years ago
Zhen Wang 8bd651b7ed
Fix the bug in the AnalysisPredictor and add more directions about io APIs. (#17639)
6 years ago
chengduo 343017324e
Polish Print Op (#17651)
6 years ago
Zeng Jinle 4aa931dd85
Code clean of Allocator (#17602)
6 years ago
Guo Sheng 430e25654b
Fix the usage of out_grad lod in sequence_slice_op. (#17625)
6 years ago
Bai Yifan bbd6e438fc
fix conflicts,test=develop (#17186)
6 years ago
bdzhuxiaoning 9f85afb7b6 test=develop (#17643)
6 years ago
chengduo 9322216170
Add data distributed_sampler (#17573)
6 years ago
hutuxian 1670db5e86
Gather Op Index Support int64_t datatype (#17610)
6 years ago
mozga-intel 2b83d75bfa Enable elementwise pow operator for ngraph (#17526)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago
Michał Gallus 0c39b97b4e [MKL-DNN] Add Fully Connected Op for inference only(#15226)
6 years ago
wopeizl 6724a652f3
add __str__ method for tensor and lodtensor to support print test=dev… (#17588)
6 years ago
Krzysztof Binias e9216d0602 Enable logical operators for the nGraph Bridge. (#17543)
6 years ago
Hongyu Liu cbaf9e5344
Fix api example [ lstm, sequence_enumerate, sequence_expand,sequence_expand_as ] (#17210)
6 years ago
guru4elephant 326bf8291a
add Run Prepared Ctx (#17616)
6 years ago
Yibing Liu e8990e64f6
Fix trust ratio in lamb (#17614)
6 years ago
Guo Sheng 2a7b321110
Fix the example code in some Python API. (#17343)
6 years ago
chengduo b5f4d5ed0e
Add broadcast operators (#17503)
6 years ago
flame 2280f185d7
BuildStrategy api comment (#17348)
6 years ago
Sylwester Fraczek 5b2a3c4b12 Conv concat relu quantization (#17466)
6 years ago
Sylwester Fraczek bccb0ba49a fix quantize_squash_pass segfault when no tensor linked to Bias (#17292)
6 years ago
chengduo 2dc1c6f25c
Add profiler in tracer (#17076)
6 years ago
mozga-intel 0d4cbdad91 [NGraph] Enable elementwise mul operator (#17552)
6 years ago
tianshuo78520a cee9dcc383 Delete LoDTensorset in API.spec (#17577)
6 years ago
mozga-intel f2694e122d [NGraph] Enable assign operator for a ngraph, test=develop (#17437)
6 years ago
mozga-intel cf02cb5e98 Enable elementwise sub operator for ngraph (#17527)
6 years ago
guru4elephant 7f8bc49d00
polish_executor_and_add_ctx_cache (#17536)
6 years ago
tensor-tang 7ae461eb13
[CPU] refine cpu softmax bwd (#17534)
6 years ago
Yibing Liu 6e11f97708
Add exponential moving average (#17562)
6 years ago
tensor-tang 0600b370ea
[CPU] refine softmax op fwd on CPU (#17522)
6 years ago
Zeng Jinle c6189637cd
Fix allocator bug (#16712)
6 years ago
mozga-intel 035771512d Enable elementwise min operator for ngraph (#17521)
6 years ago
Kaipeng Deng cf60e5a2db
fix API python example (#17226)
6 years ago
Qiao Longfei 92e7d5d7cc
fix distribute doc test=develop (#17318)
6 years ago
jerrywgz c1aae8b8d2
Fix GetExpectedKernelType in Concat op (#17459)
6 years ago
Qiao Longfei 58f7695ab2
Async exe support communicator (#17386)
6 years ago
Zhaolong Xing 38da103034 fix trt ci bug temporary. (#17565)
6 years ago
mozga-intel 109b5aed5a [NGraph] Enable reshape operator test=develop (#17512)
6 years ago
zhang wenhui 9bb6a421e3
fix bpr_loss data_norm teacher_student_sigmoid_loss api & fix continuous_value_model (#17331)
6 years ago
lijianshe02 300bd7504d
fix api-doc related bugs test=develop test=document_preview (#17360)
6 years ago
lijianshe02 daf88968e2
fix bug that saved optimal model path in test_analyzer_save_model con… (#17555)
6 years ago
Krzysztof Binias 43d15b9d96 Enable square operator for the nGraph Bridge. (#17551)
6 years ago
Sevin F. Varoglu f86f49e779 [NGraph] add increment op to ngraph engine (#16929)
6 years ago
baojun 8923612b10 NGraph enable parse serialized graph test=develop (#17453)
6 years ago
Yiqun Liu cf5d271c5a
Fix examples of fluid.layers.sums and fluid.layers.DynamicRNN (#17308)
6 years ago
guomingz 2281ebf0f3 Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130)
6 years ago
Yibing Liu f9796b1249
Add LAMB Optimizer support (#17489)
6 years ago
mozga-intel 99ab57123c Enabled ngraph elementwise max operator (#17517)
6 years ago
Tao Luo 3d19f44a89
remove unused SERIAL compiler option (#17500)
6 years ago
zhaoyuchen2018 dfdcd91869
Add api doc code examples (#17285)
6 years ago
mozga-intel 1eb151752e Enable abs operator for a ngraph test=develop (#17436)
6 years ago
lidanqing 36757ed203 Enabling resnet101, vgg16, vgg19 INT8v2 model tests (#17468)
6 years ago
liuwei1031 ba70cc499e
fix security bugs : (#17464)
6 years ago
Zhaolong Xing ff7f911b4d
add quant_dequant_moving_avg_max_abs op (#17480)
6 years ago
Qiao Longfei 287de41c04
Optimize communicator flags (#17494)
6 years ago
liuwei1031 c3949f5699
remove two useless flags: enable_subgraph_optimize, memory_optimize_debug, test=develop (#17491)
6 years ago
liuwei1031 f82e4d75e7
improve the doc of paddle.fluid.memory_optimize, test=develop (#17473)
6 years ago
Tao Luo 32da5e9c3d
remove unused expected_kernel_cache_pass (#17486)
6 years ago
wopeizl ca3ba378c7
fix the random compilation failure on windows test=develop (#17475)
6 years ago
lvmengsi 10b23a72c1 Double backward elementwise div (#17416)
6 years ago
qingqing01 97f0ec2357 Fix compiling error with cuDNN 5.1 (#17458)
6 years ago
Zeng Jinle 3d4e8268c6 fix recurrent fwd bug when no backward and scope clear (#17460)
6 years ago
lvmengsi 977e9fcb27
support elementwise_sub double backward (#17476)
6 years ago
jiaqi 75cda4d9df
fix data_feed_desc.py example run error (#17452)
6 years ago
chengduo 5a6ab38013 Add record event And remove CSP (#17447)
6 years ago
Yan Xu 0217555530 polish parallel dygraph code (#17164)
6 years ago
Jiabin Yang d7df4e5e5b
Fix/Fix memory leak in dygraph (#17394)
6 years ago
Qiao Longfei 728bbaa4e3
add cache_update_mutex_ for operator test=develop (#17124)
6 years ago
Bai Yifan 3a9ae28d32
fix assert,test=develop (#17445)
6 years ago
zhaoyuchen2018 b02f2aff04
Add conditional compile for gru opt (#17368)
6 years ago
liuwei1031 6a53fa95e7
improve the API Sample of DataFeeder, memory_optimize and release_memory (#17374)
6 years ago
guru4elephant 43c9561e9a
add inductive shape index (#17435)
6 years ago
Zeng Jinle 712bfb17cb
fix recurrent_op,test=develop (#17433)
6 years ago
Tao Luo 5babcd02dd
Revert "remove unnecessary prepare_data (#17080)" (#17432)
6 years ago
chengduo e336dc86bb
[Speed] Refine the Executor when the num_thread=1 (#17405)
6 years ago
Jie Fang 30e178fa2c init auto loss scaling (#17194)
6 years ago
Zhen Wang 4a1b7fec96
Add setting Scope function for the graph class (#17417)
6 years ago
mozga-intel 6ee6700fac Eanble stack operator for a Ngraph, test=develop (#17406)
6 years ago
flame e48dd92fc8
bug fix (#17392)
6 years ago
jiaqi 66d51206b1
add save/load model, shrink table, cvm, config file & fix pull dense bug (#17118)
6 years ago
Krzysztof Binias 0823a7bc8b Optimize the sequence padding op (#17403)
6 years ago
baojun 1ce7b45b9e NGraph Added fill_zeros_like op test=develop (#17295)
6 years ago
baojun 910196524d NGraph Added dropout and dropout_grad to ngraph test=develop (#17320)
6 years ago
mozga-intel b189480734 Ngraph Enable gather operator test=develop (#17296)
6 years ago
lvmengsi 4ef631013c Double backward sqrt (#17387)
6 years ago
JesseyXujin 829fcc98fb
Fix some APIs' example
6 years ago
Zeng Jinle eab34b2df6
fix_dygraph_mem_leak, test=develop (#17396)
6 years ago
lvmengsi 5d1ac41b00 Double backward reduce mean (#17372)
6 years ago
jerrywgz 0cae5a36b6
enhance generate mask labels, test=develop (#17380)
6 years ago
Kaipeng Deng bd9bef5a4e
add elementwise_add_grad_grad op (#17366)
6 years ago
jerrywgz 1c6d064627
add collect fpn proposals op,test=develop (#16074)
6 years ago
Kaipeng Deng 60be66e2c0
support fc_op double grad (#17317)
6 years ago
Zhen Wang ad8bbe587e
Fix some api example codes' bugs and these APIs include load_inference_model, load_vars, save_vars, L1DecayRegularizer and L2DecayRegularizer. (#17324)
6 years ago
Tao Luo 68ec0a6f74
make parallel_executor support FLAGS_use_mkldnn (#17341)
6 years ago
liuwei1031 0863599323
Fix the uninitialized gru_value.output_value. (#17197)
6 years ago
zhoukunsheng 2ff7ea3337 Expose sign op (#17117)
6 years ago
Yihua Xu 218d8d8f73 Optimize the computing kernel of sequence_reverse operator (#17349)
6 years ago
Yiqun Liu dcda20233c
Optimize the elementwise op using eigen (#15494)
6 years ago
Jiabin Yang 4624d7c642
test=develop, add gradient sort backward strategy (#17125)
6 years ago
qingqing01 1d0ba5e815
Fix the example code in some Python API (#17333)
6 years ago
Kaipeng Deng 8bae8590ac
add double grad for elementwise_mul op (#17255)
6 years ago
Kaipeng Deng 11d3a38f25
add double grad for square op (#17173)
6 years ago
Jiabin Yang 31536016ea
test=develop, test=document_preview, fix 13 api doc and code (#17293)
6 years ago
chengduo bc833945a4
Add DropLocalExeScopes in ParallelExecutor (#17297)
6 years ago
zhoukunsheng d4b67e1692 Add Where Op(#16793)
6 years ago
zhoukunsheng 1bfff02047 Add Diag Op(#17027)
6 years ago
zhaoyuchen2018 8a2caacdbc
improve gru unit performance. (#16338)
6 years ago
SunGaofeng ddb24d48c5
test=develop (#17322)
6 years ago
qingqing01 e32c9888f5
Double backward of conv2d. (#17211)
6 years ago
Zeng Jinle 5e5e7b3305
fix data_type error message (#17312)
6 years ago
Zeng Jinle fff270eacd
follow comments,test=develop (#17273)
6 years ago
Zhaolong Xing 7a3bb061d8
fix: (#17279)
6 years ago
xiaoting 50ad9046c9 add import, test=develop (#17229)
6 years ago
zhoukunsheng 4292bd8687 Mod floordiv (#17251)
6 years ago
guru4elephant 5d6a1fcf16
fix infer_from_dataset and train_from_dataset (#17243)
6 years ago
chengduo 516317cf91
use sync copy (#17291)
6 years ago
Huihuang Zheng 2c4462711f
Fix API example code of save_inference_model (#17274)
6 years ago
xiaoting 9ed4aaada4 modified formula for Lrn (#17281)
6 years ago
zhaoyuchen2018 792443ef23
Refine elementwise kernel. (#16952)
6 years ago
lujun e388a1fb66
Repair api example (#17221)
6 years ago
Yiqun Liu 6b84688ba2
Optimize the cuda implementation of sum_op (#17283)
6 years ago
chengduo db5e74ab95
update assert (#17282)
6 years ago
Hongyu Liu c3195de522
Fix concat shape check (#17247)
6 years ago
lvmengsi dab71e8d97
Fix api example (#17231)
6 years ago
whs 7d7e29957f Fix bp of roi perspective transform op. (#17216)
6 years ago
baojun 7bd1d03ee5 Adding lrn op for ngraph engine (#17189)
6 years ago
Wojciech Uss 984aa90583 improved unit test output (#17266)
6 years ago
chengduo 8f534696b7
Polish Executor and Compiler doc (#17262)
6 years ago
gongweibao 91784f8ec3
Fix code in document. (#17237)
6 years ago
chengduo 04bd413acb
Code Clean: Move all pass to paddle::framework::ir (#17228)
6 years ago
Huihuang Zheng 648320bb6c
Fix some data and reader related API code (#17202)
6 years ago
Zeng Jinle f2fa3f7300
fix api doc,test=develop (#17241)
6 years ago
Zeng Jinle 4f8594088d
Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225)
6 years ago
baojun e782b54b9c update sofmax with axis arg test=develop (#17190)
6 years ago
tensor-tang 71f0c6d5bd
fix api doc of hash, relu, concat, argmin, argmax, argsoft and all activations (#17235)
6 years ago
Zeng Jinle 6fafd37e12
fix retry_allocator (#17245)
6 years ago
Tao Luo ff1661f12a
remove unused FLAGS_warpctc_dir (#17162)
6 years ago
Kaipeng Deng a71d8fdb87
Softmax_cross_entropy op add axis (#16806)
6 years ago
songhao c2e20e2a29 fix build warning like 'comparison between signed and unsigned (#17240)
6 years ago
Zhen Wang a914d9b116
Quant output scale (#17215)
6 years ago
zhaoyuchen2018 32b62c25af
optimize sum op (#16820)
6 years ago
石晓伟 a72dbe9abf
Cherry-pick benchmark related changes from release/1.4 (#17156)
6 years ago
Tao Luo 16922e0093
fix api_example of tree_conv (#17239)
6 years ago
jerrywgz ef66baedc0
Refine api doc (#17230)
6 years ago
Leo Zhao 54636a1982 call SetNumThreads everytime to avoid missing omp thread setting (#17224)
6 years ago
Yibing Liu 6b0f27e802
Fix some APIs' example (#17214)
6 years ago
ruri 5817077c99
Fix unexecutable API examples (#17218)
6 years ago
jerrywgz cc95a7516c
fix distribute fpn proposals, test=develop (#16152)
6 years ago
Tao Luo 9ec4615deb
fix profiler and name_scope API examples (#17212)
6 years ago
Zeng Jinle c5eeecca7c
Fix tensor_py.h (#17195)
6 years ago
Zeng Jinle ee2028a110
Add use_cuda to inplace pass (#17205)
6 years ago
chengduo 950aec55fd
It doesn't need sync when fetch_list nit not empty (#17201)
6 years ago
jerrywgz a72907bbf4
Enhance concat op to support empty input. (#17015)
6 years ago
wopeizl 83c4f7721f
use two GPUs to run the exclusive test test=develop (#17187)
6 years ago
chengduo 3c6ab799cd
Remove unnecessary set_devices (#17158)
6 years ago
guru4elephant f938ccec62
remove async executor python api to fix document (#17174)
6 years ago
Zeng Jinle 5dfe2ab9e8
Fix mem leak when converting Tensor to numpy array (#17182)
6 years ago
Huihuang Zheng e4a5332416
Fix a typo in gpu_info.cc (#17175)
6 years ago
tensor-tang 79ed1c76cd
fix bn fuse vardesc and add model saver (#17143)
6 years ago
Zeng Jinle 4e1bc6e805
Rewrite inplace pass and fix gc bug (#17126)
6 years ago
Zeng Jinle 08773b6069
fix reader default stream,test=develop (#17106)
6 years ago
xiaoting bc48453b73 polish the label_smooth (#17138)
6 years ago
Leo Zhao bf4b21fa3d fix assertion failure issue when test_analyzer_bert uses ngraph (#17148)
6 years ago
tangwei12 deb510d451
cvm op feature (#17081)
6 years ago
Zeng Jinle 28d69d710a
Refine dropout gpu memory (#17095)
6 years ago
Huihuang Zheng b9494058b3
Use CudnnWorkspaceHandle in exhaustive search (#17082)
6 years ago
xiaoting 7da7881c0e Detailed coordinate description for yolov3 loss (#17007)
6 years ago
chengduo 794a195881
fix fuse optimizer ops (#17102)
6 years ago
ceci3 258e000be6
test=develop, double backward leaky_relu (#17067)
6 years ago
Kaipeng Deng 10c487eb21
fix interpolate cu. test=develop (#17101)
6 years ago
Tao Luo aca60e9a20
remove unnecessary prepare_data (#17080)
6 years ago