Commit Graph

15686 Commits (1c2aae567a8863c9cdb666fc3b553b8f01281d15)

Author SHA1 Message Date
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
石晓伟 1529154821
Support Bitmain Anakin (#18542)
6 years ago
tianshuo78520a 9b3d3b8387 Cancel jacquesqiao approval authority (#18538)
6 years ago
Leo Zhao ce38bb5341 use static variable to do cache instead of thread local in thread frequent switching case (#18428)
6 years ago
gongweibao 160ddc980c
Regroup fusion by date type. (#18496)
6 years ago
Tao Luo fe32879d2a
add mkldnn shapeblob cache clear strategy (#18513)
6 years ago
chengduo e576f2667b
update docker build (#18523)
6 years ago
zhaoyuchen2018 832d8191ff
Fix topk cannot handle 1D vector bug (#18466)
6 years ago
石晓伟 280a8784f7
Remove the obsolete cmake options (#18493)
6 years ago
LielinJiang 43e17c7951 Add distributions of normal and uniform (#18023)
6 years ago
bingyanghuang 3fe6bf5ee6 fix command line bug in int8v2 readme (#18507)
6 years ago
tensor-tang 4828a5e008
core remove pycpuinfo (#18479)
6 years ago
qingqing01 7ac4818a98
Refine Infershape in activation_op for double_grad. (#18485)
6 years ago
qingqing01 602cb6a5b4
Enhance linear_lr_warmup (#18463)
6 years ago
chengduo 7453857324 Make fuse_all_reduce_op_pass support mix_precision (#17652)
6 years ago
chengduo 55baeceddb
Enhance execution error info (#18482)
6 years ago
石晓伟 047bba855b
Remove the obsolete cmake options (#18481)
6 years ago
pkpk e9c7e218f2
Nan debugger init (#18401)
6 years ago
Jiabin Yang f72ced8814
test=develop, fix docker with paddle nccl problem (#18451)
6 years ago
Tao Luo 3f3112ceb0
add shape_blob for cache mkldnn primitive (#18454)
6 years ago
Tao Luo d234aa02cd
add transfer_scope_cache unit-test (#18467)
6 years ago
zhoukunsheng 7c6f2350b9 support Tensor input for edit_distance op (#18162)
6 years ago
zhoukunsheng 26318544d2 support Tensor input for chunk_eval op (#18226)
6 years ago
zhoukunsheng 206c44e2a8 add unique kernel and op (#17557)
6 years ago
zhoukunsheng 71af72b1c2 upgrade hash op to support Tensor and LoDTensor input (#17998)
6 years ago
zhoukunsheng d3b3443d10 add ones_like op (#17388)
6 years ago
zhoukunsheng 67b48d7fe7 add size op (#17412)
6 years ago
Leo Zhao 8f5fffca0a rename mkldnn set/get_cur_thread_id() to set/get_cur_mkldnn_session_id() (#18453)
6 years ago
Tao Luo 3123d18787
remove unused AnalysisPredictor::SetMkldnnThreadID() (#18444)
6 years ago
Yi Liu a873fa84ce
supports collective training with programs (#18392)
6 years ago
tianshuo78520a 85b49d8473 fix the api.spec file does not get the class comment problem (#18439)
6 years ago
chengduo e0d8c6ac68
Add find_no_grad_vars in backward.py (#17942)
6 years ago
LielinJiang 449c7a9f98 Make roi_perspective_transform op return mask and transform matrix (#18371)
6 years ago
tensor-tang a3bc804f5f
fix mac ci random fail (#18430)
6 years ago
Michał Gallus 7023a86c3a Fix Pooling output scale (#18186)
6 years ago
Brian Liu 4bc2987d2f Fix bug in quantize kernel which cause crash in vgg16/19 model (#17964)
6 years ago
xsrobin 47e2ef38e9
add "import paddle.fluid as fluid" to examples lack of it
6 years ago
tianshuo78520a 92ecb305c2 test=develop (#18426)
6 years ago
hutuxian 8a39e5c110 update api format (#18413)
6 years ago
jiaqi 93a2b317f7
fix data feed ptr error (#18419)
6 years ago
tensor-tang ce7a024c6d
fix py-cpuinfo mac random fail (#18383)
6 years ago
Jie Fang 2b4ef509ea init custom black white list (#18377)
6 years ago
Leo Zhao 681d3553f1 Fix potential mkldnn concat/pool/conv kernel issues (#18393)
6 years ago
tianshuo78520a 052b044873 Fix mac build nproc command not found (#18362)
6 years ago
Zeng Jinle f5641000bb
Add a unittest to inplace elementwise_add (#18385)
6 years ago
Jiabin Yang 43f64a177e
Fix/program doc (#17908)
6 years ago
Jiabin Yang af874a1f1d
test=develop, fix multigpu hang on latest docker (#18379)
6 years ago
chengduo 871cc15e6a
Add is_compiled_with_cuda (#18356)
6 years ago
lujun fd6631ef2f
Fix dygraph show style (#18297)
6 years ago
HaoRen 9931bc64f5 add dependecy of collective_helper (#18365)
6 years ago
翟飞跃 19da59ed3f Remove all the code, API and doc of MKL-DNN INT8v1 (#18347)
6 years ago
chengduo 8ed33bf91f
Fix Bug-prone code of PE (#18354)
6 years ago
tangwei12 999d9a59a5
fix communicator with pyreader (#18350)
6 years ago
tianshuo78520a cff2c2d83f add combine_avx_noavx build to dockerfile
6 years ago
kh2se2013 27fb9cad65 add WITH_COVERAGE option, default OFF (#17872)
6 years ago
Michał Gallus 8409693272 Reset DeviceContext after quantization warmup (#18182)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
Sylwester Fraczek 9252e8fa08 add int8 mkldnn prior_box (#17242)
6 years ago
lidanqing 5fd68ac154 some fixes for int8 mobilenet_ssd tester (#18112)
6 years ago
Jacek Czaja c2efdfd5bc [MKL-DNN] Extending reusing to Elementwise_add_mkldnn op (#18146)
6 years ago
qingqing01 9047ac687e
Simplify multi_box_head API in detection.py and remove assign op. (#18310)
6 years ago
Zeng Jinle 5826b72e06
Refine CUDAPlace error message. (#18343)
6 years ago
Tao Luo 3c9755bbb9
remove unused jemalloc option (#18314)
6 years ago
Yibing Liu 23941e43ec
Update lamb optimizer (#18333)
6 years ago
chengduo 135a59ed45
update reduce config (#18334)
6 years ago
tensor-tang 81ec538279
fix softrelu doc (#18324)
6 years ago
Hongyu Liu df2eee71d8
Sequence mask support tensor (#18249)
6 years ago
Qiao Longfei 0e08e91c18
optimize communicator merge sparse gradient test=develop (#18159)
6 years ago
chengduo e06c69c788
Fix default value of fluid.memory_optimize (#18295)
6 years ago
Zhaolong Xing 6978b2e48e
fix split and sampled softmax (#18280)
6 years ago
Yibing Liu f57ee3693b
Fix the bug of sequence_unpad op (#18290)
6 years ago
chengduo 5489216eba
Clean build strategy (#18148)
6 years ago
chengduo 14e1e165df
update alloc_continuous_space_for_grad_pass (#18287)
6 years ago
lujun 7e61baaa94
add Dygraph api to api.spec (#18235)
6 years ago
liuwei1031 a736c03b10
improve doc of lstm, sequence_enumerate, softmax_with_cross_entropy, space_to_depth APIs (#18261)
6 years ago
flame fdf798f95a
fix double buffer example (#18169)
6 years ago
Bai Yifan 23b8b18e56
fix api doc example, test=develop (#18266)
6 years ago
xiaoting 2f0d68261c fix yolo_box example,test=develop (#18247)
6 years ago
songhao 6b3d96254d fix some bug when merge sparse embedding parameters, test=develop (#18223)
6 years ago
jiaqi 3f8031e256
dataset (#17973)
6 years ago
liuwei1031 5d54ed4a84
improve the doc of DataFeeder and default_main_program (#18241)
6 years ago
xiaoting b58bb80248 set src_idx > 0 for bilinear_interp_op (#18238)
6 years ago
wopeizl daa32d5383
fix package generation for inference test=develop (#18220)
6 years ago
Shuai Yuan 9a32dad811
[DOC] Fix comment code of API create_py_reader_by_data (#18193)
6 years ago
Hongyu Liu cefd0fb598
Fix slice op shape=-1 bug (#18107)
6 years ago
lijianshe02 ff4279e3b2
fix paddle.fluid.layers.io.open_files api doc bug test=develop (#18203)
6 years ago
chengduo 5588b923f3
Add multi process reader (#18115)
6 years ago
wangchaochaohu a9dc534f48
fix API example (#18153)
6 years ago
翟飞跃 de42fe8fd5 Change int8v2 CAPI unit test name and add log in the prediction stage (#18200)
6 years ago
翟飞跃 802ea50956 fix spelling errors (#17941)
6 years ago
zhoukunsheng 0569ff78fa Fix doc example for greater_equal, greater_than, less_equal, not_equal, rank, reduce_all, reduce_any, sign, where, diag (#18167)
6 years ago
Huihuang Zheng bbc292920c
Fix API example code (#18176)
6 years ago
翟飞跃 78441c5449 add mkldnn Int8v2 slim doc (#17909)
6 years ago
lvmengsi d658f1133b
Fix doc for transpose, conv3d and batch_norm. (#18035)
6 years ago
FlyingQianMM 944c3165ec
fix type error of std::pow in sigmoid_focal_loss_op.cu and sigmoid_focal_loss_op.h (#18152)
6 years ago
chengduo 25f3cd6486
Update execution_strategy option default value (#18183)
6 years ago
chengduo 4978db2c10
Remove nccl dep when the number of GPU is 1 (#18158)
6 years ago
Zeng Jinle 25ab23be28
Fix dygraph mem leak (#18082)
6 years ago
tensor-tang 1c6e560607
core replace x86cpu with py cpuinfo (#18151)
6 years ago
Zeng Jinle 6eec66a1b1
Fix py_reader iterable bug (#18108)
6 years ago
qingqing01 80d2e66f9e
Update backward appending stragety to support double backward and fix some bug. (#18104)
6 years ago
Wojciech Uss ca5642c850 unify FP32 vs. INT8 comparison tests output (#18111)
6 years ago
Wojciech Uss c26130f3a9 reuse C-API INT8 unit test application (#18077)
6 years ago
FlyingQianMM ff83655f7e
add detection output operator for supporting retinanet (#17896)
6 years ago
FlyingQianMM 0aee1f0074
add sigmoid focal loss operator for supporting retinanet (#17895)
6 years ago
FDInSky 9e4b9d9798 Update generate_proposal_labels_op to support CascadeRCNN. (#17200)
6 years ago
FlyingQianMM 9ed2f936f1
add target assign operator for supporting retinanet (#17893)
6 years ago
Huihuang Zheng 7faf095618
Sync Dockerfile change of PR#17889 (#18072)
6 years ago
Sylwester Fraczek accb132f0f fix slim int8 mkldnn multithreading issue (#18009)
6 years ago
tianshuo78520a 2e1d8cf7c8 add approval to requirements.txt
6 years ago
chengduo 24e988a471
Fix bug of scope_buffered_ssa_graph_executor (#18100)
6 years ago
Huihuang Zheng 3f55ab0f89
Modify format of GPU allocation failure log. (#18034)
6 years ago
gongweibao f5caf3443c
Fix reinitialized ncclid error! (#18025)
6 years ago
whs 354643d8d9
Add warning for cudnn warpctc kernel in CUDA9\CUDA10. (#18046)
6 years ago
qingqing01 e81756f1ba
Hidden paddle.fluid.layers.detection_map. (#18033)
6 years ago
Yiqun Liu 660c1a65f3
Optimize fused_elewise_activation_grad op. (#18041)
6 years ago
lidanqing 466254151a add Mobilienet ssd int8 analyzer tester (#18075)
6 years ago
石晓伟 42f12a4aca
fix ci test cmake test=develop (#18060)
6 years ago
chengduo b5a1c1463d
Update CPU_NUM config (#18059)
6 years ago
lidanqing f8ecc3de89 refactor the function ConvFwdPrimitiveDesc (#17897)
6 years ago
Michał Gallus 8462e2b805 Disable MKLDNN FC in Resnet50 test (#18030)
6 years ago
Wojciech Uss 78e932862c Added unit test for QAT FP32 & INT8 comparison (#17814)
6 years ago
tensor-tang 566bf2ec56
concat op support negative axis (#18045)
6 years ago
Yiqun Liu 7e463c84a6
Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. (#17979)
6 years ago
tangwei12 101f74cb19
fix save/load in fleet (#17675)
6 years ago
hutuxian f1d458daf0
add trainer_desc proto DEPS (#18019)
6 years ago
Guo Sheng a06b316b94
Fix GetExpectedKernelType of add_position_encoding_op (#17935)
6 years ago
tensor-tang 5c06bff222
combine noavx and avx package (#17889)
6 years ago
wawltor 8eb134c3c1
Fix scatter and gather op when has duplicate index (#17952)
6 years ago
lujun 75fcd29220
update load_error_info, test=develop (#18000)
6 years ago
石晓伟 04ea7cb069
modify the access level of anakin engine (#18015)
6 years ago
wawltor 2ae8decc90
test=develop (#17984)
6 years ago
ruri 9d6640ff44
Fix edit distance doc (#17947)
6 years ago
Zeng Jinle a1bdf25ecb
Add shape not match doc to data layer (#17936)
6 years ago
cjt222 871af28d6c
add deformable psroi pooling (#17827)
6 years ago
SunGaofeng 40885c225b
add unfold op (new op),test=develop (#17944)
6 years ago
Jacek Czaja 84bb45c054 [MKL-DNN] Thread-Safety for MKL-DNN reusing Part 1 (#17965)
6 years ago
gongweibao da9143c1cc
Polish codes of old prs. (#17938)
6 years ago
石晓伟 bce259e5bf
Update the Anakin interfaces for content-dnn and MLU (#17890)
6 years ago
tianshuo78520a 410907f624 added monitoring of python/requirements.txt file (#17957)
6 years ago
hutuxian 969e6378b9
Pipeline Concurrency (#17402)
6 years ago
Zhaolong Xing 4e8d5a034f
Light mem reuse strategy for inference. (#17925)
6 years ago
Tao Luo 53fd507bae
fix merge conflict of 'Remove attribute in Allocator::Allocate' and elementwise_add_mkldnn_op (#17949)
6 years ago
zhaoyuchen2018 3847d9fc2c
refine sum stack api doc (#17923)
6 years ago
jerrywgz aab4d12c0e
refine GetExpectedKernelType in conat op, test=develop (#17934)
6 years ago
Zeng Jinle 3ece61f71e
Remove attribute in Allocator::Allocate (#17878)
6 years ago
Yibing Liu 33d1e56506
Enable seq_pool op to accept len 0 input (#17284)
6 years ago
Yihua Xu 9b5017366a Fix the format issue when 'X' is not nchw. (#17833)
6 years ago
Hongyu Liu 8062bd510c
Reshape support tensor attribute (#17781)
6 years ago
gongweibao 972c54cd70
Fix FLAGS_fuse_parameter_memory_size unit from Bytes to MBytes. (#17924)
6 years ago
Zeng Jinle 0a96ec699c
fix conv v7 workspace size limit error, test=develop (#17902)
6 years ago
Jiabin Yang 4d5f6937c3
Feature/refine api for dygraph (#17907)
6 years ago
gongweibao dd4cd352c7
Fix sync_batch_norm_op ncclallreduce error! (#17918)
6 years ago
whs 5df65e506d
Add Ligth-NAS for PaddleSlim (#17679)
6 years ago
Zeng Jinle 3925bd81e8
Fix cuda/cudnn version detection error (#17853)
6 years ago
Yihua Xu 14a32bf0c4 Fix the accuracy issue while using float precision to get the scale. (#17884)
6 years ago
gongweibao fbbdc9ccad
Add backward and optimizer operator dependency pass. (#17746)
6 years ago
mozga-intel c1379bf238 [NGraph] Bert model for a capi, ngraph's support test=develop (#17844)
6 years ago
baojun e2c1b7c354 [NGraph] cache compiled function instead test=develop (#17845)
6 years ago
石晓伟 d008260fa8
update the initialization of anakin subgraph (#17880)
6 years ago
Zhaolong Xing ae576f3c68
fix: when use the load model from memory mode, the RAM occupy is high (#17788)
6 years ago
Zhaolong Xing 5efe8c7287
fix bug: the lod_tensor_to_array op will aplly a new var but not release when dong inference (#17856)
6 years ago
Jiabin Yang 022dfed4fc
Add optimizer save and load (#16986)
6 years ago
wopeizl 453a49b1bc
Make ParallelExecutor support Windows GPU (#17787)
6 years ago
pawelpiotrowicz 39bc8a55a4 [NGraph] Enable ngraph layer_norm operator (#17599)
6 years ago
翟飞跃 993c703bcc INT8 MKL-DNN v2 integrate to slim (#17634)
6 years ago
wopeizl 841553e13f
use pyreader to read data in dygraph mode (#17314)
6 years ago
chengduo 5436d66667
close socket connect (#17862)
6 years ago
baojun a4c528a31c [NGraph] some ngraph updates to enable bert (#17739)
6 years ago
Jiabin Yang 3d3f5506d2
Feature/Fix recurrent usage of Varbase in Dygraph (#17838)
6 years ago
Zeng Jinle 674e0ce2d6
Use Python C-API to speed up dygraph trace (#17837)
6 years ago
tianshuo78520a 47cc1b51ad Change Linux CI check API
6 years ago
jerrywgz 5e4f99dd74
refine doc for prelu (#17810)
6 years ago
chengduo d1169afaa3
remove InstallFailureSignalHandler (#17828)
6 years ago
chengduo 437520474c
fix DropLocalExeScopes (#17829)
6 years ago
Leo Zhao 50326563d5 enable mkldnn primitive reuse for platform reorder (#17826)
6 years ago
baojun 7611208ab7 [NGraph] added gather_grad to ngraph test=develop (#17646)
6 years ago
tensor-tang 557452e778
update and polish hash op doc (#17809)
6 years ago
jerrywgz 92d9bdfce2
fix api doc in slice op, test=develop (#17804)
6 years ago
Hongyu Liu dfec676270
expand op supprt tensor attribute (#17773)
6 years ago
Jiabin Yang 3b70f870e2
Using Smart pointer to optimizer memory usage of dyGraph (#17768)
6 years ago
Hongyu Liu 82358bfdc1
ont hot support tensor depth (#16972)
6 years ago
Brian Liu 7cfddf22c8 Optimize bilinear interpolate op with OpenMP (#17800)
6 years ago
Yibing Liu d6d33fd748
Add update method for ema (#17812)
6 years ago
wangchaochaohu c10157a5df
revise the cudnn conv choose algorithm to improve the performance(mask rcnn benchmark) (#17753)
6 years ago
chengduo 863c75168c
polish error doc (#17772)
6 years ago
Tao Luo e089e454a1
make omp thread num default 1 after inference run (#17801)
6 years ago
mozga-intel 6a6bf597f7 [NGraph] Enable elementwise_div operator test=develop (#17515)
6 years ago
Huihuang Zheng 931698a54a
Modify doc of program_guard, py_reader, data, and clone (#17727)
6 years ago
lidanqing d7c5c2bd64 Add input format in Transpose GetHash (#17737)
6 years ago
tangwei12 659b72a97c
fix document of python api get_startup_program() (#17764)
6 years ago
AIFollowers 93de124cec modify some initializer api (#17301)
6 years ago
guru4elephant d52391094d
fix prepare context redundant code problem, optimize executor by cach… (#17743)
6 years ago
baojun 2c58f1a83c [NGraph] Added lookup table to ngraph engine test=develop (#17647)
6 years ago
pawelpiotrowicz bacc822492 [NGraph] Enable transpose ngraph operator (#17636)
6 years ago
lujun ed9d603a8a
fix api doc: Optimizer.ModelAverage (#17395)
6 years ago
baojun 90eae0b39a [NGraph] Addded slice op to ngraph test=develop (#17648)
6 years ago
baojun 2fbaa5c075 [NGraph] added matmul op to ngraph engine test=develop (#17645)
6 years ago
hong19860320 68dcb1bd7b
fix API examples of assign, reverse and array_write, etc. (#17287)
6 years ago
tianshuo78520a f144740b73 change ci ctest exit code (#17745)
6 years ago
chengduo 67c8dade58
Add Event in ScopeBuffer Executor (#17667)
6 years ago
Bai Yifan bba57cdd82
Add deformable conv v2 op,test=develop (#17145)
6 years ago
wangchaochaohu bd48950c7e fix paddlepaddle API examples (#17306)
6 years ago
YishengCheng bd15912d65 fix bug for ctr_reader for svm data (#17575)
6 years ago
Yiqun Liu 8fd39f3e99
Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236)
6 years ago
yaoxuefeng ac92e4c066
fix distributed_transpiler.py api test=develop (#17668)
6 years ago
Yiqun Liu 2704479bb2
Optimize recurrent_op using Prepare and RunPreparedContext, avoiding create operators in every iter. (#17689)
6 years ago
pawelpiotrowicz 9b99876442 Enable less_than ngraph operator (#17642)
6 years ago
Zhaolong Xing a9a531fa5f
Refine python api code example note: (#17369)
6 years ago
Jiabin Yang effc555955
test=develop, layz init Grad (#17653)
6 years ago
hutuxian 4ff87c049d
remove useless input 'Softmax@GRAD' from softmax_with_cross_entropy op (#17612)
6 years ago
Tao Luo b4b169467b
add fc_mkldnn_pass in compare_mkldnn (#17712)
6 years ago
pawelpiotrowicz 70a887af63 [NGraph] Add reduce_sum operator for Ngraph (#17450)
6 years ago
baojun 29baca0dd8 add depthwise_conv2d op to ngraph engine (#17454)
6 years ago
gongweibao 0d561ef442
fix 2dconn test=develop (#17681)
6 years ago
mozga-intel ccf9e2327b [Lite] Enable cast operator test=develop (#17294)
6 years ago
tangwei12 0d3c48e0a8
fix doc in transpiler, test=develop (#17313)
6 years ago
Hongyu Liu 9f85f21880
Add new gard clip [old gradient clip not support in dy graph] (#17523)
6 years ago
Zhaolong Xing 4337009b92 fix trt ci timeout error (#17701)
6 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
6 years ago
Yiqun Liu 5782dddad0
Optimize the concat and split kernel for specical cases when the number of inputs/outputs is 2 (#17415)
6 years ago
石晓伟 acbb4bf38d
update python API examples (#17351)
6 years ago
Jiabin Yang 7a401da52f test=develop, fix mac ci will not uninstall dependency files when error occurs (#17688)
6 years ago
lidanqing 04b6c29ee0 Improve mobilenetv2 INT8 performance by using INT8 relu as post-op (#17570)
6 years ago
Jacek Czaja 6d8075ecef [MKL-DNN] conv_transpose mkldnn bias pass (#17644)
6 years ago
Shuai Yuan 41f1186c6b
[DOC][PYTHON] Fix api docs, test=develop, test=document_preview (#17629)
6 years ago
wopeizl 058f1f1e1b
fix the api example for create_global_var, create_parameter, SGDOptim… (#17371)
6 years ago
Yibing Liu 4f4f0993c1
Bias correction for exponential moving average (#17677)
6 years ago
Tao Luo 962eed6f82
Revert "Enable SQRT operator for the nGraph Bridge (#17549)" (#17680)
6 years ago
Tao Luo 67a6297a9f
update unique_name notes and examples (#17671)
6 years ago
Krzysztof Binias f34830e2aa Enable SQRT operator for the nGraph Bridge (#17549)
6 years ago
Sylwester Fraczek 96845d2168 add Concat quantization (#17448)
6 years ago
Zeng Jinle 432ac70124
clean code of py_layer in dygraph mode,test=develop (#17661)
6 years ago
gongweibao 65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. (#17263)
6 years ago
Krzysztof Binias b1bd483a7d [NGraph] Enable gelu operator for the nGraph Bridge. (#17547)
6 years ago
Zhen Wang 8bd651b7ed
Fix the bug in the AnalysisPredictor and add more directions about io APIs. (#17639)
6 years ago
chengduo 343017324e
Polish Print Op (#17651)
6 years ago
Zeng Jinle 4aa931dd85
Code clean of Allocator (#17602)
6 years ago
Guo Sheng 430e25654b
Fix the usage of out_grad lod in sequence_slice_op. (#17625)
6 years ago
Huihuang Zheng afc3d85da2 Remove Docker build for CI tasks (#17650)
6 years ago
Bai Yifan bbd6e438fc
fix conflicts,test=develop (#17186)
6 years ago
bdzhuxiaoning 9f85afb7b6 test=develop (#17643)
6 years ago
chengduo 9322216170
Add data distributed_sampler (#17573)
6 years ago
hutuxian 1670db5e86
Gather Op Index Support int64_t datatype (#17610)
6 years ago
Huihuang Zheng febc07f047
Add Dockerfile for cuda9 and cuda10 (#17600)
6 years ago
mozga-intel 2b83d75bfa Enable elementwise pow operator for ngraph (#17526)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago
Michał Gallus 0c39b97b4e [MKL-DNN] Add Fully Connected Op for inference only(#15226)
6 years ago
wopeizl 6724a652f3
add __str__ method for tensor and lodtensor to support print test=dev… (#17588)
6 years ago
Krzysztof Binias e9216d0602 Enable logical operators for the nGraph Bridge. (#17543)
6 years ago
Hongyu Liu cbaf9e5344
Fix api example [ lstm, sequence_enumerate, sequence_expand,sequence_expand_as ] (#17210)
6 years ago
guru4elephant 326bf8291a
add Run Prepared Ctx (#17616)
6 years ago
Yibing Liu e8990e64f6
Fix trust ratio in lamb (#17614)
6 years ago
Guo Sheng 2a7b321110
Fix the example code in some Python API. (#17343)
6 years ago
chengduo b5f4d5ed0e
Add broadcast operators (#17503)
6 years ago
flame 2280f185d7
BuildStrategy api comment (#17348)
6 years ago
Sylwester Fraczek 5b2a3c4b12 Conv concat relu quantization (#17466)
6 years ago
Sylwester Fraczek bccb0ba49a fix quantize_squash_pass segfault when no tensor linked to Bias (#17292)
6 years ago
chengduo 2dc1c6f25c
Add profiler in tracer (#17076)
6 years ago
mozga-intel 0d4cbdad91 [NGraph] Enable elementwise mul operator (#17552)
6 years ago
tianshuo78520a cee9dcc383 Delete LoDTensorset in API.spec (#17577)
6 years ago
mozga-intel f2694e122d [NGraph] Enable assign operator for a ngraph, test=develop (#17437)
6 years ago
mozga-intel cf02cb5e98 Enable elementwise sub operator for ngraph (#17527)
6 years ago
guru4elephant 7f8bc49d00
polish_executor_and_add_ctx_cache (#17536)
6 years ago
tensor-tang 7ae461eb13
[CPU] refine cpu softmax bwd (#17534)
6 years ago
Yibing Liu 6e11f97708
Add exponential moving average (#17562)
6 years ago
tensor-tang 0600b370ea
[CPU] refine softmax op fwd on CPU (#17522)
6 years ago
Zeng Jinle c6189637cd
Fix allocator bug (#16712)
6 years ago
mozga-intel 035771512d Enable elementwise min operator for ngraph (#17521)
6 years ago
Kaipeng Deng cf60e5a2db
fix API python example (#17226)
6 years ago
Qiao Longfei 92e7d5d7cc
fix distribute doc test=develop (#17318)
6 years ago
jerrywgz c1aae8b8d2
Fix GetExpectedKernelType in Concat op (#17459)
6 years ago
Qiao Longfei 58f7695ab2
Async exe support communicator (#17386)
6 years ago
Zhaolong Xing 38da103034 fix trt ci bug temporary. (#17565)
6 years ago
mozga-intel 109b5aed5a [NGraph] Enable reshape operator test=develop (#17512)
6 years ago
zhang wenhui 9bb6a421e3
fix bpr_loss data_norm teacher_student_sigmoid_loss api & fix continuous_value_model (#17331)
6 years ago
lijianshe02 300bd7504d
fix api-doc related bugs test=develop test=document_preview (#17360)
6 years ago
lijianshe02 daf88968e2
fix bug that saved optimal model path in test_analyzer_save_model con… (#17555)
6 years ago
Krzysztof Binias 43d15b9d96 Enable square operator for the nGraph Bridge. (#17551)
6 years ago
Sevin F. Varoglu f86f49e779 [NGraph] add increment op to ngraph engine (#16929)
6 years ago
baojun 8923612b10 NGraph enable parse serialized graph test=develop (#17453)
6 years ago
Yiqun Liu cf5d271c5a
Fix examples of fluid.layers.sums and fluid.layers.DynamicRNN (#17308)
6 years ago
guomingz 2281ebf0f3 Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130)
6 years ago
Yibing Liu f9796b1249
Add LAMB Optimizer support (#17489)
6 years ago
mozga-intel 99ab57123c Enabled ngraph elementwise max operator (#17517)
6 years ago
Tao Luo 3d19f44a89
remove unused SERIAL compiler option (#17500)
6 years ago
zhaoyuchen2018 dfdcd91869
Add api doc code examples (#17285)
6 years ago
mozga-intel 1eb151752e Enable abs operator for a ngraph test=develop (#17436)
6 years ago
lidanqing 36757ed203 Enabling resnet101, vgg16, vgg19 INT8v2 model tests (#17468)
6 years ago
liuwei1031 ba70cc499e
fix security bugs : (#17464)
6 years ago
Zhaolong Xing ff7f911b4d
add quant_dequant_moving_avg_max_abs op (#17480)
6 years ago
Qiao Longfei 287de41c04
Optimize communicator flags (#17494)
6 years ago
liuwei1031 c3949f5699
remove two useless flags: enable_subgraph_optimize, memory_optimize_debug, test=develop (#17491)
6 years ago
liuwei1031 f82e4d75e7
improve the doc of paddle.fluid.memory_optimize, test=develop (#17473)
6 years ago
Tao Luo 32da5e9c3d
remove unused expected_kernel_cache_pass (#17486)
6 years ago
wopeizl ca3ba378c7
fix the random compilation failure on windows test=develop (#17475)
6 years ago
lvmengsi 10b23a72c1 Double backward elementwise div (#17416)
6 years ago
qingqing01 97f0ec2357 Fix compiling error with cuDNN 5.1 (#17458)
6 years ago
Zeng Jinle 3d4e8268c6 fix recurrent fwd bug when no backward and scope clear (#17460)
6 years ago
lvmengsi 977e9fcb27
support elementwise_sub double backward (#17476)
6 years ago