Commit Graph

15854 Commits (f04f2b232a22c9aba3ee4538ab708acf9f77c813)

Author SHA1 Message Date
Zeng Jinle 91a0911ca3
Make PADDLE_ENFORCE_EQ support types that cannot be converted to std::string (#19243)
6 years ago
chengduo 8a89ca94ce
Fix REGISTER_OP_WITHOUT_GRADIENT (#19251)
6 years ago
gongweibao fd4b15a2f6
Unset unittests http_proxy env to avoid timeout. (#19269)
6 years ago
silingtong123 a94a25867d imporve the doc of decorate_reader API (#19206)
6 years ago
zhongpu c27b081397 modify paddle/scripts/fast_install.sh about mac installation, test=develop (#19187)
6 years ago
Kaipeng Deng 2848cb791e
fix temporal_shift OP PADDLE_ENFORCE. test=develop (#19161)
6 years ago
Tao Luo 2f8c7e021f
remove unused inference_transpiler unit-tests (#19130)
6 years ago
Zeng Jinle 708bd9798d
move_flags_to_unified_files_for_management, test=develop (#19224)
6 years ago
Zeng Jinle 002f325dcd
add PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#19211)
6 years ago
lidanqing 07a4d8f8d6 Fix mAP problem in unit test of int8 object detection test (#18946)
6 years ago
Hao Wang d53fa53b65 CI - Improve example code check (#19170)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
Yibing Liu 50b1cab122
Add padding support for crf_decoding (#19057)
6 years ago
Aurelius84 45fb031f6b
remove is_test param of FC test=develop (#19209)
6 years ago
liym27 c8cdef37b2 change the default value of summarize from -1 to 20 in Print API to improve ease of use (#18738)
6 years ago
Yiqun Liu 77572b70cb
Enhance the error message when GrapOpMaker is null. (#19070)
6 years ago
lvmengsi c6f163cd7a
add description of sync_bn (#19056)
6 years ago
chengduo b5ba801ef0
Fix gather op bug (#19168)
6 years ago
Zeng Jinle 0f9b33954a
move python reader api to fluid.io module, test=develop (#19143)
6 years ago
Leo Chen 80eab822c1 Remove unused DefaultGradOpDescMaker in REGISTER_OPERATOR() (#19166)
6 years ago
chengduo c70a97f46e Use CUDAPinnedPlace in buffered_reader (#19112)
6 years ago
jiaqi b104ea0684
add get_last_save_xbox_base/get_last_save_xbox (#19122)
6 years ago
joanna.wozna.intel 492a00f53e Add conv reqantize squash (#18754)
6 years ago
Jiawei Wang 6ac32d0981 Instag Implemention (#18394)
6 years ago
zhongpu 2e76e75517 modify paddle/scripts/fast_install.sh about Mac installation to support paddle version check on MacOS (#19108)
6 years ago
Tao Luo 5f5648a8ff
Revert "Python inference API support numpy (#19009)" (#19160)
6 years ago
wawltor 0019eb376a
Fix the error of op `ones_like` document,change the output variable
6 years ago
huangjun12 20f18930ae Add hard swish op (new op) (#19001)
6 years ago
joanna.wozna.intel bce72c7fea Replace Relu with bounded Relu in MobileNetV2 quantization (#18988)
6 years ago
chengduo e044e84264
open fuse_all_optimizer_ops (#19087)
6 years ago
wangguanzhong 1fc242a7ed
refine infer shape in box decoder and assign op, test=develop (#19118)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
flame b7e1a1d7e7 Python inference API support numpy (#19009)
6 years ago
wopeizl 80b7ef6fc8
add tensorrt support for windows (#19084)
6 years ago
Kevin 744279fe68
Refine embedding Api doc (#18820)
6 years ago
Kevin 945f3cf631
fix code too big test=develop (#19111)
6 years ago
Tao Luo 4a959883e7
remove unused aws_benchmarking and go directory (#19103)
6 years ago
yaoxuefeng 9150cf50fc
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
6 years ago
hutuxian 5a80cc8431
Datafeed support reading to cuda place directly. (#19071)
6 years ago
Zeng Jinle 88f111f885
remove unused inplace act codes, test=develop (#19079)
6 years ago
tianshuo78520a cfa6305303 Add check PADDLE_ENFORCE approval (#19088)
6 years ago
ShenLiang 4397cb318e add eye op, kernel and unitest test=develop (#18980)
6 years ago
Kaipeng Deng f86fead693
Add trilinear_interp OP (#18711)
6 years ago
Zhang Ting c2063217e7 optimize error message for "embedding" and "cross_entropy" OP (#18765)
6 years ago
Tao Luo 741ce8bb1a
inference_shared_library support profile (#16275)
6 years ago
chengduo 17d62ab220
Enhance fuse optimization op pass (#19010)
6 years ago
chengduo 21440b4d69
Add call stack info during compile time (#19067)
6 years ago
jiaqi a99bc64c63
add fleet util, add some interface in hdfs util (#18752)
6 years ago
mapingshuo 4ad7c9d5a7 [WIP] Add Imdb train demo (#18895)
6 years ago
tianshuo78520a 0b1025769c Add op_use_default_grad_op_maker.spec approval (#19035)
6 years ago
wangguanzhong e50f527fee
update roi doc in roi_pool and roi_align (#19036)
6 years ago
jiaqi fc038da749
fix QueueDataset queue size (#19016)
6 years ago
Leo Chen 8f53735437 Fix memory overwriting of tensors returned by executor (#19030)
6 years ago
Kaipeng Deng 1f46253d4a
fix natural exp decay doc. test=develop (#19025)
6 years ago
tianshuo78520a be3f469ad1 CI Add Reviewer Rules for large PRs (modify 20+ files or add 1000+ lines) (#19033)
6 years ago
Yiqun Liu a445c33552
Add the check of lod in sequence_softmax kernel. (#18996)
6 years ago
Zeng Jinle 2175d19993
fix memory_reuse_pass memory_size calculation error, test=develop (#19020)
6 years ago
tianshuo78520a de975be1ec change op_use_default_grad_op_maker.spec approval member (#19029)
6 years ago
Kevin e681d65515 Add var_conv_2d op (#18518)
6 years ago
Chen Weihang 81fe02c3fe
Fix config description error in cuda_profiler function document (#18750)
6 years ago
SunGaofeng 4da1c4f15d
fix g_param shape mismatch in WeightNormParamAttr (#18940)
6 years ago
liuwei1031 a43a763b54
fix warpctc.dll not found issue (#18761)
6 years ago
chengduo 01c7daade7
Add checking for the fetch_list of Executor.run (#18957)
6 years ago
pawelpiotrowicz e53f517a44 fix for multithreading test_analyzer_image_classification --num_threads=X (#18265)
6 years ago
flame 65d987527d
python inference enable_memory_optim(#18817)
6 years ago
silingtong123 fd3b666d8c test=develop,Synchronize the contents of develop with release1.5 (#18937)
6 years ago
Liufang Sang faf6890b6c support tensor input for ctc align op (#18887)
6 years ago
xsrobin 8ce902541c
fix unalign of some examples (#18943)
6 years ago
hutuxian b62c4f9b04
fix concat check info typo (#18975)
6 years ago
Zeng Jinle 7ac748adb4
Open gc by default (#18836)
6 years ago
Zhaolong Xing 3816d221ff
Fix the CE error which caused by paddle-trt version (#18941)
6 years ago
jiaqi 02c370c3dc
support filelist size < trainer num && fix pull dense (#18956)
6 years ago
chengduo e7da0940f9
Disable fuse optimization option (#18924)
6 years ago
Krzysztof Binias c2c876f718 Fix memory leak in test (#18622)
6 years ago
石晓伟 ee2f296ef8
Fusion: seqpool_cvm_concat (#18471)
6 years ago
jiaqi 768059b3a0
adjust ins weight according to nid slot (#18784)
6 years ago
Zeng Jinle 08fa98f7cc
Fix gpu_info PADDLE_ENFORCE_GT when fraction_of_gpu_memory_to_use=1.0 (#18950)
6 years ago
wawltor 3ab1866ca5
Add the op of unique_with_counts, expand count function of the op unique (#18720)
6 years ago
Jacek Czaja 5cf2d38594 - Removed passing X from FWD to GRAD via device context (#18911)
6 years ago
LielinJiang 22fa4c2d24 Fix depthwise conv gpu kernel bug (#18582)
6 years ago
Huihuang Zheng ea6ee76fa9
GPU allocation uses fraction of available memory (#18896)
6 years ago
liuwei1031 0d99690809
fix several security bugs reported by security team (#18831)
6 years ago
Zhaolong Xing 61238d31f7
Trt fp16 support (#18860)
6 years ago
chengduo 20859c08e8
[DyGraph] Make multi-card program faster (#18892)
6 years ago
HaoRen 24f8543106 Add center Loss Op Support (#18681)
6 years ago
lvmengsi d21c391447
replace paper link (#18861)
6 years ago
Leo Zhao 86e494eb64 use mkl to accelerate gelu_grad (#18099)
6 years ago
wopeizl dfd6a62a9a
Optimize the error report information when loadcombine fail to open model files test=develop (#18888)
6 years ago
baojun adcfc53b18 upgrade ngraph version and simplify ngraph engine (#18853)
6 years ago
whs 6cccab9203
Make lod_append support variable lod. (#18908)
6 years ago
Jacek Czaja cfcb96d2df [MKL-DNN] Fix int8 performance regression (#18758)
6 years ago
danleifeng e0a2d4dfec Add elementwise_pow_op backward implementation and the unit test codes of it. (#18848)
6 years ago
Leo Zhao 10eeed93d1 Revert "use static variable to do cache instead of thread local in thread frequent switching case (#18428)" (#18879)
6 years ago
tianshuo78520a 6cd1b71208 add DEFINE_int32/DEFINE_bool/DEFINE_string flag (#18869)
6 years ago
Huihuang Zheng 0d3f16f53e
Try to modify external gflags to solve CI compilation (#18872)
6 years ago
Zeng Jinle 8008ab4e6b
Remove legacy C++ memory optimization codes (#18834)
6 years ago
Thunderbrook 52c1431eee
add clear_model interface in fleetwrapper (#18815)
6 years ago
Zeng Jinle 9a8a7a1ddc
fix affine_channel no_need buffer bug, test=develop (#18844)
6 years ago
lvmengsi 829ef26281
Fix drop deconv (#18813)
6 years ago
Huihuang Zheng cfce4994cf
Merge cuda 9/10 dockerfile with root dockerfile (#18693)
6 years ago
chengduo 4140fe11a4
Open fuse optimization ops (#18741)
6 years ago
Adam ee02227949 Add LeakyReLU MKLDNN support (#18762)
6 years ago
lidanqing b05bdda0cf remove unused TransposeINT8Op for higher UT coverage (#18791)
6 years ago
Zeng Jinle a802da650b
Feature/mem opt pass refactor (#18735)
6 years ago
Physher c5f47c2107 fix mul_mkldnn_op build failure (#18816)
6 years ago
Physher a5c986301c clarify MKLDNN INT8 Mul Op attributes (#18685)
6 years ago
FDInSky cff5e2c173 fix roi_align_op cpu backward's bug (#18789)
6 years ago
石晓伟 9dbb62eeb9
Fix examples of API (#18092)
6 years ago
chengduo 292dfbce63
fix build strategy doc (#18725)
6 years ago
fuyinno4 c167a4b4dd
Fix shrink-dense and add scale-datanorm (#18746)
6 years ago
Bai Yifan d3ac561d65
fix deformable_conv_op compile error, test=develop (#18793)
6 years ago
lidanqing 9ecd8ee789 change ComputeINT8 to template version to remove checking dst_datatype code (#18756)
6 years ago
JesseyXujin d9e7b5b5e9
fix bug of swish op formula,test=develop (#18772)
6 years ago
Bob Zhu 220eef602e Extend Matmul to support matrix multiplication with multiple heads (#18570)
6 years ago
whs 075e1cf78e
Add python API for appending LoD level (#18702)
6 years ago
chengduo 8259f1418f
Enhance backward process (#18700)
6 years ago
JesseyXujin 25c9b57bcd
Modify auc doc. Add output variable description, previously was the scalar type, now changed to the tuple type.test=develop (#18771)
6 years ago
Zhaolong Xing 26ae6d49e4
Update trt5 for paddle-trt (#18645)
6 years ago
Thunderbrook d8396281ef
add slot to sparse table (#18686)
6 years ago
Jacek Czaja 95c1816ec0 [MKL-DNN] Extended LRN with reusing via Acquire API (#18675)
6 years ago
jiaqi d18aabb472
support patch data, add load_one_table, fix bug (#18509)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
wangchaochaohu 6b78e00da4
Cudnn convolution reconstruction (#18284)
6 years ago
Yi Liu 157211c4e1
supports distributed classification (#18690)
6 years ago
qingqing01 3429e65aa8
Fix CPU implementation of roi_align_op backward (#18728)
6 years ago
Tao Luo bd22453f20
Revert "Add LeakyRelu MKLDNN support (#18656)" (#18723)
6 years ago
tianshuo78520a 58469186c3 Change api approval people name (#18699)
6 years ago
whs 189b08dc0d
Make infer shape of pad2d support for input with negative dims in compile time. (#18695)
6 years ago
Bai Yifan 7e3963f295 add license, test=develop (#18709)
6 years ago
cjt222 ccf06a48b0
test=develop (#18701)
6 years ago
wangguanzhong 185b3acea1
fix clip_by_norm doc (#18688)
6 years ago
Huihuang Zheng 89bc3fd841
Support memory eager deletion on recurrent OP (#17710)
6 years ago
Jacek Czaja 0d8e6c9b8b MKL-DNN upgrade to 0.20 (#18370)
6 years ago
Adam d6b6a337a9 Add LeakyRelu MKLDNN support (#18656)
6 years ago
zhouwei25 772e09560e Optimize the content of error reporting information, print error code and official document web sites (#18671)
6 years ago
Zeng Jinle ae58afc546
Feature/auto_growth_allocator (#18561)
6 years ago
hutuxian bb2f5d24a2
hash_op support int64 hash_size (#18674)
6 years ago
guru4elephant 5ed713d519
remove ctr reader, all functions are satisfied in dataset (#18672)
6 years ago
guru4elephant d714bf037c
remove async executor and add data_feed.proto to the deps of train demo (#18659)
6 years ago
Yang Zhang ce1ec33299
Add cuda implementation for `prelu` backward pass (#18633)
6 years ago
石晓伟 25d8079140
Fix Bitmain Predictor::Clone() (#18599)
6 years ago
Yihua Xu 97549a4f13 [CPU] Fix the compiling issue with AVX512F macro. (#18634)
6 years ago
baojun 256ba7cbb8 [NGraph] handle dim element 0 of ngraph op (#18568)
6 years ago
chengduo a6d468a265
fix PE fetch bug (#18644)
6 years ago
liuwei1031 759530966c
print out error code of cudaGetDeviceProperties if failed (#18643)
6 years ago
Jacek Czaja 71d883b8ef [MKL-DNN] Reimplemented pool2d mkl-dnn to use Acquire API (#18585)
6 years ago
chengduo f4ec7d54c8
fix bug of scatter op (#18640)
6 years ago
tianshuo78520a 112cf850b7 change pip install whl;test=develop (#18635)
6 years ago
guru4elephant ab57d3893e
make auc op compatible with 1 dim (#18551)
6 years ago
tianshuo78520a de22215c8f change const_cast error message (#18620)
6 years ago
Leo Zhao ff77dea969 not use transferscope cache in cpu case (#18578)
6 years ago
123malin b414645a65
fix #17430: int64类型的attr训练非预期 (#18264)
6 years ago
tangwei12 db212bb932
delete AllocatorFacade destructor (#18606)
6 years ago
Kevin 995d7d8600 Modify embedding_op input dtype to int64 (#18598)
6 years ago
kh2se2013 9ad57f2dfd 1)change to parallel mode on python coverage run (#18594)
6 years ago
Tao Luo 076f833110
add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580)
6 years ago
Hongyu Liu a20b2b43fc
fix cudnn lstm shape bug; test=develop (#18492)
6 years ago
gongweibao c0a82748cf
Polish backwards optimizer dependency codes and use more default values. (#18255)
6 years ago
Zeng Jinle d3003a1620
Feature/buffer_shared_inplace (#17911)
6 years ago
tianshuo78520a 1c10dac4f2 Add code example in CI (#18228)
6 years ago
Zeng Jinle be24e5b391
Clean unused code of dim and place (#18565)
6 years ago
Jacek Czaja 8869d7f735 Activations MKLDNN ops refactoring (#18191)
6 years ago
lujun b6d5c74f69
update dygraph api doc for web (#18550)
6 years ago
Yibing Liu b86234fc0b
Register fp16 for concat_op (#18563)
6 years ago
Physher 5e1220ef37 fix compile error which caused by gcc4.8 related commit;test=develop (#18567)
6 years ago
Jiabin Yang 667f88f9a6
Fix/gcc 4.8 ubt link error (#18558)
6 years ago
Physher 0caa08ea40 Add mkldnn int8 mul-op kernel (#17834)
6 years ago
LielinJiang 24d1c44a0c Fix roi_perspective_transform_op bug (#18522)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
石晓伟 1529154821
Support Bitmain Anakin (#18542)
6 years ago
tianshuo78520a 9b3d3b8387 Cancel jacquesqiao approval authority (#18538)
6 years ago
Leo Zhao ce38bb5341 use static variable to do cache instead of thread local in thread frequent switching case (#18428)
6 years ago
gongweibao 160ddc980c
Regroup fusion by date type. (#18496)
6 years ago
Tao Luo fe32879d2a
add mkldnn shapeblob cache clear strategy (#18513)
6 years ago
chengduo e576f2667b
update docker build (#18523)
6 years ago
zhaoyuchen2018 832d8191ff
Fix topk cannot handle 1D vector bug (#18466)
6 years ago
石晓伟 280a8784f7
Remove the obsolete cmake options (#18493)
6 years ago
LielinJiang 43e17c7951 Add distributions of normal and uniform (#18023)
6 years ago
bingyanghuang 3fe6bf5ee6 fix command line bug in int8v2 readme (#18507)
6 years ago
tensor-tang 4828a5e008
core remove pycpuinfo (#18479)
6 years ago
qingqing01 7ac4818a98
Refine Infershape in activation_op for double_grad. (#18485)
6 years ago
qingqing01 602cb6a5b4
Enhance linear_lr_warmup (#18463)
6 years ago
chengduo 7453857324 Make fuse_all_reduce_op_pass support mix_precision (#17652)
6 years ago
chengduo 55baeceddb
Enhance execution error info (#18482)
6 years ago
石晓伟 047bba855b
Remove the obsolete cmake options (#18481)
6 years ago
pkpk e9c7e218f2
Nan debugger init (#18401)
6 years ago
Jiabin Yang f72ced8814
test=develop, fix docker with paddle nccl problem (#18451)
6 years ago
Tao Luo 3f3112ceb0
add shape_blob for cache mkldnn primitive (#18454)
6 years ago
Tao Luo d234aa02cd
add transfer_scope_cache unit-test (#18467)
6 years ago
zhoukunsheng 7c6f2350b9 support Tensor input for edit_distance op (#18162)
6 years ago
zhoukunsheng 26318544d2 support Tensor input for chunk_eval op (#18226)
6 years ago
zhoukunsheng 206c44e2a8 add unique kernel and op (#17557)
6 years ago
zhoukunsheng 71af72b1c2 upgrade hash op to support Tensor and LoDTensor input (#17998)
6 years ago
zhoukunsheng d3b3443d10 add ones_like op (#17388)
6 years ago
zhoukunsheng 67b48d7fe7 add size op (#17412)
6 years ago
Leo Zhao 8f5fffca0a rename mkldnn set/get_cur_thread_id() to set/get_cur_mkldnn_session_id() (#18453)
6 years ago
Tao Luo 3123d18787
remove unused AnalysisPredictor::SetMkldnnThreadID() (#18444)
6 years ago
Yi Liu a873fa84ce
supports collective training with programs (#18392)
6 years ago
tianshuo78520a 85b49d8473 fix the api.spec file does not get the class comment problem (#18439)
6 years ago
chengduo e0d8c6ac68
Add find_no_grad_vars in backward.py (#17942)
6 years ago
LielinJiang 449c7a9f98 Make roi_perspective_transform op return mask and transform matrix (#18371)
6 years ago
tensor-tang a3bc804f5f
fix mac ci random fail (#18430)
6 years ago
Michał Gallus 7023a86c3a Fix Pooling output scale (#18186)
6 years ago
Brian Liu 4bc2987d2f Fix bug in quantize kernel which cause crash in vgg16/19 model (#17964)
6 years ago
xsrobin 47e2ef38e9
add "import paddle.fluid as fluid" to examples lack of it
6 years ago
tianshuo78520a 92ecb305c2 test=develop (#18426)
6 years ago
hutuxian 8a39e5c110 update api format (#18413)
6 years ago
jiaqi 93a2b317f7
fix data feed ptr error (#18419)
6 years ago
tensor-tang ce7a024c6d
fix py-cpuinfo mac random fail (#18383)
6 years ago
Jie Fang 2b4ef509ea init custom black white list (#18377)
6 years ago
Leo Zhao 681d3553f1 Fix potential mkldnn concat/pool/conv kernel issues (#18393)
6 years ago
tianshuo78520a 052b044873 Fix mac build nproc command not found (#18362)
6 years ago
Zeng Jinle f5641000bb
Add a unittest to inplace elementwise_add (#18385)
6 years ago
Jiabin Yang 43f64a177e
Fix/program doc (#17908)
6 years ago
Jiabin Yang af874a1f1d
test=develop, fix multigpu hang on latest docker (#18379)
6 years ago
chengduo 871cc15e6a
Add is_compiled_with_cuda (#18356)
6 years ago
lujun fd6631ef2f
Fix dygraph show style (#18297)
6 years ago
HaoRen 9931bc64f5 add dependecy of collective_helper (#18365)
6 years ago
翟飞跃 19da59ed3f Remove all the code, API and doc of MKL-DNN INT8v1 (#18347)
6 years ago
chengduo 8ed33bf91f
Fix Bug-prone code of PE (#18354)
6 years ago
tangwei12 999d9a59a5
fix communicator with pyreader (#18350)
6 years ago
tianshuo78520a cff2c2d83f add combine_avx_noavx build to dockerfile
6 years ago
kh2se2013 27fb9cad65 add WITH_COVERAGE option, default OFF (#17872)
6 years ago
Michał Gallus 8409693272 Reset DeviceContext after quantization warmup (#18182)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
Sylwester Fraczek 9252e8fa08 add int8 mkldnn prior_box (#17242)
6 years ago
lidanqing 5fd68ac154 some fixes for int8 mobilenet_ssd tester (#18112)
6 years ago
Jacek Czaja c2efdfd5bc [MKL-DNN] Extending reusing to Elementwise_add_mkldnn op (#18146)
6 years ago
qingqing01 9047ac687e
Simplify multi_box_head API in detection.py and remove assign op. (#18310)
6 years ago
Zeng Jinle 5826b72e06
Refine CUDAPlace error message. (#18343)
6 years ago
Tao Luo 3c9755bbb9
remove unused jemalloc option (#18314)
6 years ago
Yibing Liu 23941e43ec
Update lamb optimizer (#18333)
6 years ago
chengduo 135a59ed45
update reduce config (#18334)
6 years ago
tensor-tang 81ec538279
fix softrelu doc (#18324)
6 years ago
Hongyu Liu df2eee71d8
Sequence mask support tensor (#18249)
6 years ago
Qiao Longfei 0e08e91c18
optimize communicator merge sparse gradient test=develop (#18159)
6 years ago
chengduo e06c69c788
Fix default value of fluid.memory_optimize (#18295)
6 years ago
Zhaolong Xing 6978b2e48e
fix split and sampled softmax (#18280)
6 years ago
Yibing Liu f57ee3693b
Fix the bug of sequence_unpad op (#18290)
6 years ago
chengduo 5489216eba
Clean build strategy (#18148)
6 years ago
chengduo 14e1e165df
update alloc_continuous_space_for_grad_pass (#18287)
6 years ago
lujun 7e61baaa94
add Dygraph api to api.spec (#18235)
6 years ago
liuwei1031 a736c03b10
improve doc of lstm, sequence_enumerate, softmax_with_cross_entropy, space_to_depth APIs (#18261)
6 years ago
flame fdf798f95a
fix double buffer example (#18169)
6 years ago
Bai Yifan 23b8b18e56
fix api doc example, test=develop (#18266)
6 years ago
xiaoting 2f0d68261c fix yolo_box example,test=develop (#18247)
6 years ago
songhao 6b3d96254d fix some bug when merge sparse embedding parameters, test=develop (#18223)
6 years ago
jiaqi 3f8031e256
dataset (#17973)
6 years ago
liuwei1031 5d54ed4a84
improve the doc of DataFeeder and default_main_program (#18241)
6 years ago
xiaoting b58bb80248 set src_idx > 0 for bilinear_interp_op (#18238)
6 years ago
wopeizl daa32d5383
fix package generation for inference test=develop (#18220)
6 years ago
Shuai Yuan 9a32dad811
[DOC] Fix comment code of API create_py_reader_by_data (#18193)
6 years ago
Hongyu Liu cefd0fb598
Fix slice op shape=-1 bug (#18107)
6 years ago
lijianshe02 ff4279e3b2
fix paddle.fluid.layers.io.open_files api doc bug test=develop (#18203)
6 years ago
chengduo 5588b923f3
Add multi process reader (#18115)
6 years ago
wangchaochaohu a9dc534f48
fix API example (#18153)
6 years ago
翟飞跃 de42fe8fd5 Change int8v2 CAPI unit test name and add log in the prediction stage (#18200)
6 years ago
翟飞跃 802ea50956 fix spelling errors (#17941)
6 years ago
zhoukunsheng 0569ff78fa Fix doc example for greater_equal, greater_than, less_equal, not_equal, rank, reduce_all, reduce_any, sign, where, diag (#18167)
6 years ago
Huihuang Zheng bbc292920c
Fix API example code (#18176)
6 years ago
翟飞跃 78441c5449 add mkldnn Int8v2 slim doc (#17909)
6 years ago
lvmengsi d658f1133b
Fix doc for transpose, conv3d and batch_norm. (#18035)
6 years ago
FlyingQianMM 944c3165ec
fix type error of std::pow in sigmoid_focal_loss_op.cu and sigmoid_focal_loss_op.h (#18152)
6 years ago
chengduo 25f3cd6486
Update execution_strategy option default value (#18183)
6 years ago
chengduo 4978db2c10
Remove nccl dep when the number of GPU is 1 (#18158)
6 years ago
Zeng Jinle 25ab23be28
Fix dygraph mem leak (#18082)
6 years ago
tensor-tang 1c6e560607
core replace x86cpu with py cpuinfo (#18151)
6 years ago
Zeng Jinle 6eec66a1b1
Fix py_reader iterable bug (#18108)
6 years ago
qingqing01 80d2e66f9e
Update backward appending stragety to support double backward and fix some bug. (#18104)
6 years ago
Wojciech Uss ca5642c850 unify FP32 vs. INT8 comparison tests output (#18111)
6 years ago
Wojciech Uss c26130f3a9 reuse C-API INT8 unit test application (#18077)
6 years ago
FlyingQianMM ff83655f7e
add detection output operator for supporting retinanet (#17896)
6 years ago
FlyingQianMM 0aee1f0074
add sigmoid focal loss operator for supporting retinanet (#17895)
6 years ago
FDInSky 9e4b9d9798 Update generate_proposal_labels_op to support CascadeRCNN. (#17200)
6 years ago
FlyingQianMM 9ed2f936f1
add target assign operator for supporting retinanet (#17893)
6 years ago
Huihuang Zheng 7faf095618
Sync Dockerfile change of PR#17889 (#18072)
6 years ago
Sylwester Fraczek accb132f0f fix slim int8 mkldnn multithreading issue (#18009)
6 years ago
tianshuo78520a 2e1d8cf7c8 add approval to requirements.txt
6 years ago
chengduo 24e988a471
Fix bug of scope_buffered_ssa_graph_executor (#18100)
6 years ago
Huihuang Zheng 3f55ab0f89
Modify format of GPU allocation failure log. (#18034)
6 years ago
gongweibao f5caf3443c
Fix reinitialized ncclid error! (#18025)
6 years ago
whs 354643d8d9
Add warning for cudnn warpctc kernel in CUDA9\CUDA10. (#18046)
6 years ago
qingqing01 e81756f1ba
Hidden paddle.fluid.layers.detection_map. (#18033)
6 years ago
Yiqun Liu 660c1a65f3
Optimize fused_elewise_activation_grad op. (#18041)
6 years ago
lidanqing 466254151a add Mobilienet ssd int8 analyzer tester (#18075)
6 years ago
石晓伟 42f12a4aca
fix ci test cmake test=develop (#18060)
6 years ago
chengduo b5a1c1463d
Update CPU_NUM config (#18059)
6 years ago
lidanqing f8ecc3de89 refactor the function ConvFwdPrimitiveDesc (#17897)
6 years ago
Michał Gallus 8462e2b805 Disable MKLDNN FC in Resnet50 test (#18030)
6 years ago
Wojciech Uss 78e932862c Added unit test for QAT FP32 & INT8 comparison (#17814)
6 years ago
tensor-tang 566bf2ec56
concat op support negative axis (#18045)
6 years ago
Yiqun Liu 7e463c84a6
Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. (#17979)
6 years ago
tangwei12 101f74cb19
fix save/load in fleet (#17675)
6 years ago
hutuxian f1d458daf0
add trainer_desc proto DEPS (#18019)
6 years ago
Guo Sheng a06b316b94
Fix GetExpectedKernelType of add_position_encoding_op (#17935)
6 years ago
tensor-tang 5c06bff222
combine noavx and avx package (#17889)
6 years ago
wawltor 8eb134c3c1
Fix scatter and gather op when has duplicate index (#17952)
6 years ago
lujun 75fcd29220
update load_error_info, test=develop (#18000)
6 years ago
石晓伟 04ea7cb069
modify the access level of anakin engine (#18015)
6 years ago
wawltor 2ae8decc90
test=develop (#17984)
6 years ago