Commit Graph

8321 Commits (f86fead6938efc8735412bd3489dc17a609e373c)

Author SHA1 Message Date
Kaipeng Deng f86fead693
Add trilinear_interp OP (#18711)
6 years ago
Zhang Ting c2063217e7 optimize error message for "embedding" and "cross_entropy" OP (#18765)
6 years ago
Tao Luo 741ce8bb1a
inference_shared_library support profile (#16275)
6 years ago
chengduo 17d62ab220
Enhance fuse optimization op pass (#19010)
6 years ago
chengduo 21440b4d69
Add call stack info during compile time (#19067)
6 years ago
jiaqi a99bc64c63
add fleet util, add some interface in hdfs util (#18752)
6 years ago
mapingshuo 4ad7c9d5a7 [WIP] Add Imdb train demo (#18895)
6 years ago
wangguanzhong e50f527fee
update roi doc in roi_pool and roi_align (#19036)
6 years ago
jiaqi fc038da749
fix QueueDataset queue size (#19016)
6 years ago
Leo Chen 8f53735437 Fix memory overwriting of tensors returned by executor (#19030)
6 years ago
Kaipeng Deng 1f46253d4a
fix natural exp decay doc. test=develop (#19025)
6 years ago
Yiqun Liu a445c33552
Add the check of lod in sequence_softmax kernel. (#18996)
6 years ago
Zeng Jinle 2175d19993
fix memory_reuse_pass memory_size calculation error, test=develop (#19020)
6 years ago
Kevin e681d65515 Add var_conv_2d op (#18518)
6 years ago
Chen Weihang 81fe02c3fe
Fix config description error in cuda_profiler function document (#18750)
6 years ago
SunGaofeng 4da1c4f15d
fix g_param shape mismatch in WeightNormParamAttr (#18940)
6 years ago
liuwei1031 a43a763b54
fix warpctc.dll not found issue (#18761)
6 years ago
chengduo 01c7daade7
Add checking for the fetch_list of Executor.run (#18957)
6 years ago
pawelpiotrowicz e53f517a44 fix for multithreading test_analyzer_image_classification --num_threads=X (#18265)
6 years ago
flame 65d987527d
python inference enable_memory_optim(#18817)
6 years ago
silingtong123 fd3b666d8c test=develop,Synchronize the contents of develop with release1.5 (#18937)
6 years ago
Liufang Sang faf6890b6c support tensor input for ctc align op (#18887)
6 years ago
xsrobin 8ce902541c
fix unalign of some examples (#18943)
6 years ago
hutuxian b62c4f9b04
fix concat check info typo (#18975)
6 years ago
Zeng Jinle 7ac748adb4
Open gc by default (#18836)
6 years ago
Zhaolong Xing 3816d221ff
Fix the CE error which caused by paddle-trt version (#18941)
6 years ago
jiaqi 02c370c3dc
support filelist size < trainer num && fix pull dense (#18956)
6 years ago
chengduo e7da0940f9
Disable fuse optimization option (#18924)
6 years ago
石晓伟 ee2f296ef8
Fusion: seqpool_cvm_concat (#18471)
6 years ago
jiaqi 768059b3a0
adjust ins weight according to nid slot (#18784)
6 years ago
Zeng Jinle 08fa98f7cc
Fix gpu_info PADDLE_ENFORCE_GT when fraction_of_gpu_memory_to_use=1.0 (#18950)
6 years ago
wawltor 3ab1866ca5
Add the op of unique_with_counts, expand count function of the op unique (#18720)
6 years ago
Jacek Czaja 5cf2d38594 - Removed passing X from FWD to GRAD via device context (#18911)
6 years ago
LielinJiang 22fa4c2d24 Fix depthwise conv gpu kernel bug (#18582)
6 years ago
Huihuang Zheng ea6ee76fa9
GPU allocation uses fraction of available memory (#18896)
6 years ago
liuwei1031 0d99690809
fix several security bugs reported by security team (#18831)
6 years ago
Zhaolong Xing 61238d31f7
Trt fp16 support (#18860)
6 years ago
chengduo 20859c08e8
[DyGraph] Make multi-card program faster (#18892)
6 years ago
HaoRen 24f8543106 Add center Loss Op Support (#18681)
6 years ago
lvmengsi d21c391447
replace paper link (#18861)
6 years ago
Leo Zhao 86e494eb64 use mkl to accelerate gelu_grad (#18099)
6 years ago
wopeizl dfd6a62a9a
Optimize the error report information when loadcombine fail to open model files test=develop (#18888)
6 years ago
baojun adcfc53b18 upgrade ngraph version and simplify ngraph engine (#18853)
6 years ago
whs 6cccab9203
Make lod_append support variable lod. (#18908)
6 years ago
Jacek Czaja cfcb96d2df [MKL-DNN] Fix int8 performance regression (#18758)
6 years ago
danleifeng e0a2d4dfec Add elementwise_pow_op backward implementation and the unit test codes of it. (#18848)
6 years ago
Leo Zhao 10eeed93d1 Revert "use static variable to do cache instead of thread local in thread frequent switching case (#18428)" (#18879)
6 years ago
Huihuang Zheng 0d3f16f53e
Try to modify external gflags to solve CI compilation (#18872)
6 years ago
Zeng Jinle 8008ab4e6b
Remove legacy C++ memory optimization codes (#18834)
6 years ago
Thunderbrook 52c1431eee
add clear_model interface in fleetwrapper (#18815)
6 years ago