Commit Graph

16697 Commits (03deb41d736bea9c8d593b11d9aa541a056d250a)

Author SHA1 Message Date
Leo Chen aaa4fe491a
use function instead of lambda, test=develop (#22348)
5 years ago
Adam e7a9f6bbb7 [Bugfix] Preserve shape in inpalce operators (#22360)
5 years ago
qingqing01 2d20869c94 Fix infer_shape in compling for elementwise_op (#22291)
5 years ago
Yiqun Liu b7cac50b64
Implement a common python unittest to test the ir passes. (#22209)
5 years ago
tangwei12 82bc814a57
integrated HALF_ASYNC to communicator (#21869)
5 years ago
wangchaochaohu 1e932eccfa
remove unused code test=develop (#22327)
5 years ago
Leo Chen 3e5744aa65
Remove unused inputs for some operators (#22284)
5 years ago
zhangchunle 805328e13b fix typo in error message (#22312)
5 years ago
lidanqing 895f8da7d6 change std::cout to log(INFO), vlog (#22316)
5 years ago
石晓伟 8cb04664b9
revert paddle_fluid.map, test=develop (#22236)
5 years ago
Chen Weihang 35efbe6d95
Speeding up dygraph DataLoader with multiprocessing (#21762)
5 years ago
Zeng Jinle 9435533adf
remove op_use_default_grad_op_maker.spec, test=develop, test=document_fix (#22300)
5 years ago
wangchaochaohu 7b76a76495
fix the conda build confilict test=develop (#22279)
5 years ago
Zeng Jinle 5e601a92ad
polish grad op check (#22290)
5 years ago
Bai Yifan faba4b116a
Remove disable flag in test_fsp_op.py (#22171)
5 years ago
Zhen Wang e40cfb1010
fix the bug of assert_is_op_output. test=develop (#22262)
5 years ago
Wojciech Uss d3a6647372 improve placement pass tests code coverage (#22197)
5 years ago
liu zhengxi 07afc29e90
Make api.cc malloc consistent with paddle_api.h for PaddleBuf (#22255)
5 years ago
silingtong123 4f1da4adcb remove the useless third_party library from C++ inference library (#22021)
5 years ago
zhouwei25 549e6de7ac faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
5 years ago
xujiaqi01 e3a457d34b
add collective communication library in fleet (#22211)
5 years ago
Zhen Wang f2522e91c4 fix the type error caused by setting bool attr in OpDesc. test=develop (#22257)
5 years ago
songyouwei 0ba1d140d4 Add CI check for sequence ops' unittests (#21615)
5 years ago
Zeng Jinle 1b76e789cf
remove cuda allocator ctor, test=develop (#22212)
5 years ago
Adam 9942d9ed5c Add caching mechanizm to requantize_mkldnn_op (#22223)
5 years ago
Wilber 1230c110cb
[fluid-lite] adjust to relative error (#22232)
5 years ago
123malin 985bceac53
Bug fix for sparse recorder (#21969)
5 years ago
Chen Weihang fc0b21e17b
Polish fetch error message of parallel executor (#22206)
5 years ago
Wojciech Uss 2e90c4eb0a improve mkldnn_quantizer_config test code coverage (#22216)
5 years ago
Wilber 5750152e80
support fluid-lite subgraph run resnet test=develop (#22191)
5 years ago
wangchaochaohu 621d3e0b66
fix the bug of profile update (#22207)
5 years ago
FlyingQianMM 443a713c9e
add backward gradient computation for op argsort (#22203)
5 years ago
Zhen Wang 46189b166d Add bn and relu fuse pass (#22048)
5 years ago
zhouwei25 2f3e2a84af fix ci rule to show Shell variables (#22177)
5 years ago
baojun 298ee7d28a Improve ngraph file line coverage (#22155)
5 years ago
zhongpu d0f0a2520c test Optimizer in dygraph (#21949)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
joanna.wozna.intel 5b2e98aa17 Add multiple quantize operators fuse (#22062)
5 years ago
Yiqun Liu 96980c2244
Polish the PADDLE_ENFORCE in fusion_group pass related codes. (#22144)
5 years ago
wangchaochaohu c3876cf82d
add support for nested profiling event and printing in different level (#22061)
5 years ago
Zeng Jinle c3bcd3c1e2
fix dygraph non zero gpu bug, test=develop (#22165)
5 years ago
zhaoyuchen2018 3d4f2aa689
Refine stack op to improve xlnet performance, test=develop (#22142)
5 years ago
zhongpu cf475f95df Remove FC in dygraph, modify FC to Linear in sample code (#22082)
5 years ago
liu zhengxi 64a4044292
add double register op_data_type of pad2d and fix compile error, test=develop (#22075)
5 years ago
Liu Xudong 7ba7acd197 Add coverage tools (#21975)
5 years ago
Double_V 6ea3809143 Support prroi_pool_op with Tensor and LoDTensor rois (#20649)
5 years ago
Pei Yang d8a9b134e3
fix trt instance_norm serialize bug. test=develop (#22152)
5 years ago
zhongpu cc1a9f4238 fix sample code in paddle/fluid/imperative/README.md (#22141)
5 years ago
Zeng Jinle 4c2df8e4d4
fix allocator strategy comment, test=develop, test=document_fix (#22121)
5 years ago
bingyanghuang 7872d06ff4 Add explanation on conv grad for dims<3 (#22125)
5 years ago
liu zhengxi 724b13e459
fix xception precision problem, test=develop (#22124)
5 years ago
Yiqun Liu b1401fb74d Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#22094)
5 years ago
Pei Yang 50bee83f71
add TRT support for instance_norm op (#21928)
5 years ago
zhaoyuchen2018 3dbd4087fe
Fix windows build not kernel issue, test=develop (#22105)
5 years ago
Chengmo 418abc92f4
Update pyramid related OP (#21372)
5 years ago
bingyanghuang 4b4a9cc88f fix format in operator.cc (#22101)
5 years ago
Feiyu Chan 14aebc7a95
add erf op (#21785)
5 years ago
Chen Weihang ba8414d3a5
replace CUDNN_ENFORCE with PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#22109)
5 years ago
silingtong123 6c20e7c4e6 test=develop, remove unused parameter from class RuntimeInferShapeContext constructors (#22046)
5 years ago
Double_V fab4b0765a support elu_op double grad (#21822)
5 years ago
Pei Yang 0a51098a71
Add TRT support for BERT (#21135)
5 years ago
Jacek Czaja b0b27ff699 [MKL-DNN] Conv grad and Batch Norm grad NHWC support (#22088)
5 years ago
Huihuang Zheng dd4361568e
Add ParallelExecutor Test for Cond API and Fix PE Checks Shape Bug (#22029)
5 years ago
Zeng Jinle 9587249442
polish allocator strategy doc, test=develop, test=document_fix (#22095)
5 years ago
Zeng Jinle d9f5d1eb29
ag allocator by default, test=develop (#21837)
5 years ago
123malin 7fb817d447
add distributed_strategy (#21710)
5 years ago
Jacek Czaja ad8a9cb82c [MKL-DNN] Pool & LRN Grad Ops NHWC support (#21747)
5 years ago
Kaipeng Deng 34c57120eb polish cross_entropy ENFORCE (#22056)
5 years ago
SunAhong1993 7f4abaf2f5
register int/int64_t/float16 in pow/square kernel,test=develop (#22023)
5 years ago
Leo Chen 3f653c8323
register NoNeedBufferVarsInference for max_pool_grad_op, test=develop (#22055)
5 years ago
Yiqun Liu d48320777e
Add the first implememtation of fusion_group op (#19621)
5 years ago
Michał Gallus 6192108408 [DNNL] 3D Fully-Connected (#21746)
5 years ago
FDInSky aa2ed0dcc6 fix generate_proposal_labesl op (#21793)
5 years ago
ceci3 95d79b6d00
update error log for batch_norm_grad (#22017)
5 years ago
Aurelius84 c53b62eb8e
fix integer overflow in match_matrix (#22036)
5 years ago
Chen Weihang 2e9082250d
polish default error msg & cublas error hint, test=develop (#22032)
5 years ago
wangchaochaohu 64baee4144
polish code test=develop (#22014)
5 years ago
Chen Weihang 35ff1568e9 Add error message for cublas inItizalize failed (#21995)
5 years ago
Chen Weihang fbb42173a9
fix no hint problem when use ENFORCE for cuda, test=develop (#21994)
5 years ago
zhouwei25 e66f92d1ae Modify demo_ci to support Windows, prepare for PR_Windows_Inference (#21873)
5 years ago
danleifeng b7697f6218 fix broadcast bug;test=develop (#21898)
5 years ago
liu zhengxi 196e20dfbb
Fix multi-threads memory out of bounds error for passes (#21920)
5 years ago
zhaoyuchen2018 8859ddd6cf
Refine multihead kernel, align block to 32 (#21961)
5 years ago
silingtong123 fd9b00df4b test=develop, remove unused variable (#21974)
5 years ago
zhoushiyu cee2ccb078
add shuffle batch op (#21674)
5 years ago
mapingshuo c3e1954918
make reverse op support negative axis (#21925)
5 years ago
石晓伟 03479469a7
fix multi-thread error of fc_gru_fuse_pass.cc, test=develop (#21841)
5 years ago
wangchaochaohu de9ba01f11
add conda build python script test=develop (#21943)
5 years ago
Aurelius84 10d6846900
Remove double registered dataType in Pad2d (#21942)
5 years ago
zhouwei25 2df4be5d35 Fix openblas bug to support compile on windows when WITH_MKL=OFF (#21902)
5 years ago
hutuxian 27decacb8a
fix aucop stat shape (#21846)
5 years ago
Pei Yang 3e5008ad01
fix trt calib not working bug, test=develop (#21934)
5 years ago
Aurelius84 5cb2c74127
add register op_data_type of pad/expand_as et.al (#21718)
5 years ago
qingqing01 2066745847
Pack imperative/layer into paddle_framework.so (#21921)
5 years ago
hong 30d000f8c2
fix matmul error message; test=develop (#21885)
5 years ago
zhouwei25 a01663ca1f remove patch command and file of cares to Improved quality of Paddle Repo (#21776)
5 years ago
flame 2bbc0d7d60
python zero copy inference, delete pass (#21897)
5 years ago
Aurelius84 51a86d2b6b Optimize adam speed (#21777)
5 years ago
Leo Chen 310edc0d0c
Update layers used in ptb model to use auto-generated op functions in dygraph mode (#21724)
5 years ago
lidanqing 9dff56e8e2 change qat_performance with mobilenet, change batch_size of qat2_resnet50 (#21895)
5 years ago
FDInSky 6b9fbcf3ad Update iou_similarity op to support non-normalized bbox (#21671)
5 years ago
guofei 46f9184aff Modify the while_loop API (#21844)
5 years ago
Guo Sheng 7689b6aaa4
Fix default label dim of label_smooth_op. test=develop (#21862)
5 years ago
zhouwei25 13e4756f18 change ci check rule of deleting unit-test (#21876)
5 years ago
GaoWei8 d4dda8628e optimize fc jit (#21878)
5 years ago
zhouwei25 013225bb68 fix Execution order of ci_check_unittest, and add it to Linux_py35 (#21640)
5 years ago
Chen Weihang 2b941736f3 fix softmax_with_cross_entropy_fix bug, test=develop (#21810)
5 years ago
Thunderbrook c3cf42d0f7
add table id in cache shuffle (#21585)
5 years ago
Michał Gallus 253e664275 Disable memory opt pass when DNNL is on (#21826)
5 years ago
Chengmo a86f11b5f5
Speed GEO dense calc & communication (#21579)
5 years ago
Wojciech Uss 666c3bb9b0 handle multi-inputs with empty inputs for mkldnn_concat_op (#21827)
5 years ago
Zeng Jinle aa4d6a5d6c
Add some debug flags to auto growth allocator (#21766)
5 years ago
guofei 8b7c50f49a Make While Op could run on GPU place and add while_loop unittest (#21672)
5 years ago
WangXi 17299b8d21 fix batch_norm_grad infer shape=0 & add allreduce enforce shape, test=develop (#21801)
5 years ago
Huihuang Zheng 557bce77da
Fix Backward Bugs in Conditional Block (#21809)
5 years ago
xujiaqi01 0eb4d990c4
fix compiled error when with_pslib=on (#21769)
5 years ago
Huihuang Zheng 0677a1c1c1
Fix That conditional_block_op Doesn't Have InferShape (#21733)
5 years ago
zhaoyuchen2018 a5a8d14414
Fix softmax cuda bug (#21720)
5 years ago
Kaipeng Deng 943a44492b
yolo_box OP add Attr(clip_bbox). (#21620)
5 years ago
Michał Gallus a5159d8480 Re-anble vgg and resnet101 models download (#21713)
5 years ago
Leo Chen 7181afd75c
Fix elementwise_pow bug on CUDA place with integer (#21675)
5 years ago
石晓伟 2bb135825e
fix analysis_predictor when func is called multiple times, test=release/1.6 (#21665)
5 years ago
lidanqing d3a96632fa Add fc-dequantize squash in cpu_quantize_squash_pass for ernie model (#21714)
5 years ago
Chen Weihang 1fd1f06f11 Rename paddle throw error macro (#21657)
5 years ago
WangXi 8754cbd1f2 fix std::min type in nan_inf, test=develop (#21725)
5 years ago
Leo Chen fbe3ac217e
polish cmake, test=develop (#21681)
5 years ago
joanna.wozna.intel d419b859c0 Add reshape int8 mkldnn op (#21428)
5 years ago
WangXi 8a0f611b64 Rewrite check nan inf tools (#21076)
5 years ago
tangwei12 9ad940fdfe
memory leak for cpu (#21174)
5 years ago
Zhaolong Xing fbbd94a6ce
there is bug for inference using auto grwoth allocator (#21621)
5 years ago
Zeng Jinle 73461a7ae6
Make OperatorWithKernel::InferShape abstract (#21633)
5 years ago
mapingshuo 686f0ecb6a
add `no_need_buffer_slots` interface to pybind (#21575)
5 years ago
Zeng Jinle 6828f3684b
fix op_registry, add ignore op_function_impl.h, test=develop (#21654)
5 years ago
GaoWei8 5af0c7ba89 Modify padding strategy: remove weight copy in fc padding (#21650)
5 years ago
Chen Weihang d96acc3363
Refine dygraph DataLoader implementation (#21634)
5 years ago
wangchaochaohu 5eec8cf5af
fix the mean grad OP performance improvement test=develop (#21658)
5 years ago
Zeng Jinle 29f64c8c9e
refine some grad op makers, test=develop (#21629)
5 years ago
mapingshuo e2d849b989
Dropout with seed (#21590)
5 years ago
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
5 years ago
rensilin 7f5d532a9c fix: fail to call ZeroCopyTensor::mutable_data() when device_id is no… (#21461)
5 years ago
xujiaqi01 f404157205
fix master patch when slot is dense (#21580)
5 years ago
xujiaqi01 c05706fe73
fix code style of fleet_wrapper (#21639)
5 years ago
wangchaochaohu 95b95a284b
Mean gpu optimize (#21643)
5 years ago
Leo Chen 48600d7f17
Add op function generator for dygraph (#21569)
5 years ago
lidanqing fbf9eca0d3 QAT Int8 document (#21360)
5 years ago
liym27 be6a639655
Add CI for checking Input/Output/Attr of modified Ops (#21522)
5 years ago
Leo Chen 4f81d1bd5f
Refine VarBase init function (#21587)
5 years ago
Leo Chen 84b7267100
dygraph_grad_maker supports varbase without grad_var (#21524)
5 years ago
xujiaqi01 88960684aa
rm optimize_for in framework.proto (#21571)
5 years ago
Zeng Jinle 0f8888360e
Polish op registry codes (#21561)
5 years ago
Aurelius84 3d9dee575e
Set lod_level of Out in compile time of sequence_pool_op (#21604)
5 years ago
zhouwei25 346705967d monitoring changes of unittest, delete one unittest will need approve (#21377)
5 years ago
Zeng Jinle 97e76cb96d
refine dev_ctx.Wait() exception throw, test=develop (#21600)
5 years ago
Huihuang Zheng 1dcf6a7212
Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532)
5 years ago
hutuxian c5aec2fe68
Paddlebox Related to Framework (#21586)
5 years ago
liym27 9da7e6b4d4
add file check_op_desc.py and add interface to get default value. (#21530)
5 years ago
Jacek Czaja 8f5a93a07b - Fix to regression in performance of ResNet-50 training (#21588)
5 years ago
Jacek Czaja 9ce0e29dc3 [MKL-DNN] Batch norm mkl-dnn NHWC support (#21553)
5 years ago
Zeng Jinle 3a7caf481c
add grad maker assert, test=develop (#21564)
5 years ago
Huihuang Zheng b241c7329c
Refine a Warning Which Can Occur Not Only During Init (#21546)
5 years ago
Pei Yang 20d61414b4
fix glog warning, test=develop (#21573)
5 years ago
wangchaochaohu 932aca162d
Add Branch to avoid CPU profiler warning print (#21556)
5 years ago
Leo Chen cdd46d7e02
Split VarBase from Python Variable for Dygraph (#21359)
5 years ago
Youwei Song cdba41af4d dygraph Embedding layer use lookuptable v2 (#21209)
5 years ago
Pei Yang 122b37ce62
make config option DisableGlogInfo() able to mute all inference logs (#21318)
5 years ago
wangchaochaohu 4c9b3dafa7
fill_constant_batch_size_like OP precious problem fix (#21337)
5 years ago
Aurelius84 fa7cff1fee
Add CI for checking registered data_type of new Op (#21488)
5 years ago
Zhaolong Xing da7748c53d
add conv, depthwise_conv, pooling (#20966)
5 years ago
WangXi 768f9242e9 Fix dgc clip & rampup step, test=develop (#21491)
5 years ago
hong 0b75a0c10b add overrider for virtual function to avoid warning (#21503)
5 years ago
Aurelius84 54382ce497
Add get_all_kernels api of registered data_type in pybind.cc (#21499)
5 years ago
Zeng Jinle 3662fb71a7
remove eval() calls in Eigen, test=develop (#21498)
5 years ago
Jacek Czaja 18a5d30754 [MKL-DNN] Conv2d and Conv2d transpose MKL-DNN NHWC support (#21466)
5 years ago
GaoWei8 250a192181 Add ernie large c++ inference test (#21365)
5 years ago
zhongpu 6ebf0f47b8 support SelectedRows in dygraph, test=develop (#21078)
5 years ago
Tao Luo 70eb397677
remove unused snappy/snappystream depends in distributed codes (#21484)
5 years ago
lilong12 0bc8bdf724
set dim[0] to -1 if dim[0] < 0 during compiling for c_allgather op (#21402)
5 years ago
Zhaolong Xing c5f0293cf3
NV jetson(nano, tx2, xavier) inference compile support (#21393)
5 years ago
Zhaolong Xing b39c011637
specify the auto growth allocator for inference. (#21448)
5 years ago
tangwei12 0bddb951c2
fix async mode, test=develop (#21367)
5 years ago
Zeng Jinle 81ef8b7f8f
Fix CI DefaultGradOpMaker check (#21482)
5 years ago
Huihuang Zheng a71f53d7ac
Add warning message when initialize GLOG failed. (#21487)
5 years ago
Leo Chen b3090ad406
fix synchronization problem in softmax_with_cross_entropy_op, test=develop (#21480)
5 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
5 years ago
Lv Mengsi 37f3e56dea
Fix transpose conv (#21406)
5 years ago
hutuxian 7e68bc896b
refactor AUC OP and add its CUDA Kernel (#21336)
5 years ago
wawltor dbbe6e9cb6
fix the device supported of the op unique and unique_with_counts. (#21395)
5 years ago
wangchaochaohu d4776ec027
fix the correctness of memcpy profiling result test=develop (#21458)
5 years ago
wangguanzhong 379e3febf2
fix shape check in density_prior_box, test=develop (#21414)
5 years ago
Adam 76b55da15a Fix bug in UpdatePadding for int64_t type (#21465)
5 years ago
Pei Yang 7b28d938bf
show shape diff in wrong trt input shape errmsg, test=develop (#21451)
5 years ago
Jie Fang 5e813b53c5 nhwc optimization for batchnorm (#21090)
5 years ago
Leo Chen e0c9d856fb
add unused input vars check for OpWithKernel, test=develop (#21169)
5 years ago
Chen Weihang 664f958a02
Fix optimizer op infershape failed in dygraph multi-cards mode (#21374)
5 years ago
Huihuang Zheng 630be31952
Fix Cond Bug for Nested Control Flow (#21340)
5 years ago
Jacek Czaja cd43c4440e [MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375)
5 years ago
Leo Chen add62acfd1
remove kDepXOut for abs_grad op, test=develop (#21407)
5 years ago
Chen Weihang 407f883f5a
Add SelectedRows support for dygraph DebugString (#21415)
5 years ago
Adam 9107bf209f Add template version of UpdatePadding (#21426)
5 years ago
zhaoyuchen2018 b16274556a
Add dscending for argsort (#21400)
5 years ago
Zeng Jinle 6b09b73e17
add explicit conversion to NoNeedBufferVarsFunctor, test=develop (#21430)
5 years ago
hong ac8546701d
Add dygraph execution context (#20157)
5 years ago
hutuxian a6b089c614
add macro to ban windows (#21422)
5 years ago
Kaipeng Deng ebfb720a63
add Adam beta1/beta2 support Variable (#21234)
5 years ago
Zeng Jinle 09696d5df8
Use system allocator in OpTest (#21335)
5 years ago
tianshuo78520a d624b417d8 change make nproc on Cloud Integration (#21350)
5 years ago
wangchaochaohu 8293f21a52
Profile refine (#21258)
5 years ago
Kaipeng Deng 67c836fb5c
batch_norm momentum support variable (#21246)
5 years ago
lidanqing c0aa13672e Fp32 vs int8 qat C++ performance (#21244)
5 years ago
ShenLiang e2c6f434ec Add Lod information for gather_nd & scatter_nd (#21404)
5 years ago
Tao Luo c0656dcb1a
remove -Wno-error=sign-compare, make warning as error (#21358)
5 years ago
wangchaochaohu e0e205ea2d
fix the profiling bug test=develop (#21396)
5 years ago
Zeng Jinle b97fc16d21
fix lod_reset bug, test=develop (#21392)
5 years ago
Zeng Jinle 89966525f1
Polish reference count pass (#21324)
5 years ago
Zhaolong Xing d1a6e112e6 fix C++ multicard inference bug. (#20955)
5 years ago
hutuxian 47a82e38e3
Support data_norm gpu kernel (#21325)
5 years ago
Youwei Song d5ff79e55e Support numpy bridge (enabled by default in dygraph mode) (#20983)
5 years ago
GaoWei8 8493f20ebc Polish the codes of fc when needs padding (#21378)
5 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
5 years ago
Zeng Jinle b639a882c3 fix syn bn grad maker, test=develop, test=document_fix (#21317)
5 years ago
Youwei Song 4d0f5ab1a8 add axis check for concat op (#21288)
5 years ago
zhaoyuchen2018 afb134847d
Fix ernie python infer diff (#21311)
5 years ago
Lv Mengsi b6ce4f8b2f
Fix mistake of batch norm op (#21237)
5 years ago
lilong12 41d13209d7
add the framework support for distfc (#21197)
5 years ago
Zeng Jinle dbba9c7e4b
polish global_value_getter_setter, test=develop (#21332)
5 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
5 years ago
silingtong123 45c1e7bb7b add prediction demo and script on windows (#21248)
5 years ago
Jacek Czaja f4cf028a8c [MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207)
5 years ago
Michał Gallus ed9ceb9f98 Refactor MKL-DNN ElementwiseMul (#21061)
5 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
5 years ago
wangchaochaohu 6514f52e46
fix the fill_constant op precious problem (#21322)
5 years ago
zhaoyuchen2018 08c19c585d
Improve argsort performance. (#21267)
5 years ago
Thunderbrook 9a7832f8be
print table stat info for pslib (#21296)
5 years ago
WangXi 8ac7687e36 Fix dgc accuracy by mv regularization to local (#21278)
5 years ago
Zeng Jinle b9f8ae8494
Add global value getter setter (#21285)
5 years ago
Leo Zhao b19e1a1b56 use prefetch to load next mem into cache (#21206)
5 years ago
Dong Daxiang 691ced87c0
Refactor fetch handler (#21264)
5 years ago
gongweibao ed2a185248
optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597)
5 years ago
Yiqun Liu c918788ba9 Disable fusion_group pass for windows and mac. We will do some experiments on Linux first. (#21310)
5 years ago
Yihua Xu 69dd5152cf Fix the crash issue when scale or bias was null-pointer. (#21284)
5 years ago
Zhang Ting 698b8b73ad
optimize lod_reset op to avoid data transform
5 years ago
Liufang Sang f0b1518438 add dequantize_abs_max op and modify lookup_table op (#20899)
5 years ago
hutuxian a6ce2306f9 support cvm_op run in gpu (#21300)
5 years ago
Yihua Xu b085ecc258 Avoid the string as the key of map to improve the jit performance (#21292)
5 years ago
Chen Weihang 952508527a
Polish some PE code details (#21274)
5 years ago
Thunderbrook 0d17c1b816
solve pslib core in stop worker (#21263)
5 years ago
zhongpu c4ede95c74 open dygraph op test, test=develop (#19787)
5 years ago
danleifeng 6fc3e8ec84 edit elementwise_mul doublegrad inplace (#21245)
5 years ago
Thunderbrook 349e82d669
support general embedding params (#21217)
5 years ago
liu zhengxi 3cb6c0a059
Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240)
5 years ago