Commit Graph

16384 Commits (17299b8d217c0872408cc9146a58f0769d8b05ba)

Author SHA1 Message Date
Youwei Song cdba41af4d dygraph Embedding layer use lookuptable v2 (#21209)
5 years ago
Pei Yang 122b37ce62
make config option DisableGlogInfo() able to mute all inference logs (#21318)
5 years ago
wangchaochaohu 4c9b3dafa7
fill_constant_batch_size_like OP precious problem fix (#21337)
5 years ago
Aurelius84 fa7cff1fee
Add CI for checking registered data_type of new Op (#21488)
5 years ago
Zhaolong Xing da7748c53d
add conv, depthwise_conv, pooling (#20966)
5 years ago
WangXi 768f9242e9 Fix dgc clip & rampup step, test=develop (#21491)
5 years ago
hong 0b75a0c10b add overrider for virtual function to avoid warning (#21503)
5 years ago
Aurelius84 54382ce497
Add get_all_kernels api of registered data_type in pybind.cc (#21499)
5 years ago
Zeng Jinle 3662fb71a7
remove eval() calls in Eigen, test=develop (#21498)
5 years ago
Jacek Czaja 18a5d30754 [MKL-DNN] Conv2d and Conv2d transpose MKL-DNN NHWC support (#21466)
5 years ago
GaoWei8 250a192181 Add ernie large c++ inference test (#21365)
5 years ago
zhongpu 6ebf0f47b8 support SelectedRows in dygraph, test=develop (#21078)
5 years ago
Tao Luo 70eb397677
remove unused snappy/snappystream depends in distributed codes (#21484)
5 years ago
lilong12 0bc8bdf724
set dim[0] to -1 if dim[0] < 0 during compiling for c_allgather op (#21402)
5 years ago
Zhaolong Xing c5f0293cf3
NV jetson(nano, tx2, xavier) inference compile support (#21393)
5 years ago
Zhaolong Xing b39c011637
specify the auto growth allocator for inference. (#21448)
5 years ago
tangwei12 0bddb951c2
fix async mode, test=develop (#21367)
5 years ago
Zeng Jinle 81ef8b7f8f
Fix CI DefaultGradOpMaker check (#21482)
5 years ago
Huihuang Zheng a71f53d7ac
Add warning message when initialize GLOG failed. (#21487)
5 years ago
Leo Chen b3090ad406
fix synchronization problem in softmax_with_cross_entropy_op, test=develop (#21480)
5 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
5 years ago
Lv Mengsi 37f3e56dea
Fix transpose conv (#21406)
5 years ago
hutuxian 7e68bc896b
refactor AUC OP and add its CUDA Kernel (#21336)
5 years ago
wawltor dbbe6e9cb6
fix the device supported of the op unique and unique_with_counts. (#21395)
5 years ago
wangchaochaohu d4776ec027
fix the correctness of memcpy profiling result test=develop (#21458)
5 years ago
wangguanzhong 379e3febf2
fix shape check in density_prior_box, test=develop (#21414)
5 years ago
Adam 76b55da15a Fix bug in UpdatePadding for int64_t type (#21465)
5 years ago
Pei Yang 7b28d938bf
show shape diff in wrong trt input shape errmsg, test=develop (#21451)
5 years ago
Jie Fang 5e813b53c5 nhwc optimization for batchnorm (#21090)
5 years ago
Leo Chen e0c9d856fb
add unused input vars check for OpWithKernel, test=develop (#21169)
5 years ago
Chen Weihang 664f958a02
Fix optimizer op infershape failed in dygraph multi-cards mode (#21374)
5 years ago
Huihuang Zheng 630be31952
Fix Cond Bug for Nested Control Flow (#21340)
5 years ago
Jacek Czaja cd43c4440e [MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375)
5 years ago
Leo Chen add62acfd1
remove kDepXOut for abs_grad op, test=develop (#21407)
5 years ago
Chen Weihang 407f883f5a
Add SelectedRows support for dygraph DebugString (#21415)
5 years ago
Adam 9107bf209f Add template version of UpdatePadding (#21426)
5 years ago
zhaoyuchen2018 b16274556a
Add dscending for argsort (#21400)
5 years ago
Zeng Jinle 6b09b73e17
add explicit conversion to NoNeedBufferVarsFunctor, test=develop (#21430)
5 years ago
hong ac8546701d
Add dygraph execution context (#20157)
5 years ago
hutuxian a6b089c614
add macro to ban windows (#21422)
5 years ago
Kaipeng Deng ebfb720a63
add Adam beta1/beta2 support Variable (#21234)
5 years ago
Zeng Jinle 09696d5df8
Use system allocator in OpTest (#21335)
5 years ago
tianshuo78520a d624b417d8 change make nproc on Cloud Integration (#21350)
5 years ago
wangchaochaohu 8293f21a52
Profile refine (#21258)
5 years ago
Kaipeng Deng 67c836fb5c
batch_norm momentum support variable (#21246)
5 years ago
lidanqing c0aa13672e Fp32 vs int8 qat C++ performance (#21244)
5 years ago
ShenLiang e2c6f434ec Add Lod information for gather_nd & scatter_nd (#21404)
5 years ago
Tao Luo c0656dcb1a
remove -Wno-error=sign-compare, make warning as error (#21358)
5 years ago
wangchaochaohu e0e205ea2d
fix the profiling bug test=develop (#21396)
5 years ago
Zeng Jinle b97fc16d21
fix lod_reset bug, test=develop (#21392)
5 years ago
Zeng Jinle 89966525f1
Polish reference count pass (#21324)
5 years ago
Zhaolong Xing d1a6e112e6 fix C++ multicard inference bug. (#20955)
5 years ago
hutuxian 47a82e38e3
Support data_norm gpu kernel (#21325)
5 years ago
Youwei Song d5ff79e55e Support numpy bridge (enabled by default in dygraph mode) (#20983)
5 years ago
GaoWei8 8493f20ebc Polish the codes of fc when needs padding (#21378)
5 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
5 years ago
Zeng Jinle b639a882c3 fix syn bn grad maker, test=develop, test=document_fix (#21317)
5 years ago
Youwei Song 4d0f5ab1a8 add axis check for concat op (#21288)
5 years ago
zhaoyuchen2018 afb134847d
Fix ernie python infer diff (#21311)
5 years ago
Lv Mengsi b6ce4f8b2f
Fix mistake of batch norm op (#21237)
5 years ago
lilong12 41d13209d7
add the framework support for distfc (#21197)
5 years ago
Zeng Jinle dbba9c7e4b
polish global_value_getter_setter, test=develop (#21332)
5 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
5 years ago
silingtong123 45c1e7bb7b add prediction demo and script on windows (#21248)
5 years ago
Jacek Czaja f4cf028a8c [MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207)
5 years ago
Michał Gallus ed9ceb9f98 Refactor MKL-DNN ElementwiseMul (#21061)
5 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
5 years ago
wangchaochaohu 6514f52e46
fix the fill_constant op precious problem (#21322)
5 years ago
zhaoyuchen2018 08c19c585d
Improve argsort performance. (#21267)
5 years ago
Thunderbrook 9a7832f8be
print table stat info for pslib (#21296)
5 years ago
WangXi 8ac7687e36 Fix dgc accuracy by mv regularization to local (#21278)
5 years ago
Zeng Jinle b9f8ae8494
Add global value getter setter (#21285)
5 years ago
Leo Zhao b19e1a1b56 use prefetch to load next mem into cache (#21206)
5 years ago
Dong Daxiang 691ced87c0
Refactor fetch handler (#21264)
5 years ago
gongweibao ed2a185248
optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597)
5 years ago
Yiqun Liu c918788ba9 Disable fusion_group pass for windows and mac. We will do some experiments on Linux first. (#21310)
5 years ago
Yihua Xu 69dd5152cf Fix the crash issue when scale or bias was null-pointer. (#21284)
5 years ago
Zhang Ting 698b8b73ad
optimize lod_reset op to avoid data transform
5 years ago
Liufang Sang f0b1518438 add dequantize_abs_max op and modify lookup_table op (#20899)
5 years ago
hutuxian a6ce2306f9 support cvm_op run in gpu (#21300)
5 years ago
Yihua Xu b085ecc258 Avoid the string as the key of map to improve the jit performance (#21292)
5 years ago
Chen Weihang 952508527a
Polish some PE code details (#21274)
5 years ago
Thunderbrook 0d17c1b816
solve pslib core in stop worker (#21263)
5 years ago
zhongpu c4ede95c74 open dygraph op test, test=develop (#19787)
5 years ago
danleifeng 6fc3e8ec84 edit elementwise_mul doublegrad inplace (#21245)
5 years ago
Thunderbrook 349e82d669
support general embedding params (#21217)
5 years ago
liu zhengxi 3cb6c0a059
Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240)
5 years ago
Yiqun Liu 6b1e1f0dda
Enable generating code for a given subgraph. (#21126)
5 years ago
zhaoyuchen2018 3ff5cc2d5e
Fix topk compile failed on windows (#21243)
5 years ago
Pei Yang 2e2f92a5b1
fix trt weight bug (#21231)
5 years ago
Zhang Ting 01a9646323
optimize assign op to avoid copy data from GPU to GPU (#21181)
5 years ago
zhouwei25 c0dcb090a3 Determine whether to copy and link inference lib by ON_INFER (#20931)
5 years ago
danleifeng 0e7baabe59 extend elementwise broadcast function (#20957)
5 years ago
Adam d623e863c9 Fix GELU grad error (#21204)
5 years ago
Zeng Jinle a152315be7
refine Tensor method, test=develop (#21031)
5 years ago
yaoxuefeng b5d8ba8394
fix data_norm op to avoid impractical normalization result test=develop (#21152)
5 years ago
Zeng Jinle 67e88424e5
Polish jit trace codes (#21218)
5 years ago
Zeng Jinle cdb3d27985
Fix warn of gcc8 (#21205)
5 years ago
liuwei1031 d8b6cf2bcd
fix sporadically hang issue on windows(#21201)
5 years ago
Zhang Ting 9cbe7bccba modified error message and API doc for channel_last supported Op (#21002)
5 years ago