Commit Graph

4806 Commits (b97fc16d219071de47636acd4c627046d9f1cf23)

Author SHA1 Message Date
Zeng Jinle b97fc16d21
fix lod_reset bug, test=develop (#21392)
6 years ago
Zeng Jinle 89966525f1
Polish reference count pass (#21324)
6 years ago
hutuxian 47a82e38e3
Support data_norm gpu kernel (#21325)
6 years ago
GaoWei8 8493f20ebc Polish the codes of fc when needs padding (#21378)
6 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
6 years ago
Zeng Jinle b639a882c3 fix syn bn grad maker, test=develop, test=document_fix (#21317)
6 years ago
Youwei Song 4d0f5ab1a8 add axis check for concat op (#21288)
6 years ago
zhaoyuchen2018 afb134847d
Fix ernie python infer diff (#21311)
6 years ago
Lv Mengsi b6ce4f8b2f
Fix mistake of batch norm op (#21237)
6 years ago
lilong12 41d13209d7
add the framework support for distfc (#21197)
6 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
6 years ago
Jacek Czaja f4cf028a8c [MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207)
6 years ago
Michał Gallus ed9ceb9f98 Refactor MKL-DNN ElementwiseMul (#21061)
6 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
6 years ago
wangchaochaohu 6514f52e46
fix the fill_constant op precious problem (#21322)
6 years ago
zhaoyuchen2018 08c19c585d
Improve argsort performance. (#21267)
6 years ago
WangXi 8ac7687e36 Fix dgc accuracy by mv regularization to local (#21278)
6 years ago
Leo Zhao b19e1a1b56 use prefetch to load next mem into cache (#21206)
6 years ago
gongweibao ed2a185248
optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597)
6 years ago
Yihua Xu 69dd5152cf Fix the crash issue when scale or bias was null-pointer. (#21284)
6 years ago
Zhang Ting 698b8b73ad
optimize lod_reset op to avoid data transform
6 years ago
Liufang Sang f0b1518438 add dequantize_abs_max op and modify lookup_table op (#20899)
6 years ago
hutuxian a6ce2306f9 support cvm_op run in gpu (#21300)
6 years ago
Yihua Xu b085ecc258 Avoid the string as the key of map to improve the jit performance (#21292)
6 years ago
zhongpu c4ede95c74 open dygraph op test, test=develop (#19787)
6 years ago
danleifeng 6fc3e8ec84 edit elementwise_mul doublegrad inplace (#21245)
6 years ago
zhaoyuchen2018 3ff5cc2d5e
Fix topk compile failed on windows (#21243)
6 years ago
Zhang Ting 01a9646323
optimize assign op to avoid copy data from GPU to GPU (#21181)
6 years ago
danleifeng 0e7baabe59 extend elementwise broadcast function (#20957)
6 years ago
Adam d623e863c9 Fix GELU grad error (#21204)
6 years ago
yaoxuefeng b5d8ba8394
fix data_norm op to avoid impractical normalization result test=develop (#21152)
6 years ago
Zhang Ting 9cbe7bccba modified error message and API doc for channel_last supported Op (#21002)
6 years ago
guofei 56b5d14704 Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118)
6 years ago
WangXi 3c98ec90ce Fix INF bug of softmax_cross_entropy_op (#21165)
6 years ago
Yihua Xu eec9c9cbe7 Fix jit tls issue (#21151)
6 years ago
ruri aeb887911f
Refine edit distance cn (#21121)
6 years ago
Kaipeng Deng 98b59cb82c
fix elementwise_mod float point kernel. test=develop (#21183)
6 years ago
whs cfdd1fc2cd
Fix warpctc in padding mode. (#21033)
6 years ago
Chen Weihang 8da0cd537a
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134)
6 years ago
zhaoyuchen2018 b93870e696
Improve topk performance. (#21087)
6 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
6 years ago
Chen Weihang 7e5f74b825
Add examples for error message writing specification - InvalidArgument (#21132)
6 years ago
zhaoyuchen2018 4a544762a2
Add Asypadding for conv fusion. (#21041)
6 years ago
WangXi de5d3ff688 Fix dgc buffer illegal & reuse velocity (#21012)
6 years ago
ceci3 f62a929151
fix instance norm (#21042)
6 years ago
lilong12 e249d9a3e2
fix the computation for dx (grad for x) for prelu operation. (#20949)
6 years ago
Zhang Ting e0285eae64 add check for input channels and Attr(groups), test=develop (#21095)
6 years ago
Yiqun Liu 35f17ae28f
Add the check of lod_level between compile-time and runtime. (#20961)
6 years ago
Chen Weihang 826254f664
Add pre-condition check for fuse optimizer op pass (#21005)
6 years ago
Aurelius84 1cd6721873 Optimizer mmcpy if _rand_len=16 and remove data copy in GradKernel (#21099)
6 years ago