Commit Graph

4362 Commits (f57ee3693bfce72cac7f96bc3b9f52c712d688f6)

Author SHA1 Message Date
Yibing Liu f57ee3693b
Fix the bug of sequence_unpad op (#18290)
6 years ago
chengduo 5489216eba
Clean build strategy (#18148)
6 years ago
songhao 6b3d96254d fix some bug when merge sparse embedding parameters, test=develop (#18223)
6 years ago
xiaoting b58bb80248 set src_idx > 0 for bilinear_interp_op (#18238)
6 years ago
Hongyu Liu cefd0fb598
Fix slice op shape=-1 bug (#18107)
6 years ago
翟飞跃 802ea50956 fix spelling errors (#17941)
6 years ago
FlyingQianMM 944c3165ec
fix type error of std::pow in sigmoid_focal_loss_op.cu and sigmoid_focal_loss_op.h (#18152)
6 years ago
Zeng Jinle 6eec66a1b1
Fix py_reader iterable bug (#18108)
6 years ago
qingqing01 80d2e66f9e
Update backward appending stragety to support double backward and fix some bug. (#18104)
6 years ago
FlyingQianMM ff83655f7e
add detection output operator for supporting retinanet (#17896)
6 years ago
FlyingQianMM 0aee1f0074
add sigmoid focal loss operator for supporting retinanet (#17895)
6 years ago
FDInSky 9e4b9d9798 Update generate_proposal_labels_op to support CascadeRCNN. (#17200)
6 years ago
FlyingQianMM 9ed2f936f1
add target assign operator for supporting retinanet (#17893)
6 years ago
chengduo 24e988a471
Fix bug of scope_buffered_ssa_graph_executor (#18100)
6 years ago
whs 354643d8d9
Add warning for cudnn warpctc kernel in CUDA9\CUDA10. (#18046)
6 years ago
Yiqun Liu 660c1a65f3
Optimize fused_elewise_activation_grad op. (#18041)
6 years ago
lidanqing f8ecc3de89 refactor the function ConvFwdPrimitiveDesc (#17897)
6 years ago
Wojciech Uss 78e932862c Added unit test for QAT FP32 & INT8 comparison (#17814)
6 years ago
tensor-tang 566bf2ec56
concat op support negative axis (#18045)
6 years ago
Yiqun Liu 7e463c84a6
Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. (#17979)
6 years ago
tangwei12 101f74cb19
fix save/load in fleet (#17675)
6 years ago
Guo Sheng a06b316b94
Fix GetExpectedKernelType of add_position_encoding_op (#17935)
6 years ago
wawltor 8eb134c3c1
Fix scatter and gather op when has duplicate index (#17952)
6 years ago
lujun 75fcd29220
update load_error_info, test=develop (#18000)
6 years ago
wawltor 2ae8decc90
test=develop (#17984)
6 years ago
cjt222 871af28d6c
add deformable psroi pooling (#17827)
6 years ago
SunGaofeng 40885c225b
add unfold op (new op),test=develop (#17944)
6 years ago
Jacek Czaja 84bb45c054 [MKL-DNN] Thread-Safety for MKL-DNN reusing Part 1 (#17965)
6 years ago
石晓伟 bce259e5bf
Update the Anakin interfaces for content-dnn and MLU (#17890)
6 years ago
Tao Luo 53fd507bae
fix merge conflict of 'Remove attribute in Allocator::Allocate' and elementwise_add_mkldnn_op (#17949)
6 years ago
jerrywgz aab4d12c0e
refine GetExpectedKernelType in conat op, test=develop (#17934)
6 years ago
Zeng Jinle 3ece61f71e
Remove attribute in Allocator::Allocate (#17878)
6 years ago
Yibing Liu 33d1e56506
Enable seq_pool op to accept len 0 input (#17284)
6 years ago
Yihua Xu 9b5017366a Fix the format issue when 'X' is not nchw. (#17833)
6 years ago
Hongyu Liu 8062bd510c
Reshape support tensor attribute (#17781)
6 years ago
Zeng Jinle 0a96ec699c
fix conv v7 workspace size limit error, test=develop (#17902)
6 years ago
Yihua Xu 14a32bf0c4 Fix the accuracy issue while using float precision to get the scale. (#17884)
6 years ago
gongweibao fbbdc9ccad
Add backward and optimizer operator dependency pass. (#17746)
6 years ago
baojun e2c1b7c354 [NGraph] cache compiled function instead test=develop (#17845)
6 years ago
Zhaolong Xing ae576f3c68
fix: when use the load model from memory mode, the RAM occupy is high (#17788)
6 years ago
Zhaolong Xing 5efe8c7287
fix bug: the lod_tensor_to_array op will aplly a new var but not release when dong inference (#17856)
6 years ago
pawelpiotrowicz 39bc8a55a4 [NGraph] Enable ngraph layer_norm operator (#17599)
6 years ago
baojun a4c528a31c [NGraph] some ngraph updates to enable bert (#17739)
6 years ago
baojun 7611208ab7 [NGraph] added gather_grad to ngraph test=develop (#17646)
6 years ago
jerrywgz 92d9bdfce2
fix api doc in slice op, test=develop (#17804)
6 years ago
Hongyu Liu dfec676270
expand op supprt tensor attribute (#17773)
6 years ago
Hongyu Liu 82358bfdc1
ont hot support tensor depth (#16972)
6 years ago
Brian Liu 7cfddf22c8 Optimize bilinear interpolate op with OpenMP (#17800)
6 years ago
wangchaochaohu c10157a5df
revise the cudnn conv choose algorithm to improve the performance(mask rcnn benchmark) (#17753)
6 years ago
mozga-intel 6a6bf597f7 [NGraph] Enable elementwise_div operator test=develop (#17515)
6 years ago