Commit Graph

15785 Commits (fe18cfdb4f22d56e48e9ee35cb390098b74d1dab)

Author SHA1 Message Date
flame fe18cfdb4f
hide with inference optim API (#17355)
6 years ago
Leo Chen 578a2f5da3 fix SplitLodTensor when batch_size = 0, test=develop (#19866)
6 years ago
Aurelius84 b125e327aa
Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606)
6 years ago
wopeizl a7c440d303
add precise roi pooling op test=develop (#18960)
6 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
6 years ago
Jie Fang d9db94d752 Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus. (#19714)
6 years ago
wangchaochaohu 47af618f70
Strided slice (#19642)
6 years ago
Zeng Jinle 13ca364ceb
remove some flags and add comments to some flags, test=develop (#19813)
6 years ago
123malin 1bc285a53a
add retry function to try to solve grpc error code 14 (#19661)
6 years ago
Zeng Jinle 5eb381a3e2
refine reallocate of workspace size, test=develop (#19843)
6 years ago
石晓伟 71b2ed61bc
support MLU nums, test=develop (#19372)
6 years ago
Zeng Jinle 3f87464e9c
refine executor_gc_helper codes, test=develop (#19814)
6 years ago
LielinJiang 6d72a86b14 fix_roi_transform_bug (#19785)
6 years ago
Zeng Jinle 3fd3b663a8
fix gc bug in controlflow ops, test=develop (#19827)
6 years ago
Leo Chen 982e61f5ff Update elementwise double grad to save gpu memory (#19509)
6 years ago
Zeng Jinle db26de8389
[Bug fix] Disable memory reuse on feeded variables (#19835)
6 years ago
Adam dfdd73cbc0 Add MKLDNNhandlerT templatized class (#19801)
6 years ago
Zeng Jinle cabb9501bd
fix leaky_relu op when alpha is zero, test=develop (#19833)
6 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
6 years ago
chengjuntao 00efd1d8a9
add deformable conv v1 op and cpu version of deformable conv v2 (#18500)
6 years ago
Thunderbrook 40c66f8df9
rm return in vfork (#19734)
6 years ago
Zhaolong Xing 110be57c1b
fix memory optimization type (#19781)
6 years ago
liym27 677e714425 fix pow op, support tensor for agument factor. (#19313)
6 years ago
liym27 bd89a27308 add tensor support for argument shape in reshape op; (#19268)
6 years ago
liym27 88628016b2 add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208)
6 years ago
liym27 e9e3c08777 fix expand op: (#19302)
6 years ago
xujiaqi01 6bf298bf09
support preload thread, optimize hdfs log, fix master+patch bug (#19695)
6 years ago
Huihuang Zheng a0d80754c5
Add comments for CUDA Device Context Allocator related stuff (#19809)
6 years ago
Jiabin Yang cc311bdf95
Feature/add transform data dygraph (#19707)
6 years ago
lvmengsi b76343c3b7
cpu Conv double grad (#19672)
6 years ago
Zeng Jinle 754fd57ed7
disable memory optimization passes when FLAGS_use_ngraph=True, test=develop (#19778)
6 years ago
翟飞跃 93c85c930a Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770)
6 years ago
chengduo 8281497030
Fix warning info of build_strategy (#19805)
6 years ago
Zeng Jinle b34933d9ee
fix retry allocator bug, test=develop (#19794)
6 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
6 years ago
Zeng Jinle 32b1151f5e
reduce default value of cudnn workspace size, test=develop (#19780)
6 years ago
zhongpu 52673956de add kernel for squeeze_op, test=develop (#19656)
6 years ago
zhongpu 2a81c3679a add kernel for unstack_op, test=develop (#19538)
6 years ago
Chen Weihang 00d5375e0c
Add prune_backward function to cover complicated test_program.clone situation (#19772)
6 years ago
Kaipeng Deng 99c78b772a
fix softmax axis!=-1. test=develop (#19800)
6 years ago
tianshuo78520a 38f1c2fe28 change approve site (#19791)
6 years ago
Adam d4413a54bc Add common CreateKey for mkldnn handlers (#19767)
6 years ago
Yihua Xu 0d6ea52958 Fix the definition issue when used mkl_scsrmm and mkl_dcsrmm functions. (#19774)
6 years ago
chengduo 056fdedde3
Open fuse all reduce option (#19765)
6 years ago
Aurelius84 8c7e411908
Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716)
6 years ago
JesseyXujin e352467c1c
modify activation op API, delete use_cudnn args, test=develop, (#19758)
6 years ago
Jacek Czaja 9e4c958552 Refactoring activation mkldnn op (#19748)
6 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
6 years ago
Zeng Jinle 0daa5c9772
Make leaky relu inplacable (#19676)
6 years ago
Zeng Jinle 078a678219
refine math_op_patch, test=develop (#19727)
6 years ago