Commit Graph

15829 Commits (ebff68fa74c3f278b97326fec56d775a94323623)

Author SHA1 Message Date
wangchaochaohu 47af618f70
Strided slice (#19642)
5 years ago
Zeng Jinle 13ca364ceb
remove some flags and add comments to some flags, test=develop (#19813)
5 years ago
123malin 1bc285a53a
add retry function to try to solve grpc error code 14 (#19661)
5 years ago
Zeng Jinle 5eb381a3e2
refine reallocate of workspace size, test=develop (#19843)
5 years ago
石晓伟 71b2ed61bc
support MLU nums, test=develop (#19372)
5 years ago
Zeng Jinle 3f87464e9c
refine executor_gc_helper codes, test=develop (#19814)
5 years ago
LielinJiang 6d72a86b14 fix_roi_transform_bug (#19785)
5 years ago
Zeng Jinle 3fd3b663a8
fix gc bug in controlflow ops, test=develop (#19827)
5 years ago
Leo Chen 982e61f5ff Update elementwise double grad to save gpu memory (#19509)
5 years ago
Zeng Jinle db26de8389
[Bug fix] Disable memory reuse on feeded variables (#19835)
5 years ago
Adam dfdd73cbc0 Add MKLDNNhandlerT templatized class (#19801)
5 years ago
Zeng Jinle cabb9501bd
fix leaky_relu op when alpha is zero, test=develop (#19833)
5 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
5 years ago
chengjuntao 00efd1d8a9
add deformable conv v1 op and cpu version of deformable conv v2 (#18500)
5 years ago
Thunderbrook 40c66f8df9
rm return in vfork (#19734)
5 years ago
Zhaolong Xing 110be57c1b
fix memory optimization type (#19781)
5 years ago
liym27 677e714425 fix pow op, support tensor for agument factor. (#19313)
5 years ago
liym27 bd89a27308 add tensor support for argument shape in reshape op; (#19268)
5 years ago
liym27 88628016b2 add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208)
5 years ago
liym27 e9e3c08777 fix expand op: (#19302)
5 years ago
xujiaqi01 6bf298bf09
support preload thread, optimize hdfs log, fix master+patch bug (#19695)
5 years ago
Huihuang Zheng a0d80754c5
Add comments for CUDA Device Context Allocator related stuff (#19809)
5 years ago
Jiabin Yang cc311bdf95
Feature/add transform data dygraph (#19707)
5 years ago
lvmengsi b76343c3b7
cpu Conv double grad (#19672)
5 years ago
Zeng Jinle 754fd57ed7
disable memory optimization passes when FLAGS_use_ngraph=True, test=develop (#19778)
5 years ago
翟飞跃 93c85c930a Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770)
5 years ago
chengduo 8281497030
Fix warning info of build_strategy (#19805)
5 years ago
Zeng Jinle b34933d9ee
fix retry allocator bug, test=develop (#19794)
5 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
5 years ago
Zeng Jinle 32b1151f5e
reduce default value of cudnn workspace size, test=develop (#19780)
5 years ago
zhongpu 52673956de add kernel for squeeze_op, test=develop (#19656)
5 years ago
zhongpu 2a81c3679a add kernel for unstack_op, test=develop (#19538)
5 years ago
Chen Weihang 00d5375e0c
Add prune_backward function to cover complicated test_program.clone situation (#19772)
5 years ago
Kaipeng Deng 99c78b772a
fix softmax axis!=-1. test=develop (#19800)
5 years ago
tianshuo78520a 38f1c2fe28 change approve site (#19791)
6 years ago
Adam d4413a54bc Add common CreateKey for mkldnn handlers (#19767)
6 years ago
Yihua Xu 0d6ea52958 Fix the definition issue when used mkl_scsrmm and mkl_dcsrmm functions. (#19774)
6 years ago
chengduo 056fdedde3
Open fuse all reduce option (#19765)
6 years ago
Aurelius84 8c7e411908
Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716)
6 years ago
JesseyXujin e352467c1c
modify activation op API, delete use_cudnn args, test=develop, (#19758)
6 years ago
Jacek Czaja 9e4c958552 Refactoring activation mkldnn op (#19748)
6 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
6 years ago
Zeng Jinle 0daa5c9772
Make leaky relu inplacable (#19676)
6 years ago
Zeng Jinle 078a678219
refine math_op_patch, test=develop (#19727)
6 years ago
chengduo e506c99c20
Open fuse broadcast option (#18833)
6 years ago
Jacek Czaja 47f670d58c - Softmax mkl-dnn refactoring (#19615)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
Aurelius84 22301115d0
Remove constraint that last dimension is forced to be 1 in huber_loss op (#19562)
6 years ago
chengduo 5866a7a5fe
Enable fused_all_reduce_op_handle support GPU and CPU Gradients (#19418)
6 years ago
Youwei Song 3e5fb6361b fix api-doc error for dygraph and backward (#19721)
6 years ago
Tao Luo ec9bc1bd9f
paddle::framework::vectorize() templatization (#19730)
6 years ago
Zeng Jinle bb4f8dee83
add logs to left var memory size, test=develop (#19722)
6 years ago
Adam 428b2b9e17 MKLDNN handler cleanup (#19713)
6 years ago
XiaoguangHu 27235cf222
Add document annotations for FLAGS that need to be open to external developers test=develop (#19692)
6 years ago
Zeng Jinle 1c25c88aba
refine memory usage of some operators, test=develop (#19700)
6 years ago
wangguanzhong 25dcd74d34
merge empty lod tensor, test=develop (#19228)
6 years ago
yaoxuefeng c6756ed225 fix instag op (#19591)
6 years ago
gongweibao 6c2bc29cc0
Fix float16 optimizer. (#19682)
6 years ago
Zeng Jinle 713c05dd60
refine tensor.mutable_data, test=develop (#19680)
6 years ago
Chen Weihang c78a4781bf
Fix train error when test_program.clone is executed after optimizer.minimize (#19397)
6 years ago
zhongpu 5f627488db add kernel for unsqueeze_op and Add unsqueezed op test, test=develop (#19436)
6 years ago
Zeng Jinle a7691603a5
add gpu_allocator_try_time config, test=develop (#19675)
6 years ago
JesseyXujin 0b06db9413
delete transmission args in linear_chain_crf op (#19619)
6 years ago
Tao Luo f05d2c519d paddle::framework::vectorize() templatization [PART3] (#19643)
6 years ago
hutuxian 1ca6ea0318
fix cmakelist deps (#19668)
6 years ago
Tao Luo bcddbc78d4
remove -Wmaybe-uninitialized warning (#19653)
6 years ago
Zeng Jinle 2db40d9f60
reduce thread num of retry_allocator_test,test=develop (#19638)
6 years ago
wangchaochaohu 4440d7ced0
test=develop cuda realization of label smooth op (#19175)
6 years ago
chengduo 31c5a5ee26 Remove linear_chain_crf_op.cu (#19645)
6 years ago
123malin a25a716e87
Optimize fleet API: add input check for some interfaces (#18971)
6 years ago
wangchaochaohu ed8f44ea21
codegen for fused elementwise operation (#19520)
6 years ago
Chen Weihang 73daa3d6c0
Code Cleanup: delete three useless raw variables in Conv2D (#19644)
6 years ago
123malin 2f037c3189
fix the diff between async mode and async_half mode (#19535)
6 years ago
Jiabin Yang e9233d1c1e Refactor dygraph (#19107)
6 years ago
mapingshuo dca9b6c5b0 add feed_var_names to Prune interface (#19589)
6 years ago
tangwei12 f45cb1c2ca
fix bug of communicator flag, test=develop (#19635)
6 years ago
Yiqun Liu 42b5bec6f9
Integrate NVRTC to support compiling CUDA kernel at runtime (#19422)
6 years ago
Tao Luo 3ae939e48a
unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631)
6 years ago
Leo Chen af692c9140 update reduce_sum and reduce_mean to save memory, test=develop (#19608)
6 years ago
tensor-tang e3e98ed678
fix scope lock bug on infer (#19624)
6 years ago
Aurelius84 6364ebc4dd
Add distributions of Categorical and MultivariateNormal (#18263)
6 years ago
Zeng Jinle 710767d894
Enable inplace support for some ops (#19612)
6 years ago
FDInSky a18cf5e119 add a argument for softshrink python api (#19396)
6 years ago
Tao Luo d6c85c96dc
paddle::framework::vectorize() templatization (#19627)
6 years ago
danleifeng 8672e15363 elementwise broadcast function enhancement (#19536)
6 years ago
Chen Weihang 8cb54ede8c
Add user-friendly error message in optimizer ops to give a hint about the position sensitive problem of run(startup_program) (#19605)
6 years ago
zhongpu 118bb897cf add kernel for flatten_op, test=develop (#19472)
6 years ago
Tao Luo 0a46d34538
refine some PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19607)
6 years ago
baojun a3a4b6e570 Enable ngraph through build_strategy (#19266)
6 years ago
ShenLiang 2cd3fa3e9a add scatter_nd op and scatter_nd_add op (#19571)
6 years ago
wawltor 364c44422e
Add the support the int64 data type of `scatter_op` input Index(#18804) (#19508)
6 years ago
Adam 8d6d95cc2b paddle::framework::vectorize() templatization (#19611)
6 years ago
Tao Luo 75d1571995
refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603)
6 years ago
Yiqun Liu c5548178b0
A a pass to enable the use of cudnn (#19346)
6 years ago
zhongpu cc443675e9 modify paddle_build.sh for Paddle python3 runtime image generation, test=develop (#19218)
6 years ago
Adam e94b26daf5 using MKLDNNMemoryFormat = mkldnn::memory::format changes (#19568)
6 years ago
Zeng Jinle e045aadf9a
fix retry_allocator_test by removing glog envs, test=develop (#19596)
6 years ago
baojun f2ad30c4dd Some ngraph op and unittest fix (#19515)
6 years ago
Tao Luo 49523ea189
replace PADDLE_ASSERT with PADDLE_ASSERT_MSG (#19586)
6 years ago
gongweibao abaf87be2b
Change backward_guard to optimize_guard to maximize the allreduce overlap. (#19506)
6 years ago