Commit Graph

25186 Commits (63dd3183bcb89dbf1bf0501c70f3ded69a6feb09)
 

Author SHA1 Message Date
Adam c7e688921b Add template functions for Acquire primitive/primitive_desc (#19867)
5 years ago
flame fe18cfdb4f
hide with inference optim API (#17355)
5 years ago
Leo Chen 578a2f5da3 fix SplitLodTensor when batch_size = 0, test=develop (#19866)
5 years ago
Aurelius84 b125e327aa
Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606)
5 years ago
gongweibao e8d3745c0f
change _origin_program test=develop (#19863)
5 years ago
wopeizl a7c440d303
add precise roi pooling op test=develop (#18960)
5 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
5 years ago
WangXi 8c2c8dc626 distribute.launch use poll to query subprocess (#19853)
5 years ago
chengduo 8e9273272f Disable test_dygraph_mnist_fp16.py (#19844)
5 years ago
Jie Fang d9db94d752 Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus. (#19714)
5 years ago
wangchaochaohu 47af618f70
Strided slice (#19642)
5 years ago
Zeng Jinle 13ca364ceb
remove some flags and add comments to some flags, test=develop (#19813)
5 years ago
Huihuang Zheng 3e1e1fee26
Return correct currrent block of a var (#19850)
5 years ago
123malin 1bc285a53a
add retry function to try to solve grpc error code 14 (#19661)
5 years ago
Zeng Jinle 5eb381a3e2
refine reallocate of workspace size, test=develop (#19843)
5 years ago
石晓伟 71b2ed61bc
support MLU nums, test=develop (#19372)
5 years ago
Bai Yifan e2c6bada36 Support dispensable student_loss in PaddleSlim distillation (#19824)
5 years ago
Zeng Jinle 3f87464e9c
refine executor_gc_helper codes, test=develop (#19814)
5 years ago
LielinJiang 6d72a86b14 fix_roi_transform_bug (#19785)
5 years ago
Zeng Jinle 3fd3b663a8
fix gc bug in controlflow ops, test=develop (#19827)
5 years ago
Leo Chen 982e61f5ff Update elementwise double grad to save gpu memory (#19509)
5 years ago
Zeng Jinle db26de8389
[Bug fix] Disable memory reuse on feeded variables (#19835)
5 years ago
Adam dfdd73cbc0 Add MKLDNNhandlerT templatized class (#19801)
5 years ago
Zeng Jinle cabb9501bd
fix leaky_relu op when alpha is zero, test=develop (#19833)
5 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
5 years ago
chengjuntao 00efd1d8a9
add deformable conv v1 op and cpu version of deformable conv v2 (#18500)
5 years ago
Thunderbrook 40c66f8df9
rm return in vfork (#19734)
5 years ago
chengduo b99fc38cec
Add fp16 support for dygraph (#19828)
5 years ago
Zhaolong Xing 110be57c1b
fix memory optimization type (#19781)
5 years ago
Leo Chen 5fbf03d6b4 Enhance OpTest to support double grad inplace check (#19826)
5 years ago
xujiaqi01 6045541e89
fix libps.so path problem (#19768)
5 years ago
liym27 677e714425 fix pow op, support tensor for agument factor. (#19313)
5 years ago
liym27 bd89a27308 add tensor support for argument shape in reshape op; (#19268)
5 years ago
liym27 88628016b2 add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208)
5 years ago
liym27 e9e3c08777 fix expand op: (#19302)
5 years ago
xujiaqi01 6bf298bf09
support preload thread, optimize hdfs log, fix master+patch bug (#19695)
5 years ago
Huihuang Zheng a0d80754c5
Add comments for CUDA Device Context Allocator related stuff (#19809)
5 years ago
Jiabin Yang cc311bdf95
Feature/add transform data dygraph (#19707)
5 years ago
lvmengsi b76343c3b7
cpu Conv double grad (#19672)
5 years ago
Zeng Jinle 754fd57ed7
disable memory optimization passes when FLAGS_use_ngraph=True, test=develop (#19778)
5 years ago
翟飞跃 93c85c930a Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770)
5 years ago
chengduo 2729c17446
Fix example error of Variable and Operator (#19821)
5 years ago
chengduo 8281497030
Fix warning info of build_strategy (#19805)
5 years ago
ruri a0e9b7b9a2
add unittest for square error cost op (#19746)
5 years ago
Zeng Jinle b34933d9ee
fix retry allocator bug, test=develop (#19794)
5 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
5 years ago
zhouwei25 b5a5d93bbe fix the dependencies of third party and inference lib (#19684)
5 years ago
Zeng Jinle 32b1151f5e
reduce default value of cudnn workspace size, test=develop (#19780)
5 years ago
zhongpu 52673956de add kernel for squeeze_op, test=develop (#19656)
5 years ago
zhongpu 2a81c3679a add kernel for unstack_op, test=develop (#19538)
5 years ago