Commit Graph

15826 Commits (bdb3e376d07e2eece98710e0dde567e0b1940597)

Author SHA1 Message Date
whs bdb3e376d0
[PaddleSlim] Enhence compressor api in PaddleSlim (#19894)
5 years ago
xujiaqi01 cedc04775c
support change shuffle and train thread num (#19841)
5 years ago
Kaipeng Deng 14625ffe9e
add elementwise mod support float/double. test=develop (#19570)
5 years ago
Jacek Czaja 5b07ca9cdd - ReImplemented pooling fwd mkldnn (#19911)
5 years ago
Zeng Jinle b1e83b33b0
fix huber loss op attr type, test=develop (#19937)
5 years ago
Zeng Jinle cc157d5990
add inplace to assign op, test=develop (#19927)
5 years ago
chengduo 55ce696986
clean tensor array (#19930)
5 years ago
Leo Chen 57606205f5 Make OpTest check grad inplace even if forward has no inplace (#19847)
5 years ago
Zhang Ting cb8f3c03a7 resize Ops support data_layout:channel_last, test=develop, test=document_preview (#19914)
5 years ago
mapingshuo 9901f69677
Forward recompute3 (#19913)
5 years ago
chengduo d7251a8e1e
Delete local execution scopes (#19749)
5 years ago
wopeizl 5452b6a152
remove the useless warning for user to avoid confuse test=develop (#19871)
5 years ago
ruri d31c92a2cd
add mse_loss (#19759)
5 years ago
hong 85b398f171
Add op compatible information (#19910)
5 years ago
Kaipeng Deng 3f021781a1
fix softmax CE time limit check failed (#19846)
5 years ago
Tao Luo a4919d3688
move tree_conv to fluid.contrib.layers (#19918)
5 years ago
石晓伟 30adea0a23
tensor_array_to_tensor_op.cc, test=develop (#19289)
5 years ago
Zeng Jinle 0436efd6a3
Unify DataLoader APIs (#19305)
5 years ago
lvmengsi 4155e62559
add instance norm (#19500)
5 years ago
Zeng Jinle c7f36e7c00
Add lock to cudnn handle calls (#19845)
5 years ago
pawelpiotrowicz 2c5c636514 Add two extra flags for test_analyzer_int8_image_classification to disable fp32/int8 (#19840)
5 years ago
Adam cb65439da8 Add support for other axes in MKLDNN softmax op (#19907)
5 years ago
Jiabin Yang 454254115e
Feature/auto prune in dygraph (#19757)
5 years ago
Aurelius84 418a0967f3
move match_matrix var_conv2d et.al api into fluid.contrib test=develop (#19859)
5 years ago
Pei Yang baccd7e2ca
Add TRT input shape check between model and runtime (#19864)
5 years ago
Pei Yang 74812d1c90
Fix BUGS: paddle-TRT repeatedly sets weight_map and overdeletes repetitive_params (#19825)
5 years ago
Zeng Jinle 747d44980a
Refine err msg of out of gpu memory (#19779)
5 years ago
Aurelius84 fcf53e55ff
support 2-level lod of input in sequence_pool (#19839)
5 years ago
Zeng Jinle b25d1e758d
remove enforce.h file written, test=develop (#19897)
5 years ago
Zhang Ting 93364b45c1 group_norm support data_layout:NHWC, test=develop, test=document_preview (#19614)
5 years ago
Huihuang Zheng e117114289
Set states of recurrent op as dependent vars in prune (#19865)
5 years ago
石晓伟 d004a0f50e
fix multi-thread exec of trt, test=develop (#19338)
5 years ago
Zeng Jinle b754700fb5
fix reduce and broadcast to avoid multi-stream, test=develop (#19889)
5 years ago
Zeng Jinle 8359b415e4
add free chunks to auto growth allocator, test=develop (#19890)
5 years ago
Jacek Czaja 619c797a7f [MKL-DNN] LRN refactoring (#19798)
5 years ago
Zhang Ting 439d95e157 modified interpolate op to support tensor attribute, test=develop, test=document_preview (#19287)
5 years ago
Zhang Ting b38889413d add crop_tensor_op, test=develop, test=document_preview (#19314)
5 years ago
lidanqing 2c32c2d649 Refactor conv computeINT8 (#19574)
5 years ago
joanna.wozna.intel 3f1d0234ae Fix conv2d+dequantize squash for residual fusion (#19545)
5 years ago
Huihuang Zheng a35557d8f4
Fix deps of prune (#19876)
5 years ago
Adam c7e688921b Add template functions for Acquire primitive/primitive_desc (#19867)
5 years ago
flame fe18cfdb4f
hide with inference optim API (#17355)
5 years ago
Leo Chen 578a2f5da3 fix SplitLodTensor when batch_size = 0, test=develop (#19866)
5 years ago
Aurelius84 b125e327aa
Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606)
5 years ago
wopeizl a7c440d303
add precise roi pooling op test=develop (#18960)
5 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
5 years ago
Jie Fang d9db94d752 Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus. (#19714)
5 years ago
wangchaochaohu 47af618f70
Strided slice (#19642)
5 years ago
Zeng Jinle 13ca364ceb
remove some flags and add comments to some flags, test=develop (#19813)
5 years ago
123malin 1bc285a53a
add retry function to try to solve grpc error code 14 (#19661)
5 years ago