Commit Graph

9293 Commits (175954d894f0f7158c4f062764589096c8da11e9)

Author SHA1 Message Date
hutuxian 175954d894
PaddleBox Framework Part2 (#22466)
5 years ago
ShenLiang 3132681e8a
add partial_sum op in contrib (#22292)
5 years ago
wangchaochaohu 611411b90e
Fusion group profile support (#22718)
5 years ago
ShenLiang e136661304
add partial_concat op in contrib (#22528)
5 years ago
GaoWei8 cdf5f6fb8c
Add an inference interface to disable FC padding (#22097)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
Yibing Liu 6e7bfe30a6
register fp16 kernel for some ops (#22650) (#22696)
5 years ago
tangwei12 66a3150135
SYNC with communicaotor (#22344)
5 years ago
Yiqun Liu 22bbd54719
Add the support of fp16 in fusion_group (#22239)
5 years ago
flame d97475d53b
fix CPU C inference API compile bug (#22702)
5 years ago
Huihuang Zheng adfa5b8354
Add PADDLE_ENFORCE to Check Sequence Length of RecurrentOp (#22673)
5 years ago
flame 74eb82de19
fix go api bug (#22669)
5 years ago
wangchaochaohu a089072c8b
fix the profile print error (#22665)
5 years ago
lidanqing d926214535
[UT coverage] improve the mul_mkldnn_op line coverage (#22408)
5 years ago
wangchaochaohu c65c6ae534
add flag to control profile level in python API (#22319)
5 years ago
123malin 00594c1c88
support dumping params/grads in transpiler mode (#22490)
5 years ago
Zhaolong Xing a06d75a280
[Paddle-TRT] Refine the error log about runtime batch and max_batch_size. (#22535)
5 years ago
Adam 608447bfd5
Update MKLDNN to v1.2 (#22521)
5 years ago
Adam ab610a34ff
transpose_mkldnn code change to meet Paddle standards (#22591)
5 years ago
Jiawei Wang 8f035fb637
Add TopK Op Grad CPU&GPU Kernel test=develop (#22628)
5 years ago
Steffy-zxf 90ee366653
update ops's unittest data type from float32 to float64 and shape over 100 (#22544)
5 years ago
flame f7eafca828
remove python inference warning (#22602)
5 years ago
Chen Weihang fe685cc185
fix enforce test error, test=develop (#22610)
5 years ago
Wilber 9a8203aa25
fix fc_lstm_fuse when multi sub-graph use same fc_bias. test=develop (#22551)
5 years ago
Chen Weihang 266106da75
Fix mismatch with plus sign in the line (#22588)
5 years ago
flame 1d503e6a9e
Golang inference API (#22503)
5 years ago
Zhaolong Xing 8acd745c25
[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486)
5 years ago
Yiqun Liu 96770f519e
Disable fusion_group for windows and mac in build_strategy. (#22549)
5 years ago
Zeng Jinle 08033c8634
fix traced layer with non persistable vars, test=develop (#22552)
5 years ago
Guo Sheng 31b5464632
Add support for dynamic_decode(while) training. (#22231)
5 years ago
tangwei12 b0675c8193
fix bug with compiledProgram (#22495)
5 years ago
Wojciech Uss 4cddb43c5c
Add support for Ernie NLP model to the Slim QAT (#22506)
5 years ago
Double_V 58d99247f4
support slice double grad, test=develop (#22166)
5 years ago
hutuxian 1a7962be97
Paddlebox about box_wrapper (#22497)
5 years ago
huzhiqiang 9e29d3ebed
【OpPorting Example】DEMO OF FIX COMPILE&RUNTIME LOD_EQUALITY (#22460)
5 years ago
yaoxuefeng 2235ee1a5e
multi-loss optimization by adding a DownpourOpt worker (#22025)
5 years ago
zhaoyuchen2018 54970444ce
Improve transpose performance with tile sm copy, test=develop (#22311)
5 years ago
Wilber a90fa54092
Compile without nccl deps. [1/2] (#22509)
5 years ago
guofei 3a59a7a11f
Make assign op support LoDTensorArray and modify while_loop API (#22309)
5 years ago
Zhaolong Xing 54a325a52f
[Refine Paddle-TRT INT8]: Support PaddleSlim's Resnet50, Mobilenetv1, Yolov3 models for Inference. (#22483)
5 years ago
Wilber de009152a7 Compile without nccl deps. [2/2] (#22484)
5 years ago
Yiqun Liu 4b2227e958 Fix dismatch of std::max's arguments type on windows. (#22507)
5 years ago
Wilber 870f465887 fix test_fusion_seqpool_concat lod level between compile and runtime (#22488)
5 years ago
Zhong Hui a61d09527b
Fix the integer overflow problem of sequence2batch (#22479)
5 years ago
cc 197913ebe1
Add weight quantization in post_training_quanzitaion (#22445)
5 years ago
Yiqun Liu dcfb603897
Enable the detection of subgraph composed of grad ops (#21223)
5 years ago
Tao Luo 7c9ce097f1
refine reshape_op shape error message (#22480)
5 years ago
LielinJiang 2b1386b2b2
optimize performance of interpolate op (#22436)
5 years ago
wangchaochaohu 77dd0d97bb
use enum class to replace the usage of enum in some condition test=develop (#22464)
5 years ago
Yiqun Liu 44b45b9f07
Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456)
5 years ago