Commit Graph

9323 Commits (1644926a6c6ada630092318eabf7acf211a8f44d)

Author SHA1 Message Date
Chen Weihang 1644926a6c
Polish detail implement of dygraph data loader (#22878)
6 years ago
Wilber f686310d81
fix concat_mkldnn op. test=develop (#22692)
6 years ago
hong 5191e54494
reduce default attrs for dynamic graph (#22850)
6 years ago
Zhaolong Xing 1a533ed2de
[BUG]: Multihead matmul op's ouput size should be BxSx(N*H) (#22848)
6 years ago
hong c736fef93b
dygraph backward engine accelerate (#22808)
6 years ago
Zeng Jinle d41d802ba3
Add flags to limit gpu memory (#22793)
6 years ago
石晓伟 1861ca88f1
serialize the PaddleTensor, test=develop (#22810)
6 years ago
Zhang Ting 72ff5a09c3
fix print bug of profile, test=develop (#22804)
6 years ago
Zhang Ting 4e8bc02461
add fluid.device_guard to specify the device type for Op (#22254)
6 years ago
石晓伟 ddb9b46fec
change the function in op_teller, test=develop (#22794)
6 years ago
Zhen Wang 89cfa49156
Unmerged fetch list (#22635)
6 years ago
wangchaochaohu 8456c3f4dd
polish the profiler_help code (#22811)
6 years ago
Chen Weihang 7d8d573453
Speed up dygraph DataLoader based on shared memory and LoDTensor serialization (#22541)
6 years ago
liu zhengxi 324f2b3922
Fix inference c api PD_GetZeroCopyOutput lod (#22768)
6 years ago
wangchaochaohu 7578fcbac4
Profile code refine (#22800)
6 years ago
hutuxian 53a2b68f4e
support customized download command in dataset (#22782)
6 years ago
wangchaochaohu ca9e77a8d4
add sum op support for fusion group (#22771)
6 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
6 years ago
Kaipeng Deng ebc7ffc300
fix detection_map. test=develop (#22705)
6 years ago
zhaoyuchen2018 72dde4abde
Refine adam op to improve performance, test=develop (#22346)
6 years ago
wangguanzhong f2d1cd119a
fix lod level, test=develop (#22755)
6 years ago
FlyingQianMM 79d712346f
Correct CPU gradients of the argsort op (#22739)
6 years ago
Adam 2b80e9a719
Add cpu_info without XBYAK (#22716)
6 years ago
guofei ae8b5f11a3
Change ShareDataWith() to TensorCopy() in ref_by_trainer_id (#22717)
6 years ago
liu zhengxi 71ab0458e1
Fix pointer and c-api encapsulation (#22663)
6 years ago
Leo Chen b2c1be851a
support cond in clone, test=develop (#22657)
6 years ago
Zhang Ting f97f3f9301
add framework overhead ratio in profile report (#22590)
6 years ago
chengjuntao 15c2667143
register fp16 for assign op (#22744)
6 years ago
dyning 1c0653462d
fix generate_mask_labels lod level (#22743)
6 years ago
GaoWei8 ba140222d6
fix compile&runtime lod_equality of lod_reset (#22737)
6 years ago
hutuxian 175954d894
PaddleBox Framework Part2 (#22466)
6 years ago
ShenLiang 3132681e8a
add partial_sum op in contrib (#22292)
6 years ago
wangchaochaohu 611411b90e
Fusion group profile support (#22718)
6 years ago
ShenLiang e136661304
add partial_concat op in contrib (#22528)
6 years ago
GaoWei8 cdf5f6fb8c
Add an inference interface to disable FC padding (#22097)
6 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
6 years ago
Yibing Liu 6e7bfe30a6
register fp16 kernel for some ops (#22650) (#22696)
6 years ago
tangwei12 66a3150135
SYNC with communicaotor (#22344)
6 years ago
Yiqun Liu 22bbd54719
Add the support of fp16 in fusion_group (#22239)
6 years ago
flame d97475d53b
fix CPU C inference API compile bug (#22702)
6 years ago
Huihuang Zheng adfa5b8354
Add PADDLE_ENFORCE to Check Sequence Length of RecurrentOp (#22673)
6 years ago
flame 74eb82de19
fix go api bug (#22669)
6 years ago
wangchaochaohu a089072c8b
fix the profile print error (#22665)
6 years ago
lidanqing d926214535
[UT coverage] improve the mul_mkldnn_op line coverage (#22408)
6 years ago
wangchaochaohu c65c6ae534
add flag to control profile level in python API (#22319)
6 years ago
123malin 00594c1c88
support dumping params/grads in transpiler mode (#22490)
6 years ago
Zhaolong Xing a06d75a280
[Paddle-TRT] Refine the error log about runtime batch and max_batch_size. (#22535)
6 years ago
Adam 608447bfd5
Update MKLDNN to v1.2 (#22521)
6 years ago
Adam ab610a34ff
transpose_mkldnn code change to meet Paddle standards (#22591)
6 years ago
Jiawei Wang 8f035fb637
Add TopK Op Grad CPU&GPU Kernel test=develop (#22628)
6 years ago