Commit Graph

16827 Commits (289edf3962f039394452bfccafcd70ce3c3dde0f)

Author SHA1 Message Date
ShenLiang c706ff20a3
fix conflict, test=develop (#23298)
5 years ago
ShenLiang 5223e2bbc4
Add a new DataFeed named PaddleBoxDataFeed (#23321)
5 years ago
Chen Weihang 75bd350710
Implement StaticModelRunner to support dygraph fine-tune static graph pre-training model (#23171)
5 years ago
cc 9297f49e4b
[OP] Add randperm op (#23292)
5 years ago
Kaipeng Deng d223a24904
Fix inplace_abn compile error on Windows (#23464)
5 years ago
Tao Luo 0b583235f5
Revert "Solve the conflict of ops with the same name. (#23199)" (#23494)
5 years ago
wawltor 6577f91b74
Add the sum op to API 2.0, add some parameters for new api
5 years ago
石晓伟 36b82eae0e
refine the doc of paddle_api.h, test=develop (#23402)
5 years ago
WuHaobo c4d0305239
add tril op and triu op (#23469)
5 years ago
yongqiangma eb035f24d1
add unbind op (#23359)
5 years ago
zhangchunle fd9b7bdb3d
Op (FusedEmbeddingSeqPool) error message enhancement. (#23454)
5 years ago
Chen Weihang 16315d3d9e
Delete Ref & VectorRef and add GetDataSafely (#22997)
5 years ago
Zhen Wang abe3e6906d
Solve the conflict of ops with the same name. (#23199)
5 years ago
wawltor 0b092d05f1
Add the argmax op to API 2.0, and update some parameters
5 years ago
Leo Chen f297a33285
Dev/fix init flags (#23465)
5 years ago
Zhaolong Xing 6a23850a3f
add init value to varis in analysis config. (#23442)
5 years ago
wawltor 915341e3de
Add the zeros, ones, ones_like, zeros_like for api 2.0, test=develop (#23471)
5 years ago
Zhen Wang 56b50c97f8
Add allclose_op (#23335)
5 years ago
kinghuin 948c57d84b
move sin, sqrt, tanh, atan to paddle.tensor.math and add a new parameter "out" (#23387)
5 years ago
Chengmo a2e9af5663
Add Tdm child OP in contrib (#23241)
5 years ago
Wilber 9676ac1c5c
Add flip op. (#23255)
5 years ago
tianshuo78520a d8a21ef6f3
test=develop;fix error (#23467)
5 years ago
Feiyu Chan 81f1402f6c
Add functional convolutions in paddle.nn.functional (#23408)
5 years ago
Zhaolong Xing 70782e6379
[Inference doc]: refine paddle_api.h doc (#23354)
5 years ago
Feiyu Chan bcafe3179a
add MKL computation back to gelu's non-approximate part (#23420)
5 years ago
zhongpu dbfbd7eac4
support Exhaustive search in dygraph (#23415)
5 years ago
zhaoyuchen2018 01d7ccd4b6
Fix elementwise compile error, test=develop (#23381)
5 years ago
gongweibao 24a063f6ac
Add fleet checkpoint on local fs and remote fs(such as hdfs) for EDL (#22586)
5 years ago
Zeng Jinle 0c23e3ff4d
fix Tracer::NoGrad, test=develop (#23443)
5 years ago
channings a2e10930cf
update linspace, equal operators to API 2.0 (#23274)
5 years ago
zhaoyuchen2018 4fe9ca6959
improve elementwise performance. (#23405)
5 years ago
wangchaochaohu 5c60778731
polish the code of fusion group test=develop (#23370)
5 years ago
Leo Chen a62599a888
[feature] prune program by feed and fetch_list automatically (#22474)
5 years ago
Chen Weihang 7f1ad510bd
Add op inout check macro to simplify error message writing (#23430)
5 years ago
Yiqun Liu bc2981e998
Disable test_code_generator and test_post_training_quantization_mobilenetv1 (#23440)
5 years ago
Zeng Jinle 29337f4e17
fix conflict of inferne partial feed with gpu parallel ssa graph executor, test=develop (#23400)
5 years ago
Pei Yang 7e439780d9
add full paddle_analysis_config.h APIs. (#23215)
5 years ago
zhongpu bfb07aafe8
Revert "Exhaustive search (#22821)", test=develop (#23401)
5 years ago
liym27 b7b0b3595b
Add unittest for transformer prediction in dygraph_to_static (#23207)
5 years ago
xujiaqi01 93ea9dd27a
fix stat var in hogwild worker (#23367)
5 years ago
joanna.wozna.intel 8c463700e1
Add default pass attributes (#23042)
5 years ago
zhongpu 48144e4099
Exhaustive search (#22821)
5 years ago
Adam da7c73f847
Delete is_test attribute from activation operators (#23318)
5 years ago
Kaipeng Deng 21d95be0db
Add inplace abn op (#22806)
5 years ago
Yi Liu 821534efd3
add paralell_executor dependancy to collective_helper (#23380)
5 years ago
Zeng Jinle 3a21980b78
add reader dependency pass, test=develop (#23301)
5 years ago
wangchaochaohu 69e3f99362
refine the error message (#23212)
5 years ago
石晓伟 5c59d2139e
reverts the commit 23177, test=develop (#23363)
5 years ago
wangchaochaohu d280106007
Add support for attr type Op and add fill_constant Op and scale Op (#23163)
5 years ago
xujiaqi01 3a45767d49
add fleet pslib pull and push sparse op and push dense op (#23139)
5 years ago
songyouwei 99d30bfc36
speedup slice impl (#23340)
5 years ago
Zhaolong Xing 1a6ce8b910
add swish split gelu plugin dynamic support (#23305)
5 years ago
Jacek Czaja 2bb1b0e89e
[DNNL] Added MKL-DNN inplace pass for C-API inference (#23315)
5 years ago
Yi Liu 0471476a18
fix nccl comm double free bug (#23344)
5 years ago
wangchaochaohu 1ee2a9a424
Profiler refine (#23294)
5 years ago
Leo Chen 488b2387e2
Feature/expand params in auto-generated pybind functions for dygraph operators (#23181)
5 years ago
GaoWei8 20eed5401a
Change fluid.layers.where‘s C++ operator name (#23250)
5 years ago
Yi Liu 2169e6fb58
Initialize global nccl_comm in PE (#23275)
5 years ago
Jacek Czaja 012886df79
[DNNL] Softmax mkldnn op inplace support (#23197)
5 years ago
石晓伟 75ebb48a91
supports thread-binding stream, test=develop (#23177)
5 years ago
石晓伟 708ded584e
pause the io_utils_test of int64 and resume after repair, test=develop (#23234)
5 years ago
Zeng Jinle babda94c8a
Distinguish public/private global vars (#23269)
5 years ago
zhaoyuchen2018 58615a6272
Improve elementwise performance. (#23001)
5 years ago
Wojciech Uss f836c8aa8f
add check for scales and a message (#23119)
5 years ago
Zeng Jinle 8bfd62ffb7
Expose dygraph.grad api (#23124)
5 years ago
Wilber 0129f4b568
Add some inference API comments for AnalysisPredictor (#23242)
5 years ago
Tao Luo c00d427d52
simplify the cmake log of ir/CMakeLists.txt (#23262)
5 years ago
Zeng Jinle 77b4dc80c9
code polish for adding const qualifier, test=develop, test=document_fix (#23248)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
xujiaqi01 68ea1ad55b
add clear one table (#23089)
5 years ago
danleifeng ae3bb16d06
add MaskAucCalculator in paddlebox (#23157)
5 years ago
liym27 6af480ca33
Support int64 for op assign_value. test=develop (#23179)
5 years ago
Zeng Jinle 53e6f8e1da
rename macro, test=develop (#23161)
5 years ago
Zeng Jinle bba740710d
add cuda resource pool for BufferedReader, test=develop (#23152)
5 years ago
Zeng Jinle 7d8d50b6cc
rename no_need_buffer_vars macro, test=develop (#23160)
5 years ago
Liufang Sang a486a739e1
fix compile error in win gpu (#23196)
5 years ago
Zeng Jinle 7ca77a90ac
add Tensor::IsSharedBufferWith method, test=develop (#23175)
5 years ago
Zeng Jinle b8886bf122
rename no_need_buffer_vars_macro, test=develop (#23159)
5 years ago
Zeng Jinle bae5930ba1
fix graph attr copy issues, test=develop (#23191)
5 years ago
wangchaochaohu b721e23b25
transpose cudnn using cudnn v7 api (#19738)
5 years ago
Pei Yang 46b8d282dc
Add some inference API comments for AnalysisConfig (#23117)
5 years ago
Adam 4f5e4540f8
Improve SGD jit code to work with large data (#23120)
5 years ago
Liufang Sang 4db031902d
add dequantize_log_op and make pyramid hash support int8 weight (#22548)
5 years ago
Zeng Jinle e5fef8f38a
[Dygraph double grad]Code polish (#23121)
5 years ago
Zeng Jinle 9258e96094
fix read op comments, test=develop, test=document_fix (#23122)
5 years ago
Zeng Jinle acfc9b8a70
Reader sequential and inference partial feed (#22699)
5 years ago
Wilber 95b356a069
update embedding_eltwise_layernorm fuse and kernel. test=develop (#23114)
5 years ago
Zeng Jinle a31d7328b7
Add dygraph double grad implementation (#22939)
5 years ago
Yiqun Liu 3af4771122
Add the detection and code-generation of sqrt and square in fusion_group (#23095)
5 years ago
hutuxian 0c30098f8b
Add need_save_delta parameter to solve OOM (#23097)
5 years ago
songyouwei 2e2da7124b
high-performance dygraph slice (#22879)
5 years ago
Sylwester Fraczek abee05a8c8
added mkldnn swish activation (#23041)
5 years ago
Zhaolong Xing 8c6fde9e69
fix align error (#23090)
5 years ago
Liufang Sang 915b892a15
Fix div zero in fake quantize op (#22966)
5 years ago
Yi Liu 121b2aed4d
initialize global nccl context in dygraph (#23037)
5 years ago
Zhang Ting 880eb04d93
skip PrepareData when it is unnecessary (#22839)
5 years ago
Feiyu Chan 01ab8a0619
add approximation for gelu, test=develop (#22961)
5 years ago
Adam 5842ae6785
Revert "Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695)" (#22985)
5 years ago
Pei Yang 24db750386
fix trt int8 calib precision bug. test=develop (#23036)
5 years ago
GaoWei8 1dc1f9270e
Fix lod error of concat op for axis = 0 (#22538)
5 years ago
yaoxuefeng 660ff18488
fix datsset test=develop (#23043)
5 years ago
Zhang Ting 714b0076b6
Override GetKernelTypeForVar to avoid device transform, test=develop (#23032)
5 years ago
wangchaochaohu 112e3edbf6
fix the conv group problem test=develop (#23025)
5 years ago
Wilber db40ee86db
fix unittets. test=develop (#23018)
5 years ago
wangchaochaohu 99db0cf762
remove debug log test=develop (#22994)
5 years ago
wangchaochaohu 3757e0687c
Add Unittest for backward of fusion group (#22932)
5 years ago
chengjuntao 63f3ada7b9
fix bug which input shape (#22965)
5 years ago
Zhang Ting 137d6563fc
add check for assigned data, test=develop (#22960)
5 years ago
wangchaochaohu f0d193a23c
Cast fusion for fusion group (#22876)
5 years ago
yaoxuefeng 29a7a52d38
Fix instag (#22632)
5 years ago
wangchaochaohu c979c9f2b0
refine the profiler print test=develop (#22968)
5 years ago
Wilber ff3ddbb502
add skip_layernorm pass. test=develop (#22895)
5 years ago
wawltor f154d5860f
Speed up the matmul op, use the gemm replace the batch gemm (#22926)
5 years ago
Adam 056edf3929
Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695)
5 years ago
Zhaolong Xing 8d6dc102fe
[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494)
5 years ago
guofei 3d8571e884
modify assign op and add unittest of assign op (#22769)
5 years ago
Zeng Jinle d33c4343e1
Imperative tracer refactoring (#22457)
5 years ago
liu zhengxi 61fef9754b
Fix fc padding bug during inference fusion (#22860)
5 years ago
tangwei12 ad9c8f6d2d
fix communicator when break under pyreder mode (#22911)
5 years ago
mapingshuo 5ba9dfc16a
add lookup_table_dequant_op (#22900)
5 years ago
zhaoyuchen2018 a020a25797
Fix model int8 quant fail, test=develop (#22891)
5 years ago
Zhaolong Xing dd67d44a50
[Paddle-TRT] : (Part1) Dynamic shape support (#22868)
5 years ago
tangwei12 07e13b84cd
remove vlog, test=develop (#22898)
5 years ago
Zhang Ting ca9c8b417d
fix compute ratio of profile, test=develop (#22872)
5 years ago
wangchaochaohu dbb0b9b3b6
refine the profiler print (#22823)
5 years ago
Michał Gallus 0038bfbd1d
Prevent loading of warmup data in analyzer_int8 if enable_int8 is set to false (#22857)
5 years ago
Chen Weihang 1644926a6c
Polish detail implement of dygraph data loader (#22878)
5 years ago
Wilber f686310d81
fix concat_mkldnn op. test=develop (#22692)
5 years ago
hong 5191e54494
reduce default attrs for dynamic graph (#22850)
5 years ago
Zhaolong Xing 1a533ed2de
[BUG]: Multihead matmul op's ouput size should be BxSx(N*H) (#22848)
5 years ago
hong c736fef93b
dygraph backward engine accelerate (#22808)
5 years ago
Zeng Jinle d41d802ba3
Add flags to limit gpu memory (#22793)
5 years ago
石晓伟 1861ca88f1
serialize the PaddleTensor, test=develop (#22810)
5 years ago
Zhang Ting 72ff5a09c3
fix print bug of profile, test=develop (#22804)
5 years ago
Zhang Ting 4e8bc02461
add fluid.device_guard to specify the device type for Op (#22254)
5 years ago
石晓伟 ddb9b46fec
change the function in op_teller, test=develop (#22794)
5 years ago
Zhen Wang 89cfa49156
Unmerged fetch list (#22635)
5 years ago
wangchaochaohu 8456c3f4dd
polish the profiler_help code (#22811)
5 years ago
zhongpu 2fd1ec1e3e
fix docker build for paddle openblas, test=develop (#22795)
5 years ago
Chen Weihang 7d8d573453
Speed up dygraph DataLoader based on shared memory and LoDTensor serialization (#22541)
5 years ago
liu zhengxi 324f2b3922
Fix inference c api PD_GetZeroCopyOutput lod (#22768)
5 years ago
wangchaochaohu 7578fcbac4
Profile code refine (#22800)
5 years ago
hutuxian 53a2b68f4e
support customized download command in dataset (#22782)
5 years ago
wangchaochaohu ca9e77a8d4
add sum op support for fusion group (#22771)
5 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
5 years ago
Kaipeng Deng ebc7ffc300
fix detection_map. test=develop (#22705)
5 years ago
zhaoyuchen2018 72dde4abde
Refine adam op to improve performance, test=develop (#22346)
5 years ago
wangguanzhong f2d1cd119a
fix lod level, test=develop (#22755)
5 years ago
FlyingQianMM 79d712346f
Correct CPU gradients of the argsort op (#22739)
5 years ago
Adam 2b80e9a719
Add cpu_info without XBYAK (#22716)
5 years ago