Commit Graph

889 Commits (a32e8bf1e7fb45b9bae85e80fe7742eae8739fac)

Author SHA1 Message Date
Pei Yang 43f9f180e5
Add api to clear intermediate tensors in AnalysisPredictor (#25069)
5 years ago
Zhen Wang bb45af02ac
add the c++ part of Imperative QAT. test=develop (#25446)
5 years ago
ceci3 52be62c5ae
fix instance norm in dy (#24717)
5 years ago
gongweibao 80f1c50738
Fix typo in interface. (#24779)
5 years ago
Chen Weihang f07b25d8e5
fix DataLoader.generrator using error, test=develop (#25355)
5 years ago
Aurelius84 494cb36d09
Modify tmp var name prefix in dygraph (#25280)
5 years ago
Shibo Tao 19c4db1b56
don't re-generate header file if content doesn't change (#25130)
5 years ago
Chen Weihang b23801a262
polish tensor set error messag, test=develop (#25113)
5 years ago
hutuxian 5822862d8a
Monitor Framework (#24079)
5 years ago
Leo Chen bfa46c38d5
bn supports reverse_space, test=develop (#24988)
5 years ago
Leo Chen 6190023ac9
Refine error message in pybind folder (#24886)
5 years ago
Chen Weihang 4a702ef361
Support SelelctedRows allreduce in multi-cards imperative mode (#24690)
5 years ago
Pei Yang 14b8540551
add default ctor for AnalysisConfig python api. test=develop (#24924)
5 years ago
Leo Chen 1e818158f5
Feature/add amp_checkout_finite_and_scale op (#24875)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Yanghello aa47356b74
Add crypto python (#24836)
5 years ago
Zhen Wang 23d253e1be
Fix out of range error for outs map. test=develop (#24774)
5 years ago
ShenLiang 950892044f
fix conflict, test=develop (#24238)
5 years ago
hutuxian e6b87b3193
Support AucRunner in PaddleBox (#22884)
5 years ago
Leo Chen 9c9e635c00
support tensor to varbase, test=develop (#24660)
5 years ago
Leo Chen 14dd6388c5
fix bug of varbase.__getitem__, test=develop (#24642)
5 years ago
Leo Chen d980d251f0
specify outs, test=develop (#24537)
5 years ago
hutuxian 123255cf9f
change InitializeGPU to InitializeGPUAndLoadModel (#24377)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago
hong 67f66f0904
Fix get item out of range error (#24339)
5 years ago
Tao Luo 9eedf05d2f
solve mklml memory leak on windows (#24015)
5 years ago
xujiaqi01 1034ca316f
add timeout and http store in communication (#23436)
5 years ago
Zhang Ting ab8f8fa70d
fix example code, test=develop, test=document_fix (#24139)
5 years ago
石晓伟 46f3139c7f
supports loading model from memory, test=develop (#24098)
5 years ago
Leo Chen 5cccc69f1a
update name generator, test=develop (#24048)
5 years ago
wawltor 5c669ad1c2
Add the support dygraph out attribute for the op of mm in api2.0 (#23978)
5 years ago
Kaipeng Deng 80cf3c3c4d
Refine DataLoader support multi-processing (#23107)
5 years ago
guofei 2b896c1f6b
Support LoDTensorArray in fetch (#23645)
5 years ago
hutuxian df64a96686
support set_test_mode and set comlog level(#23905)
5 years ago
xujiaqi01 d98084e7ec
add save with prefix (#23449)
5 years ago
Leo Chen b59426b52a
Enhance error msg of imperative code (#23572)
5 years ago
hutuxian 94a3789fd0
Add AfsAPI in PaddleBox (#23419)
5 years ago
Chen Weihang df538439f5
api build strategy error polish, test=develop (#23546)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
Chen Weihang 45880f604b
API(Program) error message enhancement (#23519)
5 years ago
石晓伟 9b82e4c183
change the cmake and apis of lite engine, test=develop (#22934)
5 years ago
guofei ca7bd2beb1
Add a function to update FLAGS (#22851)
5 years ago
ShenLiang 5223e2bbc4
Add a new DataFeed named PaddleBoxDataFeed (#23321)
5 years ago
Chen Weihang 75bd350710
Implement StaticModelRunner to support dygraph fine-tune static graph pre-training model (#23171)
5 years ago
gongweibao 24a063f6ac
Add fleet checkpoint on local fs and remote fs(such as hdfs) for EDL (#22586)
5 years ago
Zeng Jinle 0c23e3ff4d
fix Tracer::NoGrad, test=develop (#23443)
5 years ago
Leo Chen a62599a888
[feature] prune program by feed and fetch_list automatically (#22474)
5 years ago
songyouwei 99d30bfc36
speedup slice impl (#23340)
5 years ago
Leo Chen 488b2387e2
Feature/expand params in auto-generated pybind functions for dygraph operators (#23181)
5 years ago
Zeng Jinle babda94c8a
Distinguish public/private global vars (#23269)
5 years ago
Zeng Jinle 8bfd62ffb7
Expose dygraph.grad api (#23124)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
xujiaqi01 68ea1ad55b
add clear one table (#23089)
5 years ago
Zeng Jinle acfc9b8a70
Reader sequential and inference partial feed (#22699)
5 years ago
Zeng Jinle a31d7328b7
Add dygraph double grad implementation (#22939)
5 years ago
songyouwei 2e2da7124b
high-performance dygraph slice (#22879)
5 years ago
zhaoyuchen2018 a020a25797
Fix model int8 quant fail, test=develop (#22891)
5 years ago
Zhaolong Xing dd67d44a50
[Paddle-TRT] : (Part1) Dynamic shape support (#22868)
5 years ago
Zeng Jinle d41d802ba3
Add flags to limit gpu memory (#22793)
5 years ago
石晓伟 1861ca88f1
serialize the PaddleTensor, test=develop (#22810)
5 years ago
Zhang Ting 4e8bc02461
add fluid.device_guard to specify the device type for Op (#22254)
5 years ago
Zhen Wang 89cfa49156
Unmerged fetch list (#22635)
5 years ago
Chen Weihang 7d8d573453
Speed up dygraph DataLoader based on shared memory and LoDTensor serialization (#22541)
5 years ago
hutuxian 53a2b68f4e
support customized download command in dataset (#22782)
5 years ago
zhaoyuchen2018 72dde4abde
Refine adam op to improve performance, test=develop (#22346)
5 years ago
Leo Chen b2c1be851a
support cond in clone, test=develop (#22657)
5 years ago
hutuxian 175954d894
PaddleBox Framework Part2 (#22466)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
tangwei12 66a3150135
SYNC with communicaotor (#22344)
5 years ago
wangchaochaohu c65c6ae534
add flag to control profile level in python API (#22319)
5 years ago
tangwei12 b0675c8193
fix bug with compiledProgram (#22495)
5 years ago
yaoxuefeng 2235ee1a5e
multi-loss optimization by adding a DownpourOpt worker (#22025)
5 years ago
Wilber de009152a7 Compile without nccl deps. [2/2] (#22484)
5 years ago
Yiqun Liu dcfb603897
Enable the detection of subgraph composed of grad ops (#21223)
5 years ago
Wilber 7bc4b09500
add WITH_NCCL option for cmake. (#22384)
5 years ago
Leo Chen 822e5b36ec
Support int16 for Tensor (#22423)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
xujiaqi01 371f377bea
add GeneralRoleMaker (#22295)
5 years ago
Leo Chen b96c7c9a7a
polish code, test=develop (#22380)
5 years ago
Leo Chen aaa4fe491a
use function instead of lambda, test=develop (#22348)
5 years ago
Yiqun Liu b7cac50b64
Implement a common python unittest to test the ir passes. (#22209)
5 years ago
tangwei12 82bc814a57
integrated HALF_ASYNC to communicator (#21869)
5 years ago
Chen Weihang 35efbe6d95
Speeding up dygraph DataLoader with multiprocessing (#21762)
5 years ago
xujiaqi01 e3a457d34b
add collective communication library in fleet (#22211)
5 years ago
Zhen Wang 46189b166d Add bn and relu fuse pass (#22048)
5 years ago
zhongpu cf475f95df Remove FC in dygraph, modify FC to Linear in sample code (#22082)
5 years ago
Huihuang Zheng dd4361568e
Add ParallelExecutor Test for Cond API and Fix PE Checks Shape Bug (#22029)
6 years ago
123malin 7fb817d447
add distributed_strategy (#21710)
6 years ago
zhouwei25 2df4be5d35 Fix openblas bug to support compile on windows when WITH_MKL=OFF (#21902)
6 years ago
flame 2bbc0d7d60
python zero copy inference, delete pass (#21897)
6 years ago
Zeng Jinle aa4d6a5d6c
Add some debug flags to auto growth allocator (#21766)
6 years ago
Huihuang Zheng 557bce77da
Fix Backward Bugs in Conditional Block (#21809)
6 years ago
xujiaqi01 0eb4d990c4
fix compiled error when with_pslib=on (#21769)
6 years ago
Leo Chen fbe3ac217e
polish cmake, test=develop (#21681)
6 years ago
mapingshuo 686f0ecb6a
add `no_need_buffer_slots` interface to pybind (#21575)
6 years ago
Zeng Jinle 6828f3684b
fix op_registry, add ignore op_function_impl.h, test=develop (#21654)
6 years ago
Chen Weihang d96acc3363
Refine dygraph DataLoader implementation (#21634)
6 years ago
Leo Chen 48600d7f17
Add op function generator for dygraph (#21569)
6 years ago
Leo Chen 4f81d1bd5f
Refine VarBase init function (#21587)
6 years ago
hutuxian c5aec2fe68
Paddlebox Related to Framework (#21586)
6 years ago
liym27 9da7e6b4d4
add file check_op_desc.py and add interface to get default value. (#21530)
6 years ago
Zeng Jinle 3a7caf481c
add grad maker assert, test=develop (#21564)
6 years ago
Leo Chen cdd46d7e02
Split VarBase from Python Variable for Dygraph (#21359)
6 years ago
Aurelius84 54382ce497
Add get_all_kernels api of registered data_type in pybind.cc (#21499)
6 years ago
zhongpu 6ebf0f47b8 support SelectedRows in dygraph, test=develop (#21078)
6 years ago
Zeng Jinle 09696d5df8
Use system allocator in OpTest (#21335)
6 years ago
Youwei Song d5ff79e55e Support numpy bridge (enabled by default in dygraph mode) (#20983)
6 years ago
Zeng Jinle dbba9c7e4b
polish global_value_getter_setter, test=develop (#21332)
6 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
6 years ago
Thunderbrook 9a7832f8be
print table stat info for pslib (#21296)
6 years ago
Zeng Jinle b9f8ae8494
Add global value getter setter (#21285)
6 years ago
Dong Daxiang 691ced87c0
Refactor fetch handler (#21264)
6 years ago
Thunderbrook 0d17c1b816
solve pslib core in stop worker (#21263)
6 years ago
Zeng Jinle 67e88424e5
Polish jit trace codes (#21218)
6 years ago
xujiaqi01 9e045170c0
add copy table (#21086)
6 years ago
Zeng Jinle 5fdfbe3413
Add friendly dygraph trace API (#21091)
6 years ago
WangXi de5d3ff688 Fix dgc buffer illegal & reuse velocity (#21012)
6 years ago
Leo Chen 008ed65fd5 Add c++ global current tracer for dygraph (#20882)
6 years ago
Leo Chen 2c3c579b9b tensor.set() supports array list and remove unused code, test=develop (#20959)
6 years ago
Leo Chen 9974e40787 Update Tensor.set() to support float16 (#19964)
6 years ago
hong 8c4573a3cb
GradMaker for dygraph (#19706)
6 years ago
Yiqun Liu 16e4d02675
Refine the cache of program, context and scope in executor. (#18483)
6 years ago
hong ff0886a92a
save load problem fix and new feature add (#20823)
6 years ago
Huihuang Zheng 95ba4bd2ab
Add shape and type check at read_op (#20754)
6 years ago
Zeng Jinle 8ff6b289bd
[Dygraph to static graph]JIT/Trace (#20775)
6 years ago
WangXi 507afa8a8a Fix dgc nan by stripping nccl from sparseReduce. (#20630)
6 years ago
633WHU 12e4be0382 Dlpack support (#20039)
6 years ago
Pei Yang 443f604c3b
add DisableGlogInfo() to AnalysisConfig, test=develop (#20581)
6 years ago
Zeng Jinle 40effc61af
Refine py_reader exit (#20331)
6 years ago
liu zhengxi f855a86c93
update the api en doc of BuildStrategy (#20445)
6 years ago
tangwei12 a010d883b4
doc fix, test=develop, test=document_fix (#20239)
6 years ago
Leo Chen 5a7142ac4e Update en APIs of LoDTensor (#20115)
6 years ago
Jiabin Yang df102f6428
Refine en doc (#20409)
6 years ago
hong fa43e80e19 New save load interface (#20148)
6 years ago
Leo Chen f4c56e9f51 Polish en doc of LoDTensorArray, test=document_fix (#19972)
6 years ago
Youwei Song 20f68916ed refine CUDA CPU places en doc (#20243)
6 years ago
tangwei12 c9139c3db3
trainer from dataset fetch targets (#19760)
6 years ago
Chengmo 728ec1b43d
Add GEO-SGD distribute training algorithm (#20018)
6 years ago
qingqing01 1a3eef026c
Enable users to create custom cpp op outside framework. (#19256)
6 years ago
石晓伟 01b9d07963
update operator compatible info, test=develop (#19978)
6 years ago
tangwei12 8f0b3c0516
the integrated communicator (#19849)
6 years ago
Yang Zhang cde73a7bbf
Expose `mutable_data` as python binding (#19932)
6 years ago
Huihuang Zheng 88af4ab650
Add new data layer (#19916)
6 years ago
Wojciech Uss 4286a6270d Add support for new QAT models (#18970)
6 years ago
xujiaqi01 cedc04775c
support change shuffle and train thread num (#19841)
6 years ago
Zeng Jinle 0436efd6a3
Unify DataLoader APIs (#19305)
6 years ago
Jiabin Yang 454254115e
Feature/auto prune in dygraph (#19757)
6 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
6 years ago
xujiaqi01 6bf298bf09
support preload thread, optimize hdfs log, fix master+patch bug (#19695)
6 years ago
Chen Weihang 00d5375e0c
Add prune_backward function to cover complicated test_program.clone situation (#19772)
6 years ago
chengduo 056fdedde3
Open fuse all reduce option (#19765)
6 years ago
Tao Luo f05d2c519d paddle::framework::vectorize() templatization [PART3] (#19643)
6 years ago
Jiabin Yang e9233d1c1e Refactor dygraph (#19107)
6 years ago
mapingshuo dca9b6c5b0 add feed_var_names to Prune interface (#19589)
6 years ago
hutuxian c756b5d231
Paddlebox Framework (#18982)
6 years ago
Thunderbrook 1fe468d319
support debug each output of each ins (#19004)
6 years ago
Leo Chen 6fb310ae29 Fix bug of getting bool Flags from os.environ (#19349)
6 years ago
liu zhengxi 32598ffd8f
Python infer api update and add unit test (#19353)
6 years ago
Leo Chen a9d5fc5142 Enhance OpTest to check the consistency of operators when using and not using inplace (#19101)
6 years ago
Zeng Jinle 5b6673c44d
merge develop to solve conflict, also fix API doc, test=develop (#18823)
6 years ago
Tao Luo 5f5648a8ff
Revert "Python inference API support numpy (#19009)" (#19160)
6 years ago
flame b7e1a1d7e7 Python inference API support numpy (#19009)
6 years ago
yaoxuefeng 9150cf50fc
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
6 years ago
Zeng Jinle 88f111f885
remove unused inplace act codes, test=develop (#19079)
6 years ago
jiaqi a99bc64c63
add fleet util, add some interface in hdfs util (#18752)
6 years ago
Leo Chen 8f53735437 Fix memory overwriting of tensors returned by executor (#19030)
6 years ago
liuwei1031 a43a763b54
fix warpctc.dll not found issue (#18761)
6 years ago
flame 65d987527d
python inference enable_memory_optim(#18817)
6 years ago
Zhaolong Xing 61238d31f7
Trt fp16 support (#18860)
6 years ago
chengduo 20859c08e8
[DyGraph] Make multi-card program faster (#18892)
6 years ago
Zeng Jinle 8008ab4e6b
Remove legacy C++ memory optimization codes (#18834)
6 years ago
Thunderbrook 52c1431eee
add clear_model interface in fleetwrapper (#18815)
6 years ago
chengduo 292dfbce63
fix build strategy doc (#18725)
6 years ago
jiaqi d18aabb472
support patch data, add load_one_table, fix bug (#18509)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
Zeng Jinle ae58afc546
Feature/auto_growth_allocator (#18561)
6 years ago
guru4elephant d714bf037c
remove async executor and add data_feed.proto to the deps of train demo (#18659)
6 years ago
123malin b414645a65
fix #17430: int64类型的attr训练非预期 (#18264)
6 years ago
gongweibao c0a82748cf
Polish backwards optimizer dependency codes and use more default values. (#18255)
6 years ago
Zeng Jinle d3003a1620
Feature/buffer_shared_inplace (#17911)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
Yi Liu a873fa84ce
supports collective training with programs (#18392)
6 years ago
xsrobin 47e2ef38e9
add "import paddle.fluid as fluid" to examples lack of it
6 years ago
lujun fd6631ef2f
Fix dygraph show style (#18297)
6 years ago
tangwei12 999d9a59a5
fix communicator with pyreader (#18350)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
Zeng Jinle 5826b72e06
Refine CUDAPlace error message. (#18343)
6 years ago
jiaqi 3f8031e256
dataset (#17973)
6 years ago
chengduo 25f3cd6486
Update execution_strategy option default value (#18183)
6 years ago
Zeng Jinle 25ab23be28
Fix dygraph mem leak (#18082)
6 years ago
Sylwester Fraczek accb132f0f fix slim int8 mkldnn multithreading issue (#18009)
6 years ago
tensor-tang 5c06bff222
combine noavx and avx package (#17889)
6 years ago
Jiabin Yang 4d5f6937c3
Feature/refine api for dygraph (#17907)
6 years ago
gongweibao fbbdc9ccad
Add backward and optimizer operator dependency pass. (#17746)
6 years ago
wopeizl 453a49b1bc
Make ParallelExecutor support Windows GPU (#17787)
6 years ago
翟飞跃 993c703bcc INT8 MKL-DNN v2 integrate to slim (#17634)
6 years ago
wopeizl 841553e13f
use pyreader to read data in dygraph mode (#17314)
6 years ago
Zeng Jinle 674e0ce2d6
Use Python C-API to speed up dygraph trace (#17837)
6 years ago
Jiabin Yang 3b70f870e2
Using Smart pointer to optimizer memory usage of dyGraph (#17768)
6 years ago
guru4elephant d52391094d
fix prepare context redundant code problem, optimize executor by cach… (#17743)
6 years ago