Commit Graph

570 Commits (20859c08e8c5ec83709e888bc8d87d965725fec0)

Author SHA1 Message Date
chengduo 20859c08e8
[DyGraph] Make multi-card program faster (#18892)
6 years ago
Zeng Jinle 8008ab4e6b
Remove legacy C++ memory optimization codes (#18834)
6 years ago
Thunderbrook 52c1431eee
add clear_model interface in fleetwrapper (#18815)
6 years ago
chengduo 292dfbce63
fix build strategy doc (#18725)
6 years ago
jiaqi d18aabb472
support patch data, add load_one_table, fix bug (#18509)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
Zeng Jinle ae58afc546
Feature/auto_growth_allocator (#18561)
6 years ago
guru4elephant d714bf037c
remove async executor and add data_feed.proto to the deps of train demo (#18659)
6 years ago
123malin b414645a65
fix #17430: int64类型的attr训练非预期 (#18264)
6 years ago
gongweibao c0a82748cf
Polish backwards optimizer dependency codes and use more default values. (#18255)
6 years ago
Zeng Jinle d3003a1620
Feature/buffer_shared_inplace (#17911)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
Yi Liu a873fa84ce
supports collective training with programs (#18392)
6 years ago
xsrobin 47e2ef38e9
add "import paddle.fluid as fluid" to examples lack of it
6 years ago
lujun fd6631ef2f
Fix dygraph show style (#18297)
6 years ago
tangwei12 999d9a59a5
fix communicator with pyreader (#18350)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
Zeng Jinle 5826b72e06
Refine CUDAPlace error message. (#18343)
6 years ago
jiaqi 3f8031e256
dataset (#17973)
6 years ago
chengduo 25f3cd6486
Update execution_strategy option default value (#18183)
6 years ago
Zeng Jinle 25ab23be28
Fix dygraph mem leak (#18082)
6 years ago
Sylwester Fraczek accb132f0f fix slim int8 mkldnn multithreading issue (#18009)
6 years ago
tensor-tang 5c06bff222
combine noavx and avx package (#17889)
6 years ago
Jiabin Yang 4d5f6937c3
Feature/refine api for dygraph (#17907)
6 years ago
gongweibao fbbdc9ccad
Add backward and optimizer operator dependency pass. (#17746)
6 years ago
wopeizl 453a49b1bc
Make ParallelExecutor support Windows GPU (#17787)
6 years ago
翟飞跃 993c703bcc INT8 MKL-DNN v2 integrate to slim (#17634)
6 years ago
wopeizl 841553e13f
use pyreader to read data in dygraph mode (#17314)
6 years ago
Zeng Jinle 674e0ce2d6
Use Python C-API to speed up dygraph trace (#17837)
6 years ago
Jiabin Yang 3b70f870e2
Using Smart pointer to optimizer memory usage of dyGraph (#17768)
6 years ago
guru4elephant d52391094d
fix prepare context redundant code problem, optimize executor by cach… (#17743)
6 years ago
Zeng Jinle 432ac70124
clean code of py_layer in dygraph mode,test=develop (#17661)
6 years ago
gongweibao 65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. (#17263)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago
wopeizl 6724a652f3
add __str__ method for tensor and lodtensor to support print test=dev… (#17588)
6 years ago
guru4elephant 326bf8291a
add Run Prepared Ctx (#17616)
6 years ago
flame 2280f185d7
BuildStrategy api comment (#17348)
6 years ago
guru4elephant 7f8bc49d00
polish_executor_and_add_ctx_cache (#17536)
6 years ago
Zeng Jinle c6189637cd
Fix allocator bug (#16712)
6 years ago
Qiao Longfei 92e7d5d7cc
fix distribute doc test=develop (#17318)
6 years ago
Qiao Longfei 58f7695ab2
Async exe support communicator (#17386)
6 years ago
Tao Luo 32da5e9c3d
remove unused expected_kernel_cache_pass (#17486)
7 years ago
Yan Xu 0217555530 polish parallel dygraph code (#17164)
7 years ago
Jiabin Yang d7df4e5e5b
Fix/Fix memory leak in dygraph (#17394)
7 years ago
Zhen Wang 4a1b7fec96
Add setting Scope function for the graph class (#17417)
7 years ago
jiaqi 66d51206b1
add save/load model, shrink table, cvm, config file & fix pull dense bug (#17118)
7 years ago
Tao Luo 68ec0a6f74
make parallel_executor support FLAGS_use_mkldnn (#17341)
7 years ago
Jiabin Yang 4624d7c642
test=develop, add gradient sort backward strategy (#17125)
7 years ago
chengduo bc833945a4
Add DropLocalExeScopes in ParallelExecutor (#17297)
7 years ago
qingqing01 e32c9888f5
Double backward of conv2d. (#17211)
7 years ago