Paddle

Commit Graph

Author	SHA1	Message	Date
Chen Weihang	c78a4781bf	Fix train error when test_program.clone is executed after optimizer.minimize (#19397 ) * add prune when test_program.clone is executed after optimizer.minimize * add unittest, test=develop * add resnet and transformer test case, test=develop * add regularization for optimizer & program compare function, test=develop * add lstm unittest, test=develop * polish code based on review comment, test=develop * adapt to interface change in framework._prune, test=develop * update API.spec, test=develop	6 years ago
Jiabin Yang	e9233d1c1e	Refactor dygraph (#19107 ) * refactor dygraph,test=develop * fix failed unittest,test=develop * polish code,test=develop * check windows ci error,test=develop try to fix windows ci error by np.allclose,test=develop * polish vlog and profiler, test=develop * try to fix preceding ops order,test=develop * test transformer in windows ci, test=develop * use python c-api to speed up tracer.trace,test=develop * test=develop, fix docker with paddle nccl problem * test=develop, add ut for debug string and gradient_accumulator * test=develop, add tests for layer/gradient_accumulator/prepared_op * test=develop, fix complie error for test_prepared_op * test=develop, add more ut for dygraph * test=develop, create API.spec for dygraph api change * test=develop, refoctor name to make it easier to understand * test=develop, refoctor name to make it easier to understand * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ * test=develop, fix ut failed on parallel se-resnext * test=develop, change one more PADDLE_ENFORCE	6 years ago
mapingshuo	dca9b6c5b0	add feed_var_names to Prune interface (#19589 ) * Fix bug: add feed_vars to the prune function	6 years ago
zhongpu	4d26274d25	add detach API for Variable in dygraph mode, test=develop (#19477 ) * add to and detach for Variable in dygraph, test=develop * add detach for Variable in dygraph, test=develop * add detach for Variable in dygraph, test=develop * add detach for Variable in dygraph, test=develop * add detach for Variable in dygraph, test=develop * add detach for Variable in dygraph, test=develop * add exception check, test=develop	6 years ago
Zhen Wang	0fe72469ea	Add the max-pool2d quantization support and the partial quantization support. (#19310 ) * add pool2d quantization support, only for max-pooling. * add the partial quantization support.	6 years ago
Zeng Jinle	561232c25a	remove is_mem_optimized in Program, test=develop (#19307 )	6 years ago
zhang wenhui	539c870753	add fl_listen_and_serv &fl_transpiler,test=develop (#19091 ) add fl_listen_and_serv op for Federated_learning and fl_distribute_transpiler add this op to pserver program . This op just listen the endpoint and sum&scale.	6 years ago
gongweibao	29d8781240	Polish fleet API to support cuda collective mode and nccl2 mode. (#18966 ) Polish fleet API to support cuda collective mode and nccl2 mode	6 years ago
Jiabin Yang	af63b1184c	test=develop, fix memory leak in dygraph (#18998 )	6 years ago
chengduo	582cc29799	add warning info for CPU_NUM (#18840 ) test=develop	6 years ago
Yi Liu	157211c4e1	supports distributed classification (#18690 ) * supports distributed classification training * update API.spec * fix evenly division in python3 * change "index_range" to "index_num" in shard_index operator test=document_preview test=develop	6 years ago
gongweibao	c0a82748cf	Polish backwards optimizer dependency codes and use more default values. (#18255 )	6 years ago
xsrobin	47e2ef38e9	add "import paddle.fluid as fluid" to examples lack of it	6 years ago
Jiabin Yang	43f64a177e	Fix/program doc (#17908 ) * test=develop, add some comments for Program.clone * test=develop, add API.spec * test=develop, refine comments * refine Program doc and clone doc * test=develop, refine doc	6 years ago
chengduo	871cc15e6a	Add is_compiled_with_cuda (#18356 ) * add cuda_is_available test=develop * Fix api.spec test=develop * fix api doc test=develop	6 years ago
HaoRen	b7128bac5f	supports collective communicated training (#18175 ) * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * fix comment test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * fix comment test=develop * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * test=develop add collective op unittest standard * test=develop remove the test_collective directory * test=develop remove the test_collective directory * remove slicegather test * code format for reducescatter * update attr of shard_index_op * Modify macro nccl_helper * remove test without distribute * macro collective_helper * marcro update * test=develop update support python3.5 * test=develop change gpu memory use to 0.1 when test * test=develop update ut equal func * test=develop set flags to 1.5 * test=develop fix pickle dumple py35 * test=develop fix divide in slice and add sync_comm_stream update atol and rtol to 1e-05 rm shard_index op and test modify read input from file to read from memory remove origin_program in framework and add i/o in c_sync_calc_stream * test=develop update unittest sync operator I/O	6 years ago
liuwei1031	5d54ed4a84	improve the doc of DataFeeder and default_main_program (#18241 ) * improve the doc of DataFeeder and default_main_program * update API.spec, test=develop	6 years ago
Hongyu Liu	cefd0fb598	Fix slice op shape=-1 bug (#18107 ) * fix slice op bug; test=develop * fix variabel test bug; test=develop * remove slice while true; test=develop	6 years ago
qingqing01	80d2e66f9e	Update backward appending stragety to support double backward and fix some bug. (#18104 ) * Update backward.py: - If there is no input grad var in all outputs of previous ops, do not append this op into graph. - Only apply this stragety when double backward. * Update some double backward op. * Update sum_op to judge whether a tensor is empty by numel or IsInitialized().	6 years ago
chengduo	24e988a471	Fix bug of scope_buffered_ssa_graph_executor (#18100 ) * fix code bug test=develop	6 years ago
Hongyu Liu	d9270af931	Fix getitems slice bug (#18053 ) * fix get items slice bug; test=develop * fix unique_name bug; test=develop	6 years ago
chengduo	b5a1c1463d	Update CPU_NUM config (#18059 ) * update CPU_NUM config test=develop	6 years ago
tensor-tang	5c06bff222	combine noavx and avx package (#17889 ) * support avx and noavx core * add catch and give some log test=develop * fix build test=develop * add missing package test=develop * fix pybind name test=develop * fix import error test=develop * conbime noavx core test=develop * add requirements test=develop * fix unkown message test=develop * fix api spec test=develop * refine and clean test=develop * update * pass dist ut * follow comments test=develop * refine scripts test=develop	6 years ago
hutuxian	969e6378b9	Pipeline Concurrency (#17402 ) Add Pipeline Concurrency Train Mode: - Cpp: pipeline_trainer & section_worker - Python: PipelineOptimizer - Add a new data_feed type: PrivateInstantDataFeed - Add a test demo of pipeline trainer and the test model is gnn - Do not support win32 now	6 years ago
Hongyu Liu	b888a4c57c	fix regularizer lod bug (#17848 ) * fix regularizer lod bug; test=develop * fix exception bug and one_hot expand; test=develop	6 years ago
xiaoting	545afb2d74	Add trainable_statist attr for bn in dygraph (#17881 ) * add import, test=develop * fix fill_constant * fix deconv * add trainable_statist for bn in dygraph	6 years ago
Hongyu Liu	82358bfdc1	ont hot support tensor depth (#16972 ) * support some input tensor remain on cpu; test=develop * fix input = none; test=develop * fix unfound bug; test=develop * fix proto None case; test=develop * fix bug; test=develop * fix proto null bug; test=develop * remove conv check; test=develop * fix test bug; test=develop * move fill constant; test=develop * no change in proto; test=develop * fix bug; test=develop * change attr detph name; test=develop * remove remain cpu; test=develop * fix bug; test=develop * merge develop; test=develop * fix one_hot bug; test=develop * fix bug; test=develop * fix bug; test=develop * fix bug; test=develop * fix python api bug; test=develop	6 years ago
Huihuang Zheng	931698a54a	Modify doc of program_guard, py_reader, data, and clone (#17727 ) Note the append_batch_size variable is doing prepend. We should change the name, but due to backward compatibility, I suggest to change at v2.0. Not now. test=develop	6 years ago
gongweibao	65bbf950ee	Add multi-ncclcomm and 2D ncclallreduce support. (#17263 )	6 years ago
wopeizl	6724a652f3	add __str__ method for tensor and lodtensor to support print test=dev… (#17588 ) * add __str__ method for tensor and lodtensor to support print test=develop	6 years ago
Hongyu Liu	306eadcd39	fix eval mode bug; test=develop (#17499 )	6 years ago
liuwei1031	6a53fa95e7	improve the API Sample of DataFeeder, memory_optimize and release_memory (#17374 ) * improve the API Sample of DataFeeder, memory_optimize and release_memory, test=develop * update API.spec, test=develop, test=document_preview * tweak the code format of feed API, test=develop * update API.spec, test=develop * improve doc for DataFeeder and default_main_program, test=develop	6 years ago
Zeng Jinle	eab34b2df6	fix_dygraph_mem_leak, test=develop (#17396 )	6 years ago
Jiabin Yang	4624d7c642	test=develop, add gradient sort backward strategy (#17125 ) * test=develop, add gradient sort backward strategy * test=develop, fix test by add FLAGS_cudnn_deterministic on new tests	6 years ago
Jiabin Yang	31536016ea	test=develop, test=document_preview, fix 13 api doc and code (#17293 ) * test=develop, test=document_preview, fix all 13 api doc and code * test=develop, fix rst * test=develop, refresh API.spec	6 years ago
lujun	e388a1fb66	Repair api example (#17221 ) Fix the following API examples: paddle.fluid.scope_guard paddle.fluid.backward.append_backward paddle.fluid.cpu_places paddle.fluid.cuda_pinned_places paddle.fluid.cuda_places paddle.fluid.in_dygraph_mode paddle.fluid.CUDAPlace paddle.fluid.CPUPlace paddle.fluid.CUDAPinnedPlace	6 years ago
Huihuang Zheng	648320bb6c	Fix some data and reader related API code (#17202 ) * Fix data and reader related api doc * Fix data and reader related api doc Review and fix the example code in some reader related API doc. These APIs are: Fix existing API example codes: paddle.fluid.io.PyReader paddle.fluid.layers.batch paddle.fluid.layers.data paddle.fluid.layers.Preprocessor paddle.fluid.layers.py_reader paddle.fluid.program_guard Add new example codes: paddle.fluid.io.PyReader.decorate_batch_generator paddle.fluid.io.PyReader.decorate_sample_generator paddle.fluid.io.PyReader.decorate_sample_list_generator paddle.fluid.io.PyReader.reset paddle.fluid.io.PyReader.start test=develop * Add changes to API.spec after changing doc. test=develop * Add blanks after python example code test=develop * Add blank line at py_reader example code test=develop * Merge API.spec test=develop * Modify reader.py based on reviewer's comment test=develop * Modify API.spec after changing doc test=develop * Change reader.py based on reviewer's comment * Modify example code of decorate_sample_generator test=develop * Fix example code of PyReader based on reviewer test=develop	6 years ago
Tao Luo	9ec4615deb	fix profiler and name_scope API examples (#17212 ) * fix profiler and name_scope API examples test=develop * update API.spec test=develop	6 years ago
minqiyang	9a3848a2ea	Fix attrs test=develop	6 years ago
minqiyang	20e304f2ae	Tracer does not hold op any more test=develop	6 years ago
lujun	01f4f2d7e4	merge confict, test=develop	6 years ago
lujun	e11bf2a49e	merge branch, test=develop	6 years ago
lujun	60e3e35575	merge branch, test=develop	6 years ago
Qiyang Min	12e36d38a5	Imperative deep-first backward process (#16605 ) * Fix bug of gradient interface * shrink transformer * Right transformer * Change from width-first backward to deep-first backward process test=develop * Reverse iterator op's input test=develop * Polish code * Change the iteration direction in ingrads' map slots test=develop * Polish code test=develop	6 years ago
lujun	2b32302bdf	move dygraph.nn,dygraph.layer to fluid, test=develop	6 years ago
lujun	717256755a	move dygraph.nn,dygraph.layer to fluid, test=develop	6 years ago
guru4elephant	76b49f02ee	Merge pull request #16539 from guru4elephant/train_with_pipe_reader_merge_develop Train with pipe reader merge develop	6 years ago
peizhilin	9c6eb1aa46	remove the useless check test=develop	6 years ago
wopeizl	e014950e87	add slice support for dim < 0 (#16494 ) * add slice support for dim < 0 test=develop	6 years ago
dongdaxiang	b7a202aa38	add distributed optimizer factory	6 years ago

1 2 3 4 5 ...

328 Commits (c6756ed225e304aff36bdabe7623bbaf2037306d)