Paddle

Commit Graph

Author	SHA1	Message	Date
wawltor	3ab1866ca5	Add the op of unique_with_counts, expand count function of the op unique (#18720 ) * test=develop Add the op of unique_with_counts, the op is calc the unqiue input of data, and output the corresponding indices and count of data. * test=develop Check the input and dtype in the op of unique_with_counts * test=develop test=document_preview update the API.spec for `unique_with_counts`, at the same time, optimize the python api in the op of `unique_with_count` * test=develop test=document_preview Fix some python api problem in the op of `unique_with_counts`, and change the error messsage in this op. * Fix some API problem in the op of `unique_with_counts` test=develop test=document_preview * test=develop test=document_preview Fix the api sample of op `unique_with_counts`, and update api.spec	6 years ago
LielinJiang	22fa4c2d24	Fix depthwise conv gpu kernel bug (#18582 ) * fix depthwise conv gpu kernel bug, test=develop * add more depthwise conv test, test=develop	6 years ago
whs	c92b78b060	Fix unitest of light nas. (#18931 ) test=develop	6 years ago
jiaqi	233746d89d	set fleet_send_batch_num a default value according to trainer num (1) set fleet_send_batch_num a default value according to trainer num， the previous 80000 is fixed，if trainer num is much less or larger than 100，global shuffle may have timeout error. (2) fix load one table bug, add barrier	6 years ago
chengduo	20859c08e8	[DyGraph] Make multi-card program faster (#18892 ) * update parallel.py test=develop	6 years ago
HaoRen	24f8543106	Add center Loss Op Support (#18681 ) * support center loss * change tensor copy api to high level api tensorcopy * test=develop rewrite the center_loss cuda_kernel to make it faster and add document of the center loss api,also update test function * test=document_preview test=develop update document of center loss * test=document_preview test=develop modify API.spec modify test code remove nouse const_cast	6 years ago
lvmengsi	d21c391447	replace paper link (#18861 ) Update conv2d transpose link	6 years ago
Dong Daxiang	2bb296dfe9	make dist unit test exclusive run (#18865 ) make dist unit test exclusive run	6 years ago
whs	6cccab9203	Make lod_append support variable lod. (#18908 ) test=develop	6 years ago
danleifeng	e0a2d4dfec	Add elementwise_pow_op backward implementation and the unit test codes of it. (#18848 )	6 years ago
chengduo	ecd2bdada6	add CPUInplaceTestWithFuseOptimizationOps (#18867 ) test=develop	6 years ago
Zeng Jinle	8008ab4e6b	Remove legacy C++ memory optimization codes (#18834 ) * remove legacy memory optimization codes, test=develop * follow huihuang's comments,test=develop * follow luotao's comments, test=develop	6 years ago
Thunderbrook	52c1431eee	add clear_model interface in fleetwrapper (#18815 ) * dump slot * test * proto * dump slot * test * proto * code style * code style * code style * style * add delete after unseen days * add unseen days * code style * conflict solve test=develop * add clear model * code style test=develop * code style test=develop	6 years ago
Zeng Jinle	9a8a7a1ddc	fix affine_channel no_need buffer bug, test=develop (#18844 )	6 years ago
lvmengsi	829ef26281	Fix drop deconv (#18813 ) * replace link * update api.spec * fix mistake	6 years ago
chengduo	4140fe11a4	Open fuse optimization ops (#18741 ) * open fuse optimization ops test=develop	6 years ago
chengduo	582cc29799	add warning info for CPU_NUM (#18840 ) test=develop	6 years ago
Adam	ee02227949	Add LeakyReLU MKLDNN support (#18762 )	6 years ago
Zeng Jinle	a802da650b	Feature/mem opt pass refactor (#18735 ) * first version memory optimize pass, test=develop * remove move_tensor_sharing_pass, test=develop * refine code comments, add unittests, test=develop * turn off memory_optimize by default, test=develop * follow huihuang's comments, test=develop * follow chengduoZH's comments, test=develop * fix grammar error, add const qualifier, fix pass_test exception message, test=develop * follow chengduoZH's comments 2nd, test=develop	6 years ago
石晓伟	9dbb62eeb9	Fix examples of API (#18092 ) * fix logical APIs test=develop test=document_preview * fix isfinite * update matmul comments * update API.spec test=document_preview test=develop * update API.spec test=document_preview test=develop * update API.spec test=document_preview test=develop	6 years ago
guru4elephant	30562e371b	refine launch_ps and role_maker (#18795 ) refine launch_ps and role_maker	6 years ago
fuyinno4	c167a4b4dd	Fix shrink-dense and add scale-datanorm (#18746 ) Fix FleetWrapper: 1. fix shrink dense: just scale show 2. add datanorm scale: divide datanorm's gradient by batch_size	6 years ago
guru4elephant	2efb282c86	split test_dist_se_resnext.py into 4 testcases (#18743 ) * split test_dist_se_resnext.py into 4 testcases	6 years ago
Bob Zhu	220eef602e	Extend Matmul to support matrix multiplication with multiple heads (#18570 ) * extend matmul op to support multiple head multiplication With the support of multiple head, the multiplication of two big matrixes is split into multiplication of several (head_number) small matrixes. e.g. if Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of [6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].	6 years ago
whs	075e1cf78e	Add python API for appending LoD level (#18702 ) * Make lod reset op support for append lod level. * Fix API.spec test=develop * Fix unitest. test=develop * Add python api for lod append. test=develop * Fix API.spec test=develop * Fix format of doc. test=develop * Fix unitest. test=develop * Fix doc. test=develop	6 years ago
chengduo	8259f1418f	Enhance backward process (#18700 ) * prun backward ops test=develop	6 years ago
JesseyXujin	25c9b57bcd	Modify auc doc. Add output variable description, previously was the scalar type, now changed to the tuple type.test=develop (#18771 )	6 years ago
Thunderbrook	d8396281ef	add slot to sparse table (#18686 ) The change includes 2 things: 1. save delta model and shrink table are control by the same parameter before, now add delete_after_unseen_days to control shrink table. 2. value in sparse table has no slot before, now add slot in sparse table, and add DownpureCtrAccessor to support the new meta. test=develop	6 years ago
jiaqi	d18aabb472	support patch data, add load_one_table, fix bug (#18509 ) （1）support patch data （merge slots of instances of same line id, modify dense layer which changes its size）（2）add fleet load_one_table interface, support load from paddle model and load from pslib model （3）fix push sparse bug which cause push sparse cost more time（about 10% in my testcase）（4）when some slots are not in one of your network (join/update, etc.)，data feed、collect label info、push/pull sparse will skip these slots， instead of throw error. （5）add more debug info in TrainFilesWithProfiler	6 years ago
chengduo	fd3aad6cb3	Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664 ) * support sparse gradients test=develop	6 years ago
Yi Liu	157211c4e1	supports distributed classification (#18690 ) * supports distributed classification training * update API.spec * fix evenly division in python3 * change "index_range" to "index_num" in shard_index operator test=document_preview test=develop	6 years ago
guru4elephant	70b03760fd	add parameter server launch (#18687 ) add parameter server launch so that a user can easily launch parameter server	6 years ago
Zeng Jinle	d07ad4c605	add more traceback to py_reader error msg, test=develop (#18722 )	6 years ago
Huihuang Zheng	a3028bb729	Fix random test_recurrent_op failure (#18718 ) The change includes 3 things: 1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1. 2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values. 3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests. test=develop	6 years ago
Tao Luo	bd22453f20	Revert "Add LeakyRelu MKLDNN support (#18656 )" (#18723 ) test=develop	6 years ago
tangwei12	d845848341	do some odd jobs (#18641 ) do some odd jobs, test=develop	6 years ago
guru4elephant	ebf9797ec3	split different comm method for mnist distributed training (#18715 ) * split different comm method for mnist distributed training	6 years ago
Huihuang Zheng	89bc3fd841	Support memory eager deletion on recurrent OP (#17710 ) Test PaddingRNN on V100 GPU device. Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU. GPU memory (MiB): 6414 (this PR) vs 6837 (without this PR) Speed (steps/s): 10.28 (this PR) vs 9.89 (without this PR)	6 years ago
Adam	d6b6a337a9	Add LeakyRelu MKLDNN support (#18656 ) test=develop	6 years ago
tangwei12	0b9acb4912	add check of executor (#17986 ) * add check of executor, test=develop	6 years ago
Zeng Jinle	ae58afc546	Feature/auto_growth_allocator (#18561 ) * feature/auto_growth_allocator, test=develop * add unittest of AlignedAllocator, test=develop * try to turn on auto_growth to test on CI, test=develop * fix segmentation fault in mixed_vector.h, test=develop * add unittests, test=develop	6 years ago
hutuxian	bb2f5d24a2	hash_op support int64 hash_size (#18674 ) * hash_op support int64 hash_size * add corresponding UT	6 years ago
guru4elephant	5ed713d519	remove ctr reader, all functions are satisfied in dataset (#18672 ) * remove ctr reader, all functions are satisfied in dataset	6 years ago
guru4elephant	ab57d3893e	make auc op compatible with 1 dim (#18551 ) * make auc op compatible with 1 dim	6 years ago
guru4elephant	b71b454328	increase timeout again (#18628 ) test=develop	6 years ago
123malin	b414645a65	fix #17430 : int64类型的attr训练非预期 (#18264 ) * fix int64_t * update fill constant op unittest * add empty line	6 years ago
Kevin	995d7d8600	Modify embedding_op input dtype to int64 (#18598 )	6 years ago
kh2se2013	9ad57f2dfd	1）change to parallel mode on python coverage run (#18594 ) 2）add pip install coverage in Dockerfile.tmp test=develop	6 years ago
gongweibao	c0a82748cf	Polish backwards optimizer dependency codes and use more default values. (#18255 )	6 years ago
Zeng Jinle	d3003a1620	Feature/buffer_shared_inplace (#17911 ) * feature/buffer_shared_inplace, test=develop * refine code, test=develop * fix elementwise_add op cpu inplace and sum inplace bug, test=develop * add unittest and debug log, test=develop * fix parallel_executor scope bug, polish code, test=develop * fix sum op, activation op, single_in_place_inference bug, test=develop * remove kLocalExecScopeName, test=develop * fix unittest,test=develop * fix out_var first version bug, test=develop * follow comments,test=develop	6 years ago
tianshuo78520a	1c10dac4f2	Add code example in CI (#18228 ) * test api example * update python * add sampcd_processor.py * add if 0 * sort * test paddle * test paddle * test paddle * add whitelist * change sampcd_processor.py * change sampcd_processor.py * change sampcd_processor.py * add exit * test=develop * test=develop	6 years ago
lujun	b6d5c74f69	update dygraph api doc for web (#18550 ) remove dygraph.enable from __all__ hidden dygraph. profiler add doc to dygraph. no_grad	6 years ago
guru4elephant	9c17a899d7	upgrade collective fleet api (#18533 ) * upgrade collective fleet api	6 years ago
bingyanghuang	a25be53cb5	QAT int8 MKL-DNN transformation pass with MUL (#18322 )	6 years ago
Physher	0caa08ea40	Add mkldnn int8 mul-op kernel (#17834 )	6 years ago
LielinJiang	24d1c44a0c	Fix roi_perspective_transform_op bug (#18522 ) * fix transform matrix bug, test=develop * modify API.spec	6 years ago
guru4elephant	1f1cc2221f	add random port (#18504 ) * add random port	6 years ago
zhaoyuchen2018	832d8191ff	Fix topk cannot handle 1D vector bug (#18466 ) * Fix topk cannot handle 1D vector bug Add path to handle 1D vector test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	6 years ago
Jiabin Yang	7586cdd545	Hide no support (#18515 ) * test=develop, fix docker with paddle nccl problem * test=develop, hide no_support api and add ut for it	6 years ago
LielinJiang	43e17c7951	Add distributions of normal and uniform (#18023 ) * add_distributions_of_normal_and_uniform * paddle/fluid/API.spec * modify API.spec * modified paddle/fluid/API.spec, test=develop * modify paddle/fluid/API.spec, test=develop * modify paddle/fluid/API.spec, test=develop * fix some comment, test=develop * modify API.spec, test=develop * add comment for init function, modify hard code, test=develop * modify API.spec, test=develop * modify API.spec, test=develop * make unit test function shorter, test=develop * modify paddle/fluid/API.spec	6 years ago
tensor-tang	4828a5e008	core remove pycpuinfo (#18479 ) remove pycpuinfo deps in core	6 years ago
qingqing01	602cb6a5b4	Enhance linear_lr_warmup (#18463 ) * make it support float/int learning as input.	6 years ago
chengduo	7453857324	Make fuse_all_reduce_op_pass support mix_precision (#17652 )	6 years ago
chengduo	55baeceddb	Enhance execution error info (#18482 ) * enhance execution error info test=develop	6 years ago
pkpk	e9c7e218f2	Nan debugger init (#18401 ) test=develop	6 years ago
zhoukunsheng	7c6f2350b9	support Tensor input for edit_distance op (#18162 )	6 years ago
zhoukunsheng	26318544d2	support Tensor input for chunk_eval op (#18226 ) * test=develop support Tensor input for chunk_eval op * test=develop fix testcase for chunk_eval op * test=develop fix typos in nn.py	6 years ago
zhoukunsheng	206c44e2a8	add unique kernel and op (#17557 )	6 years ago
zhoukunsheng	71af72b1c2	upgrade hash op to support Tensor and LoDTensor input (#17998 )	6 years ago
zhoukunsheng	d3b3443d10	add ones_like op (#17388 )	6 years ago
zhoukunsheng	67b48d7fe7	add size op (#17412 )	6 years ago
hutuxian	6e0df3102e	Refactor for Pipeline Thread Check (#18459 ) move the thread-check code from train_from_dataset to a single function add UT for the thread check function	6 years ago
Zeng Jinle	41ab76e55b	add friendly error msg to py_reader (#18316 )	6 years ago
Kaipeng Deng	823ab5e887	fix load attr error. test=develop (#18447 )	6 years ago
Yi Liu	a873fa84ce	supports collective training with programs (#18392 ) 1. Since allreduce op has 4 reduce types, We split these four reduce types into four ops 2. We also refined the collective op code, e.g. we separated the collective op kernel into CPUKernel and CUDAKernel, and remove the device specified DeviceContext parameter in template as we already knew the target DeviceContext 3. We remove the newly added Collective op role to reduce the complexity of program and graph analysis	6 years ago
guru4elephant	357311fdb7	make fleet support mpi job submit directly (#18441 ) make fleet support mpi job submit directly.	6 years ago
chengduo	e0d8c6ac68	Add find_no_grad_vars in backward.py (#17942 ) * add not_been_used_vars to no_grad_set test=develop	6 years ago
LielinJiang	449c7a9f98	Make roi_perspective_transform op return mask and transform matrix (#18371 ) * modify roi_perspective_transform_op to output mask and transform matrix * modify comment * modify comment * modify API.spec * update API.spec * remove no use header, test=develop * resolve conflict	6 years ago
tensor-tang	a3bc804f5f	fix mac ci random fail (#18430 ) * fix mac ci random fail * use platform instead	6 years ago
xiaoting	dd3f9d19cf	replace mnist dataset url, test=develop (#18429 ) replace mnist dataset url	6 years ago
xsrobin	47e2ef38e9	add "import paddle.fluid as fluid" to examples lack of it	6 years ago
hutuxian	8a39e5c110	update api format (#18413 ) * update api format test=develop * update API.spec test=develop	6 years ago
Jie Fang	2b4ef509ea	init custom black white list (#18377 ) test=develop	6 years ago
guru4elephant	e83f902b98	add MultiSlotStringDataGenerator for speedup of string based user inp… (#18390 ) * add MultiSlotStringDataGenerator for speedup of string based user input data	6 years ago
Jiabin Yang	43f64a177e	Fix/program doc (#17908 ) * test=develop, add some comments for Program.clone * test=develop, add API.spec * test=develop, refine comments * refine Program doc and clone doc * test=develop, refine doc	6 years ago
chengduo	871cc15e6a	Add is_compiled_with_cuda (#18356 ) * add cuda_is_available test=develop * Fix api.spec test=develop * fix api doc test=develop	6 years ago
Wojciech Uss	8ed819d8e4	Call the test_slim_int8_* tests through absolute path (#18386 ) test=develop	6 years ago
lujun	fd6631ef2f	Fix dygraph show style (#18297 ) Fix dygraph show style for FluidDoc.	6 years ago
翟飞跃	19da59ed3f	Remove all the code, API and doc of MKL-DNN INT8v1 (#18347 )	6 years ago
chengduo	8ed33bf91f	Fix Bug-prone code of PE (#18354 ) * update pe reduce config test=develop * drop the local_exe_scopes of the previous parallel_executor test=develop	6 years ago
tangwei12	999d9a59a5	fix communicator with pyreader (#18350 ) * add is_runnning in communicator, test=develop	6 years ago
kh2se2013	27fb9cad65	add WITH_COVERAGE option, default OFF (#17872 ) * add WITH_COVERAGE option, default OFF test=develop * add coverage for python sdk test=develop * fix code style * fix COVERAGE_FILE path test=develop * remove coverage package test=develop * test = develop, run coverage as module	6 years ago
HaoRen	b7128bac5f	supports collective communicated training (#18175 ) * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * fix comment test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * fix comment test=develop * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * test=develop add collective op unittest standard * test=develop remove the test_collective directory * test=develop remove the test_collective directory * remove slicegather test * code format for reducescatter * update attr of shard_index_op * Modify macro nccl_helper * remove test without distribute * macro collective_helper * marcro update * test=develop update support python3.5 * test=develop change gpu memory use to 0.1 when test * test=develop update ut equal func * test=develop set flags to 1.5 * test=develop fix pickle dumple py35 * test=develop fix divide in slice and add sync_comm_stream update atol and rtol to 1e-05 rm shard_index op and test modify read input from file to read from memory remove origin_program in framework and add i/o in c_sync_calc_stream * test=develop update unittest sync operator I/O	6 years ago
qingqing01	9047ac687e	Simplify multi_box_head API in detection.py and remove assign op. (#18310 ) * Simplify multi_box_head API in detection.py and remove assign op.	6 years ago
hutuxian	e42057cd1a	add ut for pipeline training (#18289 )	6 years ago
Jiabin Yang	bd61d89925	test=develop, recover ocr ut on dygraph (#18166 )	6 years ago
Yibing Liu	23941e43ec	Update lamb optimizer (#18333 ) * Update lamb optimizer test=develop, test=document_preview * Regenerate api spec test=develop, test=document_preview	6 years ago
whs	1bdfd2eb85	Fix checkpoint of Light-NAS (#18330 ) Socket can't be pickled. test=develop	6 years ago
Jiabin Yang	79bcdbbf1a	test=develop, disable basic gru related ut (#18329 )	6 years ago
Jiabin Yang	831a3e62ca	Add install check for multigpu (#18323 ) * test=develop, add_install_check_for_multigpu * test=develop, refine code to use cuda_devices	6 years ago
Zeng Jinle	f88e07a0d3	fix lod_tensor.py grammar error, test=develop (#18308 )	6 years ago
Hongyu Liu	df2eee71d8	Sequence mask support tensor (#18249 ) * sequnce mask support max length tensor input; test=develop * add rnn_impl.py; test=develop * add basic gru lstm unittest; test=develop * fix api spec; test=develop * fix sequence_mask op bug; test=develop test=document_preview * change +-x to elmentwise_op; test=develop add mkl flag; test=develop * fix rnn impl bug; test=develop * update api spec; test=develop * fix doc bug; test=develop * fix lstm bugs; test=develop	6 years ago
Jiabin Yang	9cb799be50	test=develop, Revert "Add multi gpu install check" (#18313 ) * Revert "Add multi gpu install check (#18229)" This reverts commit `61ed06b29a`. * test=develop, start ci	6 years ago
Qiao Longfei	0e08e91c18	optimize communicator merge sparse gradient test=develop (#18159 ) * optimize communicator merge sparse gradient test=develop * revert multithread selected rows merge add test=develop * follow comment test=develop	6 years ago
Jie Fang	172c2facef	init black/white lists (#17847 ) test=develop	6 years ago
chengduo	e06c69c788	Fix default value of fluid.memory_optimize (#18295 ) * fix default value of fluid.memory_optimize test=develop * fix api.spec test=develop	6 years ago
Zhaolong Xing	6978b2e48e	fix split and sampled softmax (#18280 ) test=develop	6 years ago
hutuxian	6ed73830c2	add api desc for pipeline training (#18293 )	6 years ago
liuwei1031	a736c03b10	improve doc of lstm, sequence_enumerate, softmax_with_cross_entropy, space_to_depth APIs (#18261 ) * improve doc of lstm, sequence_enumerate, softmax_with_cross_entropy, space_to_depth APIs, test=develop * update API.spec, test=develop	6 years ago
chengduo	d54e13bbec	add random seed for recurrent op test (#18274 ) test=develop	6 years ago
liuwei1031	4151d90c16	improve the hint message of memory optimize, test=develop (#18260 )	6 years ago
guru4elephant	ff399fd720	fix paddle cloud role maker bug (#18269 ) * fix paddle cloud role maker bug	6 years ago
Yibing Liu	412951d7d2	Fix ema's example & fp16 update (#18273 ) test=develop, test=document_preview	6 years ago
flame	fdf798f95a	fix double buffer example (#18169 ) test=develop test=document_preview	6 years ago
Bai Yifan	23b8b18e56	fix api doc example, test=develop (#18266 )	6 years ago
pkpk	cd9d57f5e0	fix a bug in examples of metrics.Acc	6 years ago
tensor-tang	68da8b2a01	refine core cmake warning and print more info (#18248 ) * refine core cmake warning and print more info test=develop * fix comments test=develop	6 years ago
zhaoyuchen2018	32c95f1752	Add StaticRNN.output code example (#18251 ) refine StaticRNN api doc test=develop test=document_preview	6 years ago
xiaoting	2f0d68261c	fix yolo_box example,test=develop (#18247 )	6 years ago
songhao	6b3d96254d	fix some bug when merge sparse embedding parameters, test=develop (#18223 ) 1. fix the bug that out_put_var in SaveSelectedRows would be empty string 2. use merge_sparse_lookup_table to replace sum op for load_persistables_for_inference 3. fix the bug in _clone_var_in_block_ when the var is SELECTED_ROWS.	6 years ago
jiaqi	3f8031e256	dataset (#17973 ) (1) use channel instead of vector/BlockingQueue in Dataset，to keep same with existing implementation, and make code more readable and flexible (dataset single output channel or multi output channel). one previous memory out of limit problem is cause by not release memory after training. (2) add Record because MultiSlotType costs too much memory (80B)，fix memory out of limit problem. (3) add Channel, Archive in paddle/fluid/framework (4) change dataset from shared_ptr to unique_ptr in pybind (5) move create/destroy readers from trainer to dataset (6) move shuffle from datafeed to dataset. dataset holds memory, datafeed is only for load data and feed data to network. (7) fix thread num bug of Dataset when filelist size < thread num (8) support set_queue_num in InMemoryDataset	6 years ago
liuwei1031	5d54ed4a84	improve the doc of DataFeeder and default_main_program (#18241 ) * improve the doc of DataFeeder and default_main_program * update API.spec, test=develop	6 years ago
AIFollowers	4f3acb392f	fix BilinearInitializer doc (#18242 )	6 years ago
songhao	432fda51aa	fix bug in Class MultiSlotDataGenerator's function _gen_str, test=develop (#18222 )	6 years ago
Jiabin Yang	61ed06b29a	Add multi gpu install check (#18229 ) * test=develop, add add_multi_gpu_install_check * test=develop, refine warning doc * test=develop, refine warning doc * test=develop, refine warning doc * test=develop, support multi cpu * test=develop, find right num of cuda device * test=develop, find right num of cuda device * test=develop, fix multigpu processing and fix type bug in dygraph * test=develop, fix multigpu processing and fix type bug in dygraph	6 years ago
xiaoting	b58bb80248	set src_idx > 0 for bilinear_interp_op (#18238 ) * set src_idx > 0, test=develop * add unittest and cu, test=develop	6 years ago
guru4elephant	7d76e34ec2	add more print function for timeout issue, make timeout value larger (#18219 ) * add more print function for timeout issue, make timeout value larger	6 years ago
hutuxian	cf15c3ff1e	fix errors in python3 (#18239 ) * fix relative import error in python3 * fix debug string info	6 years ago
wopeizl	222c9fe57e	fix doc for LarsMomentumOptimizer test=develop (#18208 )	6 years ago
Shuai Yuan	9a32dad811	[DOC] Fix comment code of API create_py_reader_by_data (#18193 ) * [DOC] Fix comment code of API create_py_reader_by_data. test=develop, test=document_preview * Fix code style of API comment. test=develop,test=document_preview Fix code style of API comment. test=develop,test=document_preview * update api spec of api create_py_reader_by_data * remove default config code. test=develop * remove useless code. test=develop * update create_py_reader_by_data api. test=develop	6 years ago
Zeng Jinle	ec970f1270	Fix create_lod_tensor (#18196 ) * fix_create_lod_tensor, test=develop * remove program_guard import,test=develop * fix windows numpy default int32 error, test=develop	6 years ago
Hongyu Liu	cefd0fb598	Fix slice op shape=-1 bug (#18107 ) * fix slice op bug; test=develop * fix variabel test bug; test=develop * remove slice while true; test=develop	6 years ago
Jiabin Yang	b3cbc5be76	test=develop, fix test_imperative_transformer and ocr (#18127 ) * test=develop, fix test_imperative_transformer and ocr * test=develop, remove ocr recovery part	6 years ago
lijianshe02	ff4279e3b2	fix paddle.fluid.layers.io.open_files api doc bug test=develop (#18203 ) * fix paddle.fluid.layers.io.open_files api doc bug test=develop	6 years ago
Wojciech Uss	36b60e240a	Enable MKL-DNN for slim FP32 vs. INT8 tests (#18214 ) * Enable MKL-DNN for slim FP32 vs. INT8 tests test=develop * added test for MobileNetV1 with MKL-DNN test=develop	6 years ago
bingyanghuang	976cf46010	slim QAT MKL-DNN readme (#18187 ) * QAT doc first version * follow the comments, test=develop * update the benchmark, test=develop * Change to channel-wise, test=develop	6 years ago
chengduo	5588b923f3	Add multi process reader (#18115 ) * add multi process reader test=develop	6 years ago
wangchaochaohu	a9dc534f48	fix API example (#18153 ) * API.spec test=develop * update * update test=develop * update test=develop * update * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * test=develop * update * update test=develop * update test=develop * fix test=develop	6 years ago
Qiao Longfei	778f6acf4d	disable test_async_ssa_graph_executor_mnist test=develop (#18165 )	6 years ago
翟飞跃	802ea50956	fix spelling errors (#17941 ) * fix spelling errors; test=develop * Update API.spec update md5 * Update API.spec * change the order of api;test=develop	6 years ago
zhoukunsheng	0569ff78fa	Fix doc example for greater_equal, greater_than, less_equal, not_equal, rank, reduce_all, reduce_any, sign, where, diag (#18167 ) * test=develop fix greater_than, greater_equal, less_equal, not_equal, rank, reduce_all, reduce_any, sign, where, diag doc example * test=develop fix API.spec conflict	6 years ago
Jiabin Yang	991c94f135	test=develop, add add_multi_gpu_install_check (#18157 ) * test=develop, add add_multi_gpu_install_check * test=develop, refine warning doc * test=develop, refine warning doc * test=develop, refine warning doc * test=develop, support multi cpu	6 years ago
Huihuang Zheng	bbc292920c	Fix API example code (#18176 ) The fixed APIs: 6 Methods in paddle.fluid.io.PyReader paddle.fluid.layers.Preprocessor paddle.fluid.layers.py_reader paddle.fluid.io.save_params paddle.fluid.io.save_persistables test=develop test=document_preview	6 years ago
翟飞跃	78441c5449	add mkldnn Int8v2 slim doc (#17909 )	6 years ago
lvmengsi	d658f1133b	Fix doc for transpose, conv3d and batch_norm. (#18035 ) * update some op doc, test=develop	6 years ago
翟飞跃	c2fb9b906a	change dirname mobilenet to mobilenetv1 (#18160 )	6 years ago
bingyanghuang	5cf7e7415d	update mkldnn int8v2 doc (#18177 )	6 years ago
chengduo	4978db2c10	Remove nccl dep when the number of GPU is 1 (#18158 ) * remove nccl dep when the number of GPU is 1 test=develop	6 years ago
Zeng Jinle	25ab23be28	Fix dygraph mem leak (#18082 ) * fix dygraph mem leak, test=develop * polish msg, test=develop	6 years ago
tensor-tang	1c6e560607	core replace x86cpu with py cpuinfo (#18151 ) test=develop	6 years ago

1 2 3 4 5 ...

8968 Commits (11f2f784589c32add30f42c97eaaac2f6c98fe68)