Paddle

Commit Graph

Author	SHA1	Message	Date
Yi Liu	a873fa84ce	supports collective training with programs (#18392 ) 1. Since allreduce op has 4 reduce types, We split these four reduce types into four ops 2. We also refined the collective op code, e.g. we separated the collective op kernel into CPUKernel and CUDAKernel, and remove the device specified DeviceContext parameter in template as we already knew the target DeviceContext 3. We remove the newly added Collective op role to reduce the complexity of program and graph analysis	6 years ago
chengduo	e0d8c6ac68	Add find_no_grad_vars in backward.py (#17942 ) * add not_been_used_vars to no_grad_set test=develop	6 years ago
LielinJiang	449c7a9f98	Make roi_perspective_transform op return mask and transform matrix (#18371 ) * modify roi_perspective_transform_op to output mask and transform matrix * modify comment * modify comment * modify API.spec * update API.spec * remove no use header, test=develop * resolve conflict	6 years ago
kh2se2013	27fb9cad65	add WITH_COVERAGE option, default OFF (#17872 ) * add WITH_COVERAGE option, default OFF test=develop * add coverage for python sdk test=develop * fix code style * fix COVERAGE_FILE path test=develop * remove coverage package test=develop * test = develop, run coverage as module	6 years ago
HaoRen	b7128bac5f	supports collective communicated training (#18175 ) * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * fix comment test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * fix comment test=develop * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * test=develop add collective op unittest standard * test=develop remove the test_collective directory * test=develop remove the test_collective directory * remove slicegather test * code format for reducescatter * update attr of shard_index_op * Modify macro nccl_helper * remove test without distribute * macro collective_helper * marcro update * test=develop update support python3.5 * test=develop change gpu memory use to 0.1 when test * test=develop update ut equal func * test=develop set flags to 1.5 * test=develop fix pickle dumple py35 * test=develop fix divide in slice and add sync_comm_stream update atol and rtol to 1e-05 rm shard_index op and test modify read input from file to read from memory remove origin_program in framework and add i/o in c_sync_calc_stream * test=develop update unittest sync operator I/O	6 years ago
hutuxian	e42057cd1a	add ut for pipeline training (#18289 )	6 years ago
Jiabin Yang	bd61d89925	test=develop, recover ocr ut on dygraph (#18166 )	6 years ago
Yibing Liu	23941e43ec	Update lamb optimizer (#18333 ) * Update lamb optimizer test=develop, test=document_preview * Regenerate api spec test=develop, test=document_preview	6 years ago
Jiabin Yang	79bcdbbf1a	test=develop, disable basic gru related ut (#18329 )	6 years ago
Hongyu Liu	df2eee71d8	Sequence mask support tensor (#18249 ) * sequnce mask support max length tensor input; test=develop * add rnn_impl.py; test=develop * add basic gru lstm unittest; test=develop * fix api spec; test=develop * fix sequence_mask op bug; test=develop test=document_preview * change +-x to elmentwise_op; test=develop add mkl flag; test=develop * fix rnn impl bug; test=develop * update api spec; test=develop * fix doc bug; test=develop * fix lstm bugs; test=develop	6 years ago
chengduo	d54e13bbec	add random seed for recurrent op test (#18274 ) test=develop	6 years ago
xiaoting	b58bb80248	set src_idx > 0 for bilinear_interp_op (#18238 ) * set src_idx > 0, test=develop * add unittest and cu, test=develop	6 years ago
guru4elephant	7d76e34ec2	add more print function for timeout issue, make timeout value larger (#18219 ) * add more print function for timeout issue, make timeout value larger	6 years ago
Zeng Jinle	ec970f1270	Fix create_lod_tensor (#18196 ) * fix_create_lod_tensor, test=develop * remove program_guard import,test=develop * fix windows numpy default int32 error, test=develop	6 years ago
Hongyu Liu	cefd0fb598	Fix slice op shape=-1 bug (#18107 ) * fix slice op bug; test=develop * fix variabel test bug; test=develop * remove slice while true; test=develop	6 years ago
Jiabin Yang	b3cbc5be76	test=develop, fix test_imperative_transformer and ocr (#18127 ) * test=develop, fix test_imperative_transformer and ocr * test=develop, remove ocr recovery part	6 years ago
Qiao Longfei	778f6acf4d	disable test_async_ssa_graph_executor_mnist test=develop (#18165 )	6 years ago
翟飞跃	802ea50956	fix spelling errors (#17941 ) * fix spelling errors; test=develop * Update API.spec update md5 * Update API.spec * change the order of api;test=develop	6 years ago
Jiabin Yang	991c94f135	test=develop, add add_multi_gpu_install_check (#18157 ) * test=develop, add add_multi_gpu_install_check * test=develop, refine warning doc * test=develop, refine warning doc * test=develop, refine warning doc * test=develop, support multi cpu	6 years ago
chengduo	4978db2c10	Remove nccl dep when the number of GPU is 1 (#18158 ) * remove nccl dep when the number of GPU is 1 test=develop	6 years ago
Zeng Jinle	6eec66a1b1	Fix py_reader iterable bug (#18108 ) * fix py_reader iterable bug, test=develop * move data from buffered_reader,test=develop	6 years ago
qingqing01	80d2e66f9e	Update backward appending stragety to support double backward and fix some bug. (#18104 ) * Update backward.py: - If there is no input grad var in all outputs of previous ops, do not append this op into graph. - Only apply this stragety when double backward. * Update some double backward op. * Update sum_op to judge whether a tensor is empty by numel or IsInitialized().	6 years ago
FlyingQianMM	ff83655f7e	add detection output operator for supporting retinanet (#17896 ) * test=develop add detection output for supporting retinanet * test=develop add test_layers.py * test=develop add API.spec * test=develop alter test_retinanet_detection_output.py * test=develop alter round 2 * test=develop alter retinanet_detection_output * test=develop alter paddle/fluid/API.spec * test=devlop alter detection.py * test=develop alter retinanet_detection_output * test=develop alter paddle/fluid/API.spec * test=develop alter detection.py * test=develop alter API.spec * test=develop alter retinanet_detection_output * test=develop alter paddle/fluid/API.spec * test=develop alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py * test=develop alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py * test=develop fix grammer error * test=develop fix grammer error * test=develop fix grammer error * test=develop alter python/paddle/fluid/tests/unittests/test_layers.py * test=develop alter paddle/fluid/API.spec	6 years ago
guru4elephant	0941e3e013	add class name and timeline for test_dist_base.py (#18122 ) * add class name and timeline for test_dist_base.py	6 years ago
FlyingQianMM	0aee1f0074	add sigmoid focal loss operator for supporting retinanet (#17895 ) * test=develop add sigmoid_focal_loss for supporting retinanet * test=develop add test_layers * test=develop add API.spc * test=develop alter sigmoid_focal_loss_op.cc * test=develop alter detection.py * test=develop alter API.spec * test=develop alter round 1 * test=develop alter simooid_focal_loss * test=develop alter sigmoid_focal_loss_op.cc * test=develop alter test_layers.py * test=develop alter paddle/fluid/API.spec * test=develop alter sigmoid_focal_loss_op.cu * test=develop alter paddle/fluid/operators/detection/sigmoid_focal_loss_op.cc	6 years ago
FDInSky	9e4b9d9798	Update generate_proposal_labels_op to support CascadeRCNN. (#17200 ) * Update generate_proposal_labels_op to support CascadeRCNN.	6 years ago
FlyingQianMM	9ed2f936f1	add target assign operator for supporting retinanet (#17893 ) * test=develop add target assign for retinanet * test=develop run ci * test=developp add test_layers * test=develop add APi.spec * test=develop alter round 1 * test=develop alter rpn_target_assign_op.cc * test=develop alter test_rpn_target_assign_op.py * test=develop alter rpn_target_assign_op.cc * test=develop alter API.spec * test=develop alter paddle/fluid/operators/detection/rpn_target_assign_op.cc * test=develop alter rpn_target_assign_op.cc * test=develop alter python/paddle/fluid/layers/detection.py * test=develop alter paddle/fluid/API.spec	6 years ago
chengduo	24e988a471	Fix bug of scope_buffered_ssa_graph_executor (#18100 ) * fix code bug test=develop	6 years ago
wopeizl	26a7c1a396	add unit test to cover all parameters for print op test=develop (#18089 )	6 years ago
guru4elephant	b2cfdc3891	Refine unittest log (#18084 ) * add print log for unittest of distributed training test=develop	6 years ago
gongweibao	f5caf3443c	Fix reinitialized ncclid error! (#18025 )	6 years ago
qingqing01	e81756f1ba	Hidden paddle.fluid.layers.detection_map. (#18033 ) * Remove layers.detection_map API * Since uers can use fluid.metrics.DetectionMAP to calculate mAP of current-batch and cumulative-batch. layers.detection_map only can calculate cur-batch mAP.	6 years ago
chengduo	b5a1c1463d	Update CPU_NUM config (#18059 ) * update CPU_NUM config test=develop	6 years ago
tensor-tang	566bf2ec56	concat op support negative axis (#18045 ) test=develop	6 years ago
tangwei12	101f74cb19	fix save/load in fleet (#17675 ) * fix save/load in Fleet * add UT framework of Fleet	6 years ago
wawltor	8eb134c3c1	Fix scatter and gather op when has duplicate index (#17952 ) * test=develop The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode. * test=develop Fix some code format problem, and the same time add the test case in gather and scatter op	6 years ago
Huihuang Zheng	0bf2535158	Cherry-pick: fix random CI failure. (#18011 ) * Cherry-pick fix random Python3 CI failure. In some tests, SWEs used "print('xxx').format('xxx')". The syntax is only supported in Python2, not python3. However, since those lines are related to data download, if the CI machines already have the data, it passes CI tests. That causes random failure. * Cherry-pick: disable CUDNN case of test_warpctc_op Also temporary disable a unit test. The test will be fixed under high priority.	6 years ago
Kaipeng Deng	96ee528e3e	fix logging basicConfig cannot be setting after import paddle (#17786 ) * fix logging unable. test=develop * unset sys.stdout for stream handler. test=develop * fix newly add basicConfig. test=develop * fix import error. test=develop	6 years ago
cjt222	871af28d6c	add deformable psroi pooling (#17827 ) * add deformable psroi pooling * test=develop * test=develop * test=develop modify format * fix bug * test=develop run ci * test=develop add API.spec * add test_layers.py * run ci again * test=develop run ci again * run ci again * test=develop run ci again * test=develop run ci again * test=develop run ci again * add space between two lines * test=develop add space between two lines * test=develop add space between lines * test=develop modify comment in nn.py * test=develop add space between two lines * test=develop add space between two lines * update API.spec * run ci again * test=develop run ci again * rerun ci * test=develop rerun ci * change input shape * run ci * test=develop run ci * modify format of nn.py * test=develop * test=develop * test=develop update API.spec * test=develop fix API doc * modify API comment * modift API comment * test=develop update API.spec * test=develop modify comment * test=develop modift comment * test=develop modift comment * test=develop update API.spec * test=develop modify comment * test=develop add inference in nn.py * test=develop update API.spec * test=develop resolve confict * test=develop update API.spec	6 years ago
SunGaofeng	40885c225b	add unfold op (new op),test=develop (#17944 ) * add unfold op test=develop * fix divide bug in python3 when calculating output width and height test=develop * add name=None in python api, move redundant code into inline function * try to trigger ci for this code test=develop	6 years ago
hutuxian	969e6378b9	Pipeline Concurrency (#17402 ) Add Pipeline Concurrency Train Mode: - Cpp: pipeline_trainer & section_worker - Python: PipelineOptimizer - Add a new data_feed type: PrivateInstantDataFeed - Add a test demo of pipeline trainer and the test model is gnn - Do not support win32 now	6 years ago
Huihuang Zheng	9f519bafe7	Ignore a unit test which failed on cuda9/10 python3 ci task (#17950 ) TODO: it is a temporary fix for Paddle release 1.5. We have to fix this failed unit test soon. test=develop	6 years ago
Yibing Liu	33d1e56506	Enable seq_pool op to accept len 0 input (#17284 ) * Enable seq_pool op to accept len 0 input test=develop * Update sequence_pool's api test=develop * Add more unittest cases for seq_pool op test=develop * Remove legacy comments test=develop * Don't use template in op maker test=develop	6 years ago
Hongyu Liu	2a9d74f67c	Add comment for dygraph api (#17869 ) * add api commet; test=develop * fix fc dtype bug; test=develop * remove float32 in default parameter; test=develop * fix exmpale bug; test=develop * fix build once; test=develop * fix num_chanels bug; test=develop * fix install check failed bug; test=develop	6 years ago
Hongyu Liu	8062bd510c	Reshape support tensor attribute (#17781 ) * add reshape support tensor; test=develop * fix reshape bug; test=develop * change reshape attribute default value; test=develop * fix reshape input name; test=develop * fix reshape unitest; test=develop * check dim tensor shape; test=develop	6 years ago
gongweibao	f3e5a5cf67	Unset https_proxy and http_proxy in our launch.py (#17915 )	6 years ago
Jiabin Yang	fba10b6bb5	test=develop, refine api (#17883 ) * test=develop, refine api * test=develop, fix bug when error occured on save_persistable with no optimizer * test=develop, refine waring * test=develop, refine example code and comments	6 years ago
gongweibao	fbbdc9ccad	Add backward and optimizer operator dependency pass. (#17746 )	6 years ago
Jiabin Yang	4cb7d32c9b	test=develop, add dygraph_not_support and refine ocr (#17868 ) * test=develop, add dygraph_not_support and refine ocr * test=develop, shrink name of dygraph_not_support	6 years ago
Huihuang Zheng	83e51ded21	SERIAL flaky imperative unit tests for CI cuda9 (#17892 ) test=develop	6 years ago
Jiabin Yang	3bfb92c32b	test=develop, hide build_once (#17871 )	6 years ago
Jiabin Yang	022dfed4fc	Add optimizer save and load (#16986 ) * save optimizer related vars in dygraph * test=develop, add optimizer save and load * test=develop, add optimizer save and load * test=develop, merge code and add multi-optimizer save and load * test=develop, fix test_imperative_checkpoint * test=develop, fix include error * test=develop, fix include error * test=develop, renew api spec * test=develop, refine code * test=develop, set default value for checkpoint * test=develop, fix ci error * test=develop, change API.spec and make api more readable * test=develop, refine version and time stamp * test=develop, add example code and refine code * test=develop, refine doc * test=develop, change version	6 years ago
pawelpiotrowicz	39bc8a55a4	[NGraph] Enable ngraph layer_norm operator (#17599 ) * Enable ngraph layer_norm operator test=develop * Disable/Enable cuda, new unit-test test=develop * Fix use_cudnn test=develop * Fixed test_layer test, new funciton is added test=develop * set use_cudnn by default test=develop	6 years ago
gongweibao	6a1df46991	Fine tuning launch.py (#17223 )	6 years ago
wopeizl	841553e13f	use pyreader to read data in dygraph mode (#17314 ) * use pyreader to read data * add return_list to PyReader to support return value represented as list	6 years ago
Jiabin Yang	3d3f5506d2	Feature/Fix recurrent usage of Varbase in Dygraph (#17838 ) * for debug * test=develop, memory optimize for dygraph using shared_ptr * test=develop, fix travis ci showed error * test=develop, fix bug for recurrent usage of varbase * test=develop, init varbase when it need to be Add * test=develop, fix problem of recurrent gradient * test=develop, add gradient test for recurrent varbase usage	6 years ago
Jiabin Yang	eaf049c4b8	test=develop, refine ocr attention model (#17763 ) * test=develop, refine ocr attention model * test=develop, refine code, remove cpu only part test=develop, refine code, remove cpu only part	6 years ago
Hongyu Liu	dfec676270	expand op supprt tensor attribute (#17773 ) * expand support tensor attribute; test=develop * fix bug ; test=develop * fix uni test bug; test=develop * fix copy bug; test=develop * refine expand_times default value; test=develop	6 years ago
Jiabin Yang	3b70f870e2	Using Smart pointer to optimizer memory usage of dyGraph (#17768 ) * for debug * test=develop, memory optimize for dygraph using shared_ptr * test=develop, fix travis ci showed error * test=develop, fix bug for recurrent usage of varbase * test=develop, init varbase when it need to be Add	6 years ago
Hongyu Liu	82358bfdc1	ont hot support tensor depth (#16972 ) * support some input tensor remain on cpu; test=develop * fix input = none; test=develop * fix unfound bug; test=develop * fix proto None case; test=develop * fix bug; test=develop * fix proto null bug; test=develop * remove conv check; test=develop * fix test bug; test=develop * move fill constant; test=develop * no change in proto; test=develop * fix bug; test=develop * change attr detph name; test=develop * remove remain cpu; test=develop * fix bug; test=develop * merge develop; test=develop * fix one_hot bug; test=develop * fix bug; test=develop * fix bug; test=develop * fix bug; test=develop * fix python api bug; test=develop	6 years ago
mozga-intel	6a6bf597f7	[NGraph] Enable elementwise_div operator test=develop (#17515 ) * Enable elementwise_div operator test=develop * Fix update date test=develop	6 years ago
Zeng Jinle	3a6ead24ad	Add no_grad decorator to dygraph (#17790 ) * add no_grad decorator to dygraph, test=develop * add unittest,test=develop	6 years ago
lilong12	bfcc97d924	Split the unittest test_dist_mmist into multiple unittests (test_dist_mnist, test_dist_mnist_nccl and test_dist_mnist_lars) to avoid timeout (#17707 )	6 years ago
guru4elephant	d52391094d	fix prepare context redundant code problem, optimize executor by cach… (#17743 ) * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * cache sub_scope, program, var when use_program_cache=True is set * make fetch_list runable with variables, add more unittest for use_program_cache	6 years ago
baojun	2c58f1a83c	[NGraph] Added lookup table to ngraph engine test=develop (#17647 )	6 years ago
pawelpiotrowicz	bacc822492	[NGraph] Enable transpose ngraph operator (#17636 ) test=develop	6 years ago
baojun	90eae0b39a	[NGraph] Addded slice op to ngraph test=develop (#17648 )	6 years ago
baojun	2fbaa5c075	[NGraph] added matmul op to ngraph engine test=develop (#17645 )	6 years ago
Hongyu Liu	552f8395a3	remove ocr unit test; test=develop (#17755 )	6 years ago
Bai Yifan	bba57cdd82	Add deformable conv v2 op,test=develop (#17145 ) * unit commits, test=develop * update API.spec, test=develop	6 years ago
Hongyu Liu	0a02451ea0	fix ocr; test=develop (#17751 )	6 years ago
pawelpiotrowicz	9b99876442	Enable less_than ngraph operator (#17642 ) * Enable less_than ngraph operator test=develop * Added compare unit-tests test=develop * Update: date && removed import test=develop	6 years ago
Jiabin Yang	effc555955	test=develop, layz init Grad (#17653 )	6 years ago
Jiabin Yang	33a791dd81	test=develop, add ocr in dygraph test (#17470 ) * test=develop, add ocr in dygraph test * test=develop, add cudnn determinist * test=develop, remove useless code * test=develop, fix cmake error	6 years ago
pawelpiotrowicz	70a887af63	[NGraph] Add reduce_sum operator for Ngraph (#17450 ) test=develop	6 years ago
baojun	29baca0dd8	add depthwise_conv2d op to ngraph engine (#17454 ) * add depthwise_conv2d test=develop * use cpu for ngraph test=develop	6 years ago
mozga-intel	ccf9e2327b	[Lite] Enable cast operator test=develop (#17294 )	6 years ago
Hongyu Liu	9f85f21880	Add new gard clip [old gradient clip not support in dy graph] (#17523 ) * add gradient clip in minimize; test=develop * fix bug; test=develop * fix format; test=develop * move new grad clip to dygraph/grad_clip.py; test=develop * fix lr decay and grad clip test; test=develop * seperate dygraph grad clip; test=develop * fix grad clip test; develop * fix api spec bug; test=develop * add blank line, test=develop,test=document_preview to fix format problem	6 years ago
Tao Luo	962eed6f82	Revert "Enable SQRT operator for the nGraph Bridge (#17549 )" (#17680 ) This reverts commit `f34830e2aa`.	6 years ago
Krzysztof Binias	f34830e2aa	Enable SQRT operator for the nGraph Bridge (#17549 ) * Enable sqrt operator for the nGraph Bridge. test=develop * Update activation_op.h	6 years ago
Zeng Jinle	432ac70124	clean code of py_layer in dygraph mode,test=develop (#17661 )	6 years ago
gongweibao	65bbf950ee	Add multi-ncclcomm and 2D ncclallreduce support. (#17263 )	6 years ago
Krzysztof Binias	b1bd483a7d	[NGraph] Enable gelu operator for the nGraph Bridge. (#17547 ) test=develop	6 years ago
hutuxian	1670db5e86	Gather Op Index Support int64_t datatype (#17610 ) * gather_op support int64_t index by adding a template typename * add UT and rename typename test=develop	6 years ago
mozga-intel	2b83d75bfa	Enable elementwise pow operator for ngraph (#17526 )	6 years ago
Michał Gallus	0c39b97b4e	[MKL-DNN] Add Fully Connected Op for inference only(#15226 ) * fuse mul and elementwise add to fc * Reimplement the FC forward operator * Fix FC MKLDNN integration by transposing weights * Add FC MKLDNN Pass test=develop * FC MKLDNN Pass: change memcpy to std::copy * Fix MKLDNN FC handling of mismatch input and weights dims * Lower tolerance for MKL-DNN in resnet50 test test=develop * Adjust FC to support MKLDNN Op placement test=develop * Adjust Placement Op to set use_mkldnn attribute for graph test=develop * MKLDNN FC: fix weights format so that gemm version is called test=develop * FC MKLDNN: Remove tolerance decrease from tester_helper * FC MKL-DNN: Refactor the code, change input reorder to weight reorder * MKL-DNN FC: Introduce operator caching test=develop * FC MKL-DNN: Fix the tensor type in ExpectedKernelType test=develop * FC MKL-DNN: fix style changes test=develop * FC MKL-DNN: fallback to native on non-supported dim sizes test=develop * FC MKLDNN: fix CMake paths test=develop * FC MKLDNN: Refine placement pass graph mkldnn attribute test=develop * Fix Transpiler error for fuse_conv_eltwise test=develop * Fix missing STL includes in files test=develop * FC MKL-DNN: Enable new output size computation Also, refine pass to comply with newest interface. test=develop * FC MKL-DNN: enable only when fc_mkldnn_pass is enabled * FC MKL-DNN: Allow Weights to use oi or io format * FC MKL-DNN: Adjust UT to work with correct dims test=develop * Enable MKL DEBUG for resnet50 analyzer test=develop * FC MKL-DNN: Improve Hashing function test=develop * FC MKL-DNN: Fix shape for fc weights in transpiler * FC MKL-DNN: Update input pointer in re-used fc primitive * Add log for not handling fc fuse for unsupported dims test=develop * FC MKL-DNN: Move transpose from pass to Op Kernel test=develop * FC MKL-DNN: Disable transpose in unit test test=develop * FC MKL-DNN: Remove fc_mkldnn_pass from default list * Correct Flag for fake data analyzer tests test=develop * FC MKL-DNN: Add comment about fc mkldnn pass disablement test=develop * FC MKL-DNN: Disable fc in int8 tests test=develop	6 years ago
wopeizl	6724a652f3	add __str__ method for tensor and lodtensor to support print test=dev… (#17588 ) * add __str__ method for tensor and lodtensor to support print test=develop	6 years ago
Krzysztof Binias	e9216d0602	Enable logical operators for the nGraph Bridge. (#17543 ) test=develop	6 years ago
Kaipeng Deng	3db9c8c982	refine shape and split test. test=develop (#17545 )	6 years ago
mozga-intel	0d4cbdad91	[NGraph] Enable elementwise mul operator (#17552 )	6 years ago
mozga-intel	f2694e122d	[NGraph] Enable assign operator for a ngraph, test=develop (#17437 ) * Enable assign operator for a ngraph, test=develop * Cross_entropy operators needs to be updated	6 years ago
mozga-intel	cf02cb5e98	Enable elementwise sub operator for ngraph (#17527 )	6 years ago
Jiabin Yang	3ee3611aa7	test=develop, fix test_imperative_resnet failed on CI (#17583 )	6 years ago
mozga-intel	035771512d	Enable elementwise min operator for ngraph (#17521 )	6 years ago
pkpk	d817263c80	add unittest of dygraph RL models. (#17550 ) * test=develop * test=develop	6 years ago
Qiao Longfei	58f7695ab2	Async exe support communicator (#17386 ) Async exe support communicator	6 years ago
mozga-intel	109b5aed5a	[NGraph] Enable reshape operator test=develop (#17512 )	6 years ago
wopeizl	3bd14263f5	decrease the train loop number to avoid run too long to fail the ci process test=develop (#17567 )	6 years ago
Krzysztof Binias	43d15b9d96	Enable square operator for the nGraph Bridge. (#17551 ) test=develop	6 years ago
Sevin F. Varoglu	f86f49e779	[NGraph] add increment op to ngraph engine (#16929 ) * add increment op to ngraph engine test=develop * fix style errors test=develop	6 years ago
guomingz	2281ebf0f3	Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130 ) * Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization. Below table shows the benchmark(FPS) which measured on skx-8180(28 cores) Batch size \| with fusion \| without fusion -- \| -- \| -- 1 \| 214.7 \| 53.4 50 \| 1219.727 \| 137.280 test=develop * Fix the format issue test=develop * Add the missing nolint comments. test=develop * Fix the typos. test=develop * Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine. test=develop * Adjust the indentation. test=develop * Add the test_conv_brelu_mkldnn_fuse_pass case. test=develop * Slightly update the code per Baidu comments. Let the parameter definition embedded into the code. That's will make the code easy to understand. test=develop	6 years ago
Yibing Liu	f9796b1249	Add LAMB Optimizer support (#17489 ) * Add LAMB optimizer * Expose LAMB Optimizer's APIs test=develop, test=document_preview * Cleanup code & doc test=develop, test=document_preview * Update lamb optimizer's formula test=develop	6 years ago
mozga-intel	99ab57123c	Enabled ngraph elementwise max operator (#17517 )	6 years ago
Tao Luo	3d19f44a89	remove unused SERIAL compiler option (#17500 ) test=develop	6 years ago
mozga-intel	1eb151752e	Enable abs operator for a ngraph test=develop (#17436 )	6 years ago
Zhaolong Xing	ff7f911b4d	add quant_dequant_moving_avg_max_abs op (#17480 ) * add quant_dequant_moving_avg_max_abs op test=develop * add more note for quantdequant op test=develop	6 years ago
lvmengsi	10b23a72c1	Double backward elementwise div (#17416 ) * double backward, elementwise_div * fix dx empty. test=develop * bug fix (#17392) fix secure bug * Eanble stack operator for a Ngraph, test=develop (#17406) * fix sqrt_grad_grad unittest. test=develop (#17410) * fix sqrt_grad_grad unittest. test=develop * disable sqrt_grad_grad unittest. test=develop * test=develop, fix unittest * test=develop, fix unittest * test=develop, fix unittest * test=develop, fix bug * fix unittest. test=develop * fix unittest dx. test=develop * tmp fix! for test... test=develop * reduce tmp, test=develop * test=develop, reduce tmp * fix broadcast unittest. test=develop * fix format. test=develop * refine code. test=develop * refine code. test=develop * refine GetDoubleGradSafeTensor. test=develop * fix format. test=develop	6 years ago
Kaipeng Deng	14f223624f	fix sqrt unittest. test=develop (#17440 )	6 years ago
lvmengsi	977e9fcb27	support elementwise_sub double backward (#17476 ) add elementwise_sub_grad_grad op for backward of backward calculation	6 years ago
Yan Xu	0217555530	polish parallel dygraph code (#17164 ) * add var grad hook test=develop	6 years ago
chengduo	e336dc86bb	[Speed] Refine the Executor when the num_thread=1 (#17405 ) Refine the Executor when the num_thread=1	6 years ago
Kaipeng Deng	58d5c61a29	fix sqrt_grad_grad unittest. test=develop (#17410 ) * fix sqrt_grad_grad unittest. test=develop * disable sqrt_grad_grad unittest. test=develop	6 years ago
mozga-intel	6ee6700fac	Eanble stack operator for a Ngraph, test=develop (#17406 )	6 years ago
baojun	1ce7b45b9e	NGraph Added fill_zeros_like op test=develop (#17295 )	6 years ago
baojun	910196524d	NGraph Added dropout and dropout_grad to ngraph test=develop (#17320 )	6 years ago
mozga-intel	b189480734	Ngraph Enable gather operator test=develop (#17296 )	6 years ago
lvmengsi	4ef631013c	Double backward sqrt (#17387 ) * double backward sqrt * refine unittest. test=develop * refine test. test=develop * remove alpha in unittest. test=develop	6 years ago
lvmengsi	5d1ac41b00	Double backward reduce mean (#17372 ) * test=develop, double backward reduce_mean * add comment. test=develop * fix format. test=develop * rename GradGrad -> DoubleGrad. test=develop * fix op_use_default_grad_op_maker.spec. test=develop	6 years ago
Kaipeng Deng	bd9bef5a4e	add elementwise_add_grad_grad op (#17366 ) * add elementwise_add_grad_grad op. test=develop * use defined GradMaker. test=develop	6 years ago
jerrywgz	1c6d064627	add collect fpn proposals op,test=develop (#16074 ) * add collect fpn proposals op,test=develop	6 years ago
Kaipeng Deng	60be66e2c0	support fc_op double grad (#17317 ) * add double grad for mul_op. test=develop * fix format. test=develop * fix format. test=develop * fix format. test=develop * refine code. test=develop * remove setzero. test=develop * fix dx/dy init bug. test=develop * fix format. test=develop	6 years ago
Jiabin Yang	4624d7c642	test=develop, add gradient sort backward strategy (#17125 ) * test=develop, add gradient sort backward strategy * test=develop, fix test by add FLAGS_cudnn_deterministic on new tests	6 years ago
Jiabin Yang	c843e64cf5	Revert "rename the default version from '0.0.0' to 'latest' (#17304 )" (#17356 ) This reverts commit `f456c8beb8`.	6 years ago
Kaipeng Deng	8bae8590ac	add double grad for elementwise_mul op (#17255 ) * add double grad for elementwise_mul. test=develop * remove comment. test=develop * fix grad sum. test=develop * fix for axis expand. test=develop * add test for axis expand. test=develop	6 years ago
Kaipeng Deng	11d3a38f25	add double grad for square op (#17173 ) * add double grad for square. test=develop * formax code. test=develop * fix for grad sum. test=develop * refine shape. test=develop * refine extract. test=develop	6 years ago
chengduo	bc833945a4	Add DropLocalExeScopes in ParallelExecutor (#17297 ) * reset drop local scope counter test=develop	6 years ago
zhoukunsheng	d4b67e1692	Add Where Op(#16793 )	6 years ago
zhoukunsheng	1bfff02047	Add Diag Op(#17027 )	6 years ago
qingqing01	e32c9888f5	Double backward of conv2d. (#17211 ) * Add conv2d_grad_grad_op * Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h. - Now use it in conv2d_grad_grad. - Will simply the searching code in conv2d and conv2d_grad in next PR. * Enhance and fix bug in unit testing of gradient_checker. * Support to fetch empty variables，return None in Python.	6 years ago
wopeizl	f456c8beb8	rename the default version from '0.0.0' to 'latest' (#17304 ) * rename the default version from '0.0.0' to 'latest'	6 years ago
baojun	7bd1d03ee5	Adding lrn op for ngraph engine (#17189 ) * added lrn op test=develop * Added CreateConstant method test=develop * avoid duplicates test=develop	6 years ago
Zeng Jinle	4f8594088d	Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225 ) * add use_cuda to inplace pass,test=develop * add test softmax_with_xe_inplace test,test=develop * fix potential inplace bug test=develop * add more skip vars in mem opt pass,test=develop * follow comment,test=develop * follow comments,move duplicate out arg check to program->graph,test=develop	6 years ago
baojun	e782b54b9c	update sofmax with axis arg test=develop (#17190 )	6 years ago
Tao Luo	ff1661f12a	remove unused FLAGS_warpctc_dir (#17162 ) * remove unused FLAGS_warpctc_dir test=develop * remove FLAGS_warpctc_dir test=develop	6 years ago
Kaipeng Deng	a71d8fdb87	Softmax_cross_entropy op add axis (#16806 ) * add attr axis infershape. test=develop * add CUDA kernel. test=develop * fix unittest. test=develop * fix unittest for soft_label. test=develop * fix fp16 unittest. test=develop * remove comment code. test=develop * refine test for axis. test=develop * add python api. test=develop * fix doc. test=develop * fix fp16 unittest. test=develop * fix ngraph test. test=develop * fix ENFORCE for test_imperative_transformer. test=develop * fit for ngraph test. test=develop * fix after rebase develop. test=develop * fix doc. test=develop * fix API.spec. test=develop * fix test_layers. test=develop * fix format. test=develop	6 years ago
Zhen Wang	a914d9b116	Quant output scale (#17215 ) * Add MovingAverageAbsMaxScale operator which is only used for calculating the quantization scale. * test=develop * change the output into inplace. test=develop * Revert "test=develop" This reverts commit 696cf62699ba1e1c98f61f7345ac7060010eb29a. * Revert "change the output into inplace. test=develop" This reverts commit a19acd20f07eee82622701a3015e6e9c073a5e0b. * test=develop. * update the MovingAverageAbsMaxScaleOp test. test=develop	6 years ago
jerrywgz	cc95a7516c	fix distribute fpn proposals, test=develop (#16152 ) * fix distribute fpn proposals, test=develop	6 years ago
Zeng Jinle	ee2028a110	Add use_cuda to inplace pass (#17205 ) * add use_cuda to inplace pass,test=develop * add test softmax_with_xe_inplace test,test=develop	6 years ago
jerrywgz	a72907bbf4	Enhance concat op to support empty input. (#17015 ) * enhance_concat, test=develop	6 years ago
wopeizl	83c4f7721f	use two GPUs to run the exclusive test test=develop (#17187 )	6 years ago
tianshuo78520a	8092c40560	Modify test timeout (#17181 ) * test=develop * test=deelop	6 years ago
guru4elephant	f938ccec62	remove async executor python api to fix document (#17174 ) * remove async executor python api test=develop * remove test_async_executor.py add executor train_from_dataset demo test=develop * fix import bug test=develop	6 years ago
Zeng Jinle	5dfe2ab9e8	Fix mem leak when converting Tensor to numpy array (#17182 ) * fix mem leak when converting Tensor to numpy array test=develop * remove unused unittest,test=develop * follow comments, test=develop * fix dygraph bug,test=develop	6 years ago
Zeng Jinle	4e1bc6e805	Rewrite inplace pass and fix gc bug (#17126 ) * fix op graph view test=develop * rewrite inplace pass and fix reference count pass bug test=develop * fix unittest failed test=develop * follow comments, test=develop	6 years ago
xiaoting	bc48453b73	polish the label_smooth (#17138 ) * polish the label_smooth test=develop * polish code test=develop	6 years ago
tangwei12	deb510d451	cvm op feature (#17081 ) cvm without LoD.	6 years ago
Jiancheng Li	554d3a71d2	test=develop fix bug: fix selected_indices in nms (#17140 )	6 years ago
Zeng Jinle	28d69d710a	Refine dropout gpu memory (#17095 ) * refine_dropout_mem,test=develop * # This is a combination of 14 commits. # The first commit's message is: remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066) # This is the 2nd commit message: Fleet unify distributed training (#16791) * implement distributed transpiler with fleet # This is the 3rd commit message: ParallelDyGraph with GPU collective mode (#16827) implement dygraph.parallel.DataParallel to hook reduce op. # This is the 4th commit message: Init mixed precision training interface (#16856) * Init mixed precision training interface * Add fp16 test script test=develop * All initializers support float16 test=develop * Code cleanup & add more code annotations test=develop * Update API spec test=develop * Add usage example in doc test=develop # This is the 5th commit message: fix reference_count_pass,test=develop (#17060) test=develop # This is the 6th commit message: Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090) * Cache the information of linear interpolation in forward and use it in backward. test=develop * Fix cuda kernel. test=develop # This is the 7th commit message: remove unnecessary prepare_data (#17080) test=develop # This is the 8th commit message: fix interpolate cu. test=develop (#17101) # This is the 9th commit message: test=develop, double backward leaky_relu (#17067) backward of backward: leaky_relu # This is the 10th commit message: fix fuse optimizer ops (#17102) test=develop # This is the 11th commit message: truncated_gaussian_random supported in distributed training, test=develop (#17091) # This is the 12th commit message: Detailed coordinate description for yolov3 loss (#17007) * Detailed coordinate description for yolov3 loss test=develop * modified api.spec test=develop * modified loss name * fix api.spec test=develop * polish description test=develop * modified api.spec test=develop # This is the 13th commit message: fix test_weight_decay (#17109) test=develop # This is the 14th commit message: Path flag (#17105) * fix python/paddle/fluid/__init__.py detecting problems	6 years ago
chengduo	9ccce576d6	fix test_weight_decay (#17109 ) test=develop	6 years ago
ceci3	258e000be6	test=develop, double backward leaky_relu (#17067 ) backward of backward: leaky_relu	6 years ago
Kaipeng Deng	10c487eb21	fix interpolate cu. test=develop (#17101 )	6 years ago
whs	55ce36e981	Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090 ) * Cache the information of linear interpolation in forward and use it in backward. test=develop * Fix cuda kernel. test=develop	6 years ago
Yibing Liu	beda78258f	Init mixed precision training interface (#16856 ) * Init mixed precision training interface * Add fp16 test script test=develop * All initializers support float16 test=develop * Code cleanup & add more code annotations test=develop * Update API spec test=develop * Add usage example in doc test=develop	6 years ago
Yan Xu	0b07eef118	ParallelDyGraph with GPU collective mode (#16827 ) implement dygraph.parallel.DataParallel to hook reduce op.	6 years ago
tangwei12	1a4a51db2b	Fleet unify distributed training (#16791 ) * implement distributed transpiler with fleet	6 years ago
tangwei12	e707119a89	remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066 )	6 years ago
guomingz	2deac4e447	Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing (#17058 ) * resolve #17057 Fixed the bug that fuse_relu/fuse_residual option couldn't be passed to class TestConv2dInt8Op. test=develop * Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing. test=develop	6 years ago
chengduo	a2be4b4d91	Add fuse momenutum ops (#16745 ) * Add fuse momenutum ops	6 years ago
chengduo	e296e0fead	fix test_parallel_executor_seresnet random fail (#17030 ) test=develop	6 years ago
Tao Luo	b3a11943c1	Merge pull request #17031 from luotao1/reduce_test_time reduce unittest time by rename testcuda to has_cuda	6 years ago
qingqing01	c1c2633a63	Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862 ) * Support backward of backward and a new gradient checker * Rename decorators.py to decorator_helper.py, since Python on Windows CI has decorators package. 1. Add ReluDoubleGradMaker when register relu_grad. 2. Add a new gradient checker by comparing theoretical and numerical Jacobian. Check double gradients by double_grad_check.	6 years ago
Zeng Jinle	f188b3708e	Move gc test to each test of op (#16999 ) * move gc test to op_test test=develop * Revert "move gc test to op_test" This reverts commit cf15da65c38f57c91f53b3d8b3c2365d4aa86016. * enable gc test in some ops test=develop	6 years ago
chengduo	7c370e42f9	Fix test_recurrent_op (#17001 ) * fix ramdom fail test=develop	6 years ago
Tao Luo	9466e956a7	reduce unittest time by rename testcuda to has_cuda test=develop	6 years ago
wopeizl	d9991dccdd	add parallel build script to ci … (#16901 ) * add parallel build script to ci test=develop * 1. classify the test case as single card/two cards/multiple cards type 2. run test case according to the run type	6 years ago
qingqing01	ea42e431f8	Speed unit testing. (#16978 ) * Speed affine_channel_op unit testing * Add check in tensor_py * Fix ONLY_CPU Compiling	6 years ago
guomingz	ae7a2cb8e3	resolve #16988 (#16995 ) Update the filter generation mechanism that it could generate the negative parameter. The original calling(np.random.random()) couldn't simulate the conv/relu fusion case. test=develop	6 years ago
liuwei1031	765c70a1b0	Unittest improve, test=develop (#16941 ) * accelerate test_ir_memory_optimize_nlp, test=develop * accelerate test_ir_memory_optimize_nlp, test=develop	6 years ago
guomingz	23df084b32	resolve #16987 (#16994 ) Rename the testcuda function to has_cuda, it will elimate the unnecessary testing. test=develop	6 years ago
Zeng Jinle	1202d3fc74	Refine model gpu memory (#16993 ) * speedup gc and inplace softmax_with_cross_entropy_grad test=develop * refine models gpu mem Merge skip vars and warning messages of mem opt remove relu mem opt test=develop * follow comments test=develop	6 years ago
Zeng Jinle	af8a041bb6	reduce py_reader unittest time (#16996 ) test=develop	6 years ago
Yibing Liu	3c375751f8	Support seq len equal to 0 in sequence ops (#16935 ) * Support seq len equal to 0 in sequence ops test=develop * Add more test cases * Fix some comments test=develop * Fix py3 error test=develop	6 years ago
lujun	a3f17280a3	fix dy-load bug, test=develop	6 years ago
gongweibao	cbdb8a17b1	Polish DGC code (#16818 )	6 years ago
lujun	dbf66dd034	Merge pull request #16954 from junjun315/fix-dygraph-checkpoint Fix dygraph checkpoint bug	6 years ago
Tao Luo	aed702cea3	Merge pull request #16920 from qingqing01/test_profile Fix test_profiler when the machine has many cores.	6 years ago
Tao Luo	b596eed73a	Merge pull request #16824 from LeoZhao-Intel/mkldnn_mul disable test_elementwise_mul_mkldnn_op case	6 years ago
lujun	3beed54cdd	Merge pull request #16917 from velconia/dygraph_untrack_op imperative fix tracer train mode	6 years ago
lujun	a7c11979ba	fix dygraph save/load checkpoint error, test=develop	6 years ago
tangwei12	2b61db07d1	fix sampling id op bug (#16909 ) * fix sampling id op bug, test=develop	6 years ago
gongweibao	b7f20ed6af	Fix unittest dataset error (#16925 )	6 years ago
Hongyu Liu	d5a7c09856	Merge pull request #16798 from phlrain/softmax_cross_support_high_rank softmax cross entropy support high rank	6 years ago
Dang Qingqing	b73a71d11e	Fix test_profiler when the machine has many cores test=develop	6 years ago
Kaipeng Deng	5d45eb06f9	Merge pull request #16858 from heavengate/fix_yolo_param Fix yolo param	6 years ago
minqiyang	97aa1838bc	Fix dygraph train mode test=develop	6 years ago
Qiyang Min	102fc8596e	Merge pull request #16777 from velconia/dygraph_untrack_op Imperative tracer does not hold op any more	6 years ago
Leo Zhao	1edcd73115	remove unnecessary new line test = develop resolve #16764	6 years ago
Leo Zhao	61cc842a53	disable test_elementwise_mul_mkldnn_op case	6 years ago
Hongyu Liu	0701c2db47	Merge pull request #16518 from zhoukunsheng/rsqrt Rsqrt	6 years ago
phlrain	766c868199	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into softmax_cross_support_high_rank	6 years ago
Tao Luo	485bc6a055	Merge pull request #16868 from chengduoZH/speedup_test_parallel_executor_transformer Reduce the layer number of transfromer model	6 years ago
Tao Luo	d4b5510c00	Merge pull request #16860 from junjun315/fix-utest-vgg Fix bug: long vgg-utest testing time	6 years ago
Hongyu Liu	2de7f3cfc3	Merge pull request #16799 from phlrain/sigmoid_corss_entropy_support_high_rank supprt high rank	6 years ago
chengduozh	3349094fe2	reduce the layer number of transfromer test=develop	6 years ago
minqiyang	73cbdc2998	Add train mode test=develop	6 years ago
colourful-tree	434caab21b	Merge pull request #16741 from colourful-tree/dev add continuous value model op	6 years ago
lujun	4aea89faa2	fix vgg-test. test=develop	6 years ago
dengkaipeng	7b1702d9a1	fix unittest and API.spec. test=develop	6 years ago
Yibing Liu	4267a81afc	Correct the lod level of compiled time in lod_reset (#16790 ) test=develop	6 years ago
chengduo	c62674f475	Refine StaticRnn (#16707 ) * enable recurrent op test=develop	6 years ago

... 2 3 4 5 6 ...

2762 Commits (d4413a54bc95e80d54403fd5c48261ca7313d125)