Paddle

Commit Graph

Author	SHA1	Message	Date
Zhen Wang	0fe72469ea	Add the max-pool2d quantization support and the partial quantization support. (#19310 ) * add pool2d quantization support, only for max-pooling. * add the partial quantization support.	6 years ago
Leo Chen	d49c2bad71	update inplace grad test to new CompiledProgram API, test=develop (#19359 )	6 years ago
Yibing Liu	b2c4f76cf2	Fix sequence mask in dygraph (#19271 ) * Fix data parallel & sequence mask in dygraph test=develop * Revert change in data_parallel test=develop	6 years ago
chengduo	4278518fb0	Update CompiledProgram (#18919 ) * use PE for compiler test=develop	6 years ago
翟飞跃	2e3ee57954	Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153 ) * Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * optimize bp with mkl sparse matrix test=develop	6 years ago
Leo Chen	a9d5fc5142	Enhance OpTest to check the consistency of operators when using and not using inplace (#19101 ) * add pybind interface to get all inplace ops, test=develop * enhance OpTest to check whether the consistency of operator when using and not using inplace, test=develop * handle corner cases in op_test, test=develop * support outputs without tensor holder_, like XShape in reshape_op, test=develop * fix bug, some op has GradOpMaker, but actually no grad_op in OpInfoMap, test=develop * use reshape_grad instead of reshape in FlattenGradOp, test=develop * fix error debug dims info for variables like XShape, test=develop * change computational order in sum_op to relieve computation difference using inplace, test=develop * add inplace_atol to check group_norm, and skip inplace_grad for mkldnn, test=develop * follow sneaxiy's comments, test=develop * remove unused DefaultGradOpDescMaker in mkldnn op, test=develop	6 years ago
Aurelius84	0d29cf18f4	Supports diagonal initialization in uniform_random op (#19299 ) * add diag init in Uniform_random op test=develop * modify api.spec test=develop * fix unform_batch_size_like maker test=develop * add diag_num and diag_step assert check test=develop	6 years ago
chengduo	5a579df9ba	[Speedup] Make dygraph data parallel faster (#19280 ) * update parallel.py test=develop	6 years ago
chengduo	6a1632318d	Split test_parallel_executor_seresnext to three unit test (#19239 ) * increase test_parallel_executor_seresnext time limit test=develop * split test_parallel_executor_seresnext test=develop * temporally disable reduce_and_allreduce test because of the random failure. test=develop * split gpu and cpu test=develop	6 years ago
Zeng Jinle	561232c25a	remove is_mem_optimized in Program, test=develop (#19307 )	6 years ago
lidanqing	3fdecc19b7	Add elementwise_mul_mkldnn UT with [conv + elt_mul + conv] (#19191 ) * add elementwise_mul_mkldnn UT with [conv + elt_mul + conv] to cover avx512=True branch test=develop * change a typo. test=develop	6 years ago
xiaoting	62facc7e47	fix yolo_box python example (#18925 ) test=develop, test=document_preview	6 years ago
danleifeng	0865b5a9a0	distribute launch : add use_paddlecloud argument (#19273 ) distribute launch : add use_paddlecloud argument	6 years ago
Zhaolong Xing	76c95af000	Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213 ) * fix mask rcnn bug: 1. affine channel fuse (diff) 2. condition block op (memory leak) 3. merge lod tensor op (diff) 4. memroy optim (diff) test=develop * fix ci aboud PADDLE_ENFOCE fix merge lod infer op ut test=develop	6 years ago
lvmengsi	d08d5ab519	Fix the mistake of convolution (#19274 )	6 years ago
Aurelius84	78a3d837f8	Add match_matrix_tensor op (#18525 ) * add matrch_matrix_tensor op test=develop * fix ignore unittest if with_mkl=off test=develop * clean code and rm is_test param test=develop * modify API.spec test=develop * rm useless code in search_compute.h test=develop * modify api.spec test=develop * modify default_grad.spec test=develop * Add API test code test=develop * clean code in search_computer.h * modify PADDLE_ENFORCE and clean search_compute.h test=develop * fix code style test=develop	6 years ago
Zeng Jinle	5b6673c44d	merge develop to solve conflict, also fix API doc, test=develop (#18823 )	6 years ago
zhang wenhui	539c870753	add fl_listen_and_serv &fl_transpiler,test=develop (#19091 ) add fl_listen_and_serv op for Federated_learning and fl_distribute_transpiler add this op to pserver program . This op just listen the endpoint and sum&scale.	6 years ago
kh2se2013	27e85625b8	add python coverage launch when WITH_COVERAGE=ON (#19264 ) add python coverage launch when WITH_COVERAGE=ON	6 years ago
chengduo	8a89ca94ce	Fix REGISTER_OP_WITHOUT_GRADIENT (#19251 ) * fix REGISTER_OP_WITHOUT_GRADIENT test=develop	6 years ago
gongweibao	fd4b15a2f6	Unset unittests http_proxy env to avoid timeout. (#19269 ) Unset unittests http_proxy env to avoid timeout.	6 years ago
silingtong123	a94a25867d	imporve the doc of decorate_reader API (#19206 ) * imporve the doc of decorate_reader API, test=develop * udpate API.spec, test=develop	6 years ago
gongweibao	86f0591175	Remove node_num function. (#19167 ) node_num is not needed for users, so remove them and fix the bugs about it!	6 years ago
Tao Luo	2f8c7e021f	remove unused inference_transpiler unit-tests (#19130 ) * remove unused inference_transpiler unit-tests test=develop * remove InferenceTranspiler usage in quantize_transpiler.py test=develop	6 years ago
zhaoyuchen2018	0c71c839ec	Fix recurrent op not update grade issue (#18581 ) * Fix recurrent op fails For the variable used in outter block, copy sub block's grad variable to outter block test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Fix unicode error test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine test code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Fix seq2seq case fails test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * remove unreasonable code. test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code according to comment test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	6 years ago
Hao Wang	d53fa53b65	CI - Improve example code check (#19170 ) * add exception exit on error example codes test=develop	6 years ago
Adam	b837689e97	Add generalized Conv+Activation MKLDNN fuse pass creation (#19072 ) test=develop	6 years ago
Yibing Liu	50b1cab122	Add padding support for crf_decoding (#19057 ) * Add padding support for crf_decoding * Fixes in comupte kernel test=develop * Update API Spec test=develop * Update API.spec test=develop * Avoid using paddle_enforce test=develop * Fix enforce test=develop	6 years ago
Aurelius84	45fb031f6b	remove is_test param of FC test=develop (#19209 ) Remove is_test parameter of FC op. The parameter is_test is not used anywhere.	6 years ago
wuzewu	6fc1defd77	Fix compatibility issue of fluid.io.save_vars on windows platform (#19181 )	6 years ago
liym27	c8cdef37b2	change the default value of summarize from -1 to 20 in Print API to improve ease of use (#18738 ) * change the default value of summarize from -1 to 20 in Print op to improve ease of use, test=develop * change the doc of API Print to make the document easier to understand, test=develop	6 years ago
LielinJiang	1331c9e1f8	fix distributions unittest bug, test=develop (#19012 )	6 years ago
lvmengsi	c6f163cd7a	add description of sync_bn (#19056 )	6 years ago
Zeng Jinle	0f9b33954a	move python reader api to fluid.io module, test=develop (#19143 )	6 years ago
jiaqi	b86be13c15	fix default value (#19193 ) * fix default value in ps_pb2.py: delta_keep_days 30 -> 16 * test=develop	6 years ago
jiaqi	b104ea0684	add get_last_save_xbox_base/get_last_save_xbox (#19122 ) * add get_last_save_xbox_base/get_last_save_xbox * fix fleet_util bug of load paddle model * add doc string in fleet api	6 years ago
jiaqi	bfd514c730	fix default value of fleet desc (#19176 ) * fix default value of fleet desc, default values are same with jingpai * print log when save model	6 years ago
lidanqing	c548e370f1	UT coverage for guassian_mkldnn_op and batch_norm_mkldnn_op (#19011 ) * integrations problem test=develop * add batch_norm_mkldnn_op backward-reuse test and guassian seed=0 test test=develop	6 years ago
Jiawei Wang	6ac32d0981	Instag Implemention (#18394 ) * instag lod tensor impl * First PR for instag * First PR for instag * Before adding Selection Rows. * Change name from instag to filter_instag, add upgrade the impl of filter_instag * Change name from instag to filter_instag, add upgrade the impl of filter_instag * Fix yapf error in gradient_checker.py to pass Travis-CI * Fix Filter Instag Grad test=develop * Fix Filter Instag Grad test=develop * 1) Fix API.spec, add filter_instag Op. 2) Add Vector Support for CUDA. test=develop * Impl Loss_weight and empty output handler * change Loss Weight datatype to Float32, and add Loss Weight as 2nd output * 1) Support Tensor Input(without LOD) 2) Add Unit test * Filter By Instag Final test=develop * Update API.spec for filter_by_instag test=develop * Update API.spec for filter_by_instag 2 test=develop * Add Filter By Instag Coverage * code format of test_layers.py * code format test_layers.py test=develop * Make API args more readable test=develop * Make API args more readable and pass code format test=develop * Filter By Instag Op, Rename Map to Index Map test=develop * Filter By Instag Op, code format err in filter_by_instag_op.cc test=develop * Filter by instag op: code format of cpp files test=develop * Filter by instag Op: Api spec modification test=develop * Filter by instag Op: Api spec doc id modification test=develop * Filter by instag Op: Api spec and doc preview test=develop test=document_preview * Filter By Instag Op, fix doc erro test=document_preview test=develop * Filter By Instag Op, fix doc err and Api spec test=document_preview test=develop * Filter By Instag Op, fix Api spec test=document_preview test=develop * Filter By Instag Op, fix Paddle Encoforce deprecated warning test=document_preview test=develop * Filter By Instag Op, fix Paddle Encoforce deprecated and code format warning test=document_preview test=develop	6 years ago
wawltor	0019eb376a	Fix the error of op `ones_like` document，change the output variable test=document_preview test=develop Fix the error of op `ones_like` document, change the output variable from x to out.	6 years ago
huangjun12	20f18930ae	Add hard swish op (new op) (#19001 ) * add hard_swish activation op (new op) test=develop * remove redundancy files * modify document content of HardSwish OP * add API test in test_layers.py * add dynamic_graph for test_hard_swish	6 years ago
gongweibao	29d8781240	Polish fleet API to support cuda collective mode and nccl2 mode. (#18966 ) Polish fleet API to support cuda collective mode and nccl2 mode	6 years ago
wopeizl	80b7ef6fc8	add tensorrt support for windows (#19084 ) * add tensorrt support for windows	6 years ago
Kevin	744279fe68	Refine embedding Api doc (#18820 ) * fix overflow by int32 mul test=develop * fix reference nullptr * fix codestyle test=develop * modify to point in ContextProjectFunctor test=develop * modify to point in ContextProjectFunctor test=develop * modify . to -> test=develop * refine embedding padding_idx doc test=develop * fix math:padding_idx preview bug test=develop * modify API.spec test=develop * fix spell error test=develop * refine dtype parm desc test=develop	6 years ago
yaoxuefeng	9150cf50fc	add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871 ) * add ctr related metric layer test=develop * add save cache and slots shuffle test=develop * add save cache and slots shuffle test=develop * fix error * fix error * fix style for ci * fix for comments * change SlotsShuffle input to std::strinf for generality * fix style * fix style * fix style * fix style * fix style * fix style * fix stylr * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * change non-const reference to pointer * fix style * fix style * fix style test=develop * fix style test=develop * add return ins num in ctr metric op * change dtype to float in metric_op.py * fix error test=develop * fix style test=develop * fix API spec * fix API spec * fix API spec test=develop * add UT test=develop	6 years ago
Zeng Jinle	c51eb6bb14	remove book_memory_optimization directory, test=develop (#19117 )	6 years ago
Zeng Jinle	c194b0c835	Try to deprecate unstable python memory optimize (#18983 ) * deprecate python memory optimize, test=develop * remove memory_optimize in unittests, test=develop * add unittests to deprecated interfaces, test=develop	6 years ago
hutuxian	5a80cc8431	Datafeed support reading to cuda place directly. (#19071 ) * add a place field in DataFeed to denote which place it will feed data to. * abstract the copy process in CopyToFeedTensor function * add UT for float32 type and for CUDAPlace	6 years ago
chengduo	3f4c088ad8	prune the feed op in compiler (#18997 ) test=develop	6 years ago
chengduo	d23603322e	Remove compile from PE (#19080 ) * remove compile from PE test=develop	6 years ago
ShenLiang	4397cb318e	add eye op, kernel and unitest test=develop (#18980 ) * add eye op,test=document_preview test=develop * fix the API.spec, test=develop * fix the document, test=document_preview test=develop * add unitest for CI coverage, test=develop	6 years ago
Kaipeng Deng	f86fead693	Add trilinear_interp OP (#18711 ) * add trilinear interp. test=develop * fix unittest. test=develop * add python api and test_layers. test=develop * refine API.spec. test=develop * fix format. test=develop * add python API test. test=develop * format code. test=develop * refine code strcuture. test=develop * fix format * fix doc. test=develop * fix converage. test=develop * fix format. test=develop	6 years ago
chengduo	17d62ab220	Enhance fuse optimization op pass (#19010 ) * Enhance fuse optimization op pass test=develop	6 years ago
chengduo	21440b4d69	Add call stack info during compile time (#19067 ) * Add call stack info during runtime and compile time test=develop * Rename operator_call_stack test=develop * Add unit test test=develop * follow comment test=develop	6 years ago
jiaqi	a99bc64c63	add fleet util, add some interface in hdfs util (#18752 ) * add fleet util (fleet/utils/fleet_util.py): functions for users' convenience * add some interface in hdfs util : hdfs is_file、hdfs cat	6 years ago
mapingshuo	4ad7c9d5a7	[WIP] Add Imdb train demo (#18895 ) * add train demo for imdb text classification task * make inference library release data_feed dataset dataset_factory data_feed_factory * add String Data Generator * new feature of train demo: save model params * New feature of train demo: set training config using gflags * change code style for CI * add readme and dataset for imdb demo trainer	6 years ago
wangguanzhong	e50f527fee	update roi doc in roi_pool and roi_align (#19036 ) * update roi doc in roi_pool and roi_align, test=develop	6 years ago
Leo Chen	8f53735437	Fix memory overwriting of tensors returned by executor (#19030 ) * fix memory overlapping of fetch var (return of executor.run), test=develop * fix wrong usage of ParallelExecutor in op_test, test=develop * remove useless parameter and simplify code * avoid tensor destruct untimely, test=develop * add testcase independent of OpTest, test=develop	6 years ago
Youwei Song	95ff4fba61	specify the highest numpy version under python 2.x (#19018 ) As mentioned in this link, the last version of NumPy to support Python 2.7 is numpy 1.16.4.	6 years ago
Kaipeng Deng	1f46253d4a	fix natural exp decay doc. test=develop (#19025 )	6 years ago
LielinJiang	e5b9753a18	Fix ExponentialMovingAverage api bug in python3, test=develop (#18775 )	6 years ago
Kevin	e681d65515	Add var_conv_2d op (#18518 ) * fix overflow by int32 mul test=develop * fix reference nullptr * fix codestyle test=develop * modify to point in ContextProjectFunctor test=develop * modify to point in ContextProjectFunctor test=develop * modify . to -> test=develop * add var_conv_2d op test=develop * edit api.spec test=develop * ignore unittest if with_mkl=off test=develop * fix python3 division test=develop * fix ignore unittest bug test=develop * remove useless code test=develop * modify api.spec test=develop * modify default_grad.spec test=develop	6 years ago
Chen Weihang	81fe02c3fe	Fix config description error in cuda_profiler function document (#18750 ) * fix profiler doc error, test=develop * update API.spec, test=develop	6 years ago
Zeng Jinle	311f90f1eb	reduce_unittest_time,test=develop (#19005 )	6 years ago
lvmengsi	5d9df8c8c7	fix dropout (#18965 ) Fix dropout in nn.py	6 years ago
SunGaofeng	4da1c4f15d	fix g_param shape mismatch in WeightNormParamAttr (#18940 ) * fix g_param shape mismatch in WeightNormParamAttr * add comment to show why insert reshape in startup_program test=develop	6 years ago
Jiabin Yang	af63b1184c	test=develop, fix memory leak in dygraph (#18998 )	6 years ago
liuwei1031	a43a763b54	fix warpctc.dll not found issue (#18761 ) * fix warpctc.dll not found issue, test=develop * revert the linux platform change, test=develop * delete warpctc_lib_path.h.in, test=develop * add SetPySitePackagePath function * fix warpctc.dylib not found issue on Mac, test=develop * improve the paddle lib path setting logic, test=develop * fix mac ci issue caused by test_warpctc_op unittest, test=develop * tweak code, test=develop	6 years ago
chengduo	01c7daade7	Add checking for the fetch_list of Executor.run (#18957 ) * update exe.run	6 years ago
Liufang Sang	faf6890b6c	support tensor input for ctc align op (#18887 ) * test=develop support Tensor input for ctc_align_op * test=develop add some comment	6 years ago
Dong Daxiang	c97ea53c3e	make listen and server as exclusive run (#18990 ) make listen and server as exclusive run	6 years ago
xsrobin	8ce902541c	fix unalign of some examples (#18943 ) * test=develop test=document_preview * Update API.spec	6 years ago
Zeng Jinle	7ac748adb4	Open gc by default (#18836 ) * open gc by default, test=develop * fix test_train_recognize_digits and disable gc when ngraph is enabled, test=develop * fix conditional_block op eager deletion bug, test=develop * add some comments to reviewers, test=develop	6 years ago
hong	f745d6d9e4	fix expand op dtype build bugs; test=develop (#18932 )	6 years ago
jiaqi	02c370c3dc	support filelist size < trainer num && fix pull dense (#18956 ) * support filelist size < trainer num * pull dense when stop, to make sure local dense params are same as pserver, so save paddle model will save dense model same as pserver * enable QueueDataset train same filelist for serveral times	6 years ago
石晓伟	ee2f296ef8	Fusion: seqpool_cvm_concat (#18471 ) * add fusion_seqpool_cvm_concat test=develop * simplify pass, test=develop * fix code style, test=develop	6 years ago
jiaqi	768059b3a0	adjust ins weight according to nid slot (#18784 ) adjust ins weight according to nid slot , user can specify adjust_ins_weight in strategy	6 years ago
wawltor	3ab1866ca5	Add the op of unique_with_counts, expand count function of the op unique (#18720 ) * test=develop Add the op of unique_with_counts, the op is calc the unqiue input of data, and output the corresponding indices and count of data. * test=develop Check the input and dtype in the op of unique_with_counts * test=develop test=document_preview update the API.spec for `unique_with_counts`, at the same time, optimize the python api in the op of `unique_with_count` * test=develop test=document_preview Fix some python api problem in the op of `unique_with_counts`, and change the error messsage in this op. * Fix some API problem in the op of `unique_with_counts` test=develop test=document_preview * test=develop test=document_preview Fix the api sample of op `unique_with_counts`, and update api.spec	6 years ago
LielinJiang	22fa4c2d24	Fix depthwise conv gpu kernel bug (#18582 ) * fix depthwise conv gpu kernel bug, test=develop * add more depthwise conv test, test=develop	6 years ago
whs	c92b78b060	Fix unitest of light nas. (#18931 ) test=develop	6 years ago
jiaqi	233746d89d	set fleet_send_batch_num a default value according to trainer num (1) set fleet_send_batch_num a default value according to trainer num， the previous 80000 is fixed，if trainer num is much less or larger than 100，global shuffle may have timeout error. (2) fix load one table bug, add barrier	6 years ago
chengduo	20859c08e8	[DyGraph] Make multi-card program faster (#18892 ) * update parallel.py test=develop	6 years ago
HaoRen	24f8543106	Add center Loss Op Support (#18681 ) * support center loss * change tensor copy api to high level api tensorcopy * test=develop rewrite the center_loss cuda_kernel to make it faster and add document of the center loss api,also update test function * test=document_preview test=develop update document of center loss * test=document_preview test=develop modify API.spec modify test code remove nouse const_cast	6 years ago
lvmengsi	d21c391447	replace paper link (#18861 ) Update conv2d transpose link	6 years ago
Dong Daxiang	2bb296dfe9	make dist unit test exclusive run (#18865 ) make dist unit test exclusive run	6 years ago
whs	6cccab9203	Make lod_append support variable lod. (#18908 ) test=develop	6 years ago
danleifeng	e0a2d4dfec	Add elementwise_pow_op backward implementation and the unit test codes of it. (#18848 )	6 years ago
chengduo	ecd2bdada6	add CPUInplaceTestWithFuseOptimizationOps (#18867 ) test=develop	6 years ago
Zeng Jinle	8008ab4e6b	Remove legacy C++ memory optimization codes (#18834 ) * remove legacy memory optimization codes, test=develop * follow huihuang's comments,test=develop * follow luotao's comments, test=develop	6 years ago
Thunderbrook	52c1431eee	add clear_model interface in fleetwrapper (#18815 ) * dump slot * test * proto * dump slot * test * proto * code style * code style * code style * style * add delete after unseen days * add unseen days * code style * conflict solve test=develop * add clear model * code style test=develop * code style test=develop	6 years ago
Zeng Jinle	9a8a7a1ddc	fix affine_channel no_need buffer bug, test=develop (#18844 )	6 years ago
lvmengsi	829ef26281	Fix drop deconv (#18813 ) * replace link * update api.spec * fix mistake	6 years ago
chengduo	4140fe11a4	Open fuse optimization ops (#18741 ) * open fuse optimization ops test=develop	6 years ago
chengduo	582cc29799	add warning info for CPU_NUM (#18840 ) test=develop	6 years ago
Adam	ee02227949	Add LeakyReLU MKLDNN support (#18762 )	6 years ago
Zeng Jinle	a802da650b	Feature/mem opt pass refactor (#18735 ) * first version memory optimize pass, test=develop * remove move_tensor_sharing_pass, test=develop * refine code comments, add unittests, test=develop * turn off memory_optimize by default, test=develop * follow huihuang's comments, test=develop * follow chengduoZH's comments, test=develop * fix grammar error, add const qualifier, fix pass_test exception message, test=develop * follow chengduoZH's comments 2nd, test=develop	6 years ago
石晓伟	9dbb62eeb9	Fix examples of API (#18092 ) * fix logical APIs test=develop test=document_preview * fix isfinite * update matmul comments * update API.spec test=document_preview test=develop * update API.spec test=document_preview test=develop * update API.spec test=document_preview test=develop	6 years ago
guru4elephant	30562e371b	refine launch_ps and role_maker (#18795 ) refine launch_ps and role_maker	6 years ago
fuyinno4	c167a4b4dd	Fix shrink-dense and add scale-datanorm (#18746 ) Fix FleetWrapper: 1. fix shrink dense: just scale show 2. add datanorm scale: divide datanorm's gradient by batch_size	6 years ago
guru4elephant	2efb282c86	split test_dist_se_resnext.py into 4 testcases (#18743 ) * split test_dist_se_resnext.py into 4 testcases	6 years ago
Bob Zhu	220eef602e	Extend Matmul to support matrix multiplication with multiple heads (#18570 ) * extend matmul op to support multiple head multiplication With the support of multiple head, the multiplication of two big matrixes is split into multiplication of several (head_number) small matrixes. e.g. if Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of [6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].	6 years ago
whs	075e1cf78e	Add python API for appending LoD level (#18702 ) * Make lod reset op support for append lod level. * Fix API.spec test=develop * Fix unitest. test=develop * Add python api for lod append. test=develop * Fix API.spec test=develop * Fix format of doc. test=develop * Fix unitest. test=develop * Fix doc. test=develop	6 years ago
chengduo	8259f1418f	Enhance backward process (#18700 ) * prun backward ops test=develop	6 years ago
JesseyXujin	25c9b57bcd	Modify auc doc. Add output variable description, previously was the scalar type, now changed to the tuple type.test=develop (#18771 )	6 years ago
Thunderbrook	d8396281ef	add slot to sparse table (#18686 ) The change includes 2 things: 1. save delta model and shrink table are control by the same parameter before, now add delete_after_unseen_days to control shrink table. 2. value in sparse table has no slot before, now add slot in sparse table, and add DownpureCtrAccessor to support the new meta. test=develop	6 years ago
jiaqi	d18aabb472	support patch data, add load_one_table, fix bug (#18509 ) （1）support patch data （merge slots of instances of same line id, modify dense layer which changes its size）（2）add fleet load_one_table interface, support load from paddle model and load from pslib model （3）fix push sparse bug which cause push sparse cost more time（about 10% in my testcase）（4）when some slots are not in one of your network (join/update, etc.)，data feed、collect label info、push/pull sparse will skip these slots， instead of throw error. （5）add more debug info in TrainFilesWithProfiler	6 years ago
chengduo	fd3aad6cb3	Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664 ) * support sparse gradients test=develop	6 years ago
Yi Liu	157211c4e1	supports distributed classification (#18690 ) * supports distributed classification training * update API.spec * fix evenly division in python3 * change "index_range" to "index_num" in shard_index operator test=document_preview test=develop	6 years ago
guru4elephant	70b03760fd	add parameter server launch (#18687 ) add parameter server launch so that a user can easily launch parameter server	6 years ago
Zeng Jinle	d07ad4c605	add more traceback to py_reader error msg, test=develop (#18722 )	6 years ago
Huihuang Zheng	a3028bb729	Fix random test_recurrent_op failure (#18718 ) The change includes 3 things: 1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1. 2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values. 3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests. test=develop	6 years ago
Tao Luo	bd22453f20	Revert "Add LeakyRelu MKLDNN support (#18656 )" (#18723 ) test=develop	6 years ago
tangwei12	d845848341	do some odd jobs (#18641 ) do some odd jobs, test=develop	6 years ago
guru4elephant	ebf9797ec3	split different comm method for mnist distributed training (#18715 ) * split different comm method for mnist distributed training	6 years ago
Huihuang Zheng	89bc3fd841	Support memory eager deletion on recurrent OP (#17710 ) Test PaddingRNN on V100 GPU device. Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU. GPU memory (MiB): 6414 (this PR) vs 6837 (without this PR) Speed (steps/s): 10.28 (this PR) vs 9.89 (without this PR)	6 years ago
Adam	d6b6a337a9	Add LeakyRelu MKLDNN support (#18656 ) test=develop	6 years ago
tangwei12	0b9acb4912	add check of executor (#17986 ) * add check of executor, test=develop	6 years ago
Zeng Jinle	ae58afc546	Feature/auto_growth_allocator (#18561 ) * feature/auto_growth_allocator, test=develop * add unittest of AlignedAllocator, test=develop * try to turn on auto_growth to test on CI, test=develop * fix segmentation fault in mixed_vector.h, test=develop * add unittests, test=develop	6 years ago
hutuxian	bb2f5d24a2	hash_op support int64 hash_size (#18674 ) * hash_op support int64 hash_size * add corresponding UT	6 years ago
guru4elephant	5ed713d519	remove ctr reader, all functions are satisfied in dataset (#18672 ) * remove ctr reader, all functions are satisfied in dataset	6 years ago
guru4elephant	ab57d3893e	make auc op compatible with 1 dim (#18551 ) * make auc op compatible with 1 dim	6 years ago
guru4elephant	b71b454328	increase timeout again (#18628 ) test=develop	6 years ago
123malin	b414645a65	fix #17430 : int64类型的attr训练非预期 (#18264 ) * fix int64_t * update fill constant op unittest * add empty line	6 years ago
Kevin	995d7d8600	Modify embedding_op input dtype to int64 (#18598 )	6 years ago
kh2se2013	9ad57f2dfd	1）change to parallel mode on python coverage run (#18594 ) 2）add pip install coverage in Dockerfile.tmp test=develop	6 years ago
gongweibao	c0a82748cf	Polish backwards optimizer dependency codes and use more default values. (#18255 )	6 years ago
Zeng Jinle	d3003a1620	Feature/buffer_shared_inplace (#17911 ) * feature/buffer_shared_inplace, test=develop * refine code, test=develop * fix elementwise_add op cpu inplace and sum inplace bug, test=develop * add unittest and debug log, test=develop * fix parallel_executor scope bug, polish code, test=develop * fix sum op, activation op, single_in_place_inference bug, test=develop * remove kLocalExecScopeName, test=develop * fix unittest,test=develop * fix out_var first version bug, test=develop * follow comments,test=develop	6 years ago
tianshuo78520a	1c10dac4f2	Add code example in CI (#18228 ) * test api example * update python * add sampcd_processor.py * add if 0 * sort * test paddle * test paddle * test paddle * add whitelist * change sampcd_processor.py * change sampcd_processor.py * change sampcd_processor.py * add exit * test=develop * test=develop	6 years ago
lujun	b6d5c74f69	update dygraph api doc for web (#18550 ) remove dygraph.enable from __all__ hidden dygraph. profiler add doc to dygraph. no_grad	6 years ago
guru4elephant	9c17a899d7	upgrade collective fleet api (#18533 ) * upgrade collective fleet api	6 years ago
bingyanghuang	a25be53cb5	QAT int8 MKL-DNN transformation pass with MUL (#18322 )	6 years ago
Physher	0caa08ea40	Add mkldnn int8 mul-op kernel (#17834 )	6 years ago
LielinJiang	24d1c44a0c	Fix roi_perspective_transform_op bug (#18522 ) * fix transform matrix bug, test=develop * modify API.spec	6 years ago
guru4elephant	1f1cc2221f	add random port (#18504 ) * add random port	6 years ago
zhaoyuchen2018	832d8191ff	Fix topk cannot handle 1D vector bug (#18466 ) * Fix topk cannot handle 1D vector bug Add path to handle 1D vector test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * refine code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	6 years ago
Jiabin Yang	7586cdd545	Hide no support (#18515 ) * test=develop, fix docker with paddle nccl problem * test=develop, hide no_support api and add ut for it	6 years ago
LielinJiang	43e17c7951	Add distributions of normal and uniform (#18023 ) * add_distributions_of_normal_and_uniform * paddle/fluid/API.spec * modify API.spec * modified paddle/fluid/API.spec, test=develop * modify paddle/fluid/API.spec, test=develop * modify paddle/fluid/API.spec, test=develop * fix some comment, test=develop * modify API.spec, test=develop * add comment for init function, modify hard code, test=develop * modify API.spec, test=develop * modify API.spec, test=develop * make unit test function shorter, test=develop * modify paddle/fluid/API.spec	6 years ago
tensor-tang	4828a5e008	core remove pycpuinfo (#18479 ) remove pycpuinfo deps in core	6 years ago
qingqing01	602cb6a5b4	Enhance linear_lr_warmup (#18463 ) * make it support float/int learning as input.	6 years ago
chengduo	7453857324	Make fuse_all_reduce_op_pass support mix_precision (#17652 )	6 years ago
chengduo	55baeceddb	Enhance execution error info (#18482 ) * enhance execution error info test=develop	6 years ago
pkpk	e9c7e218f2	Nan debugger init (#18401 ) test=develop	6 years ago
zhoukunsheng	7c6f2350b9	support Tensor input for edit_distance op (#18162 )	6 years ago
zhoukunsheng	26318544d2	support Tensor input for chunk_eval op (#18226 ) * test=develop support Tensor input for chunk_eval op * test=develop fix testcase for chunk_eval op * test=develop fix typos in nn.py	6 years ago
zhoukunsheng	206c44e2a8	add unique kernel and op (#17557 )	6 years ago
zhoukunsheng	71af72b1c2	upgrade hash op to support Tensor and LoDTensor input (#17998 )	6 years ago
zhoukunsheng	d3b3443d10	add ones_like op (#17388 )	6 years ago
zhoukunsheng	67b48d7fe7	add size op (#17412 )	6 years ago
hutuxian	6e0df3102e	Refactor for Pipeline Thread Check (#18459 ) move the thread-check code from train_from_dataset to a single function add UT for the thread check function	6 years ago
Zeng Jinle	41ab76e55b	add friendly error msg to py_reader (#18316 )	6 years ago
Kaipeng Deng	823ab5e887	fix load attr error. test=develop (#18447 )	6 years ago
Yi Liu	a873fa84ce	supports collective training with programs (#18392 ) 1. Since allreduce op has 4 reduce types, We split these four reduce types into four ops 2. We also refined the collective op code, e.g. we separated the collective op kernel into CPUKernel and CUDAKernel, and remove the device specified DeviceContext parameter in template as we already knew the target DeviceContext 3. We remove the newly added Collective op role to reduce the complexity of program and graph analysis	6 years ago
guru4elephant	357311fdb7	make fleet support mpi job submit directly (#18441 ) make fleet support mpi job submit directly.	6 years ago
chengduo	e0d8c6ac68	Add find_no_grad_vars in backward.py (#17942 ) * add not_been_used_vars to no_grad_set test=develop	6 years ago
LielinJiang	449c7a9f98	Make roi_perspective_transform op return mask and transform matrix (#18371 ) * modify roi_perspective_transform_op to output mask and transform matrix * modify comment * modify comment * modify API.spec * update API.spec * remove no use header, test=develop * resolve conflict	6 years ago
tensor-tang	a3bc804f5f	fix mac ci random fail (#18430 ) * fix mac ci random fail * use platform instead	6 years ago
xiaoting	dd3f9d19cf	replace mnist dataset url, test=develop (#18429 ) replace mnist dataset url	6 years ago
xsrobin	47e2ef38e9	add "import paddle.fluid as fluid" to examples lack of it	6 years ago
hutuxian	8a39e5c110	update api format (#18413 ) * update api format test=develop * update API.spec test=develop	6 years ago
tensor-tang	ce7a024c6d	fix py-cpuinfo mac random fail (#18383 ) * fix py-cpuinfo mac random fail * differentiate version on windows	6 years ago
Jie Fang	2b4ef509ea	init custom black white list (#18377 ) test=develop	6 years ago
guru4elephant	e83f902b98	add MultiSlotStringDataGenerator for speedup of string based user inp… (#18390 ) * add MultiSlotStringDataGenerator for speedup of string based user input data	6 years ago
Jiabin Yang	43f64a177e	Fix/program doc (#17908 ) * test=develop, add some comments for Program.clone * test=develop, add API.spec * test=develop, refine comments * refine Program doc and clone doc * test=develop, refine doc	6 years ago
chengduo	871cc15e6a	Add is_compiled_with_cuda (#18356 ) * add cuda_is_available test=develop * Fix api.spec test=develop * fix api doc test=develop	6 years ago
Wojciech Uss	8ed819d8e4	Call the test_slim_int8_* tests through absolute path (#18386 ) test=develop	6 years ago
lujun	fd6631ef2f	Fix dygraph show style (#18297 ) Fix dygraph show style for FluidDoc.	6 years ago
翟飞跃	19da59ed3f	Remove all the code, API and doc of MKL-DNN INT8v1 (#18347 )	6 years ago
chengduo	8ed33bf91f	Fix Bug-prone code of PE (#18354 ) * update pe reduce config test=develop * drop the local_exe_scopes of the previous parallel_executor test=develop	6 years ago
tangwei12	999d9a59a5	fix communicator with pyreader (#18350 ) * add is_runnning in communicator, test=develop	6 years ago
kh2se2013	27fb9cad65	add WITH_COVERAGE option, default OFF (#17872 ) * add WITH_COVERAGE option, default OFF test=develop * add coverage for python sdk test=develop * fix code style * fix COVERAGE_FILE path test=develop * remove coverage package test=develop * test = develop, run coverage as module	6 years ago
HaoRen	b7128bac5f	supports collective communicated training (#18175 ) * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * fix comment test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * fix prepare context redundant code problem, optimize executor by caching create_varaiables test=develop * supports collective training in executor * make fetch_list runable with variables, add more unittest for use_program_cache test=develop * use unique name for nccl_id * supports output to stream in program_to_code * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code * set op role in collective training * add collective op role * fix comment test=develop * remove orig file * add build optimizer by strategy * add collective strategy * refine collective strategy * add multi-process role maker * refine strategy building factory so that we can easily plugin more strategy * scale loss grad in collective sgd transpiler * add support for distributed fc * code format * revert some features for dist fc * add support for distributed fc training * test=develop add collective op unittest standard * test=develop remove the test_collective directory * test=develop remove the test_collective directory * remove slicegather test * code format for reducescatter * update attr of shard_index_op * Modify macro nccl_helper * remove test without distribute * macro collective_helper * marcro update * test=develop update support python3.5 * test=develop change gpu memory use to 0.1 when test * test=develop update ut equal func * test=develop set flags to 1.5 * test=develop fix pickle dumple py35 * test=develop fix divide in slice and add sync_comm_stream update atol and rtol to 1e-05 rm shard_index op and test modify read input from file to read from memory remove origin_program in framework and add i/o in c_sync_calc_stream * test=develop update unittest sync operator I/O	6 years ago
qingqing01	9047ac687e	Simplify multi_box_head API in detection.py and remove assign op. (#18310 ) * Simplify multi_box_head API in detection.py and remove assign op.	6 years ago
hutuxian	e42057cd1a	add ut for pipeline training (#18289 )	6 years ago
Jiabin Yang	bd61d89925	test=develop, recover ocr ut on dygraph (#18166 )	6 years ago
Yibing Liu	23941e43ec	Update lamb optimizer (#18333 ) * Update lamb optimizer test=develop, test=document_preview * Regenerate api spec test=develop, test=document_preview	6 years ago
whs	1bdfd2eb85	Fix checkpoint of Light-NAS (#18330 ) Socket can't be pickled. test=develop	6 years ago
Jiabin Yang	79bcdbbf1a	test=develop, disable basic gru related ut (#18329 )	6 years ago
Jiabin Yang	831a3e62ca	Add install check for multigpu (#18323 ) * test=develop, add_install_check_for_multigpu * test=develop, refine code to use cuda_devices	6 years ago
Zeng Jinle	f88e07a0d3	fix lod_tensor.py grammar error, test=develop (#18308 )	6 years ago
Hongyu Liu	df2eee71d8	Sequence mask support tensor (#18249 ) * sequnce mask support max length tensor input; test=develop * add rnn_impl.py; test=develop * add basic gru lstm unittest; test=develop * fix api spec; test=develop * fix sequence_mask op bug; test=develop test=document_preview * change +-x to elmentwise_op; test=develop add mkl flag; test=develop * fix rnn impl bug; test=develop * update api spec; test=develop * fix doc bug; test=develop * fix lstm bugs; test=develop	6 years ago
Jiabin Yang	9cb799be50	test=develop, Revert "Add multi gpu install check" (#18313 ) * Revert "Add multi gpu install check (#18229)" This reverts commit `61ed06b29a`. * test=develop, start ci	6 years ago
Qiao Longfei	0e08e91c18	optimize communicator merge sparse gradient test=develop (#18159 ) * optimize communicator merge sparse gradient test=develop * revert multithread selected rows merge add test=develop * follow comment test=develop	6 years ago
Jie Fang	172c2facef	init black/white lists (#17847 ) test=develop	6 years ago
chengduo	e06c69c788	Fix default value of fluid.memory_optimize (#18295 ) * fix default value of fluid.memory_optimize test=develop * fix api.spec test=develop	6 years ago
Zhaolong Xing	6978b2e48e	fix split and sampled softmax (#18280 ) test=develop	6 years ago
hutuxian	6ed73830c2	add api desc for pipeline training (#18293 )	6 years ago
liuwei1031	a736c03b10	improve doc of lstm, sequence_enumerate, softmax_with_cross_entropy, space_to_depth APIs (#18261 ) * improve doc of lstm, sequence_enumerate, softmax_with_cross_entropy, space_to_depth APIs, test=develop * update API.spec, test=develop	6 years ago
chengduo	d54e13bbec	add random seed for recurrent op test (#18274 ) test=develop	6 years ago
liuwei1031	4151d90c16	improve the hint message of memory optimize, test=develop (#18260 )	6 years ago
guru4elephant	ff399fd720	fix paddle cloud role maker bug (#18269 ) * fix paddle cloud role maker bug	6 years ago
Yibing Liu	412951d7d2	Fix ema's example & fp16 update (#18273 ) test=develop, test=document_preview	6 years ago
flame	fdf798f95a	fix double buffer example (#18169 ) test=develop test=document_preview	6 years ago
Bai Yifan	23b8b18e56	fix api doc example, test=develop (#18266 )	6 years ago
pkpk	cd9d57f5e0	fix a bug in examples of metrics.Acc	6 years ago
tensor-tang	68da8b2a01	refine core cmake warning and print more info (#18248 ) * refine core cmake warning and print more info test=develop * fix comments test=develop	6 years ago
zhaoyuchen2018	32c95f1752	Add StaticRNN.output code example (#18251 ) refine StaticRNN api doc test=develop test=document_preview	6 years ago
xiaoting	2f0d68261c	fix yolo_box example,test=develop (#18247 )	6 years ago
songhao	6b3d96254d	fix some bug when merge sparse embedding parameters, test=develop (#18223 ) 1. fix the bug that out_put_var in SaveSelectedRows would be empty string 2. use merge_sparse_lookup_table to replace sum op for load_persistables_for_inference 3. fix the bug in _clone_var_in_block_ when the var is SELECTED_ROWS.	6 years ago
jiaqi	3f8031e256	dataset (#17973 ) (1) use channel instead of vector/BlockingQueue in Dataset，to keep same with existing implementation, and make code more readable and flexible (dataset single output channel or multi output channel). one previous memory out of limit problem is cause by not release memory after training. (2) add Record because MultiSlotType costs too much memory (80B)，fix memory out of limit problem. (3) add Channel, Archive in paddle/fluid/framework (4) change dataset from shared_ptr to unique_ptr in pybind (5) move create/destroy readers from trainer to dataset (6) move shuffle from datafeed to dataset. dataset holds memory, datafeed is only for load data and feed data to network. (7) fix thread num bug of Dataset when filelist size < thread num (8) support set_queue_num in InMemoryDataset	6 years ago
liuwei1031	5d54ed4a84	improve the doc of DataFeeder and default_main_program (#18241 ) * improve the doc of DataFeeder and default_main_program * update API.spec, test=develop	6 years ago

... 2 3 4 5 6 ...

9298 Commits (ebff68fa74c3f278b97326fec56d775a94323623)