Paddle

Commit Graph

Author	SHA1	Message	Date
ShenLiang	2cd3fa3e9a	add scatter_nd op and scatter_nd_add op (#19571 ) * add scatter_nd op, test=document_preview test=develop * fixed the document, test=document_preview test=develop * modify the notes, test=document_preview test=develop * remove the ShareDataWith, test=develop	6 years ago
wawltor	364c44422e	Add the support the int64 data type of `scatter_op` input Index(#18804 ) (#19508 ) * test=develop Fix the scatter op bug when use the add mode, and support the int64 data type of scatter_op Index(#18804). * test=develop Remove the PADDLE_ENFORCE and use PADDLE_ENFORCE_EQ * test=develop Remove the fix bug of scatter_add, and just add the support of int64 in scatter_add * test=develop Add the test case for scatter op, the test case just for index int64	6 years ago
zhongpu	4d26274d25	add detach API for Variable in dygraph mode, test=develop (#19477 ) * add to and detach for Variable in dygraph, test=develop * add detach for Variable in dygraph, test=develop * add detach for Variable in dygraph, test=develop * add detach for Variable in dygraph, test=develop * add detach for Variable in dygraph, test=develop * add detach for Variable in dygraph, test=develop * add exception check, test=develop	6 years ago
whs	1c2aae567a	Skip start epoch and end epoch when dumping strategy in PaddleSlim (#19580 ) test=develop	6 years ago
hutuxian	66ad68ed7b	Update UT test_boxps (#19599 ) Disable test_boxps in win32. Adjust filename to avoid latent multi-thread problem.	6 years ago
baojun	f2ad30c4dd	Some ngraph op and unittest fix (#19515 ) * update ngraph ops test=develop * update unittest test=develop * increase coverage test=develop	6 years ago
Tao Luo	49523ea189	replace PADDLE_ASSERT with PADDLE_ASSERT_MSG (#19586 ) * remove unused PADDLE_ASSERT(_IS_NOT_ERROR) * replace PADDLE_ASSERT with PADDLE_ASSERT_MSG test=develop	6 years ago
gongweibao	abaf87be2b	Change backward_guard to optimize_guard to maximize the allreduce overlap. (#19506 ) Change backward_guard to optimize_guard to maximize the allreduce overlap	6 years ago
Zeng Jinle	635cd62d23	remove deprecated memory_optimize usages, test=develop (#19579 )	6 years ago
Youwei Song	9a577f2e41	fix batchnorm api param: data_layout (#19524 ) * fix batchnorm api param: data_layout * fix batchnorm data_layout param; test=develop	6 years ago
xiaoting	7a86706309	modified multiclass_nms example (#19553 ) test=develop, test=document_preview	6 years ago
gongweibao	57f0f0f2dc	Delete pserver complete file before executor running. (#19468 )	6 years ago
JesseyXujin	4a7e6deb63	add padding in linear_chain_crf op (#19583 ) * add padding in linear_chain_crf op * modify API.spec * add linear_chain_crf_op.cc and linear_chain_crf_op.h * remove useless unit test , test=develop * modify API.spec, test=develop * remove some blanks in nn.py , test=develop * fix some bugs on nn.py and API.spec ,test=develop * fix nn.py, test=develop * fix API.spec ,test=develop * fix bug of CI test in test_linear_chain_crf_op.py * fix bug of CI test in test_linear_chain_crf_op.py, test=develop * remove paddle_enforce, test=develop * remove paddle_enforce, test=develop * remove paddle_enforce, test=develop * remove paddle_enforce, test=develop * remove paddle_enforce, test=develop * remove paddle_enforce, test=develop * modify nn.py, test=develop * fix API.spec, test=develop * fix unittest bug, test=develop	6 years ago
hutuxian	c756b5d231	Paddlebox Framework (#18982 ) * Support looking up embeddings from BoxPS. * Add a _pull_box_sparse op, for now this op is not exposed to users. * Add a BoxHelper class, providing 'BeginPass', 'EndPass', 'FeedPass' functions and so on. * Add 'BoxPSDataset' in python code. * Add a compile options WITH_BOX_PS and a MACRO PADDLE_WITH_BOX_PS. * Add UT. * More concrete information pls refer to: https://github.com/PaddlePaddle/Paddle/pull/18982	6 years ago
Zeng Jinle	5dce1da680	remove reset recordio usage (#19519 )	6 years ago
ShenLiang	85914f7a88	add gather_nd op and unit test (#19366 ) * fixed the code for coverage * fixed the document,test=document_preview test=develop	6 years ago
Jacek Czaja	ecd9f330c9	[MKL-DNN] Fix to face model on AVX512 platforms (#19282 ) - Refactor step 1 - Compilation fix - Yet another compilation fix - Even more compilation fix - Lint fixes test=develop - Removed deprectaed PADDLE_ENFORCE occurance test=develop - Candidate fix to BN forward - Lint fixes test=develop - Refactoring in data_layout_transform - compilation fix - Another comppilation fix - Step further into darkness - Yet another compilation fix - Yet another compilation fix - missing header - compilation fix - Added MKLDNN -> Paddle conversion in fetch op test=develop - Compilation fix test=develop - Lint test=develop - Mul fix - Fix to MKLDNN MUL op and Elementwise MUL UT test=develop - Workaround for diffrent weights with groups representation Paddle vs MKL-DNN. test=develop - Candidate fix for 5D convolution with groups - Refactor of fix for conv3d and conv2d in fetch op test=develop - Compilation fix - Still same compilation fix - Compilation fix - Compilation fix - Reverted refactoring of fixes - Adapted test_conv2d_int8_mkldnn so it exects data in NCHW format not NHWC test=develop - minor fix in UT test=develop - Lint fixes test=develop	6 years ago
Liufang Sang	9dde564097	change var name padding_num to padding_value (#19498 )	6 years ago
Aurelius84	5b5379b32a	Add sequence_topk_avg_pooling Op (#19442 ) * add topk_avg_pooling * refine api doc and modify api.spec test=develop	6 years ago
chengduo	1cdd3b6985	Disable GC in test_parallel_exe_seresnext (#19408 ) * Disable GC in test_parallel_executor_se_resnext test=develop	6 years ago
yaoxuefeng	10ca3f9609	add thread scope stat accurate metrics test=develop (#19480 ) * add thread scope stat accurate metrics test=develop * fix style * fix style * fix style * fix style test=develop * fix style test=develop * fix style test=develop * fix style test=develop * fix style test=develop * fix style test=develop * fix style test=develop * fix conflict * fix style * fix style test=develop * fix error test=develop * fix error test=develop	6 years ago
Bai Yifan	6d99842bb8	fix mean_iou api example, test=develop, test=document_preview (#19503 ) Fix mean_iou api misleading example	6 years ago
Bai Yifan	8394699dbb	add stop_gradient in range_api, test=develop (#19484 )	6 years ago
chengduo	e340df013e	Support feed single persistable variable to PE (#19417 ) * update executor feed	6 years ago
lidanqing	ba368bf696	clean up intel labeled TODOs (#19476 ) test=develop	6 years ago
Thunderbrook	1fe468d319	support debug each output of each ins (#19004 ) * dump slot * test * proto * dump slot * test * proto * code style * code style * code style * style * add delete after unseen days * add unseen days * code style * conflict solve test=develop * add clear model * code style test=develop * code style test=develop * support debug tensor of each ins test=develop * support debug tensor of each ins test=develop * learning rate * code style * code style * code style * code style * code style * code style * code style * code style * code style * code style * code style * code style * code style test=develop * code style test=develop * unitest * style * style * multi phase * add channel * code style * style * style * unitest * style * define * define test=develop * style test=develop * rm define test=develop * linux * linux test=develop * style test=develop * output format test=develop * windows ci test=develop	6 years ago
zhang wenhui	bd35a7f0a6	support fc sort by number, test=develop (#19466 ) fleet_desc sort fc name by dictionary sort, but we want to sort by number.	6 years ago
Double_V	1d0f04315a	fix row_conv_op to force it support lodtensor and tensor input simultaneously, test=develop (#19412 ) Support Tensor input for row_conv_op	6 years ago
Jiabin Yang	1ce0a09e60	fix con2d transpose bias by create and init it in build_once (#18968 ) * fix con2d transpose bias by create and init it in build_onee * fix API spec * test=develop, invoke ci * fix bias_attr and act has no effect error on layer norm, conv2dTranpose, billinearTensorProduct, sequece_conv. fix original_mode not used error on GRUunit. fix sample_weight not set error on NCE. Add ut for all thoese layer * test=develop, change success standard for conv2dTranspose * test=develop, fix test_layers to invoke some error branch * test=develop, fix sample code * test=develop, fix BilinearTensorProduct failed in dygraph mode * test=develop, fix test_layers segment fault error	6 years ago
Yi Liu	4ef6b8457a	adapte fleet api for localsgd and support nccl comm configuration in executor (#19443 ) test=develop	6 years ago
tangwei12	65c7368400	Fix the correctness of async mode at distributed training (#18863 ) * fix correctness of the communicator * fix a bug in send thread when sending var context is empty, test=develop * add lookup_table_prefetch_op and prefetch optimize, test=develop * remove remote prefetch GPU supported * word2vec force with CPU, test=develop * test dist remote lookup table force with CPU, test=develop	6 years ago
chengduo	e26411cec2	Open test_parallel_dygraph_se_resnext (#19342 ) * enabel test_parallel_dygraph_se_resnext test=develop	6 years ago
Yi Liu	efb05ba258	supports multiple NCCL communicators preserved in NCCLCommContext (#19407 ) * supports multiple NCCL communicators preserved in NCCLCommContext test=develop * add ut for c_comm_init_all operator and fix cuda resource release problem test=develop	6 years ago
Aurelius84	a9cd513680	improve sequence_conv api doc (#19316 ) * improve sequence_conv api doc test=develop * add warning for padding param test=develop modify into deprecated	6 years ago
zhang wenhui	0d7949831b	fix fleet_desc bug && support format for abacus hotstart (#19430 ) fix fleet_desc dense_table unsort bug ，not support format for abacus hotstart yet.	6 years ago
vincentXiyu	482ce818bb	Support Tensor input with padding for warpctc op (#19322 ) * support tensor input with padding for warpctc op * merge with develop * test=develop * modified python API examples test=develop * nn.py is modified for code coverage test=develop * update documents info about warpctc op in API.spec test=develop * add test_warpctc_with_padding in test_layers test=develop * add warning log for cuda_version back to warpctc_op.cc * modify API.spec for warpctc op test=develop * modify API.spec * update warpctc test to new CompiledProgram API test=develop * modify code examples for warpctc op test=develop * modify API.spec for warpctc op test=develop * modify API.spec for warpctc op test=develop	6 years ago
chengduo	bfb6ac816e	Fix optimizer bug (#19410 ) * fix optimizer bug test=develop	6 years ago
Leo Chen	6fb310ae29	Fix bug of getting bool Flags from os.environ (#19349 ) * fix bug of getting bool Flags from os.environ, test=develop * add empty loss_name in CompiledProgram for inplace grad test, test=develop	6 years ago
liu zhengxi	32598ffd8f	Python infer api update and add unit test (#19353 ) * python inference api supports numpy and add unit test, fix unit test fail in test_slim_int8_googlenet and test_slim_int8_mobilenet	6 years ago
Zeng Jinle	807c7a4747	remove recordio convert in dataset, test=develop (#19387 )	6 years ago
chengduo	11070cbff9	enabel seresnext reduce test (#19341 ) test=develop	6 years ago
Ghost Under Moon	10643b4ea6	fix- raise io error when user load from non-existed dir test=develop (#19384 ) This PR fix problem with issue #18096 , which raise an error for user to specify the error about load dir is wrong	6 years ago
mapingshuo	c2e5eaa27d	delete recordio writer (#19406 ) test=develop	6 years ago
mapingshuo	d5ac87ec22	Lookahead optimizer (#19386 ) * Add lookahead optimizer * add unittest for lookahead optimizer test=develop * add doc string for LookaheadOptimizer test=develop test=document_preview * add API spec for lookahead test=develop test=document_preview * modify api spec test=develop test=document_preview * modified doc string * modify the test file test=develop test=document_preview * modify doc string test=develop test=document_preview	6 years ago
silingtong123	da127d1110	Optimized error reporting information (#19173 ) * test=develop,Optimized error reporting information * test=develop,add importscipy unittest * test=develop, rename the file and function	6 years ago
Jiabin Yang	55931db449	fix problem that get_attr method can't using default mode when we call has_attr in dygraph (#19328 ) * add default getItem * test=develop, fix has_attr disabled error in Layer * test=develop, fix GroupNorm and deepcf bug on attrs	6 years ago
tangwei12	19dac67e9f	fix distribute transpiler GRPC error code 4, RPC Deadline (#18984 ) * fix sync mode hang in transpiler * remove sync mode in send/recv * replace PADDLE_ENFORCE with PADDLE_ENFORCE_NE	6 years ago
Yibing Liu	5d1575cfe8	Fix arg do_model_average in param_attr (#19376 ) * Fix arg do_model_average in param_attr test=develop * Update api spec test=develop	6 years ago
zhang wenhui	4a3c4b8fa4	add fleet_desc config feature & multi_sparse table, test=develop (#18827 ) add fleet_desc config feature & multi_sparse table,	6 years ago
Jiancheng Li	1799c257ad	Update Light-NAS to support latency-aware search (#19050 ) * update light_nas_strategy: add latency constraint test=develop * update light_nas_strategy: update get_model_latency test=develop * update light_nas_strategy: add more check test=develop * update light_nas test test=develop * update light_nas test test=develop * minor update light_nas test test=develop * minor update light_nas test test=develop * update light_nas test test=develop * update _constrain_func of light_nas_strategy test=develop * update _constrain_func of light_nas_strategy test=develop * remove unused code test=develop	6 years ago
Zhen Wang	0fe72469ea	Add the max-pool2d quantization support and the partial quantization support. (#19310 ) * add pool2d quantization support, only for max-pooling. * add the partial quantization support.	6 years ago
Leo Chen	d49c2bad71	update inplace grad test to new CompiledProgram API, test=develop (#19359 )	6 years ago
Yibing Liu	b2c4f76cf2	Fix sequence mask in dygraph (#19271 ) * Fix data parallel & sequence mask in dygraph test=develop * Revert change in data_parallel test=develop	6 years ago
chengduo	4278518fb0	Update CompiledProgram (#18919 ) * use PE for compiler test=develop	6 years ago
翟飞跃	2e3ee57954	Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153 ) * Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * optimize bp with mkl sparse matrix test=develop	6 years ago
Leo Chen	a9d5fc5142	Enhance OpTest to check the consistency of operators when using and not using inplace (#19101 ) * add pybind interface to get all inplace ops, test=develop * enhance OpTest to check whether the consistency of operator when using and not using inplace, test=develop * handle corner cases in op_test, test=develop * support outputs without tensor holder_, like XShape in reshape_op, test=develop * fix bug, some op has GradOpMaker, but actually no grad_op in OpInfoMap, test=develop * use reshape_grad instead of reshape in FlattenGradOp, test=develop * fix error debug dims info for variables like XShape, test=develop * change computational order in sum_op to relieve computation difference using inplace, test=develop * add inplace_atol to check group_norm, and skip inplace_grad for mkldnn, test=develop * follow sneaxiy's comments, test=develop * remove unused DefaultGradOpDescMaker in mkldnn op, test=develop	6 years ago
Aurelius84	0d29cf18f4	Supports diagonal initialization in uniform_random op (#19299 ) * add diag init in Uniform_random op test=develop * modify api.spec test=develop * fix unform_batch_size_like maker test=develop * add diag_num and diag_step assert check test=develop	6 years ago
chengduo	5a579df9ba	[Speedup] Make dygraph data parallel faster (#19280 ) * update parallel.py test=develop	6 years ago
chengduo	6a1632318d	Split test_parallel_executor_seresnext to three unit test (#19239 ) * increase test_parallel_executor_seresnext time limit test=develop * split test_parallel_executor_seresnext test=develop * temporally disable reduce_and_allreduce test because of the random failure. test=develop * split gpu and cpu test=develop	6 years ago
Zeng Jinle	561232c25a	remove is_mem_optimized in Program, test=develop (#19307 )	6 years ago
lidanqing	3fdecc19b7	Add elementwise_mul_mkldnn UT with [conv + elt_mul + conv] (#19191 ) * add elementwise_mul_mkldnn UT with [conv + elt_mul + conv] to cover avx512=True branch test=develop * change a typo. test=develop	6 years ago
xiaoting	62facc7e47	fix yolo_box python example (#18925 ) test=develop, test=document_preview	6 years ago
danleifeng	0865b5a9a0	distribute launch : add use_paddlecloud argument (#19273 ) distribute launch : add use_paddlecloud argument	6 years ago
Zhaolong Xing	76c95af000	Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213 ) * fix mask rcnn bug: 1. affine channel fuse (diff) 2. condition block op (memory leak) 3. merge lod tensor op (diff) 4. memroy optim (diff) test=develop * fix ci aboud PADDLE_ENFOCE fix merge lod infer op ut test=develop	6 years ago
lvmengsi	d08d5ab519	Fix the mistake of convolution (#19274 )	6 years ago
Aurelius84	78a3d837f8	Add match_matrix_tensor op (#18525 ) * add matrch_matrix_tensor op test=develop * fix ignore unittest if with_mkl=off test=develop * clean code and rm is_test param test=develop * modify API.spec test=develop * rm useless code in search_compute.h test=develop * modify api.spec test=develop * modify default_grad.spec test=develop * Add API test code test=develop * clean code in search_computer.h * modify PADDLE_ENFORCE and clean search_compute.h test=develop * fix code style test=develop	6 years ago
Zeng Jinle	5b6673c44d	merge develop to solve conflict, also fix API doc, test=develop (#18823 )	6 years ago
zhang wenhui	539c870753	add fl_listen_and_serv &fl_transpiler,test=develop (#19091 ) add fl_listen_and_serv op for Federated_learning and fl_distribute_transpiler add this op to pserver program . This op just listen the endpoint and sum&scale.	6 years ago
kh2se2013	27e85625b8	add python coverage launch when WITH_COVERAGE=ON (#19264 ) add python coverage launch when WITH_COVERAGE=ON	6 years ago
chengduo	8a89ca94ce	Fix REGISTER_OP_WITHOUT_GRADIENT (#19251 ) * fix REGISTER_OP_WITHOUT_GRADIENT test=develop	6 years ago
gongweibao	fd4b15a2f6	Unset unittests http_proxy env to avoid timeout. (#19269 ) Unset unittests http_proxy env to avoid timeout.	6 years ago
silingtong123	a94a25867d	imporve the doc of decorate_reader API (#19206 ) * imporve the doc of decorate_reader API, test=develop * udpate API.spec, test=develop	6 years ago
gongweibao	86f0591175	Remove node_num function. (#19167 ) node_num is not needed for users, so remove them and fix the bugs about it!	6 years ago
Tao Luo	2f8c7e021f	remove unused inference_transpiler unit-tests (#19130 ) * remove unused inference_transpiler unit-tests test=develop * remove InferenceTranspiler usage in quantize_transpiler.py test=develop	6 years ago
zhaoyuchen2018	0c71c839ec	Fix recurrent op not update grade issue (#18581 ) * Fix recurrent op fails For the variable used in outter block, copy sub block's grad variable to outter block test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Fix unicode error test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine test code test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Fix seq2seq case fails test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * remove unreasonable code. test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com> * Refine code according to comment test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	6 years ago
Hao Wang	d53fa53b65	CI - Improve example code check (#19170 ) * add exception exit on error example codes test=develop	6 years ago
Adam	b837689e97	Add generalized Conv+Activation MKLDNN fuse pass creation (#19072 ) test=develop	6 years ago
Yibing Liu	50b1cab122	Add padding support for crf_decoding (#19057 ) * Add padding support for crf_decoding * Fixes in comupte kernel test=develop * Update API Spec test=develop * Update API.spec test=develop * Avoid using paddle_enforce test=develop * Fix enforce test=develop	6 years ago
Aurelius84	45fb031f6b	remove is_test param of FC test=develop (#19209 ) Remove is_test parameter of FC op. The parameter is_test is not used anywhere.	6 years ago
wuzewu	6fc1defd77	Fix compatibility issue of fluid.io.save_vars on windows platform (#19181 )	6 years ago
liym27	c8cdef37b2	change the default value of summarize from -1 to 20 in Print API to improve ease of use (#18738 ) * change the default value of summarize from -1 to 20 in Print op to improve ease of use, test=develop * change the doc of API Print to make the document easier to understand, test=develop	6 years ago
LielinJiang	1331c9e1f8	fix distributions unittest bug, test=develop (#19012 )	6 years ago
lvmengsi	c6f163cd7a	add description of sync_bn (#19056 )	6 years ago
Zeng Jinle	0f9b33954a	move python reader api to fluid.io module, test=develop (#19143 )	6 years ago
jiaqi	b86be13c15	fix default value (#19193 ) * fix default value in ps_pb2.py: delta_keep_days 30 -> 16 * test=develop	6 years ago
jiaqi	b104ea0684	add get_last_save_xbox_base/get_last_save_xbox (#19122 ) * add get_last_save_xbox_base/get_last_save_xbox * fix fleet_util bug of load paddle model * add doc string in fleet api	6 years ago
jiaqi	bfd514c730	fix default value of fleet desc (#19176 ) * fix default value of fleet desc, default values are same with jingpai * print log when save model	6 years ago
lidanqing	c548e370f1	UT coverage for guassian_mkldnn_op and batch_norm_mkldnn_op (#19011 ) * integrations problem test=develop * add batch_norm_mkldnn_op backward-reuse test and guassian seed=0 test test=develop	6 years ago
Jiawei Wang	6ac32d0981	Instag Implemention (#18394 ) * instag lod tensor impl * First PR for instag * First PR for instag * Before adding Selection Rows. * Change name from instag to filter_instag, add upgrade the impl of filter_instag * Change name from instag to filter_instag, add upgrade the impl of filter_instag * Fix yapf error in gradient_checker.py to pass Travis-CI * Fix Filter Instag Grad test=develop * Fix Filter Instag Grad test=develop * 1) Fix API.spec, add filter_instag Op. 2) Add Vector Support for CUDA. test=develop * Impl Loss_weight and empty output handler * change Loss Weight datatype to Float32, and add Loss Weight as 2nd output * 1) Support Tensor Input(without LOD) 2) Add Unit test * Filter By Instag Final test=develop * Update API.spec for filter_by_instag test=develop * Update API.spec for filter_by_instag 2 test=develop * Add Filter By Instag Coverage * code format of test_layers.py * code format test_layers.py test=develop * Make API args more readable test=develop * Make API args more readable and pass code format test=develop * Filter By Instag Op, Rename Map to Index Map test=develop * Filter By Instag Op, code format err in filter_by_instag_op.cc test=develop * Filter by instag op: code format of cpp files test=develop * Filter by instag Op: Api spec modification test=develop * Filter by instag Op: Api spec doc id modification test=develop * Filter by instag Op: Api spec and doc preview test=develop test=document_preview * Filter By Instag Op, fix doc erro test=document_preview test=develop * Filter By Instag Op, fix doc err and Api spec test=document_preview test=develop * Filter By Instag Op, fix Api spec test=document_preview test=develop * Filter By Instag Op, fix Paddle Encoforce deprecated warning test=document_preview test=develop * Filter By Instag Op, fix Paddle Encoforce deprecated and code format warning test=document_preview test=develop	6 years ago
wawltor	0019eb376a	Fix the error of op `ones_like` document，change the output variable test=document_preview test=develop Fix the error of op `ones_like` document, change the output variable from x to out.	6 years ago
huangjun12	20f18930ae	Add hard swish op (new op) (#19001 ) * add hard_swish activation op (new op) test=develop * remove redundancy files * modify document content of HardSwish OP * add API test in test_layers.py * add dynamic_graph for test_hard_swish	6 years ago
gongweibao	29d8781240	Polish fleet API to support cuda collective mode and nccl2 mode. (#18966 ) Polish fleet API to support cuda collective mode and nccl2 mode	6 years ago
wopeizl	80b7ef6fc8	add tensorrt support for windows (#19084 ) * add tensorrt support for windows	6 years ago
Kevin	744279fe68	Refine embedding Api doc (#18820 ) * fix overflow by int32 mul test=develop * fix reference nullptr * fix codestyle test=develop * modify to point in ContextProjectFunctor test=develop * modify to point in ContextProjectFunctor test=develop * modify . to -> test=develop * refine embedding padding_idx doc test=develop * fix math:padding_idx preview bug test=develop * modify API.spec test=develop * fix spell error test=develop * refine dtype parm desc test=develop	6 years ago
yaoxuefeng	9150cf50fc	add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871 ) * add ctr related metric layer test=develop * add save cache and slots shuffle test=develop * add save cache and slots shuffle test=develop * fix error * fix error * fix style for ci * fix for comments * change SlotsShuffle input to std::strinf for generality * fix style * fix style * fix style * fix style * fix style * fix style * fix stylr * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * change non-const reference to pointer * fix style * fix style * fix style test=develop * fix style test=develop * add return ins num in ctr metric op * change dtype to float in metric_op.py * fix error test=develop * fix style test=develop * fix API spec * fix API spec * fix API spec test=develop * add UT test=develop	6 years ago
Zeng Jinle	c51eb6bb14	remove book_memory_optimization directory, test=develop (#19117 )	6 years ago
Zeng Jinle	c194b0c835	Try to deprecate unstable python memory optimize (#18983 ) * deprecate python memory optimize, test=develop * remove memory_optimize in unittests, test=develop * add unittests to deprecated interfaces, test=develop	6 years ago
hutuxian	5a80cc8431	Datafeed support reading to cuda place directly. (#19071 ) * add a place field in DataFeed to denote which place it will feed data to. * abstract the copy process in CopyToFeedTensor function * add UT for float32 type and for CUDAPlace	6 years ago
chengduo	3f4c088ad8	prune the feed op in compiler (#18997 ) test=develop	6 years ago
chengduo	d23603322e	Remove compile from PE (#19080 ) * remove compile from PE test=develop	6 years ago
ShenLiang	4397cb318e	add eye op, kernel and unitest test=develop (#18980 ) * add eye op,test=document_preview test=develop * fix the API.spec, test=develop * fix the document, test=document_preview test=develop * add unitest for CI coverage, test=develop	6 years ago
Kaipeng Deng	f86fead693	Add trilinear_interp OP (#18711 ) * add trilinear interp. test=develop * fix unittest. test=develop * add python api and test_layers. test=develop * refine API.spec. test=develop * fix format. test=develop * add python API test. test=develop * format code. test=develop * refine code strcuture. test=develop * fix format * fix doc. test=develop * fix converage. test=develop * fix format. test=develop	6 years ago
chengduo	17d62ab220	Enhance fuse optimization op pass (#19010 ) * Enhance fuse optimization op pass test=develop	6 years ago
chengduo	21440b4d69	Add call stack info during compile time (#19067 ) * Add call stack info during runtime and compile time test=develop * Rename operator_call_stack test=develop * Add unit test test=develop * follow comment test=develop	6 years ago
jiaqi	a99bc64c63	add fleet util, add some interface in hdfs util (#18752 ) * add fleet util (fleet/utils/fleet_util.py): functions for users' convenience * add some interface in hdfs util : hdfs is_file、hdfs cat	6 years ago
mapingshuo	4ad7c9d5a7	[WIP] Add Imdb train demo (#18895 ) * add train demo for imdb text classification task * make inference library release data_feed dataset dataset_factory data_feed_factory * add String Data Generator * new feature of train demo: save model params * New feature of train demo: set training config using gflags * change code style for CI * add readme and dataset for imdb demo trainer	6 years ago
wangguanzhong	e50f527fee	update roi doc in roi_pool and roi_align (#19036 ) * update roi doc in roi_pool and roi_align, test=develop	6 years ago
Leo Chen	8f53735437	Fix memory overwriting of tensors returned by executor (#19030 ) * fix memory overlapping of fetch var (return of executor.run), test=develop * fix wrong usage of ParallelExecutor in op_test, test=develop * remove useless parameter and simplify code * avoid tensor destruct untimely, test=develop * add testcase independent of OpTest, test=develop	6 years ago
Youwei Song	95ff4fba61	specify the highest numpy version under python 2.x (#19018 ) As mentioned in this link, the last version of NumPy to support Python 2.7 is numpy 1.16.4.	6 years ago
Kaipeng Deng	1f46253d4a	fix natural exp decay doc. test=develop (#19025 )	6 years ago
LielinJiang	e5b9753a18	Fix ExponentialMovingAverage api bug in python3, test=develop (#18775 )	6 years ago
Kevin	e681d65515	Add var_conv_2d op (#18518 ) * fix overflow by int32 mul test=develop * fix reference nullptr * fix codestyle test=develop * modify to point in ContextProjectFunctor test=develop * modify to point in ContextProjectFunctor test=develop * modify . to -> test=develop * add var_conv_2d op test=develop * edit api.spec test=develop * ignore unittest if with_mkl=off test=develop * fix python3 division test=develop * fix ignore unittest bug test=develop * remove useless code test=develop * modify api.spec test=develop * modify default_grad.spec test=develop	6 years ago
Chen Weihang	81fe02c3fe	Fix config description error in cuda_profiler function document (#18750 ) * fix profiler doc error, test=develop * update API.spec, test=develop	6 years ago
Zeng Jinle	311f90f1eb	reduce_unittest_time,test=develop (#19005 )	6 years ago
lvmengsi	5d9df8c8c7	fix dropout (#18965 ) Fix dropout in nn.py	6 years ago
SunGaofeng	4da1c4f15d	fix g_param shape mismatch in WeightNormParamAttr (#18940 ) * fix g_param shape mismatch in WeightNormParamAttr * add comment to show why insert reshape in startup_program test=develop	6 years ago
Jiabin Yang	af63b1184c	test=develop, fix memory leak in dygraph (#18998 )	6 years ago
liuwei1031	a43a763b54	fix warpctc.dll not found issue (#18761 ) * fix warpctc.dll not found issue, test=develop * revert the linux platform change, test=develop * delete warpctc_lib_path.h.in, test=develop * add SetPySitePackagePath function * fix warpctc.dylib not found issue on Mac, test=develop * improve the paddle lib path setting logic, test=develop * fix mac ci issue caused by test_warpctc_op unittest, test=develop * tweak code, test=develop	6 years ago
chengduo	01c7daade7	Add checking for the fetch_list of Executor.run (#18957 ) * update exe.run	6 years ago
Liufang Sang	faf6890b6c	support tensor input for ctc align op (#18887 ) * test=develop support Tensor input for ctc_align_op * test=develop add some comment	6 years ago
Dong Daxiang	c97ea53c3e	make listen and server as exclusive run (#18990 ) make listen and server as exclusive run	6 years ago
xsrobin	8ce902541c	fix unalign of some examples (#18943 ) * test=develop test=document_preview * Update API.spec	6 years ago
Zeng Jinle	7ac748adb4	Open gc by default (#18836 ) * open gc by default, test=develop * fix test_train_recognize_digits and disable gc when ngraph is enabled, test=develop * fix conditional_block op eager deletion bug, test=develop * add some comments to reviewers, test=develop	6 years ago
hong	f745d6d9e4	fix expand op dtype build bugs; test=develop (#18932 )	6 years ago
jiaqi	02c370c3dc	support filelist size < trainer num && fix pull dense (#18956 ) * support filelist size < trainer num * pull dense when stop, to make sure local dense params are same as pserver, so save paddle model will save dense model same as pserver * enable QueueDataset train same filelist for serveral times	6 years ago
石晓伟	ee2f296ef8	Fusion: seqpool_cvm_concat (#18471 ) * add fusion_seqpool_cvm_concat test=develop * simplify pass, test=develop * fix code style, test=develop	6 years ago
jiaqi	768059b3a0	adjust ins weight according to nid slot (#18784 ) adjust ins weight according to nid slot , user can specify adjust_ins_weight in strategy	6 years ago
wawltor	3ab1866ca5	Add the op of unique_with_counts, expand count function of the op unique (#18720 ) * test=develop Add the op of unique_with_counts, the op is calc the unqiue input of data, and output the corresponding indices and count of data. * test=develop Check the input and dtype in the op of unique_with_counts * test=develop test=document_preview update the API.spec for `unique_with_counts`, at the same time, optimize the python api in the op of `unique_with_count` * test=develop test=document_preview Fix some python api problem in the op of `unique_with_counts`, and change the error messsage in this op. * Fix some API problem in the op of `unique_with_counts` test=develop test=document_preview * test=develop test=document_preview Fix the api sample of op `unique_with_counts`, and update api.spec	6 years ago
LielinJiang	22fa4c2d24	Fix depthwise conv gpu kernel bug (#18582 ) * fix depthwise conv gpu kernel bug, test=develop * add more depthwise conv test, test=develop	6 years ago
whs	c92b78b060	Fix unitest of light nas. (#18931 ) test=develop	6 years ago
jiaqi	233746d89d	set fleet_send_batch_num a default value according to trainer num (1) set fleet_send_batch_num a default value according to trainer num， the previous 80000 is fixed，if trainer num is much less or larger than 100，global shuffle may have timeout error. (2) fix load one table bug, add barrier	6 years ago
chengduo	20859c08e8	[DyGraph] Make multi-card program faster (#18892 ) * update parallel.py test=develop	6 years ago
HaoRen	24f8543106	Add center Loss Op Support (#18681 ) * support center loss * change tensor copy api to high level api tensorcopy * test=develop rewrite the center_loss cuda_kernel to make it faster and add document of the center loss api,also update test function * test=document_preview test=develop update document of center loss * test=document_preview test=develop modify API.spec modify test code remove nouse const_cast	6 years ago
lvmengsi	d21c391447	replace paper link (#18861 ) Update conv2d transpose link	6 years ago
Dong Daxiang	2bb296dfe9	make dist unit test exclusive run (#18865 ) make dist unit test exclusive run	6 years ago
whs	6cccab9203	Make lod_append support variable lod. (#18908 ) test=develop	6 years ago
danleifeng	e0a2d4dfec	Add elementwise_pow_op backward implementation and the unit test codes of it. (#18848 )	6 years ago
chengduo	ecd2bdada6	add CPUInplaceTestWithFuseOptimizationOps (#18867 ) test=develop	6 years ago
Zeng Jinle	8008ab4e6b	Remove legacy C++ memory optimization codes (#18834 ) * remove legacy memory optimization codes, test=develop * follow huihuang's comments,test=develop * follow luotao's comments, test=develop	6 years ago
Thunderbrook	52c1431eee	add clear_model interface in fleetwrapper (#18815 ) * dump slot * test * proto * dump slot * test * proto * code style * code style * code style * style * add delete after unseen days * add unseen days * code style * conflict solve test=develop * add clear model * code style test=develop * code style test=develop	6 years ago
Zeng Jinle	9a8a7a1ddc	fix affine_channel no_need buffer bug, test=develop (#18844 )	6 years ago
lvmengsi	829ef26281	Fix drop deconv (#18813 ) * replace link * update api.spec * fix mistake	6 years ago
chengduo	4140fe11a4	Open fuse optimization ops (#18741 ) * open fuse optimization ops test=develop	6 years ago
chengduo	582cc29799	add warning info for CPU_NUM (#18840 ) test=develop	6 years ago
Adam	ee02227949	Add LeakyReLU MKLDNN support (#18762 )	6 years ago
Zeng Jinle	a802da650b	Feature/mem opt pass refactor (#18735 ) * first version memory optimize pass, test=develop * remove move_tensor_sharing_pass, test=develop * refine code comments, add unittests, test=develop * turn off memory_optimize by default, test=develop * follow huihuang's comments, test=develop * follow chengduoZH's comments, test=develop * fix grammar error, add const qualifier, fix pass_test exception message, test=develop * follow chengduoZH's comments 2nd, test=develop	6 years ago
石晓伟	9dbb62eeb9	Fix examples of API (#18092 ) * fix logical APIs test=develop test=document_preview * fix isfinite * update matmul comments * update API.spec test=document_preview test=develop * update API.spec test=document_preview test=develop * update API.spec test=document_preview test=develop	6 years ago
guru4elephant	30562e371b	refine launch_ps and role_maker (#18795 ) refine launch_ps and role_maker	6 years ago
fuyinno4	c167a4b4dd	Fix shrink-dense and add scale-datanorm (#18746 ) Fix FleetWrapper: 1. fix shrink dense: just scale show 2. add datanorm scale: divide datanorm's gradient by batch_size	6 years ago
guru4elephant	2efb282c86	split test_dist_se_resnext.py into 4 testcases (#18743 ) * split test_dist_se_resnext.py into 4 testcases	6 years ago

1 2 3 4 5 ...

9298 Commits (ebff68fa74c3f278b97326fec56d775a94323623)