Paddle

Commit Graph

Author	SHA1	Message	Date
Jiabin Yang	1ce0a09e60	fix con2d transpose bias by create and init it in build_once (#18968 ) * fix con2d transpose bias by create and init it in build_onee * fix API spec * test=develop, invoke ci * fix bias_attr and act has no effect error on layer norm, conv2dTranpose, billinearTensorProduct, sequece_conv. fix original_mode not used error on GRUunit. fix sample_weight not set error on NCE. Add ut for all thoese layer * test=develop, change success standard for conv2dTranspose * test=develop, fix test_layers to invoke some error branch * test=develop, fix sample code * test=develop, fix BilinearTensorProduct failed in dygraph mode * test=develop, fix test_layers segment fault error	6 years ago
Yi Liu	4ef6b8457a	adapte fleet api for localsgd and support nccl comm configuration in executor (#19443 ) test=develop	6 years ago
tangwei12	65c7368400	Fix the correctness of async mode at distributed training (#18863 ) * fix correctness of the communicator * fix a bug in send thread when sending var context is empty, test=develop * add lookup_table_prefetch_op and prefetch optimize, test=develop * remove remote prefetch GPU supported * word2vec force with CPU, test=develop * test dist remote lookup table force with CPU, test=develop	6 years ago
Tao Luo	61389ae5aa	make PADDLE_ENFORCE ci check rule more robust (#19445 )	6 years ago
baojun	6421c61ae2	Update ngraph engine for multiple threading (#19155 ) * update for multiple threading test=develop * remove PADDLE_ENFORCE test=develop	6 years ago
chengduo	e26411cec2	Open test_parallel_dygraph_se_resnext (#19342 ) * enabel test_parallel_dygraph_se_resnext test=develop	6 years ago
Zeng Jinle	caf59d0f3f	Add signal message to stderr (#19421 ) * add signal message to stderr, test=develop * add unittests for ugly SignalHandle, test=develop	6 years ago
Yi Liu	efb05ba258	supports multiple NCCL communicators preserved in NCCLCommContext (#19407 ) * supports multiple NCCL communicators preserved in NCCLCommContext test=develop * add ut for c_comm_init_all operator and fix cuda resource release problem test=develop	6 years ago
Huihuang Zheng	56dd76538c	Delete useless ex-scope in recurrent op (#19426 )	6 years ago
wopeizl	b8aa37d529	save the callstack information to file when exception throws test=dev… (#19324 ) * save the callstack information to file when exception throws test=develop	6 years ago
xsrobin	3f392fd4bc	test=develop (#19463 )	6 years ago
Aurelius84	a9cd513680	improve sequence_conv api doc (#19316 ) * improve sequence_conv api doc test=develop * add warning for padding param test=develop modify into deprecated	6 years ago
zhang wenhui	0d7949831b	fix fleet_desc bug && support format for abacus hotstart (#19430 ) fix fleet_desc dense_table unsort bug ，not support format for abacus hotstart yet.	6 years ago
joanna.wozna.intel	2e3ec66be0	Add conv dequant squash for int8 (#18905 )	6 years ago
vincentXiyu	482ce818bb	Support Tensor input with padding for warpctc op (#19322 ) * support tensor input with padding for warpctc op * merge with develop * test=develop * modified python API examples test=develop * nn.py is modified for code coverage test=develop * update documents info about warpctc op in API.spec test=develop * add test_warpctc_with_padding in test_layers test=develop * add warning log for cuda_version back to warpctc_op.cc * modify API.spec for warpctc op test=develop * modify API.spec * update warpctc test to new CompiledProgram API test=develop * modify code examples for warpctc op test=develop * modify API.spec for warpctc op test=develop * modify API.spec for warpctc op test=develop	6 years ago
chengduo	bfb6ac816e	Fix optimizer bug (#19410 ) * fix optimizer bug test=develop	6 years ago
Leo Chen	6fb310ae29	Fix bug of getting bool Flags from os.environ (#19349 ) * fix bug of getting bool Flags from os.environ, test=develop * add empty loss_name in CompiledProgram for inplace grad test, test=develop	6 years ago
tianshuo78520a	8048992042	add cuda10 support in fast_install.sh and add dynamic get version for release (#19106 ) add cuda10 support in fast_install.sh and add dynamic get version for release, then remove useless ave check for MacOS install check	6 years ago
liu zhengxi	32598ffd8f	Python infer api update and add unit test (#19353 ) * python inference api supports numpy and add unit test, fix unit test fail in test_slim_int8_googlenet and test_slim_int8_mobilenet	6 years ago
Zeng Jinle	807c7a4747	remove recordio convert in dataset, test=develop (#19387 )	6 years ago
chengduo	11070cbff9	enabel seresnext reduce test (#19341 ) test=develop	6 years ago
Ghost Under Moon	10643b4ea6	fix- raise io error when user load from non-existed dir test=develop (#19384 ) This PR fix problem with issue #18096 , which raise an error for user to specify the error about load dir is wrong	6 years ago
mapingshuo	c2e5eaa27d	delete recordio writer (#19406 ) test=develop	6 years ago
mapingshuo	d5ac87ec22	Lookahead optimizer (#19386 ) * Add lookahead optimizer * add unittest for lookahead optimizer test=develop * add doc string for LookaheadOptimizer test=develop test=document_preview * add API spec for lookahead test=develop test=document_preview * modify api spec test=develop test=document_preview * modified doc string * modify the test file test=develop test=document_preview * modify doc string test=develop test=document_preview	6 years ago
Huihuang Zheng	12d29f4d2a	Change TensorCopy in recurrent_op to ShareDataWith (#19319 )	6 years ago
silingtong123	da127d1110	Optimized error reporting information (#19173 ) * test=develop,Optimized error reporting information * test=develop,add importscipy unittest * test=develop, rename the file and function	6 years ago
Jiabin Yang	55931db449	fix problem that get_attr method can't using default mode when we call has_attr in dygraph (#19328 ) * add default getItem * test=develop, fix has_attr disabled error in Layer * test=develop, fix GroupNorm and deepcf bug on attrs	6 years ago
tangwei12	19dac67e9f	fix distribute transpiler GRPC error code 4, RPC Deadline (#18984 ) * fix sync mode hang in transpiler * remove sync mode in send/recv * replace PADDLE_ENFORCE with PADDLE_ENFORCE_NE	6 years ago
Yibing Liu	5d1575cfe8	Fix arg do_model_average in param_attr (#19376 ) * Fix arg do_model_average in param_attr test=develop * Update api spec test=develop	6 years ago
Tao Luo	c82280e445	remove unused conv_elementwise_add2_act_fuse.cc (#19344 ) test=develop	6 years ago
zhang wenhui	4a3c4b8fa4	add fleet_desc config feature & multi_sparse table, test=develop (#18827 ) add fleet_desc config feature & multi_sparse table,	6 years ago
Jiancheng Li	1799c257ad	Update Light-NAS to support latency-aware search (#19050 ) * update light_nas_strategy: add latency constraint test=develop * update light_nas_strategy: update get_model_latency test=develop * update light_nas_strategy: add more check test=develop * update light_nas test test=develop * update light_nas test test=develop * minor update light_nas test test=develop * minor update light_nas test test=develop * update light_nas test test=develop * update _constrain_func of light_nas_strategy test=develop * update _constrain_func of light_nas_strategy test=develop * remove unused code test=develop	6 years ago
Zhen Wang	0fe72469ea	Add the max-pool2d quantization support and the partial quantization support. (#19310 ) * add pool2d quantization support, only for max-pooling. * add the partial quantization support.	6 years ago
Leo Chen	d49c2bad71	update inplace grad test to new CompiledProgram API, test=develop (#19359 )	6 years ago
Yibing Liu	b2c4f76cf2	Fix sequence mask in dygraph (#19271 ) * Fix data parallel & sequence mask in dygraph test=develop * Revert change in data_parallel test=develop	6 years ago
chengduo	4278518fb0	Update CompiledProgram (#18919 ) * use PE for compiler test=develop	6 years ago
lidanqing	9240e5325c	add local user data conversion into full_pascalvoc_test_preprocess.py (#19283 ) * add local user data conversion into full_pascalvoc_test_preprocess.py test=develop * change PADDLE_ENFORCE to PADDLE_ENFORCE_GE test=develop * change according to reviews test=develop	6 years ago
翟飞跃	2e3ee57954	Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153 ) * Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * optimize bp with mkl sparse matrix test=develop	6 years ago
Leo Chen	a9d5fc5142	Enhance OpTest to check the consistency of operators when using and not using inplace (#19101 ) * add pybind interface to get all inplace ops, test=develop * enhance OpTest to check whether the consistency of operator when using and not using inplace, test=develop * handle corner cases in op_test, test=develop * support outputs without tensor holder_, like XShape in reshape_op, test=develop * fix bug, some op has GradOpMaker, but actually no grad_op in OpInfoMap, test=develop * use reshape_grad instead of reshape in FlattenGradOp, test=develop * fix error debug dims info for variables like XShape, test=develop * change computational order in sum_op to relieve computation difference using inplace, test=develop * add inplace_atol to check group_norm, and skip inplace_grad for mkldnn, test=develop * follow sneaxiy's comments, test=develop * remove unused DefaultGradOpDescMaker in mkldnn op, test=develop	6 years ago
Aurelius84	0d29cf18f4	Supports diagonal initialization in uniform_random op (#19299 ) * add diag init in Uniform_random op test=develop * modify api.spec test=develop * fix unform_batch_size_like maker test=develop * add diag_num and diag_step assert check test=develop	6 years ago
chengduo	5a579df9ba	[Speedup] Make dygraph data parallel faster (#19280 ) * update parallel.py test=develop	6 years ago
Tao Luo	e3c68bde78	stronger the error message of tensor's mutable_data (#19303 ) * stronger the error message of tensor's mutable_data test=develop * update error message test=develop	6 years ago
chengduo	6a1632318d	Split test_parallel_executor_seresnext to three unit test (#19239 ) * increase test_parallel_executor_seresnext time limit test=develop * split test_parallel_executor_seresnext test=develop * temporally disable reduce_and_allreduce test because of the random failure. test=develop * split gpu and cpu test=develop	6 years ago
tianshuo78520a	188a5caf2e	Split and enhance assert_api_spec_approvals (#19292 )	6 years ago
chengduo	a8a9823dae	add memory profiler (#19320 ) test=develop	6 years ago
Zeng Jinle	561232c25a	remove is_mem_optimized in Program, test=develop (#19307 )	6 years ago
Adam	97d1db1874	Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237 ) * Add generalized Conv+Activation MKLDNN fuse pass creation Part2 test=develop * Undefined behaviour of GetAttrIfExists<> FIX test=develop	6 years ago
wangguanzhong	37428952c6	fix generate mask fpn, test=develop (#19301 )	6 years ago
lidanqing	3fdecc19b7	Add elementwise_mul_mkldnn UT with [conv + elt_mul + conv] (#19191 ) * add elementwise_mul_mkldnn UT with [conv + elt_mul + conv] to cover avx512=True branch test=develop * change a typo. test=develop	6 years ago
zhaoyuchen2018	5296294dae	Fix elementwise performance poor issue (#19278 ) For small case use 1D block is better than 2D block. Refer to this issue: #19275	6 years ago

1 2 3 4 5 ...

24955 Commits (75d1571995edd5efdd31288563fc43bce4cd458b) All Branches Search

24955 Commits (75d1571995edd5efdd31288563fc43bce4cd458b)

All Branches