Paddle

Commit Graph

Author	SHA1	Message	Date
Zeng Jinle	712bfb17cb	fix recurrent_op,test=develop (#17433 )	6 years ago
Tao Luo	5babcd02dd	Revert "remove unnecessary prepare_data (#17080 )" (#17432 ) This reverts commit `aca60e9a20`.	6 years ago
chengduo	e336dc86bb	[Speed] Refine the Executor when the num_thread=1 (#17405 ) Refine the Executor when the num_thread=1	6 years ago
Zhen Wang	4a1b7fec96	Add setting Scope function for the graph class (#17417 ) * add set_not_owned function for graph * add scope set. test=develop * add scope_ptr enforce not null before setting.test=develop	6 years ago
jiaqi	66d51206b1	add save/load model, shrink table, cvm, config file & fix pull dense bug (#17118 ) * add save/load model, shrink table, cvm, config file & fix pull dense bug test=develop * fix global shuffle bug, fix pull dense bug, fix release memeory bug, fix shrink error add client flush, add get data size test=develop * fix global shuffle bug test=develop * fix global shuffle bug test=develop * fix code style test=develop * fix code style & modify pslib cmake test=develop * fix error of _role_maker test=develop * fix code style test=develop * fix code style test=develop * fix code style test=develop * fix code style test=develop * fix code style test=develop * fix windows compile error of fleet test=develop * fix global shuffle bug * add comment test=develop * update pslib.cmake test=develop * fix fill sparse bug test=develop * fix push sparse bug test=develop	6 years ago
Tao Luo	68ec0a6f74	make parallel_executor support FLAGS_use_mkldnn (#17341 ) * make parallel_executor support FLAGS_use_mkldnn test=develop * add warning when set mkldnn_enabled_op_types_ in non-mkldnn env test=develop	6 years ago
chengduo	bc833945a4	Add DropLocalExeScopes in ParallelExecutor (#17297 ) * reset drop local scope counter test=develop	6 years ago
qingqing01	e32c9888f5	Double backward of conv2d. (#17211 ) * Add conv2d_grad_grad_op * Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h. - Now use it in conv2d_grad_grad. - Will simply the searching code in conv2d and conv2d_grad in next PR. * Enhance and fix bug in unit testing of gradient_checker. * Support to fetch empty variables，return None in Python.	6 years ago
Zeng Jinle	5e5e7b3305	fix data_type error message (#17312 ) test=develop	6 years ago
guru4elephant	5d6a1fcf16	fix infer_from_dataset and train_from_dataset (#17243 ) * fix train_from_dataset and infer_from_dataset example * add inductive dim for data_reader, example: shape=[-1, 1], then -1 will be inducted through run-time reading of number of elements	6 years ago
chengduo	516317cf91	use sync copy (#17291 ) test=develop	6 years ago
Hongyu Liu	c3195de522	Fix concat shape check (#17247 ) * fix shape_check; test=develop * fix format; test=develop * fix format; test=develop * fix ddim bug; test=develop * fix c++ format; test=develop * change function name; test=develop	6 years ago
chengduo	04bd413acb	Code Clean: Move all pass to paddle::framework::ir (#17228 ) * move pass to ir * polish code test=develop * fix dependency test=develop	6 years ago
Zeng Jinle	4f8594088d	Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225 ) * add use_cuda to inplace pass,test=develop * add test softmax_with_xe_inplace test,test=develop * fix potential inplace bug test=develop * add more skip vars in mem opt pass,test=develop * follow comment,test=develop * follow comments,move duplicate out arg check to program->graph,test=develop	6 years ago
songhao	c2e20e2a29	fix build warning like 'comparison between signed and unsigned (#17240 ) integer', test=develop	6 years ago
石晓伟	a72dbe9abf	Cherry-pick benchmark related changes from release/1.4 (#17156 ) * cherry-pick commit from `8877054` * cherry-pick commit from `3f0b97d` * cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn (cherry picked from commit `8643dbc233`) * Cherry-Pick from 16662 : Anakin subgraph cpu support (cherry picked from commit `7ad182e16c`) * Cherry-pick from 1662, 16797.. : add anakin int8 support (cherry picked from commit `e14ab180fe`) * Cherry-pick from 16813 : change singleton to graph RegistBlock test=release/1.4 (cherry picked from commit `4b9fa42307`) * Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2 Support ShuffleNet and MobileNet-v2, test=release/1.4 (cherry picked from commit `a6fb066f90`) * Cherry-pick : anakin subgraph add opt config layout argument #16846 test=release/1.4 (cherry picked from commit `8121b3eccb`) * 1. add shuffle_channel_detect (cherry picked from commit `6efdea8997`) * update shuffle_channel op convert, test=release/1.4 (cherry picked from commit `e4726a066f`) * Modify symbol export rules test=develop	6 years ago
Zeng Jinle	ee2028a110	Add use_cuda to inplace pass (#17205 ) * add use_cuda to inplace pass,test=develop * add test softmax_with_xe_inplace test,test=develop	6 years ago
chengduo	950aec55fd	It doesn't need sync when fetch_list nit not empty (#17201 ) test=develop	6 years ago
tensor-tang	79ed1c76cd	fix bn fuse vardesc and add model saver (#17143 ) * fix bn fuse vardesc and add model saver test=develop * unify save model in test helper test=develop * fix mkdir on windows test=develop * remove magic number use bn bias var desc test=develop	6 years ago
Zeng Jinle	4e1bc6e805	Rewrite inplace pass and fix gc bug (#17126 ) * fix op graph view test=develop * rewrite inplace pass and fix reference count pass bug test=develop * fix unittest failed test=develop * follow comments, test=develop	6 years ago
chengduo	794a195881	fix fuse optimizer ops (#17102 ) test=develop	6 years ago
Tao Luo	aca60e9a20	remove unnecessary prepare_data (#17080 ) test=develop	6 years ago
Zeng Jinle	842ded14b0	fix reference_count_pass,test=develop (#17060 ) test=develop	6 years ago
Tao Luo	d9cd989825	Merge pull request #17048 from luotao1/fix_runtime_cache_bug fix runtime_context_cache bug when gpu model has an op runs only on cpu	6 years ago
chengduo	cc31681687	use fast executor as default (#17044 ) test=develop	6 years ago
chengduo	a2be4b4d91	Add fuse momenutum ops (#16745 ) * Add fuse momenutum ops	6 years ago
luotao1	490e746269	fix runtime_context_cache bug when gpu model has an op runs only on cpu test=develop	6 years ago
wopeizl	51a0243a56	fix nccl wrapper on windows test=develop	6 years ago
Zeng Jinle	1202d3fc74	Refine model gpu memory (#16993 ) * speedup gc and inplace softmax_with_cross_entropy_grad test=develop * refine models gpu mem Merge skip vars and warning messages of mem opt remove relu mem opt test=develop * follow comments test=develop	6 years ago
Yibing Liu	3c375751f8	Support seq len equal to 0 in sequence ops (#16935 ) * Support seq len equal to 0 in sequence ops test=develop * Add more test cases * Fix some comments test=develop * Fix py3 error test=develop	6 years ago
jiaqi	8bcba3db84	Merge pull request #16896 from xjqbest/develop fix bug of num > INT_MAX	6 years ago
guru4elephant	bbc6c5714f	Merge pull request #16887 from guru4elephant/add_nccl_context_pybind Add nccl context pybind	6 years ago
gongweibao	cbdb8a17b1	Polish DGC code (#16818 )	6 years ago
dongdaxiang	2ab2869c2d	fix GPU compile error problem	6 years ago
dongdaxiang	466d177d09	add pybind dependency test=develop	6 years ago
xjqbest	10991e00a9	fix bug of num > INT_MAX	6 years ago
xjqbest	241120d94d	fix bug of num > INT_MAX	6 years ago
xjqbest	dac70ad4c5	fix bug of num > INT_MAX	6 years ago
xjqbest	74471397cf	fix bug of num > INT_MAX	6 years ago
dongdaxiang	b091139049	add nccl wrapper for python API	6 years ago
dongdaxiang	fff795e5c8	add nccl_wrapper	6 years ago
乔龙飞 Qiao Longfei	82cff5ec42	Merge pull request #16762 from jacquesqiao/add-async_sparse_param_update_recorder Add async sparse param update recorder	6 years ago
Yibing Liu	4267a81afc	Correct the lod level of compiled time in lod_reset (#16790 ) test=develop	6 years ago
chengduo	e9409665f7	Refine Fuse Optimize Ops (#16810 ) * fix bug of fuse optimize ops	6 years ago
chengduo	d105c06b50	Replace ThreadedExecutor with FastThreadedExecutor (#16650 ) * replace ThreadedExecutor with FastThreadedExecutor test=develop * Fix Travise CI test=develop * Test FastThreadedSSAGraphExecutor test=develop * refine parallel_ssa_graph_executor.cc test=develop	6 years ago
Qiao Longfei	1526a3e4da	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder test=develop	6 years ago
Yihua Xu	93cedfdb9c	Fix the order while sorting the operators (#16756 ) * Fix the order when sorting operators. test=develop * Enable transfomer compare test item. test=develop * Use set to replace vector. test=develop	6 years ago
Qiao Longfei	afc56949c1	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async_sparse_param_update_recorder	6 years ago
liuwei1031	85363848a1	Security issue (#16774 ) * disable memory_optimize and inpalce strategy by default, test=develop * fix security issue http://newicafe.baidu.com:80/issue/PaddleSec-3/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-8/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-12/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-32/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-35/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-37/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-40/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-43/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-44/show?from=page http://newicafe.baidu.com:80/issue/PaddleSec-45/show?from=page test=develop * revert piece.cc, test=develop * adjust api.cc,test=develop	6 years ago
guru4elephant	aa46caf3d9	Merge pull request #16765 from guru4elephant/gpu_dataset_train add gpu training for Executor.train_from_dataset	6 years ago

1 2 3 4 5 ...

2424 Commits (712bfb17cb6d10658d804748c84b67badf47f0d7)