Paddle

Commit Graph

Author	SHA1	Message	Date
tensor-tang	742300baa8	fix unkown omp pragmas	7 years ago
tensor-tang	b9dbb7c5cb	fix bias attri in mkldnn fc	7 years ago
tangwei12	59580a7f69	bug fix	7 years ago
tensor-tang	4b5986bb77	enable fc op in normal case	7 years ago
tensor-tang	e133df6037	enable native fc forward	7 years ago
tensor-tang	6a2a9a8350	Revert "Refine elementwise_add op"	7 years ago
Yu Yang	8dda526a45	Merge pull request #12659 from sneaxiy/refine_softmax_with_cross_entropy Fix 'softmax_with_cross_entropy_op' dependency error	7 years ago
sneaxiy	f6f5cdaa05	Merge pull request #12555 from sneaxiy/refine_layer_norm Refine layer_norm op	7 years ago
sneaxiy	c50c537732	fix arithmetic error in backward kernel	7 years ago
tensor-tang	038cbf799d	add bias for fc op	7 years ago
whs	9d6243b6fb	Fix crop op. (#12603 ) * Fix infer shape of crop op. * Speed crop op.	7 years ago
Bai Yifan	649f5d74f0	fix mine_hard_example bug (#12664 )	7 years ago
sneaxiy	2d9508f8f3	Merge pull request #12554 from sneaxiy/refine_elementwise_add Refine elementwise_add op	7 years ago
tensor-tang	171a0e2b42	add some comment	7 years ago
sneaxiy	2c560623d1	fix dependency error	7 years ago
tensor-tang	5377edd282	refine packed condition	7 years ago
tensor-tang	3bf3e77ac8	Merge remote-tracking branch 'ups/develop' into refine/op/gru	7 years ago
qiaolongfei	c0890988da	add RPCServerProfiler, replace listen and serv optimizer	7 years ago
tangwei12	64a4925cb4	Merge branch 'Pdv' into samplingIdOp	7 years ago
tangwei12	0bfd62be3d	remove gpu supported, will add it later	7 years ago
Tao Luo	5a9ae411e0	Merge pull request #12618 from sfraczek/sfraczek/fix-new-mkldnn-conv-tests fix UT for mkldnn 0.15	7 years ago
sneaxiy	cf799a6a04	Merge pull request #12553 from sneaxiy/refine_softmax_with_cross_entropy Refine softmax_with_cross_entropy op	7 years ago
dzhwinter	8499559c42	"fix style" (#12600 )	7 years ago
sneaxiy	010883689c	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_layer_norm	7 years ago
sneaxiy	5d698589ce	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_elementwise_add	7 years ago
sneaxiy	19ff254d05	Merge branch 'refine_elementwise_add' of https://github.com/sneaxiy/Paddle into refine_elementwise_add	7 years ago
Sylwester Fraczek	d74bb6ab9c	fix ut for mkldnn 0.15 - added forcing layout NCHW in mkldnn conv tests	7 years ago
fengjiayi	855c9e3311	clean softmax_op code	7 years ago
fengjiayi	24d51de022	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_op_tensor_support	7 years ago
fengjiayi	27df3a9f2b	make cross_entropy_op supporting tensors	7 years ago
fengjiayi	66be53264e	Merge pull request #12592 from JiayiFeng/fix_mac_compile_error fix mac compile error	7 years ago
fengjiayi	8e604a10aa	fix mac compile error	7 years ago
nhzlx	551c802cdc	merge develop	7 years ago
sneaxiy	ad45d39222	refine layer_norm	7 years ago
chengduo	7c8b69c700	Feature/op fusion (#12240 ) * Add Preface * Add demo code * Save file * Refine code * seems can work * use elementwise strategy * Use ElementwiseComputeEx * Add comments * extract functions from operator * Refine code * Follow comment * code refine * follow comments * follow comments	7 years ago
sneaxiy	1b4515f6db	refine softmax_with_cross_entropy	7 years ago
nhzlx	3a0caf801f	modify trt engine op test	7 years ago
nhzlx	e51d045a6d	modify trt engine op test	7 years ago
nhzlx	e8954a36f5	merge develop	7 years ago
nhzlx	32a9e050bc	mapping the variable name inside the subgraph	7 years ago
Wu Yi	2d036c47cd	polish dist unit test code (#12512 ) * polish dist se resnext ut * update * update * update * avoid cpu initializer differ * change to use executor for now * update by comment * remove lr decay use para exe, should fix para exe bug later * update by comment	7 years ago
fengjiayi	7834b4a470	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_op_tensor_support	7 years ago
tangwei12	5bfdefae91	Merge branch 'Pdv' into samplingIdOp	7 years ago
tangwei12	b30bdde15a	random optimize	7 years ago
tangwei12	9c63fef63c	random optimize	7 years ago
Qiao Longfei	88a607c342	Merge pull request #12541 from jacquesqiao/optimize-profiler optimize profiler	7 years ago
tangwei12	5b9716d1f6	add dims check	7 years ago
tangwei12	4cd504d3b4	bug fix	7 years ago
sneaxiy	e57bc4d745	Merge branch 'refine_elementwise_add' of https://github.com/sneaxiy/Paddle into refine_elementwise_add	7 years ago
sneaxiy	222fbbedfb	Merge branch 'develop' into refine_elementwise_add	7 years ago
sneaxiy	4b83afff6e	Merge branch 'develop' into refine_elementwise_add	7 years ago
sneaxiy	b2d0ee5159	refine elementwise_add op	7 years ago
tangwei12	da2cc99f67	sampling op optimize	7 years ago
fengjiayi	7c55e08c93	stash	7 years ago
tangwei12	4973e07be3	sampling op optimize	7 years ago
tensor-tang	836068569f	Merge remote-tracking branch 'ups/develop' into refine/op/gru	7 years ago
tensor-tang	18c322c2a1	seperate cpu and gpu implementations for gru kernel compute	7 years ago
tensor-tang	54c95e49f0	fix blas	7 years ago
fengjiayi	b656d97e86	Merge pull request #12485 from JiayiFeng/dev_ops_tensor_support Make lookup_table_op and softmax_op supporting high rank tensor	7 years ago
qiaolongfei	1623f1ba4f	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-profiler	7 years ago
tangwei12	3206970b77	sampling op rename	7 years ago
Xin Pan	99a77cfc62	Merge pull request #12468 from panyx0718/improve_profiler2 Improve profiler	7 years ago
qiaolongfei	a3f9d6a38c	optimize profiler	7 years ago
tangwei12	e0ab2f7158	new sampling op	7 years ago
tensor-tang	8c23f7c4f0	fix blas and use packed weight	7 years ago
tensor-tang	d9cc6b1866	replace gru compute with details	7 years ago
tensor-tang	43cee33a23	add mkl packed gemm	7 years ago
tangwei12	766ac488ac	sum_op selectedRows dim bug fix	7 years ago
dzhwinter	595a2c83ae	explicit gradient of elementwise_add/elementwise_sub (#11970 ) * "add gradient register" * "make some enhance" * "better format" * "fix typo" * "fix reuse" * "fix get expected kernel" * "change the mkldnn code" * "fix mkldnn" * "fix mkldnn failed test" * "add comment"	7 years ago
fengjiayi	e7d8e16a66	update softmax_mkldnn_op	7 years ago
Yu Yang	2567afa35d	Merge pull request #12462 from reyoung/feature/fix_cudnn_deterministic Fix bug in cudnn_determistic	7 years ago
fengjiayi	dc111d3476	update softmax_cudnn_op	7 years ago
fengjiayi	f7bd0b227b	Add unittests for softmax_op	7 years ago
gongweibao	819ac3df0a	Modify style (#12465 )	7 years ago
fengjiayi	b314a69523	make softmax supporting tensors	7 years ago
fengjiayi	b1af7e5d9b	Add unittests for lookup_table_op	7 years ago
tangwei12	c4c8f60bec	sum_op selectedRows dim bug fix	7 years ago
Xin Pan	486345551d	clean	7 years ago
Xin Pan	caf10b474f	make profiler use thread_id from g_thread_id Add a few more RecordEvent. Cleanup	7 years ago
Yu Yang	040fc1c39b	Fix bug in cudnn_determistic * Introduced by #11205	7 years ago
fengjiayi	7efdf05ac2	make look_up_op supporting tensor ids	7 years ago
Qiao Longfei	690625fe15	Merge pull request #12456 from jacquesqiao/add-profiler-to-pserver Add profiler to pserver	7 years ago
qiaolongfei	7e46a8d172	fix logical bug, optimize code	7 years ago
qiaolongfei	0b62f61d29	add init flag in __init__.py for listen_and_serv_profile_period	7 years ago
dzhwinter	91fb0156ca	Memory/reshape op (#12414 ) * "remove inplace in single op" * "fix ci" * "add transpiler case" * fix conflict * "fix reshape" * "delete reshape inplace attr" * "follo the comments" * "rerun ci"	7 years ago
qiaolongfei	0b861bbca9	add profiler for listen_and_serv op	7 years ago
tensor-tang	059b27840c	Merge pull request #12408 from tensor-tang/refine/im2col Refine CPU im2col padding with 1	7 years ago
qiaolongfei	147bf00ffe	clear mutable rows for the output of split_ids_op	7 years ago
qiaolongfei	91b114a787	change map to unordered_map	7 years ago
tensor-tang	d8d2dbcfac	further optimize im2col using variables	7 years ago
qiaolongfei	91f63cd401	fix split_ids_op and add unit test	7 years ago
tensor-tang	5373fe29c2	Merge remote-tracking branch 'ups/develop' into refine/im2col	7 years ago
Qiyang Min	7da453630e	Merge pull request #12403 from velconia/fix_hang_up Fix grpc destroy bug	7 years ago
Tao Luo	5a634786af	Merge pull request #12312 from luotao1/unify unify libpaddle_inference_api and libpaddle_fluid	7 years ago
Bai Yifan	e12b1d1792	Add flatten op (#12341 ) * add flatten op	7 years ago
Luo Tao	062556f938	Merge branch 'develop' into unify	7 years ago
chengduo	2409d0f710	Refine regularization for selected_rows (#12369 ) * refine regularization for selected_rows * clean lookup_table * refine rpc_server_test * temporally disable rpc_server_test * fix rpc_server_test * add unit test	7 years ago
tensor-tang	687a322267	Merge remote-tracking branch 'ups/develop' into refine/im2col	7 years ago
tensor-tang	65d418f060	complete im2col with padding==1 and speedup filter width==1	7 years ago
minqiyang	053540e199	Add volatile to stopped_ member	7 years ago
minqiyang	b78ffde6d5	Add stopped sign for grpc client	7 years ago
tensor-tang	52eb86e30f	refine im2col benchmark	7 years ago
tensor-tang	3017f46076	add more test cases	7 years ago
tensor-tang	8d6be4fb5f	refine im2col test and add benchmark	7 years ago
tensor-tang	507c143047	im2col cfo cpu code clean	7 years ago
tensor-tang	4eeed0b5e4	refine width padding and enable core copy	7 years ago
Wu Yi	73fcfc06ec	refine conv cudnn enforce (#12353 ) * refine conv cudnn enforce * update * update all cudnn ops * fix	7 years ago
tensor-tang	e3131e2d73	enable width padding	7 years ago
Xin Pan	d7e08c53c2	Merge pull request #12169 from panyx0718/ir_graph_sort construct a SSAGraph at the beginning.	7 years ago
tensor-tang	92518c519f	reuse sizes saving time	7 years ago
tensor-tang	660df122ce	enable padding!=0 and fill height padding with 0	7 years ago
tensor-tang	d8e00facf7	reuse im_size	7 years ago
tensor-tang	179dd0cb8a	Merge pull request #12337 from tensor-tang/refine/im2col refine cpu im2col no padding	7 years ago
Luo Tao	5ba4337698	unify libpaddle_inference_api into libpaddle_fluid	7 years ago
tensor-tang	b72befc5cc	reuse copy size	7 years ago
Yancey	6133efd9ed	Merge pull request #12218 from Yancey1989/rpc_complete_interface Add rpc complete interface	7 years ago
Zhaolong Xing	6169d724b9	Merge pull request #12324 from NHZlX/enhance_for_tensorrt_infer Enhance for tensorrt infer	7 years ago
nhzlx	4d49e61ab8	fix comments	7 years ago
tensor-tang	6788af4bf1	refine test cases	7 years ago
tensor-tang	b163e601b6	add gtest	7 years ago
nhzlx	bcd67bdd71	add assert for GetOutput	7 years ago
tensor-tang	aae994fd26	refine im2col no padding	7 years ago
Yancey1989	fb06ed7bdc	Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface	7 years ago
Yu Yang	21387e3c2a	Tiny refines for lod_tensor_blocking_queue and reshape_op	7 years ago
nhzlx	f42ea48996	deal with conflict	7 years ago
nhzlx	940f5dbcac	modify the tensorrt engine op to adapt to chage	7 years ago
Yan Chunwei	02cf54d331	bugfix lod cpu performance (#12297 )	7 years ago
Qiao Longfei	b41f8b9d42	Merge pull request #12295 from jacquesqiao/speedup-reduce-sum-grad-op Speedup reduce sum grad op	7 years ago
fengjiayi	eec412b230	Merge pull request #12273 from JiayiFeng/update_py_reader Some enhancement on readers	7 years ago
Xin Pan	21a45420f0	polish and test	7 years ago
Qiao Longfei	95a2b5f56a	fix mac build of sendrecvop_utils (#12272 )	7 years ago
qiaolongfei	273f737517	optimize code	7 years ago
Xin Pan	93355cc0d2	fix control deps	7 years ago
fengjiayi	ea8a375fa4	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update_py_reader	7 years ago
qiaolongfei	5d718a5886	optimize reduce_sum_grad op	7 years ago
Yancey1989	d4f51218ef	Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface	7 years ago
qiaolongfei	b643473d31	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-mac-build	7 years ago
fengjiayi	060f421797	Some enhancement on readers 1. Make the feeding thread of py_reader a daemon thread. 2. Update buffer_reader's destructor, fixing a bug. 3. Make pyreader demo script supporting CPU environment.	7 years ago
qingqing01	873a50ce35	Fix serious bug in nesterov momentum optimizer. (#12231 ) * Fix serious bug in nesterov momentum optimizer.	7 years ago
Yan Chunwei	b42ced8eda	bugfix/tensorrt analysis fix subgraph trigger (#12266 )	7 years ago
qiaolongfei	938390b38d	fix mac build of sendrecvop_utils	7 years ago
gongweibao	3a6213f493	Change grpc interface to compatible with brpc. (#12164 )	7 years ago
Yu Yang	b06309381b	Merge pull request #12149 from reyoung/feature/combine_open_files_and_double_buffer Change and polish readers	7 years ago
tensor-tang	be04fbff42	Merge pull request #12233 from tensor-tang/refine/mkl/gemm add option split mkl gemm	7 years ago
Qiao Longfei	2b58c62aa0	Update auc op (#12199 ) fix AUC op optimize it's test	7 years ago
Yancey1989	efd5a84986	update executor interface	7 years ago
tensor-tang	fc2b578842	add gemm_warp test	7 years ago
tensor-tang	a916c52579	refine gemm	7 years ago
tensor-tang	961e754c9f	mkl split gemm for better perf	7 years ago
Yancey1989	ade6675490	Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface	7 years ago

1 2 3 4 5 ...

1776 Commits (557be6fc58a8fad13a830df33ec77560faaa3d7c)