Paddle

Commit Graph

Author	SHA1	Message	Date
hong	bfb60efb1f	change compare forece_cpu default value; test=develop (#23888 )	5 years ago
wangchaochaohu	2270864019	Fusion group optimize for cuda codegen(#23940 )	5 years ago
ShenLiang	94dfb7d770	opt the postprocess, test=develop (#24155 )	5 years ago
Chengmo	28a558e867	Update index sample (#24109 ) * update index sample	5 years ago
Zhang Ting	ab8f8fa70d	fix example code, test=develop, test=document_fix (#24139 )	5 years ago
Feiyu Chan	e01262e691	add kron op (#24105 ) * add kron op and its python API, doc and unittests. * add kron in paddle.complex	5 years ago
Jacek Czaja	eb411613e9	[DNNL] refine activations Inplace support (#24145 )	5 years ago
wangchaochaohu	6bf26ef156	fix warning mac compiler (#24138 )	5 years ago
lijianshe02	a398464e77	API/OP (affine_channel, group_norm, layer_norm, random_crop, unpool, … (#24118 ) * API/OP (affine_channel, group_norm, layer_norm, random_crop, unpool, log_loss) error message enhancement test=develop	5 years ago
Guo Sheng	a8c0fb4e86	Add cholesky_op (#23543 ) * Add cholesky_op forward part. test=develop * Complete cholesky_op forward part. test=develop * Add cholesky_op backward part. test=develop * Complete cholesky_op backward part. test=develop * Refine cholesky_op error check and docs. test=develop * Add grad_check unit test for cholesky_op. test=develop * Fix sample code in cholesky doc. test=develop * Refine some error messages of cholesky_op. test=develop * Refine some error messages of cholesky_op. test=develop * Remove unused input in cholesky_grad. test=develop * Remove unused input in cholesky_grad. test=develop * Fix stream for cusolverDnSetStream. test=develop * Update PADDLE_ENFORCE_CUDA_SUCCESS from cholesky_op to adapt to latest code. test=develop * Add CUSOLVER ERROR in enforce.h test=develop * Fix the missing return value in cholesky. test=develop	5 years ago
Jacek Czaja	461e6a01ec	[DNNL] activations Inplace support (#24123 )	5 years ago
wangchaochaohu	6ba7c3ac92	Reduce the construction time of fuction about profiler (#24117 )	5 years ago
Zhang Ting	fb0729ee7f	avoid warnings in MAC compile (#24124 )	5 years ago
qingqing01	b059fb955d	Add trainable_statistics in attr for batch_norm. (#24072 ) * Add trainable_statistics in attr for batch_norm * Unifying behavior of dynamic graph and static graph	5 years ago
hutuxian	aa0330f451	Add some print log before running UT (#24071 )	5 years ago
Tao Luo	34b4153e97	show statistics summary of ccache (#24125 ) test=develop	5 years ago
arlesniak	d31a174f51	added fusing matmul-transpose-reshape pass (#23866 )	5 years ago
石晓伟	46f3139c7f	supports loading model from memory, test=develop (#24098 )	5 years ago
xiaoting	96ffebef55	fix bicubic, change int to floor (#24063 ) * change int to floor, test=develop * fix unittest, test=develop	5 years ago
Zeng Jinle	a67eea9f00	polish code by adding final, test=develop, test=develop (#24114 )	5 years ago
Zeng Jinle	ab2e284235	fix compilation failure (#24091 )	5 years ago
石晓伟	34d7d6aef0	declare the stream::Priority as enum class, test=develop (#24013 )	5 years ago
Pei Yang	695a53c874	remove conv_bn_fuse_pass and fc_fuse_pass in trt int8 calibration. test=develop (#23805 )	5 years ago
0YuanZhang0	a164b10d05	API/OP error message enhancement (#23717 ) * test=develop	5 years ago
Zeng Jinle	acef55df04	fix isolated var fetch bug, test=develop (#24070 )	5 years ago
Zeng Jinle	3ca700a987	code clean, test=develop (#24060 )	5 years ago
Zhaolong Xing	35148d17f7	[BUG]: Head number can only be > 1 on multihead op (#23974 ) * support the head number == 1 test=develop * fix slice op error. test=develop	5 years ago
Jacek Czaja	c6c65c65c7	[DNNL] Added elementwise_add mkl-dnn inplace (#23477 )	5 years ago
hutuxian	9ff558a46f	Optimize DataFeed (#23957 ) * Make batch_float_feasigns & batch_uint64_feasigns as member variable	5 years ago
Leo Chen	5cccc69f1a	update name generator, test=develop (#24048 ) * update name generator, test=develop * use c++ unique name generator, test=develop	5 years ago
zhangchunle	735e9ccce0	modify paddle_build.sh (#24062 )	5 years ago
tianshuo78520a	109ffffbb2	Update code format (#24000 )	5 years ago
石晓伟	db6d867383	add boost dependency to cuda_stream (#24032 )	5 years ago
wawltor	5c669ad1c2	Add the support dygraph out attribute for the op of mm in api2.0 (#23978 ) Fix the dygraph mode in matmul, add the support in Linear Op	5 years ago
石晓伟	d2584a7082	New feature: thread local allocator, test=develop (#23989 ) * add the thread_local_allocator, test=develop * refactor the thread_local_allocator, test=develop * provides option setting strategy, test=develop	5 years ago
Kaipeng Deng	80cf3c3c4d	Refine DataLoader support multi-processing (#23107 ) * add DataLoader, Dataset, BatchSampler	5 years ago
Zhou Wei	76d78c6387	fix conv_fusion_op conflict,test=develop (#24020 )	5 years ago
wangguanzhong	931cba2e64	add clamp api, test=develop (#23273 ) * add clamp api, test=develop	5 years ago
zhaoyuchen2018	a28a63a943	OP(fusion_gru) error message enhancement. test=develop (#23591 ) * OP(fusion_gru) error message enhancement. test=develop * refine code, test=develop * Refine inout log, test=develop * Refine description, test=develop	5 years ago
Zhou Wei	7817003795	Optimize the error messages of paddle CUDA API (#23816 ) * Optimize the error messages of paddle CUDA API, test=develop * fix the error messages of paddle CUDA API, test=develop * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop * remove build_ex_string,test=develop * merge conflict,test=develop	5 years ago
silingtong123	f6dbf8e3a4	add 'seed' arguemnt of randint API (#23809 ) * test=develop, add seed arguemnt of randint API	5 years ago
Li Fuchen	87d8dc3dc0	add diag_embed op (#23385 ) * add diag_embed op, test=develop * add TestCase of diag_embed API * Modified diag embed python API teastcase from dygraph to static graph, test=develop * delete useless log and trigger ci, test=develop * modified float16 of diag_embed, test=develop * modified en doc of diag_embed * trigger ci, test=develop * add fp16 in dtype check of python API, test=develop * modified __init__ and fix a big, test=develop * modified a test bug of test_bicubic_interp_op and test_trilinear_interp_op, test=develop * modified to use one kernel on cpu and cuda, test=develop	5 years ago
liu zhengxi	8e555ba650	OP(pad, pad2d, pad_constant_like) error message enhancement (#23882 ) * enhance pad.* error message, test=develop	5 years ago
tianshuo78520a	19235e8525	add cpu ci job check_style (#23617 )	5 years ago
zhangchunle	6bd200db66	remove high level api (#23854 )	5 years ago
ShenLiang	30bd7e1c83	Add rank_attention_op attributes for GPU memory in contrib (#23915 ) * optimize rank_attention, test=develop * use the paddle memory pool, test=develop * set max size, test=develop * limit the max size, test=develop * fix the head of cu, test=develop * add AsDispensable, test=develop	5 years ago
Yiqun Liu	8d0b0cb4ae	Op(conv2d_fusion) error message enhancement. (#23596 )	5 years ago
ShenLiang	7f0b2c7407	fix memory leaking problem of dataset, test=develop (#23955 )	5 years ago
mapingshuo	7d4002e06a	restrict block num of layer_norm_grad cuda block to 128 (#23878 ) restrict block num of layer_norm_grad cuda kernel to 128, test=develop	5 years ago
Zhaolong Xing	133f1fc123	[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723 ) * refine ernie trt dynamic shape support 1. add slice op converter 2. add emb eltwise layernorm fp16 support test=develop * fix dynamic shape test ut test=develop * fix comments. test=develop * fix comments test=develop	5 years ago

1 2 3 4 5 ...

16926 Commits (25a233e46d42f6cb76220d58d89f882723a7a4fc)