Paddle

Commit Graph

Author	SHA1	Message	Date
littletomatodonkey	1c08a2136e	test=develop, add addmm op (#23384 ) add addmm op	5 years ago
zhaoyuchen2018	7b5e23c034	OP(fusion_gru) error message enhancement. test=develop (#23599 ) C++ OP enhancement.	5 years ago
Chengmo	8c0bdde934	Add Tdm sampler op in Contrib (#23290 ) * add tdm sampler op * fix compile bug * fix compile bug * fix compile bug * fix compile bug * test=develop, add tdm sampler unittest * fix tdm sampler unittest * fix input var name bug * update tdm sampler unittest * fix unittest * fix unittest * update tdm sampler unittest * add tdm exe run unittest * fix infershape * test=develop, add doc * test=develop, fix gcc8 compile bug & unittest bug * test=develop, fix unittest * test=develop, fix T one & zero * test=develop, add unittest check * test=develop, add doc smaple code & fix dype set * test=develop, fix dtype * test=develop, fix compile bug * test=develop, fix unittest * test=develop, fix unittest * test=develop, check py3 unittest * test=develop,fix unittest * test=develop, fix py3&py2 unittest diff * test=develop, fix sample code * test=develop, fix sample code * test=develop, fix sample code * test=develop, fix error message	5 years ago
GaoWei8	517929f148	Op (reorder_lod_tensor_by_rank) error message enhancement (#23552 )	5 years ago
Wilber	1ac9db4354	error message enhancement for fusion_seqpool_concat_op. test=develop (#23563 ) error message enhancement for fusion_seqpool_concat_op	5 years ago
Wilber	286c2e0ede	error message enhancement for py_func op. (#23565 ) error message enhancement for py_func op.	5 years ago
zhongpu	37fcf03af7	Op (Save/Load) error message enhancement, test=develop (#23650 )	5 years ago
Zhen Wang	84cd45f674	Solve the conflict of ops with the same name, test for CI. (#23573 ) * solve the conflict of ops with the same name. test=develop	5 years ago
wangguanzhong	c2f5a3ad34	enhance the error message of roi_align, test=develop (#23649 )	5 years ago
silingtong123	cec234b1aa	test=develop, error message of tree_conv OP enhancement (#23574 )	5 years ago
Kaipeng Deng	b465bb0de7	fix adaptive_pool2d/pool3d error message. test=develop (#23658 )	5 years ago
Zhaolong Xing	f345607115	Refine transpose flatten concat error message (#23625 ) * refine fusion_transpose_flatten_concat_op log test=develop * fix ci error test=develop	5 years ago
Zeng Jinle	7f3e0eaad1	refine error msg, test=develop (#23589 )	5 years ago
zhaoyuchen2018	42d67dacb6	OP(minus) error message enhancement. test=develop (#23621 ) C++ error message enhancement.	5 years ago
Huihuang Zheng	a82ce2b1bb	API/OP (ConditionalBlock) error message enhancement (#23480 ) API/OP (ConditionalBlock) error message enhancement (#23480)	5 years ago
Yiqun Liu	4489f0d304	Op(fetch) error message enhancement. (#23542 )	5 years ago
Zhen Wang	2cf27260ae	OP(fake_quantize) error message enhancement (#23550 ) * improve error messages of fake_quantize op. test=develop * update the bit_length error info. test=develop	5 years ago
Zhen Wang	1cf64e00fc	improve error messages of fake_dequantize_op. test=develop (#23556 )	5 years ago
mozga-intel	3baaee9aab	Remove: NGraph engine from PDPD repository (#23545 ) * Remove the NGraph engine from PDPD repository 1. Each operator was removed from the operator's directory 2. Each test was removed from the unittest directory 3. The parallel executor support was removed from the PDPD 4. The CMake file was removed from the PDPD 5. The NG flags were removed from the repository test=develop * Remove ngraph from: 1. Cmake file 2. Python file test=develop	5 years ago
wangchaochaohu	81e8fd4a3e	API(fluid.layers.array_length) error message enhancement (#23547 )	5 years ago
wangguanzhong	6bb8206d03	enhance the error message of box_clip, test=develop (#23638 )	5 years ago
liym27	8987946fe2	Api/Op (select_input/select_ouput) error message enhancement. (#23445 )	5 years ago
Wilber	5f22478a93	error message enhancement for repeated fc. test=develop (#23562 ) error message enhancement for repeated fc	5 years ago
Wilber	a5bdf485d5	fill op error message enhancement. test=develop (#23560 ) fill op error message enhancement	5 years ago
GaoWei8	2c4b57e94b	Op (concat) error message enhancement (#23523 )	5 years ago
GaoWei8	66cae9157e	Op (lod_reset) error message enhancement (#23499 )	5 years ago
liym27	dc225ed2fc	OP (tensor_array_read_write) error message enhancement. test=develop (#23468 )	5 years ago
GaoWei8	c068512f34	Implement a new C++ operator where and API tensor.where (#23220 )	5 years ago
Yiqun Liu	55d0c8fde7	Enhance the error message of feed_op. (#23526 )	5 years ago
Huihuang Zheng	71b5f1d2b2	OP (recurrent) error message enhancement (#23481 ) * OP (recurrent) error message enhancement	5 years ago
Adam	62aff0a7ac	Add DNNL GELU kernels (#22426 )	5 years ago
silingtong123	009c049e82	Add randint op API (#23337 ) * add randint op	5 years ago
qingqing01	6162cf2f2e	Make optimizer consistent in dygraph and static-graph and remove some LOG-INFO. (#23426 ) * Make optimizer consistent in dygraph and static-graph and remove some LOG-INFO	5 years ago
wangchaochaohu	29c4fae112	Tensor value support (#23491 ) * add support for value tensor support of fill_constant Op	5 years ago
Chengmo	426912df5a	Add Index sample OP (#23218 ) * add index_sample op	5 years ago
zhangchunle	638d924d89	Op (FusionSquaredMatSub) error message enhancement. (#23498 )	5 years ago
ShenLiang	c706ff20a3	fix conflict, test=develop (#23298 )	5 years ago
Chen Weihang	75bd350710	Implement StaticModelRunner to support dygraph fine-tune static graph pre-training model (#23171 ) * static model runner basic implement, test=develop * add run program op to execute loaded program, test=develop * refactor static model runner & run program op, test=develop * reset engine.cc to resolve conflict * adapt the change of dygraph double grad, test=develop * refactor impl to solve control flow error, test=develop * clear debug code, test=develop * fix ci str compatible error & checkout dygraph grad maker & add example, test=develop * hide api & add op test, test=develop * fix run program op test places error, test=develop * fix program by review comment, test=develop * delete change var desc name, test=develop * fix other program by review comment, test=develop * remove _static_graph_guard, test=develop * add selectedrows test, test=develop * remove desc parser, test=develop * fix detail program, test=develop * change socpe create & add test, test=develop	5 years ago
cc	9297f49e4b	[OP] Add randperm op (#23292 )	5 years ago
Kaipeng Deng	d223a24904	Fix inplace_abn compile error on Windows (#23464 ) * fix inplace_abn windows compile error. test=develop	5 years ago
Tao Luo	0b583235f5	Revert "Solve the conflict of ops with the same name. (#23199 )" (#23494 ) This reverts commit `abe3e6906d`. test=develop	5 years ago
wawltor	6577f91b74	Add the sum op to API 2.0， add some parameters for new api * Add the sum op to API 2.0, test=develop * Fix the import meesage in common_ops_import	5 years ago
WuHaobo	c4d0305239	add tril op and triu op (#23469 ) add tril op and triu op	5 years ago
yongqiangma	eb035f24d1	add unbind op (#23359 ) * add unbind op unbind(tensor, dim=0): 说明：移除指定维后，返回一组数组，包含了沿着指定维切片后的各个切片。 tensor(Tensor) -- 输入Tensor dim(int) -- 删除的维度示例： Input = [[1,2], [3,4], [5,6]] axis = 0 Output[0] = [1,2] Output[1] = [3,4] Output[2] = [5,6]	5 years ago
zhangchunle	fd9b7bdb3d	Op (FusedEmbeddingSeqPool) error message enhancement. (#23454 )	5 years ago
Chen Weihang	16315d3d9e	Delete Ref & VectorRef and add GetDataSafely (#22997 ) * delete invalid check inferface Ref & VectorRef, test=develop * fix vector ref delete error, test=develop * try the new check inferface, test=develop * change all related code with new check macro, test=develop * remove static assert, test=develop * polish detail, test=develop * skip coverage problem, test=develop * add new check macro, test=develop	5 years ago
Zhen Wang	abe3e6906d	Solve the conflict of ops with the same name. (#23199 ) * solve the conflict of ops with the same name. test=develop	5 years ago
wawltor	0b092d05f1	Add the argmax op to API 2.0， and update some parameters * Add the argmax op to API 2.0, test=develop * Fix the compiler problem in arg_max op, test=develop * Fix the import meesage in common_ops_import, test=develop * Fix the default dtype of arg_min_max, test=develop	5 years ago
wawltor	915341e3de	Add the zeros, ones, ones_like, zeros_like for api 2.0, test=develop (#23471 ) Update the new api ops of creation ops to the api 2.0	5 years ago
Zhen Wang	56b50c97f8	Add allclose_op (#23335 ) * Add allclose Op, and its function is analogous to numpy.allclose. It returns True if two tensors are elementwise equal within a tolerance.	5 years ago
kinghuin	948c57d84b	move sin, sqrt, tanh, atan to paddle.tensor.math and add a new parameter "out" (#23387 ) * sin sqrt tanh atan add out, test=develop * optimize doc, test=develop * add dygraph test, test=develop	5 years ago
Chengmo	a2e9af5663	Add Tdm child OP in contrib (#23241 ) * add tdm child op	5 years ago
Wilber	9676ac1c5c	Add flip op. (#23255 ) * add flip op	5 years ago
Feiyu Chan	81f1402f6c	Add functional convolutions in paddle.nn.functional (#23408 ) * add functional conv * add test and doc for function convs, test=develop * update ConvTransposeOp's InferShape and error message, test=develop	5 years ago
Feiyu Chan	bcafe3179a	add MKL computation back to gelu's non-approximate part (#23420 )	5 years ago
zhongpu	dbfbd7eac4	support Exhaustive search in dygraph (#23415 ) * use global conv cache; test=develop * use singleton cache; test=develop * fix format error; test=develop * add cudnn helper header; test=develop * fix header error; test=develop * fix mac unitest; test=develop * fix mac unitest; test=develop * fix file format; test=develop * fix include file error, test=develop * remove kernel_configs_ in class ExecutionContext and kernel_configs_map_ in class OperatorWithKernel, test=develop * fix test_elementwise_mul_op_dim, test=develop * fix compile error, test=develop Co-authored-by: phlrain <phliuhongyu@126.com>	5 years ago
zhaoyuchen2018	01d7ccd4b6	Fix elementwise compile error, test=develop (#23381 ) elementwise function used before definition then failed in cuda 8, move it ahead.	5 years ago
channings	a2e10930cf	update linspace, equal operators to API 2.0 (#23274 ) * update linspace, equal operators to API 2.0, test=develop * equal support higher performance CUDA kernel, test=develop * update comment of equal&linspace operator, test=develop * update comment of equal&linspace operator, test=develop	5 years ago
zhaoyuchen2018	4fe9ca6959	improve elementwise performance. (#23405 ) * improve elementwise performance. * Add contiguous check, test=develop	5 years ago
Chen Weihang	7f1ad510bd	Add op inout check macro to simplify error message writing (#23430 ) * add op inout check macro, test=develop * fix enforce_test, test=develop	5 years ago
zhongpu	bfb07aafe8	Revert "Exhaustive search (#22821 )", test=develop (#23401 ) This reverts commit `48144e4099`.	5 years ago
liym27	b7b0b3595b	Add unittest for transformer prediction in dygraph_to_static (#23207 ) * Add unittest for transformer prediction in dygraph_to_static. * fix bug in fill_constant api. * Make transpose support size 0. test=develop	5 years ago
zhongpu	48144e4099	Exhaustive search (#22821 ) * use global conv cache; test=develop * use singleton cache; test=develop * fix format error; test=develop * add cudnn helper header; test=develop * fix header error; test=develop * fix mac unitest; test=develop * fix mac unitest; test=develop * fix file format; test=develop * fix include file error, test=develop * remove kernel_configs_ in class ExecutionContext and kernel_configs_map_ in class OperatorWithKernel, test=develop * fix test_elementwise_mul_op_dim, test=develop Co-authored-by: phlrain <phliuhongyu@126.com>	5 years ago
Adam	da7c73f847	Delete is_test attribute from activation operators (#23318 ) * Delete is_test from activation operators test=develop * Revent unneeded changes test=develop	5 years ago
Kaipeng Deng	21d95be0db	Add inplace abn op (#22806 ) * add inplace_abn_op. test=develop	5 years ago
wangchaochaohu	69e3f99362	refine the error message (#23212 ) * refine the error message of tensor_array_read_write Op	5 years ago
xujiaqi01	3a45767d49	add fleet pslib pull and push sparse op and push dense op (#23139 ) * add fleet pslib pull and push sparse op and push dense op * test=develop	5 years ago
GaoWei8	20eed5401a	Change fluid.layers.where‘s C++ operator name (#23250 )	5 years ago
Jacek Czaja	012886df79	[DNNL] Softmax mkldnn op inplace support (#23197 )	5 years ago
zhaoyuchen2018	58615a6272	Improve elementwise performance. (#23001 ) * Improve elementwise performance. Elementwise performace is poor as walk into CommonGradBroadcastCUDA, add some new kernels for different data pattern. * Add some cuda kernel to speedup common broadcast cases. test=develop * Add more test cases and fix cuda kernel bug. test=develop * Remove tests as cpu percision fails.test=develop * Refine SplitDims, test=develop * Change file mode, test=develop	5 years ago
Zhaolong Xing	430b0099c9	[Paddle-TRT]: Ernie Dynamic shape support. (#23138 ) * add dynamic plugin support. test=develop * change emb eltwise layernorm to math function test=develop * add emb eltwise layernorm test=develop * can run dynamic shape ernie test=develop * fix ci test=develop * add ut for trt ernie dynamic test=develop * refine dynamic shape c++ interface. test=develop * fix comments test=develop * fix comments test=develop	5 years ago
liym27	6af480ca33	Support int64 for op assign_value. test=develop (#23179 )	5 years ago
Zeng Jinle	53e6f8e1da	rename macro, test=develop (#23161 )	5 years ago
Zeng Jinle	bba740710d	add cuda resource pool for BufferedReader, test=develop (#23152 )	5 years ago
Zeng Jinle	7d8d50b6cc	rename no_need_buffer_vars macro, test=develop (#23160 )	5 years ago
Liufang Sang	a486a739e1	fix compile error in win gpu (#23196 ) * fix compile error in win gpu test=develop * fix compile error in win gpu test=develop * fix compile error in win gpu test=develop	5 years ago
Zeng Jinle	7ca77a90ac	add Tensor::IsSharedBufferWith method, test=develop (#23175 )	5 years ago
Zeng Jinle	b8886bf122	rename no_need_buffer_vars_macro, test=develop (#23159 )	5 years ago
wangchaochaohu	b721e23b25	transpose cudnn using cudnn v7 api (#19738 ) * refine the transopose conv using v7 to choose algorithm	5 years ago
Adam	4f5e4540f8	Improve SGD jit code to work with large data (#23120 )	5 years ago
Liufang Sang	4db031902d	add dequantize_log_op and make pyramid hash support int8 weight (#22548 ) * add dequantize_log_op and make pyramid hash support int8 weight test=develop * add unittest and update pyramid hash op test=develop * remove paddle_enforce test=develop * fix error message test=develop * remove incorrent commit test=develop * fix error message in log_dequantize test=develop * change 2019 to 2020 test=develop * remove useless check_grad test=develop	5 years ago
Zeng Jinle	9258e96094	fix read op comments, test=develop, test=document_fix (#23122 )	5 years ago
Zeng Jinle	acfc9b8a70	Reader sequential and inference partial feed (#22699 ) * sequential reader stage 1, test=develop * fix ut, test=develop * fix iterable=False reset bug, add some logs and polish code, test=develop * inference feed partial data, test=develop * Turn on keep_order=True for test, test=develop * enhance ut to test more cases, test=develop * test commit for reverting * Revert "test commit for reverting", test=develop This reverts commit 80aef42ef52ba1ee79627d6f663a624ec4f12f58. * add ut of merged and unmerged results, test=develop * add more uts for coverages and add en doc of api, test=develop * follow comments, test=develop * change note style, test=develop	5 years ago
Wilber	95b356a069	update embedding_eltwise_layernorm fuse and kernel. test=develop (#23114 ) update embedding_eltwise_layernorm fuse pass and fused kernel, to support multi input	5 years ago
Zeng Jinle	a31d7328b7	Add dygraph double grad implementation (#22939 ) * add double grad implementation for dygraph, test=develop * polish code, add uts, test=develop * fix place bug, test=develop * polish codes, add more uts for coverages, test=develop * add no_grad_set, test=develop * add star gan ut, test=develop * follow comments, test=develop	5 years ago
songyouwei	2e2da7124b	high-performance dygraph slice (#22879 ) * move __getitem__ to cpp * bug fix * add type check and gil release * support negative step with omitted ends test=develop * code refine test=develop * bug fix test=develop * slice always return different pyobj test=develop	5 years ago
Sylwester Fraczek	abee05a8c8	added mkldnn swish activation (#23041 )	5 years ago
Zhaolong Xing	8c6fde9e69	fix align error (#23090 ) test=develop	5 years ago
Liufang Sang	915b892a15	Fix div zero in fake quantize op (#22966 ) * fix div zero test=develop * fix div zero test=develop * add hostdevice function test=develop * add eps when is zero test=develop	5 years ago
Feiyu Chan	01ab8a0619	add approximation for gelu, test=develop (#22961 ) add approximation for gelu, default value is False (only kernel with eigen is added, remove code for computing gelu with MKLDNN temporarily)	5 years ago
Adam	5842ae6785	Revert "Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695 )" (#22985 )	5 years ago
GaoWei8	1dc1f9270e	Fix lod error of concat op for axis = 0 (#22538 )	5 years ago
Zhang Ting	714b0076b6	Override GetKernelTypeForVar to avoid device transform, test=develop (#23032 )	5 years ago
wangchaochaohu	112e3edbf6	fix the conv group problem test=develop (#23025 )	5 years ago
wangchaochaohu	3757e0687c	Add Unittest for backward of fusion group (#22932 ) * add fusion group test for backward and refine code	5 years ago
chengjuntao	63f3ada7b9	fix bug which input shape (#22965 ) * fix bug which input shape, test=develop * add error type,test=develop	5 years ago
wangchaochaohu	f0d193a23c	Cast fusion for fusion group (#22876 ) * add support for expression type convert and add cast Op support in fusion group	5 years ago
yaoxuefeng	29a7a52d38	Fix instag (#22632 ) * update * update test=develop * update compile set test=develop * update compile set test=develop * update test=develop * update test=develop * update test=develop * update compile setting test=develop * update compile setting test=develop * update run demo test=develop * update test=develop * update test=develop * fix test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update format test=develop * update format test=develop * update style test=develop * update style test=develop * change style test=develop * change style test=develop * change style test=develop * add dataset unittest test=develop * update test=develop * update for record test=develop * udpate style for record test=develop * update for record test=develop * update for record test=develop * update for record test=develop * fix format test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * fix compile warning test=develop * add attr default test=develop * add unittest test=develop * fix style test=develop * fix style test=develop * change out_val_ifempty to out_val_if_empty test=develop	5 years ago
wawltor	f154d5860f	Speed up the matmul op, use the gemm replace the batch gemm (#22926 ) In the op of gemm, we use the gemm to replace batch gemm, speed up the matmul op	5 years ago
Adam	056edf3929	Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695 )	5 years ago
Zhaolong Xing	8d6dc102fe	[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494 ) * 1. add embedding eltwise layernorm fuse 2. add embedding eltwise layernorm op 3. refine inplace_add_relu 4. refine fc_eltwise_layernorm test=develop * 1. refine fc test=develop * fix comments test=develop * fix comments test=develop	5 years ago
guofei	3d8571e884	modify assign op and add unittest of assign op (#22769 ) As the title.	5 years ago
Zeng Jinle	d33c4343e1	Imperative tracer refactoring (#22457 ) * refine grad maker, test=develop * refactor tracer stage 1, test=develop * merge develop to solve conflict third times, test=develop	5 years ago
tangwei12	ad9c8f6d2d	fix communicator when break under pyreder mode (#22911 ) * fix communicator when breaking under PyReader mode, test=develop * revert some vlog level to 0, test=develop	5 years ago
mapingshuo	5ba9dfc16a	add lookup_table_dequant_op (#22900 ) add lookup_table_dequant_op	5 years ago
Zhaolong Xing	dd67d44a50	[Paddle-TRT] : (Part1) Dynamic shape support (#22868 ) * change the ci trt from version 5. to 6.0 * paddle-trt dynamic shape support init * conv+bias or conv+bn dynamic shape support test=develop * modity trt engine opconvert test=develop * fix ci error test=develop	5 years ago
tangwei12	07e13b84cd	remove vlog, test=develop (#22898 )	5 years ago
Wilber	f686310d81	fix concat_mkldnn op. test=develop (#22692 ) fix concat_mkldnn op when encounter extreame conditions.	5 years ago
Zhaolong Xing	1a533ed2de	[BUG]: Multihead matmul op's ouput size should be BxSx(N*H) (#22848 ) test=develop	5 years ago
石晓伟	ddb9b46fec	change the function in op_teller, test=develop (#22794 ) * change the function in op_teller, test=develop * correct the commit-id, test=develop	5 years ago
tianshuo78520a	433cef03e5	fix typo word (#22784 )	5 years ago
Kaipeng Deng	ebc7ffc300	fix detection_map. test=develop (#22705 )	5 years ago
zhaoyuchen2018	72dde4abde	Refine adam op to improve performance, test=develop (#22346 ) * Refine adam op, test=develop * Fuse kernels together to reduce cpu time. * Refine paddle enforce, test=develop * Remove some comments, test=develop * Refine code,test=develop * Refine cuda kernel, test=develop * Refine code according to comments, test=develop	5 years ago
wangguanzhong	f2d1cd119a	fix lod level, test=develop (#22755 )	5 years ago
FlyingQianMM	79d712346f	Correct CPU gradients of the argsort op (#22739 ) * Correct CPU gradients of the argsort op, form a network to test its forward and backward process, test=develop * fix dynamic threshold error in test_argsort_op, test=develop	5 years ago
guofei	ae8b5f11a3	Change ShareDataWith() to TensorCopy() in ref_by_trainer_id (#22717 ) As the title	5 years ago
chengjuntao	15c2667143	register fp16 for assign op (#22744 ) * register fp16 for assign op, test=develop * add op test for fp16, test=develop	5 years ago
dyning	1c0653462d	fix generate_mask_labels lod level (#22743 )	5 years ago
GaoWei8	ba140222d6	fix compile&runtime lod_equality of lod_reset (#22737 )	5 years ago
ShenLiang	3132681e8a	add partial_sum op in contrib (#22292 ) * add partial_sum_op, test=develop * modify the Paddle Error Message, test=develop * modify the Paddle Error Message, test=develop * modify the bug for python3, test=develop * modify the ut for ci, test=develop * mv to contrib, test=develop * use check_variable_and_dtype, test=develop * fix ci, test=develop * fix conflict, test=dvelop * add partial concat, test=develop * fix the conflict, test=develop * fix the error, test=develop * rm SSE4, test=develop	5 years ago
ShenLiang	e136661304	add partial_concat op in contrib (#22528 ) * add partial_concat, test=develop * fix the grids and blocks, test=develop * fix the Paddle_Enforce, test=develop * fix the doc of op, test=develop * fix the doc, test=develop * fix the doc of the op, test=develop * replace -1 with None, test=develop	5 years ago
tianshuo78520a	d2ba91aad1	fix typo words (#22653 )	5 years ago
Yibing Liu	6e7bfe30a6	register fp16 kernel for some ops (#22650 ) (#22696 ) test=develop	5 years ago
tangwei12	66a3150135	SYNC with communicaotor (#22344 ) * add sync communicator and implement	5 years ago
Yiqun Liu	22bbd54719	Add the support of fp16 in fusion_group (#22239 )	5 years ago
Huihuang Zheng	adfa5b8354	Add PADDLE_ENFORCE to Check Sequence Length of RecurrentOp (#22673 ) 1. Add PADDLE_ENFORCE to Check Sequence Length of RecurrentOp. 2. Also enrich PADDLE_ENFORCE error messages.	5 years ago
lidanqing	d926214535	[UT coverage] improve the mul_mkldnn_op line coverage (#22408 ) * improve the mul_mkldnn_op line coverage test=develop * remove fp32 mul mkldnn kernel test=develop * locally refactoring test=develop * change according to reviews test=develop	5 years ago
Zhaolong Xing	a06d75a280	[Paddle-TRT] Refine the error log about runtime batch and max_batch_size. (#22535 ) * fix trt log test=develop * fix comments test=develop	5 years ago
Adam	608447bfd5	Update MKLDNN to v1.2 (#22521 )	5 years ago
Adam	ab610a34ff	transpose_mkldnn code change to meet Paddle standards (#22591 )	5 years ago
Jiawei Wang	8f035fb637	Add TopK Op Grad CPU&GPU Kernel test=develop (#22628 ) * Add TopK Op Grad CPU&GPU Kernel test=develop * Add TopK Op Grad, modify grad op maker test=develop * Add TopK Op Grad, modify grad op maker test=develop * Add TopK Op Grad, modify PADDLE_ENFORCE test=develop * Add TopK Op Grad, modify PADDLE_THROW test=develop * Add TopK Op Grad, modify unittest test=develop * fix ngraph top k op unittest test=develop	5 years ago
Steffy-zxf	90ee366653	update ops's unittest data type from float32 to float64 and shape over 100 (#22544 ) * update ops's unittest of elementwise_pow, elementwise_max, elementwise_min, scale and sqrt 1. update elementwise_pow, elementwise_max and scale's unitests with input data type (float32 -> float64) 2. fix bug that the elementwise_pow doesn't meet threshold requirements with tackling float64 data 3. remove sqrt from op_accuracy_white_list.py 4. update the unittests of elementwise_pow, elementwise_max and elementwise_min ops that their input data shape over 100 5. test=develop * modify the writing style according suggestions test=develop	5 years ago
Zhaolong Xing	8acd745c25	[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486 ) * 1. optim multihead matmul: fuse three fc to multihtead matmul test=develop * fix conflict test=develop * fix comments test=develop	5 years ago
Guo Sheng	31b5464632	Add support for dynamic_decode(while) training. (#22231 ) * Add support for dynamic_decode(while) training. test=develop * Fix assign_op and tensor_array_read_write_op after solving conflict. test=develop * Fix test_rnn_decode_api.py. test=develop * Refine docs for apis in rnn.py. test=develop * Adjust outputs of dynamic_decode. test=develop * Remove the force_cpu update in assign_op. test=develop * Remove the force_cpu update in assign_op. test=develop * Make RNNCell.get_initial_states support batch_dim_idx argument. test=develop * Rename _create_array_outof_while as _create_array_out_of_while in rnn.py. test=develop	5 years ago
Wojciech Uss	4cddb43c5c	Add support for Ernie NLP model to the Slim QAT (#22506 ) * a test for Ernie QAT INT8 accuracy check test=develop * Remove NLP comparison test to split PRs test=develop * Fix typo and tabs, delete commented lines test=develop * re-combine the 2 PRs, test=develop Co-authored-by: Michał Gallus <sand3r@interia.eu> Co-authored-by: bingyanghuang <33643817+bingyanghuang@users.noreply.github.com>	5 years ago
Double_V	58d99247f4	support slice double grad, test=develop (#22166 ) * support slice double grad, test=develop * merge two doublegradopmaker to one doublegradopmaker,test=develop * change the shape of slice_OP's unittest, test=develop	5 years ago
hutuxian	1a7962be97	Paddlebox about box_wrapper (#22497 ) Refine PaddleBox Framework, Main functions: * Add MetricMsg util class, which can calculate metrics like AUC, bucket_error, COPC. * Replace FeedPass with new interface: BeginFeedPass & EndFeedPass * Refactor Pull/Push Sparse Function in box_wrapper. * Use CUDA Kernel to copy keys and copy feasign between tensor and boxps struct. * Cache copied keys in pull sparse in order to reuse it in push period.	5 years ago
huzhiqiang	9e29d3ebed	【OpPorting Example】DEMO OF FIX COMPILE&RUNTIME LOD_EQUALITY (#22460 )	5 years ago
zhaoyuchen2018	54970444ce	Improve transpose performance with tile sm copy, test=develop (#22311 ) * Refine code, fix select tile error,test=develop * Refine element type and some comments, test=develop * Refine comments and gpu utils, test=develop * Remove some useless condition * Refine floor and ceil, test=develop * refine for loop. test=develop Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>	5 years ago
Wilber	a90fa54092	Compile without nccl deps. [1/2] (#22509 ) 支持不依赖nccl进行编译。[1/2] 多卡下，如果没有打开WITH_NCCL开关编译，多卡不能通信，则只能选择一张卡使用。 Co-authored-by: 石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>	5 years ago
Wilber	de009152a7	Compile without nccl deps. [2/2] (#22484 ) Compile without nccl deps. [1/2] Co-authored-by: 石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>	5 years ago
Yiqun Liu	4b2227e958	Fix dismatch of std::max's arguments type on windows. (#22507 ) test=develop	5 years ago
Wilber	870f465887	fix test_fusion_seqpool_concat lod level between compile and runtime (#22488 )	5 years ago
Zhong Hui	a61d09527b	Fix the integer overflow problem of sequence2batch (#22479 ) Fix the integer overflow problem in the op of sequence2batch, change the int32_t to size_t， In the /paddle/fluid/operators/math/sequence2batch.h#L122.	5 years ago
cc	197913ebe1	Add weight quantization in post_training_quanzitaion (#22445 ) * support weight quantization in post_training_quanzitaion, test=develop * add test for weight quantization, test=develop	5 years ago
Tao Luo	7c9ce097f1	refine reshape_op shape error message (#22480 ) test=develop	5 years ago
LielinJiang	2b1386b2b2	optimize performance of interpolate op (#22436 ) * optimize interpolate op, test=develop	5 years ago
Yiqun Liu	44b45b9f07	Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456 ) * Add log in memory::Copy for debug purpose. * Change to use context in DeviceContextPool directly in sequence_pooling_test, instead to new one. * Change to use context in DeviceContextPool directly in sequence_padding_test, instead to new one. test=develop * Change the type of second_dim from size_t to int64_t. test=develop	5 years ago
Wilber	7bc4b09500	add WITH_NCCL option for cmake. (#22384 ) cmake选项中添加了WITH_NCCL，显示指定是否编译NCCL的部分代码，WITH_NCCL默认打开，但如果WITH_GPU为OFF，则关闭WITH_NCCL 添加了PADDLE_WITH_NCCL定义单机单卡能够关闭NCCL编译，多卡的话需要默认打开NCCL，如果关闭NCCL，则只能使用单卡 Co-authored-by: 石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>	5 years ago
Tao Luo	943cb8c664	fix sigmoid cudnn bug (#22439 ) * Sigmoid bug fix, test=develop * fix code format test=develop Co-authored-by: Manjunath Bhat <manjunathbhat9920@gmail.com>	5 years ago

1 2 3 4 5 ...

5169 Commits (b43b46e619ad2aaccf960c7dd4db7f6d15d8a399)