Paddle

Commit Graph

Author	SHA1	Message	Date
Yu Yang	81520a24cf	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor	7 years ago
Yu Yang	9bd70a1e04	Change tensor uses proto::VarType::type test=develop	7 years ago
Yu Yang	8175983ef9	Merge pull request #14814 from reyoung/feature/gprof Add gperftools supports for PE	7 years ago
Yu Yang	5e60906996	Fix compile error test=develop	7 years ago
Yu Yang	7604b1ad51	Fix Eigen macro when using GPU The macro should be defined by compiler rather than by source. test=develop	7 years ago
Yu Yang	b22d638d8f	Speed up SizeOfType test=develop	7 years ago
sneaxiy	66182abda6	add cuda cudnn version check test=develop	7 years ago
Zeng Jinle	add98c9e7d	Merge pull request #14745 from sneaxiy/fix_eigen_deallocate Fix eigen deallocate bug	7 years ago
Tao Luo	54fcafb5f6	Merge pull request #14707 from yihuaxu/develop_4f71a6ee2_conv3d_mkldnn_opt Implement conv3d with mkldnn library	7 years ago
sneaxiy	0f96c2e80f	fix thread-safety bug test=develop	7 years ago
Yihua Xu	65dbc7cca4	Merge branch 'develop' into develop_4f71a6ee2_conv3d_mkldnn_opt	7 years ago
tensor-tang	4a93db9288	remove jit namespace test=develop	7 years ago
sneaxiy	900765224c	fix deallocate bug test=develop	7 years ago
liuhongyu	773dc73fbf	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support	7 years ago
liuhongyu	8daf67f90f	fix bugs; test=develop	7 years ago
Xin Pan	052cc5f538	Merge pull request #14725 from ZongwuYang/my-cool-stuff My cool stuff	7 years ago
Wu Yi	29d9fb53fc	[Feature] multi process multi gpu dist training, boost v100 performance by 20% (#14661 ) * wip multi process multi gpu dist training * workable for p2p * update test=develop * change back env name test=develop * fix alloc init * fix cpu build test=devlop * fix mac tests test=develop * refine code * refine test=develop	7 years ago
liuhongyu	968dd3c078	add cudnn 5 support; test=develop	7 years ago
ZongwuYang	1560eb4a6d	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my-cool-stuff	7 years ago
ZongwuYang	deb04809bd	test=develop Fix the bug that profiler cannot trace the nccl allreduce operator	7 years ago
Yihua Xu	669191c9cc	Implement conv3d with mkldnn library (test=develop)	7 years ago
Hongyu Liu	4f71a6ee2c	Merge pull request #14622 from PaddlePaddle/add_cudnn_lstm Add cudnn lstm	7 years ago
Yibing Liu	c7382df80f	Print assert failure id in lookup_table_op (#14698 )	7 years ago
phlrain	cf1fe61004	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm	7 years ago
Tao Luo	20120d9c97	Merge pull request #14608 from jczaja/prv-conv2d-transpose-mkldnn [MKL-DNN]conv2d transpose	7 years ago
Tao Luo	ea47685f91	Merge pull request #14646 from jczaja/prv-softmax-mkl-sasum Softmax for inference MKL further changes	7 years ago
minqiyang	a02ce58f2c	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog test=develop	7 years ago
Tao Luo	4ec9de0122	Merge pull request #14628 from Sand3r-/mgallus/mkldnn-elementwise_mul EltwiseMul: Changes from previous PR	7 years ago
Clementine	6c71c1f8f9	Add activation gelu (#14569 )	7 years ago
Michal Gallus	9455be0ba5	EltwiseMul: Extract StringToFormat to MKLDNN helper test=develop	7 years ago
Jacek Czaja	8bfa1fa9bb	- ASUM MKL integration	7 years ago
liuhongyu	05917c3c79	add cudnn lstm; test=develop	7 years ago
peizhilin	38715e6fd0	minor fix	7 years ago
Jacek Czaja	fb24690a58	- conv2d transpose MKL-DNN test=develop - Added new header for MKLDNN reuse functionality - Extended conv2d_transpose GetExpectedKernelType for MKL-DNN supporrt - Buildable conv transpose mkldnn and conv mkldnn using conv template - Conv2d transpose roughlt implemented and buildable - Added modifications conv2d transpose MKLDNN unit tests - Fix to UT of conv2d transpose mkldnn op - Wrong type of MKLDNN primitive was chosen for conv2d transpose - HAcks for conv2d transpose - UT enalbed - Replaced copying loop with memcpy - Draft of passing lambda into AcquireMemory - Made reorder (IOHW->OIHW) to be called only once	7 years ago
minqiyang	be04d99fe4	Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog test=develop	7 years ago
minqiyang	53433d7f2e	Revert the changes of VLOG test=develop	7 years ago
peizhilin	36cd18b549	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
peizhilin	b2f8d4183d	Given the different fraction_of_gpu_memory_to_use depends on platform	7 years ago
Yu Yang	26af9cf90c	Merge pull request #14565 from chengduoZH/fix_cublas_warp_error Fix cublas warp error	7 years ago
chengduozh	f7847ca6a3	fix cublas warp error test=develop	7 years ago
luotao1	e21edb26f6	add Set/GetCPUNumThreads api	7 years ago
peizhilin	445fff24dc	add the bigobj option to NVCC compile fix code style	7 years ago
chengduo	00b9e9a135	Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929 ) * refine cublase test=develop * code refine * refine cublas * add GEMME_EX * add enable_cublas_tensor_op_math doc and add cublasCall test=develop * fix CublasCall for cuda version test=develop * fix error test=develop * fix GEMM_EX to be compatible with gcc 4.8 test=develop * add GEMM_EX test=develop * to compatiable with gcc4.8 test=develop	7 years ago
peizhilin	7c8c9dc9bf	fix unit test cases	7 years ago
wopeizl	d9a1f3e58e	Windows/online (#14474 ) * add recordio support * disable the openblas multi-thread on windows since no support adjust the python script * code style * code style test=develop * add create_recordio_file_reader back * fix code style test=develop * fix the gtest.cmake on windows * fix cc_test on windows * fix the win build test=develop * remove fused compile support on windows test=develop * add the jit support test=develop * add the jit support, test=develop * add the jit support, test=develop * add the jit back fix compile error on windows * rollback test=develop * test case fix * disable DSO by default on windows * exclude warpctc_op on windows * exclude the dynload_warpctc out on windows test=develop * fix the scripts error test=develop * disable avx on windows by default test=develop * re-organize the cmake file * disable mkl on windows by default * add warp_ctc back * fix the dependency * fix the dependency * fix the build issue on windows * remove unsupported flag on windows * code style * code style test=develop * fix issue * add profiler, parallel_executor back * clean up the pre-definitions on windows * fix build issue * test=develop	7 years ago
peizhilin	6e66fadb95	clean up the pre-definitions on windows	7 years ago
peizhilin	67562a6fcd	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago
peizhilin	703b26e697	add profiler, parallel_executor back	7 years ago
chengduo	a8d3aaae2a	print output log warning (#14497 ) test=develop	7 years ago
peizhilin	3a72a634cf	Merge remote-tracking branch 'upstream/develop' into windows/build	7 years ago

1 2 3 4 5 ...

463 Commits (81520a24cf0edb065231ddeecea803a8f0149eeb)