Commit Graph

5291 Commits (23dec787723f6906077e77b2d15820a78bde1344)

Author SHA1 Message Date
phlrain 4b9689379f fix cudnn lstm; test=develop
6 years ago
phlrain d1a17cadd4 fix cudnn rnn; test=develop
6 years ago
Qiao Longfei 9450048acb add PADDLE_ENABLE_REMOTE_PREFETCH to enable remote prefetch
6 years ago
Xin Pan 75939c2059 fix
6 years ago
Tao Luo 20120d9c97
Merge pull request #14608 from jczaja/prv-conv2d-transpose-mkldnn
6 years ago
Qiao Longfei 3e45a5a5ec lookup_table gpu kernel support prefetch
6 years ago
Zhaolong Xing d215293c92
Merge pull request #14649 from NHZlX/add_params_sync_pass
6 years ago
qingqing01 731d45a39a
Enable BatchNorm to use global mean and variane during training (#14630)
6 years ago
nhzlx 49c28b8c52 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass
6 years ago
nhzlx 3c83a2f720 fix comments
6 years ago
Xin Pan ad6ed5b745 fix py3
6 years ago
Xin Pan 0cc9ab3dc2 enable API check for readers
6 years ago
luotao1 4a4daa8ab4 Merge branch 'develop' into has_attr
6 years ago
Qiao Longfei 75eba6108d Add scope doc (#14582)
6 years ago
Tao Luo ea47685f91
Merge pull request #14646 from jczaja/prv-softmax-mkl-sasum
6 years ago
Qiao Longfei 3a3cfc2d8d prefetch support gpu
6 years ago
baojun-nervana d5ee05e6c3 Replaced VarIsTensor
6 years ago
baojun-nervana e6bd53be60 Named to RuntimeInferShape
6 years ago
Sang Ik Lee 24e70920db Refactor some build settings.
6 years ago
baojun-nervana a29696146c Added annotation
6 years ago
Sang Ik Lee d6125a5eec Include ngraph in inference demo build.
6 years ago
baojun-nervana caf4b937b3 Added RunInferShape
6 years ago
baojun-nervana 1d19eb2bd4 Implemented ngraph engine
6 years ago
Qiao Longfei 4b9082a4cd follow comment
6 years ago
Tao Luo b4de023ee1
Merge pull request #14636 from Superjomn/fix/word2vec
6 years ago
luotao1 fe915901cd update Opdesc's HasAttr
6 years ago
chengduo 6776e92846
refine tensor_array_write_read (#14643)
6 years ago
nhzlx d3e140a572 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass
6 years ago
nhzlx d666c8eb1d fix benchmark
6 years ago
nhzlx 900fbb83f9 add params sync pass
6 years ago
superjomn 9c665c81ae update
6 years ago
Jacek Czaja 48e1b97e8e - Coding style fixes
6 years ago
Qiao Longfei d32de7e6e1 fix code format test=develop
6 years ago
Qiao Longfei 5a660aee7d update log level in parameter prefetch test=develop
6 years ago
Qiao Longfei 8ebde595c9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
6 years ago
Qiao Longfei b9d3d75fc4 fix prefetch dependency test=develop
6 years ago
Qiao Longfei 145c535750 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
6 years ago
minqiyang 9d7c3b18c0 Polish code
6 years ago
minqiyang 2b430adaee Polish code
6 years ago
minqiyang a02ce58f2c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
6 years ago
Jiabin Yang 12e1719f96
Merge pull request #14352 from JiabinYang/enhance_hierachical_sigmod_op
6 years ago
Qiao Longfei 40f68b1349 unit test ready
6 years ago
Qiao Longfei 36e26a53b0 Optimize bilinear tensor product op (#14485)
6 years ago
Tao Luo 4ec9de0122
Merge pull request #14628 from Sand3r-/mgallus/mkldnn-elementwise_mul
6 years ago
Qiao Longfei 35b79ab865
Merge pull request #13983 from jacquesqiao/add-ctr-reader
6 years ago
Qiao Longfei da387720d7 fix infer compile test=develop
6 years ago
Jacek Czaja cf40daee58 - Building fix to softmax for inference
6 years ago
Clementine 6c71c1f8f9 Add activation gelu (#14569)
6 years ago
Michal Gallus 9455be0ba5 EltwiseMul: Extract StringToFormat to MKLDNN helper
6 years ago
Jacek Czaja 1540df51cf - Fix to test_conv2d_transpose_mkldnn for GPU
6 years ago
JiabinYang eda069068d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
6 years ago
JiabinYang a08dc83eb0 remove arg 'non_leaf_num', test=develop
6 years ago
chengduo 6648f5ed6f
add ShareLoD for dropout_grad (#14616)
6 years ago
Qiao Longfei 18fd2d01b7 update embedding api
6 years ago
JiabinYang 7594787deb Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
6 years ago
JiabinYang c469334cfb polish python code and comment, test=develop
6 years ago
Xin Pan 3c77ce3751
Merge pull request #14593 from panyx0718/fix5
6 years ago
Qiao Longfei 92afbb923c fix compile problem test=develop
6 years ago
Tao Luo e8ef14d2a7
Merge pull request #14610 from Superjomn/revert/cache_fix
6 years ago
Qiao Longfei 97cbec9b74 clean code
6 years ago
Qiao Longfei 1edd435da6 fix ci problem test=develop
6 years ago
JiabinYang 87648f8edf merge develop, test=develop
6 years ago
Yiqun Liu 726f2cefe3
Fix bug of referencing a temporary variable. (#14614)
6 years ago
wopeizl db9284ecde
Merge pull request #14617 from wopeizl/windows/online
6 years ago
JiabinYang c3c3c0b33c polish code, test=develop
6 years ago
gongweibao 867c312bc4
Fix allreduce dependency order. (#14586)
6 years ago
Jacek Czaja 8bfa1fa9bb - ASUM MKL integration
6 years ago
phlrain 487ee36aec Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
6 years ago
tangwei12 56a4912b76
Make NCE_OP more efficient and support SelectedRows (#14469)
6 years ago
liuhongyu 1ffe41d722 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
6 years ago
Qiao Longfei 9589babe12 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
6 years ago
liuhongyu 05917c3c79 add cudnn lstm; test=develop
6 years ago
Zeng Jinle 1c48d61442
Merge pull request #14599 from sneaxiy/fix_mac_unittest_bug
6 years ago
Qiao Longfei f35f3fe77a ctr reader can not be used in windows
6 years ago
peizhilin 6a85dd3278 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin 38715e6fd0 minor fix
6 years ago
JiabinYang 7389597ce2 Update API.spec, test=develop
6 years ago
peizhilin 511cc9024a fix for build issue
6 years ago
Qiao Longfei 6bef565dac clean code test=develop
6 years ago
Qiao Longfei e7d1f524f3 change log level
6 years ago
JiabinYang 7e4bd695e6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
6 years ago
Qiao Longfei fe54adf70c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-ctr-reader
6 years ago
JiabinYang b10df8bcfa refine code and add none bias ut, test=develop
6 years ago
Kaipeng Deng 251a1bb0f4
Merge pull request #14588 from heavengate/revert_interpolate
6 years ago
Qiao Longfei 668ae9083e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-ctr-reader
6 years ago
Qiyang Min 30e47bce8b
Merge branch 'develop' into revert_vlog
6 years ago
superjomn 4babc6b06c update
6 years ago
sneaxiy f3522a11d2 fix mac unittest bug
6 years ago
Qiao Longfei 87e4edd2ea fix grad_varname in remote prefetch
6 years ago
superjomn dc249d3b69 Revert "fix transfer cache thread_local bug (#14581)"
6 years ago
Qiao Longfei d98c59fd2c support none sliced variable
6 years ago
dengkaipeng bb489d4cc9 add interp_method default bilinear. test=develop
6 years ago
dengkaipeng 78f563917c revert interpolate_op to bilinear_interp_op & nearest_interp_op. test=develop
6 years ago
Jacek Czaja fb24690a58 - conv2d transpose MKL-DNN
6 years ago
tensor-tang 7a91271436
Merge branch 'develop' into fea/jit/rnn
6 years ago
minqiyang be04d99fe4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
6 years ago
wopeizl 05b7ee7eeb
Merge pull request #14545 from wopeizl/windows/online
6 years ago
JiabinYang 81e145764d refine code and comments, test=develop
6 years ago
Qiao Longfei af2f5fc824 fix some bugs
6 years ago
JiabinYang 2f6b529aff refine code and comments, test=develop
6 years ago
Xin Pan 3e665862b8 Protect important header files.
6 years ago
minqiyang e43f5bc77c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_dist_resnet_ut_in_py36
6 years ago
minqiyang 53433d7f2e Revert the changes of VLOG
6 years ago
tensor-tang 1f0291a51e add comments and follow comments
6 years ago
tensor-tang 557229bd39 Merge remote-tracking branch 'ups/develop' into fea/jit/rnn
6 years ago
Qiao Longfei ed9fa4b301 can run
6 years ago
peizhilin 30849d1f20 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
qingqing01 6224e61fd9
Transpose-Flatten-Concat fusion operator. (#14568)
6 years ago
Yan Chunwei 5c073a4db2
fix transfer cache thread_local bug (#14581)
6 years ago
Xin Pan 87332bb18d
Merge pull request #14579 from Superjomn/fix/transfer-cache-compile-error
6 years ago
minqiyang 8b154c172f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_dist_resnet_ut_in_py36
6 years ago
Qiao Longfei 686d15c8e0 update grpc_variable_response
6 years ago
Jiabin Yang 13bc7619f5
Merge pull request #14552 from JiabinYang/fix_mac/fix_pinned_memory
6 years ago
tangwei12 3639d99f99
Fix save and load lookup table/optimizer vars (#14301)
6 years ago
peizhilin 36cd18b549 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
qingqing01 39ec80def4
Remove the memory copy of feeding data in C++ inference API (#14577)
6 years ago
peizhilin b2f8d4183d Given the different fraction_of_gpu_memory_to_use depends on platform
6 years ago
Qiao Longfei d827881502 fix pserver and prefetch rpc
6 years ago
peizhilin 1afa9492af Recover the profiler
6 years ago
Yiqun Liu bf222f197d
Use sub scope in tensor_array_to_tensor op. (#14524)
6 years ago
superjomn 4b40c0013b fix compile
6 years ago
JiabinYang 02d68051db add sparsed bias grad, test=develop
6 years ago
dzhwinter 840c1b29ad
test=develop (#14562)
6 years ago
Qiao Longfei 5856c2f332 change Var to FindVar
7 years ago
Yu Yang 26af9cf90c
Merge pull request #14565 from chengduoZH/fix_cublas_warp_error
7 years ago
Qiao Longfei 312b7786d9 clean code
7 years ago
Qiao Longfei 2b6c0c09d6 add unit test
7 years ago
Yan Chunwei 923c8e3332
add benchmark for inference (#14571)
7 years ago
Qiao Longfei 47280ef8b4 lookup table op support prefetch
7 years ago
Yan Chunwei a7188d5bc7
fix executor transfer cache bug (#14518)
7 years ago
gongweibao c1bf9664cd
Add options to disable SO_REUSEPORT of grpc. (#14269)
7 years ago
minqiyang ee73810fd5 Fix API.spec
7 years ago
Qiao Longfei 4ad5fd8f54 add parameter prefetch
7 years ago
Qiao Longfei 9d276fe8a8 add parameter prefetch
7 years ago
minqiyang d2045260a5 Change visibilities of variant_visitor of pybind11
7 years ago
minqiyang b67229187e Change to PYBIND11_MODULE because the deprecation of PYBIND11_PLUGIN
7 years ago
minqiyang 81994e84e0 Change the include files because the version changes of pybind11
7 years ago
Tao Luo e90afec47b
Merge pull request #14543 from luotao1/threads
7 years ago
qingqing01 64ca3d176c
Add bias_attr in sequence_conv_pool API. (#14553)
7 years ago
chengduozh f7847ca6a3 fix cublas warp error
7 years ago
Zhaolong Xing e52d90a35e
Merge pull request #14527 from hjchen2/develop
7 years ago
JiabinYang 47c4e65d60 test=develop
7 years ago
luotao1 116979a40a refine api name
7 years ago
luotao1 e66b4c6bff adjust tester_helper to make multi-instance multi-thread work
7 years ago
luotao1 a5c4b463c9 add SetMKLDNNThreadId api
7 years ago
luotao1 e21edb26f6 add Set/GetCPUNumThreads api
7 years ago
Qiao Longfei 9851a53478 add prefetch part in pserver
7 years ago
JiabinYang 42470f14b7 test=develop
7 years ago
peizhilin 445fff24dc add the bigobj option to NVCC compile
7 years ago
sabreshao 61c5f13fcf Fix cmake for AMDGPU platform (#13801)
7 years ago
qingqing01 36f08eef3b
CUDA kernel for density_prior_box_op. (#14513)
7 years ago
tensor-tang 6a7f83d45d enable gru jitcode and refine act and lstm jitcode
7 years ago
tensor-tang 686eaf20ba Merge remote-tracking branch 'ups/develop' into fea/jit/rnn
7 years ago
peizhilin 81bd7eeff4 rollback the format
7 years ago
Qiao Longfei 1f87f263a2 clean code
7 years ago
Qiao Longfei 361cb0e078 lookup remote table can compile
7 years ago
JiabinYang 0fca16847c temp
7 years ago
JiabinYang e9be3366a9 test=develop
7 years ago
Zeng Jinle bfc34ac19f
Merge pull request #14536 from sneaxiy/dlpack_integration
7 years ago
chengduo 00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929)
7 years ago
peizhilin dfbac60398 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin 7c8c9dc9bf fix unit test cases
7 years ago
tensor-tang 0c5ed5f6fc enable peephole jitcode
7 years ago
JiabinYang 3c6102a367 test=develop
7 years ago
Qiao Longfei 7c3ce2952d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
7 years ago
Qiao Longfei 60a4f69b3c add lookup remote table op
7 years ago
Qiao Longfei e0b48f7e29 init lookup remote table
7 years ago
tensor-tang e3b61cf52b init gru jitcode and fix lstm jitcode
7 years ago
tensor-tang 0f25446574 Merge remote-tracking branch 'ups/develop' into fea/jit/rnn
7 years ago
Dun ae7d22862b Group Norm (#13843)
7 years ago
hjchen2 1adda8e06c Add more unit tests for split plugin
7 years ago
sneaxiy 488610a65a merge develop
7 years ago
Jiabin Yang de2db11735
Merge pull request #14537 from reyoung/feature/fix_macos_ut
7 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
7 years ago
Yu Yang 533c5d5803 fix(Cpu): fix cpu compile and unittest
7 years ago
sneaxiy 3912545ffe add dlpack support
7 years ago
JiabinYang 57a18e32a1 test=develop
7 years ago
peizhilin bef475c92b Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Tao Luo 5d4d117edc
Merge pull request #14502 from qingqing01/cudnn5_fix
7 years ago
Jiabin Yang f7b55de9e5
Merge branch 'develop' into enhance_hierachical_sigmod_op
7 years ago
Yu Yang e68c1fcd5a
Merge pull request #14522 from reyoung/feature/fix_op_header_deps
7 years ago
hjchen2 6eba5bd276 Fix direct copy and refine split ut
7 years ago
Qiao Longfei fd290c2580 fix mac compile of analysis
7 years ago
hjchen2 5857fb3014 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
tensor-tang 3562051302 add gru refer code and remove redundant avx code
7 years ago
JiabinYang af9a3301da test=develop
7 years ago
hjchen2 3e3599f3d9 Refine split tensorrt plugin
7 years ago
peizhilin f10e196fc8 fix build issue
7 years ago
Yu Yang 6a128dea32
Merge pull request #14515 from reyoung/feature/fix_macos_build
7 years ago
Zhaolong Xing ad349e770f
Merge pull request #14452 from NHZlX/fix_avg_pool_trt_bug
7 years ago
tensor-tang f913860873 jitkernel lstm refer support peephole
7 years ago
tensor-tang 2f9b5f2383
Merge branch 'develop' into fea/jit/rnn
7 years ago
JiabinYang 014e50c284 test=develop
7 years ago
peizhilin 6e66fadb95 clean up the pre-definitions on windows
7 years ago
Yu Yang 3edd32d070 fix(Compile): fix depends error when compile op using cub
7 years ago
Dang Qingqing cda60311f9 Fix compling with cuDNN v5
7 years ago
peizhilin 67562a6fcd Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin 703b26e697 add profiler, parallel_executor back
7 years ago
Tao Luo 1d9b2a453c
Merge pull request #14508 from luotao1/warm_up_multi_thread
7 years ago
Yu Yang b3364d4035 fix(Macos): fix compile on macos
7 years ago
Yu Yang a685f305f8
Merge pull request #14479 from reyoung/feature/fix_macos_ut
7 years ago
tensor-tang 10fb4ceefc
Merge pull request #14351 from tpatejko/tpatejko/mkldnn-elementwise_mul
7 years ago
jerrywgz 13e254faed refine code, test=develop
7 years ago
tensor-tang b4c826c548 Merge remote-tracking branch 'ups/develop' into fea/jit/rnn
7 years ago
tensor-tang ce31deb7e9 refine refer code and add lstm refer code
7 years ago
jerrywgz 79cec53111 add ignore index for sigmoid cross entropy with logits op, test=develop
7 years ago
nhzlx e62872df8b fix conflicts
7 years ago
nhzlx a4dc1d4292 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt
7 years ago
nhzlx faeb9b8aa9 fix compile rely problem
7 years ago
chengduo a8d3aaae2a
print output log warning (#14497)
7 years ago
Tao Luo eb9b9becdc add warm up in TestMultiThreadPrediction
7 years ago
tensor-tang c2cfb03a72 add lstm jitcode
7 years ago
Tao Luo 5cc7946313
Merge pull request #14499 from luotao1/disable_openblas_test
7 years ago
Houjiang Chen 10ae3ba486
Merge pull request #14493 from hjchen2/develop
7 years ago
nhzlx 2a84054372 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_trt
7 years ago
nhzlx b742d46520 fix demo ci bug on trt
7 years ago
peizhilin 25adf970b2 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Houjiang Chen 33c65517fd Update CMakeLists.txt test=develop
7 years ago
Tao Luo 1d3e9bde1e
Merge pull request #14488 from yihuaxu/develop_7a64d48f5_stack_opt
7 years ago
Houjiang Chen 01bda73116
Update CMakeLists.txt
7 years ago
Tao Luo 09ee266f8e disable two openblas test temporary
7 years ago
hjchen2 2c2a192eb1 Resolve merge conflicts
7 years ago
Yiqun Liu 8bc1c5d2ab
Implement the Tensorrt plugin for elementwise op (#14487)
7 years ago
tensor-tang 7aa3aff338
Merge pull request #14465 from tensor-tang/fea/jit/exp
7 years ago
Tao Luo 1b894e495f
Merge pull request #14437 from jczaja/prv-softmax-mkl
7 years ago
peizhilin 3f73c0a70d fix the build issue on windows
7 years ago
chengduo a94a7355f0
Refine the GraphNum check (#14144)
7 years ago
peizhilin 3a72a634cf Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yihua Xu a906a361be Add the macro for NVCC (test=develop)
7 years ago
Yihua Xu d91740acb1 Revert "Remove the remnant code (test=develop)"
7 years ago
Yihua Xu be50670348 Remove the remnant code (test=develop)
7 years ago
hjchen2 1622cb9937 Fix alpha tensor key
7 years ago
hjchen2 a8c077df7c Implement leaky relu tensorRT converter
7 years ago
qingqing01 9eefd2c766
Modify some infer-shape about detection operators in compile-time. (#14483)
7 years ago
Tao Luo cf685f361b
Merge pull request #14458 from tpatejko/tpatejko/mkldnn-skip-connections
7 years ago
Yihua Xu f4c869d872 Optimize the layer_norm operator with AVX intrinsic function (#14417)
7 years ago
Houjiang Chen 816b464037
Merge pull request #14486 from hjchen2/develop
7 years ago
peizhilin 81f750a88c fix the dependency
7 years ago
peizhilin ee0fd78c81 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yu Yang f1a392a5fe
Merge pull request #13804 from sneaxiy/rewrite_allocation
7 years ago
Yihua Xu f418f552df Merge branch 'develop' into develop_7a64d48f5_stack_opt (test=develop)
7 years ago
peizhilin 8443961a4f add warp_ctc back
7 years ago
hjchen2 2825685f2a Fix tensorrt plugin cmake dependency, test=develop
7 years ago
Superjomn e878a8e885 update
7 years ago
qingqing01 fd7e643153
Convolution fusion operator. (#14449)
7 years ago
Yu Yang 98bbfc17be Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
Yu Yang 7486b0ddec fix(Mac): fix unittest of macos
7 years ago
peizhilin 4a6769da84 re-organize the cmake file
7 years ago
dengkaipeng 8ef6280c03 Add operator double support. test=develop
7 years ago
Yu Yang d424115f9e Clean code
7 years ago