Commit Graph

3022 Commits (f5532877f7c748adf74afa51e403208a3777a7ef)

Author SHA1 Message Date
tensor-tang 194ce2e92c add benchmark
7 years ago
Yibing Liu 6951ef9a55
Fix the gelu backward to avoid nan (#14857)
7 years ago
Yu Yang c00e07cda0 Fix distribute compile
7 years ago
Qiao Longfei 3668f07965
Merge pull request #14844 from jacquesqiao/pserver-should-crash
7 years ago
sneaxiy ca84c2ca8f merge develop
7 years ago
sneaxiy 06f8aa5b97 remove while_op support temporarily
7 years ago
Yu Yang 81520a24cf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor
7 years ago
Yu Yang 9bd70a1e04 Change tensor uses proto::VarType::type
7 years ago
Yu Yang eeca721a99
Merge pull request #14842 from reyoung/feature/refine_eigen_tensor
7 years ago
tensor-tang d538513fce fix the compile error on mac
7 years ago
tensor-tang 28eb7d840c test all impls and all inplace cases
7 years ago
Yihua Xu acc6ae49b1 Fix the issue to run on AVX2 and AVX512F machines (#14851)
7 years ago
Michal Gallus 92daace55c MKL-DNN Concat: Fix segfault related to referencing deleter memory primitive
7 years ago
tensor-tang d4cab7d948 use jitkernel in one file
7 years ago
tensor-tang adc7ba2edd Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
tensor-tang 900c789a35 use jitcode and use vmul
7 years ago
Qiao Longfei 1870262ba9 pserver should crash early whe has problem
7 years ago
dengkaipeng a81fabd327 fix doc errors. test=develop
7 years ago
dengkaipeng cf06e50f1d add doc for adaptive pool. test=develop
7 years ago
dengkaipeng 266c6856c9 add adaptive pool 2d & 3d. test=develop
7 years ago
dengkaipeng eab4745965 add adaptive mode for pool.
7 years ago
Yu Yang 7604b1ad51 Fix Eigen macro when using GPU
7 years ago
Qiao Longfei 1213e2838f
Merge pull request #14820 from jacquesqiao/fix-split-selected-rows
7 years ago
sneaxiy 7923042365 merge develop
7 years ago
zhang wenhui c4c5f0b8ca
Merge pull request #14771 from frankwhzhang/bpr
7 years ago
tensor-tang 53709e7e61 refine names
7 years ago
Qiao Longfei abf140289f split selected rows op should always init output selected rows
7 years ago
frankwhzhang c9a653820b fix label_pos ,add test_layers.py, test=develop
7 years ago
frankwhzhang a672b291e5 fix code style, test=develop
7 years ago
frankwhzhang ea95f9c335 fix style bug, test=develop
7 years ago
gongweibao f1fb64b17f
Add reduce sparse tensor feature. (#14757)
7 years ago
frankwhzhang f4cc5881b0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
7 years ago
frankwhzhang 97de98cd0a update bpr_loss op code, test=develop
7 years ago
Yihua Xu 3821fc3950 Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
7 years ago
Yihua Xu 240d974ac5 Clean Code
7 years ago
Tao Luo 54fcafb5f6
Merge pull request #14707 from yihuaxu/develop_4f71a6ee2_conv3d_mkldnn_opt
7 years ago
sneaxiy 387bac46b5 refine code
7 years ago
Yihua Xu 155328a488 Clean Code
7 years ago
Tao Luo 743cb840f1 update with comments
7 years ago
tensor-tang ce674b685f add readme doc and complete TODOs
7 years ago
Tao Luo 42359e88a4 clean code
7 years ago
Tao Luo 923b18877e Merge branch 'develop' into memory_load
7 years ago
Tao Luo 405b2486db support loading from memory
7 years ago
frankwhzhang 272f3d3111 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
7 years ago
frankwhzhang 570d89ec84 add bpr_loss operator , test=develop
7 years ago
qingqing01 549f165b59
Speed conv_fusion_op for identity activation. (#14744)
7 years ago
tensor-tang fab0ee8757 Merge remote-tracking branch 'ups/develop' into refine/jitkernel
7 years ago
Houjiang Chen c6b39a0099
Merge pull request #14714 from NHZlX/add_prelu_gpu
7 years ago
tensor-tang dbe451976b
Merge pull request #14753 from tensor-tang/refine/namespace
7 years ago
Jiabin Yang d9bb55a1f9
Merge pull request #14756 from JiabinYang/fix_hs_op
7 years ago
Yihua Xu 65dbc7cca4
Merge branch 'develop' into develop_4f71a6ee2_conv3d_mkldnn_opt
7 years ago
JiabinYang e05e1d7d88 fix bug in dist train on hs, test=develop
7 years ago
tensor-tang a1eb21e704 refine names
7 years ago
tensor-tang b523787f9f remove jit namespace
7 years ago
tensor-tang 191948c933 enable jitcode
7 years ago
tensor-tang 4a93db9288 remove jit namespace
7 years ago
Hongyu Liu 8cda28f345
Merge pull request #14733 from phlrain/add_cudnn_5_support
7 years ago
Xin Pan 73b4d1aa72
Merge pull request #14742 from panyx0718/infer2
7 years ago
Jiabin Yang 21c0f8749e
Merge pull request #14728 from JiabinYang/optimize_hs_op
7 years ago
tensor-tang 45bfa70cb8 complete vmul jit kernel
7 years ago
tensor-tang 77236e33fc init jitkernel
7 years ago
Xin Pan 82d68281c0 follow comments
7 years ago
liuhongyu 8b2898e201 fix bug of formate; test=develop
7 years ago
Xin Pan 41c28d54c6 allow customize kernel selection
7 years ago
liuhongyu 773dc73fbf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
7 years ago
liuhongyu 8daf67f90f fix bugs; test=develop
7 years ago
chengduo 04539d4c5d
Fix clip.py (#14718)
7 years ago
Michal Gallus 6fdbb365ce Include MKL-DNN header to concat op only when flag is set
7 years ago
Michal Gallus f2a880421e Fix style @ concat integration and tests
7 years ago
Michal Gallus 738069e491 Refactor MKL-DNN Concat
7 years ago
Michal Gallus 208f912512 Implement MKL-DNN Concat
7 years ago
liuhongyu 968dd3c078 add cudnn 5 support; test=develop
7 years ago
sneaxiy e694d0c2e4 fix while_op eager deletion bug
7 years ago
JiabinYang 8c75705984 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize_hs_op
7 years ago
JiabinYang b387a19410 optimize op with blas
7 years ago
Zeng Jinle ff4237309a
Merge pull request #14720 from sneaxiy/fix_seq_mask_op_infershape
7 years ago
Kaipeng Deng 934f13a70a
Merge pull request #14371 from heavengate/yolo_loss
7 years ago
sneaxiy 65867d8989 test=develop
7 years ago
sneaxiy 64ad051b9a merge develop
7 years ago
sneaxiy c47c451a00 fix bug
7 years ago
nhzlx e7abe6b654 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_prelu_gpu
7 years ago
nhzlx f75815b78c add prelu gpu inference
7 years ago
Yihua Xu ea00270fe8 Remove the dims checking when the dim is 3 (test=develop)
7 years ago
jerrywgz 96dc3d8326
Merge pull request #14511 from jerrywgz/ignore_index_for_sigmoid_cross_entropy
7 years ago
Yihua Xu 669191c9cc Implement conv3d with mkldnn library (test=develop)
7 years ago
Hongyu Liu 4f71a6ee2c
Merge pull request #14622 from PaddlePaddle/add_cudnn_lstm
7 years ago
Yibing Liu c7382df80f
Print assert failure id in lookup_table_op (#14698)
7 years ago
phlrain 4c256ca6be Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
phlrain b65722d3cf fix uni test; test=develop
7 years ago
tangwei12 618f7620e2
add enforce for auc (#14687)
7 years ago
phlrain 2770ea1a73 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
chengduozh 3f4aca618f code refine
7 years ago
chengduozh af8c2cec13 fix operator.cmake
7 years ago
chengduozh 679d8fc6fe rename op name
7 years ago
jerrywgz 3df0538940 replace -100 to kIgnoreIndex
7 years ago
Wang Guibao 41e19eb431
AsyncExecutor (#14627)
7 years ago
whs 1b9753d109
Make pad2d support for variable paddings. (#14667)
7 years ago
luotao1 bcc90123f0 speedup box_coder_op for multi-threads
7 years ago
phlrain 6ce4250172 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
Qiao Longfei 44debca844
Merge pull request #14589 from jacquesqiao/refactor-prefetch
7 years ago
phlrain bd94ab0ef3 rename op; test=develop
7 years ago
phlrain 92f5be1d82 remove inputvarname in operator; test=develop
7 years ago
phlrain cf1fe61004 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
phlrain d1a17cadd4 fix cudnn rnn; test=develop
7 years ago
Tao Luo 20120d9c97
Merge pull request #14608 from jczaja/prv-conv2d-transpose-mkldnn
7 years ago
Qiao Longfei 3e45a5a5ec lookup_table gpu kernel support prefetch
7 years ago
qingqing01 731d45a39a
Enable BatchNorm to use global mean and variane during training (#14630)
7 years ago
Tao Luo ea47685f91
Merge pull request #14646 from jczaja/prv-softmax-mkl-sasum
7 years ago
Qiao Longfei 3a3cfc2d8d prefetch support gpu
7 years ago
Qiao Longfei 4b9082a4cd follow comment
7 years ago
chengduo 6776e92846
refine tensor_array_write_read (#14643)
7 years ago
Jacek Czaja 48e1b97e8e - Coding style fixes
7 years ago
Qiao Longfei d32de7e6e1 fix code format test=develop
7 years ago
Qiao Longfei 5a660aee7d update log level in parameter prefetch test=develop
7 years ago
Qiao Longfei 8ebde595c9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
7 years ago
Qiao Longfei b9d3d75fc4 fix prefetch dependency test=develop
7 years ago
Qiao Longfei 145c535750 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
7 years ago
minqiyang 9d7c3b18c0 Polish code
7 years ago
minqiyang 2b430adaee Polish code
7 years ago
minqiyang a02ce58f2c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
7 years ago
Jiabin Yang 12e1719f96
Merge pull request #14352 from JiabinYang/enhance_hierachical_sigmod_op
7 years ago
Qiao Longfei 40f68b1349 unit test ready
7 years ago
Qiao Longfei 36e26a53b0 Optimize bilinear tensor product op (#14485)
7 years ago
Tao Luo 4ec9de0122
Merge pull request #14628 from Sand3r-/mgallus/mkldnn-elementwise_mul
7 years ago
Qiao Longfei 35b79ab865
Merge pull request #13983 from jacquesqiao/add-ctr-reader
7 years ago
Qiao Longfei da387720d7 fix infer compile test=develop
7 years ago
Jacek Czaja cf40daee58 - Building fix to softmax for inference
7 years ago
Clementine 6c71c1f8f9 Add activation gelu (#14569)
7 years ago
Michal Gallus 9455be0ba5 EltwiseMul: Extract StringToFormat to MKLDNN helper
7 years ago
Jacek Czaja 1540df51cf - Fix to test_conv2d_transpose_mkldnn for GPU
7 years ago
JiabinYang eda069068d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
7 years ago
JiabinYang a08dc83eb0 remove arg 'non_leaf_num', test=develop
7 years ago
chengduo 6648f5ed6f
add ShareLoD for dropout_grad (#14616)
7 years ago
JiabinYang c469334cfb polish python code and comment, test=develop
7 years ago
Qiao Longfei 92afbb923c fix compile problem test=develop
7 years ago
Qiao Longfei 97cbec9b74 clean code
7 years ago
Qiao Longfei 1edd435da6 fix ci problem test=develop
7 years ago
JiabinYang 87648f8edf merge develop, test=develop
7 years ago
wopeizl db9284ecde
Merge pull request #14617 from wopeizl/windows/online
7 years ago
JiabinYang c3c3c0b33c polish code, test=develop
7 years ago
Jacek Czaja 8bfa1fa9bb - ASUM MKL integration
7 years ago
phlrain 487ee36aec Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
tangwei12 56a4912b76
Make NCE_OP more efficient and support SelectedRows (#14469)
7 years ago
liuhongyu 1ffe41d722 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
7 years ago
Qiao Longfei 9589babe12 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
7 years ago
liuhongyu 05917c3c79 add cudnn lstm; test=develop
7 years ago
Qiao Longfei f35f3fe77a ctr reader can not be used in windows
7 years ago
peizhilin 6a85dd3278 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin 38715e6fd0 minor fix
7 years ago
Qiao Longfei 6bef565dac clean code test=develop
7 years ago