Commit Graph

14104 Commits (05993c3ff3f8bfce449550e1d02dc0fe48dffaab)

Author SHA1 Message Date
tensor-tang 6057f36208
Merge pull request from tensor-tang/op/embgrad 6 years ago
chengduo c67afb0f76
Fix reshape bug () 6 years ago
Tao Luo 14b4337663
Merge pull request from luotao1/num_threads 6 years ago
sneaxiy 33138a421d remove match check 6 years ago
wopeizl 7fbf52daa3
remove the ignored from is_empty and less_than test=develop () 6 years ago
Zhen Wang 8063b31e2d Reduce redundant code for channel wise dequant op. test=develop 6 years ago
Tao Luo 6375fe45d7
Merge pull request from luotao1/execution_context 6 years ago
Zhen Wang e8f9dac7ab Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into channel_wise_quant_op 6 years ago
Zhen Wang 806832e091 update the input format of channel wise dequantize op. 6 years ago
jerrywgz f0177a1ed1 refine doc, test=develop 6 years ago
jerrywgz 072eca348a refine doc, test=develop 6 years ago
Kaipeng Deng 6d8771b55c
Merge pull request from heavengate/spectral_norm 6 years ago
sneaxiy 814a759061 merge develop 6 years ago
sneaxiy 597dc65e76 enhance gc 6 years ago
liuwei1031 caadd0581d
add IfElse test case for ir memory optimize () 6 years ago
luotao1 06aab1b493 refine SetCpuMathLibraryNumThreads 6 years ago
baojun da45fbdaf5 fix tanh typo test=develop () 6 years ago
whs 0f99d24083
Make sequence_erase op support for input with multi-level LoD. () 6 years ago
Zhen Wang 89dee160d1 add channel wise dequantize op. 6 years ago
Tao Luo f4587789d8 remove legacy function in ExecutionContext 6 years ago
jerrywgz b4f5180299 fix doc, test=develop 6 years ago
jerrywgz 21e0d35ce3 fix formula, test=develop 6 years ago
jerrywgz d1901f27bc refine doc 6 years ago
jerrywgz a1ef7df865 refine code, test=develop 6 years ago
tensor-tang 12eb9aecde Merge remote-tracking branch 'ups/develop' into op/embgrad 6 years ago
jerrywgz e64921c79a fix API.spec,test=develop 6 years ago
jerrywgz d497bd9079 resolve conflict, test=develop 6 years ago
jerrywgz 41471d28ac add box_coder_and_assign, test=develop 6 years ago
lidanqing 02c106c717 MKLDNN: Add UT for conv_transpose_mkldnn op. () 6 years ago
sneaxiy 7e5a4a3d63 test=develop 6 years ago
dengkaipeng 3eab9e4b95 fix statement. test=develop 6 years ago
dengkaipeng e37f5ab5b1 fix API.spec. test=develop 6 years ago
dengkaipeng 54bbbfa71f fix doc statement. test=develop 6 years ago
dengkaipeng c1a69e3ea0 refine doc. test=develop 6 years ago
dengkaipeng 65d375a09f fix format. test=develop 6 years ago
dengkaipeng 82d514345c fix spectral_norm doc. test=develop 6 years ago
dengkaipeng 2ea5843cbf add doc and test_layers. test=develop 6 years ago
dengkaipeng 037855f42d fix attr dim calc. test=develop 6 years ago
dengkaipeng 70dbd59839 add grad kernel for spectral_norm. test=develop 6 years ago
dengkaipeng 72509ec3bd add unittest for spectral_norm. test=develop 6 years ago
dengkaipeng 3bf1ae9b59 add spectral_norm forwarn kenel 6 years ago
Zhen Wang 545247d7b4 add channel wise quantize op. 6 years ago
sneaxiy f0634da4b5 test=develop 6 years ago
ceci3 44a4ac0f8c fix API.spec and testfile 6 years ago
tensor-tang b16dabd7e0 refine vbroadcast jitcode 6 years ago
tensor-tang c2e56e6bbc Merge remote-tracking branch 'ups/develop' into op/embgrad 6 years ago
ceci3 3b96aa0839 conflict fix 6 years ago
ceci3 06d8e1a15d test=develop 6 years ago
chengduo 92438f6132 Revert "Add Event for TensorCopy" () 6 years ago
baojun 742839f8f4 fix cpplint test=develop () 6 years ago
chengduo d4b461eb10 Unified ParallelExecutor and Compiler () 6 years ago
chengduo 06f3c8575d Add Event for TensorCopy () 6 years ago
Tink_Y 8949a94691 refine image_resize annotation () 6 years ago
tangwei12 7b0875e9f8 add op type in check nan/inf () 6 years ago
Yiqun Liu 2bdf44641c Add the include of cudnn.h to enable the use of CUDNN_VERSION. () 6 years ago
Yiqun Liu b94307a919 Revert "Optimize while_op when is_test is true. ()" () 6 years ago
flame eeb70edd9a add anakin fc op converter () 6 years ago
minqiyang ab5a648481 Add missing headers 6 years ago
minqiyang 94c8ce3f13 reduce ut time 6 years ago
Yiqun Liu c90b82a637 Fix error in CUDA kernel of beam_search. () 6 years ago
minqiyang 3723dcc301 Polish code 6 years ago
flame afc3fcd509 anakin subgraph engine () 6 years ago
minqiyang 212242c4e4 Polish code 6 years ago
Yiqun Liu 1b10a7843c Optimize while_op when is_test is true. () 6 years ago
xiaolil1 91838c3214 Optimize Quantize Op with primitive reuse. () 6 years ago
luotao1 1c58eee9b2 refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool 6 years ago
minqiyang 3f4aeed57f Polish code 6 years ago
minqiyang b754bf30fb Reset output var's pre_op pointer when op was destructed 6 years ago
baojun ac72bcd065 Added adam op test=develop () 6 years ago
mozga-intel b29acec815 Register sum operator () 6 years ago
dzhwinter 4449e85528 polish cudnn related code and fix bug. () 6 years ago
Xin Pan 8e094f7117 polish 6 years ago
Xin Pan 90b17d28ec have no time for cmake/externel 6 years ago
mozga-intel 06a7f741f0 The flag of mkldnn is enabled iff it is necessary 6 years ago
baojun-nervana 320b27988c added concat op test=develop 6 years ago
minqiyang b71af29fb4 Remove var op deps in imperative mode 6 years ago
Tao Luo 690be0bb09 fix cpplint error of async_executor.h 6 years ago
Tao Luo 6e87843e26 enable cpplint, remove go_fmt 6 years ago
tensor-tang 0eefad0a2d fix jitcodekey and refine test 6 years ago
tensor-tang ce4cc482a4 add sgd jitcode and op test 6 years ago
tensor-tang 1bfc565ffe add benchmark and mkl sgd implement 6 years ago
shippingwang a0834044fc add API.spec. test=develop 6 years ago
shippingwang 7d4feb2fc5 fix api.spec, test=develop 6 years ago
minqiyang 9035887bc9 Add gperftools into imperative tracer 6 years ago
Yihua Xu b48d56e87f Optimize gelu operation with mkl erf. 6 years ago
xiaoli.liu@intel.com f8cbc4f34b Optimize INT8 DeQuantize Op with primitive reuse. 6 years ago
minqiyang 701af43958 Fix bugs 6 years ago
baojun-nervana dea34134e8 Update ngraph version to v0.14 test=develop 6 years ago
minqiyang f1a2d20430 invoke backward_hooks after reduce op's depcounts map 6 years ago
minqiyang e0a2b472f4 Move ClearBlock into OpBase and VarBase's destructor 6 years ago
minqiyang 9abf40c9e2 Add imperative python tracer 6 years ago
tensor-tang 92f3cf42cb enable sgd jitkernel refer code and test 6 years ago
shippingwang 13e891516b add cosine decay op, test=develop 6 years ago
jerrywgz b2ce832021 change default option related to softmax, test=develop 6 years ago
chengduo e2da3a5b22
Revert "Add Event for TensorCopy" () 6 years ago
sneaxiy a9ea99d700 merge develop 6 years ago
baojun 9aaea38c0a fix cpplint test=develop () 6 years ago
tianshuo78520a 26e3842d40 Update detection API add new check document () 6 years ago
chengduo ae37f82964
Unified ParallelExecutor and Compiler () 6 years ago
chengduo 7235fd662b
Add Event for TensorCopy () 6 years ago
Tink_Y 31d830de9f refine image_resize annotation () 6 years ago
nhzlx 3c40cb767b 7 refine zero copy 6 years ago
tensor-tang 641b3cccce add vbroadcast mkl code and jitcode 6 years ago
tensor-tang 41a1270856 add vbroadcast jitkernel refer code and use it 6 years ago
tensor-tang 867e93b21a add jitkernel vcopy and speedup unit test time 6 years ago
tangwei12 6d5a04c1e7
add op type in check nan/inf () 6 years ago
Qiyang Min 187cffd019
Merge pull request from velconia/imperative_backward_hooks 6 years ago
Yiqun Liu 1616c32acf
Add the include of cudnn.h to enable the use of CUDNN_VERSION. () 6 years ago
jerrywgz c31da7899a refine code, test=develop 6 years ago
Yiqun Liu 798925453e
Revert "Optimize while_op when is_test is true. ()" () 6 years ago
flame b187e3728e
add anakin fc op converter () 6 years ago
minqiyang e5f3435dd5 Add missing headers 6 years ago
minqiyang fa1ff1d2f1 reduce ut time 6 years ago
Yiqun Liu 87248281f7
Fix error in CUDA kernel of beam_search. () 6 years ago
Tao Luo c494f64a0f
Merge pull request from mozga-intel/mozga-intel/enable_mkldnn_framework 6 years ago
jerrywgz e8a8fe07e7 fix code for windows CI, test=develop 6 years ago
jerrywgz 149411762a add gpu kernel, test=develop 6 years ago
Tao Luo 4efdebc6f6
Merge pull request from yihuaxu/develop_2c5c7b2a7_gelu_mkl_opt 6 years ago
tensor-tang e5f9d3a47c
Merge pull request from tensor-tang/jit/sgd 6 years ago
Tao Luo e6bab55f1b
Merge pull request from luotao1/infershape_refine 6 years ago
minqiyang 50639fafdb Polish code 6 years ago
ruri 72efef6358
Merge pull request from shippingwang/cosine_decay_op 6 years ago
flame e40d56c3d3
anakin subgraph engine () 6 years ago
minqiyang fe406b98c9 Polish code 6 years ago
Yiqun Liu 613d9d0756
Optimize while_op when is_test is true. () 6 years ago
xiaolil1 1abddd8d97 Optimize Quantize Op with primitive reuse. () 6 years ago
Tao Luo 7ec97a0a7e
Merge pull request from xiaolil1/dequantize-reuse 6 years ago
nhzlx 2eff3e26b6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_static_model_load_for_trt 6 years ago
nhzlx 06a088a199 fix comments and fix cpplint 6 years ago
luotao1 34404f9c31 refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool 6 years ago
Xin Pan a63e2a0a4f
Merge pull request from panyx0718/api2 6 years ago
minqiyang f469bb6b36 Polish code 6 years ago
minqiyang ac88c62a5b Reset output var's pre_op pointer when op was destructed 6 years ago
baojun f285191fb3 Added adam op test=develop () 6 years ago
jerrywgz b92ef45fe9
Merge pull request from jerrywgz/refine_softmax_with_cross_entropy 6 years ago
mozga-intel 558f94cd77 Register sum operator () 6 years ago
tensor-tang 58b8231338
added concat op test=develop () 6 years ago
Tao Luo 47d36b2008
Merge pull request from baojun-nervana/ngraph_v14 6 years ago
Qiyang Min 1c9cfb01df
Merge pull request from velconia/imperative_gperftools 6 years ago
jerrywgz 0f652f304c add distribute fpn proposals op, test=develop 6 years ago
dzhwinter 225c11a91f polish cudnn related code and fix bug. () 6 years ago
Tao Luo 6e3624442e
Merge pull request from luotao1/pre_commit2 6 years ago
Xin Pan 0c277ac6e9 polish 6 years ago
ceci3 4b7bf06e1f test=develop 6 years ago
Xin Pan 4d80db838a have no time for cmake/externel 6 years ago
Yiqun Liu 454f4f2140
Rewrite is_empty op to avoid unnecessary data transform. () 6 years ago
xiaolil1 6724be2b0d INT8 Pool kernel Key Creation Optimization. () 6 years ago
xiaoli.liu@intel.com c4187dbd7c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dequantize-reuse 6 years ago
Tao Luo d5a888e15c
Merge pull request from kbinias/kbinias/add-placement-pass-tester 6 years ago
Tao Luo ba90e05281
Merge pull request from jczaja/prv-tensor-mkldnn-ops 6 years ago