Commit Graph

6753 Commits (6010361c7af1f24b84ac906c71cf8a500e706726)

Author SHA1 Message Date
tensor-tang 6010361c7a add vbroadcast mkl code and jitcode
6 years ago
tensor-tang 2e96da453a add vbroadcast jitkernel refer code and use it
6 years ago
tensor-tang 020540948f add jitkernel vcopy and speedup unit test time
6 years ago
sneaxiy 7e5a4a3d63 test=develop
6 years ago
ceci3 44a4ac0f8c fix API.spec and testfile
6 years ago
ceci3 3b96aa0839 conflict fix
6 years ago
ceci3 06d8e1a15d test=develop
6 years ago
chengduo 92438f6132 Revert "Add Event for TensorCopy" (#16022)
6 years ago
baojun 742839f8f4 fix cpplint test=develop (#16028)
6 years ago
chengduo d4b461eb10 Unified ParallelExecutor and Compiler (#15970)
6 years ago
chengduo 06f3c8575d Add Event for TensorCopy (#15953)
6 years ago
Tink_Y 8949a94691 refine image_resize annotation (#15976)
6 years ago
tangwei12 7b0875e9f8 add op type in check nan/inf (#15986)
6 years ago
Yiqun Liu 2bdf44641c Add the include of cudnn.h to enable the use of CUDNN_VERSION. (#15961)
6 years ago
Yiqun Liu b94307a919 Revert "Optimize while_op when is_test is true. (#15811)" (#15968)
6 years ago
flame eeb70edd9a add anakin fc op converter (#15965)
6 years ago
minqiyang ab5a648481 Add missing headers
6 years ago
minqiyang 94c8ce3f13 reduce ut time
6 years ago
Yiqun Liu c90b82a637 Fix error in CUDA kernel of beam_search. (#15957)
6 years ago
minqiyang 3723dcc301 Polish code
6 years ago
flame afc3fcd509 anakin subgraph engine (#15774)
6 years ago
minqiyang 212242c4e4 Polish code
6 years ago
Yiqun Liu 1b10a7843c Optimize while_op when is_test is true. (#15811)
6 years ago
xiaolil1 91838c3214 Optimize Quantize Op with primitive reuse. (#15929)
6 years ago
luotao1 1c58eee9b2 refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool
6 years ago
minqiyang 3f4aeed57f Polish code
6 years ago
minqiyang b754bf30fb Reset output var's pre_op pointer when op was destructed
6 years ago
baojun ac72bcd065 Added adam op test=develop (#15710)
6 years ago
mozga-intel b29acec815 Register sum operator (#15889)
6 years ago
dzhwinter 4449e85528 polish cudnn related code and fix bug. (#15164)
6 years ago
mozga-intel 06a7f741f0 The flag of mkldnn is enabled iff it is necessary
6 years ago
baojun-nervana 320b27988c added concat op test=develop
6 years ago
minqiyang b71af29fb4 Remove var op deps in imperative mode
6 years ago
Tao Luo 690be0bb09 fix cpplint error of async_executor.h
6 years ago
Tao Luo 6e87843e26 enable cpplint, remove go_fmt
6 years ago
tensor-tang 0eefad0a2d fix jitcodekey and refine test
6 years ago
tensor-tang ce4cc482a4 add sgd jitcode and op test
6 years ago
tensor-tang 1bfc565ffe add benchmark and mkl sgd implement
6 years ago
shippingwang a0834044fc add API.spec. test=develop
6 years ago
shippingwang 7d4feb2fc5 fix api.spec, test=develop
6 years ago
minqiyang 9035887bc9 Add gperftools into imperative tracer
6 years ago
Yihua Xu b48d56e87f Optimize gelu operation with mkl erf.
6 years ago
xiaoli.liu@intel.com f8cbc4f34b Optimize INT8 DeQuantize Op with primitive reuse.
6 years ago
minqiyang 701af43958 Fix bugs
6 years ago
baojun-nervana dea34134e8 Update ngraph version to v0.14 test=develop
6 years ago
minqiyang f1a2d20430 invoke backward_hooks after reduce op's depcounts map
6 years ago
minqiyang e0a2b472f4 Move ClearBlock into OpBase and VarBase's destructor
6 years ago
minqiyang 9abf40c9e2 Add imperative python tracer
6 years ago
tensor-tang 92f3cf42cb enable sgd jitkernel refer code and test
6 years ago
shippingwang 13e891516b add cosine decay op, test=develop
6 years ago