Commit Graph

2676 Commits (2e76e75517f2afb2933aaf108905b128366b73fd)

Author SHA1 Message Date
huangjun12 20f18930ae Add hard swish op (new op) (#19001)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
yaoxuefeng 9150cf50fc
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
6 years ago
Zeng Jinle c51eb6bb14
remove book_memory_optimization directory, test=develop (#19117)
6 years ago
Zeng Jinle c194b0c835
Try to deprecate unstable python memory optimize (#18983)
6 years ago
hutuxian 5a80cc8431
Datafeed support reading to cuda place directly. (#19071)
6 years ago
chengduo 3f4c088ad8
prune the feed op in compiler (#18997)
6 years ago
ShenLiang 4397cb318e add eye op, kernel and unitest test=develop (#18980)
6 years ago
Kaipeng Deng f86fead693
Add trilinear_interp OP (#18711)
6 years ago
chengduo 17d62ab220
Enhance fuse optimization op pass (#19010)
6 years ago
chengduo 21440b4d69
Add call stack info during compile time (#19067)
6 years ago
Leo Chen 8f53735437 Fix memory overwriting of tensors returned by executor (#19030)
6 years ago
Kevin e681d65515 Add var_conv_2d op (#18518)
6 years ago
Zeng Jinle 311f90f1eb
reduce_unittest_time,test=develop (#19005)
6 years ago
Liufang Sang faf6890b6c support tensor input for ctc align op (#18887)
6 years ago
Dong Daxiang c97ea53c3e make listen and server as exclusive run (#18990)
6 years ago
Zeng Jinle 7ac748adb4
Open gc by default (#18836)
6 years ago
石晓伟 ee2f296ef8
Fusion: seqpool_cvm_concat (#18471)
6 years ago
wawltor 3ab1866ca5
Add the op of unique_with_counts, expand count function of the op unique (#18720)
6 years ago
LielinJiang 22fa4c2d24 Fix depthwise conv gpu kernel bug (#18582)
6 years ago
HaoRen 24f8543106 Add center Loss Op Support (#18681)
6 years ago
Dong Daxiang 2bb296dfe9
make dist unit test exclusive run (#18865)
6 years ago
danleifeng e0a2d4dfec Add elementwise_pow_op backward implementation and the unit test codes of it. (#18848)
6 years ago
chengduo ecd2bdada6
add CPUInplaceTestWithFuseOptimizationOps (#18867)
6 years ago
Zeng Jinle 8008ab4e6b
Remove legacy C++ memory optimization codes (#18834)
6 years ago
Zeng Jinle 9a8a7a1ddc
fix affine_channel no_need buffer bug, test=develop (#18844)
6 years ago
chengduo 4140fe11a4
Open fuse optimization ops (#18741)
6 years ago
Adam ee02227949 Add LeakyReLU MKLDNN support (#18762)
6 years ago
Zeng Jinle a802da650b
Feature/mem opt pass refactor (#18735)
6 years ago
guru4elephant 2efb282c86
split test_dist_se_resnext.py into 4 testcases (#18743)
6 years ago
Bob Zhu 220eef602e Extend Matmul to support matrix multiplication with multiple heads (#18570)
6 years ago
whs 075e1cf78e
Add python API for appending LoD level (#18702)
6 years ago
chengduo 8259f1418f
Enhance backward process (#18700)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
Yi Liu 157211c4e1
supports distributed classification (#18690)
6 years ago
Huihuang Zheng a3028bb729
Fix random test_recurrent_op failure (#18718)
6 years ago
Tao Luo bd22453f20
Revert "Add LeakyRelu MKLDNN support (#18656)" (#18723)
6 years ago
guru4elephant ebf9797ec3
split different comm method for mnist distributed training (#18715)
6 years ago
Huihuang Zheng 89bc3fd841
Support memory eager deletion on recurrent OP (#17710)
6 years ago
Adam d6b6a337a9 Add LeakyRelu MKLDNN support (#18656)
6 years ago
tangwei12 0b9acb4912
add check of executor (#17986)
6 years ago
Zeng Jinle ae58afc546
Feature/auto_growth_allocator (#18561)
6 years ago
hutuxian bb2f5d24a2
hash_op support int64 hash_size (#18674)
6 years ago
guru4elephant ab57d3893e
make auc op compatible with 1 dim (#18551)
6 years ago
guru4elephant b71b454328 increase timeout again (#18628)
6 years ago
123malin b414645a65
fix #17430: int64类型的attr训练非预期 (#18264)
6 years ago
kh2se2013 9ad57f2dfd 1)change to parallel mode on python coverage run (#18594)
6 years ago
gongweibao c0a82748cf
Polish backwards optimizer dependency codes and use more default values. (#18255)
6 years ago
Zeng Jinle d3003a1620
Feature/buffer_shared_inplace (#17911)
6 years ago
lujun b6d5c74f69
update dygraph api doc for web (#18550)
6 years ago