Commit Graph

2717 Commits (11f2f784589c32add30f42c97eaaac2f6c98fe68)

Author SHA1 Message Date
Thunderbrook 1fe468d319
support debug each output of each ins (#19004)
6 years ago
Double_V 1d0f04315a fix row_conv_op to force it support lodtensor and tensor input simultaneously, test=develop (#19412)
6 years ago
Jiabin Yang 1ce0a09e60
fix con2d transpose bias by create and init it in build_once (#18968)
6 years ago
Yi Liu 4ef6b8457a
adapte fleet api for localsgd and support nccl comm configuration in executor (#19443)
6 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training (#18863)
6 years ago
chengduo e26411cec2
Open test_parallel_dygraph_se_resnext (#19342)
6 years ago
Yi Liu efb05ba258
supports multiple NCCL communicators preserved in NCCLCommContext (#19407)
6 years ago
Aurelius84 a9cd513680
improve sequence_conv api doc (#19316)
6 years ago
vincentXiyu 482ce818bb Support Tensor input with padding for warpctc op (#19322)
6 years ago
chengduo bfb6ac816e
Fix optimizer bug (#19410)
6 years ago
Leo Chen 6fb310ae29 Fix bug of getting bool Flags from os.environ (#19349)
6 years ago
liu zhengxi 32598ffd8f
Python infer api update and add unit test (#19353)
6 years ago
chengduo 11070cbff9
enabel seresnext reduce test (#19341)
6 years ago
Ghost Under Moon 10643b4ea6 fix- raise io error when user load from non-existed dir test=develop (#19384)
6 years ago
mapingshuo d5ac87ec22
Lookahead optimizer (#19386)
6 years ago
silingtong123 da127d1110 Optimized error reporting information (#19173)
6 years ago
Jiabin Yang 55931db449
fix problem that get_attr method can't using default mode when we call has_attr in dygraph (#19328)
6 years ago
zhang wenhui 4a3c4b8fa4
add fleet_desc config feature & multi_sparse table, test=develop (#18827)
6 years ago
Leo Chen d49c2bad71 update inplace grad test to new CompiledProgram API, test=develop (#19359)
6 years ago
chengduo 4278518fb0
Update CompiledProgram (#18919)
6 years ago
翟飞跃 2e3ee57954 Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153)
6 years ago
Leo Chen a9d5fc5142 Enhance OpTest to check the consistency of operators when using and not using inplace (#19101)
6 years ago
Aurelius84 0d29cf18f4
Supports diagonal initialization in uniform_random op (#19299)
6 years ago
chengduo 5a579df9ba
[Speedup] Make dygraph data parallel faster (#19280)
6 years ago
chengduo 6a1632318d
Split test_parallel_executor_seresnext to three unit test (#19239)
6 years ago
lidanqing 3fdecc19b7 Add elementwise_mul_mkldnn UT with [conv + elt_mul + conv] (#19191)
6 years ago
danleifeng 0865b5a9a0 distribute launch : add use_paddlecloud argument (#19273)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
Aurelius84 78a3d837f8
Add match_matrix_tensor op (#18525)
6 years ago
Zeng Jinle 5b6673c44d
merge develop to solve conflict, also fix API doc, test=develop (#18823)
6 years ago
zhang wenhui 539c870753
add fl_listen_and_serv &fl_transpiler,test=develop (#19091)
6 years ago
kh2se2013 27e85625b8 add python coverage launch when WITH_COVERAGE=ON (#19264)
6 years ago
chengduo 8a89ca94ce
Fix REGISTER_OP_WITHOUT_GRADIENT (#19251)
6 years ago
gongweibao fd4b15a2f6
Unset unittests http_proxy env to avoid timeout. (#19269)
6 years ago
Tao Luo 2f8c7e021f
remove unused inference_transpiler unit-tests (#19130)
6 years ago
zhaoyuchen2018 0c71c839ec
Fix recurrent op not update grade issue (#18581)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
Yibing Liu 50b1cab122
Add padding support for crf_decoding (#19057)
6 years ago
LielinJiang 1331c9e1f8 fix distributions unittest bug, test=develop (#19012)
6 years ago
lidanqing c548e370f1 UT coverage for guassian_mkldnn_op and batch_norm_mkldnn_op (#19011)
6 years ago
Jiawei Wang 6ac32d0981 Instag Implemention (#18394)
6 years ago
huangjun12 20f18930ae Add hard swish op (new op) (#19001)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
yaoxuefeng 9150cf50fc
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
6 years ago
Zeng Jinle c51eb6bb14
remove book_memory_optimization directory, test=develop (#19117)
6 years ago
Zeng Jinle c194b0c835
Try to deprecate unstable python memory optimize (#18983)
6 years ago
hutuxian 5a80cc8431
Datafeed support reading to cuda place directly. (#19071)
6 years ago
chengduo 3f4c088ad8
prune the feed op in compiler (#18997)
6 years ago
ShenLiang 4397cb318e add eye op, kernel and unitest test=develop (#18980)
6 years ago
Kaipeng Deng f86fead693
Add trilinear_interp OP (#18711)
6 years ago