Commit Graph

2724 Commits (ecd9f330c9ae86414ef31e6808f87a48ce99dac3)

Author SHA1 Message Date
Jacek Czaja ecd9f330c9 [MKL-DNN] Fix to face model on AVX512 platforms (#19282)
7 years ago
Liufang Sang 9dde564097 change var name padding_num to padding_value (#19498)
7 years ago
Aurelius84 5b5379b32a
Add sequence_topk_avg_pooling Op (#19442)
7 years ago
chengduo 1cdd3b6985
Disable GC in test_parallel_exe_seresnext (#19408)
7 years ago
yaoxuefeng 10ca3f9609
add thread scope stat accurate metrics test=develop (#19480)
7 years ago
chengduo e340df013e
Support feed single persistable variable to PE (#19417)
7 years ago
lidanqing ba368bf696 clean up intel labeled TODOs (#19476)
7 years ago
Thunderbrook 1fe468d319
support debug each output of each ins (#19004)
7 years ago
Double_V 1d0f04315a fix row_conv_op to force it support lodtensor and tensor input simultaneously, test=develop (#19412)
7 years ago
Jiabin Yang 1ce0a09e60
fix con2d transpose bias by create and init it in build_once (#18968)
7 years ago
Yi Liu 4ef6b8457a
adapte fleet api for localsgd and support nccl comm configuration in executor (#19443)
7 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training (#18863)
7 years ago
chengduo e26411cec2
Open test_parallel_dygraph_se_resnext (#19342)
7 years ago
Yi Liu efb05ba258
supports multiple NCCL communicators preserved in NCCLCommContext (#19407)
7 years ago
Aurelius84 a9cd513680
improve sequence_conv api doc (#19316)
7 years ago
vincentXiyu 482ce818bb Support Tensor input with padding for warpctc op (#19322)
7 years ago
chengduo bfb6ac816e
Fix optimizer bug (#19410)
7 years ago
Leo Chen 6fb310ae29 Fix bug of getting bool Flags from os.environ (#19349)
7 years ago
liu zhengxi 32598ffd8f
Python infer api update and add unit test (#19353)
7 years ago
chengduo 11070cbff9
enabel seresnext reduce test (#19341)
7 years ago
Ghost Under Moon 10643b4ea6 fix- raise io error when user load from non-existed dir test=develop (#19384)
7 years ago
mapingshuo d5ac87ec22
Lookahead optimizer (#19386)
7 years ago
silingtong123 da127d1110 Optimized error reporting information (#19173)
7 years ago
Jiabin Yang 55931db449
fix problem that get_attr method can't using default mode when we call has_attr in dygraph (#19328)
7 years ago
zhang wenhui 4a3c4b8fa4
add fleet_desc config feature & multi_sparse table, test=develop (#18827)
7 years ago
Leo Chen d49c2bad71 update inplace grad test to new CompiledProgram API, test=develop (#19359)
7 years ago
chengduo 4278518fb0
Update CompiledProgram (#18919)
7 years ago
翟飞跃 2e3ee57954 Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153)
7 years ago
Leo Chen a9d5fc5142 Enhance OpTest to check the consistency of operators when using and not using inplace (#19101)
7 years ago
Aurelius84 0d29cf18f4
Supports diagonal initialization in uniform_random op (#19299)
7 years ago
chengduo 5a579df9ba
[Speedup] Make dygraph data parallel faster (#19280)
7 years ago
chengduo 6a1632318d
Split test_parallel_executor_seresnext to three unit test (#19239)
7 years ago
lidanqing 3fdecc19b7 Add elementwise_mul_mkldnn UT with [conv + elt_mul + conv] (#19191)
7 years ago
danleifeng 0865b5a9a0 distribute launch : add use_paddlecloud argument (#19273)
7 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
7 years ago
Aurelius84 78a3d837f8
Add match_matrix_tensor op (#18525)
7 years ago
Zeng Jinle 5b6673c44d
merge develop to solve conflict, also fix API doc, test=develop (#18823)
7 years ago
zhang wenhui 539c870753
add fl_listen_and_serv &fl_transpiler,test=develop (#19091)
7 years ago
kh2se2013 27e85625b8 add python coverage launch when WITH_COVERAGE=ON (#19264)
7 years ago
chengduo 8a89ca94ce
Fix REGISTER_OP_WITHOUT_GRADIENT (#19251)
7 years ago
gongweibao fd4b15a2f6
Unset unittests http_proxy env to avoid timeout. (#19269)
7 years ago
Tao Luo 2f8c7e021f
remove unused inference_transpiler unit-tests (#19130)
7 years ago
zhaoyuchen2018 0c71c839ec
Fix recurrent op not update grade issue (#18581)
7 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
7 years ago
Yibing Liu 50b1cab122
Add padding support for crf_decoding (#19057)
7 years ago
LielinJiang 1331c9e1f8 fix distributions unittest bug, test=develop (#19012)
7 years ago
lidanqing c548e370f1 UT coverage for guassian_mkldnn_op and batch_norm_mkldnn_op (#19011)
7 years ago
Jiawei Wang 6ac32d0981 Instag Implemention (#18394)
7 years ago
huangjun12 20f18930ae Add hard swish op (new op) (#19001)
7 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
7 years ago