Commit Graph

24955 Commits (75d1571995edd5efdd31288563fc43bce4cd458b)
 

Author SHA1 Message Date
Jiabin Yang 1ce0a09e60
fix con2d transpose bias by create and init it in build_once () 6 years ago
Yi Liu 4ef6b8457a
adapte fleet api for localsgd and support nccl comm configuration in executor () 6 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training () 6 years ago
Tao Luo 61389ae5aa
make PADDLE_ENFORCE ci check rule more robust () 6 years ago
baojun 6421c61ae2 Update ngraph engine for multiple threading () 6 years ago
chengduo e26411cec2
Open test_parallel_dygraph_se_resnext () 6 years ago
Zeng Jinle caf59d0f3f
Add signal message to stderr () 6 years ago
Yi Liu efb05ba258
supports multiple NCCL communicators preserved in NCCLCommContext () 6 years ago
Huihuang Zheng 56dd76538c
Delete useless ex-scope in recurrent op () 6 years ago
wopeizl b8aa37d529
save the callstack information to file when exception throws test=dev… () 6 years ago
xsrobin 3f392fd4bc test=develop () 6 years ago
Aurelius84 a9cd513680
improve sequence_conv api doc () 6 years ago
zhang wenhui 0d7949831b
fix fleet_desc bug && support format for abacus hotstart () 6 years ago
joanna.wozna.intel 2e3ec66be0 Add conv dequant squash for int8 () 6 years ago
vincentXiyu 482ce818bb Support Tensor input with padding for warpctc op () 6 years ago
chengduo bfb6ac816e
Fix optimizer bug () 6 years ago
Leo Chen 6fb310ae29 Fix bug of getting bool Flags from os.environ () 6 years ago
tianshuo78520a 8048992042 add cuda10 support in fast_install.sh and add dynamic get version for release () 6 years ago
liu zhengxi 32598ffd8f
Python infer api update and add unit test () 6 years ago
Zeng Jinle 807c7a4747
remove recordio convert in dataset, test=develop () 6 years ago
chengduo 11070cbff9
enabel seresnext reduce test () 6 years ago
Ghost Under Moon 10643b4ea6 fix- raise io error when user load from non-existed dir test=develop () 6 years ago
mapingshuo c2e5eaa27d
delete recordio writer () 6 years ago
mapingshuo d5ac87ec22
Lookahead optimizer () 6 years ago
Huihuang Zheng 12d29f4d2a
Change TensorCopy in recurrent_op to ShareDataWith () 6 years ago
silingtong123 da127d1110 Optimized error reporting information () 6 years ago
Jiabin Yang 55931db449
fix problem that get_attr method can't using default mode when we call has_attr in dygraph () 6 years ago
tangwei12 19dac67e9f
fix distribute transpiler GRPC error code 4, RPC Deadline () 6 years ago
Yibing Liu 5d1575cfe8
Fix arg do_model_average in param_attr () 6 years ago
Tao Luo c82280e445
remove unused conv_elementwise_add2_act_fuse.cc () 6 years ago
zhang wenhui 4a3c4b8fa4
add fleet_desc config feature & multi_sparse table, test=develop () 6 years ago
Jiancheng Li 1799c257ad Update Light-NAS to support latency-aware search () 6 years ago
Zhen Wang 0fe72469ea
Add the max-pool2d quantization support and the partial quantization support. () 6 years ago
Leo Chen d49c2bad71 update inplace grad test to new CompiledProgram API, test=develop () 6 years ago
Yibing Liu b2c4f76cf2
Fix sequence mask in dygraph () 6 years ago
chengduo 4278518fb0
Update CompiledProgram () 6 years ago
lidanqing 9240e5325c add local user data conversion into full_pascalvoc_test_preprocess.py () 6 years ago
翟飞跃 2e3ee57954 Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel () 6 years ago
Leo Chen a9d5fc5142 Enhance OpTest to check the consistency of operators when using and not using inplace () 6 years ago
Aurelius84 0d29cf18f4
Supports diagonal initialization in uniform_random op () 6 years ago
chengduo 5a579df9ba
[Speedup] Make dygraph data parallel faster () 6 years ago
Tao Luo e3c68bde78
stronger the error message of tensor's mutable_data () 6 years ago
chengduo 6a1632318d
Split test_parallel_executor_seresnext to three unit test () 6 years ago
tianshuo78520a 188a5caf2e Split and enhance assert_api_spec_approvals () 6 years ago
chengduo a8a9823dae
add memory profiler () 6 years ago
Zeng Jinle 561232c25a
remove is_mem_optimized in Program, test=develop () 6 years ago
Adam 97d1db1874 Add generalized Conv+Activation MKLDNN fuse pass creation Part2 () 6 years ago
wangguanzhong 37428952c6
fix generate mask fpn, test=develop () 6 years ago
lidanqing 3fdecc19b7 Add elementwise_mul_mkldnn UT with [conv + elt_mul + conv] () 6 years ago
zhaoyuchen2018 5296294dae
Fix elementwise performance poor issue () 6 years ago