Commit Graph

8548 Commits (d13dd3b6a7ee81d4c106035ec0bad2c581ea795c)

Author SHA1 Message Date
tensor-tang d13dd3b6a7 revert profiling
7 years ago
tensor-tang 4d11c8e9c6 retest single thread
7 years ago
tensor-tang 77599415ba enable read dataset
7 years ago
tensor-tang c00843f4e8 enable multi-threads
7 years ago
tensor-tang 400f5e7c3c add threads test
7 years ago
tensor-tang ce20dfa236 enable more choices
7 years ago
tensor-tang 602e28bf1c use the actual data
7 years ago
tensor-tang 1b8b253ec1 Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 98fb8e58fd test infer nlp
7 years ago
Tao Luo ec7f1e99ab
Merge pull request #10907 from reyoung/feature/remove_cpplint_while_compile
7 years ago
Tomasz Patejko 3e876b3e49 MKL optimized elementwise add: fix style check
7 years ago
Tomasz Patejko 9241011b31 MKL elementwise add backward: backward works for integral types with fall back to default impl
7 years ago
Tomasz Patejko fde47aae61 MKL elementwise add backward: grad inputs copied when they are not null
7 years ago
Tomasz Patejko 996d12f16d MKL optimized elementwise add backward: coding style fixes
7 years ago
Tomasz Patejko 5a622c2906 MKL elementwise add backward: Initial implementation with vector copy
7 years ago
Tomasz Patejko 01fb2be908 MKL elementwise add: default implementation used for integral types, float16 and/or GPU
7 years ago
Tomasz Patejko 6f932482f4 MKL elementwise_add: BLAS version compiles with integral types
7 years ago
Tomasz Patejko e43c8f33cd MKL elementwise add: elementwise_add uses vAdd VML function when MKL is used
7 years ago
Tao Luo 174d884d2c
Merge pull request #10904 from luotao1/ci
7 years ago
yuyang18 a229734cbd Remove cpplint in cmake
7 years ago
Yan Chunwei b1d446856c
fix inference api (#10867)
7 years ago
Guo Sheng 6d2deedf1d
Merge pull request #10814 from guoshengCS/fix-ElementwiseOpInferVarType
7 years ago
Luo Tao 654229b232 refine fluid deploy on teamcity
7 years ago
Yu Yang d406c76a30
Merge pull request #10744 from reyoung/feature/refine_parallel_executor
7 years ago
Yu Yang 8653cf3004
Merge pull request #10656 from reyoung/feature/support_op_role
7 years ago
Xi Chen e68d196097 update cudnn version to cuda version
7 years ago
Xi Chen 1087bb0ce3 fix cudnn9 production dockerfile
7 years ago
Xin Pan 0d598cf9f6
Merge pull request #10822 from panyx0718/dist_opt
7 years ago
tangwei12 397a69d942
Merge pull request #10532 from seiriosPlus/checkpoint
7 years ago
Yibing Liu 20bdc3e109
Merge pull request #10846 from kuke/deconv_group
7 years ago
fengjiayi 530556dd97
Merge pull request #10864 from JiayiFeng/dev_expose_random_gen
7 years ago
Xin Pan 2643868c66 follow comments
7 years ago
Yibing Liu 4bafbf411c Enable groups for conv3d transpose op
7 years ago
Xin Pan 08e4970e45 follow comments
7 years ago
yuyang18 fcbf19bf93 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
yuyang18 ad6a8715ff Disable unstable unittests
7 years ago
Wu Yi c8919d8ee2
Merge pull request #10821 from typhoonzero/use_pinned_memory
7 years ago
Yan Chunwei 1153144fbb
Inference analysis/init data flow graph analysis (#10776)
7 years ago
fengjiayi c2436f2ca5 expose random_data_generator
7 years ago
Yu Yang a9f9fbadd9
Merge pull request #10832 from typhoonzero/fix_server_port_file_conflict
7 years ago
yuyang18 9ab12df62d Disable no good tests
7 years ago
yuyang18 3b04f0099c Fix bug
7 years ago
whs 8655904be2
Enhance reduce op (#10708)
7 years ago
Yibing Liu adbf97b42d Merge branch 'develop' of upstream into deconv_group
7 years ago
yuyang18 39a44e3834 Add Debug log
7 years ago
Yibing Liu 6e13c86dff Enable multiple groups for cudnn conv transpose
7 years ago
Lei Wang 01c0a81b4d
Dockerfile: add --allow-downgrades option when install libnccl. (#10850)
7 years ago
Lei Wang faedee0d7a
Dockerfile: fix production image libnccl install error. (#10848)
7 years ago
yuyang18 fb370f4411 Refine code
7 years ago
yuyang18 53a250e45e Disable hanged tests
7 years ago