Commit Graph

373 Commits (7c041e48f48e111b3ab0d5876c2b419e90c7b591)

Author SHA1 Message Date
Tao Luo b62874429d
Merge pull request #8910 from Xreki/core_inference_profile
7 years ago
Yu Yang 43d09a1c5f Extract Prepare from Executor
7 years ago
Yu Yang 225efa671f Remove dims in base class
7 years ago
QI JUN f7e9fe57d3
[Memory]More memory optimization policy (#8690)
7 years ago
Yu Yang 2ea4a5d96c Polish double buffer reader
7 years ago
kexinzhao 607eec30a8
Merge pull request #8946 from kexinzhao/fix_cuda_arch_fp16
7 years ago
Yancey b5ef315cf1
Fix dist compile error (#8987)
7 years ago
qingqing01 b3d26cd3ad
Fix bug in detection_output and mAP calculation in SSD. (#8985)
7 years ago
Yu Yang 46ae4075ee Polish ShuffleReader and test
7 years ago
Kexin Zhao c88f58dbd8 add comment
7 years ago
chengduoZH f1c3ecb2b2 add concat rows
7 years ago
chengduo 685f03762e
Merge pull request #8890 from chengduoZH/feature/fix_bug_of_elementwise
7 years ago
Kexin Zhao 3b44b849d3 address comments
7 years ago
fengjiayi dd1244f3c9
Merge pull request #8943 from JiayiFeng/fix_bugs_in_readers
7 years ago
Yu Yang 7eedced82a Polish RecordIO
7 years ago
Yu Yang cfca8a3a26 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/recordio_file_reader
7 years ago
Yu Yang fea43077f6 Refine
7 years ago
pzelazko-intel 4730a4be24 MKLDNN pool2d OP kernel added (#8879)
7 years ago
Kexin Zhao 95de7617eb fix bug
7 years ago
Kexin Zhao 1998d5afa2 add gpu info func to get compute cap
7 years ago
Kexin Zhao d400b4192d fix math function arch mismatch for older GPU
7 years ago
fengjiayi 614c33fb3a fix a potential bug in the c++ reader
7 years ago
chengduoZH 1509ce6638 enhancement look_up_table
7 years ago
fengjiayi aa3f5058d3
Merge pull request #8841 from JiayiFeng/dev_double_buffer_for_cpp_reader
7 years ago
QI JUN b341bac7e1
Refine cast op (#8923)
7 years ago
Yancey 8468037918
Fix sparse update memory error for distributed training (#8837)
7 years ago
fengjiayi 35e1e0d521 uses channel to replace the traditional buffer
7 years ago
fengjiayi b3a11fdf3a Merge branch 'rm_reader_HasNext' into dev_double_buffer_for_cpp_reader
7 years ago
fengjiayi 6e5736e270 fix a compile error
7 years ago
fengjiayi 4e517881f7 remove HasNext
7 years ago
Liu Yiqun a8e8507767 Refine the profile codes for inference.
7 years ago
武毅 9dd34e4169 update unpushed commits for zerocopy grpc (#8900)
7 years ago
zhouhanqing 9d78971d8b Some comments have been modified.
7 years ago
Xin Pan b825c79261
Merge pull request #8897 from panyx0718/message
7 years ago
zhouhanqing 3ca968441d Merge branch 'develop' into Paddle-ReduceProd
7 years ago
kexinzhao 90215b7844
Add float16 GEMM math function on GPU (#8695)
7 years ago
武毅 45af8c1e99 Performance/zero copy variable seriralization (#8839)
7 years ago
Xin Pan 9a27d3af23 Print exception message from threads
7 years ago
chengduoZH 53d19f5b1e Add ElementwiseOpInferVarType
7 years ago
qingqing01 ffda2c414d
Clipping bbox in the mAP evaluator calculation. (#8872)
7 years ago
Yiqun Liu fecc9a38c6
Add test for nested RecordEvent. (#8773)
7 years ago
Xin Pan a9b9ec45ab
Merge pull request #8775 from panyx0718/test2
7 years ago
Yu Yang 9d4c93a0a7 Fix CI
7 years ago
chengduo abb10556e8
Merge pull request #8859 from chengduoZH/feature/refine_exe_log
7 years ago
Yu Yang b536799af0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/recordio_file_reader
7 years ago
Xin Pan 30e556d675 Use vlog instead.
7 years ago
Yu Yang db46778bdd Polish codes and comments
7 years ago
Yu Yang 5cb79524d2 Fix CI
7 years ago
QI JUN 47ca1814f3
fix mac build error (#8856)
7 years ago
chengduoZH f7c7135673 Add log before op Run
7 years ago
chengduo f3cdeb9a29
Merge pull request #8820 from chengduoZH/feature/refine_elementwise_
7 years ago
Xin Pan eb46845313 Add warning
7 years ago
Yiqun Liu a032f56f7c
Add profiling information for inference example (#8748)
7 years ago
qingqing01 ded34b2c0f
Fix detection_map_op for multi-device. (#8845)
7 years ago
kexinzhao 7f00716c87
Add context wait in type_transform (#8850)
7 years ago
Tao Luo 6f50dee4d5 compile and install the static library of fluid inference (#7827)
7 years ago
Yu Yang 72be7a6151 Complete RecordIO reader op
7 years ago
fengjiayi b1f647fd6d fix errors
7 years ago
zhouhanqing 732eebb286 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into Paddle-ReduceProd
7 years ago
zhouhanqing 15306ffdc3 add product reduction for reduce_op
7 years ago
fengjiayi e8d21b6349 fix an error
7 years ago
fengjiayi 4fb7b96756 Add basic double buffer reader
7 years ago
Luo Tao 49f3f1db07 add back framework_proto depends
7 years ago
Luo Tao 3ddc997182 rename concat_functor to concat, refine CMakeLists based on comments
7 years ago
Luo Tao 1ef97fa7b1 Merge branch 'develop' into math_function
7 years ago
Yu Yang bcb80756af Add Writer/Scanner
7 years ago
chengduo 84aea8a8a1
Merge pull request #8669 from chengduoZH/feature/concat_op
7 years ago
pzelazko-intel 8c71adaa8c MKLDNN conv2d kernel added (#8451)
7 years ago
Yan Chunwei 049383c615
add inplace to reshape (#8747)
7 years ago
fengjiayi 77200a7057 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_double_buffer_for_cpp_reader
7 years ago
fengjiayi af64f39bfd fix compile errors
7 years ago
Yu Yang 42e65a20c6
Merge pull request #8791 from reyoung/feature/extract_reader_ops
7 years ago
Yu Yang 10343123e3 Merge branch 'feature/extract_reader_ops' into feature/recordio_file_reader
7 years ago
Yu Yang f1d61e675c Add magic number in recordio
7 years ago
chengduoZH 8b30fadac3 refine elementwise sub,div,min,max
7 years ago
Yu Yang 4690b9c987 FIX CI
7 years ago
chengduo c43995ed36
Merge pull request #8810 from chengduoZH/feature/refine_elementwise_mul
7 years ago
kexinzhao 266ccaa843
Integrate float16 into data_type_transform (#8619)
7 years ago
Abhinav Arora 78c884d7a7
Redesign channel implementation for Select Op (#8814)
7 years ago
chengduoZH a1331f9877 refine elementwise_mul_op
7 years ago
fengjiayi 3fcd16ede3 init double buffer
7 years ago
qingqing01 0e1f82fd2a
Fix bug in detection mAP evaluator. (#8778)
7 years ago
Yu Yang 4d8345e3ac Extract create_reader_op to three files
7 years ago
Yu Yang 82b149cd29
Merge pull request #8780 from reyoung/feature/recordio
7 years ago
whs f608bb2756
Merge pull request #8651 from wanghaoshuang/fix_pool
7 years ago
Xin Pan f10152df78 Fix nullptr when doing nested profileing
7 years ago
Xin Pan cf6244c1b8 Improve profiler
7 years ago
Yu Yang 9dc69582de Make recordio simple
7 years ago
Xin Pan c7b7291b9f
Merge pull request #8758 from panyx0718/nccl
7 years ago
Xin Pan a4d68ed3d8 Add lock
7 years ago
kexinzhao 6720681cc2 Enable is_test attr of batch norm and drop out op for test program (#8642)
7 years ago
qingqing01 608feea204
Implement detection mAP evaluator wrapper and unify label format between SSD loss and mAP evaluator (#8736)
7 years ago
chengduoZH c3864eab99 if axis == 0; directly copy D->D
7 years ago
dongzhihong fe18341585 "seperate internal library and exported library"
7 years ago
dongzhihong 7364348d04 "move from recordio repo to paddle"
7 years ago
chengduoZH 131ec276ed fix bug for big number; float->double and code refine
7 years ago
Xin Pan d054cfeae6 Avoid init_nccl for every steps.
7 years ago
dongzhihong 7016979cd5 "add crc32 encoder"
7 years ago
dongzhihong 69c7991108 "add snappy library"
7 years ago
chengduoZH 82bd82c186 follow comments and refine code
7 years ago
Tao Luo b02e12e8e4
Merge pull request #8740 from luotao1/warning
7 years ago
Luo Tao ac149bc948 fix warning: statement is unreachable
7 years ago
qingqing01 457d012a8e
Enable device automatically switching in mine_hard_examples_op. (#8706)
7 years ago
dongzhihong 6540cda109 Merge remote-tracking branch 'origin/develop' into feature/recordio
7 years ago
dongzhihong af5dcda481 "add testing"
7 years ago
chengduoZH 00e596edbe get max threads of GPU
7 years ago
Luo Tao 8c691c8de0 fix only shared variables could be declared as static in the device code
7 years ago
typhoonzero f94a758c16 fix fluid distribute build
7 years ago
Luo Tao f67275a920 refine operator/math/CMakeLists.txt, seperate im2col from math_function
7 years ago
Xin Pan 92974d4186
Merge pull request #8679 from panyx0718/test2
7 years ago
Xin Pan f3cbfc021c Add MEMCPY information
7 years ago
chengduoZH 60e7ee0611 refine concat_op
7 years ago
qingqing01 9e1ec8c919
Enable device switching automatically for serveral operators (#8684)
7 years ago
Xin Pan 55b2d3d032 Add CPU time to the timeline.
7 years ago
Xin Pan 6720198731
Merge pull request #8663 from panyx0718/test2
7 years ago
Abhinav Arora 01536e8dc5
Adding more unit tests for ChannelHolder class (#8668)
7 years ago
chengduo 12a3cea087 Add tuple type (#8519)
7 years ago
Xin Pan 12843a3a53 Firt timeline version
7 years ago
Yu Yang db77006923
Merge pull request #8657 from reyoung/feature/fix_compile
7 years ago
emailweixu eac2c3cf36
Merge pull request #8505 from emailweixu/math_op
7 years ago
Abhinav Arora 0d878e4c09
Add Go_op, Channel_create, channel_close, channel_send and channel_receive ops (#8593)
7 years ago
Yancey 718642e93f
Merge pull request #8659 from Yancey1989/fix_dist_bug
7 years ago
chengduo e9f2033175
Merge pull request #8539 from chengduoZH/feature/refine_elementwise_op_function.h
7 years ago
Yancey1989 7bd16fe13b registry var type infer
7 years ago
Yuan Gao e84615bae6 Fix box coder op (#8647)
7 years ago
wanghaoshuang 0ffec514cd Fix comments
7 years ago
Yu Yang 22b5c07a7d Fix the compilation on CUDA 9.1/GCC 5.3
7 years ago
Yibing Liu ace512a3ed
Merge pull request #8596 from kuke/profiler_multi_gpu
7 years ago
wanghaoshuang 1e510d9914 Add ceil_mode option for pool2d and pool3d
7 years ago
Yibing Liu c0876cf686 update due to upstream's change
7 years ago
dzhwinter 313454dfae "init"
7 years ago
chengduoZH 90dc33b5ff Add todo for reduceSum
7 years ago
chengduo cf883d9caf
Merge pull request #8613 from chengduoZH/feature/fix_bug_conv_op
7 years ago
qingqing01 f449180b1c
Register more data type for reshape operator. (#8617)
7 years ago
chengduoZH a779b42450 follow comments
7 years ago
xuwei06 e9b8ebf42c Correctly handling variable with batch dimension for math ops.
7 years ago
xuwei06 7d56c6d078 Adding Dim<0>
7 years ago
chengduoZH b5c920920c fix conv_op bug
7 years ago
Luo Tao 6dd3a61b2f combine batch_size_like.cc into batch_size_like.h
7 years ago
Xin Pan decaad5ccc
Merge pull request #8542 from panyx0718/test
7 years ago
Tao Luo 1ac31d3d08
Merge pull request #8591 from chengduoZH/feature/refine_cmake_for_cudnn
7 years ago
chengduoZH 62fe2f28a1 follow comments
7 years ago
qingqing01 4948f7b3fe
Enhance bipartite_match_op to support argmax matching after bipartite matching. (#8580)
7 years ago
Yibing Liu ee88855ddf Fix the profiler's bug in multi-gpu mode
7 years ago
Tao Luo dce0383f89
Merge pull request #8404 from Xreki/core_refine_inference
7 years ago
chengduoZH 16fc5e3893 refine cmake for cudnn
7 years ago
武毅 0250e8083d
Merge pull request #8586 from Yancey1989/fix_dist_unittest
7 years ago
Xin Pan 9bbce49353 Fix version date.
7 years ago
Xin Pan b9ec24c6e9 Extend current profiler for timeline and more features.
7 years ago
dzhwinter 34605d2641
accelerate the cuda concat op, avoid many times copy (#8585)
7 years ago