Commit Graph

1057 Commits (b587a7f66e2a4c3adebad29376c477cf470b150d)

Author SHA1 Message Date
qingqing01 ca5ea65ad1 Fix reshape op. (#10641)
7 years ago
tangwei12 3c820064de remove overwrite judge to test load
7 years ago
tangwei12 2f4c039e62 rename, modify ckpt structure
7 years ago
Yancey1989 b35ea1a4d6 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
typhoonzero 872e55bce5 remove comments
7 years ago
typhoonzero 373a2e66eb remove comments
7 years ago
tangwei12 461d2fc0d7 rename ckpt -> checkpoint
7 years ago
typhoonzero 7b0c0273f4 update by comments
7 years ago
Tao Luo 8c7d2e2984
Merge pull request #10576 from jczaja/prv-reuse-mkldnn-softmax-primitives
7 years ago
typhoonzero 928418a9ac Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
baiyf 43b6d4f8cb put detection op together (#10595)
7 years ago
Yu Yang 05a96db67f
Merge branch 'develop' into feature/matmul_support_float16_double
7 years ago
yi.wu 5ae0c664b0 fix build and merge develop
7 years ago
yi.wu 6ef60de6f1 update
7 years ago
tangwei12 a1419f1062 test add op declare
7 years ago
tangwei12 5e74db3f2a add build and test make
7 years ago
Jacek Czaja 7bf00c3a93 - First draft of reusing of softmax mkldnn primitives
7 years ago
typhoonzero 7a7d27b33e update op
7 years ago
typhoonzero 0ae726f060 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
typhoonzero f5840d8925 follow comments
7 years ago
tangwei12 d1bd3fdefc add build and test make
7 years ago
tangwei12 802d10cf53 rename cpkt_save_op
7 years ago
tangwei12 dc534fc195 add checkpoint save op test
7 years ago
tangwei12 87a0856384 add checkpoint save op
7 years ago
tangwei12 2a05b3d5a3 delete checkpoint function
7 years ago
typhoonzero 04bde96e4c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
Yu Yang 046405e091
Merge pull request #10486 from reyoung/feature/clean_op_maker
7 years ago
yuyang18 66590a0b88 Fix typo in blas_impl.h
7 years ago
yuyang18 ad2e420623 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/matmul_support_float16_double
7 years ago
tangwei12 e21a72d1b9 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into checkpoint
7 years ago
Siddharth Goyal 283c4dbe57
Add FP16 option in save_combine_op (#10471)
7 years ago
Siddharth Goyal 28a6037bb8
Fix lod check in FP16 test for save_op (#10508)
7 years ago
Wu Yi 61343fbf53
Merge pull request #10531 from typhoonzero/refine_grpc_serde_code
7 years ago
tangwei12 77c6b71ec4 add ckpt to sync loop
7 years ago
yuyang18 27197290dc matmul support float16/double
7 years ago
Yu Yang 705e7345d0
Merge pull request #10449 from reyoung/feature/clean_matmul
7 years ago
yuyang18 613d3ef084 Fix compile error
7 years ago
fengjiayi e15d616e29 Complete the C++ core of 'CustomReader'
7 years ago
yuyang18 ad594b9b70 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_matmul
7 years ago
typhoonzero 796a448ce4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_grpc_serde_code
7 years ago
reyoung b0ca371f11 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_op_maker
7 years ago
tangwei12 1fabbbade2 modify const to const &
7 years ago
Yancey1989 b1e5183627 overlap sendop and backward ops
7 years ago
typhoonzero 602aa43322 cast data type
7 years ago
fengjiayi cf3b3d6024 fix warpctc
7 years ago
Kexin Zhao aa2635fe65 clean code
7 years ago
Kexin Zhao cbf502e5d4 fix error
7 years ago
Kexin Zhao 270a87fb66 add load op fp16 mode test
7 years ago
Kexin Zhao eb95417e05 initial commit
7 years ago
tangwei12 568a329c83 add checkpoint util class and implement
7 years ago
typhoonzero a2de156dfa refine serde code
7 years ago
fengjiayi e61a38daa3 init CustomReader
7 years ago
Kexin Zhao 170ac721b6 remove unnecessary tensor copy in save op
7 years ago
Yu Yang 0e78cb69fb Clean OpProtoAndCheckerMaker
7 years ago
Yu Yang fcd31d6161 Follow comments and polish code names
7 years ago
chengduoZH 187e23a79c fix MatMul parameter
7 years ago
Yu Yang 0a13d3c67a Move MatMul to blas_impl.h
7 years ago
typhoonzero a135fec1fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
typhoonzero 0f86397d81 fix build
7 years ago
Yu Yang 3dd01823a8 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_matmul
7 years ago
typhoonzero 17009d0627 workable version
7 years ago
Xin Pan dce0732d5e
Merge pull request #10380 from panyx0718/dist_timeline
7 years ago
Yu Yang c6a6d87f96 Rewrite Matmul, make code cleaner
7 years ago
typhoonzero a529d790b6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
typhoonzero 82c61dbde3 fix testing
7 years ago
Xin Pan d1ea74d3b9 follow comments
7 years ago
fengjiayi b708ec0ae1
Merge pull request #10412 from JiayiFeng/correct_TensorCopy_misuse
7 years ago
Kexin Zhao 55e714e0d2 add float16 support to pool3d
7 years ago
Kexin Zhao 8b16927230 add fp16 support to conv3d
7 years ago
Darcy 8f8a4768dc adding device_context to blas deps list (#10420)
7 years ago
dzhwinter a28dffbb0b
Fix/adam float64 (#10407)
7 years ago
typhoonzero 0598a4b366 fix ci
7 years ago
fengjiayi 0c99cd7bbb fix errors in sequence_padding_test
7 years ago
Lei Wang 6418c42148 Travis: fix check style error.
7 years ago
Kexin Zhao 4e3fac4129 fix sign unsigned comparison (#10424)
7 years ago
Siddharth Goyal b65282168c Fix cpplint errors in lstm kernel (#10394)
7 years ago
chengduoZH d36af62c1e wrap_shfl_x_sync
7 years ago
fengjiayi bf99396a04 fix errors in sequence_slice_op
7 years ago
fengjiayi baa9f50da5 fix errors in multiplex_op
7 years ago
fengjiayi 2e617334eb fix errors in lod_reset_op
7 years ago
fengjiayi e309f42293 fix errors in concat_test
7 years ago
Yu Yang 0285a2b95d
Merge pull request #10371 from reyoung/refine_code
7 years ago
typhoonzero d9320dcd94 complete code
7 years ago
Xin Pan 5a9f17f02b clean up
7 years ago
Qingsheng Li 3bb99c4f66
Added auto transform to beam_search_decode_op (#10286)
7 years ago
Abhinav Arora c9f55dfafc
Fix CPPLint issues in /math/detail/gru_kernel.h (#10390)
7 years ago
Yu Yang ef6ea790dc Clean and extract blas
7 years ago
Kexin Zhao ccc594e4c4
need to copy LoD info (#10392)
7 years ago
Xin Pan 76d8b14bce Add timeline support for distributed training
7 years ago
Yu Yang d0785ce982 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_code
7 years ago
Yu Yang 815d888468 Clean MatMul
7 years ago
Yu Yang 2abcf37958
Merge pull request #10327 from reyoung/feature/clean_blas
7 years ago
chengduo 54797abd53
Merge pull request #10347 from chengduoZH/replace___shfl_with__shfl_sync
7 years ago
chengduo 62fed4cbb3 fix __shfl_down (#10362)
7 years ago
Yu Yang bc8160350b Fix compile
7 years ago
Yu Yang a6edeb39b3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_blas
7 years ago
typhoonzero 7237323c5d fix compile
7 years ago
dzhwinter f63ff90b03
Fix/fp64 (#10346)
7 years ago
Wu Yi 88d79dfe95
Merge pull request #10292 from typhoonzero/fix_grpc_server_ready_condition
7 years ago
chengduoZH 0cc635497c merge develop
7 years ago
Yu Yang 8a0c7e2e70
Merge pull request #10280 from reyoung/feature/add_stable_test_of_cross_entropy
7 years ago
Tomasz Patejko 4a497b826d MKLDNN implementation of batch normalization (#9904)
7 years ago
chengduo 4fbde42cdf Fix __shfl_down_sync_ of cross_entropy (#10345)
7 years ago
yi.wu 6422c0e4f6 update by comment
7 years ago
chengduoZH b8f7fa97b6 replace __shfl with __shfl_sync
7 years ago
typhoonzero eeed7af5c3 add gen_nccl_id_op
7 years ago
Yu Yang 5e151b2c83 Follow comment
7 years ago
Yu Yang 9a4c1a39f0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_stable_test_of_cross_entropy
7 years ago
Yu Yang caa4027d9d Follow comments
7 years ago
dzhwinter 57be5c6c74
"fix double type error" (#10322)
7 years ago
Qiao Longfei faebadd938
Merge pull request #10228 from jacquesqiao/use-multi-thread-todo-update
7 years ago
Yancey ff99d94197
Merge pull request #10164 from Yancey1989/lookup_sparse_table_op
7 years ago
typhoonzero b3cf429e02 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_server_ready_condition
7 years ago
typhoonzero e7ac709b4b done
7 years ago
Abhinav Arora 1945b729b6
Fix CPPLint issues with math/sequence_padding (#10317)
7 years ago
chengduo 9bcd9f661b fix cpplint error (#10329)
7 years ago
Yu Yang 4db43c6c9f Naive implement cblas
7 years ago
fengjiayi a1a401eb26 fix
7 years ago
typhoonzero a131c73fcf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_server_ready_condition
7 years ago
Kexin Zhao 4613aeba0e
Merge pull request #10272 from kexinzhao/save_fp16
7 years ago
Yu Yang 60d6348e69 Revert develop
7 years ago
Yu Yang 86af6bdc81 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_blas
7 years ago
Yu Yang 49dedfad17 Polish code and tests
7 years ago
qiaolongfei d86626df84 optimize log
7 years ago
qiaolongfei ebf0027391 use IOThreadPool to dispatch async update task
7 years ago
qiaolongfei ea372b3452 add more log
7 years ago
Abhinav Arora 738585476d
Fix more CPPLint issues in fluid/operators/math (#10276)
7 years ago
Helin Wang d25fdb0a47 fix build: cuda_helper.h not found
7 years ago
dzhwinter eb6f9dd5de
Feature/cuda9 cudnn7 (#10140)
7 years ago
typhoonzero 008f6df9b2 update
7 years ago
chengduo f61dfeedcc
Merge pull request #10263 from chengduoZH/add_FLAGS_use_deterministic_algo
7 years ago
typhoonzero ef48f3c766 wip
7 years ago
Yu Yang c888e01660 Refactor GEMM in blas
7 years ago
Qingsheng Li 79be1bb3df
Merge branch 'develop' into fix-10026
7 years ago
ktlichkid 48466b4424 auto => auto*
7 years ago
chengduoZH 9fda5c92cd Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_FLAGS_use_deterministic_algo
7 years ago
Yu Yang 76174ec0e9 Clean cross entropy and add sync in executor
7 years ago
chengduoZH c5774e3282 add FLAGS_use_deterministic_algo
7 years ago
Abhinav Arora e735359631
Fix more CPPlint issues in fluid/operators/math (#10249)
7 years ago
Kexin Zhao efba1c7dcb address comments
7 years ago
Kexin Zhao 6c88f1ae6e add save op float16 support
7 years ago
qiaolongfei f82cb635cf optimize code, add more log
7 years ago
fengjiayi 71fa3ca9c4
Merge pull request #10232 from JiayiFeng/fix_unittests
7 years ago
qingqing01 76c4ae856f
Fix reshape op. (#10253)
7 years ago
Yancey1989 1a93253f16 fix unittest
7 years ago
fengjiayi 30f9dc92e5 fix errors
7 years ago
ktlichkid 9997c916fc Pull origin
7 years ago
ktlichkid 709a9edd46 Code clean up
7 years ago
fengjiayi 330fa95cbd Follow comments
7 years ago
dyning 4a5bfa89c3 Modify RoI pooling op to use LoDTensor and expose it into Python API (#10208)
7 years ago
Tomasz Patejko e498e1fc56 Adam operator optimized with Eigen (#10229)
7 years ago
Abhinav Arora 83b1a8f6bf
Pending more CPPLint errors in fluid/operators/math (#10243)
7 years ago
baiyf c816121d11 optimized iou_similarity_op (#10231)
7 years ago
fengjiayi bcf260e1e8 fix several unit tests
7 years ago
qiaolongfei b058189941 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into use-multi-thread-todo-update
7 years ago
Qiao Longfei 6d934560c7
Merge pull request #10042 from jacquesqiao/add-async-listen-and-serv-op
7 years ago
Abhinav Arora f457d5da06
Fix more CPPLint errors (#10218)
7 years ago
qiaolongfei 0d491b670a use-multi-thread-todo-update
7 years ago
qiaolongfei 3295f31076 optimize naming
7 years ago
qiaolongfei 46342a2306 delete useless code
7 years ago
fengjiayi 9c7fa6ff69
Merge pull request #10206 from JiayiFeng/blocking_queue_for_reader
7 years ago
qiaolongfei 0264ec3957 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-listen-and-serv-op
7 years ago
qiaolongfei 63bd38bd74 code optimize
7 years ago
Yancey1989 dccd013bd3 refine distribute transpiler
7 years ago
fengjiayi 8bd34664f1 fix unit test error
7 years ago
fengjiayi 17c51d69d1 fix unit test error
7 years ago
fengjiayi 304b6b7138 Follow comments
7 years ago
Siddharth Goyal 5fe1fe3a27
Fix signed/unsigned comparison warning (#10211)
7 years ago
fengjiayi 4cb63d8451 Remove unnecessary header files
7 years ago
Yancey1989 e393c86c4a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into lookup_sparse_table_op
7 years ago
fengjiayi e057ba6877 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into blocking_queue_for_reader
7 years ago
fengjiayi e2ca42408b Replace Channel in DoubleBufferReader with BlockingQueue
7 years ago
fengjiayi a786611314 Replace Channel in MultiFileReader with BlockingQueue
7 years ago
fengjiayi 1a25f3cd07 Add reader blocking queue
7 years ago
Xin Pan 64babc9aeb
Merge pull request #10189 from reyoung/feature/fix_matmul_bug
7 years ago
Yu Yang 580dad0c2c Fix compile when there is no mkl
7 years ago
Yu Yang 2a06e307d0 Fix batch_gemm bugs
7 years ago
Yancey1989 8aea5cac0a add attr auto_grown_table
7 years ago
gongweibao 2f53cd0a76
Fix beam_search memory leak. (#10185)
7 years ago
ktlichkid 5afc2a9972 Keep up with upstream
7 years ago
Qiao Longfei 63bf82ddea
Merge branch 'develop' into add-async-listen-and-serv-op
7 years ago
Tao Luo 8b2391858f
Merge pull request #10181 from abhinavarora/cpplint_advanced
7 years ago
Wu Yi 3fdfa940be
Merge pull request #10135 from typhoonzero/unify_blocking_queue
7 years ago
Abhinav Arora edd3587e50 Fix CPPLint errors with op_desc
7 years ago
wangyang59 72ee737f3f
Merge pull request #9308 from wangyang59/bilinear
7 years ago
Yang Yang(Tony) 2182ecfbbd
remove duplicated ShareLoD in gru_op and sequence_conv_op (#10149)
7 years ago
gongweibao fc025f5265
Fix memory leak of pserver (#10173)
7 years ago
Yu Yang 4ecc9b7bae
Merge pull request #10166 from reyoung/feature/train_and_test_recordio
7 years ago
chengduo bfbbe19fbb
Merge pull request #10150 from chengduoZH/fix_elementwise_gradient
7 years ago
chengduoZH 0f5d5b1ffc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_elementwise_gradient
7 years ago
qiaolongfei 8081e15774 fix send_recv_op_test
7 years ago
Yu Yang 54ada9449e Add demo for recordio train/test and parallel executor
7 years ago
Tao Luo 44fa823841
Merge pull request #9949 from mozga-intel/mozga-intel/Mul_mkldnn
7 years ago
Yancey1989 e8d802159e add lookup_sparse_table_op
7 years ago
chengduoZH d06c79c7a7 fix elementwise_grad op kernel and add unit test
7 years ago
wangyang59 469a349ae3 polishing after qingqing's comments
7 years ago
wangyang59 7436b36875 make bilinear_op registration up-to-date
7 years ago
wangyang59 4a3c99f334 after rebase
7 years ago
wangyang59 d61738311a remove dropout and nccl test due to frequent CI failures
7 years ago
wangyang59 3e6718e2de simplified include structure
7 years ago
wangyang59 d87ac4de34 GPU of bilinear_interp_op done
7 years ago
wangyang59 ad3b3d9dc1 ported old paddle gpu bilinear_interp
7 years ago
wangyang59 67ce586453 gpu implementation of bilinear interp
7 years ago
wangyang59 f67f0cae50 finished testing cpu bilinear_interp_op
7 years ago
wangyang59 c7cd6d130b cpu implement of bilinear interp
7 years ago
qiaolongfei 3503c47f9a listen and serv default sync mode
7 years ago
fengjiayi 9f11da5931 Add synchronous TensorCopy and use it in double buffer
7 years ago
ktlichkid 64509fd93b Style fix
7 years ago
ktlichkid 294b58a9ba Changed registered type
7 years ago
ktlichkid df80b6ea8c Added InferVarType
7 years ago
ktlichkid f57efeb6d1 Added GetExpectedKernelType and Debug message
7 years ago
ktlichkid 6f06b32258 Added GetExpectedKernelType and Debug message
7 years ago
qiaolongfei a29e352b80 optimize code
7 years ago
qiaolongfei a0ced3df82 async update can run
7 years ago
typhoonzero 251e4a8ee5 unify fluid blocking queue
7 years ago
qiaolongfei 42a15a43b7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-listen-and-serv-op
7 years ago
qiaolongfei 63055a3e08 complete grad_to_id
7 years ago
Yancey1989 8023c6d749 Create sub socpe when it is necessary
7 years ago
qiaolongfei 4b86b49ecd Merge branch 'fix-build-activation_op' of ssh://github.com/jacquesqiao/Paddle into add-async-listen-and-serv-op
7 years ago
qiaolongfei 108e71cc94 fix build activation_op.cc on mac
7 years ago
qiaolongfei c6937abdd1 tmp
7 years ago
qiaolongfei 1d75674614 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-listen-and-serv-op
7 years ago
Siddharth Goyal cb7f096da1 Fix cpplint error in mkldnn_activation (#10105)
7 years ago
Qiao Longfei 7a993ee4f7
Merge pull request #10080 from jacquesqiao/refine-listen-and-serve-op
7 years ago
Yu Yang f2e400d65b Revert "accelerate dropout (#9902)" (#10082)
7 years ago
qiaolongfei 0763ae9a1a remove unused file
7 years ago
qiaolongfei dc3d2dc8ff rename grad_map to grad_to_id
7 years ago
qiaolongfei 260bf5aceb add sync_mode
7 years ago
qiaolongfei 63fbdcf979 update send_recv_op_test
7 years ago
qiaolongfei e2ace032ae rename RunAsyncUpdate to RunAsyncLoop
7 years ago
qiaolongfei f997c9b702 Merge branch 'refine-listen-and-serve-op' of ssh://github.com/jacquesqiao/Paddle into add-async-listen-and-serv-op
7 years ago
qiaolongfei 0f5a9cc9fc change RunSyncUpdate to RunSyncLoop
7 years ago
ktlichkid df70d5f1ce Fixed some bugs
7 years ago
qiaolongfei 0a881a1ecf init RunAsyncUpdate
7 years ago
qiaolongfei 36083018c1 Merge branch 'refine-listen-and-serve-op' of ssh://github.com/jacquesqiao/Paddle into add-async-listen-and-serv-op
7 years ago
Yu Yang f738691777
Merge pull request #9740 from dzhwinter/memory/activation
7 years ago
qiaolongfei d144dba4a1 simplify code
7 years ago
qiaolongfei 9c2d7df8ad optimize code
7 years ago
qiaolongfei 8f7c77309d refine listen_and_serv_op
7 years ago
qiaolongfei 1e30c41e7b add split string
7 years ago
qiaolongfei d002aa7abf update
7 years ago
qiaolongfei a39e607798 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-listen-and-serv-op
7 years ago
Qiao Longfei ba927b8811
Merge pull request #10060 from jacquesqiao/update-variable-response
7 years ago
Abhinav Arora 324ab7a39a
Fix CPPLint issues with select_op (#10072)
7 years ago
Siddharth Goyal 122141249d Fix cpplint for print_op (#10070)
7 years ago
Abhinav Arora 8113de9425
Fix more CPPLint errors (#10069)
7 years ago
qiaolongfei 65b3138e98 add check
7 years ago
Qiao Longfei bb4b9af7d4
Merge pull request #10056 from typhoonzero/fix_splitbyref_macbuild
7 years ago
ktlichkid d060b5dfac Registered beam search op
7 years ago
ktlichkid b94c518884 Implemented BeamSearchKernel
7 years ago