Commit Graph

1578 Commits (35e5563695ac9a227a1d78965b417ee45202b457)

Author SHA1 Message Date
typhoonzero a135fec1fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
chengduoZH aff8a26d71 check generated_op_
7 years ago
typhoonzero 0f86397d81 fix build
7 years ago
Luo Tao 53b401d589 refine io_convert and op_convert
7 years ago
chengduoZH 2e5d44f102 fix fetch op
7 years ago
Yu Yang 3dd01823a8 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_matmul
7 years ago
typhoonzero 17009d0627 workable version
7 years ago
Xin Pan dce0732d5e
Merge pull request #10380 from panyx0718/dist_timeline
7 years ago
Xin Pan 0c518888fa
Merge pull request #10430 from panyx0718/infer
7 years ago
Yu Yang c6a6d87f96 Rewrite Matmul, make code cleaner
7 years ago
typhoonzero a529d790b6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
Yan Chunwei 2a2c83b9e6 feature/convert tensorrt io (#10440)
7 years ago
typhoonzero 82c61dbde3 fix testing
7 years ago
Xin Pan 9fccf46270 reword comments
7 years ago
Xin Pan d1ea74d3b9 follow comments
7 years ago
fengjiayi b708ec0ae1
Merge pull request #10412 from JiayiFeng/correct_TensorCopy_misuse
7 years ago
Kexin Zhao 55e714e0d2 add float16 support to pool3d
7 years ago
Kexin Zhao 8b16927230 add fp16 support to conv3d
7 years ago
Yiqun Liu fd1971caa0
Add the call of DropKids at the end of executor.Run to delete the local scopes created in operators (#10403)
7 years ago
chengduo 99acf1da4c
Merge pull request #10351 from chengduoZH/feature/update_sparse_parameter
7 years ago
Darcy 8f8a4768dc adding device_context to blas deps list (#10420)
7 years ago
dzhwinter a28dffbb0b
Fix/adam float64 (#10407)
7 years ago
Xin Pan cdd52f3a30 Add comment to explain how to run inference test
7 years ago
typhoonzero 0598a4b366 fix ci
7 years ago
typhoonzero 3667578ec2 testing
7 years ago
chengduoZH 881e063ee2 follow comments
7 years ago
chengduoZH ff599b9218 use Reduce and Broadcast
7 years ago
chengduoZH 0441c2cc45 fix ci
7 years ago
fengjiayi 0c99cd7bbb fix errors in sequence_padding_test
7 years ago
Lei Wang 6418c42148 Travis: fix check style error.
7 years ago
Kexin Zhao 4e3fac4129 fix sign unsigned comparison (#10424)
7 years ago
Siddharth Goyal b65282168c Fix cpplint errors in lstm kernel (#10394)
7 years ago
chengduo 4558c0ec0a
Merge pull request #10414 from chengduoZH/wrap_shfl_x_sync
7 years ago
chengduoZH d36af62c1e wrap_shfl_x_sync
7 years ago
Yancey 2d98a418d7
fix remove op (#10410)
7 years ago
fengjiayi bf99396a04 fix errors in sequence_slice_op
7 years ago
fengjiayi baa9f50da5 fix errors in multiplex_op
7 years ago
fengjiayi 2e617334eb fix errors in lod_reset_op
7 years ago
fengjiayi e309f42293 fix errors in concat_test
7 years ago
Yu Yang 0285a2b95d
Merge pull request #10371 from reyoung/refine_code
7 years ago
chengduoZH f9c680c43e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/update_sparse_parameter
7 years ago
typhoonzero d9320dcd94 complete code
7 years ago
chengduoZH 7722baa8e3 follow comments and clean code
7 years ago
Xin Pan 5a9f17f02b clean up
7 years ago
Tao Luo 4646c0f35d
Merge pull request #10144 from luotao1/tr_convert_init
7 years ago
Qingsheng Li 3bb99c4f66
Added auto transform to beam_search_decode_op (#10286)
7 years ago
Abhinav Arora c9f55dfafc
Fix CPPLint issues in /math/detail/gru_kernel.h (#10390)
7 years ago
Yu Yang ef6ea790dc Clean and extract blas
7 years ago
Kexin Zhao ccc594e4c4
need to copy LoD info (#10392)
7 years ago
Kexin Zhao 7a86069422 Add float16 demo code and put float16 work in contrib/float16 folder (#10331)
7 years ago
Luo Tao beb1245560 add relu converter and unit-test
7 years ago
Xin Pan 76d8b14bce Add timeline support for distributed training
7 years ago
Yu Yang d0785ce982 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_code
7 years ago
Yu Yang 815d888468 Clean MatMul
7 years ago
Yu Yang 2abcf37958
Merge pull request #10327 from reyoung/feature/clean_blas
7 years ago
chengduo 54797abd53
Merge pull request #10347 from chengduoZH/replace___shfl_with__shfl_sync
7 years ago
chengduo 62fed4cbb3 fix __shfl_down (#10362)
7 years ago
Yu Yang 9d7279b953 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_code
7 years ago
Yu Yang bc8160350b Fix compile
7 years ago
chengduoZH e97c1a8ca0 fix __shfl
7 years ago
Yu Yang a6edeb39b3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_blas
7 years ago
typhoonzero 7237323c5d fix compile
7 years ago
dzhwinter f63ff90b03
Fix/fp64 (#10346)
7 years ago
Wu Yi 88d79dfe95
Merge pull request #10292 from typhoonzero/fix_grpc_server_ready_condition
7 years ago
chengduoZH 0cc635497c merge develop
7 years ago
Yiqun Liu 6084af47ef
Fix the bug when a input variable of op is dispensable. (#10268)
7 years ago
Yu Yang 8a0c7e2e70
Merge pull request #10280 from reyoung/feature/add_stable_test_of_cross_entropy
7 years ago
Tomasz Patejko 4a497b826d MKLDNN implementation of batch normalization (#9904)
7 years ago
chengduo 4fbde42cdf Fix __shfl_down_sync_ of cross_entropy (#10345)
7 years ago
chengduoZH c891189568 update sparse gradient parameter with reduce and broadcast
7 years ago
yi.wu 6422c0e4f6 update by comment
7 years ago
chengduoZH b8f7fa97b6 replace __shfl with __shfl_sync
7 years ago
chengduoZH 5ff1ef36ee update sparse parameter
7 years ago
typhoonzero eeed7af5c3 add gen_nccl_id_op
7 years ago
Yu Yang 5e151b2c83 Follow comment
7 years ago
Yu Yang 9a4c1a39f0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_stable_test_of_cross_entropy
7 years ago
Yu Yang caa4027d9d Follow comments
7 years ago
dzhwinter 57be5c6c74
"fix double type error" (#10322)
7 years ago
Qiao Longfei faebadd938
Merge pull request #10228 from jacquesqiao/use-multi-thread-todo-update
7 years ago
Yancey ff99d94197
Merge pull request #10164 from Yancey1989/lookup_sparse_table_op
7 years ago
typhoonzero b3cf429e02 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_server_ready_condition
7 years ago
typhoonzero e7ac709b4b done
7 years ago
Abhinav Arora 1945b729b6
Fix CPPLint issues with math/sequence_padding (#10317)
7 years ago
chengduo 9bcd9f661b fix cpplint error (#10329)
7 years ago
Abhinav Arora 55f0d84029
Fix Cpplint Issues in fluid/inference/tensorrt/ (#10318)
7 years ago
Yu Yang 4db43c6c9f Naive implement cblas
7 years ago
fengjiayi a1a401eb26 fix
7 years ago
fengjiayi d11b8e56e5 fix
7 years ago
Luo Tao 9945265f09 Merge branch 'develop' into tr_convert_init
7 years ago
typhoonzero a131c73fcf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_server_ready_condition
7 years ago
chengduo 3222cf16f7
Merge pull request #10325 from chengduoZH/fix_shfl_sync
7 years ago
Kexin Zhao 4613aeba0e
Merge pull request #10272 from kexinzhao/save_fp16
7 years ago
Yu Yang 60d6348e69 Revert develop
7 years ago
Yu Yang 86af6bdc81 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_blas
7 years ago
Yang yaming 9a8be9daca
Merge pull request #10223 from pkuyym/fix-10219
7 years ago
Yu Yang 49dedfad17 Polish code and tests
7 years ago
chengduoZH 90d73c79c3 fix shfl_sync for CUDA8.0
7 years ago
qiaolongfei d86626df84 optimize log
7 years ago
qiaolongfei ebf0027391 use IOThreadPool to dispatch async update task
7 years ago
qiaolongfei ea372b3452 add more log
7 years ago
Abhinav Arora 738585476d
Fix more CPPLint issues in fluid/operators/math (#10276)
7 years ago
Helin Wang d25fdb0a47 fix build: cuda_helper.h not found
7 years ago
dzhwinter eb6f9dd5de
Feature/cuda9 cudnn7 (#10140)
7 years ago
typhoonzero 008f6df9b2 update
7 years ago
chengduo f61dfeedcc
Merge pull request #10263 from chengduoZH/add_FLAGS_use_deterministic_algo
7 years ago
typhoonzero ef48f3c766 wip
7 years ago
Yu Yang c888e01660 Refactor GEMM in blas
7 years ago
yangyaming 13fac4232a Fix to pass CI.
7 years ago
Yu Yang c0ac0cd6b3 Complete rename
7 years ago
Qingsheng Li 79be1bb3df
Merge branch 'develop' into fix-10026
7 years ago
ktlichkid 48466b4424 auto => auto*
7 years ago
Yu Yang 6c18410487 Revert code to develop
7 years ago
chengduoZH 9fda5c92cd Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_FLAGS_use_deterministic_algo
7 years ago
yangyaming f456cd8079 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-10219
7 years ago
Yu Yang 76174ec0e9 Clean cross entropy and add sync in executor
7 years ago
Yu Yang 25779c982d
Merge pull request #10265 from reyoung/feature/polish_code
7 years ago
chengduoZH c5774e3282 add FLAGS_use_deterministic_algo
7 years ago
Abhinav Arora e735359631
Fix more CPPlint issues in fluid/operators/math (#10249)
7 years ago
Kexin Zhao efba1c7dcb address comments
7 years ago
Kexin Zhao 6c88f1ae6e add save op float16 support
7 years ago
qiaolongfei f82cb635cf optimize code, add more log
7 years ago
Yu Yang 9612c7e599 Add comments and polish code
7 years ago
Yu Yang deabc8ca0b
Merge branch 'develop' into feature/clean_memcpy_async
7 years ago
fengjiayi 71fa3ca9c4
Merge pull request #10232 from JiayiFeng/fix_unittests
7 years ago
qingqing01 76c4ae856f
Fix reshape op. (#10253)
7 years ago
Yancey1989 1a93253f16 fix unittest
7 years ago
fengjiayi 30f9dc92e5 fix errors
7 years ago
whs 2f9fa9b721
Merge pull request #10167 from wanghaoshuang/fluid_init
7 years ago
Luo Tao 6f6f330423 update the register method
7 years ago
ktlichkid 9997c916fc Pull origin
7 years ago
ktlichkid 709a9edd46 Code clean up
7 years ago
fengjiayi 330fa95cbd Follow comments
7 years ago
dyning 4a5bfa89c3 Modify RoI pooling op to use LoDTensor and expose it into Python API (#10208)
7 years ago
Tomasz Patejko e498e1fc56 Adam operator optimized with Eigen (#10229)
7 years ago
Kexin Zhao 0ecc6fa8f3 Add float16 transpiler and image classification example (#10109)
7 years ago
Abhinav Arora 83b1a8f6bf
Pending more CPPLint errors in fluid/operators/math (#10243)
7 years ago
baiyf c816121d11 optimized iou_similarity_op (#10231)
7 years ago
fengjiayi b88721213f fix broadcast_op_test and reduce_op_test
7 years ago
fengjiayi bcf260e1e8 fix several unit tests
7 years ago
qiaolongfei b058189941 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into use-multi-thread-todo-update
7 years ago
Qiao Longfei 6d934560c7
Merge pull request #10042 from jacquesqiao/add-async-listen-and-serv-op
7 years ago
Abhinav Arora f457d5da06
Fix more CPPLint errors (#10218)
7 years ago
qiaolongfei 0d491b670a use-multi-thread-todo-update
7 years ago
qiaolongfei 3295f31076 optimize naming
7 years ago
yangyaming 18d6254d44 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-10219
7 years ago
qiaolongfei 46342a2306 delete useless code
7 years ago
wanghaoshuang 848fb00215 Fix comments.
7 years ago
fengjiayi 9c7fa6ff69
Merge pull request #10206 from JiayiFeng/blocking_queue_for_reader
7 years ago
qiaolongfei 0264ec3957 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-listen-and-serv-op
7 years ago
qiaolongfei 63bd38bd74 code optimize
7 years ago
Yu Yang c02ba51de0
Merge pull request #10191 from reyoung/feature/strict_dynload
7 years ago
Yancey1989 dccd013bd3 refine distribute transpiler
7 years ago
fengjiayi 8bd34664f1 fix unit test error
7 years ago
fengjiayi 17c51d69d1 fix unit test error
7 years ago
fengjiayi 304b6b7138 Follow comments
7 years ago
yangyaming 82571deb89 Change `customize_loss_grad` to `use_default_grad_scale`.
7 years ago
Luo Tao 326221acec Merge branch 'develop' into tr_convert_init
7 years ago
Abhinav Arora 4c8ff72615
Fix CPPLint errors with rxecutor (#10212)
7 years ago
Siddharth Goyal 5fe1fe3a27
Fix signed/unsigned comparison warning (#10211)
7 years ago
fengjiayi 4cb63d8451 Remove unnecessary header files
7 years ago
Yancey1989 e393c86c4a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into lookup_sparse_table_op
7 years ago
Luo Tao c4e3010b14 use template to do registry
7 years ago
Yan Chunwei 2d57158e2b
fea/init tensorrt engine (#10003)
7 years ago
Luo Tao d599de5c41 auto registray op converters
7 years ago
fengjiayi e057ba6877 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into blocking_queue_for_reader
7 years ago
fengjiayi e2ca42408b Replace Channel in DoubleBufferReader with BlockingQueue
7 years ago
fengjiayi a786611314 Replace Channel in MultiFileReader with BlockingQueue
7 years ago
fengjiayi 1a25f3cd07 Add reader blocking queue
7 years ago
Yu Yang 0c24b3f937 Clean memcpy async
7 years ago
Xin Pan 64babc9aeb
Merge pull request #10189 from reyoung/feature/fix_matmul_bug
7 years ago
Yu Yang 580dad0c2c Fix compile when there is no mkl
7 years ago
Yu Yang 3d53631bad Make dyload strictly use the same ABI in header
7 years ago
Yu Yang 2a06e307d0 Fix batch_gemm bugs
7 years ago
Yancey1989 8aea5cac0a add attr auto_grown_table
7 years ago
gongweibao 2f53cd0a76
Fix beam_search memory leak. (#10185)
7 years ago
ktlichkid 5afc2a9972 Keep up with upstream
7 years ago
Qiao Longfei 63bf82ddea
Merge branch 'develop' into add-async-listen-and-serv-op
7 years ago
Tao Luo 8b2391858f
Merge pull request #10181 from abhinavarora/cpplint_advanced
7 years ago
Wu Yi 3fdfa940be
Merge pull request #10135 from typhoonzero/unify_blocking_queue
7 years ago
wanghaoshuang ad3f6f4ad5 Fix devices 'not undefined' error.
7 years ago
Abhinav Arora edd3587e50 Fix CPPLint errors with op_desc
7 years ago
Yang Yang(Tony) 81dfc0cf0e
Clean up unused code in operator class (#10035)
7 years ago
Abhinav Arora f09aed0475
Fix CPPLint issues in framework/data_transform framework/prune.cc (#10178)
7 years ago
wangyang59 72ee737f3f
Merge pull request #9308 from wangyang59/bilinear
7 years ago
Yang Yang(Tony) 2182ecfbbd
remove duplicated ShareLoD in gru_op and sequence_conv_op (#10149)
7 years ago
gongweibao 6171705a2c Potential bug in paddle/fluid/platform/CMakeLists.txt (#9723)
7 years ago
gongweibao fc025f5265
Fix memory leak of pserver (#10173)
7 years ago
wanghaoshuang 3d96b3811a Fix InitGflags.
7 years ago
Luo Tao 48473dddf4 Merge branch 'develop' into tr_convert_init
7 years ago
wanghaoshuang a4b452a2d6 Remove initP2P(bool) and init function in framework.
7 years ago
Yu Yang 4ecc9b7bae
Merge pull request #10166 from reyoung/feature/train_and_test_recordio
7 years ago
wanghaoshuang e4708565f4 Fix cpplint format.
7 years ago
wanghaoshuang a0b258278e Reuse 'initP2P(bool, std::vector)' in 'initP2P(bool)'
7 years ago
wanghaoshuang f31bb1476c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fluid_init
7 years ago
Tao Luo 5a433ecb60
Merge pull request #10134 from luotao1/tensorrt_include
7 years ago
chengduo bfbbe19fbb
Merge pull request #10150 from chengduoZH/fix_elementwise_gradient
7 years ago
wanghaoshuang 48b7b54321 Refine code.
7 years ago
Abhinav Arora 5ce57555ee
Fix CPPLint issues in init.cc, init.h and library_type.h (#10148)
7 years ago
chengduoZH 0f5d5b1ffc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_elementwise_gradient
7 years ago
wanghaoshuang 1bdea0a8d2 Add init interface for customize devices.
7 years ago