Commit Graph

8548 Commits (d13dd3b6a7ee81d4c106035ec0bad2c581ea795c)

Author SHA1 Message Date
yuyang18 64305b3ffc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/polish_visit_data_type
7 years ago
yuyang18 e5281b3c2d Clean code & add execution strategy
7 years ago
Tao Luo 8c7d2e2984
Merge pull request #10576 from jczaja/prv-reuse-mkldnn-softmax-primitives
7 years ago
yangyaming 1c446508a4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-10605
7 years ago
typhoonzero 928418a9ac Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
Yu Yang 9923be5d10
Merge pull request #10546 from chengduoZH/feature/change_pe_strategy
7 years ago
yangyaming ca7c5a24e0 Fix order of prior boxes.
7 years ago
baiyf 43b6d4f8cb put detection op together (#10595)
7 years ago
Yu Yang 05a96db67f
Merge branch 'develop' into feature/matmul_support_float16_double
7 years ago
yi.wu 5ae0c664b0 fix build and merge develop
7 years ago
yi.wu 6ef60de6f1 update
7 years ago
tangwei12 a1419f1062 test add op declare
7 years ago
tangwei12 5e74db3f2a add build and test make
7 years ago
Jacek Czaja 7bf00c3a93 - First draft of reusing of softmax mkldnn primitives
7 years ago
typhoonzero 7a7d27b33e update op
7 years ago
typhoonzero 0ae726f060 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
typhoonzero f5840d8925 follow comments
7 years ago
tangwei12 d1bd3fdefc add build and test make
7 years ago
tangwei12 802d10cf53 rename cpkt_save_op
7 years ago
tangwei12 dc534fc195 add checkpoint save op test
7 years ago
tangwei12 87a0856384 add checkpoint save op
7 years ago
tangwei12 2a05b3d5a3 delete checkpoint function
7 years ago
typhoonzero 04bde96e4c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
yuyang18 54cbf799d6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/change_pe_strategy
7 years ago
yuyang18 9f705a4bb9 Use int instead of VarType as unordered_map key
7 years ago
chengduoZH 97cb5479ae change PE strategy
7 years ago
chengduo 18f527bfa3
Merge pull request #10586 from jacquesqiao/fix-compile
7 years ago
yuyang18 c4d6daac58 Polish SizeOfType
7 years ago
Yu Yang 046405e091
Merge pull request #10486 from reyoung/feature/clean_op_maker
7 years ago
yuyang18 711d86bb56 Polish data_type.h
7 years ago
qiaolongfei 3c2a6109ec fix build computation_op_handle
7 years ago
qiaolongfei 07d94bdd82 fix compile fetch_op_handle on mac
7 years ago
yuyang18 66590a0b88 Fix typo in blas_impl.h
7 years ago
yuyang18 ad2e420623 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/matmul_support_float16_double
7 years ago
tangwei12 e21a72d1b9 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into checkpoint
7 years ago
Lei Wang f3ffec23cf
CI: exit when fail any step. (#10579)
7 years ago
Siddharth Goyal 283c4dbe57
Add FP16 option in save_combine_op (#10471)
7 years ago
Siddharth Goyal 28a6037bb8
Fix lod check in FP16 test for save_op (#10508)
7 years ago
chengduo ce72c3ff26
Merge pull request #10476 from chengduoZH/refine_parallel_exe
7 years ago
chengduoZH a89cd46700 Wait VarDummyHandle generated
7 years ago
Wu Yi 61343fbf53
Merge pull request #10531 from typhoonzero/refine_grpc_serde_code
7 years ago
tangwei12 77c6b71ec4 add ckpt to sync loop
7 years ago
yuyang18 27197290dc matmul support float16/double
7 years ago
Yu Yang 705e7345d0
Merge pull request #10449 from reyoung/feature/clean_matmul
7 years ago
yuyang18 613d3ef084 Fix compile error
7 years ago
Yu Yang 36653587e9
Merge pull request #10475 from gmcather/develop
7 years ago
Kexin Zhao cc75e84d05
Merge pull request #10541 from kexinzhao/load_fp16
7 years ago
yuyang18 ad594b9b70 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_matmul
7 years ago
reyoung a0c413833d Polish Dctor
7 years ago
typhoonzero 796a448ce4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_grpc_serde_code
7 years ago
reyoung b0ca371f11 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_op_maker
7 years ago
Tao Luo 28de0ea404
Merge pull request #10545 from luotao1/fix_tensorrt_engine
7 years ago
tangwei12 1fabbbade2 modify const to const &
7 years ago
Xin Pan 01a2773df4
Merge pull request #10547 from panyx0718/reduce_parall
7 years ago
Tao Luo d2b0c4897c
Merge pull request #10511 from wanglei828/fixdockerfile
7 years ago
typhoonzero 602aa43322 cast data type
7 years ago
Luo Tao 4a5ebb6806 fix tensorrt_engine compiler error
7 years ago
fengjiayi cf3b3d6024 fix warpctc
7 years ago
Xin Pan e3aff13451 Our tests interfere with each other and cause random failure.
7 years ago
Kexin Zhao aa2635fe65 clean code
7 years ago
Kexin Zhao cbf502e5d4 fix error
7 years ago
Kexin Zhao 270a87fb66 add load op fp16 mode test
7 years ago
Kexin Zhao eb95417e05 initial commit
7 years ago
Xin Pan dcb77813aa Revert "CI: rerun failed tests. (#10536)"
7 years ago
Lei Wang 61eb3aa6b3 Dockerfile: fix production dockefile when WITH_GOLONG=OFF.
7 years ago
Lei Wang 0446220e01
CI: rerun failed tests. (#10536)
7 years ago
tangwei12 568a329c83 add checkpoint util class and implement
7 years ago
chengduoZH 830532213a extract method from broadcast::RunImpl
7 years ago
typhoonzero a2de156dfa refine serde code
7 years ago
Tao Luo 303277f002
Merge pull request #10437 from panyx0718/infer2
7 years ago
Xin Pan f093a7b332 allow inference test to generate timeline
7 years ago
Luo Tao 40b8b634f9 Merge branch 'develop' into refine_relu_test
7 years ago
Yan Chunwei 819038113e Feature/engine refactor (#10497)
7 years ago
Yan Chunwei 6eeb819538
feature/inference analysis dot (#10494)
7 years ago
Xin Pan 6728d96d89 follow comments
7 years ago
Xin Pan 3de43a87ef Add a multi-dim add layer test.
7 years ago
chengduoZH 93368aac23 Merge develop
7 years ago
chengduoZH 9eec2c7509 refine pe
7 years ago
Kexin Zhao 170ac721b6 remove unnecessary tensor copy in save op
7 years ago
Yao Cheng 9ca8124f99 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
chengduo 8b1918f5c7
Merge pull request #10454 from chengduoZH/fix_fetchop
7 years ago
fengjiayi 2bff03bc1e fix a compile error (#10488)
7 years ago
Luo Tao 0ae97e8a5b Merge branch 'develop' into refine_relu_test
7 years ago
chengduoZH f4851f14b8 clean code
7 years ago
chengduoZH a459764de7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_fetchop
7 years ago
chengduoZH e00c1ee10f fix split var test
7 years ago
Yu Yang 0e78cb69fb Clean OpProtoAndCheckerMaker
7 years ago
Luo Tao 89dcb0bd15 refine EngineIOConverter, and use io_convert in test_trt_activation_op
7 years ago
Yu Yang 96b703c007 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_matmul
7 years ago
Yao Cheng 8cbb49ce63 rm space
7 years ago
Yao Cheng e97645c959 rm block_desc.h whitespace
7 years ago
chengduoZH 345737d0fe add sync
7 years ago
Yu Yang fcd31d6161 Follow comments and polish code names
7 years ago
chengduoZH 187e23a79c fix MatMul parameter
7 years ago
Yu Yang 0a13d3c67a Move MatMul to blas_impl.h
7 years ago
gmcather be1f058f52
Merge branch 'develop' into develop
7 years ago
Yao Cheng 55218dafbc add prependAllocatedop
7 years ago
Tao Luo 3356fb3c6e
Merge pull request #10461 from luotao1/refine_convert
7 years ago
chengduoZH 676dfd1875 follow comments
7 years ago
typhoonzero a135fec1fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
chengduoZH aff8a26d71 check generated_op_
7 years ago
typhoonzero 0f86397d81 fix build
7 years ago
Luo Tao 53b401d589 refine io_convert and op_convert
7 years ago
chengduoZH 2e5d44f102 fix fetch op
7 years ago
Yu Yang 3dd01823a8 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_matmul
7 years ago
typhoonzero 17009d0627 workable version
7 years ago
Xin Pan dce0732d5e
Merge pull request #10380 from panyx0718/dist_timeline
7 years ago
Xin Pan 0c518888fa
Merge pull request #10430 from panyx0718/infer
7 years ago
Yu Yang c6a6d87f96 Rewrite Matmul, make code cleaner
7 years ago
typhoonzero a529d790b6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
Yan Chunwei 2a2c83b9e6 feature/convert tensorrt io (#10440)
7 years ago
typhoonzero 82c61dbde3 fix testing
7 years ago
Xin Pan 9fccf46270 reword comments
7 years ago
Xin Pan d1ea74d3b9 follow comments
7 years ago
fengjiayi b708ec0ae1
Merge pull request #10412 from JiayiFeng/correct_TensorCopy_misuse
7 years ago
Kexin Zhao 55e714e0d2 add float16 support to pool3d
7 years ago
Kexin Zhao 8b16927230 add fp16 support to conv3d
7 years ago
Yiqun Liu fd1971caa0
Add the call of DropKids at the end of executor.Run to delete the local scopes created in operators (#10403)
7 years ago
chengduo 99acf1da4c
Merge pull request #10351 from chengduoZH/feature/update_sparse_parameter
7 years ago
Darcy 8f8a4768dc adding device_context to blas deps list (#10420)
7 years ago
dzhwinter a28dffbb0b
Fix/adam float64 (#10407)
7 years ago
Xin Pan cdd52f3a30 Add comment to explain how to run inference test
7 years ago
typhoonzero 0598a4b366 fix ci
7 years ago
typhoonzero 3667578ec2 testing
7 years ago
chengduoZH 881e063ee2 follow comments
7 years ago
chengduoZH ff599b9218 use Reduce and Broadcast
7 years ago
chengduoZH 0441c2cc45 fix ci
7 years ago
fengjiayi 0c99cd7bbb fix errors in sequence_padding_test
7 years ago
Lei Wang 6418c42148 Travis: fix check style error.
7 years ago
Kexin Zhao 4e3fac4129 fix sign unsigned comparison (#10424)
7 years ago
Siddharth Goyal b65282168c Fix cpplint errors in lstm kernel (#10394)
7 years ago
chengduo 4558c0ec0a
Merge pull request #10414 from chengduoZH/wrap_shfl_x_sync
7 years ago
chengduoZH d36af62c1e wrap_shfl_x_sync
7 years ago
Yancey 2d98a418d7
fix remove op (#10410)
7 years ago
fengjiayi bf99396a04 fix errors in sequence_slice_op
7 years ago
fengjiayi baa9f50da5 fix errors in multiplex_op
7 years ago
fengjiayi 2e617334eb fix errors in lod_reset_op
7 years ago
fengjiayi e309f42293 fix errors in concat_test
7 years ago
Yu Yang 0285a2b95d
Merge pull request #10371 from reyoung/refine_code
7 years ago
chengduoZH f9c680c43e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/update_sparse_parameter
7 years ago
typhoonzero d9320dcd94 complete code
7 years ago
chengduoZH 7722baa8e3 follow comments and clean code
7 years ago
Xin Pan 5a9f17f02b clean up
7 years ago
Tao Luo 4646c0f35d
Merge pull request #10144 from luotao1/tr_convert_init
7 years ago
Qingsheng Li 3bb99c4f66
Added auto transform to beam_search_decode_op (#10286)
7 years ago
Abhinav Arora c9f55dfafc
Fix CPPLint issues in /math/detail/gru_kernel.h (#10390)
7 years ago
Yu Yang ef6ea790dc Clean and extract blas
7 years ago
Lei Wang 20fa848076 Travis: using ccache for docker build.
7 years ago
Kexin Zhao ccc594e4c4
need to copy LoD info (#10392)
7 years ago
Kexin Zhao 7a86069422 Add float16 demo code and put float16 work in contrib/float16 folder (#10331)
7 years ago
Luo Tao beb1245560 add relu converter and unit-test
7 years ago
Xin Pan 76d8b14bce Add timeline support for distributed training
7 years ago
Yu Yang d0785ce982 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_code
7 years ago
Yu Yang 815d888468 Clean MatMul
7 years ago
chengduo 4c58da2c60
Merge pull request #10367 from NHZlX/fix_maxpool_with_mask_layer
7 years ago
Yu Yang 2abcf37958
Merge pull request #10327 from reyoung/feature/clean_blas
7 years ago
chengduo 54797abd53
Merge pull request #10347 from chengduoZH/replace___shfl_with__shfl_sync
7 years ago
chengduo 62fed4cbb3 fix __shfl_down (#10362)
7 years ago
Yu Yang 9d7279b953 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_code
7 years ago
Yu Yang bc8160350b Fix compile
7 years ago
xzl 9ab8faaf76 fix pool with mask layer bug
7 years ago
chengduoZH e97c1a8ca0 fix __shfl
7 years ago
Yu Yang a6edeb39b3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_blas
7 years ago
typhoonzero 7237323c5d fix compile
7 years ago
dzhwinter f63ff90b03
Fix/fp64 (#10346)
7 years ago
Wu Yi 88d79dfe95
Merge pull request #10292 from typhoonzero/fix_grpc_server_ready_condition
7 years ago
chengduoZH 0cc635497c merge develop
7 years ago
Yiqun Liu 6084af47ef
Fix the bug when a input variable of op is dispensable. (#10268)
7 years ago
Yu Yang 8a0c7e2e70
Merge pull request #10280 from reyoung/feature/add_stable_test_of_cross_entropy
7 years ago
Tomasz Patejko 4a497b826d MKLDNN implementation of batch normalization (#9904)
7 years ago
chengduo 4fbde42cdf Fix __shfl_down_sync_ of cross_entropy (#10345)
7 years ago
Lei Wang 753ea15d26 Build: add cicheck task.
7 years ago
chengduoZH c891189568 update sparse gradient parameter with reduce and broadcast
7 years ago
yi.wu 6422c0e4f6 update by comment
7 years ago
chengduoZH b8f7fa97b6 replace __shfl with __shfl_sync
7 years ago
chengduoZH 5ff1ef36ee update sparse parameter
7 years ago
typhoonzero eeed7af5c3 add gen_nccl_id_op
7 years ago
fengjiayi c7e23bb77d
Merge pull request #10338 from JiayiFeng/fix_cpplint_format_check
7 years ago
Yu Yang 5e151b2c83 Follow comment
7 years ago
Yu Yang 9a4c1a39f0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_stable_test_of_cross_entropy
7 years ago
Yu Yang caa4027d9d Follow comments
7 years ago
dzhwinter 57be5c6c74
"fix double type error" (#10322)
7 years ago
Qiao Longfei faebadd938
Merge pull request #10228 from jacquesqiao/use-multi-thread-todo-update
7 years ago
fengjiayi 1e28ba7639 follow comments
7 years ago
Yancey ff99d94197
Merge pull request #10164 from Yancey1989/lookup_sparse_table_op
7 years ago
typhoonzero b3cf429e02 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_server_ready_condition
7 years ago
typhoonzero e7ac709b4b done
7 years ago
Abhinav Arora 1945b729b6
Fix CPPLint issues with math/sequence_padding (#10317)
7 years ago
chengduo 9bcd9f661b fix cpplint error (#10329)
7 years ago
fengjiayi ac266dfb8d follow cpplint
7 years ago
Abhinav Arora 55f0d84029
Fix Cpplint Issues in fluid/inference/tensorrt/ (#10318)
7 years ago
fengjiayi d946d01e2b follow cpplint
7 years ago
Yu Yang 4db43c6c9f Naive implement cblas
7 years ago
fengjiayi a1a401eb26 fix
7 years ago
fengjiayi d11b8e56e5 fix
7 years ago
Luo Tao 9945265f09 Merge branch 'develop' into tr_convert_init
7 years ago
typhoonzero a131c73fcf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_server_ready_condition
7 years ago
chengduo 3222cf16f7
Merge pull request #10325 from chengduoZH/fix_shfl_sync
7 years ago
Kexin Zhao 4613aeba0e
Merge pull request #10272 from kexinzhao/save_fp16
7 years ago
Yu Yang 60d6348e69 Revert develop
7 years ago
Yu Yang 86af6bdc81 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_blas
7 years ago
Yang yaming 9a8be9daca
Merge pull request #10223 from pkuyym/fix-10219
7 years ago
Yu Yang 49dedfad17 Polish code and tests
7 years ago
chengduoZH 90d73c79c3 fix shfl_sync for CUDA8.0
7 years ago
qiaolongfei d86626df84 optimize log
7 years ago
qiaolongfei ebf0027391 use IOThreadPool to dispatch async update task
7 years ago
qiaolongfei ea372b3452 add more log
7 years ago
Abhinav Arora 738585476d
Fix more CPPLint issues in fluid/operators/math (#10276)
7 years ago
Lei Wang 95d2651bc2
Build: simplify travis CI script. (#10245)
7 years ago
Helin Wang d25fdb0a47 fix build: cuda_helper.h not found
7 years ago
dzhwinter eb6f9dd5de
Feature/cuda9 cudnn7 (#10140)
7 years ago
typhoonzero 008f6df9b2 update
7 years ago
chengduo f61dfeedcc
Merge pull request #10263 from chengduoZH/add_FLAGS_use_deterministic_algo
7 years ago
typhoonzero ef48f3c766 wip
7 years ago
Yu Yang c888e01660 Refactor GEMM in blas
7 years ago
yangyaming 13fac4232a Fix to pass CI.
7 years ago
Yu Yang c0ac0cd6b3 Complete rename
7 years ago
Qingsheng Li 79be1bb3df
Merge branch 'develop' into fix-10026
7 years ago
ktlichkid 48466b4424 auto => auto*
7 years ago
Yu Yang 6c18410487 Revert code to develop
7 years ago
chengduoZH 9fda5c92cd Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_FLAGS_use_deterministic_algo
7 years ago
yangyaming f456cd8079 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-10219
7 years ago
Yu Yang 76174ec0e9 Clean cross entropy and add sync in executor
7 years ago
Yu Yang 25779c982d
Merge pull request #10265 from reyoung/feature/polish_code
7 years ago
chengduoZH c5774e3282 add FLAGS_use_deterministic_algo
7 years ago
Abhinav Arora e735359631
Fix more CPPlint issues in fluid/operators/math (#10249)
7 years ago
Kexin Zhao efba1c7dcb address comments
7 years ago
Kexin Zhao 6c88f1ae6e add save op float16 support
7 years ago
qiaolongfei f82cb635cf optimize code, add more log
7 years ago
Yu Yang 01da25845e
Merge pull request #10202 from reyoung/feature/clean_memcpy_async
7 years ago
Yu Yang 9612c7e599 Add comments and polish code
7 years ago
fengjiayi 31373370ce fix mac compile errors
7 years ago
Yu Yang deabc8ca0b
Merge branch 'develop' into feature/clean_memcpy_async
7 years ago
fengjiayi 71fa3ca9c4
Merge pull request #10232 from JiayiFeng/fix_unittests
7 years ago
qingqing01 76c4ae856f
Fix reshape op. (#10253)
7 years ago
Yancey1989 1a93253f16 fix unittest
7 years ago
fengjiayi 30f9dc92e5 fix errors
7 years ago
whs 2f9fa9b721
Merge pull request #10167 from wanghaoshuang/fluid_init
7 years ago
Luo Tao 6f6f330423 update the register method
7 years ago
ktlichkid 9997c916fc Pull origin
7 years ago
ktlichkid 709a9edd46 Code clean up
7 years ago
fengjiayi 330fa95cbd Follow comments
7 years ago
dyning 4a5bfa89c3 Modify RoI pooling op to use LoDTensor and expose it into Python API (#10208)
7 years ago
Tomasz Patejko e498e1fc56 Adam operator optimized with Eigen (#10229)
7 years ago
Kexin Zhao 0ecc6fa8f3 Add float16 transpiler and image classification example (#10109)
7 years ago
Abhinav Arora 83b1a8f6bf
Pending more CPPLint errors in fluid/operators/math (#10243)
7 years ago
baiyf c816121d11 optimized iou_similarity_op (#10231)
7 years ago
fengjiayi b88721213f fix broadcast_op_test and reduce_op_test
7 years ago
fengjiayi bcf260e1e8 fix several unit tests
7 years ago
qiaolongfei b058189941 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into use-multi-thread-todo-update
7 years ago