Commit Graph

1604 Commits (27e4ce728741e6acfa0308b8dfa2ce129bd24e22)

Author SHA1 Message Date
Yu Yang 05a96db67f
Merge branch 'develop' into feature/matmul_support_float16_double
7 years ago
yi.wu 5ae0c664b0 fix build and merge develop
7 years ago
yi.wu 6ef60de6f1 update
7 years ago
Jacek Czaja 7bf00c3a93 - First draft of reusing of softmax mkldnn primitives
7 years ago
typhoonzero 7a7d27b33e update op
7 years ago
typhoonzero 0ae726f060 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
typhoonzero f5840d8925 follow comments
7 years ago
typhoonzero 04bde96e4c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
yuyang18 54cbf799d6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/change_pe_strategy
7 years ago
yuyang18 9f705a4bb9 Use int instead of VarType as unordered_map key
7 years ago
chengduoZH 97cb5479ae change PE strategy
7 years ago
chengduo 18f527bfa3
Merge pull request #10586 from jacquesqiao/fix-compile
7 years ago
yuyang18 c4d6daac58 Polish SizeOfType
7 years ago
Yu Yang 046405e091
Merge pull request #10486 from reyoung/feature/clean_op_maker
7 years ago
yuyang18 711d86bb56 Polish data_type.h
7 years ago
qiaolongfei 3c2a6109ec fix build computation_op_handle
7 years ago
qiaolongfei 07d94bdd82 fix compile fetch_op_handle on mac
7 years ago
yuyang18 66590a0b88 Fix typo in blas_impl.h
7 years ago
yuyang18 ad2e420623 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/matmul_support_float16_double
7 years ago
Lei Wang f3ffec23cf
CI: exit when fail any step. (#10579)
7 years ago
Siddharth Goyal 283c4dbe57
Add FP16 option in save_combine_op (#10471)
7 years ago
Siddharth Goyal 28a6037bb8
Fix lod check in FP16 test for save_op (#10508)
7 years ago
chengduo ce72c3ff26
Merge pull request #10476 from chengduoZH/refine_parallel_exe
7 years ago
chengduoZH a89cd46700 Wait VarDummyHandle generated
7 years ago
Wu Yi 61343fbf53
Merge pull request #10531 from typhoonzero/refine_grpc_serde_code
7 years ago
yuyang18 27197290dc matmul support float16/double
7 years ago
Yu Yang 705e7345d0
Merge pull request #10449 from reyoung/feature/clean_matmul
7 years ago
yuyang18 613d3ef084 Fix compile error
7 years ago
Yu Yang 36653587e9
Merge pull request #10475 from gmcather/develop
7 years ago
Kexin Zhao cc75e84d05
Merge pull request #10541 from kexinzhao/load_fp16
7 years ago
yuyang18 ad594b9b70 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_matmul
7 years ago
reyoung a0c413833d Polish Dctor
7 years ago
typhoonzero 796a448ce4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_grpc_serde_code
7 years ago
reyoung b0ca371f11 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_op_maker
7 years ago
Tao Luo 28de0ea404
Merge pull request #10545 from luotao1/fix_tensorrt_engine
7 years ago
typhoonzero 602aa43322 cast data type
7 years ago
Luo Tao 4a5ebb6806 fix tensorrt_engine compiler error
7 years ago
fengjiayi cf3b3d6024 fix warpctc
7 years ago
Kexin Zhao aa2635fe65 clean code
7 years ago
Kexin Zhao cbf502e5d4 fix error
7 years ago
Kexin Zhao 270a87fb66 add load op fp16 mode test
7 years ago
Kexin Zhao eb95417e05 initial commit
7 years ago
Xin Pan dcb77813aa Revert "CI: rerun failed tests. (#10536)"
7 years ago
Lei Wang 0446220e01
CI: rerun failed tests. (#10536)
7 years ago
chengduoZH 830532213a extract method from broadcast::RunImpl
7 years ago
typhoonzero a2de156dfa refine serde code
7 years ago
Tao Luo 303277f002
Merge pull request #10437 from panyx0718/infer2
7 years ago
Xin Pan f093a7b332 allow inference test to generate timeline
7 years ago
Luo Tao 40b8b634f9 Merge branch 'develop' into refine_relu_test
7 years ago
Yan Chunwei 819038113e Feature/engine refactor (#10497)
7 years ago
Yan Chunwei 6eeb819538
feature/inference analysis dot (#10494)
7 years ago
Xin Pan 6728d96d89 follow comments
7 years ago
Xin Pan 3de43a87ef Add a multi-dim add layer test.
7 years ago
chengduoZH 93368aac23 Merge develop
7 years ago
chengduoZH 9eec2c7509 refine pe
7 years ago
Kexin Zhao 170ac721b6 remove unnecessary tensor copy in save op
7 years ago
Yao Cheng 9ca8124f99 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
7 years ago
chengduo 8b1918f5c7
Merge pull request #10454 from chengduoZH/fix_fetchop
7 years ago
fengjiayi 2bff03bc1e fix a compile error (#10488)
7 years ago
Luo Tao 0ae97e8a5b Merge branch 'develop' into refine_relu_test
7 years ago
chengduoZH f4851f14b8 clean code
7 years ago
chengduoZH a459764de7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_fetchop
7 years ago
chengduoZH e00c1ee10f fix split var test
7 years ago
Yu Yang 0e78cb69fb Clean OpProtoAndCheckerMaker
7 years ago
Luo Tao 89dcb0bd15 refine EngineIOConverter, and use io_convert in test_trt_activation_op
7 years ago
Yu Yang 96b703c007 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_matmul
7 years ago
Yao Cheng 8cbb49ce63 rm space
7 years ago
Yao Cheng e97645c959 rm block_desc.h whitespace
7 years ago
chengduoZH 345737d0fe add sync
7 years ago
Yu Yang fcd31d6161 Follow comments and polish code names
7 years ago
chengduoZH 187e23a79c fix MatMul parameter
7 years ago
Yu Yang 0a13d3c67a Move MatMul to blas_impl.h
7 years ago
gmcather be1f058f52
Merge branch 'develop' into develop
7 years ago
Yao Cheng 55218dafbc add prependAllocatedop
7 years ago
Tao Luo 3356fb3c6e
Merge pull request #10461 from luotao1/refine_convert
7 years ago
chengduoZH 676dfd1875 follow comments
7 years ago
typhoonzero a135fec1fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
chengduoZH aff8a26d71 check generated_op_
7 years ago
typhoonzero 0f86397d81 fix build
7 years ago
Luo Tao 53b401d589 refine io_convert and op_convert
7 years ago
chengduoZH 2e5d44f102 fix fetch op
7 years ago
Yu Yang 3dd01823a8 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_matmul
7 years ago
typhoonzero 17009d0627 workable version
7 years ago
Xin Pan dce0732d5e
Merge pull request #10380 from panyx0718/dist_timeline
7 years ago
Xin Pan 0c518888fa
Merge pull request #10430 from panyx0718/infer
7 years ago
Yu Yang c6a6d87f96 Rewrite Matmul, make code cleaner
7 years ago
typhoonzero a529d790b6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gen_nccl_id_op
7 years ago
Yan Chunwei 2a2c83b9e6 feature/convert tensorrt io (#10440)
7 years ago
typhoonzero 82c61dbde3 fix testing
7 years ago
Xin Pan 9fccf46270 reword comments
7 years ago
Xin Pan d1ea74d3b9 follow comments
7 years ago
fengjiayi b708ec0ae1
Merge pull request #10412 from JiayiFeng/correct_TensorCopy_misuse
7 years ago
Kexin Zhao 55e714e0d2 add float16 support to pool3d
7 years ago
Kexin Zhao 8b16927230 add fp16 support to conv3d
7 years ago
Yiqun Liu fd1971caa0
Add the call of DropKids at the end of executor.Run to delete the local scopes created in operators (#10403)
7 years ago
chengduo 99acf1da4c
Merge pull request #10351 from chengduoZH/feature/update_sparse_parameter
7 years ago
Darcy 8f8a4768dc adding device_context to blas deps list (#10420)
7 years ago
dzhwinter a28dffbb0b
Fix/adam float64 (#10407)
7 years ago
Xin Pan cdd52f3a30 Add comment to explain how to run inference test
7 years ago
typhoonzero 0598a4b366 fix ci
7 years ago
typhoonzero 3667578ec2 testing
7 years ago
chengduoZH 881e063ee2 follow comments
7 years ago
chengduoZH ff599b9218 use Reduce and Broadcast
7 years ago
chengduoZH 0441c2cc45 fix ci
7 years ago
fengjiayi 0c99cd7bbb fix errors in sequence_padding_test
7 years ago
Lei Wang 6418c42148 Travis: fix check style error.
7 years ago
Kexin Zhao 4e3fac4129 fix sign unsigned comparison (#10424)
7 years ago
Siddharth Goyal b65282168c Fix cpplint errors in lstm kernel (#10394)
7 years ago
chengduo 4558c0ec0a
Merge pull request #10414 from chengduoZH/wrap_shfl_x_sync
7 years ago
chengduoZH d36af62c1e wrap_shfl_x_sync
7 years ago
Yancey 2d98a418d7
fix remove op (#10410)
7 years ago
fengjiayi bf99396a04 fix errors in sequence_slice_op
7 years ago
fengjiayi baa9f50da5 fix errors in multiplex_op
7 years ago
fengjiayi 2e617334eb fix errors in lod_reset_op
7 years ago
fengjiayi e309f42293 fix errors in concat_test
7 years ago
Yu Yang 0285a2b95d
Merge pull request #10371 from reyoung/refine_code
7 years ago
chengduoZH f9c680c43e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/update_sparse_parameter
7 years ago
typhoonzero d9320dcd94 complete code
7 years ago
chengduoZH 7722baa8e3 follow comments and clean code
7 years ago
Xin Pan 5a9f17f02b clean up
7 years ago
Tao Luo 4646c0f35d
Merge pull request #10144 from luotao1/tr_convert_init
7 years ago
Qingsheng Li 3bb99c4f66
Added auto transform to beam_search_decode_op (#10286)
7 years ago
Abhinav Arora c9f55dfafc
Fix CPPLint issues in /math/detail/gru_kernel.h (#10390)
7 years ago
Yu Yang ef6ea790dc Clean and extract blas
7 years ago
Kexin Zhao ccc594e4c4
need to copy LoD info (#10392)
7 years ago
Kexin Zhao 7a86069422 Add float16 demo code and put float16 work in contrib/float16 folder (#10331)
7 years ago
Luo Tao beb1245560 add relu converter and unit-test
7 years ago
Xin Pan 76d8b14bce Add timeline support for distributed training
7 years ago
Yu Yang d0785ce982 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_code
7 years ago
Yu Yang 815d888468 Clean MatMul
7 years ago
Yu Yang 2abcf37958
Merge pull request #10327 from reyoung/feature/clean_blas
7 years ago
chengduo 54797abd53
Merge pull request #10347 from chengduoZH/replace___shfl_with__shfl_sync
7 years ago
chengduo 62fed4cbb3 fix __shfl_down (#10362)
7 years ago
Yu Yang 9d7279b953 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_code
7 years ago
Yu Yang bc8160350b Fix compile
7 years ago
chengduoZH e97c1a8ca0 fix __shfl
7 years ago
Yu Yang a6edeb39b3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_blas
7 years ago
typhoonzero 7237323c5d fix compile
7 years ago
dzhwinter f63ff90b03
Fix/fp64 (#10346)
7 years ago
Wu Yi 88d79dfe95
Merge pull request #10292 from typhoonzero/fix_grpc_server_ready_condition
7 years ago
chengduoZH 0cc635497c merge develop
7 years ago
Yiqun Liu 6084af47ef
Fix the bug when a input variable of op is dispensable. (#10268)
7 years ago
Yu Yang 8a0c7e2e70
Merge pull request #10280 from reyoung/feature/add_stable_test_of_cross_entropy
7 years ago
Tomasz Patejko 4a497b826d MKLDNN implementation of batch normalization (#9904)
7 years ago
chengduo 4fbde42cdf Fix __shfl_down_sync_ of cross_entropy (#10345)
7 years ago
chengduoZH c891189568 update sparse gradient parameter with reduce and broadcast
7 years ago
yi.wu 6422c0e4f6 update by comment
7 years ago
chengduoZH b8f7fa97b6 replace __shfl with __shfl_sync
7 years ago
chengduoZH 5ff1ef36ee update sparse parameter
7 years ago
typhoonzero eeed7af5c3 add gen_nccl_id_op
7 years ago