Commit Graph

17607 Commits (b6d261dff51b216551b6a886faa67406a93849f9)
 

Author SHA1 Message Date
qiaolongfei 1623f1ba4f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-profiler
7 years ago
tangwei12 3206970b77 sampling op rename
7 years ago
qiaolongfei 903b2c0162 optimize code
7 years ago
Xin Pan 99a77cfc62
Merge pull request #12468 from panyx0718/improve_profiler2
7 years ago
qiaolongfei fd73f2bb26 dist lookup table only support sgd while other dense table support all optimizer
7 years ago
qiaolongfei 4c5bcd7859 add guard to profiler
7 years ago
Xin Pan 19f40dab03 polish doc
7 years ago
qiaolongfei d553e2ff3f revert inference
7 years ago
qiaolongfei a3f9d6a38c optimize profiler
7 years ago
qiaolongfei 3e0477b504 revert inference
7 years ago
qiaolongfei 9a7084c1ae fix test_dist_transpiler
7 years ago
tangwei12 e0ab2f7158 new sampling op
7 years ago
qiaolongfei 25ab4e0d6d fix typo
7 years ago
qiaolongfei 28c4443c22 add test for remove duplicated init op
7 years ago
qiaolongfei 8e9f338c0f add _is_duplicated_init_op
7 years ago
qiaolongfei 37e069eba3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into remove-duplicated-init-op
7 years ago
minqiyang 5656f64b58 Remove duplicate handle of dist_transformer
7 years ago
minqiyang a58dd3e557 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_python3_syntax
7 years ago
minqiyang 850c3945f2 Make transformer unittest serial
7 years ago
tensor-tang 8c23f7c4f0 fix blas and use packed weight
7 years ago
tensor-tang d9cc6b1866 replace gru compute with details
7 years ago
tensor-tang 43cee33a23 add mkl packed gemm
7 years ago
Qiyang Min 0964de119b
Merge pull request #12505 from velconia/fix_pserver_shutdown
7 years ago
Qiyang Min a7d921a537
Merge pull request #12521 from velconia/port_python3_syntax
7 years ago
minqiyang c3e187896d Port contrib to python3
7 years ago
minqiyang cd27e53667 Fix python code in contrib
7 years ago
minqiyang f9ef0ee8a9 Polish code
7 years ago
minqiyang 2e6bce597d Add CUDA_VERSION to cudnn location detection
7 years ago
minqiyang c4d000a990 Make code more efficient
7 years ago
JiabinYang 4af5d3e3d3 fix the paddle script causes 'command not found' error'
7 years ago
minqiyang 9812bb8b48 Fix pserver can NOT start with DebugString problem
7 years ago
Chen Weihang 3185d38af4
Merge pull request #12467 from chenwhql/batch_size_check
7 years ago
tangwei12 766ac488ac sum_op selectedRows dim bug fix
7 years ago
Zhaolong Xing d7dd0868db
Merge pull request #12449 from NHZlX/add_tensorrt_elementwise_add
7 years ago
nhzlx d50f776b27 merge develop
7 years ago
Bai Yifan 900d61dd98
Clean python api (#12406)
7 years ago
dzhwinter 0c8fde7dce
"cherry picked cpp tests" (#12182)
7 years ago
dzhwinter 595a2c83ae
explicit gradient of elementwise_add/elementwise_sub (#11970)
7 years ago
fengjiayi 23aebf0ea7 update softmax layer comment
7 years ago
chenweihang bcc2fbe687 add type conversion for batch_size
7 years ago
nhzlx 64a08f840f increase the test batch
7 years ago
Zhaolong Xing f37f875f1f
Merge pull request #12349 from NHZlX/add_tensorrt_conv2d_converter
7 years ago
Zhaolong Xing 7e6bac3ea6
Merge pull request #12479 from NHZlX/fix_gtest_test_eq_warning
7 years ago
chengduo c237795100
Reopen test_parallel_executor_transformer (#12452)
7 years ago
fengjiayi e7d8e16a66 update softmax_mkldnn_op
7 years ago
chenweihang 8627ef3330 refactor: simplify unittest function
7 years ago
nhzlx c7e6a11bc1 merge develop
7 years ago
nhzlx 0015df1b12 modify op converter for conv2d
7 years ago
chengduo b5a3e40bb5
close random fail test (#12482)
7 years ago
Yu Yang 2567afa35d
Merge pull request #12462 from reyoung/feature/fix_cudnn_deterministic
7 years ago