Commit Graph

294 Commits (f58fe6d3edb78c96547d91bf9da1492d0709b63d)

Author SHA1 Message Date
chengduo f58fe6d3ed
Merge pull request #6601 from chengduoZH/profiling/cosine_op
7 years ago
武毅 0bd7f97b4b
Merge pull request #7045 from typhoonzero/adam_selectedrows
7 years ago
chengduoZH 812c5f60eb remove conflict
7 years ago
chengduoZH 24cf2fcd90 move cos_sim_functor to math
7 years ago
typhoonzero 1039c1e3b7 scatter optimizers
7 years ago
typhoonzero 641b4c0fe6 wip
7 years ago
Yancey 2cdef424d9
Implement selectedrows serialize and deserialize (#7042)
7 years ago
typhoonzero 74b122889c wip
7 years ago
typhoonzero d48a0e4eae WIP: adding generic scattor functors
7 years ago
qingqing01 95da78a6df
Merge pull request #7047 from qingqing01/rowwise_add
7 years ago
qingqing01 19367389c0 Update the CUDA kernel.
7 years ago
qingqing01 41372ded20 Resume CPU implenmentation.
7 years ago
qingqing01 32d881beab Optimize the rowwise add function.
7 years ago
Tao Luo c77b696b8e
Merge pull request #7022 from luotao1/license
7 years ago
Luo Tao 761b329793 unify the indentation of license
7 years ago
qingqing01 f839154542
Merge pull request #6996 from qingqing01/lstm_active_type
7 years ago
dangqingqing a8e18549c2 Fix the clang format.
7 years ago
qingqing01 d760b6a58d Refine the activation type getting in the LSTM operator to speed.
7 years ago
QI JUN efd3726929
remove unused place (#6972)
7 years ago
dzhwinter 0d2235aadf GPUPlace to CUDAPlace (#6960)
7 years ago
qiaolongfei 682eee40cb fix math_function warning
7 years ago
Yu Yang 7e214b4985
Speed up ColwiseSum in CPU (#6834)
7 years ago
chengduo e19032fb4e
Merge pull request #6743 from chengduoZH/profiling/02.recognize_digits
7 years ago
chengduoZH cb3a74e436 revert im2col
7 years ago
chengduoZH 7b0744edcf refine im2col
7 years ago
chengduoZH f1ab13bd0e refine
7 years ago
chengduoZH 293b292e0f refine im2col
7 years ago
QI JUN 93a2d9c59d
add more place test and rename Cudnn to CUDNN (#6621)
7 years ago
tensor-tang 7728c53448 Merge remote-tracking branch 'upstream/develop' into fluid
7 years ago
tensor-tang f271210595 fix undefined issue when with_gpu
7 years ago
dzhwinter 0e9b393b34
"derived cudnnDevice context" (#6585)
7 years ago
QI JUN 61ec0b9516
Refine device context (#6433)
7 years ago
tensor-tang 69b44f2f19 unify MKL macro definition
7 years ago
qingqing01 e5b51c4d10 Make lstm_op follow google code style.
7 years ago
sweetsky0901 966a6ce6db
Merge pull request #5826 from sweetsky0901/my_unpool_max_2d
7 years ago
sweetsky0901 4ffb73fd3b format ..
7 years ago
sweetsky0901 3206094b5e format code
7 years ago
guosheng 1ab03d49d5 Fix gru_op related code style in gpu_kernel
7 years ago
guosheng 3e552cdcac Fix gru_op related code style
7 years ago
sweetsky0901 d2ee3c98df format code
7 years ago
sweetsky0901 c52ed8de37 format code
7 years ago
sweetsky0901 bd561384bc format code
7 years ago
sweetsky0901 d9673cad15 format code
7 years ago
sweetsky0901 6fc9a9fd69 modify for del T2 and doc update
7 years ago
sweetsky0901 ee0a794c27 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_unpool_max_2d
7 years ago
sweetsky0901 57e68e5740 modify for code review by qingqing 2nd
7 years ago
QI JUN b28b2f172b refine test_recognize_digits_mlp and format codes (#5937)
7 years ago
sweetsky0901 20654cf78a modify for type check rewrite
7 years ago
sweetsky0901 27cf7f3376 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into my_unpool_max_2d
7 years ago
Yu Yang a619695b06
Feature/enhance evaluator (#5824)
7 years ago