Commit Graph

109 Commits (0575fd4647bf414662d31c02371a68689273b22c)

Author SHA1 Message Date
fengjiayi 0575fd4647 simplify shape inference code
7 years ago
dzhwinter 80eff2662b
"unify flags" (#7973)
7 years ago
Qiao Longfei 50ac67fc63
Bugfix/check if kernel for type exist (#7657)
7 years ago
dzhwinter 5ad1aef051
"cudnn operators change to cudnn kernel" (#6660)
7 years ago
Qiao Longfei 23df6c4478
Add get lod for debug (#7375)
7 years ago
ranqiu92 95c0c12641
Merge pull request #7384 from dzhwinter/feature/sync_wait
7 years ago
Qiao Longfei 377424bf21
reorganize data transform related code (#7391)
7 years ago
dzhwinter a6edc0389e "fix CI"
7 years ago
dzhwinter f0316bdbbd "add flags"
7 years ago
Qiao Longfei d762e07ecc
Merge pull request #7294 from jacquesqiao/add-back-priority
7 years ago
qiaolongfei 8b1a81a9bf fix GetDims bug
7 years ago
qiaolongfei 0b52cc886f fix priority
7 years ago
qiaolongfei ca90356b0e add back priority
7 years ago
dzhwinter e94db381ba
Feature/add shared layout (#7233)
7 years ago
Qiao Longfei 0f353ab46e
cpu gpu transform function (#7191)
7 years ago
emailweixu 8814bec0c5 Show argument dimensions with operator::DebugStringEx (#7268)
7 years ago
Yu Yang 894236a128
Merge pull request #6730 from tonyyang-svail/parallel_do
7 years ago
dzhwinter 5593858dd9
Feature/use cudnn (#7141)
7 years ago
Yang Yang 97dc451f4a clean up
7 years ago
Yang Yang 9313233297 merge develop
7 years ago
dzhwinter 899a79cceb
Feature/transform (#7111)
7 years ago
QI JUN 5036cf0387
add helper function to get appropriate DeviceContext (#7066)
7 years ago
Yu Yang 15e8c80ee0 Rename API of DeviceContext (#7055)
7 years ago
QI JUN 7aed7eb539
cache memory in local scope (#7058)
7 years ago
QI JUN 94096ae554
add memory switch mechanism in operator kernel switch (#6991)
7 years ago
Qiao Longfei af0c4c45a3
Impl kernel hint (#6883)
7 years ago
qiaolongfei 313afc9cce add op_kernel_type_test
7 years ago
QI JUN 37e9626437 refine OpKernelType (#6879)
7 years ago
dzhwinter 735eba2976
Feature/operator run place (#6783)
7 years ago
Yang Yang f899150e0a pass forward runtime
7 years ago
Yu Yang e445b3ff20
Move framework.proto to proto namespace (#6718)
7 years ago
QI JUN 61ec0b9516
Refine device context (#6433)
7 years ago
dangqingqing 4e451a34db Remove the cuda stream synchronization between each operator.
7 years ago
Yang Yang(Tony) 18f0c40a97 feature/while_grad_op (#5554)
7 years ago
Yu Yang bbdac7f7d8 Polish OpWithKernel
8 years ago
qingqing01 58db07b7bb Check errors for the cuda kernel calls. (#5436)
8 years ago
Yu Yang 6cde889b5e
Add unittest, backward of array read/write op (#5409)
8 years ago
Yu Yang 0a32e74d13
Rewrite StaticRNN with Executor (#5224)
8 years ago
Cao Ying 8401039feb
Merge pull request #5084 from lcy-seso/crf
8 years ago
Qiao Longfei ee11f00642
add shareLod (#5259)
8 years ago
caoying03 dd2be3daba Merge branch 'develop' into crf
8 years ago
caoying03 86fd6b6373 add gpu kernel by copying inputs/outputs between cpu and gpu.
8 years ago
Yu Yang 46a13e37d7 Polish Accuracy Op (#5191)
8 years ago
Yu Yang 8f6c0a0fad
Extract InferShape to many cc files (#5174)
8 years ago
QI JUN 7f8574c0f5 add sparse support for sum op (#5093)
8 years ago
Yu Yang 86437a8dda Global function, op_support_gpu (#4980)
8 years ago
qiaolongfei a0767228bd merge InferShapeContext and ExecutionContext
8 years ago
Yi Wang 4558807c48 Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU
8 years ago
Yu Yang 84500f9487 Change `PADDLE_ONLY_CPU` to `PADDLE_WITH_GPU`
8 years ago
Qiao Longfei 87efa600df add some check to operator.run (#4544)
8 years ago