Yu Yang
186b2b138d
Revert "Merge pull request #13201 from reyoung/revert_callstack" ( #13697 )
...
This reverts commit 21bb9e91fc
, reversing
changes made to 3fa68dc101
.
test=develop
7 years ago
Yan Chunwei
c8744d118d
fea/infer executor and concurrency performance issue bug fix ( #13451 )
...
- add naive executor
- fix concurrency performance issue
7 years ago
Yu Yang
7119d6c3cf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_callstack
7 years ago
dzhwinter
97636a9fcf
"fix link error" ( #13545 )
7 years ago
tensor-tang
d61c11764a
follow comment add enforce
7 years ago
tensor-tang
8bb824bb93
refine infershape hasinput and hasoutput
7 years ago
tensor-tang
e0436ad8bb
refine fusion lstm infershape
7 years ago
Yu Yang
cda7842e26
Revert "Revert "Add Python Callstacks when Op::Run error ( #12759 )""
...
This reverts commit 1f270275a6
.
7 years ago
Qiao Longfei
d0b713493e
enhance DebugStringEx ( #12949 )
7 years ago
guochaorong
1f270275a6
Revert "Add Python Callstacks when Op::Run error ( #12759 )"
...
This reverts commit b2df17003f
.
7 years ago
Yu Yang
b2df17003f
Add Python Callstacks when Op::Run error ( #12759 )
...
* Add Python Callstacks when Op::Run error
* Skip op with sub-block
* refactor: refine callstack info's format
* Reshape only support matrix
* Polish Python code
* Fix UT
* Fix Py3
7 years ago
Qiao Longfei
c1446342ff
Merge pull request #12577 from jacquesqiao/optimize-vlog-before-and-after-op-run
...
optimize vlog before and after op run, move into op.run
7 years ago
Yu Yang
c4f8afa258
Fix bug when memopt optimize loss.grad and use ParallelExecutor
7 years ago
qiaolongfei
b4d48531e4
optimize vlog before and after op run, move into op.run
7 years ago
qiaolongfei
903b2c0162
optimize code
7 years ago
qiaolongfei
a3f9d6a38c
optimize profiler
7 years ago
minqiyang
f9ef0ee8a9
Polish code
7 years ago
minqiyang
c4d000a990
Make code more efficient
7 years ago
minqiyang
9812bb8b48
Fix pserver can NOT start with DebugString problem
7 years ago
dzhwinter
0c8fde7dce
"cherry picked cpp tests" ( #12182 )
...
* "cherry picked cpp tests"
* "cherry picked"
* "cherry picked tests"
* "merge develop branch"
7 years ago
qiaolongfei
3033841b4a
CheckTensorNANOrInf support checking SelectedRows
7 years ago
Tao Luo
436bb4500b
Merge pull request #11699 from pzelazko-intel/pzelazko/workaround-for-missing-mklnn-kernels
...
workaround for no MKLDNN kernel
7 years ago
Paweł Żelazko
05ffc1331e
log for fallback added
7 years ago
Paweł Żelazko
ac323343a0
typos fix
7 years ago
Paweł Żelazko
bf0090a1ef
workaround for missing MKLDNN kernel
7 years ago
yuyang18
3b00ed81a9
Make Kernel registed as a function
7 years ago
sneaxiy
3f9292c6e6
fix merge conflict
7 years ago
sneaxiy
dd70fb4393
fix type comparation bugs
7 years ago
guochaorong
6a35899131
Revert "Extend fill_zeros_like_op for zero-filling an LoDTensorArray ( #11496 )"
...
This reverts commit bc28cf613f
.
7 years ago
yuyang18
5e23a5ec18
Rename TransferData -> TransformData
7 years ago
yuyang18
5999201346
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refactor_op
7 years ago
Haichao Zhang
bc28cf613f
Extend fill_zeros_like_op for zero-filling an LoDTensorArray ( #11496 )
...
* Add fill_zeros_array op. This op is used for zero-filling an LoDTensorArray.
* merge fill_zeros_array_op with fill_zeros_like_op
* add unit_test for fill_zeros_like for array
7 years ago
yuyang18
9faf5a39c5
Refactor Operator.cc, and clean code
7 years ago
Yang Yang(Tony)
d827c6e87a
Dynamic Graph first prototype ( #11415 )
7 years ago
qiaolongfei
7ebef493d5
add row_size for selected rows in DebugStringEx
7 years ago
fengjiayi
5803115720
Merge pull request #11288 from JiayiFeng/fix_bug_of_ExecutionContext
...
fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
dzhwinter
f7c96f079b
Big data op_test benchmark, for checking output consistent in different runs. ( #10646 )
...
* "init benchmark ops"
* "untrack outputs"
* "delete some usused code"
* "benchmark"
* "fix ci"
* "fix op test"
* "fix uint16 missing"
* "fix ci"
* "follow comments"
* "fix ci"
* "follow comments"
* "conficts. merge develop branch"
* repick
* "merge develop branch"
7 years ago
fengjiayi
dc8e0b494d
fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
mozga-intel
3ff9ba0e6b
Mkldnn layout ( #11040 )
...
* Add MKLDNN layout support in Paddle
Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.
* Add MKLDNN layout support in activation OP
* Don't populate layout from input to output when kMKLDNN in
* Refine pool mkldnn op kernel
* MKLDNN layout
* Remove the inferitance from tensor file
* MKLDNN layout: refactoring
* Remove additional #define to register new operator
* Prepare mkldnn tests to work with layout
7 years ago
Yu Yang
14905516aa
Merge pull request #10970 from JiayiFeng/dev_add_random_crop_op
...
Add random crop op
7 years ago
fengjiayi
3bce3dbce1
fix a bug
7 years ago
chengduoZH
cb1c657ce4
move check_nan_inf to operator
7 years ago
Yiqun Liu
6084af47ef
Fix the bug when a input variable of op is dispensable. ( #10268 )
...
* Fix the bug when a input variable of op is dispensable.
* Add HasInputs/Outputs interfaces to OperatorBase.
* Remove the unreferenced header file.
7 years ago
Yang Yang(Tony)
81dfc0cf0e
Clean up unused code in operator class ( #10035 )
...
* delete unused IsNetOp() and Rename()
* rm OperatorBase::Rename implementation
* delete Operator::InputVars()
* remove unused OperatorBase::ShareLoD; ShareLoD has been implemented in infershape
* organize operatorbase; remove unused set_type
* add comments
* fix comment
7 years ago
Abhinav Arora
f09aed0475
Fix CPPLint issues in framework/data_transform framework/prune.cc ( #10178 )
...
* Fic CPPLint issues with data_transform
* Fic CPPLint issues with prune.cc
7 years ago
Qiao Longfei
4c55a6022a
Dist transpiler support prefetch ( #9714 )
...
* init
* add some check
* add dist transpile logic
* add insert op for block
* init change get_pserver_program
* optimize code
* fix a bug
* can run now
* start to do table split
* start to process table gradient
* complete pserver part
* can send_vars now
* revert cpplint
* fix a bug
* optimize code
* move dist test to models
* revert the interface of distribute_transpiler.transpile
* fix prefetch_block
* optimize trainspiler code
* add comment to sum_op
* add warning log
* fix comment
* fix test_send_recv
* fix test_send_recv
* fix train with no distributed table
* optimize GetDims
7 years ago
qiaolongfei
e66bd4cb73
add GetDataTypeOfVar
7 years ago
Qiao Longfei
23bab34ca3
Fix data transform when inplace ( #9450 )
...
* fix data transform when op have inplace in/out
* add log
* should not delete scope because Compute maybe async
* optimize code
7 years ago
Yibing Liu
90afbd2856
Move back operator's event to RunImpl()
7 years ago
Yu Yang
a8c076e577
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/shuffle_reader
7 years ago