qiaolongfei
3033841b4a
CheckTensorNANOrInf support checking SelectedRows
7 years ago
Tao Luo
436bb4500b
Merge pull request #11699 from pzelazko-intel/pzelazko/workaround-for-missing-mklnn-kernels
...
workaround for no MKLDNN kernel
7 years ago
Paweł Żelazko
05ffc1331e
log for fallback added
7 years ago
Paweł Żelazko
ac323343a0
typos fix
7 years ago
Paweł Żelazko
bf0090a1ef
workaround for missing MKLDNN kernel
7 years ago
yuyang18
3b00ed81a9
Make Kernel registed as a function
7 years ago
sneaxiy
3f9292c6e6
fix merge conflict
7 years ago
sneaxiy
dd70fb4393
fix type comparation bugs
7 years ago
guochaorong
6a35899131
Revert "Extend fill_zeros_like_op for zero-filling an LoDTensorArray ( #11496 )"
...
This reverts commit bc28cf613f
.
7 years ago
yuyang18
5e23a5ec18
Rename TransferData -> TransformData
7 years ago
yuyang18
5999201346
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refactor_op
7 years ago
Haichao Zhang
bc28cf613f
Extend fill_zeros_like_op for zero-filling an LoDTensorArray ( #11496 )
...
* Add fill_zeros_array op. This op is used for zero-filling an LoDTensorArray.
* merge fill_zeros_array_op with fill_zeros_like_op
* add unit_test for fill_zeros_like for array
7 years ago
yuyang18
9faf5a39c5
Refactor Operator.cc, and clean code
7 years ago
Yang Yang(Tony)
d827c6e87a
Dynamic Graph first prototype ( #11415 )
7 years ago
qiaolongfei
7ebef493d5
add row_size for selected rows in DebugStringEx
7 years ago
fengjiayi
5803115720
Merge pull request #11288 from JiayiFeng/fix_bug_of_ExecutionContext
...
fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
dzhwinter
f7c96f079b
Big data op_test benchmark, for checking output consistent in different runs. ( #10646 )
...
* "init benchmark ops"
* "untrack outputs"
* "delete some usused code"
* "benchmark"
* "fix ci"
* "fix op test"
* "fix uint16 missing"
* "fix ci"
* "follow comments"
* "fix ci"
* "follow comments"
* "conficts. merge develop branch"
* repick
* "merge develop branch"
7 years ago
fengjiayi
dc8e0b494d
fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
mozga-intel
3ff9ba0e6b
Mkldnn layout ( #11040 )
...
* Add MKLDNN layout support in Paddle
Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.
* Add MKLDNN layout support in activation OP
* Don't populate layout from input to output when kMKLDNN in
* Refine pool mkldnn op kernel
* MKLDNN layout
* Remove the inferitance from tensor file
* MKLDNN layout: refactoring
* Remove additional #define to register new operator
* Prepare mkldnn tests to work with layout
7 years ago
Yu Yang
14905516aa
Merge pull request #10970 from JiayiFeng/dev_add_random_crop_op
...
Add random crop op
7 years ago
fengjiayi
3bce3dbce1
fix a bug
7 years ago
chengduoZH
cb1c657ce4
move check_nan_inf to operator
7 years ago
Yiqun Liu
6084af47ef
Fix the bug when a input variable of op is dispensable. ( #10268 )
...
* Fix the bug when a input variable of op is dispensable.
* Add HasInputs/Outputs interfaces to OperatorBase.
* Remove the unreferenced header file.
7 years ago
Yang Yang(Tony)
81dfc0cf0e
Clean up unused code in operator class ( #10035 )
...
* delete unused IsNetOp() and Rename()
* rm OperatorBase::Rename implementation
* delete Operator::InputVars()
* remove unused OperatorBase::ShareLoD; ShareLoD has been implemented in infershape
* organize operatorbase; remove unused set_type
* add comments
* fix comment
7 years ago
Abhinav Arora
f09aed0475
Fix CPPLint issues in framework/data_transform framework/prune.cc ( #10178 )
...
* Fic CPPLint issues with data_transform
* Fic CPPLint issues with prune.cc
7 years ago
Qiao Longfei
4c55a6022a
Dist transpiler support prefetch ( #9714 )
...
* init
* add some check
* add dist transpile logic
* add insert op for block
* init change get_pserver_program
* optimize code
* fix a bug
* can run now
* start to do table split
* start to process table gradient
* complete pserver part
* can send_vars now
* revert cpplint
* fix a bug
* optimize code
* move dist test to models
* revert the interface of distribute_transpiler.transpile
* fix prefetch_block
* optimize trainspiler code
* add comment to sum_op
* add warning log
* fix comment
* fix test_send_recv
* fix test_send_recv
* fix train with no distributed table
* optimize GetDims
7 years ago
qiaolongfei
e66bd4cb73
add GetDataTypeOfVar
7 years ago
Qiao Longfei
23bab34ca3
Fix data transform when inplace ( #9450 )
...
* fix data transform when op have inplace in/out
* add log
* should not delete scope because Compute maybe async
* optimize code
7 years ago
Yibing Liu
90afbd2856
Move back operator's event to RunImpl()
7 years ago
Yu Yang
a8c076e577
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/shuffle_reader
7 years ago
Yu Yang
225efa671f
Remove dims in base class
7 years ago
Liu Yiqun
a8e8507767
Refine the profile codes for inference.
7 years ago
Yibing Liu
ee88855ddf
Fix the profiler's bug in multi-gpu mode
7 years ago
Abhinav Arora
c7ad26d6a4
[WIP] Move DataType enum inside VarType ( #8447 )
...
* Move Pod Types from DataType enum to Type enum
* Fixed data_type.h
* Fix type in TensorDesc
* Add comment to framework.proto
* Fixed type in data_type.h
* Updated format of type in data_type.h
* Fix var_desc.h
* Fix op_kernel_type.h
* Fixed data_type_transform_test.cc
* Fix operator.h
* Fixed data_type_transform.cc
* Fixed op_kernel_type_test.cc
* Fix operator.cc
* Fixed data_layout_transform_test.cc
* Fix var_desc.cc
* Fixed assign_value_op.cc
* Fixed assign_value_op.h
* fixed protobuf.cc
* Fix data_layout_transform_test.cc and op_kernel_type_test.cc
* Fixed rnn_memory_helper_op.cc
* Fix progrma_desc_test.cc
* Fixed fill_constant_batch_size_like_op.cc
* Fix operator_test.cc
* Fixed fill_constant_op.cc
* Fixed gaussian_random_op.cc
* Fixed uniform_random_op.cc
* Fixed edit_distance_op.cc
* Fixed fill_constant_batch_size_like_op.cc
* Fixed rnn_memory_helper_op.cc
* Fixed chunk_eval_op.cc
* Fixed assign_value_op.cc
* Fixed assign_value_op.h
* Fixed cast_op.h
* Fixed cast_op.h
* Fix fill constant op
* Fixed clang for assign_value_op.cc
* Fix one_hot_op.h
* Fix one_hot_op.cc
* Fix fill_op.cc
* Fixed sum_op.cc
* Fixed sum_op clang
* Fix uniform_random_op.cc
* Fix gaussian_random_op.cc
* Fix backward.cc
* Fix protobuf.cc
* Fixed prune_test.cc
* Fixed op_registry_test.cc
* Fix data_device_transform_test.cu
* Fix travis error
* Fixed one_hot_op.cu
* Fixed op_registry_test.cc
* Fixed nccl_op.cc
* Fixing python tests
* Revert "Fixing python tests"
This reverts commit fccaa4c5818ed9f379ea1ce4315066cc78076c64.
* Fixing Pybind to remove data type
* Fixing tensor.py
* Updated the new files:
* Resolve error in merge conflict of fill_constant_batch_size_like_op.cc
7 years ago
Abhinav Arora
fcadb45251
Separate VarType from VarDesc in framework.proto and fix all related compiler errors ( #8414 )
...
* Refine Type system
* Fixing type inference
* Fixed create_reader_op.cc
* Fix var_desc.h
* Fixed executor.cc
* Fix shape_inference.h
* Fixed create_reader_op.cc
* Fix tensor_util.h
* Fixed var_type_inference_test.cc
* Fix shape_inference.cc
* Fixed sum_op.c
* Fixed read_op.cc
* Fix var_type.h
* Fixed beam_search_decode_op.cc
* sendrecvop_utils.cc
* Fix operator.cc
* Fixed lookup_table_op.cc
* Fixed op_desc.cc
* Fixed get_places_op.cc
* Fixed lod_rank_table_op.cc
* Fixed beam_search_op.cc
* Fix var_desc.cc
* Fixed lod_tensor_to_array_op.cc
* Fixed while_op.cc
* Fix program_desc_test.cc
* tensor_array_read_write_op.cc
* Fix assign_op.cc
* Fix executor.cc
* Fix protobuf.cc
* Fix protobuf.cc
7 years ago
qingqing01
24509f4af9
Fix the grammar in copyright. ( #8403 )
7 years ago
Yi Wang
40c7972d46
Merge branch 'develop' of http://github.com/paddlepaddle/paddle into operator_set_device
7 years ago
Yi Wang
a6c1bffa47
Merge with upstream
7 years ago
Yi Wang
fc374821dd
Correct #include path
7 years ago
Yi Wang
90648f336d
Move file to fluid/; Edit CMakeLists.txt
7 years ago