* don't expose numerous Tensor.set(), test=develop
* fix condition, test=develop
* fix float16 bug, test=develop
* feed should be Tensor or np.array, not Variable or number, test=develop
* use forcecast to copy numpy slice to new array, test=develop
* remove float16-uint16 hacking, test=develop
* Inception fusion operator.
* Support horizontal layer fusion in conv_fusion_op.
* Search conv algo strategy for variable-length input.
search N times and cache the searched algos. For other input, choose the algo of input whose area is closest to this input.
* add lod_tensor util and modify pybind
* refind pybind LoDTensor API and modify LoDTensor and DataFeeder test
* fix test error
* fix detection map op test
* fix reorder_lod_tensor test
* fix seq_concat_op
* fix chunk evel op test
* fix target assign op
* fix warp ctc op
* address comments step 1: reverse reset_lod op
* step 2: modify op test
* add warning message
* remove has_valid_lod
* add back has_valid_lod
* address comments
* add exception catching trial