* Rename API of DeviceContext
Make them as usual names.
* Rename API of DeviceContext
Make them as usual names.
* Fix compile
* Fix compile
* Fix compile
* Fix compile
* Fix compile
* Use stable_sort in lod_rank_table
It is easy to debug and test when use `stable_sort`and the time
complexity is not changed.
* Add LoDTensorArray
* Stash
* Better debug message for IsInitialized
* Stash
* Better debug message for IsInitialized
* Complete array read/write op unittests
* Simplize Gradient Check
* Stash
* Extract apply_backward_pass to backward.py
Rename apply_backward_pass to append_backward_ops
* Use graph API to check gradient
* Fix ci
* Fix CI
* Fix backward for double precision
* Stash
* Fix CI
* Fix ci
* Ignore GRU test
* Ignore xe op
* Fix CI
* Fix softmax with xe gradient
The correct equation should be IG = OG * (d_softmax_with_xe())
* Fix typo
* Fix merge error
* Disable LRN