History

Zeng Jinle 25ab23be28 Fix dygraph mem leak (#18082 ) * fix dygraph mem leak, test=develop * polish msg, test=develop		6 years ago
..
CMakeLists.txt	Fix dygraph mem leak (#18082 )	6 years ago
README.md	add doc	6 years ago
backward_strategy.h	test=develop, add gradient sort backward strategy (#17125 )	6 years ago
engine.cc	Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative"	6 years ago
engine.h	Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative"	6 years ago
flags.cc	Fix dygraph mem leak (#18082 )	6 years ago
flags.h	Fix dygraph mem leak (#18082 )	6 years ago
layer.cc	Fix dygraph mem leak (#18082 )	6 years ago
layer.h	Fix dygraph mem leak (#18082 )	6 years ago
nccl_context.cc	close socket connect (#17862 )	6 years ago
nccl_context.h	fix win gpu test=develop (#16694 )	6 years ago
nccl_context_test.cc	[Imperative] implement imperative NCCLParallelContext (#16477 )	6 years ago
profiler.cc	Polish code	6 years ago
profiler.h	1. Add imperative gperf profiler	6 years ago
tracer.cc	Feature/Fix recurrent usage of Varbase in Dygraph (#17838 )	6 years ago
tracer.h	Using Smart pointer to optimizer memory usage of dyGraph (#17768 )	6 years ago
type_defs.h	Feature/Fix recurrent usage of Varbase in Dygraph (#17838 )	6 years ago

README.md

Unescape Escape

Overview

Imperative Programming is easier to learn, debug and try new ideas.

Pytorch

https://pytorch.org/

TensorFlow Eager

https://www.tensorflow.org/guide/eager

Design

API

class Layer(object):

  def __call__(inputs):
    # build some parameter once.
    # ...
    return self.apply(inputs):

  def forward(inputs):
    # forward logic with paddle operators. backward auto-generated.


class PyLayer(core.PyLayer):

  def __call__(cls, inputs):
    # trace the logic.

  @staticmethod
  def forward(inputs):
    # any forward logic implemented with numpy io.

  @staticmethod
  def backward(inputs):
    # any backward logic implemented with numpy io.

Tracer

Current: Python Variable -> C++ VarBase -> C++ Variable -> C++ Tensor

Longer term.


# Parent class.
class PyVarBase(object):
  pass

# Current python variable.
class Variable(PyVarBase):
  pass

class IVariable(PyVarBase):
  def __init__(self):
    self._ivar = core.VarBase()

  # Move var to a device.
  def to(device): pass
  # Get var value.
  def value(): pass
  # Trigger backward.
  def backward(): pass
  # Get var's gradient value.
  def gradient_value(): pass
  # operators to override.

class Tracer {
 public:
  explicit Tracer(framework::BlockDesc* root_block) : root_block_(root_block) {}

  virtual ~Tracer() {}

  void Trace(OpBase* op,
             const std::map<std::string, std::vector<VarBase*>>& inputs,
             const std::map<std::string, std::vector<VarBase*>>& outputs,
             framework::BlockDesc* block, const bool stop_gradient = false);

  std::vector<VarBase*> PyTrace(OpBase* op, const std::vector<VarBase*>& inputs,
                                bool stop_gradient = false);
};

Trace forward operations
Perform quick shape/type infer, push kernel execution engine and return to user.
Perform autograd to generate gradients.
Clear trace.
Apply gradients with optimizers

Autodiff

Lots of research already. https://autodiff-workshop.github.io/ https://en.wikipedia.org/wiki/Automatic_differentiation

Basically, trace the forward execution, and perform autodiff when needed.

Can be triggered by backward().
Can select a block of code to trace and autodiff.
Use require_grad to drop some forward subgraph that doesn't need autodiff.

Execution Engine

Lazy execution of pushed C++ operations.

Device Placement

Operator executes on the inputs' device.
All inputs should live on the same device.
use Var.to() to explicitly move var to a device.

Save/Load Models

TODO

I/O

TODO

Refactor

All function layers with parameters converted to class Layers.
Existing models converted to imperative mode.
All op tests run once in static graph, once in imperative mode.

Examples

class MyLayer(fluid.imperative.Layer):
    def __init__(self):
        super(MyLayer, self).__init__()

    def forward(self, inputs):
        x = fluid.layers.relu(inputs)
        x = fluid.layers.elementwise_mul(x, x)
        x = fluid.layers.reduce_sum(x)
        return [x]


class MyPyLayer(fluid.imperative.PyLayer):
    def __init__(self):
        super(MyPyLayer, self).__init__()

    @staticmethod
    def forward(inputs):
        return np.tanh(inputs[0])

    @staticmethod
    def backward(inputs):
        return np.array(dout) * (1 - np.square(np.array(out)))


np_inp = np.ones([2, 2], np.float32)
with fluid.imperative.guard():
    my_py_layer = MyPyLayer()
    outs = my_py_layer(np_inp)
    dy_out = np.sum(outs[0]._numpy())
    outs[0]._backward()
    dy_grad = var_inp._gradient()


class MLP(fluid.imperative.Layer):
    def __init__(self):
        super(MLP, self).__init__()
        self._fc1 = FC(3,
                       fluid.ParamAttr(
                           initializer=fluid.initializer.Constant(value=0.1)))
        self._fc2 = FC(4,
                       fluid.ParamAttr(
                           initializer=fluid.initializer.Constant(value=0.1)))

    def forward(self, inputs):
        x = self._fc1(inputs)
        x = self._fc2(x)
        x = fluid.layers.reduce_sum(x)
        return x


 np_inp = np.array([[1.0, 2.0], [3.0, 4.0]], dtype=np.float32)
 with fluid.imperative.guard():
     var_inp = fluid.imperative.base.to_variable(np_inp)
     mlp = MLP()
     out = mlp(var_inp)
     dy_out = out._numpy()
     out._backward()

Plan

2.1，3 fulltime, Can run a few simple models. (Currently, 2 20% engs)

4.1, 4 fulltime, Can run 6 models, Performance 70% Pytorch. Release alpha.

6.1, 5 fulltime, Performance close to Pytorch, can run multi-devices. Release Beta.

8.1, 5 fulltime, Works in general. Update existing models. Can compile to static graph, support more optimizations.

12.1 Done.

Discussion

TODO.

README.md Unescape Escape