commit
e65960ede8
@ -1 +1,157 @@
|
|||||||
./doc/howto/dev/contribute_to_paddle_en.md
|
# Contribute Code
|
||||||
|
|
||||||
|
We sincerely appreciate your contribution. This document explains our workflow and work style.
|
||||||
|
|
||||||
|
## Workflow
|
||||||
|
|
||||||
|
PaddlePaddle uses this [Git branching model](http://nvie.com/posts/a-successful-git-branching-model/). The following steps guide usual contributions.
|
||||||
|
|
||||||
|
1. Fork
|
||||||
|
|
||||||
|
Our development community has been growing fastly; it doesn't make sense for everyone to write into the official repo. So, please file Pull Requests from your fork. To make a fork, just head over to the GitHub page and click the ["Fork" button](https://help.github.com/articles/fork-a-repo/).
|
||||||
|
|
||||||
|
1. Clone
|
||||||
|
|
||||||
|
To make a copy of your fork to your local computers, please run
|
||||||
|
|
||||||
|
```bash
|
||||||
|
git clone https://github.com/your-github-account/paddle
|
||||||
|
cd paddle
|
||||||
|
```
|
||||||
|
|
||||||
|
1. Create the local feature branch
|
||||||
|
|
||||||
|
For daily works like adding a new feature or fixing a bug, please open your feature branch before coding:
|
||||||
|
|
||||||
|
```bash
|
||||||
|
git checkout -b my-cool-stuff
|
||||||
|
```
|
||||||
|
|
||||||
|
1. Commit
|
||||||
|
|
||||||
|
Before issuing your first `git commit` command, please install [`pre-commit`](http://pre-commit.com/) by running the following commands:
|
||||||
|
|
||||||
|
```bash
|
||||||
|
pip install pre-commit
|
||||||
|
pre-commit install
|
||||||
|
```
|
||||||
|
|
||||||
|
Our pre-commit configuration requires clang-format 3.8 for auto-formating C/C++ code and yapf for Python.
|
||||||
|
|
||||||
|
Once installed, `pre-commit` checks the style of code and documentation in every commit. We will see something like the following when you run `git commit`:
|
||||||
|
|
||||||
|
```
|
||||||
|
➜ git commit
|
||||||
|
CRLF end-lines remover...............................(no files to check)Skipped
|
||||||
|
yapf.................................................(no files to check)Skipped
|
||||||
|
Check for added large files..............................................Passed
|
||||||
|
Check for merge conflicts................................................Passed
|
||||||
|
Check for broken symlinks................................................Passed
|
||||||
|
Detect Private Key...................................(no files to check)Skipped
|
||||||
|
Fix End of Files.....................................(no files to check)Skipped
|
||||||
|
clang-formater.......................................(no files to check)Skipped
|
||||||
|
[my-cool-stuff c703c041] add test file
|
||||||
|
1 file changed, 0 insertions(+), 0 deletions(-)
|
||||||
|
create mode 100644 233
|
||||||
|
```
|
||||||
|
|
||||||
|
1. Build and test
|
||||||
|
|
||||||
|
Users can build PaddlePaddle natively on Linux and Mac OS X. But to unify the building environment and to make it easy for debugging, the recommended way is [using Docker](https://github.com/PaddlePaddle/Paddle/blob/develop/doc/howto/dev/build_en.md).
|
||||||
|
|
||||||
|
1. Keep pulling
|
||||||
|
|
||||||
|
An experienced Git user pulls from the official repo often -- daily or even hourly, so they notice conflicts with others work early, and it's easier to resolve smaller conflicts.
|
||||||
|
|
||||||
|
```bash
|
||||||
|
git remote add upstream https://github.com/PaddlePaddle/Paddle
|
||||||
|
git pull upstream develop
|
||||||
|
```
|
||||||
|
|
||||||
|
1. Push and file a pull request
|
||||||
|
|
||||||
|
You can "push" your local work into your forked repo:
|
||||||
|
|
||||||
|
```bash
|
||||||
|
git push origin my-cool-stuff
|
||||||
|
```
|
||||||
|
|
||||||
|
The push allows you to create a pull request, requesting owners of this [official repo](https://github.com/PaddlePaddle/Paddle) to pull your change into the official one.
|
||||||
|
|
||||||
|
To create a pull request, please follow [these steps](https://help.github.com/articles/creating-a-pull-request/).
|
||||||
|
|
||||||
|
If your change is for fixing an issue, please write ["Fixes <issue-URL>"](https://help.github.com/articles/closing-issues-using-keywords/) in the description section of your pull request. Github would close the issue when the owners merge your pull request.
|
||||||
|
|
||||||
|
Please remember to specify some reviewers for your pull request. If you don't know who are the right ones, please follow Github's recommendation.
|
||||||
|
|
||||||
|
|
||||||
|
1. Delete local and remote branches
|
||||||
|
|
||||||
|
To keep your local workspace and your fork clean, you might want to remove merged branches:
|
||||||
|
|
||||||
|
```bash
|
||||||
|
git push origin :my-cool-stuff
|
||||||
|
git checkout develop
|
||||||
|
git pull upstream develop
|
||||||
|
git branch -d my-cool-stuff
|
||||||
|
```
|
||||||
|
|
||||||
|
### Code Review
|
||||||
|
|
||||||
|
- Please feel free to ping your reviewers by sending them the URL of your pull request via IM or email. Please do this after your pull request passes the CI.
|
||||||
|
|
||||||
|
- Please answer reviewers' every comment. If you are to follow the comment, please write "Done"; please give a reason otherwise.
|
||||||
|
|
||||||
|
- If you don't want your reviewers to get overwhelmed by email notifications, you might reply their comments by [in a batch](https://help.github.com/articles/reviewing-proposed-changes-in-a-pull-request/).
|
||||||
|
|
||||||
|
- Reduce the unnecessary commits. Some developers commit often. It is recommended to append a sequence of small changes into one commit by running `git commit --amend` instead of `git commit`.
|
||||||
|
|
||||||
|
|
||||||
|
## Coding Standard
|
||||||
|
|
||||||
|
### Code Style
|
||||||
|
|
||||||
|
Our C/C++ code follows the [Google style guide](http://google.github.io/styleguide/cppguide.html).
|
||||||
|
|
||||||
|
Our Python code follows the [PEP8 style guide](https://www.python.org/dev/peps/pep-0008/).
|
||||||
|
|
||||||
|
Our build process helps to check the code style. In [`build.sh`](https://github.com/PaddlePaddle/Paddle/blob/b84e8226514b8bb4405c3c28e54aa5077193d179/paddle/scripts/docker/build.sh#L42), the entry point of our [builder Docker image](https://github.com/PaddlePaddle/Paddle/blob/b84e8226514b8bb4405c3c28e54aa5077193d179/Dockerfile#L88), the CMake argument `WITH_STYLE_CHECK` is set to `ON` by default. This flag is on
|
||||||
|
|
||||||
|
Please install pre-commit, which automatically reformat the changes to C/C++ and Python code whenever we run `git commit`. To check the whole codebase, we can run the command `pre-commit run -a`, as in the [`check_style.sh` file](https://github.com/PaddlePaddle/Paddle/blob/b84e8226514b8bb4405c3c28e54aa5077193d179/paddle/scripts/travis/check_style.sh#L30), which is invoked by [our Travis CI configuration](https://github.com/PaddlePaddle/Paddle/blob/b84e8226514b8bb4405c3c28e54aa5077193d179/.travis.yml#L43).
|
||||||
|
|
||||||
|
### Unit Tests
|
||||||
|
|
||||||
|
Please remember to add related unit tests.
|
||||||
|
|
||||||
|
- For C/C++ code, please follow [`google-test` Primer](https://github.com/google/googletest/blob/master/googletest/docs/Primer.md).
|
||||||
|
|
||||||
|
- For Python code, please use [Python's standard `unittest` package](http://pythontesting.net/framework/unittest/unittest-introduction/).
|
||||||
|
|
||||||
|
|
||||||
|
### Writing Logs
|
||||||
|
|
||||||
|
We use [glog](https://github.com/google/glog) for logging in our C/C++ code.
|
||||||
|
|
||||||
|
For general information, please use `LOG`. For debug information, please use [`VLOG`](http://htmlpreview.github.io/?https://github.com/google/glog/blob/master/doc/glog.html#verbose). The reason is at [here](https://groups.google.com/a/chromium.org/d/msg/chromium-dev/3NDNd1KzXeY/AZKMMx37fdQJ).
|
||||||
|
|
||||||
|
`VLOG` requires a *verbose level* parameter. For example:
|
||||||
|
|
||||||
|
```c++
|
||||||
|
VLOG(3) << "Operator FC is taking " << num_inputs << "inputs."
|
||||||
|
```
|
||||||
|
|
||||||
|
When we run a PaddlePaddle application or test, we can specify a verbose threshold. For example:
|
||||||
|
|
||||||
|
```bash
|
||||||
|
GLOG_vmodule=buddy_allocator=2 \
|
||||||
|
GLOG_v=10 \
|
||||||
|
python \
|
||||||
|
../python/paddle/v2/framework/tests/test_recurrent_op.py
|
||||||
|
```
|
||||||
|
|
||||||
|
This will enable VLOG messages generated by `buddy_allocator.{h,cc}` and in the verbose range of 0 to 3, so you will see above example VLOG message, which is in level 3. This suggests that we output overall messages in lower verbose levels, so they display with higher probability. When coding C++, please follow the verbose level convention as follows:
|
||||||
|
|
||||||
|
- verbose level 1: [framework](https://github.com/PaddlePaddle/Paddle/tree/develop/paddle/framework)
|
||||||
|
- verbose level 3: [operators](https://github.com/PaddlePaddle/Paddle/tree/develop/paddle/operators)
|
||||||
|
- verbose level 5: [memory](https://github.com/PaddlePaddle/Paddle/tree/develop/paddle/memory), [platform](https://github.com/PaddlePaddle/Paddle/tree/develop/paddle/platform)
|
||||||
|
- verbose level 7: [math](https://github.com/PaddlePaddle/Paddle/tree/develop/paddle/math)
|
||||||
|
@ -0,0 +1,48 @@
|
|||||||
|
# Benchmark
|
||||||
|
|
||||||
|
Machine:
|
||||||
|
|
||||||
|
- Server
|
||||||
|
- Intel(R) Xeon(R) Gold 6148 CPU @ 2.40GHz, 2 Sockets, 20 Cores per socket
|
||||||
|
- Laptop
|
||||||
|
- DELL XPS15-9560-R1745: i7-7700HQ 8G 256GSSD
|
||||||
|
- i5 MacBook Pro (Retina, 13-inch, Early 2015)
|
||||||
|
- Desktop
|
||||||
|
- i7-6700k
|
||||||
|
|
||||||
|
System: CentOS release 6.3 (Final), Docker 1.12.1.
|
||||||
|
|
||||||
|
PaddlePaddle: paddlepaddle/paddle:latest (TODO: will rerun after 0.11.0)
|
||||||
|
|
||||||
|
- MKL-DNN tag v0.10
|
||||||
|
- MKLML 2018.0.20170720
|
||||||
|
- OpenBLAS v0.2.20
|
||||||
|
|
||||||
|
On each machine, we will test and compare the performance of training on single node using MKL-DNN / MKLML / OpenBLAS respectively.
|
||||||
|
|
||||||
|
## Benchmark Model
|
||||||
|
|
||||||
|
### Server
|
||||||
|
Test on batch size 64, 128, 256 on Intel(R) Xeon(R) Gold 6148M CPU @ 2.40GHz
|
||||||
|
|
||||||
|
Input image size - 3 * 224 * 224, Time: images/second
|
||||||
|
|
||||||
|
- VGG-19
|
||||||
|
|
||||||
|
| BatchSize | 64 | 128 | 256 |
|
||||||
|
|--------------|-------| -----| --------|
|
||||||
|
| OpenBLAS | 7.82 | 8.62 | 10.34 |
|
||||||
|
| MKLML | 11.02 | 12.86 | 15.33 |
|
||||||
|
| MKL-DNN | 27.69 | 28.8 | 29.27 |
|
||||||
|
|
||||||
|
|
||||||
|
chart on batch size 128
|
||||||
|
TBD
|
||||||
|
|
||||||
|
- ResNet
|
||||||
|
- GoogLeNet
|
||||||
|
|
||||||
|
### Laptop
|
||||||
|
TBD
|
||||||
|
### Desktop
|
||||||
|
TBD
|
@ -0,0 +1,49 @@
|
|||||||
|
include(ExternalProject)
|
||||||
|
|
||||||
|
set(NCCL_SOURCE_DIR ${THIRD_PARTY_PATH}/nccl)
|
||||||
|
|
||||||
|
include_directories(${NCCL_SOURCE_DIR}/src/extern_nccl/src)
|
||||||
|
|
||||||
|
if(WITH_DSO)
|
||||||
|
# If we use DSO, we do not build nccl, just download the dependencies
|
||||||
|
set(NCCL_BUILD_COMMAND "")
|
||||||
|
set(NCCL_INSTALL_COMMAND "")
|
||||||
|
set(NCCL_INSTALL_DIR "")
|
||||||
|
else()
|
||||||
|
# otherwise, we build nccl and link it.
|
||||||
|
set(NCCL_INSTALL_DIR ${THIRD_PARTY_PATH}/install/nccl)
|
||||||
|
# Note: cuda 8.0 is needed to make nccl
|
||||||
|
# When cuda is not installed on the system directory, need to set CUDA_HOME to your cuda root
|
||||||
|
set(NCCL_BUILD_COMMAND "make -j 8")
|
||||||
|
set(NCCL_INSTALL_COMMAND "make install PREFIX=${NCCL_INSTALL_DIR}")
|
||||||
|
endif()
|
||||||
|
|
||||||
|
ExternalProject_Add(
|
||||||
|
extern_nccl
|
||||||
|
${EXTERNAL_PROJECT_LOG_ARGS}
|
||||||
|
GIT_REPOSITORY "https://github.com/NVIDIA/nccl.git"
|
||||||
|
GIT_TAG "v1.3.4-1"
|
||||||
|
PREFIX "${NCCL_SOURCE_DIR}"
|
||||||
|
UPDATE_COMMAND ""
|
||||||
|
CONFIGURE_COMMAND ""
|
||||||
|
BUILD_COMMAND "${NCCL_BUILD_COMMAND}"
|
||||||
|
INSTALL_COMMAND "${NCCL_INSTALL_COMMAND}"
|
||||||
|
INSTALL_DIR "${NCCL_INSTALL_DIR}"
|
||||||
|
TEST_COMMAND ""
|
||||||
|
)
|
||||||
|
|
||||||
|
if(WITH_DSO)
|
||||||
|
if(${CMAKE_VERSION} VERSION_LESS "3.3.0")
|
||||||
|
set(dummyfile ${CMAKE_CURRENT_BINARY_DIR}/lib_nccl_dummy.c)
|
||||||
|
file(WRITE ${dummyfile} "const char * dummy_nccl = \"${dummyfile}\";")
|
||||||
|
add_library(nccl STATIC ${dummyfile})
|
||||||
|
else()
|
||||||
|
add_library(nccl INTERFACE)
|
||||||
|
endif()
|
||||||
|
else()
|
||||||
|
add_library(nccl STATIC IMPORTED GLOBAL)
|
||||||
|
set_property(TARGET nccl PROPERTY IMPORTED_LOCATION
|
||||||
|
${NCCL_INSTALL_DIR}/lib/libnccl_static.a)
|
||||||
|
endif()
|
||||||
|
|
||||||
|
add_dependencies(nccl extern_nccl)
|
File diff suppressed because it is too large
Load Diff
Binary file not shown.
After Width: | Height: | Size: 57 KiB |
@ -0,0 +1,23 @@
|
|||||||
|
# Executor Design Doc
|
||||||
|
|
||||||
|
## Motivation
|
||||||
|
|
||||||
|
We use executor to do the runtime evaluation of a `ProgramDesc`.
|
||||||
|
|
||||||
|
## Overview
|
||||||
|
|
||||||
|
An executor takes a `ProgramDesc`, a `block_id` and a `Scope`. The `ProgramDesc` is a list of blocks and each block contains the protobuf definition of all the parameters and operators. The `block_id` specifies the entrance block. And the `Scope` is the container of all the variable instance, which is persistent throughout different runs.
|
||||||
|
|
||||||
|
### What does executor do?
|
||||||
|
|
||||||
|
It evaluates all the operators in the `block_id`th block of a `ProgramDesc`.
|
||||||
|
|
||||||
|
### What does executor NOT do?
|
||||||
|
|
||||||
|
It does not do runtime optimization, meaning intelligently parse the dependency of each op a choose which one to be run and in which order they should be run.
|
||||||
|
|
||||||
|
It does not do graph partitioning, meaning dividing the `ProgramDesc` into several small pieces and executing them on different devices.
|
||||||
|
|
||||||
|
## Implementation
|
||||||
|
|
||||||
|
`Executor` evaluates a `ProgramDesc`. Essentially, it instantiates Variables and Operators, then run all the operators in sequence. [[code]](https://github.com/PaddlePaddle/Paddle/blob/develop/paddle/framework/executor.cc)
|
Some files were not shown because too many files have changed in this diff Show More
Loading…
Reference in new issue