fengjiayi
24d51de022
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_op_tensor_support
7 years ago
fengjiayi
27df3a9f2b
make cross_entropy_op supporting tensors
7 years ago
Chen Weihang
b2435a3a11
Merge pull request #12374 from chenwhql/py_calc_memory
...
Add memory usage estimate API
7 years ago
fengjiayi
66be53264e
Merge pull request #12592 from JiayiFeng/fix_mac_compile_error
...
fix mac compile error
7 years ago
Yu Yang
cb79b0233e
Merge pull request #12595 from reyoung/fix_scale_loss_with_memopt
...
Fix bug when memopt optimize loss.grad and use ParallelExecutor
7 years ago
Yu Yang
c4f8afa258
Fix bug when memopt optimize loss.grad and use ParallelExecutor
7 years ago
fengjiayi
8e604a10aa
fix mac compile error
7 years ago
nhzlx
551c802cdc
merge develop
7 years ago
nhzlx
c69ae865db
fix comments
7 years ago
chengduo
7c8b69c700
Feature/op fusion ( #12240 )
...
* Add Preface
* Add demo code
* Save file
* Refine code
* seems can work
* use elementwise strategy
* Use ElementwiseComputeEx
* Add comments
* extract functions from operator
* Refine code
* Follow comment
* code refine
* follow comments
* follow comments
7 years ago
nhzlx
8f9e704f94
merge develop
7 years ago
nhzlx
3a0caf801f
modify trt engine op test
7 years ago
nhzlx
e51d045a6d
modify trt engine op test
7 years ago
Qiao Longfei
88e47e1e2d
Merge pull request #12570 from jacquesqiao/add-flag-to-disable-inference
...
add WITH_INFERENCE flag
7 years ago
nhzlx
e8954a36f5
merge develop
7 years ago
nhzlx
32a9e050bc
mapping the variable name inside the subgraph
7 years ago
Wu Yi
2d036c47cd
polish dist unit test code ( #12512 )
...
* polish dist se resnext ut
* update
* update
* update
* avoid cpu initializer differ
* change to use executor for now
* update by comment
* remove lr decay use para exe, should fix para exe bug later
* update by comment
7 years ago
qiaolongfei
9331ba752f
add WITH_INFERENCE flag
7 years ago
chengduo
97a77512b4
Fix the order of sum ( #12562 )
...
* fix the order of sum
* add doc
* check whether need to copy
* follow comments
7 years ago
fengjiayi
7834b4a470
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_op_tensor_support
7 years ago
Jeff Wang
4713f0a9e4
Simplify the travis script. ( #12557 )
...
* Simplify the travis script.
Now use docker to deploy documentations
* Check for the pull request
* Update paddle_build.sh
* Update paddle_build.sh
7 years ago
Qiao Longfei
88a607c342
Merge pull request #12541 from jacquesqiao/optimize-profiler
...
optimize profiler
7 years ago
qiaolongfei
954d680b40
fix test_parallel_do.py
7 years ago
Tao Luo
0fd2f713a4
Merge pull request #12548 from Superjomn/bugfix/disable-anakin-test
...
Bugfix/disable anakin test
7 years ago
fengjiayi
7c55e08c93
stash
7 years ago
superjomn
ebe1920626
add comment
7 years ago
superjomn
3c5e15de03
disable anakin test
7 years ago
fengjiayi
b656d97e86
Merge pull request #12485 from JiayiFeng/dev_ops_tensor_support
...
Make lookup_table_op and softmax_op supporting high rank tensor
7 years ago
qiaolongfei
52576c5f38
revert inference
7 years ago
qiaolongfei
1623f1ba4f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-profiler
7 years ago
qiaolongfei
903b2c0162
optimize code
7 years ago
Xin Pan
99a77cfc62
Merge pull request #12468 from panyx0718/improve_profiler2
...
Improve profiler
7 years ago
qiaolongfei
4c5bcd7859
add guard to profiler
7 years ago
qiaolongfei
d553e2ff3f
revert inference
7 years ago
qiaolongfei
a3f9d6a38c
optimize profiler
7 years ago
minqiyang
f9ef0ee8a9
Polish code
7 years ago
minqiyang
c4d000a990
Make code more efficient
7 years ago
minqiyang
9812bb8b48
Fix pserver can NOT start with DebugString problem
7 years ago
Zhaolong Xing
d7dd0868db
Merge pull request #12449 from NHZlX/add_tensorrt_elementwise_add
...
Add tensorrt elementwise add
7 years ago
nhzlx
d50f776b27
merge develop
7 years ago
Bai Yifan
900d61dd98
Clean python api ( #12406 )
...
* api clean
* update API.spec
7 years ago
dzhwinter
0c8fde7dce
"cherry picked cpp tests" ( #12182 )
...
* "cherry picked cpp tests"
* "cherry picked"
* "cherry picked tests"
* "merge develop branch"
7 years ago
dzhwinter
595a2c83ae
explicit gradient of elementwise_add/elementwise_sub ( #11970 )
...
* "add gradient register"
* "make some enhance"
* "better format"
* "fix typo"
* "fix reuse"
* "fix get expected kernel"
* "change the mkldnn code"
* "fix mkldnn"
* "fix mkldnn failed test"
* "add comment"
7 years ago
nhzlx
64a08f840f
increase the test batch
7 years ago
Zhaolong Xing
f37f875f1f
Merge pull request #12349 from NHZlX/add_tensorrt_conv2d_converter
...
add conv2d trt converter
7 years ago
Zhaolong Xing
7e6bac3ea6
Merge pull request #12479 from NHZlX/fix_gtest_test_eq_warning
...
fix warning
7 years ago
fengjiayi
e7d8e16a66
update softmax_mkldnn_op
7 years ago
nhzlx
c7e6a11bc1
merge develop
7 years ago
nhzlx
0015df1b12
modify op converter for conv2d
7 years ago
Yu Yang
2567afa35d
Merge pull request #12462 from reyoung/feature/fix_cudnn_deterministic
...
Fix bug in cudnn_determistic
7 years ago