tensor-tang
2dd331cc21
Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
...
test=develop
6 years ago
tensor-tang
316636404f
add seqpool concat unit test
6 years ago
Yan Chunwei
6ccf8685f7
refactor tensorrt node teller ( #15181 )
6 years ago
Tao Luo
7dc0181c46
run analyzer_tester serial in multi-thread
...
test=develop
6 years ago
xiaolil1
c8f101e5da
Conv int8 relu ( #15130 )
...
* Enable basic MKL-DNN INT8 Conv OP
test=develop
* Modify test case
test=develop
* Clean unittest code
test=develop
* Fix test
test=develop
* Modify test
test=develop
* Enable MKL-DNN INT8 Conv with Relu Fusion OP
test=develop
* Modify basic INT8 Conv
test=develop
* fix type
test=develop
* Modify test
test=develop
6 years ago
sneaxiy
9793a0b6a6
fix_cudnn_compatible_check
6 years ago
Zeng Jinle
ccb322d6a5
merge develop
6 years ago
Xin Pan
0d0bc61248
update api
...
test=develop
6 years ago
tensor-tang
7923d7271f
add fusion seqpool concat op
6 years ago
Zeng Jinle
f3a13512fc
Merge pull request #15139 from sneaxiy/remove_op_handle_lock
...
Remove op handle lock
6 years ago
Qiao Longfei
44b300556d
change min_row_size_to_use_multithread to parameter of adam
...
test=develop
6 years ago
Qiao Longfei
87b4eb1da4
change min_param_size_to_use_multithread to min_row_size_to_use_multithread
6 years ago
minqiyang
0f94c1ac14
Polish code
...
test=develop
6 years ago
minqiyang
00e4de04bf
Polish code
6 years ago
minqiyang
4bfa110fd8
Add no lock optimize pass
...
test=develop
6 years ago
Qiyang Min
1df2399e00
Merge pull request #15180 from velconia/add_pyramid_dnn_support
...
Add JeMalloc
6 years ago
chengduo
eabb2105fa
Refactor MultiDevSSAGraphBuilder ( #15090 )
...
* Refactor ParallelExecutor
test=develop
* extract Reduce and AllReduce mode from MultiDevSSAGraphBuilder
test=develop
* Refactor MultiDevSSAGraphBuilder
test=developt
* Remove enable_data_balance
test=develop
* code refine
test=develop
* remove data balance
test=develop
* refine ScaleLossGradOp
test=develop
* remove uncessary file
test=develop
* code refine
test=develop
* modify function name
test=develop
* follow comments
test=develop
* add is_distribution field
test=develop
* set is_distribution
test=develop
* fix DistSSAGraphBuilder
test=develop
6 years ago
Yan Chunwei
875a07c32d
refactor inference analysis api ( #14634 )
6 years ago
minqiyang
c09a379015
remove const_cast
...
test=develop
6 years ago
tensor-tang
102d93712e
Merge remote-tracking branch 'ups/develop' into jit/seqpool
...
test=develop
6 years ago
tensor-tang
123b98f417
refine heigth and codesize and support all pool
...
test=develop
6 years ago
tensor-tang
0145f40f45
use height from params of jitcode
6 years ago
tensor-tang
e0591deebc
enhance seqpool jitcode
6 years ago
Zeng Jinle
99e6e8b00f
Merge pull request #15179 from sneaxiy/fix_crf_grad_lod
...
Fix crf grad lod share
6 years ago
minqiyang
db8eb9b688
Polish code
...
test=develop
6 years ago
minqiyang
f4c990e7b8
Add fused embedding ops
6 years ago
minqiyang
39b98709b1
Move fused ops to fused dir
...
test=develop
6 years ago
minqiyang
920d4a8b78
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fused_emb_seq_pool_op
...
test=develop
6 years ago
minqiyang
b2716909b4
Add changes to paddle_build
...
test=develop
6 years ago
Tao Luo
5ee596cae5
Merge pull request #15175 from baojun-nervana/intel/mkldnn
...
Update ngraph to resolve issue with mkldnn upgrade
6 years ago
乔龙飞 Qiao Longfei
7c891e1ecc
Merge pull request #15111 from jacquesqiao/fix-adam-tmp-var
...
Fix adam tmp var on cpu
6 years ago
mozga-intel
e77956c920
Enable mean operator for a ngraph
...
test=develop
6 years ago
mozga-intel
dd768714ab
Enable scale operator for a ngraph
...
test=develop
6 years ago
sneaxiy
be425461a1
fix crf grad lod share
...
test=develop
6 years ago
Qiao Longfei
3e1b914fcb
update gru op forward kernel
6 years ago
Qiao Longfei
7a81ab8607
complete gru_unite_op and test
6 years ago
Qiao Longfei
72618c8da5
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gru-add-mode
6 years ago
Qiao Longfei
17b1b660fc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
...
test=develop
6 years ago
Qiao Longfei
c15270c5b2
optimize multi thread adam
6 years ago
乔龙飞 Qiao Longfei
e1679b8847
Merge pull request #14893 from JiabinYang/feature/add_prefech_hs
...
Feature/add prefech hs
6 years ago
baojun-nervana
f0cde74564
Update ngraph with elt-wise relu test=develop
6 years ago
tensor-tang
92201d3956
support avg and sqrt pool and add mkl impl
...
test=develop
6 years ago
tensor-tang
c50060bb26
add jitcode impl and use it
6 years ago
tensor-tang
142bb41748
add seqpool jitkernel test and benchmark
6 years ago
tensor-tang
e58a569c6c
use seqpool jitkernel
6 years ago
tensor-tang
3e01a4048f
add refer seqpool jitkernel
6 years ago
Qiao Longfei
4ecb9c93f0
update API.spec
...
test=develop
6 years ago
Xin Pan
f1c7f4b016
Merge pull request #15142 from tianshuo78520a/tools
...
test=develop
6 years ago
xiaolil1
bbc9336878
Enable basic MKL-DNN INT8 Conv OP ( #15124 )
...
* Enable basic MKL-DNN INT8 Conv OP
test=develop
* Modify test case
test=develop
* Clean unittest code
test=develop
* Fix test
test=develop
* Modify test
test=develop
* Modify basic INT8 Conv
test=develop
6 years ago
Xin Pan
8ae9094e07
polish and resolve conflicts
...
test=develop
6 years ago
Xin Pan
5e928e579a
try unify Executor and ParallelExecutor
...
test=develop
6 years ago
Qiao Longfei
e10af895de
update gru grad op
...
test=develop
6 years ago
Qiao Longfei
78ec7c0f99
gru add origin mode
...
test=develop
6 years ago
peizhilin
c919b2f31d
Merge remote-tracking branch 'upstream/develop' into windows/fixgpuissue
6 years ago
peizhilin
fd4f4d0e5f
fix build issue test=develop
6 years ago
Yan Xu
a1e60ab19b
Merge pull request #14791 from Yancey1989/parallel_graph_mode
...
[Feature] Add ParallelGraph executor mode in parallelexecutor to improve performance
6 years ago
peizhilin
9ae50dd07d
fix gpu buils issue on windows test=develop
6 years ago
Qiao Longfei
0e747e8d02
change the limit of thead num
6 years ago
qingqing01
c981bf0f9d
Fix compling error with cuDNN v5 ( #15148 )
...
test=develop
6 years ago
Yancey1989
4ad9de74dd
disable sync nccl by default test=develop
6 years ago
Yancey1989
449bf58ea6
disable parallelgraph mode by default test=develop
6 years ago
Yancey1989
db603398b7
disable parallel graph executor by default
6 years ago
tianshuo78520a
af61582543
test=develop
6 years ago
wopeizl
67093da398
Merge pull request #15122 from wopeizl/windows/fixhuberloss
...
fix the huber loss compile issue on windows test=develop
6 years ago
sneaxiy
d0a8a1e950
remove_op_handle_lock
...
test=develop
6 years ago
Xin Pan
087af6a686
Merge pull request #15131 from panyx0718/clean
...
hide temp tensor allocation
6 years ago
Yancey1989
e65436103f
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
...
test=develop
6 years ago
Yancey1989
94c80347b6
update by comment
6 years ago
sneaxiy
6f06e6cdac
Merge remote origin
...
test=develop
6 years ago
Qiyang Min
23761beaef
Merge pull request #14971 from velconia/imperative_mnist
...
Imperative Optimizer
6 years ago
xiaolil1
8eb1f26211
Enable INT8 pool OP ( #15046 )
...
* Enable INT8 pool OP
test=develop
* fix unittest
test=develop
* Clean unittest code.
test=develop
6 years ago
Wu Yi
227e0c4518
fix nccl2 mode startup test=develop ( #15132 )
6 years ago
Xin Pan
9186451f60
hide GetTensor
...
test=develop
6 years ago
wopeizl
7305fc2ff9
Merge pull request #15112 from wopeizl/windows/fixsaveandloadops
...
fix save and load ops on windows test=develop
6 years ago
tensor-tang
a1d709572d
Merge pull request #15108 from tensor-tang/refine/seqpool
...
Refine/seqpool with test data
6 years ago
peizhilin
dba009dbbf
fix script issue
...
test=develop
6 years ago
peizhilin
cd2d60b4c8
fix build issue for density prior box op on windows test=develop
6 years ago
Yancey1989
35cda13e9f
fix unittest test=develop
6 years ago
peizhilin
1f423f84ac
fix the huber loss compile issue on windows test=develop
6 years ago
sneaxiy
d25395fc98
remove tensor core lock
...
test=develop
6 years ago
tensor-tang
516fe301ee
add comment in case of empty name
...
test=develop
6 years ago
peizhilin
b3688100ad
fix unittest
...
test=develop
6 years ago
tianshuo78520a
bf518ec872
update CI rules for checking change of python reference ( #15104 )
...
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
6 years ago
minqiyang
2547f9d1b8
Polish code
...
test=develop
6 years ago
tensor-tang
b9c645639b
workaround with third party cache
...
test=develop
6 years ago
peizhilin
5d8f281397
restore the memory mode
...
test=develop
6 years ago
tensor-tang
c02165d23a
Merge remote-tracking branch 'ups/develop' into refine/seqpool
...
test=develop
6 years ago
tensor-tang
dca68cdf97
throw error when name not find
...
test=develop
6 years ago
peizhilin
33b7821a75
fix save and load ops on windows test=develop
6 years ago
Qiao Longfei
dfe85fb358
fix build
6 years ago
Qiao Longfei
f057bbd1d1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-adam-tmp-var
6 years ago
Qiao Longfei
f1c973b014
adam op should not create tmp var in compute
6 years ago
Yancey1989
82b42e31f0
polish unittest test=develop
6 years ago
wopeizl
10bedbdeaa
Merge pull request #15105 from wopeizl/windows/fixtimer
...
fix timer test=develop
6 years ago
minqiyang
09e2e66236
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
6 years ago
Yancey1989
0a885ac12a
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
...
test=develop
6 years ago
Yancey1989
ca8c77d966
selecte execution according to strategy test=develop
6 years ago
tensor-tang
484085693e
update url and num_ops
...
test=develop
6 years ago
tensor-tang
cd94df8679
fix load and refine
6 years ago
tensor-tang
8e271896ae
add test data for seqpool1
6 years ago
minqiyang
858e903231
Add unittest for operator
...
test=develop
6 years ago
gongweibao
ce70229ba6
Add max_body_size flags to brpc ( #15084 )
6 years ago
qingqing01
6f0a1d7b47
Inception fusion operator. ( #14968 )
...
* Inception fusion operator.
* Support horizontal layer fusion in conv_fusion_op.
* Search conv algo strategy for variable-length input.
search N times and cache the searched algos. For other input, choose the algo of input whose area is closest to this input.
6 years ago
peizhilin
813c2ce539
fix timer test=develop
6 years ago
Qiao Longfei
25d44d40ac
sum op support empty selected rows as input
6 years ago
wopeizl
7ab501264d
Merge pull request #15069 from wopeizl/windows/dsosupport
...
add cuda dso support for windows
6 years ago
minqiyang
6a5f604607
Support stop_gradients var in imperative backward
...
test=develop
6 years ago
guru4elephant
ff739449ab
Merge pull request #15018 from guru4elephant/add_timer
...
Add debug thread function for async executor
6 years ago
Qiyang Min
e29cbfe4f7
Merge pull request #14829 from velconia/accelerate_ddpg
...
Accelerate little models
6 years ago
Tao Luo
9c2cbfb89e
Merge pull request #15093 from baojun-nervana/intel/cmake
...
Upgrade ngraph & clean up cmake
6 years ago
Zeng Jinle
25b49a0896
Merge pull request #14933 from sneaxiy/rewrite_ddim
...
Rewrite ddim
6 years ago
Wu Yi
a8bc05b5ff
Refactor distributed RPC ( #15075 )
...
* wip
* wip
* refactor no.1 dir structure test=develop
* fix linking test=develop
* fix includes test=develop
* fix build test=develop
* fix build test=develop
6 years ago
baojun-nervana
555fbc10d8
upgrade ngraph to v0.10.1 test=develop
6 years ago
baojun-nervana
c714c36482
simplify logic test=develop
6 years ago
minqiyang
9e3155e01d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
6bb84490af
Fix imperative unit test
...
test=develop
6 years ago
Xin Pan
3e8408429d
Merge pull request #15053 from panyx0718/imperative_hold
...
refactor to avoid scope.
6 years ago
sneaxiy
73896eeb94
merge develop
...
test=develop
6 years ago
Wu Yi
e26cced7cc
refine batch merge pass ( #14777 )
...
* refine batch merge pass
* refine batch merge pass test=develop
6 years ago
minqiyang
336160e651
Complete imperative optimizer implementation
...
test=develop
6 years ago
Yancey1989
4743c9cd5d
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
sneaxiy
9a3a246cb5
fix py35 compile error
...
test=develop
6 years ago
Xin Pan
f7294f8b25
register float16
...
test=develop
6 years ago
Zhaolong Xing
4048cfa9da
Merge pull request #15048 from NHZlX/add_affine_channel_fuse
...
Add conv+ affine channel fuse pass
6 years ago
minqiyang
ef7d563db9
Add changes back
...
test=develop
6 years ago
Yu Yang
efa630eadb
Refine Dockerfile ( #14908 )
...
* Refine Dockerfile
* Add tasks, cmake gen
* Fix code error
* Disable compile after paddle_build.sh
* Refine
* Skip on PY35 CI
* Change env
* Refine paddle_build.sh
* Expose gen_fluid_lib
* Refine mkldnn.cmake
* Refine mkldnn.cmake
* Refine mkldnnlib
* Skip unstable tests
6 years ago
minqiyang
a318a490ab
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
...
test=develop
6 years ago
Zeng Jinle
c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
...
Rewrite variable type
6 years ago
chengduo
fe8495a758
[WIP] Refine MultiDevSSAGraph ( #15040 )
...
* refine parallel_exe
test=develop
* rename shared_var_device
* code refine
* add test_weight_decay
* remove Sort
test=develop
* Add SortForReduce
test=develop
* code refine
test=develop
* follow comment
test=develop
6 years ago
Qiao Longfei
d161215332
optimize adam multi thread
6 years ago
dongdaxiang
82335cd88c
Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
...
test=develop
6 years ago
Tao Luo
85471533e0
Merge pull request #15079 from luotao1/analysis_test
...
simplify analysis tests
6 years ago
minqiyang
d4b9928c5a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
5822f7f1d8
Polish code
...
test=develop
6 years ago
wopeizl
719ebe3786
Merge pull request #15070 from wopeizl/windows/testcasefix
...
fix test issues on windows
6 years ago
Qiao Longfei
7a58ad5c79
lazy mode have higher priority then multithread
...
test=develop
6 years ago
Xin Pan
c132c79011
address comments and resolve conflicts.
...
test=develop
6 years ago
Xin Pan
b91a7a9d30
clear operator changes
...
test=develop
6 years ago
Xin Pan
f52b514dcd
call kernel
6 years ago
Xin Pan
4e80e04f23
fix
...
test=develop
6 years ago
Xin Pan
7b6bf9ddf2
make fill_constant kernel-based
...
test=develop
6 years ago
Xin Pan
61491ce250
clean
...
test=develop
6 years ago
Xin Pan
ce7e503cbe
refactor to avoid scope.
...
test=develop
6 years ago
Qiyang Min
0238a3bb4f
Merge pull request #14972 from velconia/accelerate_lstm
...
Accelerate PADDLE_ENFORCE
6 years ago
Houjiang Chen
242d3c71a6
Merge pull request #15031 from hjchen2/develop
...
Fix conv_elementwise_add2_act pass
6 years ago
Qiao Longfei
d0572bf02e
add log for lazy mode test=develop
6 years ago
Xin Pan
71a4a8e981
Merge pull request #15071 from wopeizl/revert/15035
...
Revert "cherry-pick the #12759"
6 years ago
minqiyang
68e9b841ab
Add support for optimizer
6 years ago
Qiao Longfei
1177b0bc84
update multi thread adam
6 years ago
Qiao Longfei
3b294e2e2e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago
Zeng Jinle
988bc2b5a7
Merge pull request #15060 from dzhwinter/fix/nccl
...
fix ci error. test=develop
6 years ago
sneaxiy
c4ce2e7b21
merge develop, solve conflict
...
test=develop
6 years ago
minqiyang
8ed0233924
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
...
test=develop
6 years ago
Zeng Jinle
9c6a0203e2
Merge pull request #15073 from sneaxiy/add_scope_pool
...
Add scope_pool
6 years ago
tensor-tang
656c672cdd
Merge pull request #15051 from tensor-tang/test/seq_pool1
...
Add seq pool1 inference test
6 years ago
Tao Luo
ecae157edf
simplify some data record in analyzer_tester
...
test=develop
6 years ago
sneaxiy
b56aca82e9
merge develop
...
test=develop
6 years ago
Tao Luo
05f1b65da3
simplify prepere_input in analyzer_test
...
test=develop
6 years ago
sneaxiy
ee83ce75bf
try to fix py35 compile error
...
test=develop
6 years ago
sneaxiy
10a6bc9675
modify API.spec
...
test=develop
6 years ago
sneaxiy
3e917a934a
add scope_pool
...
add module cleanup
test=develop
6 years ago
nhzlx
02e17396c2
fix comments
...
test=develop
6 years ago
jerrywgz
ef2d292bfc
Merge pull request #14956 from jerrywgz/fix_bug_in_ifelse
...
fix bug in if-else op
6 years ago
Yancey1989
1a4f79a7de
fix unittest test=develop
6 years ago
peizhilin
e49276e731
restore the huber_loss_op
...
test=develop
6 years ago
Yancey1989
86bb583881
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989
495e73d766
enable gc
6 years ago
Yancey1989
28cdfbc2b0
delete comment code
6 years ago
Yancey1989
845bfd5807
cleanup code
6 years ago
peizhilin
2388d0e7d6
Revert "cherry-pick the #12759"
...
test=develop
This reverts commit 7f6d8acecb
.
6 years ago
nhzlx
71636e677d
add min_subgraph_size attr to tensorrt config
...
test=develop
6 years ago
peizhilin
01c00b07dd
fix test issues on windows
...
test=develop
6 years ago
peizhilin
1e7f83e60a
add cuda dso support for windows
...
test=develop
6 years ago
tangwei12
dc8eca826e
code style fix, test=develop ( #15045 )
...
* code style fix, test=develop
6 years ago
Yancey1989
41a64f6a2a
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Jiabin Yang
55e3c6518e
Merge pull request #15024 from JiabinYang/fix/ci_requirements_install
...
test=develop, fix ci by install requirement and add pip install valid…
6 years ago
nhzlx
a6aa8ea771
faster rcnn input is presistable. (fix it in paddle-trt)
...
test=develop
6 years ago
hjchen2
956cf92145
Fix conv_elementwise_add2_act pass
...
test=develop
6 years ago
Tao Luo
69659f4ae2
Merge pull request #15037 from jianhang-liu/fix/abnormal_stack_op_time
...
Fix/abnormal stack op time
6 years ago
whs
2314f2ebb3
Make topk op support variable k. ( #15044 )
...
* Make topk op support variable k.
test=develop
* Fix tensor type.
test=develop
6 years ago
sneaxiy
179acc60b3
fix conflict with develop
...
test=develop
6 years ago
wopeizl
09bd8fa67a
Merge pull request #15035 from wopeizl/debug/improvement1
...
cherry-pick the #12759
6 years ago
sneaxiy
dde3afe7b7
Merge develop
...
test=develop
6 years ago
dzhwinter
3ea2f415dc
fix ci error. test=develop
6 years ago
dongdaxiang
2df1d80767
Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
...
test=develop
6 years ago
Wu Yi
856f0da0fe
Fp16 training ( #14992 )
...
* wip
* wip
* wip
* wip for test
* add fp16 tests test=develop
* fix cpu build test=develop
* fix test=develop
* fix py3 tests test=develop
* fix lr_scheduler dtype test=develop
* fix test=dvelop
* test fix ci compile test=develop
* fix build and merge test=develop
* fallback momentumop change to general test=develop
* make fp16 lr schedule simple test=develop
* fix ut test=develop
* fix tests test=develop
* remove fp16 learning rate cast test=develop
6 years ago
Brian Liu
e821b12f57
Fix issue which cause abnormal CPU usage in stack op
...
Stack OP has much higher CPU cost than expected in release mode.
Caused by DebugStringEx() in base class OperatorWithKernel. Actually
this issue occur for each OP which hasn't implement it's own
GetExpectedKernelType().
test=develop
6 years ago
chengduo
b9fb03cf54
Move GetTensor to tensor_util ( #15011 )
...
* refine tensor
test=develop
* refine tensor
test=develop
* fix device_context log
test=develop
6 years ago
Yihua Xu
0b0acfaa88
Add mkldnn item for porfile and compare usage.
...
test=develop
6 years ago
Yihua Xu
dbb90a76f0
Merge remote-tracking branch 'paddle/develop' into develop_641313ea7_elementwise_mul_mkldnn_bug_fix
6 years ago
tensor-tang
d46a140dd9
add seq pool inference test
...
test=develop
6 years ago
tensor-tang
d4931a2abc
support more input fake data
6 years ago
nhzlx
73b47df1f4
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_affine_channel_fuse
...
test=develop
6 years ago
nhzlx
ce3782c193
add affine_channel fuse.
...
fix conv+elemenwise fuse bug.
6 years ago
peizhilin
170e78b397
restore the top-k
...
test=develop
6 years ago
dongdaxiang
ab2abfc5b2
Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
...
test=develop
6 years ago
Tao Luo
bc16bcda49
Merge pull request #14998 from luotao1/mm_dnn
...
add MM_DNN inference test
6 years ago
JiabinYang
fa33eae9aa
test=develop, fix python exetension on python3.x
6 years ago
Qiyang Min
aba1f9b06e
Merge pull request #14891 from velconia/accelerate_adam
...
Remove BinarySearch from Adam Op (CPU part)
6 years ago
minqiyang
8ec3d863b0
Fix throw_on_error direct call bug
...
test=develop
6 years ago
peizhilin
e05fb128bc
fix code style
...
test=develop
6 years ago
peizhilin
7f6d8acecb
cherry-pick the #12759
...
test=develop
6 years ago
sneaxiy
3a2afbf02e
polish code
...
test=develop
6 years ago
dongdaxiang
4cb833d2de
Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
...
test=develop
6 years ago
JiabinYang
3e40c79c4f
test=develop, using absolute dir
6 years ago
tensor-tang
05d1121b22
Merge pull request #14802 from mozga-intel/mozga-intel/fill_constant_operator_ngraph
...
Enable fill_constant operator for a ngraph engine
6 years ago
tensor-tang
9d4f1d468a
Merge pull request #14804 from mozga-intel/mozga-intel/top_k_operator_ngraph
...
Enable top_k operator for a ngraph engine
6 years ago
tensor-tang
8a6ac4dba7
Merge pull request #14973 from xiaolil1/dequantize
...
Add Dequantize OP
6 years ago
tensor-tang
f0e02a65ed
Merge pull request #14974 from xiaolil1/quantize
...
Add Quantize OP
6 years ago
Tao Luo
91408e3122
fix analyzer_mm_dnn_tester fails when bs > 1
...
test=develop
6 years ago
Tao Luo
f01c966800
Merge branch 'develop' into mm_dnn
6 years ago
sneaxiy
68d91cd594
add copy ctor
...
test=develop
6 years ago
dongdaxiang
68a2d1f3d7
Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
...
add timer_test
test=develop
6 years ago
dongdaxiang
2e5ebc4594
Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
...
test=develop
6 years ago
dongdaxiang
5dfd9c9aa9
Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
...
test=develop
6 years ago
dongdaxiang
d0a5159946
Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
...
test=develop
6 years ago
JiabinYang
ea6e057e40
test=develop, fix bug
6 years ago
dongdaxiang
f9b8168508
Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
...
test=develop
6 years ago
dongdaxiang
3b3cb4ea55
Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
JiabinYang
e9c86ac41d
test=develop, install requirements.txt with user previlige
6 years ago
JiabinYang
f8fc6ba595
test=develop, fix ci by install requirement and add pip install validation
6 years ago
minqiyang
52b4821a6e
Fix Sprintf problem
...
test=develop
6 years ago
qingqing01
51a9fca323
Async memory copy ( #15013 )
6 years ago
minqiyang
010f657b33
Polish code
...
test=develop
6 years ago
JiabinYang
1a8cbb6799
test=develop, accelerate_hs_op and add prefetch with is_sparse
6 years ago
sneaxiy
e02f67eff7
rewrite unsafe_cast
...
test=develop
6 years ago
minqiyang
45acfbd011
1. Add specific condition for one or no arg in PADDLE_ENFORCE
...
2. Add unit test for new enforce feature
test=develop
6 years ago
minqiyang
68b86d6665
Change default value to align with the original react
...
test=develop
6 years ago
whs
938705745e
Init paddle slim ( #14834 )
...
* Init slim.
* Remove distillation demo.
* Fix import errors.
test=develop
* Fix some issues.
test=develop
* Fix configs.
test=develop
* Modify API.spec.
test=develop
* Fix format.
test=develop
* Fix format.
test=develop
* Add some comments.
6 years ago
dongdaxiang
2dee8f6cd5
add TrainFilesWithTimer in async_executor
6 years ago
xiaoli.liu@intel.com
869d444b92
Fix comments misunderstanding
...
test=develop
6 years ago
xiaoli.liu@intel.com
d83d0f33fd
extract templated function
...
test=develop
6 years ago
dongdaxiang
d434fcbaa6
add TrainFilesWithTimer in async_executor
6 years ago
Yihua Xu
d4606bcb22
Fix the exception when tensor format is x
...
test=develop
6 years ago
minqiyang
250e893745
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
...
test=develop
6 years ago
minqiyang
8b6b0da062
Use adam_update
...
test=develop
6 years ago
minqiyang
f4e7a47381
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
...
test=develop
6 years ago
minqiyang
b1d0a14c14
Change the ut back
...
test=develop
6 years ago
minqiyang
7d1533216d
Fix syntax error in unit test
...
test=develop
6 years ago
tensor-tang
641313ea77
Merge pull request #15000 from tensor-tang/doc/en/jit
...
add eng doc of jit kernel and follow comments
6 years ago
minqiyang
e811e06555
Avoid comma in macro
...
test=develop
6 years ago
minqiyang
0cf1461ccc
Avoid comma in macro
...
test=develop
6 years ago
wopeizl
b117a5f208
Merge pull request #14931 from wopeizl/windows/mkl
...
add mkl support for windows
6 years ago
Xin Pan
103f08f50e
Merge pull request #14910 from panyx0718/clean3
...
further updates
6 years ago
dongdaxiang
cf6188a823
add a linux timer
6 years ago
Zeng Jinle
0021b05b19
Merge pull request #14993 from sneaxiy/fix_check_lod
...
Fix CheckLoD bug
6 years ago
tensor-tang
68ab16444a
add eng doc of jit kernel and follow comments
...
test=develop
6 years ago
chengduo
79bd6dfa18
[Feature] Add Temporary Allocator ( #14875 )
...
* Add Temporal Allocator
* add Temporay Allocator to DeviceContext
test=develop
* code refine
test=develop
* fix mean_iou
test=develop
* Add DeviceTemporaryAllocator
test=develop
* fix conv_op bug
test=develop
* small fix
test=develop
* code refine
test=develop
* log refine
test=develop
* fix unit test
test=develop
* move double check
* refine concat_and_split
test=develop
* add limit_of_temporary_allocation
test=develop
* fix name
test=develop
6 years ago
minqiyang
e4719eb462
Fix bug in Windows VC 2010
...
test=develop
6 years ago
sneaxiy
a30c5373eb
use std::is_sorted
...
fix comment
test=develop
6 years ago
minqiyang
5a5c577529
Polish code
...
test=develop
6 years ago
minqiyang
099186cd41
Support one argument PADDLE_ENFORCE
...
test=develop
6 years ago
peizhilin
2e35290ffa
fix build issue
...
test=develop
6 years ago
Xin Pan
484c24b756
Merge pull request #14996 from tensor-tang/fix/deps
...
fix deps of jit benchmark
6 years ago
tensor-tang
e99e4e994d
fix deps of jit benchmark
...
test=develop
6 years ago
Tao Luo
22c71398e3
add MM_DNN inference test
...
test=develop
6 years ago
minqiyang
8149a07a41
Fix wait stream two times bug
...
test=develop
6 years ago
Tao Luo
d4f384767a
Merge pull request #14987 from jczaja/prv-transpose-mkldnn-grad
...
[MKL-DNN] Added transpose/transpose2 MKLDNN grad ops
6 years ago
minqiyang
4af97c6946
Polish code
6 years ago
peizhilin
201283f95c
fix code style
...
test=develop
6 years ago
minqiyang
41b81293ab
Polish code
...
test=develop
6 years ago
peizhilin
9e60c58666
Merge remote-tracking branch 'upstream/develop' into windows/mkl
...
test=develop
6 years ago
minqiyang
bc66401566
Polish code
...
test=develop
6 years ago
tensor-tang
693e5e65ce
Merge pull request #14958 from tensor-tang/refine/jit
...
enhance jit
6 years ago
minqiyang
74ead6ff35
Polish code
6 years ago
sneaxiy
b8051e7927
merge develop
...
test=develop
6 years ago
Tao Luo
df1e4e2f10
fix check_lod
...
test=develop
6 years ago
minqiyang
0a4b6fc056
Remove unnessesary code
...
test=develop
6 years ago
minqiyang
53619a79b4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
6 years ago
minqiyang
6fabbd8fb8
Polish code and remove spin lock
...
test=develop
6 years ago
minqiyang
3de0f612e8
Polish code
...
test=develop
6 years ago
Zeng Jinle
95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
...
Featue/py_func op
6 years ago
mozga-intel
969f669493
Static specifier was removed
...
test=develop
6 years ago
mozga-intel
7048caf9a0
Enable top_k operator for a ngraph
...
test=develop
6 years ago
Tao Luo
d49990e447
Merge pull request #14982 from luotao1/diff_test
...
add deterministic result unit-test
6 years ago
mozga-intel
394635b2ee
Static specifier was removed
...
test=develop
6 years ago
mozga-intel
ecfa68ecaa
Enable fill_constant operator for a ngraph
...
test=develop
6 years ago
sneaxiy
600f6d8272
polish code
...
test=develop
6 years ago
minqiyang
9e24fa3aeb
Polish code
...
test=develop
6 years ago
minqiyang
1c116462cc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
6 years ago
minqiyang
29697c2e25
Add stop_gradient to VarBase to support loss function
...
test=develop
6 years ago
Jacek Czaja
d05094eec3
- Added transpose/transpose2 MKLDNN grad ops
...
test=develop
6 years ago
Tao Luo
55af116861
Merge pull request #14966 from jczaja/prv-transpose-mkldnn-reuse-PR
...
[MKL-DNN] Extending Transpose Op to reuse MKL-dNN primitives
6 years ago
sneaxiy
7f6e513b1f
fix mac ci bug
...
make forward declaration
test=develop
6 years ago
sneaxiy
c1f7e54f62
merge develop
...
test=develop
6 years ago
Xin Pan
2537ac512e
Merge pull request #14983 from typhoonzero/revert_14850
...
Revert "[Feature] Fp16 training for resnet50 (#14850 )"
6 years ago
chengduo
a015a8a39d
Refine merge_selected_rows Doc ( #14748 )
...
* add doc for MergeSelectedRows
test=develop
* checkout selected_rows
test=develop
6 years ago
typhoonzero
da87f7a698
Revert "[Feature] Fp16 training for resnet50 ( #14850 )"
...
This reverts commit 3d750f9c5a
.
6 years ago
peizhilin
b06ce129bc
some not so useful adjust
...
test=develop
6 years ago
sneaxiy
89b9d86d9d
fix windows compile bug
...
test=develop
6 years ago
tensor-tang
1aaec571c2
fix enum style
...
test=develop
6 years ago
Qiao Longfei
d76bda50c4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
...
test=develop
6 years ago
sneaxiy
490eb9061f
polish infer shape of py_func op
...
test=develop
6 years ago
tensor-tang
facfecbdd0
follow comment: reuse time function and change to upper case
...
test=develop
6 years ago
tensor-tang
f5532877f7
Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
Yihua Xu
3babc80160
Fix the regression issue and add the group unitest for conv2d ( #14932 )
...
* Add test items for mkldnn conv2d
* Fix the regression issue and pass the unit test for conv2d and conv3d
test=develop
6 years ago
Xin Pan
969ad966ba
all converted
...
test=develop
6 years ago
Xin Pan
a872eb90c2
Merge pull request #14959 from panyx0718/clean2
...
Further op RunImpl refactor
6 years ago
minqiyang
6467eda534
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
...
test=develop
6 years ago
sneaxiy
13429c3e9f
clean code, remove void registration
...
test why MAC CI fail again
test=develop
6 years ago