Liu Yiqun
|
7ffbcbcaf0
|
Add flush of program desc to update the proto information.
|
7 years ago |
typhoonzero
|
ba8b0a5dc2
|
fix mac build
|
7 years ago |
whs
|
aa4d51b095
|
Merge pull request #10029 from wanghaoshuang/fix_gru_unit
Remove intermediate output's gradient from inputs of grad_op.
|
7 years ago |
dzhwinter
|
2e331c6593
|
accelerate dropout (#9902)
* accelerate dropout
* accelerate dropout
* "fix the dropout test"
* "rerun ci"
* "fix ci"
* "rerun ci"
* "fix ci"
* "fix"
* "stage"
* disable
|
7 years ago |
Yancey
|
0b8630b9b4
|
Merge pull request #9897 from Yancey1989/auto_grwon_sparse_table
Auto-grown sparse table
|
7 years ago |
Tao Luo
|
82b192a3fd
|
Merge pull request #10041 from abhinavarora/cpplint_framework
Fix CPPLint errors in some framework files
|
7 years ago |
chengduo
|
88f8183576
|
Merge branch 'develop' into feature/refine_gather_reduce
|
7 years ago |
dzhwinter
|
1fc9fe773d
|
merge develop
|
7 years ago |
chengduoZH
|
035712822c
|
fix VisitVariable
|
7 years ago |
wanghaoshuang
|
216443de15
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_gru_unit
|
7 years ago |
Abhinav Arora
|
cbbf08aee9
|
Fix CPPLint errors in some framework files
|
7 years ago |
Yang Yang(Tony)
|
e04c43d543
|
add semicolon to op registry (#10034)
* script to add semicolon
* fix typo
|
7 years ago |
Abhinav Arora
|
c5c7dc2e82
|
Fix CPPLint errors in multiclass_nms, nccl, nce, reduce and save_load_combine (#10032)
* Fix CPPLint errors in multiclass_nms, nccl, nce, reduce and save_load_combine
* Fix
|
7 years ago |
Yiqun Liu
|
598035f985
|
Fix a bug in save_inference_model and prune when the program is initailized by load_inference_model (#10011)
* Fix bug in save_inference_model and prune when the program is initialized by load_inference_program.
* Save the transpiled program instead.
|
7 years ago |
Abhinav Arora
|
9ca578d49e
|
Fix CPPLint issues in expand_op, gather_op and get_places_op (#10000)
|
7 years ago |
Xin Pan
|
98c12b1a08
|
Clean up C++ codes. (#10022)
* Privatize OpHandleBase
* Clean up a few private members
|
7 years ago |
dzhwinter
|
9b8ca0cfa8
|
"fix after merge"
|
7 years ago |
dzhwinter
|
7a517dc93e
|
merge develop
|
7 years ago |
wanghaoshuang
|
00548a1601
|
Remove intermediate output's gradient from inputs of grad_op.
|
7 years ago |
Tao Luo
|
d84cdb7b59
|
Merge pull request #9911 from tonyyang-svail/unify_op_registry
Unify REGISTER_OP and REGISTER_OPERATOR
|
7 years ago |
Wu Yi
|
d655417f90
|
Merge pull request #9956 from typhoonzero/split_byref_op
Split byref op
|
7 years ago |
Yancey1989
|
f12b3f3617
|
use memcpy
|
7 years ago |
dzhwinter
|
61cb4f2fdb
|
"fix ci"
|
7 years ago |
Yu Yang
|
60e0d1aabd
|
Merge pull request #10023 from reyoung/feature/DtorOfPE
Correctly implement destructor of ParallelExecutor
|
7 years ago |
Yu Yang
|
28013ef9e2
|
Merge pull request #10019 from reyoung/feature/feed_fetch_tensor_on_cpu
Fix FetchTensor on CPU
|
7 years ago |
Yancey1989
|
13e7194ebd
|
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into auto_grwon_sparse_table
|
7 years ago |
Yancey1989
|
70bf732f82
|
refine get interface
|
7 years ago |
typhoonzero
|
ff0d9341ea
|
remove not used code
|
7 years ago |
chengduoZH
|
fbb75c6bb6
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_gather_reduce
|
7 years ago |
Yu Yang
|
5305c5f845
|
Correctly implement destructor of ParallelExecutor
|
7 years ago |
Yu Yang
|
38d75c912c
|
Follow comments
|
7 years ago |
chengduoZH
|
4760ac44f1
|
check the generate_op is null or not and add DEPS of broadcast_op_handle and gather_op_handle
|
7 years ago |
mozga-intel
|
171471eada
|
Merge branch 'develop' into mozga-intel/Mul_mkldnn
|
7 years ago |
Yu Yang
|
6de5fd9e8a
|
Fix FetchTensor on CPU
|
7 years ago |
Yu Yang
|
35ebe2eca7
|
Clean MultiDevicesGraphBuilder
|
7 years ago |
typhoonzero
|
184835856c
|
fix copy size
|
7 years ago |
Yu Yang
|
d24ef931b5
|
Clean Code
|
7 years ago |
Yancey1989
|
b920b51686
|
rename parameter
|
7 years ago |
Yu Yang
|
c3c7b7bd1b
|
Merge pull request #9928 from reyoung/feature/stablize_code
Use mutex to stablize ncclCtxMap
|
7 years ago |
typhoonzero
|
788636f078
|
update by comments
|
7 years ago |
Yancey1989
|
2917a75b7a
|
add some comments for sparse table
|
7 years ago |
fengjiayi
|
fbe562478d
|
Merge pull request #9994 from reyoung/feature/debug
Fix bugs in local_scopes
|
7 years ago |
qingqing01
|
ee9832a7a9
|
Add Top-k Python API. (#9973)
* Add topk Python API.
* Add unit test.
* Remove the repeated API.
|
7 years ago |
chengduo
|
e5b3eb9899
|
Merge pull request #10001 from abhinavarora/disable_test_channel
Disabling Channel Test to fix CI
|
7 years ago |
chengduoZH
|
e63013a86f
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_reduce_op_handle
|
7 years ago |
dzhwinter
|
425a1e76a2
|
Merge remote-tracking branch 'origin/develop' into memory/activation_memory
|
7 years ago |
Tao Luo
|
2617d8bcda
|
Merge pull request #9993 from jczaja/prv-softmax-mkldnn-fix
- Added Epsilon (preventing softmax output from being too small) for softmax MKLDNN op
|
7 years ago |
typhoonzero
|
e2d56832e1
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into split_byref_op
|
7 years ago |
Abhinav Arora
|
74ff0be234
|
Merge remote-tracking branch 'origin/develop' into disable_test_channel
|
7 years ago |
chengduoZH
|
1eeb2e0042
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_reduce_op_handle
|
7 years ago |
Abhinav Arora
|
ed681d5235
|
Fix conv_mkldnn_op.cc which is causing CI failure
|
7 years ago |
Abhinav Arora
|
1b5c1bcb57
|
Disabling Channel Test to fix CI
|
7 years ago |
Yang Yang
|
68d96385e4
|
remove REGISTER_OP and REGISTER_OP_EX
|
7 years ago |
Abhinav Arora
|
6f83142353
|
Fix cpplint issues with beam_search_op and beam_search_decode_op (#9962)
* Fix cpplint warnings in beam_search_decode_op
* Fix cpplint warnings in beam_search_op
* Fix test
* fix
|
7 years ago |
Yang Yang
|
4b1a32db34
|
fix pybind.h generator
|
7 years ago |
Abhinav Arora
|
2d1a6f8d4a
|
Fix cpplint issues in Detection_map_op (#9969)
* Fix conv_op.h
* Fix conv_mkldnn_op
* Fix cpplint issues in detection_map_op
|
7 years ago |
Abhinav Arora
|
d08791d11a
|
Fix CPPLint issues with Chunk_eval_op (#9964)
|
7 years ago |
Wu Yi
|
8352f93809
|
Merge pull request #9933 from typhoonzero/add_dist_unit_test
Add dist unitest data compare, ensure that dist train have same behavior as local training
|
7 years ago |
Jacek Czaja
|
acdf7cbd19
|
- Added EPS for softmax MKLDNN op
- EPS added to softmax mkldnn primitive outcome is limited to training
phase
Fixes after review
clang format fixes
clang format fixes
|
7 years ago |
dzhwinter
|
626227eb72
|
"fix ci"
|
7 years ago |
Yu Yang
|
9b60d0decb
|
Merge pull request #9975 from reyoung/feature/VarHandleCtor
Using constructor for VarHandle
|
7 years ago |
Yu Yang
|
06fb055a2f
|
New group
|
7 years ago |
typhoonzero
|
c6d7c2bdff
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into split_byref_op
|
7 years ago |
mozga-intel
|
6e7b883bdd
|
Initial implementation of multiplication operator for MKLDNN
|
7 years ago |
Yancey1989
|
ca327508cc
|
update
|
7 years ago |
chengduoZH
|
e4de957f19
|
code refine
|
7 years ago |
Tao Luo
|
038dbb386e
|
Merge pull request #9958 from luotao1/find_tensorrt
auto find tensorrt library and install in user root
|
7 years ago |
Yu Yang
|
71a2e6b73c
|
Reverse create var
|
7 years ago |
Yu Yang
|
2e8459be71
|
DebugCode
|
7 years ago |
Yu Yang
|
fc6f0be276
|
Merge pull request #9942 from reyoung/feature/tuning_pe_trans
Feature/tuning pe trans
|
7 years ago |
Yu Yang
|
d7527681c6
|
Fix scale grad
|
7 years ago |
Yu Yang
|
89728f8e66
|
update
|
7 years ago |
Yu Yang
|
eb2e4eeade
|
Debug
|
7 years ago |
typhoonzero
|
948628563f
|
update
|
7 years ago |
Yu Yang
|
64c139e854
|
Using constructor for VarHandle
|
7 years ago |
chengduoZH
|
4abef50123
|
code refine
|
7 years ago |
Yu Yang
|
b4aaa00a8a
|
Polish logic of ParallelExecutor
|
7 years ago |
Yu Yang
|
2ab12ca248
|
Add comments and clean code
|
7 years ago |
Yu Yang
|
a822f8ddbb
|
Add wait
|
7 years ago |
Yu Yang
|
e9e27e0f32
|
Revert
|
7 years ago |
Yu Yang
|
7289e75836
|
Udpate
|
7 years ago |
Yu Yang
|
e6bc7bb09a
|
Tuning code
|
7 years ago |
Yu Yang
|
0ca28b8507
|
Sync Copy
|
7 years ago |
Yancey1989
|
ed6241cd44
|
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into auto_grwon_sparse_table
|
7 years ago |
Yu Yang
|
4999f85f6a
|
Clean RunDelayedOp method
|
7 years ago |
Yu Yang
|
c0f4e0332c
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/simplify_delay_logic
|
7 years ago |
Kexin Zhao
|
64bf3df0f9
|
add print support to float16 (#9960)
|
7 years ago |
Yang Yang
|
411e888c56
|
fix duplication
|
7 years ago |
Yang Yang
|
ce7c2e86e5
|
script to fix all
|
7 years ago |
Yang Yang
|
dafe06af46
|
first commit
|
7 years ago |
chengduoZH
|
3301d4402e
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_reduce_op_handle
|
7 years ago |
Luo Tao
|
d4682247e1
|
auto find tensorrt library
|
7 years ago |
chengduoZH
|
3c5bbf42c4
|
make unit test to work
|
7 years ago |
Yan Chunwei
|
186659798f
|
add tensorrt build support(#9891)
|
7 years ago |
typhoonzero
|
04c559e3aa
|
wip split byref op
|
7 years ago |
chengduoZH
|
2aaa75ecf8
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_gather_reduce
|
7 years ago |
typhoonzero
|
f86d35a269
|
add sharable tensor
|
7 years ago |
chengduoZH
|
e39adc8600
|
add reduce op handle
|
7 years ago |
guru4elephant
|
a097d082c7
|
Merge pull request #9938 from guru4elephant/fix_testcase
Fix testcase
|
7 years ago |
dzhwinter
|
b92b408e50
|
"fix ci"
|
7 years ago |
typhoonzero
|
745aacfc38
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_dist_unit_test
|
7 years ago |
dzhwinter
|
877e2aed03
|
"fix ci"
|
7 years ago |
Yu Yang
|
5b84c9b59c
|
CreateOpHandleIOs
|
7 years ago |
typhoonzero
|
8c1b257596
|
add dist unitest data compare
|
7 years ago |
dongdaxiang
|
fb873713e7
|
update program_desc_test testcase
|
7 years ago |
Yu Yang
|
d0342f12aa
|
Simplify DelayOps Logic
|
7 years ago |
dzhwinter
|
e54f203c55
|
"move to a new PR"
|
7 years ago |
chengduoZH
|
690cd1f748
|
refine gather and broadcast
|
7 years ago |
Yu Yang
|
093d227a77
|
Use mutex to stablize ncclCtxMap
|
7 years ago |
whs
|
494c262a26
|
Fix average_accumulate_op for parallel executor. (#9852)
|
7 years ago |
Yi Wang
|
630943c7a7
|
Update documentation (#9918)
|
7 years ago |
Yi Wang
|
b48cf1712b
|
Fix cpplint errors in transform_test.cu (#9915)
* Fix cpplint errors with transformer_test.cu
* Update
|
7 years ago |
Tao Luo
|
b66893829b
|
Merge pull request #9914 from abhinavarora/split_ids_op_warning
Fix compiler warnings in split_op and chunks_test
|
7 years ago |
Abhinav Arora
|
7b86da7195
|
Fix CPPLint errors in operators (#9826)
* Fix CPPLint errors in operators
* Fix cast in softmax
* Fix softmax_mkldnn
* Fix send_recv_op_test
* Send_recv
* Fix softmax mkldnn
|
7 years ago |
Kexin Zhao
|
92913027fc
|
fix unused var error (#9908)
|
7 years ago |
Yi Wang
|
47609ab2b8
|
Document transform.h and fix cpplint errors (#9913)
|
7 years ago |
chengduo
|
b43d87c917
|
Merge pull request #9825 from chengduoZH/feature/add_gather_and_BCast_op_handle
feature/Add Broadcast and Gather op handle
|
7 years ago |
Yu Yang
|
e4cfe477db
|
Merge pull request #9898 from reyoung/feature/mix_cpu_gpu_op
Feature/mix cpu gpu op
|
7 years ago |
fengjiayi
|
c1bf06f9d9
|
Merge pull request #9289 from dzhwinter/speed/sequence_expand
Speed/sequence expand
|
7 years ago |
chengduoZH
|
384d6ee8ac
|
follow comments
|
7 years ago |
Yu Yang
|
ed2d7d7d26
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/mix_cpu_gpu_op
|
7 years ago |
Yu Yang
|
925c17aba7
|
Merge pull request #9895 from reyoung/feature/fix_transformer_hang
Fix Transformer Hang Problem
|
7 years ago |
Yu Yang
|
4452ff76b7
|
Fix CPU compile
|
7 years ago |
fengjiayi
|
51c219c9cd
|
Merge pull request #9892 from JiayiFeng/refine_reshape_op
Add Wait() for reshape_op
|
7 years ago |
chengduoZH
|
02842cfc25
|
enhance broadcast_op_handle and gather_op_handle
|
7 years ago |
Yancey1989
|
19152541b2
|
fix ci
|
7 years ago |
Yu Yang
|
79be06045c
|
Support CPU/GPU mixture for ParallelExecutor
|
7 years ago |
Yancey1989
|
ac7cb949d0
|
auto-grown sparse table
|
7 years ago |
Yu Yang
|
7083c2a6b3
|
Merge branch 'feature/fix_transformer_hang' into feature/mix_cpu_gpu_op
|
7 years ago |
Yu Yang
|
cedade9494
|
Stash
|
7 years ago |
Yancey
|
41a9146de3
|
Merge pull request #9787 from Yancey1989/random_selected_rows_value
Initialize large table value randomly
|
7 years ago |
fengjiayi
|
121166750e
|
Merge pull request #9894 from JiayiFeng/refine_double_buffer_code
Refine double_buffer code
|
7 years ago |
Wu Yi
|
1bdc7261a5
|
Merge pull request #9578 from typhoonzero/threadpool_for_io
Multi stream thread pool
|
7 years ago |
Tao Luo
|
2c552d4e61
|
Merge pull request #9630 from Xreki/core_inference_prepare
Split Executor.Run to Executor.Prepare and Executor.RunPreparedContext for inference
|
7 years ago |
Yu Yang
|
6b20b35589
|
Fix Transformer Hang Problem
|
7 years ago |
fengjiayi
|
3fa0ef3d71
|
Refine double_buffer code
|
7 years ago |
typhoonzero
|
a08bf76f74
|
refine name
|
7 years ago |
fengjiayi
|
c20cc2bd8a
|
Add Wait() for reshape_op
|
7 years ago |
Tao Luo
|
5a4d93286d
|
Merge pull request #9888 from abhinavarora/fix_warnings_
Fix warnings in sgd_op.h
|
7 years ago |
Yu Yang
|
06ddaa73f2
|
Merge pull request #9840 from reyoung/feature/polish_reshape_op
Polish reshape op
|
7 years ago |
chengduoZH
|
b0267ac93a
|
refine broadcast op
|
7 years ago |
chengduoZH
|
8b597d9ddc
|
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_gather_and_BCast_op_handle
|
7 years ago |
Tao Luo
|
f7a54b514d
|
Merge pull request #9881 from mozga-intel/mozga-intel/fc_without_mkldnn
The fully connected: the operator is removed when the MKLDNN flag is OFF
|
7 years ago |
Abhinav Arora
|
855992dab0
|
Fix warnings in chunk_test
|
7 years ago |
Abhinav Arora
|
c241959e48
|
Fix CPPLint errors in operators (#9828)
* Fix CPPLint errors in operators
* Fix prior box op
* Fix Prior Box op
* Fix top_k_op.cu
* Fix pool mkmldnn
* Fix pool mkmldnn
|
7 years ago |
Abhinav Arora
|
9b63b7dde0
|
Fix warnings in split_ids_op
|
7 years ago |
Abhinav Arora
|
3794027d7f
|
Fix warnings in sgd_op.h
|
7 years ago |
Kexin Zhao
|
617e790a59
|
fix cuda 7.5 compile error (#9885)
|
7 years ago |
Tao Luo
|
859fedf3e7
|
Merge pull request #9871 from qingqing01/fix_bn
Refine batch_norm_op.
|
7 years ago |
mozga-intel
|
d24b5e060f
|
The fully connected: the operator is removed when the MKLDNN flag is OFF
|
7 years ago |