Leo Chen
b0e7439fbc
rename inplace/no_need_buffer inferer, part2, test=develop ( #24733 )
5 years ago
Leo Chen
a6fbba65ff
rename inplace/no_need_buffer inferer, part3, test=develop ( #24734 )
5 years ago
wangchaochaohu
355caee18b
fix conv_transpose Op fp16 error test=develop ( #24695 )
5 years ago
Chen Weihang
19e5f7879c
Append error op hint for GradOpMaker ( #24750 )
...
* append error op hint for grad op maker, test=develop
* add unittests for coverage, test=develop
5 years ago
Pei Yang
181b1f5a30
adjust minimum trt version for hard_sigmoid converter to 5130. test=develop ( #24746 )
5 years ago
wangchaochaohu
79caed6667
fix the print error of PE record_event and framework overhead in profiler test=develop ( #24744 )
5 years ago
Adam
56a714a19b
Add isCached() machinism to oneDNN pooling primitive ( #24724 )
5 years ago
Aurelius84
6db0e2b1a9
[Dy2stat] Support len syntax ( #24638 )
...
* [dy2stat] Support len
* add convert_call func
* refine code test=develop
5 years ago
ShenLiang
950892044f
fix conflict, test=develop ( #24238 )
5 years ago
lidanqing
c3c61d34c1
Update PADDLE_ENFORCE in DNNL related ops ( #24333 )
...
* Update PADDLE_ENFORCE in DNNL related ops
test=develop
* Abstract macro of OP_GET_PLACE_CHECK
test=develop
* update according to reviews
* update GET_PLACE_CPU_CHECK
* fix typo
test=develop
* revert macro
test=develop
5 years ago
Wilber
dbe2497768
fix mkldnn concat bug. test=develop ( #24722 )
5 years ago
Leo Chen
de8b4f42cf
rename inplace/no_need_buffer inferer, part 1, test=develop ( #24711 )
5 years ago
hutuxian
e6b87b3193
Support AucRunner in PaddleBox ( #22884 )
...
* Support AucRunner in PaddleBox
* update some code style
5 years ago
Chen Weihang
7fa9f16c17
Polish reader folder error message ( #24698 )
...
* polish reader error message, test=develop
* fix detail error, test=develop
* reset activation dcudnn change, test=develop
5 years ago
wangchaochaohu
dbfe5333c5
Add pe profiler Event ( #24611 )
5 years ago
Liufang Sang
55b664a131
remove pow to speed up in dequantize_log op ( #24607 )
...
* remove pow in speed up in dequantize_log test=develop
* remove pow in speed up in dequantize_log test=develop
* fix unittest test=develop
5 years ago
Jacek Czaja
ca68b13f85
[oneDNN] Fix to elementwise_add grad ( #24639 )
5 years ago
Adam
586b587519
Add isCached() check in Softmax handler ( #24637 )
...
* Update isCached() to be thread freindly
test=develop
* Add isCached() check inside Softmax handler
test=develop
* Fix PaddleEnforce() message
test=develop
5 years ago
Leo Chen
9c9e635c00
support tensor to varbase, test=develop ( #24660 )
5 years ago
zlsh80826
fdbe114b12
[Paddle-TRT] use float constant instead of double test=develop ( #24544 )
5 years ago
Leo Chen
14dd6388c5
fix bug of varbase.__getitem__, test=develop ( #24642 )
...
* fix bug of varbase.__getitem__, test=develop
* fix bug of float and other type, test=develop
5 years ago
jinyuKING
6ca44cbaa3
API/OP(sequence_first_step, sequence_last_step, sequence_mask, beam_search, beam_search_decode) error message enhancement ( #24590 )
5 years ago
Wilber
ba2f8f0ce4
fix embedding_eltwise_layernorm_fuse_pass. test=develop ( #24592 )
5 years ago
Huihuang Zheng
f831f61216
Make buddy_allocator_test Exclusive ( #24636 )
...
The background is that buddy_allocator_test depends on some big GPU allocations and current machine may not have it. It can cause random failure on cloud machine. After discussion with Shuo Tian, we decided to make buddy_allocator_test exclusive.
5 years ago
hutuxian
0ec3a42e97
Random Dump ( #24477 )
...
* Refactor code for dump_field & dump_param: abstracting the common function in base class.
* Support dump randomly & random with lineid
* Support specify the random interval, which avoids printing too much logs.
5 years ago
Leo Chen
1d03469685
use vector instead of pointer, test=develop ( #24620 )
5 years ago
Yiqun Liu
6b464f969a
Add an operator node in unittest to make the fusing result unique. ( #24617 )
5 years ago
ShenLiang
837dd47a96
Add lod in gather/scatter ( #24613 )
...
* add lod msg in gather and scatter_op, test=develop
5 years ago
Jacek Czaja
3292f0ef58
[onednn] elementwise add broadcasting support ( #24594 )
5 years ago
Yiqun Liu
560c815390
Add some check for CUDA Driver API and NVRTC ( #22719 )
...
* Add the check for whether CUDA Driver and NVRTC is available for the runtime system.
* Call cuInit to initialize the CUDA Driver API before all CUDA callings.
test=develop
* Change the behavior when libnvrtc.so can not be found, printing a warning instead of exiting.
test=develop
* Do not initialize CUDA Driver API for windows and macos.
test=develop
* Remove the call of cuInit when entering paddle and enable the test_code_generator.
test=develop
* Add some built-in functions for __half.
test=develop
* Change save_intermediate_out to false in unittest.
test=develop
* Fix error reference to tempropary variable when seting including path for device_code.
test=develop
5 years ago
Guo Sheng
fe8d006f7e
API/OP(sequence_expand_as) error message enhancement ( #23712 )
...
* API/OP(sequence_expand_as) error message enhancement.
test=develop
Co-authored-by: FrostML <380185688@qq.com>
5 years ago
Leo Chen
d980d251f0
specify outs, test=develop ( #24537 )
5 years ago
yaoxuefeng
16817c70fc
OP(datanorm lookupsparsetable lookuptable) error message enhancement ( #24506 )
...
* OP(datanorm lookupsparsetable lookuptable) error message enhancement
* fix test=develop
* fix test=develop
* fix test=develop
* fix test=develop
* fix test=develop
* fix test=develop
* fix test=develop
5 years ago
liu zhengxi
fff9faae72
API(dynamic_gru, chunk_eval, BeamSearchDecoder) error message enhancement ( #24513 )
...
* dynamic_gru err_msg enhancement, test=develop
* chunk_eval err_msg enhancement and fix crf_decoding output type, test=develop
* BeamSearchDecoder err msg enhancement, test=develop
* fix doc for chunk_eval, test=develop
* refine lod err msg for chunk_eval, test=develop
5 years ago
wangchaochaohu
c4dd596de6
cusum op optimization for GPU kernel ( #24321 )
5 years ago
Leo Chen
d43e4047ff
refine err msg, test=develop ( #24550 )
5 years ago
liym27
615a8bfcf4
Support int32 int64 and fix bug ( #24407 )
...
* Compatible int32 and int64 for attr in op slice/strided_slice. test=develop
* Polish code in nn.py test=develop
* Fix bug: set the same dtype for the inputs of elementwise_add. test=develop
* Convert int32 to int64 in slice op to avoid data overflow. test=develop
* Convert int32 to int64 in strided_slice_op to avoid data overflow. test=develop
5 years ago
Zhaolong Xing
f68d4fb3f1
fix bert bug using trt6 when compile with CUDA_ARCH_NAME=All ( #24517 )
...
test=develop
5 years ago
lilong12
a13a4dbc7b
Improving error reporting messages for ops ( #24438 )
...
* improve error reporting message
5 years ago
123malin
45ef6ff338
test=develop, error info improvement ( #24496 )
5 years ago
Adam
dcf17f4813
Add isCached() mechanism to elementwise_add DNNL ( #24563 )
...
* Add isCached() mechanism to elementwise_add
test=develop
* Hide code inside handler
test=develop
5 years ago
pawelpiotrowicz
db2b6b6568
Hide globals & redesign restore PR ( #24279 )
...
test=develop
5 years ago
Jiawei Wang
4a105f803e
SamplingID Op fix error print ( #24521 )
...
* fix error print for sampling_id_op
* fix spell err
* fix spell err test=develop
5 years ago
ShenLiang
53e3c53423
fix error message, test=develop ( #24425 )
5 years ago
ruri
ea2c498712
Fix err message ( #24507 )
...
* fix error message, test=develop
5 years ago
Wilber
4ec7287602
fix compile when with_nccl=off. test=develop ( #24444 )
5 years ago
chajchaj
e5a624935f
Enhance error message of cross_entropy_op, sigmoid_cross_entropy_with_logits_op ( #24485 )
...
* error message of cross_entropy_op, test=develop
* fix bug : can't use platform::errors::InvalidArgument in HOSTDEVICE, test=develop
* fix bug: recovery the check_variable_and_dtype for rank_loss and bpr_loss, test=develop
5 years ago
FDInSky
526a211774
update conv error info ( #24430 )
...
* test=develop update conv error info
* test=develop update iou_similarity error info
* test=develop update some error info based review
5 years ago
Yibing Liu
a9520db24e
Format error message for ops ( #24482 )
...
* Format error message for ops, test=develop
* Fix check in sequence_expand, test=develop
5 years ago
Yang Zhang
2644cb819d
Remove extraneous comma in error messages ( #24478 )
...
which messes up the formatting
test=develop
5 years ago
suytingwan
05c9642dc3
Update paddle enforce message ( #24498 )
...
* test=develop error message update
5 years ago
lijianshe02
9f83f0fe69
API/OP (group_norm, layer_norm, random_crop, unpool) error message enhancement ( #24413 )
...
* API/OP (group_norm, layer_norm, unpool) error message enhancement test=develop
5 years ago
hutuxian
7f85dfeb18
Upgrade Error Message for AucOP & MultiplexOP ( #24458 )
5 years ago
WangXi
027f9953bc
Optimize error message, include dgc, nccl, size op ( #24456 )
5 years ago
gongweibao
f1c57d648c
Enhance error message of prefetch_op, proximal_adagrad_op, proximal_gd_op ( #24436 )
5 years ago
danleifeng
d98e1182a1
fix check and error message for flatten hash is_empty op ( #24434 )
...
fix check info for flatten hash is_empty op; test=develop
5 years ago
liu zhengxi
417b576c34
API(dynamic_lstm, dynamic_lstmp) error message enhancement ( #24450 )
...
* update err msg for dynamic_lstm and dynamic_lstmp, test=develop
5 years ago
wangchaochaohu
53bdee64e6
add tensor support for gaussian_random_op test=develop ( #24389 )
5 years ago
qingqing01
da4a1db7bb
Refine error message in some OPs ( #24443 )
...
test=develop
5 years ago
Xing Wu
479c47f316
fix error info for transpose sequence_conv_pool max_sequence_len sequ… ( #24437 )
...
* fix error info for transpose sequence_conv_pool max_sequence_len sequence_erase, test=develop
* fix error info for transpose sequence_conv_pool max_sequence_len sequence_erase, test=develop
* update modify, test=develop
* update modify, test=develop
* fixed some modifications, test=develop
5 years ago
WuHaobo
1c00732dd0
Polish PADDLE_ENFORCE of unfold_op ( #24423 )
5 years ago
MRXLT
71ff32b65d
update error message for unstack op and lamb op; test=develop ( #24439 )
5 years ago
ForFishes
05c3bc3bf6
fix error message, test=develop ( #24447 )
...
fix scatter and scatter_nd op error message
5 years ago
ceci3
633aaea07e
Update err message ( #24422 )
...
* update err info, test=develop
* update, test=develop
5 years ago
Chen Weihang
05d20e5793
API/OP (Some SL API) error message enhancement ( #24441 )
...
* polish some sl api error message, test=develop
* polish python input check of stride slice, test=develop
* fix unittest bugs, test=develop
5 years ago
tangwei12
a97d5a6153
fix op error, test=develop ( #24451 )
5 years ago
Yang Zhang
7c17ed57e0
Refine `squared_l2_distance_grad` error message ( #24409 )
...
test=develop
5 years ago
huangjun12
046b7ebc03
refine Error message of modified_huber_loss ( #24417 )
...
* refine Error message of modified_huber_loss, test=develop
* refine Error message style, test=develop
5 years ago
zhang wenhui
621a4085b9
enhance cvm bpr_loss adam adagrad adamax ftrl error message, test=develop ( #24452 )
5 years ago
Li Fuchen
e58619295e
OP(rank_loss, similarity_focus, squeeze) error message enhancement ( #24448 )
...
* enhance rank_loss error message, test=develop
* enhance similarity_focus error message, test=develop
* enhance squeeze error message, test=develop
5 years ago
wangguanzhong
abb49df26f
Enhance yolo_box & yolov3_loss ( #24370 )
...
* add scale_x_y for yolo_box, test=develop
* refine eps in iou_similarity, test=develop
5 years ago
mapingshuo
d1bb76a2f7
fix error log, test=develop ( #24419 )
...
* fix error log: resahpe, range, reverse.
5 years ago
Jacek Czaja
8b88cd5167
[oneDNN] Fix to inplace pass ( #24442 )
...
* - Disabling inplace pass
test=develop
- Disable cycles
test=develop
- fix
test=develop
- Enhancement to in-place
- Lint fixes
test=develop
* - Lint fixes
test=develop
5 years ago
Bai Yifan
f1c4c14c7a
Fix fsp_op error message,test=develop ( #24405 )
...
* fix fsp_op error message,test=develop
5 years ago
Jacek Czaja
d0307145a3
[oneDNN] disabling oneDNN inplace pass ( #24406 )
5 years ago
Kaipeng Deng
6435d1b4b8
fix PADDLE_THROW in spectral_norm_op.h. test=develop ( #24414 )
5 years ago
wangguanzhong
cd327e66df
optimize error message, test=develop ( #24420 )
5 years ago
FDInSky
7fedf26b87
add linear interpolate operator ( #23357 )
...
* test=develop add linear interpolate operator
5 years ago
Liufang Sang
e24575c8ba
fix error message in accuracy_op.h test=develop ( #24416 )
5 years ago
wawltor
2de5075a6d
Fix the elementwise ops in broadcast in the process of backward ( #24319 )
...
* Remove the error in the elementwise op, use the backup mode to calculate
5 years ago
Wojciech Uss
fddf442470
add batch size to the mkldnn matmul cache key ( #24408 )
...
test=develop
5 years ago
Tao Luo
72c370c8d2
remove unused test_multi_thread_helper.h ( #24399 )
...
test=develop
5 years ago
hutuxian
123255cf9f
change InitializeGPU to InitializeGPUAndLoadModel ( #24377 )
...
* Add InitializeGPUAndLoadModel to solve random hang when downloading sparse parameters.
* Update SaveBase to solve test problem.
5 years ago
Tao Luo
63da846de0
remove old inference C++ tests ( #24368 )
5 years ago
Chen Weihang
aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get ( #24175 )
...
* add new macro BOOST_GET_SAFELY & unittests, test=develop
* add different macro type, test=develop
* fix get macro type in executor, test=develop
* four macro part change backup
* using one macro for all case, test=develop
* revert attribute change, test=develop
* change to three func to solve gcc4.8 bug, test=develop
* polish some details, test=develop
5 years ago
Pei Yang
9c073bbd53
skip pyramid_hash on jetson platforms to avoid compile error. test=develop ( #24371 )
5 years ago
guofei
70bc488979
Fix the error of recurrnet op in multithreading in eval process ( #24357 )
...
CreateStepScopes in recurrent op also clears scopes, which can cause segmentation fault un multi-threading. We add a lock in this PR but it may slow the computation process. We will fix in another way in next PR.
5 years ago
hong
67f66f0904
Fix get item out of range error ( #24339 )
...
* raise index error when slice out of range; test=develop
* add uni test; test=develop
* fix format error; test=develop
* add comment for py::index_error; test=develop
* polish error message; test=develop
* polish error message; test=develop
5 years ago
Wojciech Uss
db052009c7
Enabled quantize all and skip missing in QAT ( #24281 )
...
* Enabled quantize all and skip missing in QAT
5 years ago
Aurelius84
1ed8baf9c8
[dy2static] Support for static graph training with @declarative decorator ( #24259 )
...
* support to train in static
* support to independent decorator
* remove in_dygraph_mode condition in ProgramTranslator
* fix import param_guard and add train/eval test=develop
* Modify into ShareVarsFromScope and rm __all__ in partial_program test=develop
5 years ago
Huihuang Zheng
8a1a2af82e
Add Assert Op ( #24280 )
...
1. To make ProgramTranslator to support `assert` grammar, this PR adds `assert` python API and C++ code.
2. Fix a bug: graph_pattern_detector.h #include <gtest/gtest_prod.h> but didn't declared dependency at CMakeLists, which can cause single build failure.
3. Refactoring `Formatter` in print_op to make it reusable and reuse the formatter to print in assert op.
5 years ago
Pei Yang
8c296dea75
fix compile error(cpuid.h not found) on nvidia jetson platforms. test=develop ( #24329 )
5 years ago
Guo Sheng
4a5de14426
Remove cusolver potrfBatched support on Windows. ( #24338 )
...
test=develop
test=win_gpu
5 years ago
Tao Luo
c1df7048c7
add UT for mkldnn_cache_capacity ( #24336 )
...
* add UT for mkldnn_cache_capacity
test=develop
* fix comparison of integer expressions of different signedness
test=develop
5 years ago
Feiyu Chan
420707c26e
API/OP (margin_rank_loss, nce, row_conv, positive_negative_pair) erro… ( #24246 )
...
* API/OP (margin_rank_loss, nce, row_conv, glu, positive_negative_pair) error message enhancement, test=develop
* add unittest for glu, test=develop
5 years ago
joanna.wozna.intel
356f5ee220
[Refactoring] Unify op-dequant squashes ( #24277 )
5 years ago
liym27
ac9a7eeea4
[Dy2Stat]Support list pop ( #24250 )
...
* Replace dygraph_to_static_func with @declarative or program_translator.get_func in test_list.py
* Add comments in ConditionalBlock.
* Support list pop last item.
* Support pop the i-th item.
* Support an empty tensor array as Input in assign op and set the kernel type is float.
5 years ago
Tao Luo
9eedf05d2f
solve mklml memory leak on windows ( #24015 )
...
* solve mklml memory leak on windows
test=develop
* remove unused msvcr120.dll
test=develop
5 years ago
Zhang Ting
b71abeee1b
use 32 bit index to improve activation ops ( #24206 )
...
* improve activation ops performance, test=develop
* use 32bit only GPU computation, test=develop
5 years ago
xujiaqi01
1034ca316f
add timeout and http store in communication ( #23436 )
...
* add timeout and http store in communication, add revert and confirm in fleet
* test=develop
5 years ago
Guo Sheng
1fc6cc502a
Fix cusolver loader for Windows ( #24157 )
...
* Fix cusolver loader for Windows in dynamic_loader.cc. test=develop
* Fix missing CUSOLVER_ROUTINE_EACH_R1.
test=gpu
test=develop
* Add unsupprot for cusolver on Windows temporarily. test=develop
* Fix GetCusolverDsoHandle error message. test=develop
5 years ago
wawltor
d1e1d85881
add the graph batch reader for pslib mode ( #24178 )
...
Add the pslib graph batch reader mode, add the test case for this change
5 years ago
liym27
803559499d
[Dy2Stat]Support LoDTensorArray for slice op ( #23091 )
...
* Support LoDTensorArray for slice op.
* Support read elements of list in dygraph_to_static
* Fix infershape add test for infershape.
* Support Tensor for Attr(starts) and Attr(ends).
* Use new interfaces in VarTypeInference.
5 years ago
Feiyu Chan
78cd3dd507
fix kron_op: when only one input needs gradient, test=develop ( #24269 )
...
fix kron_op: when only one input needs gradient
5 years ago
Li Fuchen
5dc069d050
OP(warpctc, add_position_encoding, scaled_dot_product_attention) error message enhancement ( #24261 )
...
* enhance add_position_encoding error message, test=develop
* enhance warpctc & scaled_dot_product_attention error message, test=develop
* modified error message and ctest of scaled_dot_product_attention, test=develop
5 years ago
liym27
19511dfa25
Fix bug in assign op: support to infer shape for LOD_TENSOR_ARRAY. ( #24268 )
5 years ago
joanna.wozna.intel
b43b46e619
[INT8] Add requant-op squash ( #24143 )
5 years ago
石晓伟
17ac6e2580
update the analysis predictor for multi-stream support, test=develop ( #24046 )
...
* update the analysis predictor, test=develop
* update the unit test, test=develop
* no priority set before the inferface determined, test=develop
* interface name generalization, test=develop
5 years ago
hutuxian
3e2bc8715f
Try to fix UT Random Fail ( #24223 )
5 years ago
lidanqing
61ec30f030
Update QAT INT8 2.0 doc ( #24127 )
...
* update local data preprocess doc
* update for 2.0 QAT
test=develop
test=document_fix
* update benchmark data
test=develop
test=document_fix
Co-authored-by: Wojciech Uss <wojciech.uss@intel.com>
5 years ago
Sylwester Fraczek
e1a7a88057
added reshape transpose matmul fuse pass ( #23754 )
5 years ago
Tao Luo
d7850dcdc0
add noavx_axpy and noavx_axpy_noadd ( #24207 )
...
* remove double registery for pyramid_hash op
* add noavx_axpy and noavx_axpy_noadd
test=develop
5 years ago
Huihuang Zheng
3047b69f94
Make Allocation Smaller on BuddyAllocatorTest to Decrease CI Failure ( #24205 )
...
test=develop
5 years ago
Chen Weihang
9b851ba216
[dy2static] Add print transformer and unify print format ( #24068 )
...
* add print transformer & unify print format, test=develop
* remove using of dygraph_to_static_func, test=develop
* remove python stdout capture, test=develop
* fix compatibility problems for PY2, test=develop
* fix detail error, test=develop
* fix type analysis bug, test=develop
* fix print tuple compatible error in PY2, test=develop
* replace get_func to declarative, test=develop
* fix detail bug, test=develop
* fix some detail problems, test=develop
* change visit_call in print transformer, test=develop
5 years ago
Li Fuchen
077e5a0fe5
Add trace op ( #23873 )
...
* add trace op, test=develop
* Optimized the implementation of trace op, test=develop
* fix a bug of include in trace_op.h, test=develop
* move trace API from creation to math, test=develop
* modified en doc. test=develop
* add complex trace api
* add complex sum api, test=develop
* modified en doc of complex sum and trace, test=develop
* modified doc and trace API, test=develop
* modified en doc of trace and sum, test=develop
* modified comment in complex kron API, test=develop
* OP Should Not Have Unused Input, test=develop
* add GetExpectedKernelType, test=develop
5 years ago
wangchaochaohu
fa43d74a3a
fix the intermediate node of graph for fusion group test=develop ( #24184 )
5 years ago
Yiqun Liu
ecfddebbef
Add the implementation of inverse ( #23310 )
5 years ago
HappyAngel
8ea325aa53
c++ API ( average_accumulates, tensor_array_to_tensor and average_accumulates) error message enhance. test=develop ( #23631 )
5 years ago
ShenLiang
0fb9b208ab
Add batch_fc op in contrib ( #24017 )
...
* add batch fc op, test=develop
* add batch_fc_op, test=develop
* fix untest, test=develop
* rm check_dygraph, test=develop
* fix comment, test=develop
* fix comment, test=develop
5 years ago
liuwei1031
9a93f6aae0
improve efficiency of runtime InferVarType ( #22778 )
...
* save InferVarType changes, test=develop
* remove code comments, test=develop
* tweak code, test=develop
* fix compilation warning, update merge_ids_op split_ids_op to new interface, test=develop
* modify fused_bn_activation_op, test=develop
* fix error of fused_bn_activation_op, test=develop
* fix PADDLE_ENFORCE and unittest coverage issue, test=develop
* tweak PADDLE_ENFORCE messages, test=develop
* improve unittest coverage, test=develop
* add StaticGraphInferVarType class, test=develop
* rebase develop branch, test=develop
* fix unittest error, test=develop
* remove comments, test=develop
* improve unittest coverage, test=develop
* imporve error message and imporve unittest coverage, test=develop
* upgrade InferVarType API, test=develop
* tweak pyfunc error message, test=develop
* fix compilation conflict - save_combine_op, test=develop
5 years ago
hong
bfb60efb1f
change compare forece_cpu default value; test=develop ( #23888 )
5 years ago
wangchaochaohu
2270864019
Fusion group optimize for cuda codegen( #23940 )
5 years ago
ShenLiang
94dfb7d770
opt the postprocess, test=develop ( #24155 )
5 years ago
Chengmo
28a558e867
Update index sample ( #24109 )
...
* update index sample
5 years ago
Zhang Ting
ab8f8fa70d
fix example code, test=develop, test=document_fix ( #24139 )
5 years ago
Feiyu Chan
e01262e691
add kron op ( #24105 )
...
* add kron op and its python API, doc and unittests.
* add kron in paddle.complex
5 years ago
Jacek Czaja
eb411613e9
[DNNL] refine activations Inplace support ( #24145 )
5 years ago
wangchaochaohu
6bf26ef156
fix warning mac compiler ( #24138 )
5 years ago
lijianshe02
a398464e77
API/OP (affine_channel, group_norm, layer_norm, random_crop, unpool, … ( #24118 )
...
* API/OP (affine_channel, group_norm, layer_norm, random_crop, unpool, log_loss) error message enhancement test=develop
5 years ago
Guo Sheng
a8c0fb4e86
Add cholesky_op ( #23543 )
...
* Add cholesky_op forward part. test=develop
* Complete cholesky_op forward part. test=develop
* Add cholesky_op backward part. test=develop
* Complete cholesky_op backward part. test=develop
* Refine cholesky_op error check and docs. test=develop
* Add grad_check unit test for cholesky_op. test=develop
* Fix sample code in cholesky doc. test=develop
* Refine some error messages of cholesky_op. test=develop
* Refine some error messages of cholesky_op. test=develop
* Remove unused input in cholesky_grad. test=develop
* Remove unused input in cholesky_grad. test=develop
* Fix stream for cusolverDnSetStream. test=develop
* Update PADDLE_ENFORCE_CUDA_SUCCESS from cholesky_op to adapt to latest code.
test=develop
* Add CUSOLVER ERROR in enforce.h
test=develop
* Fix the missing return value in cholesky. test=develop
5 years ago
Jacek Czaja
461e6a01ec
[DNNL] activations Inplace support ( #24123 )
5 years ago
wangchaochaohu
6ba7c3ac92
Reduce the construction time of fuction about profiler ( #24117 )
5 years ago
Zhang Ting
fb0729ee7f
avoid warnings in MAC compile ( #24124 )
5 years ago
qingqing01
b059fb955d
Add trainable_statistics in attr for batch_norm. ( #24072 )
...
* Add trainable_statistics in attr for batch_norm
* Unifying behavior of dynamic graph and static graph
5 years ago
arlesniak
d31a174f51
added fusing matmul-transpose-reshape pass ( #23866 )
5 years ago
石晓伟
46f3139c7f
supports loading model from memory, test=develop ( #24098 )
5 years ago
xiaoting
96ffebef55
fix bicubic, change int to floor ( #24063 )
...
* change int to floor, test=develop
* fix unittest, test=develop
5 years ago
Zeng Jinle
a67eea9f00
polish code by adding final, test=develop, test=develop ( #24114 )
5 years ago
Zeng Jinle
ab2e284235
fix compilation failure ( #24091 )
5 years ago
石晓伟
34d7d6aef0
declare the stream::Priority as enum class, test=develop ( #24013 )
5 years ago
Pei Yang
695a53c874
remove conv_bn_fuse_pass and fc_fuse_pass in trt int8 calibration. test=develop ( #23805 )
5 years ago
0YuanZhang0
a164b10d05
API/OP error message enhancement ( #23717 )
...
* test=develop
5 years ago
Zeng Jinle
acef55df04
fix isolated var fetch bug, test=develop ( #24070 )
5 years ago
Zeng Jinle
3ca700a987
code clean, test=develop ( #24060 )
5 years ago
Zhaolong Xing
35148d17f7
[BUG]: Head number can only be > 1 on multihead op ( #23974 )
...
* support the head number == 1
test=develop
* fix slice op error.
test=develop
5 years ago
Jacek Czaja
c6c65c65c7
[DNNL] Added elementwise_add mkl-dnn inplace ( #23477 )
5 years ago
hutuxian
9ff558a46f
Optimize DataFeed ( #23957 )
...
* Make batch_float_feasigns & batch_uint64_feasigns as member variable
5 years ago
Leo Chen
5cccc69f1a
update name generator, test=develop ( #24048 )
...
* update name generator, test=develop
* use c++ unique name generator, test=develop
5 years ago
石晓伟
db6d867383
add boost dependency to cuda_stream ( #24032 )
5 years ago
wawltor
5c669ad1c2
Add the support dygraph out attribute for the op of mm in api2.0 ( #23978 )
...
Fix the dygraph mode in matmul, add the support in Linear Op
5 years ago
石晓伟
d2584a7082
New feature: thread local allocator, test=develop ( #23989 )
...
* add the thread_local_allocator, test=develop
* refactor the thread_local_allocator, test=develop
* provides option setting strategy, test=develop
5 years ago
Kaipeng Deng
80cf3c3c4d
Refine DataLoader support multi-processing ( #23107 )
...
* add DataLoader, Dataset, BatchSampler
5 years ago
Zhou Wei
76d78c6387
fix conv_fusion_op conflict,test=develop ( #24020 )
5 years ago
wangguanzhong
931cba2e64
add clamp api, test=develop ( #23273 )
...
* add clamp api, test=develop
5 years ago
zhaoyuchen2018
a28a63a943
OP(fusion_gru) error message enhancement. test=develop ( #23591 )
...
* OP(fusion_gru) error message enhancement. test=develop
* refine code, test=develop
* Refine inout log, test=develop
* Refine description, test=develop
5 years ago
Zhou Wei
7817003795
Optimize the error messages of paddle CUDA API ( #23816 )
...
* Optimize the error messages of paddle CUDA API, test=develop
* fix the error messages of paddle CUDA API, test=develop
* Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
* remove build_ex_string,test=develop
* merge conflict,test=develop
5 years ago
silingtong123
f6dbf8e3a4
add 'seed' arguemnt of randint API ( #23809 )
...
* test=develop, add seed arguemnt of randint API
5 years ago
Li Fuchen
87d8dc3dc0
add diag_embed op ( #23385 )
...
* add diag_embed op, test=develop
* add TestCase of diag_embed API
* Modified diag embed python API teastcase from dygraph to static graph, test=develop
* delete useless log and trigger ci, test=develop
* modified float16 of diag_embed, test=develop
* modified en doc of diag_embed
* trigger ci, test=develop
* add fp16 in dtype check of python API, test=develop
* modified __init__ and fix a big, test=develop
* modified a test bug of test_bicubic_interp_op and test_trilinear_interp_op, test=develop
* modified to use one kernel on cpu and cuda, test=develop
5 years ago
liu zhengxi
8e555ba650
OP(pad, pad2d, pad_constant_like) error message enhancement ( #23882 )
...
* enhance pad.* error message, test=develop
5 years ago
ShenLiang
30bd7e1c83
Add rank_attention_op attributes for GPU memory in contrib ( #23915 )
...
* optimize rank_attention, test=develop
* use the paddle memory pool, test=develop
* set max size, test=develop
* limit the max size, test=develop
* fix the head of cu, test=develop
* add AsDispensable, test=develop
5 years ago
Yiqun Liu
8d0b0cb4ae
Op(conv2d_fusion) error message enhancement. ( #23596 )
5 years ago
ShenLiang
7f0b2c7407
fix memory leaking problem of dataset, test=develop ( #23955 )
5 years ago
mapingshuo
7d4002e06a
restrict block num of layer_norm_grad cuda block to 128 ( #23878 )
...
restrict block num of layer_norm_grad cuda kernel to 128, test=develop
5 years ago
Zhaolong Xing
133f1fc123
[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support ( #23723 )
...
* refine ernie trt dynamic shape support
1. add slice op converter
2. add emb eltwise layernorm fp16 support
test=develop
* fix dynamic shape test ut
test=develop
* fix comments.
test=develop
* fix comments
test=develop
5 years ago
guofei
2b896c1f6b
Support LoDTensorArray in fetch ( #23645 )
...
* Support LoDTEnsorArray in fetch op
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
5 years ago
Qinghe JING
ad7ac4c607
create bmm op and move several api from fluid.layers to tensor ( #23457 )
...
* add gradient check to reduce ops
* add skip gradient check to reduce ops test=develop
* modify stack api test=develop
* add bmm op and move serval ops from fluid.layers to tensor test=develop
5 years ago
Yiqun Liu
071a702060
Fix the error misjudgment when there are control nodes in graph. ( #23943 )
5 years ago
GaoWei8
490db7f325
add paddle.tensor interface ( #23801 )
...
* add paddle.tensor
test=develop
* polish gpu where codes
test=develop
* polish test code
test=develop
5 years ago
lidanqing
2291634c5c
Add user local data preprocess support ( #23692 )
...
* add local data preprocess support for imagenet
test=develop
* add local data2bin tests
test=develop
* locally two tests passed
test=develop
* change according to reviews
test=develop
5 years ago
Liufang Sang
771c3b297f
update enhance error message for Initializer, smooth_l1 ( #23912 )
...
* update enhance error message test=develop
* update details test=develop
5 years ago
Zhang Ting
b89dd86fb6
Update eigen ( #23203 )
...
* update eigen, test=develop
* remove patches, test=develop
* add definition of -fabi-version, test=develop
* add patch for TensorBlock.h, test=develop
* test windows, test=develop
* only update eigen for Linux, test=develop
* add code comments, test=develop
5 years ago
GaoWei8
0be4b04d13
Api (lod_append) error message enhancement ( #23541 )
5 years ago
hutuxian
df64a96686
support set_test_mode and set comlog level( #23905 )
5 years ago
石晓伟
2d01cc85c4
DeviceContext Split, test=develop ( #23737 )
...
* supports thread-binding stream, test=develop
* avoid using thread_local variables in dtor, test=develop
* modify the stream priority enum, test=develop
5 years ago
huangjun12
8af85922d0
Error message enhancement of 6 op ( #23759 )
...
* enhance error message test=develop
* fix CI test of 3 op test=develop
* fix bug caused by the diff of long type in py2 and py3, test=develop
5 years ago
Zhang Ting
318dfa0d4f
remove eval in eigen function when dtype is fp16 ( #23845 )
5 years ago
Zhong Hui
361c6ccc01
OP error message enhancement of l2_normalize, matmul, mean, etc
...
* fix error message of l2_normalize, matmul, mean, etc.
* add the test case for those ops
5 years ago
chenhaoze
b7d185d6ca
OP clip, merge_lod_tensor, convert/elementwise error message enhancement ( #23742 )
...
* OP clip, merge_lod_tensor, convert/elementwise error message enhancement. test=develop
5 years ago
gfwm0502
d8ca66dae2
Modify documents of executor and randn and fix other errors ( #23879 )
...
test=develop
5 years ago
gfwm0502
a756360263
OP/API (While/while_loop/DynamicRNN) : Error Message Enhancement ( #23896 )
...
As the title
5 years ago
Zhang Ting
b88662254b
use 32 bit index to improve expand op ( #23899 )
...
* use 32 bit index to improve expand op, test=develop
* remove redundant code, test=develop
5 years ago
lijianshe02
e21b3c273e
add nll_loss op test=develop ( #23758 )
...
* add nll_loss op test=develop
5 years ago
yiicy
f5f76e610d
fusion_seqconv_eltadd_relu error message enhancement. ( #23554 )
5 years ago
Double_V
b4daea13f4
API/OP (center_loss, fluid.one_hot, prroi_pool, roi_pool, ctc_greed_decoder) error message enhancement ( #23794 )
...
* error message enchanced, test=develop
* error message enchanced for APIs, test=develop
* error message enhanced for roi_pool, test=develop
* update added code, test=develop
* update fluid/input.py, test=develop
* update fluid/input.py ,test=develop
* fix code style, test=develop
* fix lower converage, test=develop
* error message enhanced for roi pool, test=develop
5 years ago
zhupengyang
56c54cccbd
Op(prelu/relu/f.relu/f.log_softmax) error message enhancement ( #23792 )
5 years ago
guofei
c2a60bb1fa
Correct the wrong name in the flag comment ( #22977 )
...
Correct the name [`FLAGS_sync_nccl_allreduce`](https://www.paddlepaddle.org.cn/documentation/docs/zh/advanced_guide/flags/others_cn.html#flags-sync-nccl-allreduce ) based on the information from our official website.
5 years ago
gfwm0502
af149f2595
OP(compare/get_places/shrink_rnn_memory) error message enhancement ( #23780 )
...
As the title.
5 years ago
zhaoyuchen2018
f0b08123b2
OP(fused_embedding_fc_lstm) error message enhancement. test=develop ( #23527 )
...
* API(fused_embedding_fc_lstm) error message enhancement. test=develop
C++ API enhancement.
* Refine code, test=develop
* Refine code. test=develop
5 years ago
zhaoyuchen2018
ef72de6fe0
API(Print) error message enhancement. test=develop ( #23521 )
...
Enhance print op c++ error message.
5 years ago
Zhong Hui
1066f59bf7
fix compile problem on windows and some invalid argument check
...
Fix the compile problem in norm op in the windows env, add delete some invalid argument
5 years ago
Zeng Jinle
82f3348ad0
remove useless comment in layer.cc, test=develop, test=document_fix ( #23847 )
5 years ago
Bai Yifan
fc1b140aba
Refine error message, test=develop ( #23823 )
...
* refine error message, test=develop
5 years ago
yiicy
de3e299dbb
fusion_seqexpand_concat_fc error message enhancement, test=develop ( #23558 )
5 years ago
yiicy
a1e7387919
Variable error message enhancement, test=develop ( #23548 )
5 years ago
wangguanzhong
93aa66902d
enhance the error message, test=develop ( #23646 )
...
* enhance the error message of box_coder, test=develop
* enhance error message in box_decoder_and_assign
* enhance error message of collect_fpn_proposals
* enhance errror message of distribute_fpn_proposals
* enhance error message of multiclass_nms, test=develop
5 years ago
xiaoting
7de0a25b5e
Add `nn.interpolate ` ( #23434 )
...
* add nn.interpolate, test=develop
* fix interpolate typo, test=develop
* formate code, test=develop
* fix unitest, test=develop
* add test layers, test=develop
* add test layers, test=develop
* extract common function, test=develop
* reduce the threads for cuda10, test=develop
* update unitest, test=develop
* polish unitest, test=develop
* add dygraph test, test=develop
* format description, test=develop
* add 5D input check, test=develop
* fix doc, test=develop
5 years ago
huzhiqiang
5fe3b63824
[error message enhancement] fused_elemwise_activation_op and fusion_conv_inception_op ( #23686 )
5 years ago
Liufang Sang
9549b78691
OP Normal, Uniform, Xavier Initializer, smooth_l1, mean_iou error message enhancement ( #23751 )
...
* enhance error message test=develop
* enhance error message test=develop
* change to INOUT_CHECK test=develop
5 years ago
yaoxuefeng
5b69242fab
modify datanorm op test=develop ( #23030 )
5 years ago
suytingwan
3e1676fa9a
Add meshgrid op ( #23736 )
...
* test=develop fix api doc error
5 years ago