zlsh80826
479c8834f7
[Paddle-TRT] Fixes #24731 , opt for SoftmaxKernelWithEltadd kernel, test=develop ( #24834 )
...
* blockReduce opt
* launch threads align to warpSize
* reduce unnecessary shared memory for broadcast reduced value
* vectorize SoftmaxKernelWithEltadd
* add fp16 constrain
* test=develop
5 years ago
hutuxian
5822862d8a
Monitor Framework ( #24079 )
...
* Add a StatValue class in the backend to represent a stat.
* Add a singleton StatRegistry to maintain the collection of stats.
* For the sake of code neatness, we only support type of int and float, which can cover most of the scenarios.
5 years ago
Leo Chen
028de857d4
fix dtype error of compare op, test=develop ( #25059 )
5 years ago
Jeng Bai-Cheng
bef4afa6de
bugfix for unique_ptr of IOptimizationProfile ( #23917 )
...
This commit fixs the compiling bug regarding unique_ptr of IOptimizationProfile.
IOptimizationProfile has protected dtor and is controlled by TensorRT
internally. Application shouldn't delete the pointer of IOptimizationProfile.
See TensorRT document: https://docs.nvidia.com/deeplearning/sdk/tensorrt-api/c_api/classnvinfer1_1_1_i_builder.html#a9ac47e100454151d8206ac91d543299a
test=develop
5 years ago
zlsh80826
49e4ee27e1
[Paddle-TRT] slice kernel optimization ( #24783 )
...
* parallel move shared data test=develop
* test=develop
5 years ago
tianshuo78520a
770c11a117
fix make device_context error ( #25045 )
...
* test=develop
* test=develop
* fix bug
* test=develop
* test=develop
5 years ago
tangwei12
be6a315fbd
Fix/sync barrier ( #25016 )
...
* fix sync barrier with barrier monitor, test=develop
5 years ago
ceci3
8db66fc3f6
fix cos_sim, test=develop ( #25017 )
5 years ago
Leo Chen
25a4dac4c2
Use allow list instead of white list ( #25002 )
...
* use allow list instead of white list, test=develop
* reduce include, test=develop
5 years ago
Zhang Ting
621b638550
improve performance of instance_norm, test=develop ( #25005 )
5 years ago
hutuxian
1c224e26af
support CMatchAuc ( #24990 )
...
Support CMatchAucCalculator based on CMatchRankAucCalculator with a new parameter ignore_rank
5 years ago
Zhou Wei
ff8ca52f88
windows publish package scripts ( #24851 )
...
* windows publish package scripts,test=develop
* windows publish package scripts,test=develop
* windows publish package scripts,test=develop
5 years ago
Leo Chen
bfa46c38d5
bn supports reverse_space, test=develop ( #24988 )
5 years ago
wangchaochaohu
613303dbf6
refine the slice Op to improve the performance of xlnet for fp16 training ( #24967 )
5 years ago
silingtong123
37bdb5269f
test=develop, add log message in the function UpdateDllFlag ( #24937 )
...
* test=develop, add log message in the function UpdateDllFlag
* test=develop, add the test
5 years ago
Chen Weihang
d152d7231e
clear old var in scope, test=develop ( #24976 )
5 years ago
Sylwester Fraczek
53d563a0fe
Reshape transpose matmul coverage ( #24970 )
...
* remove gmock from ut
test=develop
* coverage enabled for r+t+m fuse pass
test=develop
5 years ago
wawltor
0eb1b0bc01
Add support the 5d, 6d tensor support for the reduce ops
...
Add the support the 5d,6d tensor support for the reduce ops;
Add the same time, the compile time, it was 22 minutes, it was 21 minutes after fixed.
5 years ago
liuwei1031
8603b5fb72
fix randomly hang issue of PaddleDetection training task on windows ( #24977 )
5 years ago
silingtong123
640196c446
test=develop, remove the tensorrt dll file from windows package ( #24922 )
5 years ago
wangchaochaohu
feba131893
fix the sgement fault error of profiler in seqseq model test=develop ( #24952 )
5 years ago
Sylwester Fraczek
a7ee634b45
fix WARNING: ThreadSanitizer: heap-use-after-free ( #24929 )
...
test=develop
5 years ago
mapingshuo
24e24987f0
fixes the place info in the Print op ( #24934 )
...
fixes the CUDAPlace info in the Print op
5 years ago
Aurelius84
6be0ee159e
Support LoDTensorArray in reverse_op ( #24797 )
...
* Support LoDTensorArray in reverse_op test=develop
* polish en doc and unittest code test=develop
* refine sample code test=develop
* add example of LoDTensorArray test=develop
* fix typo test=develop
5 years ago
Leo Chen
6190023ac9
Refine error message in pybind folder ( #24886 )
...
* refine err_msg of pybind.cc, test=develop
* refine err_msg in tensor_py.h, test=develop
* refine error msg, test=develop
* fix test_exception, test=develop
* follow comments, test=develop
5 years ago
Zhou Wei
4058e736ff
temporarily disable these unittests failed on windows ( #24942 )
5 years ago
Leo Chen
a7cb97a1a5
Fix/isfinite on windows ( #24927 )
...
* refine isfinite, test=develop
* use namespace std of isfinite, test=develop, test=win_gpu
5 years ago
silingtong123
ef9b36873d
test=develop, remove the gflags/gflags.h form paddle_api.h ( #24921 )
5 years ago
whs
4c01d6d53e
Enhance checking in some operator. ( #24473 )
5 years ago
Chen Weihang
4a702ef361
Support SelelctedRows allreduce in multi-cards imperative mode ( #24690 )
...
* support selectedrows allreduce in multi-cards dygraph, test=develop
* remove useless import modules in unittests, test=develop
* add nccl cmake to get nccl version, test=develop
* add if-condition to compiled correctly, test=develop
* add detail version parseing for old nccl, test=develop
* polish camke details, test=develop
* fix remove test cmake error, test=develop
* fix cmake condition, test=develop
* change unittest camke list, test=develop
* fix unittest cmake rule, test=develop, test=framep0
5 years ago
Pei Yang
14b8540551
add default ctor for AnalysisConfig python api. test=develop ( #24924 )
5 years ago
silingtong123
fc4435174b
test=develop, fix the bug of tensorrt package can't compile on windows ( #24860 )
...
* test=develop, fix a bug
* test=develop, remove the macro of PADDLE_DLL_INFERENCE
5 years ago
lilong12
29de0d97a5
add the support to specify device index for device_guard ( #24555 )
...
* add the support of device index for device_guard.
5 years ago
lilong12
6e10022781
add queue_generator_op, dequeue_op, enqueue_op and ut ( #24481 )
...
* add queue_generator_op, dequeue_op, enqueue_op and ut, test=develop
5 years ago
hutuxian
b8f17a049d
fix problem in dump and add log ( #24891 )
...
* Fix the field length in LoD scenario
* Fix the missed lod info when copy tensor in dump field
* Add some log to make debug easy
5 years ago
石晓伟
76cdbb84b0
ignore warnings of external libraries, test=develop ( #24193 )
5 years ago
Leo Chen
1e818158f5
Feature/add amp_checkout_finite_and_scale op ( #24875 )
...
* add amp_check_finite_and_scale op, test=develop
* add cpu kernel, test=develop
* use bool, test=develop
* follow comments, test=develop
5 years ago
zhangchunle
576d68083e
generate ci index ( #24792 )
5 years ago
leesusu
a6beb96dd0
FTRL with sparse update, test=develop ( #22092 )
5 years ago
Chen Weihang
6aae034f8d
add dep for fs.cc, test=develop, test=document_fix ( #24881 )
5 years ago
Jacek Czaja
40a5f3fd86
[oneDNN] Clearing mkldnn cache in naiveexecutor destructor ( #24756 )
5 years ago
Chen Weihang
d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW ( #24759 )
...
* remove REPLACE_ENFORCE_GLOG compile option & add ci rule prohibit LOG(FATAL) using, test=develop
* remove ci test case, test=develop
* replace all LOG(FATAL) & polish message, test=develop
* fix typo, test=develop
* polish error info detail, test=develop
5 years ago
Chen Weihang
a4f6003404
append try-catch to opbase run, test=develop ( #24870 )
5 years ago
Michał Gallus
23a85f030c
Remove old mkldnn_elementwise_mul test ( #24855 )
...
test=develop
5 years ago
123malin
9d2bd0ac38
downpour_worker增加try_catch机制,打印program所有参数 ( #24700 )
...
* test=develop, add try_catch for debug
5 years ago
Michał Gallus
b2ba830eeb
Fix bug in ShareBufferWith causing eltwise_add inplace ut to fail ( #24853 )
5 years ago
Zhang Ting
7d0cbfd045
fix negative framework overhead in Profiling Report ( #24850 )
...
* fix negative framework overhead, test=develop
* use overhead summary, test=develop
5 years ago
Yanghello
aa47356b74
Add crypto python ( #24836 )
...
* add crypto helper for paddle, test=develop
* cryptopp.cmake bug fixed, test=develop
* remove debug build type, test=develop
* fixed CMakeLists for new target, test=develop
* fix CI bug, test=develop
* add cmake option flag DWITH_CRYPTO, test=develop
* add crypto api for python, test=develop
* Revert "add crypto api for python, test=develop"
This reverts commit 3a1cfa9d055fab357f46e653a8786f96336f6b47.
* Revert "Add crypto api (#24694 )"
This reverts commit 5a7a517cde
.
* Revert "Revert "Add crypto api (#24694 )""
This reverts commit f952b19fa7e8b7f9c57d31d78b9ffee1041c43ed.
* fixed cryptopp cmake building error, test=develop
* change WITH_CRYPTO building option to OFF, test=develop
* âfixed cipher test failed, test=develop
* "add crypto api for python, test=develop"
This reverts commit 83fb55c0668d59afad2ad1e7e04d425c7c7dd189.
* travis CI bug fixed, test=develop
* fixed test in python3
* test=develop
* fixed unittest, test=develop
5 years ago
Leo Chen
b67ded04f2
Support gradient accumulation of fp16 in imperative mode ( #24823 )
...
* support gradient accumulation of fp16 in imperative mode, test=develop
* enhance coverage test, test=develop
* follow comments, test=develop
5 years ago
Wilber
1e190a9e02
[Inference] [unittest] Inference unit tests rely on dynamic libraries (2) ( #24859 )
5 years ago
Chen Weihang
0aed095188
The third time to simplify the C ++ error stack ( #24831 )
...
* simply C++ error stack once again, test=develop
* refactor code remove string pointer and recursive, test=develop
5 years ago
Qi Li
704cad6a66
Add histc op ( #24562 )
...
* add histc operator, test=develop
* update english doc to 2.0 API, test=develop
* update API from histc to histogram, test=develop
Co-authored-by: root <root@yq01-gpu-255-129-15-00.epc.baidu.com>
5 years ago
Qi Li
a125697e2c
fix ut test_recognize_digits ( #24794 )
5 years ago
Yi Liu
12bffdc086
Enhance error message of checkpoint_notify_op, fake_init_op gen_nccl_id_op and listen_and_serv_op ( #24554 )
...
test=develop
5 years ago
Wojciech Uss
78d4f0cc91
add option to exclude ops by id from quantization ( #24689 )
5 years ago
Yanghello
62b4ff7dd2
Aes_cipher_test and cipher_utils_test failed fixed ( #24816 )
5 years ago
Adam
b490e41c1d
Add isCached() mechanism for BatchNorm and LRN oneDNN operators ( #24798 )
...
* Add isCached() mechanism for BatchNorm and LRN oneDNN operators
test=develop
* Formatting fix
test=develop
5 years ago
zhangchunle
f9c7f58ef6
remove duplicated mac_test build ( #24793 )
5 years ago
Wilber
f8e370ac7f
[Inference] [unittest] Inference unit tests rely on dynamic libraries ( #24743 )
5 years ago
silingtong123
126d3d693b
support C++ inference shared library on windows ( #24672 )
...
* add SetCommandLineOption
* add the print_FLAGS function
* remove the test demo
* modify the location of macro
* add the 'WITH_STATIC_LIB' option on windows
* modify the macro of PD_INFER_DECL
* modify the the fuction name
* modify the unittest
* modify the code style
5 years ago
Zhen Wang
23d253e1be
Fix out of range error for outs map. test=develop ( #24774 )
5 years ago
Zhou Wei
d1047d0a69
add WITH_GPU for cudaerror download ( #24056 )
5 years ago
Aurelius84
a7e21cbed3
Move input_size check into RunTime phrase of gru_unit_op and refine error message ( #24776 )
...
* Add IsRuntime judgement in GRUUnit test=develop
* add IsRuntime judgement is GradOp test=develop
* Refine Error Message of SelecteInput/Output test=develop
* refine Error Message of RNNMemoryHelperOp test=develop
5 years ago
Zhou Wei
f66594a558
fix bug that diag API can't use on Windows( #24762 )
5 years ago
Leo Chen
c0911fdd32
rename inplace/no_need_buffer inferer, part4, test=develop ( #24781 )
5 years ago
Chen Weihang
be82de4c79
polish two error message, test=develop ( #24778 )
5 years ago
Yanghello
5a7a517cde
Add crypto api ( #24694 )
5 years ago
Leo Chen
b0e7439fbc
rename inplace/no_need_buffer inferer, part2, test=develop ( #24733 )
5 years ago
Leo Chen
a6fbba65ff
rename inplace/no_need_buffer inferer, part3, test=develop ( #24734 )
5 years ago
wangchaochaohu
355caee18b
fix conv_transpose Op fp16 error test=develop ( #24695 )
5 years ago
Chen Weihang
19e5f7879c
Append error op hint for GradOpMaker ( #24750 )
...
* append error op hint for grad op maker, test=develop
* add unittests for coverage, test=develop
5 years ago
Pei Yang
181b1f5a30
adjust minimum trt version for hard_sigmoid converter to 5130. test=develop ( #24746 )
5 years ago
wangchaochaohu
79caed6667
fix the print error of PE record_event and framework overhead in profiler test=develop ( #24744 )
5 years ago
Adam
56a714a19b
Add isCached() machinism to oneDNN pooling primitive ( #24724 )
5 years ago
randytli
d15fc95ef5
Delete unused Cmake variables( #24707 )
5 years ago
Aurelius84
6db0e2b1a9
[Dy2stat] Support len syntax ( #24638 )
...
* [dy2stat] Support len
* add convert_call func
* refine code test=develop
5 years ago
ShenLiang
950892044f
fix conflict, test=develop ( #24238 )
5 years ago
lidanqing
c3c61d34c1
Update PADDLE_ENFORCE in DNNL related ops ( #24333 )
...
* Update PADDLE_ENFORCE in DNNL related ops
test=develop
* Abstract macro of OP_GET_PLACE_CHECK
test=develop
* update according to reviews
* update GET_PLACE_CPU_CHECK
* fix typo
test=develop
* revert macro
test=develop
5 years ago
Wilber
dbe2497768
fix mkldnn concat bug. test=develop ( #24722 )
5 years ago
Leo Chen
de8b4f42cf
rename inplace/no_need_buffer inferer, part 1, test=develop ( #24711 )
5 years ago
hutuxian
e6b87b3193
Support AucRunner in PaddleBox ( #22884 )
...
* Support AucRunner in PaddleBox
* update some code style
5 years ago
Chen Weihang
7fa9f16c17
Polish reader folder error message ( #24698 )
...
* polish reader error message, test=develop
* fix detail error, test=develop
* reset activation dcudnn change, test=develop
5 years ago
wangchaochaohu
dbfe5333c5
Add pe profiler Event ( #24611 )
5 years ago
Liufang Sang
55b664a131
remove pow to speed up in dequantize_log op ( #24607 )
...
* remove pow in speed up in dequantize_log test=develop
* remove pow in speed up in dequantize_log test=develop
* fix unittest test=develop
5 years ago
tianshuo78520a
37ae661ca1
conda install cuda10 cudnn7.6 update ( #24665 )
5 years ago
Jacek Czaja
ca68b13f85
[oneDNN] Fix to elementwise_add grad ( #24639 )
5 years ago
Adam
586b587519
Add isCached() check in Softmax handler ( #24637 )
...
* Update isCached() to be thread freindly
test=develop
* Add isCached() check inside Softmax handler
test=develop
* Fix PaddleEnforce() message
test=develop
5 years ago
Leo Chen
9c9e635c00
support tensor to varbase, test=develop ( #24660 )
5 years ago
zlsh80826
fdbe114b12
[Paddle-TRT] use float constant instead of double test=develop ( #24544 )
5 years ago
Leo Chen
14dd6388c5
fix bug of varbase.__getitem__, test=develop ( #24642 )
...
* fix bug of varbase.__getitem__, test=develop
* fix bug of float and other type, test=develop
5 years ago
jinyuKING
6ca44cbaa3
API/OP(sequence_first_step, sequence_last_step, sequence_mask, beam_search, beam_search_decode) error message enhancement ( #24590 )
5 years ago
Wilber
ba2f8f0ce4
fix embedding_eltwise_layernorm_fuse_pass. test=develop ( #24592 )
5 years ago
Huihuang Zheng
f831f61216
Make buddy_allocator_test Exclusive ( #24636 )
...
The background is that buddy_allocator_test depends on some big GPU allocations and current machine may not have it. It can cause random failure on cloud machine. After discussion with Shuo Tian, we decided to make buddy_allocator_test exclusive.
5 years ago
hutuxian
0ec3a42e97
Random Dump ( #24477 )
...
* Refactor code for dump_field & dump_param: abstracting the common function in base class.
* Support dump randomly & random with lineid
* Support specify the random interval, which avoids printing too much logs.
5 years ago
Leo Chen
1d03469685
use vector instead of pointer, test=develop ( #24620 )
5 years ago
Yiqun Liu
6b464f969a
Add an operator node in unittest to make the fusing result unique. ( #24617 )
5 years ago
ShenLiang
837dd47a96
Add lod in gather/scatter ( #24613 )
...
* add lod msg in gather and scatter_op, test=develop
5 years ago
Jacek Czaja
3292f0ef58
[onednn] elementwise add broadcasting support ( #24594 )
5 years ago
Yiqun Liu
560c815390
Add some check for CUDA Driver API and NVRTC ( #22719 )
...
* Add the check for whether CUDA Driver and NVRTC is available for the runtime system.
* Call cuInit to initialize the CUDA Driver API before all CUDA callings.
test=develop
* Change the behavior when libnvrtc.so can not be found, printing a warning instead of exiting.
test=develop
* Do not initialize CUDA Driver API for windows and macos.
test=develop
* Remove the call of cuInit when entering paddle and enable the test_code_generator.
test=develop
* Add some built-in functions for __half.
test=develop
* Change save_intermediate_out to false in unittest.
test=develop
* Fix error reference to tempropary variable when seting including path for device_code.
test=develop
5 years ago
Guo Sheng
fe8d006f7e
API/OP(sequence_expand_as) error message enhancement ( #23712 )
...
* API/OP(sequence_expand_as) error message enhancement.
test=develop
Co-authored-by: FrostML <380185688@qq.com>
5 years ago
Leo Chen
d980d251f0
specify outs, test=develop ( #24537 )
5 years ago
yaoxuefeng
16817c70fc
OP(datanorm lookupsparsetable lookuptable) error message enhancement ( #24506 )
...
* OP(datanorm lookupsparsetable lookuptable) error message enhancement
* fix test=develop
* fix test=develop
* fix test=develop
* fix test=develop
* fix test=develop
* fix test=develop
* fix test=develop
5 years ago
liu zhengxi
fff9faae72
API(dynamic_gru, chunk_eval, BeamSearchDecoder) error message enhancement ( #24513 )
...
* dynamic_gru err_msg enhancement, test=develop
* chunk_eval err_msg enhancement and fix crf_decoding output type, test=develop
* BeamSearchDecoder err msg enhancement, test=develop
* fix doc for chunk_eval, test=develop
* refine lod err msg for chunk_eval, test=develop
5 years ago
wangchaochaohu
c4dd596de6
cusum op optimization for GPU kernel ( #24321 )
5 years ago
Leo Chen
d43e4047ff
refine err msg, test=develop ( #24550 )
5 years ago
liym27
615a8bfcf4
Support int32 int64 and fix bug ( #24407 )
...
* Compatible int32 and int64 for attr in op slice/strided_slice. test=develop
* Polish code in nn.py test=develop
* Fix bug: set the same dtype for the inputs of elementwise_add. test=develop
* Convert int32 to int64 in slice op to avoid data overflow. test=develop
* Convert int32 to int64 in strided_slice_op to avoid data overflow. test=develop
5 years ago
Zhaolong Xing
f68d4fb3f1
fix bert bug using trt6 when compile with CUDA_ARCH_NAME=All ( #24517 )
...
test=develop
5 years ago
lilong12
a13a4dbc7b
Improving error reporting messages for ops ( #24438 )
...
* improve error reporting message
5 years ago
123malin
45ef6ff338
test=develop, error info improvement ( #24496 )
5 years ago
Adam
dcf17f4813
Add isCached() mechanism to elementwise_add DNNL ( #24563 )
...
* Add isCached() mechanism to elementwise_add
test=develop
* Hide code inside handler
test=develop
5 years ago
pawelpiotrowicz
db2b6b6568
Hide globals & redesign restore PR ( #24279 )
...
test=develop
5 years ago
Jiawei Wang
4a105f803e
SamplingID Op fix error print ( #24521 )
...
* fix error print for sampling_id_op
* fix spell err
* fix spell err test=develop
5 years ago
ShenLiang
53e3c53423
fix error message, test=develop ( #24425 )
5 years ago
ruri
ea2c498712
Fix err message ( #24507 )
...
* fix error message, test=develop
5 years ago
Wilber
4ec7287602
fix compile when with_nccl=off. test=develop ( #24444 )
5 years ago
chajchaj
e5a624935f
Enhance error message of cross_entropy_op, sigmoid_cross_entropy_with_logits_op ( #24485 )
...
* error message of cross_entropy_op, test=develop
* fix bug : can't use platform::errors::InvalidArgument in HOSTDEVICE, test=develop
* fix bug: recovery the check_variable_and_dtype for rank_loss and bpr_loss, test=develop
5 years ago
FDInSky
526a211774
update conv error info ( #24430 )
...
* test=develop update conv error info
* test=develop update iou_similarity error info
* test=develop update some error info based review
5 years ago
Yibing Liu
a9520db24e
Format error message for ops ( #24482 )
...
* Format error message for ops, test=develop
* Fix check in sequence_expand, test=develop
5 years ago
Yang Zhang
2644cb819d
Remove extraneous comma in error messages ( #24478 )
...
which messes up the formatting
test=develop
5 years ago
suytingwan
05c9642dc3
Update paddle enforce message ( #24498 )
...
* test=develop error message update
5 years ago
lijianshe02
9f83f0fe69
API/OP (group_norm, layer_norm, random_crop, unpool) error message enhancement ( #24413 )
...
* API/OP (group_norm, layer_norm, unpool) error message enhancement test=develop
5 years ago
hutuxian
7f85dfeb18
Upgrade Error Message for AucOP & MultiplexOP ( #24458 )
5 years ago
WangXi
027f9953bc
Optimize error message, include dgc, nccl, size op ( #24456 )
5 years ago
gongweibao
f1c57d648c
Enhance error message of prefetch_op, proximal_adagrad_op, proximal_gd_op ( #24436 )
5 years ago
danleifeng
d98e1182a1
fix check and error message for flatten hash is_empty op ( #24434 )
...
fix check info for flatten hash is_empty op; test=develop
5 years ago
liu zhengxi
417b576c34
API(dynamic_lstm, dynamic_lstmp) error message enhancement ( #24450 )
...
* update err msg for dynamic_lstm and dynamic_lstmp, test=develop
5 years ago
wangchaochaohu
53bdee64e6
add tensor support for gaussian_random_op test=develop ( #24389 )
5 years ago
qingqing01
da4a1db7bb
Refine error message in some OPs ( #24443 )
...
test=develop
5 years ago
Xing Wu
479c47f316
fix error info for transpose sequence_conv_pool max_sequence_len sequ… ( #24437 )
...
* fix error info for transpose sequence_conv_pool max_sequence_len sequence_erase, test=develop
* fix error info for transpose sequence_conv_pool max_sequence_len sequence_erase, test=develop
* update modify, test=develop
* update modify, test=develop
* fixed some modifications, test=develop
5 years ago
WuHaobo
1c00732dd0
Polish PADDLE_ENFORCE of unfold_op ( #24423 )
5 years ago
MRXLT
71ff32b65d
update error message for unstack op and lamb op; test=develop ( #24439 )
5 years ago
ForFishes
05c3bc3bf6
fix error message, test=develop ( #24447 )
...
fix scatter and scatter_nd op error message
5 years ago
ceci3
633aaea07e
Update err message ( #24422 )
...
* update err info, test=develop
* update, test=develop
5 years ago
Chen Weihang
05d20e5793
API/OP (Some SL API) error message enhancement ( #24441 )
...
* polish some sl api error message, test=develop
* polish python input check of stride slice, test=develop
* fix unittest bugs, test=develop
5 years ago
tangwei12
a97d5a6153
fix op error, test=develop ( #24451 )
5 years ago
Yang Zhang
7c17ed57e0
Refine `squared_l2_distance_grad` error message ( #24409 )
...
test=develop
5 years ago
huangjun12
046b7ebc03
refine Error message of modified_huber_loss ( #24417 )
...
* refine Error message of modified_huber_loss, test=develop
* refine Error message style, test=develop
5 years ago
zhang wenhui
621a4085b9
enhance cvm bpr_loss adam adagrad adamax ftrl error message, test=develop ( #24452 )
5 years ago
Li Fuchen
e58619295e
OP(rank_loss, similarity_focus, squeeze) error message enhancement ( #24448 )
...
* enhance rank_loss error message, test=develop
* enhance similarity_focus error message, test=develop
* enhance squeeze error message, test=develop
5 years ago
wangguanzhong
abb49df26f
Enhance yolo_box & yolov3_loss ( #24370 )
...
* add scale_x_y for yolo_box, test=develop
* refine eps in iou_similarity, test=develop
5 years ago
mapingshuo
d1bb76a2f7
fix error log, test=develop ( #24419 )
...
* fix error log: resahpe, range, reverse.
5 years ago
Jacek Czaja
8b88cd5167
[oneDNN] Fix to inplace pass ( #24442 )
...
* - Disabling inplace pass
test=develop
- Disable cycles
test=develop
- fix
test=develop
- Enhancement to in-place
- Lint fixes
test=develop
* - Lint fixes
test=develop
5 years ago
Bai Yifan
f1c4c14c7a
Fix fsp_op error message,test=develop ( #24405 )
...
* fix fsp_op error message,test=develop
5 years ago
Jacek Czaja
d0307145a3
[oneDNN] disabling oneDNN inplace pass ( #24406 )
5 years ago
Kaipeng Deng
6435d1b4b8
fix PADDLE_THROW in spectral_norm_op.h. test=develop ( #24414 )
5 years ago
wangguanzhong
cd327e66df
optimize error message, test=develop ( #24420 )
5 years ago
FDInSky
7fedf26b87
add linear interpolate operator ( #23357 )
...
* test=develop add linear interpolate operator
5 years ago
Liufang Sang
e24575c8ba
fix error message in accuracy_op.h test=develop ( #24416 )
5 years ago
wawltor
2de5075a6d
Fix the elementwise ops in broadcast in the process of backward ( #24319 )
...
* Remove the error in the elementwise op, use the backup mode to calculate
5 years ago
Wojciech Uss
fddf442470
add batch size to the mkldnn matmul cache key ( #24408 )
...
test=develop
5 years ago
Tao Luo
72c370c8d2
remove unused test_multi_thread_helper.h ( #24399 )
...
test=develop
5 years ago
hutuxian
123255cf9f
change InitializeGPU to InitializeGPUAndLoadModel ( #24377 )
...
* Add InitializeGPUAndLoadModel to solve random hang when downloading sparse parameters.
* Update SaveBase to solve test problem.
5 years ago
qingqing01
43625bdabd
Add a high-level API with traning and inference into Paddle. ( #24293 )
...
* Merge hapi into Paddle
Hapi is a high level API for training and inference.
The main modules include Model, Loss, Metrics, Dataset.
Also includes common modules and models in NLP and computer vision, such as BERT, ResNet.
These modules are developed by:
0YuanZhang0, guoshengCS heavengate, LielinJiang, qingqing01, xyzhou-puck huangjun12, wangxiao1021, zhangyang.
5 years ago
Tao Luo
63da846de0
remove old inference C++ tests ( #24368 )
5 years ago
Chen Weihang
aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get ( #24175 )
...
* add new macro BOOST_GET_SAFELY & unittests, test=develop
* add different macro type, test=develop
* fix get macro type in executor, test=develop
* four macro part change backup
* using one macro for all case, test=develop
* revert attribute change, test=develop
* change to three func to solve gcc4.8 bug, test=develop
* polish some details, test=develop
5 years ago
Pei Yang
9c073bbd53
skip pyramid_hash on jetson platforms to avoid compile error. test=develop ( #24371 )
5 years ago
guofei
70bc488979
Fix the error of recurrnet op in multithreading in eval process ( #24357 )
...
CreateStepScopes in recurrent op also clears scopes, which can cause segmentation fault un multi-threading. We add a lock in this PR but it may slow the computation process. We will fix in another way in next PR.
5 years ago
wangchaochaohu
01e45a06f6
add dependency for conda test=develop ( #24318 )
5 years ago
hong
67f66f0904
Fix get item out of range error ( #24339 )
...
* raise index error when slice out of range; test=develop
* add uni test; test=develop
* fix format error; test=develop
* add comment for py::index_error; test=develop
* polish error message; test=develop
* polish error message; test=develop
5 years ago
Wojciech Uss
db052009c7
Enabled quantize all and skip missing in QAT ( #24281 )
...
* Enabled quantize all and skip missing in QAT
5 years ago
Aurelius84
1ed8baf9c8
[dy2static] Support for static graph training with @declarative decorator ( #24259 )
...
* support to train in static
* support to independent decorator
* remove in_dygraph_mode condition in ProgramTranslator
* fix import param_guard and add train/eval test=develop
* Modify into ShareVarsFromScope and rm __all__ in partial_program test=develop
5 years ago
Huihuang Zheng
8a1a2af82e
Add Assert Op ( #24280 )
...
1. To make ProgramTranslator to support `assert` grammar, this PR adds `assert` python API and C++ code.
2. Fix a bug: graph_pattern_detector.h #include <gtest/gtest_prod.h> but didn't declared dependency at CMakeLists, which can cause single build failure.
3. Refactoring `Formatter` in print_op to make it reusable and reuse the formatter to print in assert op.
5 years ago
Pei Yang
8c296dea75
fix compile error(cpuid.h not found) on nvidia jetson platforms. test=develop ( #24329 )
5 years ago
Guo Sheng
4a5de14426
Remove cusolver potrfBatched support on Windows. ( #24338 )
...
test=develop
test=win_gpu
5 years ago
Tao Luo
c1df7048c7
add UT for mkldnn_cache_capacity ( #24336 )
...
* add UT for mkldnn_cache_capacity
test=develop
* fix comparison of integer expressions of different signedness
test=develop
5 years ago
Feiyu Chan
420707c26e
API/OP (margin_rank_loss, nce, row_conv, positive_negative_pair) erro… ( #24246 )
...
* API/OP (margin_rank_loss, nce, row_conv, glu, positive_negative_pair) error message enhancement, test=develop
* add unittest for glu, test=develop
5 years ago
joanna.wozna.intel
356f5ee220
[Refactoring] Unify op-dequant squashes ( #24277 )
5 years ago
liym27
ac9a7eeea4
[Dy2Stat]Support list pop ( #24250 )
...
* Replace dygraph_to_static_func with @declarative or program_translator.get_func in test_list.py
* Add comments in ConditionalBlock.
* Support list pop last item.
* Support pop the i-th item.
* Support an empty tensor array as Input in assign op and set the kernel type is float.
5 years ago
Tao Luo
9eedf05d2f
solve mklml memory leak on windows ( #24015 )
...
* solve mklml memory leak on windows
test=develop
* remove unused msvcr120.dll
test=develop
5 years ago
Zhang Ting
b71abeee1b
use 32 bit index to improve activation ops ( #24206 )
...
* improve activation ops performance, test=develop
* use 32bit only GPU computation, test=develop
5 years ago
xujiaqi01
1034ca316f
add timeout and http store in communication ( #23436 )
...
* add timeout and http store in communication, add revert and confirm in fleet
* test=develop
5 years ago
Guo Sheng
1fc6cc502a
Fix cusolver loader for Windows ( #24157 )
...
* Fix cusolver loader for Windows in dynamic_loader.cc. test=develop
* Fix missing CUSOLVER_ROUTINE_EACH_R1.
test=gpu
test=develop
* Add unsupprot for cusolver on Windows temporarily. test=develop
* Fix GetCusolverDsoHandle error message. test=develop
5 years ago
wawltor
d1e1d85881
add the graph batch reader for pslib mode ( #24178 )
...
Add the pslib graph batch reader mode, add the test case for this change
5 years ago
liym27
803559499d
[Dy2Stat]Support LoDTensorArray for slice op ( #23091 )
...
* Support LoDTensorArray for slice op.
* Support read elements of list in dygraph_to_static
* Fix infershape add test for infershape.
* Support Tensor for Attr(starts) and Attr(ends).
* Use new interfaces in VarTypeInference.
5 years ago
Feiyu Chan
78cd3dd507
fix kron_op: when only one input needs gradient, test=develop ( #24269 )
...
fix kron_op: when only one input needs gradient
5 years ago
Li Fuchen
5dc069d050
OP(warpctc, add_position_encoding, scaled_dot_product_attention) error message enhancement ( #24261 )
...
* enhance add_position_encoding error message, test=develop
* enhance warpctc & scaled_dot_product_attention error message, test=develop
* modified error message and ctest of scaled_dot_product_attention, test=develop
5 years ago
liym27
19511dfa25
Fix bug in assign op: support to infer shape for LOD_TENSOR_ARRAY. ( #24268 )
5 years ago
joanna.wozna.intel
b43b46e619
[INT8] Add requant-op squash ( #24143 )
5 years ago
石晓伟
17ac6e2580
update the analysis predictor for multi-stream support, test=develop ( #24046 )
...
* update the analysis predictor, test=develop
* update the unit test, test=develop
* no priority set before the inferface determined, test=develop
* interface name generalization, test=develop
5 years ago
hutuxian
3e2bc8715f
Try to fix UT Random Fail ( #24223 )
5 years ago
lidanqing
61ec30f030
Update QAT INT8 2.0 doc ( #24127 )
...
* update local data preprocess doc
* update for 2.0 QAT
test=develop
test=document_fix
* update benchmark data
test=develop
test=document_fix
Co-authored-by: Wojciech Uss <wojciech.uss@intel.com>
5 years ago
Sylwester Fraczek
e1a7a88057
added reshape transpose matmul fuse pass ( #23754 )
5 years ago
Tao Luo
d7850dcdc0
add noavx_axpy and noavx_axpy_noadd ( #24207 )
...
* remove double registery for pyramid_hash op
* add noavx_axpy and noavx_axpy_noadd
test=develop
5 years ago
Huihuang Zheng
3047b69f94
Make Allocation Smaller on BuddyAllocatorTest to Decrease CI Failure ( #24205 )
...
test=develop
5 years ago
Chen Weihang
9b851ba216
[dy2static] Add print transformer and unify print format ( #24068 )
...
* add print transformer & unify print format, test=develop
* remove using of dygraph_to_static_func, test=develop
* remove python stdout capture, test=develop
* fix compatibility problems for PY2, test=develop
* fix detail error, test=develop
* fix type analysis bug, test=develop
* fix print tuple compatible error in PY2, test=develop
* replace get_func to declarative, test=develop
* fix detail bug, test=develop
* fix some detail problems, test=develop
* change visit_call in print transformer, test=develop
5 years ago
Li Fuchen
077e5a0fe5
Add trace op ( #23873 )
...
* add trace op, test=develop
* Optimized the implementation of trace op, test=develop
* fix a bug of include in trace_op.h, test=develop
* move trace API from creation to math, test=develop
* modified en doc. test=develop
* add complex trace api
* add complex sum api, test=develop
* modified en doc of complex sum and trace, test=develop
* modified doc and trace API, test=develop
* modified en doc of trace and sum, test=develop
* modified comment in complex kron API, test=develop
* OP Should Not Have Unused Input, test=develop
* add GetExpectedKernelType, test=develop
5 years ago
wangchaochaohu
fa43d74a3a
fix the intermediate node of graph for fusion group test=develop ( #24184 )
5 years ago
Yiqun Liu
ecfddebbef
Add the implementation of inverse ( #23310 )
5 years ago
HappyAngel
8ea325aa53
c++ API ( average_accumulates, tensor_array_to_tensor and average_accumulates) error message enhance. test=develop ( #23631 )
5 years ago
Tao Luo
e3179ea2f5
refine ccache statistics show ( #24167 )
...
test=develop
5 years ago
ShenLiang
0fb9b208ab
Add batch_fc op in contrib ( #24017 )
...
* add batch fc op, test=develop
* add batch_fc_op, test=develop
* fix untest, test=develop
* rm check_dygraph, test=develop
* fix comment, test=develop
* fix comment, test=develop
5 years ago
liuwei1031
9a93f6aae0
improve efficiency of runtime InferVarType ( #22778 )
...
* save InferVarType changes, test=develop
* remove code comments, test=develop
* tweak code, test=develop
* fix compilation warning, update merge_ids_op split_ids_op to new interface, test=develop
* modify fused_bn_activation_op, test=develop
* fix error of fused_bn_activation_op, test=develop
* fix PADDLE_ENFORCE and unittest coverage issue, test=develop
* tweak PADDLE_ENFORCE messages, test=develop
* improve unittest coverage, test=develop
* add StaticGraphInferVarType class, test=develop
* rebase develop branch, test=develop
* fix unittest error, test=develop
* remove comments, test=develop
* improve unittest coverage, test=develop
* imporve error message and imporve unittest coverage, test=develop
* upgrade InferVarType API, test=develop
* tweak pyfunc error message, test=develop
* fix compilation conflict - save_combine_op, test=develop
5 years ago
hong
bfb60efb1f
change compare forece_cpu default value; test=develop ( #23888 )
5 years ago
wangchaochaohu
2270864019
Fusion group optimize for cuda codegen( #23940 )
5 years ago
ShenLiang
94dfb7d770
opt the postprocess, test=develop ( #24155 )
5 years ago
Chengmo
28a558e867
Update index sample ( #24109 )
...
* update index sample
5 years ago
Zhang Ting
ab8f8fa70d
fix example code, test=develop, test=document_fix ( #24139 )
5 years ago
Feiyu Chan
e01262e691
add kron op ( #24105 )
...
* add kron op and its python API, doc and unittests.
* add kron in paddle.complex
5 years ago
Jacek Czaja
eb411613e9
[DNNL] refine activations Inplace support ( #24145 )
5 years ago
wangchaochaohu
6bf26ef156
fix warning mac compiler ( #24138 )
5 years ago