liym27
803559499d
[Dy2Stat]Support LoDTensorArray for slice op ( #23091 )
...
* Support LoDTensorArray for slice op.
* Support read elements of list in dygraph_to_static
* Fix infershape add test for infershape.
* Support Tensor for Attr(starts) and Attr(ends).
* Use new interfaces in VarTypeInference.
5 years ago
Feiyu Chan
78cd3dd507
fix kron_op: when only one input needs gradient, test=develop ( #24269 )
...
fix kron_op: when only one input needs gradient
5 years ago
Li Fuchen
5dc069d050
OP(warpctc, add_position_encoding, scaled_dot_product_attention) error message enhancement ( #24261 )
...
* enhance add_position_encoding error message, test=develop
* enhance warpctc & scaled_dot_product_attention error message, test=develop
* modified error message and ctest of scaled_dot_product_attention, test=develop
5 years ago
liym27
19511dfa25
Fix bug in assign op: support to infer shape for LOD_TENSOR_ARRAY. ( #24268 )
5 years ago
joanna.wozna.intel
b43b46e619
[INT8] Add requant-op squash ( #24143 )
5 years ago
石晓伟
17ac6e2580
update the analysis predictor for multi-stream support, test=develop ( #24046 )
...
* update the analysis predictor, test=develop
* update the unit test, test=develop
* no priority set before the inferface determined, test=develop
* interface name generalization, test=develop
5 years ago
hutuxian
3e2bc8715f
Try to fix UT Random Fail ( #24223 )
5 years ago
lidanqing
61ec30f030
Update QAT INT8 2.0 doc ( #24127 )
...
* update local data preprocess doc
* update for 2.0 QAT
test=develop
test=document_fix
* update benchmark data
test=develop
test=document_fix
Co-authored-by: Wojciech Uss <wojciech.uss@intel.com>
5 years ago
Sylwester Fraczek
e1a7a88057
added reshape transpose matmul fuse pass ( #23754 )
5 years ago
Tao Luo
d7850dcdc0
add noavx_axpy and noavx_axpy_noadd ( #24207 )
...
* remove double registery for pyramid_hash op
* add noavx_axpy and noavx_axpy_noadd
test=develop
5 years ago
Huihuang Zheng
3047b69f94
Make Allocation Smaller on BuddyAllocatorTest to Decrease CI Failure ( #24205 )
...
test=develop
5 years ago
Chen Weihang
9b851ba216
[dy2static] Add print transformer and unify print format ( #24068 )
...
* add print transformer & unify print format, test=develop
* remove using of dygraph_to_static_func, test=develop
* remove python stdout capture, test=develop
* fix compatibility problems for PY2, test=develop
* fix detail error, test=develop
* fix type analysis bug, test=develop
* fix print tuple compatible error in PY2, test=develop
* replace get_func to declarative, test=develop
* fix detail bug, test=develop
* fix some detail problems, test=develop
* change visit_call in print transformer, test=develop
5 years ago
Li Fuchen
077e5a0fe5
Add trace op ( #23873 )
...
* add trace op, test=develop
* Optimized the implementation of trace op, test=develop
* fix a bug of include in trace_op.h, test=develop
* move trace API from creation to math, test=develop
* modified en doc. test=develop
* add complex trace api
* add complex sum api, test=develop
* modified en doc of complex sum and trace, test=develop
* modified doc and trace API, test=develop
* modified en doc of trace and sum, test=develop
* modified comment in complex kron API, test=develop
* OP Should Not Have Unused Input, test=develop
* add GetExpectedKernelType, test=develop
5 years ago
wangchaochaohu
fa43d74a3a
fix the intermediate node of graph for fusion group test=develop ( #24184 )
5 years ago
Yiqun Liu
ecfddebbef
Add the implementation of inverse ( #23310 )
5 years ago
HappyAngel
8ea325aa53
c++ API ( average_accumulates, tensor_array_to_tensor and average_accumulates) error message enhance. test=develop ( #23631 )
5 years ago
ShenLiang
0fb9b208ab
Add batch_fc op in contrib ( #24017 )
...
* add batch fc op, test=develop
* add batch_fc_op, test=develop
* fix untest, test=develop
* rm check_dygraph, test=develop
* fix comment, test=develop
* fix comment, test=develop
5 years ago
liuwei1031
9a93f6aae0
improve efficiency of runtime InferVarType ( #22778 )
...
* save InferVarType changes, test=develop
* remove code comments, test=develop
* tweak code, test=develop
* fix compilation warning, update merge_ids_op split_ids_op to new interface, test=develop
* modify fused_bn_activation_op, test=develop
* fix error of fused_bn_activation_op, test=develop
* fix PADDLE_ENFORCE and unittest coverage issue, test=develop
* tweak PADDLE_ENFORCE messages, test=develop
* improve unittest coverage, test=develop
* add StaticGraphInferVarType class, test=develop
* rebase develop branch, test=develop
* fix unittest error, test=develop
* remove comments, test=develop
* improve unittest coverage, test=develop
* imporve error message and imporve unittest coverage, test=develop
* upgrade InferVarType API, test=develop
* tweak pyfunc error message, test=develop
* fix compilation conflict - save_combine_op, test=develop
5 years ago
hong
bfb60efb1f
change compare forece_cpu default value; test=develop ( #23888 )
5 years ago
wangchaochaohu
2270864019
Fusion group optimize for cuda codegen( #23940 )
5 years ago
ShenLiang
94dfb7d770
opt the postprocess, test=develop ( #24155 )
5 years ago
Chengmo
28a558e867
Update index sample ( #24109 )
...
* update index sample
5 years ago
Zhang Ting
ab8f8fa70d
fix example code, test=develop, test=document_fix ( #24139 )
5 years ago
Feiyu Chan
e01262e691
add kron op ( #24105 )
...
* add kron op and its python API, doc and unittests.
* add kron in paddle.complex
5 years ago
Jacek Czaja
eb411613e9
[DNNL] refine activations Inplace support ( #24145 )
5 years ago
wangchaochaohu
6bf26ef156
fix warning mac compiler ( #24138 )
5 years ago
lijianshe02
a398464e77
API/OP (affine_channel, group_norm, layer_norm, random_crop, unpool, … ( #24118 )
...
* API/OP (affine_channel, group_norm, layer_norm, random_crop, unpool, log_loss) error message enhancement test=develop
5 years ago
Guo Sheng
a8c0fb4e86
Add cholesky_op ( #23543 )
...
* Add cholesky_op forward part. test=develop
* Complete cholesky_op forward part. test=develop
* Add cholesky_op backward part. test=develop
* Complete cholesky_op backward part. test=develop
* Refine cholesky_op error check and docs. test=develop
* Add grad_check unit test for cholesky_op. test=develop
* Fix sample code in cholesky doc. test=develop
* Refine some error messages of cholesky_op. test=develop
* Refine some error messages of cholesky_op. test=develop
* Remove unused input in cholesky_grad. test=develop
* Remove unused input in cholesky_grad. test=develop
* Fix stream for cusolverDnSetStream. test=develop
* Update PADDLE_ENFORCE_CUDA_SUCCESS from cholesky_op to adapt to latest code.
test=develop
* Add CUSOLVER ERROR in enforce.h
test=develop
* Fix the missing return value in cholesky. test=develop
5 years ago
Jacek Czaja
461e6a01ec
[DNNL] activations Inplace support ( #24123 )
5 years ago
wangchaochaohu
6ba7c3ac92
Reduce the construction time of fuction about profiler ( #24117 )
5 years ago
Zhang Ting
fb0729ee7f
avoid warnings in MAC compile ( #24124 )
5 years ago
qingqing01
b059fb955d
Add trainable_statistics in attr for batch_norm. ( #24072 )
...
* Add trainable_statistics in attr for batch_norm
* Unifying behavior of dynamic graph and static graph
5 years ago
arlesniak
d31a174f51
added fusing matmul-transpose-reshape pass ( #23866 )
5 years ago
石晓伟
46f3139c7f
supports loading model from memory, test=develop ( #24098 )
5 years ago
xiaoting
96ffebef55
fix bicubic, change int to floor ( #24063 )
...
* change int to floor, test=develop
* fix unittest, test=develop
5 years ago
Zeng Jinle
a67eea9f00
polish code by adding final, test=develop, test=develop ( #24114 )
5 years ago
Zeng Jinle
ab2e284235
fix compilation failure ( #24091 )
5 years ago
石晓伟
34d7d6aef0
declare the stream::Priority as enum class, test=develop ( #24013 )
5 years ago
Pei Yang
695a53c874
remove conv_bn_fuse_pass and fc_fuse_pass in trt int8 calibration. test=develop ( #23805 )
5 years ago
0YuanZhang0
a164b10d05
API/OP error message enhancement ( #23717 )
...
* test=develop
5 years ago
Zeng Jinle
acef55df04
fix isolated var fetch bug, test=develop ( #24070 )
5 years ago
Zeng Jinle
3ca700a987
code clean, test=develop ( #24060 )
5 years ago
Zhaolong Xing
35148d17f7
[BUG]: Head number can only be > 1 on multihead op ( #23974 )
...
* support the head number == 1
test=develop
* fix slice op error.
test=develop
5 years ago
Jacek Czaja
c6c65c65c7
[DNNL] Added elementwise_add mkl-dnn inplace ( #23477 )
5 years ago
hutuxian
9ff558a46f
Optimize DataFeed ( #23957 )
...
* Make batch_float_feasigns & batch_uint64_feasigns as member variable
5 years ago
Leo Chen
5cccc69f1a
update name generator, test=develop ( #24048 )
...
* update name generator, test=develop
* use c++ unique name generator, test=develop
5 years ago
石晓伟
db6d867383
add boost dependency to cuda_stream ( #24032 )
5 years ago
wawltor
5c669ad1c2
Add the support dygraph out attribute for the op of mm in api2.0 ( #23978 )
...
Fix the dygraph mode in matmul, add the support in Linear Op
5 years ago
石晓伟
d2584a7082
New feature: thread local allocator, test=develop ( #23989 )
...
* add the thread_local_allocator, test=develop
* refactor the thread_local_allocator, test=develop
* provides option setting strategy, test=develop
5 years ago
Kaipeng Deng
80cf3c3c4d
Refine DataLoader support multi-processing ( #23107 )
...
* add DataLoader, Dataset, BatchSampler
5 years ago
Zhou Wei
76d78c6387
fix conv_fusion_op conflict,test=develop ( #24020 )
5 years ago
wangguanzhong
931cba2e64
add clamp api, test=develop ( #23273 )
...
* add clamp api, test=develop
5 years ago
zhaoyuchen2018
a28a63a943
OP(fusion_gru) error message enhancement. test=develop ( #23591 )
...
* OP(fusion_gru) error message enhancement. test=develop
* refine code, test=develop
* Refine inout log, test=develop
* Refine description, test=develop
5 years ago
Zhou Wei
7817003795
Optimize the error messages of paddle CUDA API ( #23816 )
...
* Optimize the error messages of paddle CUDA API, test=develop
* fix the error messages of paddle CUDA API, test=develop
* Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
* remove build_ex_string,test=develop
* merge conflict,test=develop
5 years ago
silingtong123
f6dbf8e3a4
add 'seed' arguemnt of randint API ( #23809 )
...
* test=develop, add seed arguemnt of randint API
5 years ago
Li Fuchen
87d8dc3dc0
add diag_embed op ( #23385 )
...
* add diag_embed op, test=develop
* add TestCase of diag_embed API
* Modified diag embed python API teastcase from dygraph to static graph, test=develop
* delete useless log and trigger ci, test=develop
* modified float16 of diag_embed, test=develop
* modified en doc of diag_embed
* trigger ci, test=develop
* add fp16 in dtype check of python API, test=develop
* modified __init__ and fix a big, test=develop
* modified a test bug of test_bicubic_interp_op and test_trilinear_interp_op, test=develop
* modified to use one kernel on cpu and cuda, test=develop
5 years ago
liu zhengxi
8e555ba650
OP(pad, pad2d, pad_constant_like) error message enhancement ( #23882 )
...
* enhance pad.* error message, test=develop
5 years ago
ShenLiang
30bd7e1c83
Add rank_attention_op attributes for GPU memory in contrib ( #23915 )
...
* optimize rank_attention, test=develop
* use the paddle memory pool, test=develop
* set max size, test=develop
* limit the max size, test=develop
* fix the head of cu, test=develop
* add AsDispensable, test=develop
5 years ago
Yiqun Liu
8d0b0cb4ae
Op(conv2d_fusion) error message enhancement. ( #23596 )
5 years ago
ShenLiang
7f0b2c7407
fix memory leaking problem of dataset, test=develop ( #23955 )
5 years ago
mapingshuo
7d4002e06a
restrict block num of layer_norm_grad cuda block to 128 ( #23878 )
...
restrict block num of layer_norm_grad cuda kernel to 128, test=develop
5 years ago
Zhaolong Xing
133f1fc123
[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support ( #23723 )
...
* refine ernie trt dynamic shape support
1. add slice op converter
2. add emb eltwise layernorm fp16 support
test=develop
* fix dynamic shape test ut
test=develop
* fix comments.
test=develop
* fix comments
test=develop
5 years ago
guofei
2b896c1f6b
Support LoDTensorArray in fetch ( #23645 )
...
* Support LoDTEnsorArray in fetch op
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
* Support LoDTensorArray in fetch
test=develop
5 years ago
Qinghe JING
ad7ac4c607
create bmm op and move several api from fluid.layers to tensor ( #23457 )
...
* add gradient check to reduce ops
* add skip gradient check to reduce ops test=develop
* modify stack api test=develop
* add bmm op and move serval ops from fluid.layers to tensor test=develop
5 years ago
Yiqun Liu
071a702060
Fix the error misjudgment when there are control nodes in graph. ( #23943 )
5 years ago
GaoWei8
490db7f325
add paddle.tensor interface ( #23801 )
...
* add paddle.tensor
test=develop
* polish gpu where codes
test=develop
* polish test code
test=develop
5 years ago
lidanqing
2291634c5c
Add user local data preprocess support ( #23692 )
...
* add local data preprocess support for imagenet
test=develop
* add local data2bin tests
test=develop
* locally two tests passed
test=develop
* change according to reviews
test=develop
5 years ago
Liufang Sang
771c3b297f
update enhance error message for Initializer, smooth_l1 ( #23912 )
...
* update enhance error message test=develop
* update details test=develop
5 years ago
Zhang Ting
b89dd86fb6
Update eigen ( #23203 )
...
* update eigen, test=develop
* remove patches, test=develop
* add definition of -fabi-version, test=develop
* add patch for TensorBlock.h, test=develop
* test windows, test=develop
* only update eigen for Linux, test=develop
* add code comments, test=develop
5 years ago
GaoWei8
0be4b04d13
Api (lod_append) error message enhancement ( #23541 )
5 years ago
hutuxian
df64a96686
support set_test_mode and set comlog level( #23905 )
5 years ago
石晓伟
2d01cc85c4
DeviceContext Split, test=develop ( #23737 )
...
* supports thread-binding stream, test=develop
* avoid using thread_local variables in dtor, test=develop
* modify the stream priority enum, test=develop
5 years ago
huangjun12
8af85922d0
Error message enhancement of 6 op ( #23759 )
...
* enhance error message test=develop
* fix CI test of 3 op test=develop
* fix bug caused by the diff of long type in py2 and py3, test=develop
5 years ago
Zhang Ting
318dfa0d4f
remove eval in eigen function when dtype is fp16 ( #23845 )
5 years ago
Zhong Hui
361c6ccc01
OP error message enhancement of l2_normalize, matmul, mean, etc
...
* fix error message of l2_normalize, matmul, mean, etc.
* add the test case for those ops
5 years ago
chenhaoze
b7d185d6ca
OP clip, merge_lod_tensor, convert/elementwise error message enhancement ( #23742 )
...
* OP clip, merge_lod_tensor, convert/elementwise error message enhancement. test=develop
5 years ago
gfwm0502
d8ca66dae2
Modify documents of executor and randn and fix other errors ( #23879 )
...
test=develop
5 years ago
gfwm0502
a756360263
OP/API (While/while_loop/DynamicRNN) : Error Message Enhancement ( #23896 )
...
As the title
5 years ago
Zhang Ting
b88662254b
use 32 bit index to improve expand op ( #23899 )
...
* use 32 bit index to improve expand op, test=develop
* remove redundant code, test=develop
5 years ago
lijianshe02
e21b3c273e
add nll_loss op test=develop ( #23758 )
...
* add nll_loss op test=develop
5 years ago
yiicy
f5f76e610d
fusion_seqconv_eltadd_relu error message enhancement. ( #23554 )
5 years ago
Double_V
b4daea13f4
API/OP (center_loss, fluid.one_hot, prroi_pool, roi_pool, ctc_greed_decoder) error message enhancement ( #23794 )
...
* error message enchanced, test=develop
* error message enchanced for APIs, test=develop
* error message enhanced for roi_pool, test=develop
* update added code, test=develop
* update fluid/input.py, test=develop
* update fluid/input.py ,test=develop
* fix code style, test=develop
* fix lower converage, test=develop
* error message enhanced for roi pool, test=develop
5 years ago
zhupengyang
56c54cccbd
Op(prelu/relu/f.relu/f.log_softmax) error message enhancement ( #23792 )
5 years ago
guofei
c2a60bb1fa
Correct the wrong name in the flag comment ( #22977 )
...
Correct the name [`FLAGS_sync_nccl_allreduce`](https://www.paddlepaddle.org.cn/documentation/docs/zh/advanced_guide/flags/others_cn.html#flags-sync-nccl-allreduce ) based on the information from our official website.
5 years ago
gfwm0502
af149f2595
OP(compare/get_places/shrink_rnn_memory) error message enhancement ( #23780 )
...
As the title.
5 years ago
zhaoyuchen2018
f0b08123b2
OP(fused_embedding_fc_lstm) error message enhancement. test=develop ( #23527 )
...
* API(fused_embedding_fc_lstm) error message enhancement. test=develop
C++ API enhancement.
* Refine code, test=develop
* Refine code. test=develop
5 years ago
zhaoyuchen2018
ef72de6fe0
API(Print) error message enhancement. test=develop ( #23521 )
...
Enhance print op c++ error message.
5 years ago
Zhong Hui
1066f59bf7
fix compile problem on windows and some invalid argument check
...
Fix the compile problem in norm op in the windows env, add delete some invalid argument
5 years ago
Zeng Jinle
82f3348ad0
remove useless comment in layer.cc, test=develop, test=document_fix ( #23847 )
5 years ago
Bai Yifan
fc1b140aba
Refine error message, test=develop ( #23823 )
...
* refine error message, test=develop
5 years ago
yiicy
de3e299dbb
fusion_seqexpand_concat_fc error message enhancement, test=develop ( #23558 )
5 years ago
yiicy
a1e7387919
Variable error message enhancement, test=develop ( #23548 )
5 years ago
wangguanzhong
93aa66902d
enhance the error message, test=develop ( #23646 )
...
* enhance the error message of box_coder, test=develop
* enhance error message in box_decoder_and_assign
* enhance error message of collect_fpn_proposals
* enhance errror message of distribute_fpn_proposals
* enhance error message of multiclass_nms, test=develop
5 years ago
xiaoting
7de0a25b5e
Add `nn.interpolate ` ( #23434 )
...
* add nn.interpolate, test=develop
* fix interpolate typo, test=develop
* formate code, test=develop
* fix unitest, test=develop
* add test layers, test=develop
* add test layers, test=develop
* extract common function, test=develop
* reduce the threads for cuda10, test=develop
* update unitest, test=develop
* polish unitest, test=develop
* add dygraph test, test=develop
* format description, test=develop
* add 5D input check, test=develop
* fix doc, test=develop
5 years ago
huzhiqiang
5fe3b63824
[error message enhancement] fused_elemwise_activation_op and fusion_conv_inception_op ( #23686 )
5 years ago
Liufang Sang
9549b78691
OP Normal, Uniform, Xavier Initializer, smooth_l1, mean_iou error message enhancement ( #23751 )
...
* enhance error message test=develop
* enhance error message test=develop
* change to INOUT_CHECK test=develop
5 years ago
yaoxuefeng
5b69242fab
modify datanorm op test=develop ( #23030 )
5 years ago
suytingwan
3e1676fa9a
Add meshgrid op ( #23736 )
...
* test=develop fix api doc error
5 years ago
Yang Zhang
648f1d5b3f
API/OP error message enhancement ( #23684 )
...
* `l1_norm` error message enhancement
test=develop
* `norm` error message enhancement
test=develop
* `squared_l2_norm` error message enhancement
test=develop
* `squared_l2_distance` error message enhancement
test=develop
* `conv_shift_op` error message enhancement
test=develop
* `sample_logits_op` error message enhancement
test=develop
* Fix format
test=develop
* use `PADDLE_ENFORCE_EQ` for '||' conditional
test=develop
* Fix string format
test=develop
* Add test for `sample_logits`
test=develop
* Add error message
test=develop
* Use ordered dict for outputs
test=develop
* Hard code fetch list so that order is deterministic
test=develop
* Fix license header year
test=develop
5 years ago
Yi Liu
14e7041c6d
Fix CUDAHandleHolder destruction problem. ( #23772 )
...
eagerly release cuda resources before cuda enviroment destroying
test=develop
5 years ago