iducn
35ae10272e
add shell of CPU's version info ( #26937 )
5 years ago
Feiyu Chan
c8cc094576
add template specialization for bfloat16 for gcc 4.8 compatability ( #26985 )
5 years ago
wangchaochaohu
3eacced950
[cuda11 support] add support for cublas load of same function name (parameter diff) ( #26963 )
5 years ago
Chen Weihang
209273e605
Support load state dict form `inference model` format save result ( #26718 )
...
* support load infer model format state dict
* add unittests
* remove keep name table
* recolve circle inport
* fix compatible problem
* recover unittest
* polish doc and comment
5 years ago
joanna.wozna.intel
95e1434bb2
Add bfloat16 data type ( #25402 )
5 years ago
Yang Zhang
29b844ad5e
Fix clip op attr ( #26924 )
5 years ago
LoveAn
26c698e2c9
Fix catch exit code failed caused by ( #26934 )
5 years ago
Shang Zhizhou
61fc7a3e45
Pass version check ( #26887 )
5 years ago
Zhou Wei
f772540d80
add time when test failed ( #26935 )
...
show unittest time even if unittest failed
5 years ago
huangjun12
e480168fae
fix dropout bug in backward when input is 1d tensor ( #26837 )
...
* fix dropout bug in backward when input is 1d tensor, test=develop
* add test case and refine error message, test=develop
* refine error message, test=develop
5 years ago
YUNSHEN XIE
d8984a6b90
limit timeout value setting on linux ( #26923 )
5 years ago
Zhou Wei
1771d9f880
fix cache judge more safe ( #26910 )
5 years ago
joanna.wozna.intel
0627a319b0
Restore "Add mkldnn bfloat16 option to C-API " ( #26882 )
...
* Add mkldnn bfloat16 option to C-API
* Add test for bfloat16 gpu
* Change coverage test
* Repair capi_gpu test
5 years ago
Jacek Czaja
5e874cc333
- Cosmetic fixes to align with PADDLE_ENFORCE guidelines ( #26891 )
...
test=develop
5 years ago
wanghuancoder
2d2c31a63a
Add FetchAsyncOpHandle, and use it in FastThreadedExecutor ( #26643 )
...
* optimized transformation form tensor to numpy, test=develop
* Modify fetch op handle, from memcpy Sync to memcpy Async, test=develop
* modify CUDAPinnedPlace to CPUPlace, test=develop
* modify CPUPlace to CUDAPinnedPlace, and set default inplace to false, test=develop
* revert fetch_op_handle, add fetch_async_op_handle, test=develop
* revert fetch_op_handle, add fetch_async_op_handle, test=develop
* fix error msg report, test=develop
* fix bug in cpuplace, test=develop
* fix bug in unmerge and tensorarray modle, test=develop
* fix bug, double copy gpu memory, test=develop
* fix chenweihang¡¯s review advice, test=develop
5 years ago
Thunderbrook
5205748481
fix eigen in push sparse; fix hadoop command ( #26872 )
...
* fix eigen in push sparse; fix hadoop command
test=develop
* add log in load_combine_op
test=develop
5 years ago
Zhaolong Xing
932bbe955b
fix pool trt plugin bug ( #26463 )
...
test=develop
5 years ago
wawltor
0a29fc85d6
fix the argmin,argmax op for the paddlepaddle 2.0
...
* fix the argmin,argmax op for the paddlepaddle 2.0, add checkPoint for the argmax/argmin
5 years ago
LoveAn
d067e66d39
Show more possible problems with build_and_check in file paddle_build.sh ( #26846 )
...
* Show more possible problems with build_and_check in file paddle_build.sh, test=develop
* Remove test codes modified in file device.py for build_and_check, test=document_fix
* Fix missing blank space in file device.py, test=document_fix
* Final process via summary_check_problems function, test=document_fix
5 years ago
Chengmo
d0962abd20
supplement bug fix of parameter server ( #26217 )
...
* fix fluid.embedding
5 years ago
zlsh80826
ad6e3dd69c
[Paddle-TRT] Stack op plugin ( #25605 )
...
* add stack_op to CMakeLists
* add dim=3 support for scale op
* add trt stack op, test=develop
* remove debug message
* add stack plugin serialize
* remove slice, scale op, will add later
* enhence error message
* revise trt ernie test to conver the stack op CI testi, test=develop
* add stack op serialization
* fix test shape after adding stack op
* remove slice op, will add after implementing serialization
* roll back to min_graph=5 to avoid using slice op
* fix scale op output layer
* implement stack op createPlugin
* use workspace and move the defination to .cu
* move stack plugin creator definition to .cu, test=develop
5 years ago
Leo Chen
60ffc22026
Refine bernoulli and unsqueeze op ( #26842 )
...
* add check for bernoulli and register bool for unsqueeze
* follow comments
5 years ago
YUNSHEN XIE
1e50b2a635
fix retry error with blank ( #26835 )
5 years ago
石晓伟
ced6e87eee
Revert "Add mkldnn bfloat16 option to C-API ( #26676 )" ( #26854 )
...
This reverts commit 02083bda40
.
5 years ago
tangwei12
ebc5f99789
add embedding 2.0 ( #26649 )
...
* add embedding 2.0
* add embedding support input int32
5 years ago
Zhou Wei
d85410109d
Count the time and packet size for Windows monitor ( #26678 )
...
* turn on WITH_INFERENCE_API_TEST
* Count the time and packet size on windows
* fix conflit
* fix conflit
* fix conflit
* fix date-time funciton
5 years ago
hong19860320
40378edfa8
Add the AddCheckpoint macro to softplus op ( #26809 )
5 years ago
GaoWei8
11fb8a1c10
Refine cudnn softmax ( #25757 )
...
* refine cudnn softmax
5 years ago
arlesniak
885c61f086
Add use of global flag 'use_mkldnn' to layer_helper ( #26497 )
...
* get use of global 'use_mkldnn' in layer_helper
* update for CI
* update for CI, relu test
* update for CI, relu test added, make FLAGS_use_mkldnn a public flag
* added more strict tests, fixes after review
* fixes after review
* fixes after review, CI stuff
5 years ago
swtkiwi
f44420c874
test=develop ( #26710 )
5 years ago
Pei Yang
78a530c219
[Paddle-TRT] TRT dynamic shape support PaddleSlim quant models ( #26536 )
...
* support trt dynamic shape int8
* add unittest
* add support for sigmoid; adapt to trt6+ api
5 years ago
wawltor
7ee70a47b8
update the doc for the some ops
...
update the doc for the some ops, ceil asin, atan
5 years ago
yaoxuefeng
a47d92d868
fleet add save with whitelist test=develop ( #23376 )
5 years ago
zhupengyang
0f1ad9b06c
leaky_relu and hardshrink add checkpoint for behavior changed ( #26802 )
5 years ago
Chengmo
7f2aa2db3c
【paddle.fleet】Support Heter Parameter Server ( #25998 )
...
* Support Heter Parameter Server
5 years ago
zlsh80826
ac63c7cdef
fix a skip_layernorm bug, test=develop ( #26800 )
5 years ago
Jiawei Wang
a1b99fae07
Adadelta Optimizer ( #26590 )
...
* add doc; notest
* fix doc; notest
* update doc; notest
* refine optimizer && adam
* refine optimizer; notest
* add adam
* fix doc
* fix doc && add adamw; notest
* add error message
* bug fix
* refine rmsprop && adamax
* fix ci
* buf fix
* update comment
* unify arguments place; notest
* fix ut, test=develop
* bug fix
* fix conflicts, test=develop
* add examples code
* bug fix
* fix comments
* fix sample code
* add sample code for Optimizer
* add adamax ut, test=develop
* fix rmsprop ut, test=develop
* add ut for optimizer.py and adamw.py
* first commit of adadelta optimizer
* fix learning rate
* fix adadelta doc and add sgd momentum
* remove unused fluid
* fix codestyle
* Update test_adam_op.py
* Update test_adam_op.py
* fix SGD in 2 unittests
* fix SGD in 2 unittests
* fix ci
* fix ut
Co-authored-by: MRXLT <xlt2024@gmail.com>
Co-authored-by: mapingshuo <mps2012@yeah.net>
5 years ago
LielinJiang
346689c6f1
Register conv_transpose Op version for compatible Op upgrades ( #26745 )
...
* fix bug
* add version check
* fix docs, test=document_fix
* fix formula, test=document_fix
5 years ago
Adam
8bcb1f29d9
Add conv+affine_channel fuse pass to MKLDNN pass strategy and fix it ( #26779 )
5 years ago
Wilber
68e0560c2f
refine paddle inference api ( #26774 )
...
* refine paddle inference api
Co-authored-by: nhzlx <nhzlx.dragon@gmail.com>
5 years ago
iducn
64df9b99a9
add shell of GPU version ( #26589 )
5 years ago
Wojciech Uss
7afb1df11e
Decouple weights and bias from fc primitive in MKLDNN cache ( #26708 )
...
* decouple weights and bias from fc primitive in cache
* removed reduntant update of pointers
5 years ago
Zhen Wang
f32ae272ec
Remove `sorted_sum_gradient_` form BasicEngine and PartialGradTask. ( #26766 )
...
Use `Tensor` instead of `Variable` in the doc of paddle.grad.
5 years ago
Leo Chen
844583c8fd
Refine paddle.manual_seed ( #26496 )
...
* refine manual seed
* fix ci problem
* fix unittests
* fix unittest
* set is_init_py=false in manual_seed
* fix unittest
* fix bernoulli_op
* fix(unittest): change random_seed to manual_seed
* 🐞 fix(unittest): fix manual_seed
* trigger ci
* fix test_sentiment
* fix test_imperative_save_load
* fix test_uniform_random_op
* fix test_uniform_random_op
* fix test_jit_save_load
* merge develop
* fix manual_seed
* fix manual_seed
* use global engine
* use shared_ptr
* fix double free
* fix bug
* fix bug
* fix bug
* fix test bug
* fix test bug
* fix test bug
* fix ci
5 years ago
Zhou Wei
2d88b9ffe7
turn on WITH_INFERENCE_API_TEST ( #26746 )
5 years ago
Pei Yang
e3f8e5cf5c
trt int8 support conv2d_transpose ( #26636 )
5 years ago
ShenLiang
29494d703d
fix remainder, floor_div ( #26732 )
...
* fix remainder, floordiv
5 years ago
zhangchunle
623a4c2e56
fix ci coverage build error ( #26761 )
5 years ago
lilong12
5f524efe56
modify error report message, test=develop ( #26743 )
5 years ago
wangchaochaohu
4561fc37e2
Add check point for gather Op ( #26696 )
5 years ago
joanna.wozna.intel
eb097d64f6
Fix int8 performace drop cpu_quantize_placement_pass ( #26715 )
...
* Fix cpu quantize placement pass
* Include string lib
5 years ago
joanna.wozna.intel
02083bda40
Add mkldnn bfloat16 option to C-API ( #26676 )
...
* Add mkldnn bfloat16 option to C-API
* Add test for bfloat16 gpu
* Change coverage test
5 years ago
LutaoChu
1ec30cb160
register cumsum Op version for compatible Op upgrades ( #26734 )
...
register cumsum Op version for compatible Op upgrades
5 years ago
Jack Zhou
c282db3a93
add broadcast feature for elementwise logical op
...
add broadcast feature for elementwise logical op
5 years ago
Yang Zhang
63eef7632e
Fix clip input check ( #26683 )
...
* Fix clip input check
* Fix default min/max value
* Allow both max and min to be None
* Register op change
* Revert OP signature change
5 years ago
Zhen Wang
f9066e6a6f
Update the demo code and the doc of varbase.backward. ( #26506 )
...
* update the demo code and the doc of varbase.backward.
* update the doc of the fake interface `paddle.fluid.Variable`.
* remove BackwardStrategy.
5 years ago
Wilber
1c898b66d6
add bug fix enum. ( #26736 )
5 years ago
Zhou Wei
8071d23073
fix bug that can't print int8_t ( #26712 )
...
fix bug that can't print int8_t
5 years ago
joejiong
f311d3c1cf
Fix pow api type error with python side method, merge elementwise_pow and pow. ( #26163 )
...
As the title
5 years ago
yongqiangma
e4cc6a28b0
Norm op support 2-axis ( #26492 )
5 years ago
chalsliu
dc56c89822
Add the option to execute unit tests only at night ( #26669 )
...
* Add the option to execute unit tests only at night
* set ut nightly label for 3 cases.
5 years ago
xiaoting
89d7d86684
add intepolte_v2 ( #26520 )
...
* add intepolte_v2
* fix linear interp
* polish unittest, test=develop
* update code samples to 2.0 API, test=develop
* remove warning, test_develop
* add name in attrs, test=develop
* polish code, test=develop
* change Align to align, test=develop
* fix unittest in py3,test=develop
* fix coverage, test=develop
* fix coverage, test=develop
* fix for windows ci, test=develop
* fix coverage, test=develop
5 years ago
Adam Osewski
c2c689582e
Update Paddle-Lite commit hash. ( #26413 )
...
* Update Paddle-Lite commit hash.
* Add BF16 data type to VarTyp protobuf message.
5 years ago
Zhang Ting
97cebfa4d3
add dtype for unique ( #26655 )
...
* update doc, test=document_fix
* add attr(dtype)
* refine code
5 years ago
lilong12
1c68138327
[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis ( #26552 )
...
add collective op for cpu using gloo and paddle.distributed.* apis
5 years ago
joanna.wozna.intel
559e43eee4
Small change in conv2d and quantize pass ( #26671 )
5 years ago
Bai Yifan
8986a82131
fix adaptive gpu grad bug, add doc refine ( #26660 )
5 years ago
wawltor
286eca2d9e
update the code for the topk v2
...
add the top v2 for the paddlepaddle api 2.0
5 years ago
whs
f82384113b
Fix atomicAdd in grid sample op and affine grid op ( #26647 )
...
test=develop
5 years ago
Wilber
32ba8602c6
Enhance py_func error info message. ( #26557 )
5 years ago
chalsliu
cb3f131f1c
Set timeout properity for a few unitests
5 years ago
石晓伟
32ceacf317
update op_version_registry, test=develop ( #26644 )
5 years ago
RandyLi
2f5bdd8dc7
Remove WOBOQ, gen_html() and sphinx ( #26128 )
5 years ago
Dong Daxiang
08d736ad78
【paddle.fleet】add cudnn related strategies to DistributedStrategy ( #26598 )
...
* add cudnn related strategies to DistributedStrategy
5 years ago
Zhang Ting
0a895bc0df
improve unique op ( #26537 )
...
* add unique_v2 op
* remove unique_v2 op
* update doc
5 years ago
whs
a004dfde3d
Use atomicAdd defined in paddle fromework ( #26631 )
...
test=develop
5 years ago
LoveAn
02fc1fef8b
Fix the cmake-function named inference_download_and_uncompress on Windows ( #26512 )
...
* Fix the cmake-function named inference_download_and_uncompress with Windows, test=develop
* Fix some problems when remove limit of unittests on Windows, test=develop
* Using URL to download file instead of DOWNLOAD_COMMAND. test=develop
5 years ago
YUNSHEN XIE
a8b5741fb4
add a few unittests for setting timeout properity ( #26630 )
5 years ago
zhangchunle
ef317b4b14
add mac tests failed exitcode ( #26611 )
5 years ago
wanghuancoder
c1f5df5269
optimized transformation form tensor to numpy ( #26447 )
...
* optimized transformation form tensor to numpy, test=develop
* optimized transformation form tensor to numpy, pass pre-commit, test=develop
* modify fetchophandle zerocopy to deepcopy in PE&CUP, test=develop
* modify py:array construct, test=develop
* fix _fetch_var to use deep copy, test=develop
5 years ago
zhupengyang
c80fcf901e
reduce_mean error if keepdim=True and reduce_all=True ( #26614 )
5 years ago
whs
a065a24232
【2.0 API】Enhance affine grid operator ( #26385 )
...
* Enhance affine grid operator:
1. Add cuda kernel
2. Add align corners options
test=develop
* Move new affine_grid api to functional
test=develop
* Add CUDA kernel for affine_grid.
test=develop
* Add more unitest for grid sample API
test=develop
5 years ago
Qi Li
6f69fbc8ea
fix elu grad whne alpha less then zero, test=develop ( #26543 )
5 years ago
whs
786373ba29
Use atomicAdd defined in paddle framework ( #26628 )
...
test=develop
5 years ago
ruri
1f82c0cd62
[Api2.0] add pixel shuffle ( #26071 )
5 years ago
Zhou Wei
1ed74aae7c
fix msbuild log level ( #26607 )
5 years ago
wanghuancoder
422a162019
api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear ( #26399 )
...
* api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear, test=develop
* api2.0 fix code examples, test=develop
* modify test_bilinear_api, about place,to_tensor , test=develop
* re pass pre-commit, test=develop
* Update common.py
* fix BilinearTensorProduct ci error, test=develop
5 years ago
wanghuancoder
6e823cfec3
add op_function_generator.exe retry in windows, test=develop ( #26591 )
...
add op_function_generator.exe retry in windows
5 years ago
石晓伟
fa08a834be
update op_version_registry, test=develop ( #26592 )
5 years ago
whs
79539cf198
【2.0 API】Add CUDA kernel and enhance options for grid_sample ( #26576 )
...
This PR enhance CPU kernel and add new CUDA kernel to make grid_sample support:
- align_corners: with bool type.
- padding mode: which can be in ['zeros', 'reflect', 'border']
- Interpolation mode: which ca be in ['bilinear', 'nearest']
The old CPU and CUDNN version only support align_corners=true, padding_mode='zeros' and interpolation_mode='bilinear'.
The behavior of the new version op in default mode is compatible with the old version.
5 years ago
Guanghua Yu
8645591d66
support fp64 in huber_loss cuda kernel ( #26583 )
5 years ago
yaoxuefeng
efee426742
support generator seed in related kernals test=develop ( #26495 )
5 years ago
Zhong Hui
bf4a4636f1
change to use bce_loss op, add shape check for bce_loss
...
change to use bce_loss op, add numel check for bce_loss.
5 years ago
ShenLiang
0e81626081
add div, floor_div, remainder ( #26562 )
...
* add div, floor_div, remainder
5 years ago
石晓伟
656e60b18f
new class: op_version_registry, test=develop ( #26542 )
5 years ago
qingqing01
24566e951c
Support empty bbox in bipartite math op ( #26488 )
5 years ago
Jack Zhou
199b0c7c1b
Add isfinite v2 op ( #26344 )
...
add the isnan, isfinite, isinf api for the paddle 2.0
5 years ago
Zhou Wei
28554c3f85
add --user for pip ( #26440 )
5 years ago
wangchaochaohu
ebf9b2125e
add paddle.gather for API2.0 ( #26455 )
5 years ago
wangchaochaohu
9219b79104
gather_nd Op for API 2.0 refine ( #26540 )
5 years ago
zhupengyang
9b14117cac
logsumexp: impl kernel, refine docs ( #26307 )
5 years ago
Wojciech Uss
5c2b9258a6
Fix (de/re)quantize cache keys ( #26549 )
5 years ago
YUNSHEN XIE
df7fe1fe23
fix unittests run with error of Expression too big ( #26573 )
5 years ago
wawltor
6b28456ed0
add the argmax, argmin for the api2.0
...
* add the new api and op for the argmax, argmin
5 years ago
LielinJiang
d26ae9ad87
Update conv_transpose api ( #26427 )
...
* update conv_transpose api
5 years ago
lilong12
faa9b97b78
fix cscatter, test=develop ( #26554 )
5 years ago
WangXi
45711dade7
【API】rename div to divide, add floor_divide, remainder ( #26434 )
5 years ago
LutaoChu
4e0c6d91aa
add paddle.tensor.linalg.diag API, diag_v2 OP and CUDA kernel
...
add paddle.tensor.linalg.diag API, diag_v2 OP and CUDA kernel.
5 years ago
zhupengyang
f8863e0603
leaky_relu and LeakyReLU: alpha->negative_slope ( #26216 )
5 years ago
ShenLiang
c609066074
Add Matmul op ( #26411 )
...
* add matmul_v2
5 years ago
Leo Chen
aa2a9b5d89
add bernoulli op ( #26511 )
...
* add bernoulli op
* fix cuda kernel and add unit test
* refine doc
* fix uniform
5 years ago
Adam
f3909020de
Add mechanism for blocking oneDNN cache clearing ( #26502 )
...
* Add mechanism for blocking oneDNN cache clearing
* Review changes and Add thread guards
5 years ago
ShenLiang
b6eb37f5b3
add error message for cholesky ( #26444 )
...
* add error message
5 years ago
QingshuChen
138ecf24aa
support Baidu Kunlun AI Accelerator ( #25959 )
...
* support Baidu AI Accelerator
* test=kunlun
* minor
* test=kunlun
* support xpu op in separate file
* test=kunlun
* update XPU error message and remove duplicated code
* test=kunlun
* minor
* test=kunlun
* minor
* test=kunlun
5 years ago
yaoxuefeng
4f259354d2
mod cvm test=develop ( #25146 )
...
* mod cvm test=develop
* mod code format test=develop
5 years ago
wangchaochaohu
e167e87974
【API2.0】add masked_select Op for API2.0 ( #26374 )
5 years ago
Pei Yang
379222c3f1
add output scale and trt op teller support for hard_swish and hard_sigmoid ( #26499 )
5 years ago
zhupengyang
6e5670b8bd
mean: not support int32, int64; add check for axis ( #26401 )
5 years ago
zhupengyang
4ad504e7c7
hardshrink: support threshold < 0 ( #26403 )
5 years ago
lilong12
e92f770c42
Add collective ops (reduce) ( #26340 )
5 years ago
wangchaochaohu
bdb805505e
【API2.0】add numel API for paddle test=develop ( #26311 )
5 years ago
wangchaochaohu
2073ffc04d
Enhance the data type of linspace API ( #26247 )
5 years ago
hong19860320
40d193ed17
Add the ReLU6, Tanhshrink, SELU, Softplus, Softshrink and Softsign for the api 2.0 ( #26376 )
5 years ago
Chen Weihang
9108282883
Polish framework error message part 5 ( #26204 )
...
* polish framework error msg part 5
* revert enforce change
* refine error type
* trigger ci check
* polish details by review comment
5 years ago
Zhaolong Xing
f00f982a02
add cub impl for arg max, min ( #25941 )
...
test=develop
5 years ago
Zhang Ting
6914a12f82
rename the inputs of allclose ( #26360 )
...
* rename input
* add unittest, test=develop
* use paddle.data instead of fluid.data, test=develop
5 years ago
YUNSHEN XIE
e3612de8d7
add failed unittests retry ( #26342 )
5 years ago
littletomatodonkey
bcf03273f6
add pad func ( #26106 )
...
* add pad func
* add pad
* test=develop, add pad op and apis
* restore pad2d
* test=develop, fix paddl declare
* fix pad interface
* test=develop, fix pad
* test=develop, add all pad api and cos_sim
* test=develop, remove padding default value
* test=develop, rename var to tensor
* test=develop, add more tests
* test=develop, rename tovar to totensor
* test=develop, fix init
* test=develop, add more test
* test=develop, add more tests
5 years ago
Chengmo
eeeef957c7
Fix ps gpu ( #26218 )
...
* support ps-gpu
5 years ago
Zhong Hui
6cbeafb6c0
add zero norm, inf norm support for p_norm op ( #26364 )
...
* add zero norm, inf norm support for p_norm op
* fix the invalid argument check, fix the dtype problem in test case.
5 years ago
tianshuo78520a
029390b1d2
fix ci bug ( #26276 )
5 years ago
Tao Luo
1b03ab3899
set opencv-python <=4.2.0.32 ( #26415 )
5 years ago
Zhaolong Xing
b7a86e92a8
fix dy shape bug in trt7.1 ( #26273 )
...
test=develop
5 years ago
ceci3
56890dc729
Add SyncBatchNorm ( #26032 )
...
* add SyncBatchNorm,test=develop
5 years ago
GaoWei8
1fbee267d4
remove scope in cudnn lstm ( #25188 )
5 years ago
Zhou Wei
da29760d58
add msvc log from quiet to minimal ( #26383 )
5 years ago
Pei Yang
b757466b0d
fix trt dynamic ernie serialization unit test ( #26228 )
5 years ago
Wilber
3ec0bcbbb8
[Bug] Fix prune for save_inference_model about transformer ( #25347 )
5 years ago
cc
3f816bc8b4
[Quantization] Conv2d_transpose and mul support channnelwise quantization ( #25639 )
...
* Conv2d_transpose and mul support channnelwise quantization, test=develop
* Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop
* Fix error in test_user_defined_quantization, test=develop
* Add depthwise_conv_bn_fuse, test=develop
* Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop
5 years ago
lilong12
638bbb6153
Improve expand as ( #26290 )
...
align expand_as op to expand.
5 years ago
Thunderbrook
a83e0f264c
fix heter proto ( #26093 )
...
test=develop
5 years ago
Leo Chen
049ac56c08
Print user-friendly error message in core.ops [part 2] ( #26377 )
5 years ago
zhupengyang
586a6dd358
log_softmax and LogSoftmax: impl kernel and refind docs ( #26088 )
5 years ago
yaoxuefeng
23261ff44b
add cpu random Generator ( #26013 )
5 years ago
Sylwester Fraczek
69742bd9a4
Enable mkldnn layout conversion ( #25778 )
...
* enable mkldnn layout conversion
* review fix: remove tmp_place
* fix test mkldnn swish
* add UT for PrepareData CPU->MKLDNN
* add #ifdef PADDLE_WITH_MKLDNN
* Force-push commit
Co-authored-by: grygielski <adam.grygielski@gmail.com>
5 years ago
Leo Chen
672578a797
Print user-friendly error message in core.ops ( #26261 )
...
* print user-friendly error message
* adjust error sumary
5 years ago
Zhou Wei
5017aa76e6
set default python3,fix incompatible,cache dir for third party,unify error code,for windows ( #26178 )
...
* set default python3 for paddle windows,test=win
* set default python3,cache dir for third party,error code,test=win
* fix some incompatible
* fix some error
* set virtual environment,test=win
5 years ago
Jack Zhou
6d22f5c73e
Add PADDLE_ENFORCE in nll loss cuda kernel ( #26294 )
...
* add nll loss API, update demo code of the comment
5 years ago
wangchaochaohu
0b81d76310
[API2.0] add op for cudnn version query test=develop ( #26180 )
5 years ago
lilong12
241b44db14
[API 2.0] adaptive expand op to use shape instead of expand_times ( #26206 )
...
* adaptive expand op to 2.0 (align to torch.expand) , test=develop
5 years ago
wangchaochaohu
bb11cbc250
[API2.0] add Device api (set_device and get_device)( #26103 )
5 years ago
Zhou Wei
6de463d3d1
expose and unify the Tensor concepts to the user ( #25978 )
...
* expose and unify the Tensor concepts to the user
* expose tensor to user
* add copy place for Tensor
* add copy place for Tensor
* add note
* add macro PADDLE_WITH_CUDA
* remove RUN_TYPE=DIST
* fix some error
5 years ago
lilong12
fbd4d3cc97
[API 2.0] add paddle.tile op ( #26245 )
...
* add tile_op, test=develop
5 years ago
Zhou Wei
20147ace3f
fix_copy_if_different ( #25868 )
5 years ago
Wilber
c84aa9c61f
update diff val. ( #26242 )
5 years ago
Yang Zhang
a2d3e5c03b
Fix `paddle.abs` docstring ( #25942 )
...
test=document_fix
remove activation wording
5 years ago
Yang Zhang
22165934bc
Fix `paddle.acos` docstring ( #25958 )
...
test=develop,test=document_fix
remove activation wording
5 years ago
Yang Zhang
a5b5b00e02
Fix `paddle.asin` docstring ( #25967 )
...
test=develop,test=document_fix
remove activation wording
5 years ago
Yang Zhang
c758765769
Fix `paddle.atan` docstring ( #25968 )
...
test=develop,test=document_fix
remove activation wording
tanh -> tan
5 years ago
Yang Zhang
c4e480efc5
Fix `paddle.cos` docstring ( #25969 )
...
test=develop,test=document_fix
explain input/out put range and out of boundary behavior
5 years ago
liuyuhui
935da32d25
【paddle.fleet】upgrade fleet: modify role_maker ( #26038 )
...
* add unittest for paddlerolemaker with gloo
5 years ago
wawltor
2d6cc0b125
support the tuple for attribute of axis in min, max for api2.0
...
Update the code for the min,max, test=develop
5 years ago
Dong Daxiang
50a5bcfc9d
【paddle.fleet】paddle.fleet -> paddle.distributed.fleet. ( #26186 )
...
* move paddle.fleet to paddle.distributed.fleet
5 years ago
Leo Chen
ffe52b4452
[OpDevOptimize] Add common infershape functions ( #26096 )
...
* add unchaged infershape function
* add broadcast infershape function
* fix bug
* rename infershape functions
* add UnaryOpUnchangedInferShapeCheckAxis
* add error message
* add test for common infer shape functions
* dont update existed ops
* dont update op_desc.h
* add more test
* add error check, refine error message
5 years ago
Leo Chen
2d95280e1f
Feature/Enable Auto-Mixed-Precision in dynamic graph ( #24903 )
...
* add auto_cast, test=develop
* add loss scaler, test=develop
* add comments, test=develop
* refine code, test=develop
* refine code, test=develop
* do not set flags automatically, test=develop
* fix custom op bug, test=develop
* add more test, test=develop
* refine enable logic, test=develop
* enable amp test with GPU, test=develop
* add unittest
* add test for found_inf
* follow comments
* follow comments
* remove global variable, use singleton
* add some notes
* update comments
* update comments
* update comments
* add use_dynamic_loss_scaling argument
* refine found_inf
* refine found_inf
5 years ago
Chen Weihang
838e36e9ed
Fix loaded variable suffix repeat error ( #26169 )
...
* fix loaded var suffix repeat error
* use new dygraph name for loaded param
5 years ago
Jack Zhou
dea41da715
add nll loss API for the paddlepaddle api2.0
...
* add nll loss API, update demo code of the comment
5 years ago
Wilber
fb72b192e7
[DOC] Fix dead link ( #26154 )
5 years ago
wawltor
9c17b3c9f8
Add the max, min, maximum, minimum api for the API 2.0
...
* Add the max, min, maximum, minimum api for the API 2.0, test=develop
5 years ago
JZ-LIANG
54003b873e
【paddle.fleet】add lamb to fleet meta optimizer ( #26025 )
...
add lamb to fleet meta optimizer
5 years ago
Yiqun Liu
1be6bf45ae
Add assign to fusion_group and enhance inplace execution in fusion_group. ( #26121 )
5 years ago
lidanqing
65b97d6215
GRU model xnli dataset C++ tester ( #25534 )
...
* Add laxical GRU unit test
performance works
* Get model accuracy
* model and data name to be confirmed
test=develop
* update model name and output format
test=develop
* update according to reviews
test=develop
* add accuracy check
* accuracy check between native and analysis
test=develop
* fix a reading bug, fix gru passes sequence
test=develop
* fix passes sequence
test=develop
5 years ago
Zhen Wang
a86e8c0eef
add more error info for these ops without double grad ops. ( #25987 )
5 years ago
tianshuo78520a
75a1311400
Fix inference CI bug ( #26080 )
...
* Fix inference bug
* fix inference lib
5 years ago
MRXLT
6559229b7e
fix encryption infer ( #25979 )
...
* add encrypt for inference lib
* fix code;test=develop
* fix test; test=develop
* bug fix; test=develop
* add MakeCipher;test=develop
* fix bug;test=develop
* move MakeCipher to paddle space; test=develop
* fix include dir ;test=develop
* add include dir; test=develop
* move include; test=develop
* move include; test=develop
* fix for windows ci
* fix cmake; test=develop
* fix bug
bug fix
5 years ago
lilong12
8caee2ad51
【paddle.fleet】add the support for multi-node training for pipeline ( #25907 )
...
* add the support for multi-node training
5 years ago
LutaoChu
bf2db646de
fix cumsum op for API 2.0, optimize performance
...
update cumsum api and fix up the cumsum op
5 years ago
Adam
1893cd6bb8
Add oneDNN relu6 op ( #26037 )
...
* Add oneDNN relu6 op
* Lint fixes
5 years ago
Zhaolong Xing
50f149a48e
fix cudnn workspace size problem during inference. ( #26021 )
...
test=develop
5 years ago
Zhou Wei
1f74b94d3f
fix compile warning on windows MSVC, fix paddle_build.bat more safe ( #25933 )
...
* Fixed compile warning about incorrect compile options,fix paddle_build.bat
* fix paddle_build.bat to more safe
5 years ago
tangwei12
c14ec8782b
【paddle.fleet】Feature/fleet ps api 2.0 ( #25857 )
...
* add paddle.fleet.AsyncOptimizer
Co-authored-by: dongdaxiang <dongdaxiang@baidu.com>
5 years ago
Chen Weihang
3c8daa9b89
Add pin memory control for BufferedReader ( #26026 )
...
* add pin memory control
* fix buffered reader init problem
* fix unittest error
* add unittest for coverage
5 years ago
Chen Weihang
ad4a0466a5
Add cuda pinned place branch in slice op GetExpectedKernelType ( #26027 )
...
* add cuda pinned place branch
* add unittest
* add skip when not gpu
5 years ago
zhangchunle
86794cccbd
separate approve ( #26035 )
5 years ago
Feiyu Chan
e853ece0a2
update document template for unary elementwise layers ( #25896 )
...
1. update document template for unary elementwise layers(a.k.a. activation layer);
2. remove generate_op_noattr and use generate_activation instead; remove redundant function copies;
3. minor update for docstring to fix rst format errors.
4. fix doc for Rsqrt OP
5. add sample code for each activation separately;
6. remove the unused deprecated decorator.
5 years ago
joanna.wozna.intel
734cf1c3e9
Change use_quantizer attribute name and data type ( #25838 )
...
* Change use_quantizer attribute name and data type
* Fix problem with setting attribute
* Add changes due to review
* Small change in function
* Restore use_quantizer attr for compatibility
5 years ago
Leo Chen
5258d53d65
refine unsqueeze, test=develop ( #25470 )
...
* refine unsqueeze, test=develop
* update unsqueeze, test=develop
* refine unsqueeze, test=develop
* refine unsqueeze, test=develop
* update
* remove None, test=develop
* follow comments
* support bool
* update doc
* follow comments
* merge develop
5 years ago
tangwei12
3755564ae1
Fix/large scale fix ( #25999 )
...
* fix large scale KV
* fix single training using async ssa graph
5 years ago
Leo Chen
751305ecf0
Add flags to control call stack of error message ( #25997 )
...
* add flags_call_stack_level
* update
* refine code
5 years ago
Thunderbrook
fd2947babf
fix compile error with mkl ( #26030 )
...
test=develop
5 years ago
Leo Chen
0a47387bd8
Use static local variable instead of global variable for safty ( #26018 )
...
* remove global variable
* refine code
5 years ago
Pei Yang
beb0ca5fab
Fix TRT plugin registry without TRT lib ( #25982 )
...
* fix trt plugin registry without trt lib
* support trt4
* refine code style
5 years ago
123malin
2191a08317
【paddle.fleet】fleet_util move to paddle.fleet ( #25805 )
...
* test=develop,test=document_fix, remove the out args
* fleet_util move to paddle.fleet
Co-authored-by: WuHaobo <wuhaobo1994@gmail.com>
Co-authored-by: tangwei12 <tangwei12@baidu.com>
5 years ago
yaoxuefeng
224620071b
add new flatten op test=develop ( #25393 )
5 years ago
Adam
68c6160e63
Add oneDNN fusion_gru kernel ( #25594 )
...
* Add oneDNN fusion_gru kernel and fix fc+gru pass
test=develop
* Formatting changes
test=develop
* Lint fixes
test=develop
* Add memory::format_tag::any to GRU weights
test=develop
* Fix build with CUDA
* Fix build with CUDA v2
5 years ago
Thunderbrook
0cb60c700d
add heter ps mode ( #25682 )
...
* add heter ps mode
* code style
test=develop
* add with_pslib
test=develop
* unitest
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* test monitor
test=develop
* prepare trainer
test=develop
* code style
test=develop
5 years ago
Zhong Hui
dca56f47f5
fix invalid read of pnorm gradient function
...
fix invalid read of pnorm gradient function and delete the unused code
5 years ago
WangXi
2c9d0f3cb9
【paddle.fleet】Add dgc to fleet meta optimizer ( #25738 )
...
Add dgc to fleet meta optimizer, rm dgc from optimizer all
5 years ago
Zhaolong Xing
358bc06c72
[CUDNN8 support] : support CUDNN8 ( #25664 )
...
* cunn8 support
test=develop
* fix ci error
test=develop
5 years ago
Zhaolong Xing
5970871a64
add eltwise clip cuda impl. ( #25689 )
...
test=develop
5 years ago