Chen Weihang
813b2ade34
Enrich the python error types of paddle & polish format ( #28124 )
...
* add multiple exception type
* define all exception & polish compile pystack
* mapping paddle error to python exception
* polish static mode error format
* fix failed unittests
* fix dytostatic test_error
* fix check_nan_inf failed
* add unittest for coverage
* revert some code try to solve compile error
* refactor enforce & error change
* polish code & add unittest
4 years ago
Adam Osewski
7db747d9e8
oneDNN BatchNorm + Act fusion pass. ( #27912 )
4 years ago
mapingshuo
81244fbfab
add sharding strategy in fleet( #27900 )
...
* add sharding
4 years ago
Chen Weihang
2babd6ff67
Add compile limit for PADDLE_ENFORCE without error message ( #28221 )
...
* add compile limit for paddle enforce
* polish elementwise_op_function.cu.h
* fix failed unittest
* fix windows compile failed
* detail polish
* revert no type constructor
4 years ago
Leo Chen
1f3be85914
Fix bug of fetch_async_op_handle when fetching the feed variable ( #28194 )
...
* fix bug of fetch_async_op_handle
* revert some changes of test_buffer_shared_memory_reuse_pass
* revert some changes of test_buffer_shared_memory_reuse_pass
4 years ago
lidanqing
7cb4a8b8f2
[oneDNN] Conv dilation support ( #27914 )
...
* conv dilated mkldnn support: forward and backward pass
* add mkldnn conv_transpose dilation UT
test=develop
* remove unnecessary PADDLE_ENFORCE
* add int8 and bf16 dilated conv UT
* update according to reviews
4 years ago
Zhou Wei
2ac6c6c3af
fix bug of tensor copy of CUDAPinnedPlace ( #27966 )
4 years ago
guofei
6bbb6e7f45
Implement the function of OutScaleForTraining/OutScaleForInference in dygraph ( #26601 )
...
* Implement the function of OueScaleForTraining/OutScaleForInference in dygraph
test=develop
4 years ago
Thunderbrook
3ee6ad6ec5
solve bug in pull_dense_worker ( #27918 )
...
* op error info
* style
* code format
* create pin var bug
4 years ago
zhang wenhui
5a83496c8d
Multi task ( #26002 )
...
* add multitask
* add multitask, test=develop
* fix code style, test=develop
* add partail push dense, test=develop
* fix has_kay in py3, test=develop
* fix, test=develop
* fix, test=develop
* fix, test=develop
4 years ago
wanghuancoder
41aad9bfcd
revert 4 files, from clear include by iwyu, test=develop ( #27895 )
4 years ago
Leo Chen
049696bf67
Refine the format of printing tensor ( #27673 )
...
* add sumary feature
* refine printting tensor
* add sci_mode
* add sample code
* fix indent error
* fix _format_item
* polish code
* support item indent
* add ut
* set place for ut
* fix py2 issue
* fix ut
4 years ago
Chengmo
c5f2802d56
【paddle.fleet】Update fleetrun & ps-heter ( #27472 )
...
* refine fleetrun.ps_launch
* update fleet run for multi device support
* ps_graph support ps-gpu
* fix heter save
* add heter save unittest
* fix unittest & simple code
* update fleetrun
* fix fleetrun
* fix launch barrier
* fix role maker
* add paddlecloud rolemaker unittest
* rename heter_worker_device_guard
4 years ago
石晓伟
0d27591642
save operator version infomation to program desc, test=develop ( #27668 )
4 years ago
Jacek Czaja
631c1f3018
- Fix to 27398 ( #27770 )
...
test=develop
- compilation fix
test=develop
4 years ago
Jacek Czaja
606611d351
[oneDNN] GRU BF16 kernel ( #27731 )
4 years ago
Jacek Czaja
b9fda2ff09
Fix to issue #25537 ( #27546 )
...
* - condidate fix to issue #25537
test=develop
* - UT for transpose NHWC
test=develop
4 years ago
Wojciech Uss
966447e338
Added support for quantization of fusion_gru ( #27518 )
4 years ago
Pei Yang
8a4f85feb9
Add unittests and OP version registry for quant_conv2d_dequant_fuse_pass ( #27689 )
4 years ago
AshburnLee
c3a3df6466
Add cuda support for unique op ( #27646 )
...
* unique op for cuda is added
* add support for cuda
* Add cuda support for unique op.
* Add support for int32_t and int64_t.
* For old version, process by cpu
* Add VisitDataType for thrust
4 years ago
Leo Chen
35074963e3
Refine error msg in paddle/fluid/framework/details [part 2] ( #27429 )
...
* refine broadcast_op_handle
* refine some error messages
* refine some files
* fix bug
* fix bug
* fix bug
* follow comments
* follow comments
4 years ago
Chengmo
0e101c4f6f
Fix test dist fleet heter ctr ( #27513 )
...
* fix test_dist_fleet_heter_ctr & peformance update
4 years ago
joanna.wozna.intel
b0ee1405f7
Add conv2d bfloat16 support ( #27325 )
4 years ago
Thunderbrook
6f69a4cb05
add xpu in heter mode ( #27000 )
...
* add xpu in heter mode
test=develop
* BOOST_CONST_GET; PADDLE_THROW
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* refine
test=develop
* refine
test=develop
* refine
test=develop
* refine code
test=develop
4 years ago
WangXi
e550fc02ae
fleet2.0 add fp16 grad compression ( #27480 )
4 years ago
cc
c5c13473c6
Add compatibility check for four mkldnn pass ( #27364 )
...
* Add pass compatibility check for four mkldnn pass, test=develop
4 years ago
Wilber
3d5522146e
register seq_concat_fc_fuse pass. ( #27479 )
4 years ago
wanghuancoder
df43905f12
use iwyu clean include ( #27267 )
...
* use iwyu clean include, test=develop, test=win
* compilation error, test=develop
* fix compilation error2, test=develop
* fix compilation error3, test=develop
* fix compilation error4, test=develop
* fix compilation error5, test=develop
* fix compilation error6, test=develop
* fix compilation error7, test=develop
* fix compilation error8, test=develop
* fix compilation error8, test=develop
* fix compilation error10, test=develop
* fix compilation error11, test=develop
4 years ago
Pei Yang
8182337096
clear pass logs ( #27434 )
4 years ago
Shang Zhizhou
d93661942e
fix bug sequececonv_eltadd_relu_fuse_pass ( #27404 )
...
* fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0
* fix seqconv_eltadd_relu_fuse_pass unitest error
4 years ago
Leo Chen
aba759ba16
[Feature] Enhance inplace addto strategy for gradient accumulation in static graph ( #27112 )
...
* support use add instead of sum to do gradient accumulation
* add inplace addto pass
* add grad_add op and inplace addto pass
* remove debug code
* code refine
* fix bug when sereral sum ops inserts at same op_idx
* fix Flags type
* add addto attribute for conv3d
* fix ut
* code clean
* fix type
4 years ago
Wilber
39546aa2f3
Add pass compatible and unit test. ( #27377 )
4 years ago
Leo Chen
bbc84e0fe0
Refine error msg in paddle/fluid/framework/details [part 1] ( #25631 )
...
* refine error msg in var_handle.h, test=develop
* refine all_reduce_op_handle
* fix some error msg
* refine variable_visitor
* refine threaded_ssa_graph_executor
* refine inplace related files
* refine executor related files
* refine fetch_op_handle.cc
* fix bug
* follow comments
4 years ago
tangwei12
99626502f7
【paddle.fleet】gloo and util ( #27213 )
...
* fix worker endpoints
* fix gloo wrapper for hdfs
* GPU fleetrun support gloo
* parameterserver fleetrun support gloo
* fix get server endpoint
4 years ago
yaoxuefeng
d726fd5e86
enhance dataset err msg ( #27363 )
4 years ago
Pei Yang
fd7ab4e63c
register pass compatibility ( #27357 )
...
* pass compatibility
* add compatibility registry
* add unittests for different padding
* add assert
* drop errmsg
4 years ago
haozech
7e6dfcf9b2
Add 3 pass version check ( #27283 )
4 years ago
Shang Zhizhou
3c11717988
add op version checker to ir passes ( #27329 )
4 years ago
lilong12
9f9d15e285
fix the bug of non-exit, test=develop ( #27350 )
4 years ago
ShenLiang
54b81fa32c
add adaptivelsgd in meta_optimizer ( #27289 )
...
* add adaptivelsgd
* Todo fix the code to avoid the conflict.
4 years ago
Chen Weihang
4f9d6529fe
Polish framework error message part 7 ( #27266 )
...
* polish framework error message part 7
* fix typo
* polish by reviewes comment
4 years ago
Wilber
f827665ae6
[Pass Compatible] Bind python compatible. ( #27262 )
4 years ago
Chen Weihang
dafb0e3bb7
Polish framework error message part 6 ( #27257 )
...
* polish framework error msg part 6
* polish lossed item
* fix failed unittest
* polish by reviewer comments
4 years ago
joanna.wozna.intel
1483ea2304
Add bfloat16 passes ( #26999 )
4 years ago
Chen Weihang
79149c8ee6
polish framework error message part 8 ( #27269 )
4 years ago
ShenLiang
2b6a5793fe
remove auto mode from localsgd optimizer ( #27237 )
...
* rm auto from localsgd
4 years ago
JZ-LIANG
5d039f4086
modified the implement of Lars optimizer ( #26733 )
...
add lars to fleet meta optimizer
5 years ago
WeiXin
13804ed80c
Error msg/polish tensor error msg ( #26976 )
...
* polish one line error message in tensor.cc
* polish error messages in tensor.cc,tensor.h tensor_impl.h
* polish error messages in tensor.cc tensor.h tensor_impl.h
* polish error messages in tensor.cc,tensor.h tensor_impl.h
* polish error messages in tensor.cc tensor.h tensor_impl.h tensor_test.cc
* polish error messages in tensor.cc tensor.h tensor_impl.h
5 years ago
Pei Yang
5fb8c92054
fix multihead matmul shared params ( #27121 )
5 years ago
yaoxuefeng
7f3e6ca596
add cuda generator ( #26786 )
5 years ago
Feiyu Chan
c8cc094576
add template specialization for bfloat16 for gcc 4.8 compatability ( #26985 )
5 years ago
joanna.wozna.intel
95e1434bb2
Add bfloat16 data type ( #25402 )
5 years ago
Shang Zhizhou
61fc7a3e45
Pass version check ( #26887 )
5 years ago
wanghuancoder
2d2c31a63a
Add FetchAsyncOpHandle, and use it in FastThreadedExecutor ( #26643 )
...
* optimized transformation form tensor to numpy, test=develop
* Modify fetch op handle, from memcpy Sync to memcpy Async, test=develop
* modify CUDAPinnedPlace to CPUPlace, test=develop
* modify CPUPlace to CUDAPinnedPlace, and set default inplace to false, test=develop
* revert fetch_op_handle, add fetch_async_op_handle, test=develop
* revert fetch_op_handle, add fetch_async_op_handle, test=develop
* fix error msg report, test=develop
* fix bug in cpuplace, test=develop
* fix bug in unmerge and tensorarray modle, test=develop
* fix bug, double copy gpu memory, test=develop
* fix chenweihang¡¯s review advice, test=develop
5 years ago
Thunderbrook
5205748481
fix eigen in push sparse; fix hadoop command ( #26872 )
...
* fix eigen in push sparse; fix hadoop command
test=develop
* add log in load_combine_op
test=develop
5 years ago
yaoxuefeng
a47d92d868
fleet add save with whitelist test=develop ( #23376 )
5 years ago
Adam
8bcb1f29d9
Add conv+affine_channel fuse pass to MKLDNN pass strategy and fix it ( #26779 )
5 years ago
Leo Chen
844583c8fd
Refine paddle.manual_seed ( #26496 )
...
* refine manual seed
* fix ci problem
* fix unittests
* fix unittest
* set is_init_py=false in manual_seed
* fix unittest
* fix bernoulli_op
* fix(unittest): change random_seed to manual_seed
* 🐞 fix(unittest): fix manual_seed
* trigger ci
* fix test_sentiment
* fix test_imperative_save_load
* fix test_uniform_random_op
* fix test_uniform_random_op
* fix test_jit_save_load
* merge develop
* fix manual_seed
* fix manual_seed
* use global engine
* use shared_ptr
* fix double free
* fix bug
* fix bug
* fix bug
* fix test bug
* fix test bug
* fix test bug
* fix ci
5 years ago
Pei Yang
e3f8e5cf5c
trt int8 support conv2d_transpose ( #26636 )
5 years ago
zhangchunle
623a4c2e56
fix ci coverage build error ( #26761 )
5 years ago
joanna.wozna.intel
eb097d64f6
Fix int8 performace drop cpu_quantize_placement_pass ( #26715 )
...
* Fix cpu quantize placement pass
* Include string lib
5 years ago
Wilber
1c898b66d6
add bug fix enum. ( #26736 )
5 years ago
Zhou Wei
8071d23073
fix bug that can't print int8_t ( #26712 )
...
fix bug that can't print int8_t
5 years ago
Adam Osewski
c2c689582e
Update Paddle-Lite commit hash. ( #26413 )
...
* Update Paddle-Lite commit hash.
* Add BF16 data type to VarTyp protobuf message.
5 years ago
lilong12
1c68138327
[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis ( #26552 )
...
add collective op for cpu using gloo and paddle.distributed.* apis
5 years ago
joanna.wozna.intel
559e43eee4
Small change in conv2d and quantize pass ( #26671 )
5 years ago
石晓伟
32ceacf317
update op_version_registry, test=develop ( #26644 )
5 years ago
Dong Daxiang
08d736ad78
【paddle.fleet】add cudnn related strategies to DistributedStrategy ( #26598 )
...
* add cudnn related strategies to DistributedStrategy
5 years ago
wanghuancoder
c1f5df5269
optimized transformation form tensor to numpy ( #26447 )
...
* optimized transformation form tensor to numpy, test=develop
* optimized transformation form tensor to numpy, pass pre-commit, test=develop
* modify fetchophandle zerocopy to deepcopy in PE&CUP, test=develop
* modify py:array construct, test=develop
* fix _fetch_var to use deep copy, test=develop
5 years ago
石晓伟
fa08a834be
update op_version_registry, test=develop ( #26592 )
5 years ago
石晓伟
656e60b18f
new class: op_version_registry, test=develop ( #26542 )
5 years ago
Jack Zhou
199b0c7c1b
Add isfinite v2 op ( #26344 )
...
add the isnan, isfinite, isinf api for the paddle 2.0
5 years ago
QingshuChen
138ecf24aa
support Baidu Kunlun AI Accelerator ( #25959 )
...
* support Baidu AI Accelerator
* test=kunlun
* minor
* test=kunlun
* support xpu op in separate file
* test=kunlun
* update XPU error message and remove duplicated code
* test=kunlun
* minor
* test=kunlun
* minor
* test=kunlun
5 years ago
Chen Weihang
9108282883
Polish framework error message part 5 ( #26204 )
...
* polish framework error msg part 5
* revert enforce change
* refine error type
* trigger ci check
* polish details by review comment
5 years ago
Pei Yang
b757466b0d
fix trt dynamic ernie serialization unit test ( #26228 )
5 years ago
Wilber
3ec0bcbbb8
[Bug] Fix prune for save_inference_model about transformer ( #25347 )
5 years ago
cc
3f816bc8b4
[Quantization] Conv2d_transpose and mul support channnelwise quantization ( #25639 )
...
* Conv2d_transpose and mul support channnelwise quantization, test=develop
* Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop
* Fix error in test_user_defined_quantization, test=develop
* Add depthwise_conv_bn_fuse, test=develop
* Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop
5 years ago
Thunderbrook
a83e0f264c
fix heter proto ( #26093 )
...
test=develop
5 years ago
yaoxuefeng
23261ff44b
add cpu random Generator ( #26013 )
5 years ago
Zhou Wei
6de463d3d1
expose and unify the Tensor concepts to the user ( #25978 )
...
* expose and unify the Tensor concepts to the user
* expose tensor to user
* add copy place for Tensor
* add copy place for Tensor
* add note
* add macro PADDLE_WITH_CUDA
* remove RUN_TYPE=DIST
* fix some error
5 years ago
Dong Daxiang
50a5bcfc9d
【paddle.fleet】paddle.fleet -> paddle.distributed.fleet. ( #26186 )
...
* move paddle.fleet to paddle.distributed.fleet
5 years ago
Leo Chen
ffe52b4452
[OpDevOptimize] Add common infershape functions ( #26096 )
...
* add unchaged infershape function
* add broadcast infershape function
* fix bug
* rename infershape functions
* add UnaryOpUnchangedInferShapeCheckAxis
* add error message
* add test for common infer shape functions
* dont update existed ops
* dont update op_desc.h
* add more test
* add error check, refine error message
5 years ago
Chen Weihang
838e36e9ed
Fix loaded variable suffix repeat error ( #26169 )
...
* fix loaded var suffix repeat error
* use new dygraph name for loaded param
5 years ago
JZ-LIANG
54003b873e
【paddle.fleet】add lamb to fleet meta optimizer ( #26025 )
...
add lamb to fleet meta optimizer
5 years ago
Yiqun Liu
1be6bf45ae
Add assign to fusion_group and enhance inplace execution in fusion_group. ( #26121 )
5 years ago
MRXLT
6559229b7e
fix encryption infer ( #25979 )
...
* add encrypt for inference lib
* fix code;test=develop
* fix test; test=develop
* bug fix; test=develop
* add MakeCipher;test=develop
* fix bug;test=develop
* move MakeCipher to paddle space; test=develop
* fix include dir ;test=develop
* add include dir; test=develop
* move include; test=develop
* move include; test=develop
* fix for windows ci
* fix cmake; test=develop
* fix bug
bug fix
5 years ago
tangwei12
c14ec8782b
【paddle.fleet】Feature/fleet ps api 2.0 ( #25857 )
...
* add paddle.fleet.AsyncOptimizer
Co-authored-by: dongdaxiang <dongdaxiang@baidu.com>
5 years ago
joanna.wozna.intel
734cf1c3e9
Change use_quantizer attribute name and data type ( #25838 )
...
* Change use_quantizer attribute name and data type
* Fix problem with setting attribute
* Add changes due to review
* Small change in function
* Restore use_quantizer attr for compatibility
5 years ago
tangwei12
3755564ae1
Fix/large scale fix ( #25999 )
...
* fix large scale KV
* fix single training using async ssa graph
5 years ago
Thunderbrook
fd2947babf
fix compile error with mkl ( #26030 )
...
test=develop
5 years ago
Leo Chen
0a47387bd8
Use static local variable instead of global variable for safty ( #26018 )
...
* remove global variable
* refine code
5 years ago
123malin
2191a08317
【paddle.fleet】fleet_util move to paddle.fleet ( #25805 )
...
* test=develop,test=document_fix, remove the out args
* fleet_util move to paddle.fleet
Co-authored-by: WuHaobo <wuhaobo1994@gmail.com>
Co-authored-by: tangwei12 <tangwei12@baidu.com>
5 years ago
Thunderbrook
0cb60c700d
add heter ps mode ( #25682 )
...
* add heter ps mode
* code style
test=develop
* add with_pslib
test=develop
* unitest
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* code style
test=develop
* test monitor
test=develop
* prepare trainer
test=develop
* code style
test=develop
5 years ago
WangXi
2c9d0f3cb9
【paddle.fleet】Add dgc to fleet meta optimizer ( #25738 )
...
Add dgc to fleet meta optimizer, rm dgc from optimizer all
5 years ago
WangXi
a6c87fd091
Add amp to fleet meta optimizer, test=develop ( #25770 )
5 years ago
xujiaqi01
d11c140e28
fix dump, fix cvm check ( #25400 )
...
* fix dump, fix cvm check
test=develop
* fix
test=develop
* fix
test=develop
* fix
test=develop
5 years ago
JZ-LIANG
8ebffc78c9
add lars to fleet meta optimizer ( #25884 )
5 years ago
Dong Daxiang
8d2896f1fe
【paddle.fleet】Fleet run graph in Executor and add two more strategies ( #25844 )
...
* split meta optimizer files
* add graph execution in execution, update two properties in DistributedStrategy, unit tests for these features
5 years ago
Chen Weihang
48b9a56f1c
Polish framework error message - part 4 ( #25807 )
...
* polish framework error message part 4
* fix type error
* fix message error
* polish by review comments
5 years ago
lilong12
5f30e57cdd
fix test_pipeline, test=develop ( #25808 )
...
* fix test_pipeline, test=develop
5 years ago
Chen Weihang
d47304e6d9
Refine paddle error stack format ( #25790 )
...
* refine error stack format
* polish compile traceback format
* polish detail format
5 years ago
tangwei12
caa90a6510
Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) ( #22957 )
...
* Integrated Trainer of Parameter Server
5 years ago
Dong Daxiang
a96d54ac19
Generate final strategy ( #25782 )
...
* refine strategy compiler and meta optimizers
make async as a_sync
5 years ago
Chen Weihang
1b3081b1b4
Simplify BufferedReader to improve DataLoader performance ( #25648 )
...
* simplify buffered reader to improve DataLoader performance
* fix 22 failed unittests
* fix cuda pinned context condition
* fix test_reader_reset failed
* fix two failed unittests
* change unittest place
* polish error messaage
* polish cast op GetExpecctedKernelType
* remove debug info in unittest
5 years ago
Zhou Wei
e0a9115e28
fix random compile failure due to missing file ( #25661 )
5 years ago
Dong Daxiang
920d998f1e
add more settings for distributed strategy ( #25685 )
...
* add more settings for distributed strategy
Basically, DistributedStrategy has several parts of configurations:
- BuildStrategy: the same as paddle.fluid.BuildStrategy, but the distributed arguments are moved out of BuildStrategy
- ExecutionStrategy: the same as paddle.fluid.ExecutionStrategy
- collective communication configs: nccl_comm_num, hierarchical allreduce and so on
- distributed algorithms: async_update(mainly used in PS), lars, lamb and so on
5 years ago
Chen Weihang
c34c80d302
Polish framework error message part3 ( #25701 )
...
* polish framework error message part3
* polish details
* fix error message print error
5 years ago
arlesniak
e52df3b125
Added DNNL cache management for DyGraph ( #25624 )
...
* Added DNNL cache management for DyGraph
* move FLAGS_use_mkldnn to more general CMakeLists, getu use of the flag in ClearGradients
* missing file
* Fixes after review
* Bringing back original idea of place for 'use_mkldnn' flag to be accessible from platform nad imperative.
* Removed duplicate and added docs
* Fixes for CI
5 years ago
Chen Weihang
364cc53618
Polish paddle fluid framework error message - part2 ( #25667 )
...
* polish framework error meg part2
* polish details
5 years ago
Adam
98899b73d2
Fix FC + GRU fuse pass ( #25687 )
5 years ago
wanghuancoder
1917b38099
fix some errmsg report,in framework/ir/, about 21 files ( #25525 )
...
* fix error msg report in ir/, about 19 files, test=develop
* modified some unclear descriptions, test=develop
* modified some unclear descriptions, test=develop
* modify unit test pass_test.cc, because the error report in pass.cc is used by pass_test.cc, test=develop
5 years ago
joanna.wozna.intel
e5bbffa84c
Add NOMINMAX define due to windows.h max/min macro conflict ( #25637 )
...
test=develop
5 years ago
石晓伟
7206417259
supports xpu runtime, test=develop ( #25554 )
...
* update ResetHolder, test=develop
* add TensorShare for lite engine, test=develop
* tensor data changed from copying to sharing, test=develop
* supports xpu runtime, test=develop
* fix code styles, test=develop
5 years ago
Chen Weihang
dfb3ae1b9b
Polish some error message in framework holder - Part 1 ( #25509 )
...
* polish some error message in framework, test=develop
* fix unittest error, test=develop
* replace PADDLE_ENFORCE, test=develop
* polish details based review comment, test=develop
5 years ago
wanghuancoder
9b46fe0440
fix some errmsg report,in framework/ir/, about 5 files ( #25539 )
...
* fix error msg report in ir/, about 5 files, test=develop
* fix error msg report in ir/, about 5 files, test=develop
* fix error msg report in ir/, about 5 files, test=develop
5 years ago
Dong Daxiang
e657d7062d
fleet base initial implementation and the API ( #25442 )
...
refactor fleet api under paddle.fleet
update DistributedStrategy
5 years ago
wanghuancoder
e65c5b8e83
fix some errmsg report, in framework/ir/ ( #25471 )
...
* fix paddle/fluid/framework/ir/ error msg reoprt, test=develop
* modify error msg reoprt in ir/, about errortype, grammar, supplementary infor, test=develop
* modified some unclear descriptions, test=develop
* Modify the problem that report msg is less than 20 characters, test=develop
5 years ago
wanghuancoder
6c0982b942
fix some errmsg report, in framework/ir/mkldnn ( #25467 )
...
* fix paddle/fluid/framework/ir/mkldnn/ error msg reoprt, test=develop
* modify error msg reoprt, about errortype, grammar, supplementary infor, test=develop
* modified some error descriptions, test=develop
5 years ago
wanghuancoder
fce6466217
fix some errmsg report, in framework/ir/ subdir(memory,optimizer,multi_device) ( #25460 )
...
* fix paddle/fluid/framework/ir/multi_devices_graph_pass/ error msg reoprt, test=develop
* fix paddle/fluid/framework/ir/memory_optimize_pass/ error msg reoprt, test=develop
* fix paddle/fluid/framework/ir/fuse_optimizer_ops_pass/ error msg reoprt, test=develop
* fix paddle/fluid/framework/ir/memory_optimize_pass/ error msg reoprt about PADDLE_ENFORCE, test=develop
* modify error msg reoprt,about errortype,grammar. test=develop
* modify error msg reoprt,about PADDLE_ENFORCE to PADDLE_ENFORCE_XXX, test=develop
* modify error msg reoprt,about PADDLE_ENFORCE to PADDLE_ENFORCE_XXX, and %s to %d, test=develop
* modified some error descriptions, test=develop
5 years ago
Chen Weihang
0b54d54fd8
Fix index overflow bug of the CUDA kernel loop increment ( #25435 )
...
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop
* replace old macro & for condition, test=develop
* polish details, test=develop
5 years ago
Chen Weihang
4061aa6488
Polish ParallelExecutor exception process logic ( #25449 )
...
* polish pe exception process logic, test=develop
* fix unittest, test=develop
* add unittests, test=develop
5 years ago
lilong12
e39aa70ec7
add the support for pipeline ( #24560 )
...
* add device_worker for pipeline, test=develop
5 years ago
hong
70d7d07fea
catch bad alloc exception ( #25140 )
...
* cat bad alloc exception; test=develop
* add unitest; test=develop
* move bad alloc catch to the first place; test=develop
* polish error message; test=develop
* polish error message; test=develop
* add mutex header; test=develop
5 years ago
gongweibao
80f1c50738
Fix typo in interface. ( #24779 )
5 years ago
Zhaolong Xing
7b7e605189
[Fix BUGs]: fix multhead matmul pass's instable bug ( #25123 )
...
* fix multhead matmul's instable
test=develop
* fix multihead matmul bug
test=develop
* fix converage problem
test=develop
5 years ago
Dong Daxiang
d5e40d1ba9
Paddle fleet distributed strategy ( #25379 )
...
* add paddle.fleet.DistributedStrategy for 2.0
5 years ago
Wojciech Uss
d0a921ba98
Quant2 updates and fixes ( #25313 )
5 years ago
Jacek Czaja
17c751bec6
[oneDNN] Fix to #25078 ( #25256 )
5 years ago
MRXLT
3b8f0a64c2
Encryption infer ( #25119 )
...
* add encrypt api for inference lib
5 years ago
Sylwester Fraczek
36abeff44f
adding elementwiseadd quantization ( #25178 )
5 years ago
Pei Yang
b2f5a149e7
[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models ( #25097 )
...
* Paddle-TensorRT support slim QAT. test=develop
* add comments. test=develop
* use RenameInput instead of ResetInputs. test=develop
5 years ago
Shibo Tao
19c4db1b56
don't re-generate header file if content doesn't change ( #25130 )
...
* don't re-generate header file if content doesn't change. test=develop
* add copy_if_different function. test=develop
5 years ago
Jacek Czaja
a7944904d3
[oneDNN]elementwise_add and elementwise_mul int8 support ( #24984 )
...
* Start implementing int8 eltwise add
test=develop
* - Fix to Michal PR
* - Fix
test=develop
* - Lint fixes
test=develop
* - Added checking if elementwise_mul can be used
test=develop
* - Added attribs to skip_attrs_set
test=develop
* - Improved broadcasting
test=develop
- fixes to compilation
- fix
- fix
- Lint fixes
test=develop
* - removed redundant condition
test=develop
Co-authored-by: Michal Gallus <michal.gallus@intel.com>
5 years ago
hutuxian
5822862d8a
Monitor Framework ( #24079 )
...
* Add a StatValue class in the backend to represent a stat.
* Add a singleton StatRegistry to maintain the collection of stats.
* For the sake of code neatness, we only support type of int and float, which can cover most of the scenarios.
5 years ago
Leo Chen
25a4dac4c2
Use allow list instead of white list ( #25002 )
...
* use allow list instead of white list, test=develop
* reduce include, test=develop
5 years ago
hutuxian
1c224e26af
support CMatchAuc ( #24990 )
...
Support CMatchAucCalculator based on CMatchRankAucCalculator with a new parameter ignore_rank
5 years ago
Sylwester Fraczek
53d563a0fe
Reshape transpose matmul coverage ( #24970 )
...
* remove gmock from ut
test=develop
* coverage enabled for r+t+m fuse pass
test=develop
5 years ago
Sylwester Fraczek
a7ee634b45
fix WARNING: ThreadSanitizer: heap-use-after-free ( #24929 )
...
test=develop
5 years ago
Zhou Wei
4058e736ff
temporarily disable these unittests failed on windows ( #24942 )
5 years ago
lilong12
29de0d97a5
add the support to specify device index for device_guard ( #24555 )
...
* add the support of device index for device_guard.
5 years ago
hutuxian
b8f17a049d
fix problem in dump and add log ( #24891 )
...
* Fix the field length in LoD scenario
* Fix the missed lod info when copy tensor in dump field
* Add some log to make debug easy
5 years ago
Chen Weihang
6aae034f8d
add dep for fs.cc, test=develop, test=document_fix ( #24881 )
5 years ago
Jacek Czaja
40a5f3fd86
[oneDNN] Clearing mkldnn cache in naiveexecutor destructor ( #24756 )
5 years ago
Chen Weihang
d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW ( #24759 )
...
* remove REPLACE_ENFORCE_GLOG compile option & add ci rule prohibit LOG(FATAL) using, test=develop
* remove ci test case, test=develop
* replace all LOG(FATAL) & polish message, test=develop
* fix typo, test=develop
* polish error info detail, test=develop
5 years ago
123malin
9d2bd0ac38
downpour_worker增加try_catch机制,打印program所有参数 ( #24700 )
...
* test=develop, add try_catch for debug
5 years ago
Michał Gallus
b2ba830eeb
Fix bug in ShareBufferWith causing eltwise_add inplace ut to fail ( #24853 )
5 years ago
Wojciech Uss
78d4f0cc91
add option to exclude ops by id from quantization ( #24689 )
5 years ago
Yanghello
62b4ff7dd2
Aes_cipher_test and cipher_utils_test failed fixed ( #24816 )
5 years ago
Yanghello
5a7a517cde
Add crypto api ( #24694 )
5 years ago
Chen Weihang
19e5f7879c
Append error op hint for GradOpMaker ( #24750 )
...
* append error op hint for grad op maker, test=develop
* add unittests for coverage, test=develop
5 years ago