Commit Graph

16509 Commits (7e2665c58b95c7ec98527ab64911355a38e0587d)

Author SHA1 Message Date
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
5 years ago
rensilin 7f5d532a9c fix: fail to call ZeroCopyTensor::mutable_data() when device_id is no… (#21461)
5 years ago
xujiaqi01 f404157205
fix master patch when slot is dense (#21580)
5 years ago
xujiaqi01 c05706fe73
fix code style of fleet_wrapper (#21639)
5 years ago
wangchaochaohu 95b95a284b
Mean gpu optimize (#21643)
5 years ago
Leo Chen 48600d7f17
Add op function generator for dygraph (#21569)
5 years ago
lidanqing fbf9eca0d3 QAT Int8 document (#21360)
5 years ago
liym27 be6a639655
Add CI for checking Input/Output/Attr of modified Ops (#21522)
5 years ago
Leo Chen 4f81d1bd5f
Refine VarBase init function (#21587)
5 years ago
Leo Chen 84b7267100
dygraph_grad_maker supports varbase without grad_var (#21524)
5 years ago
xujiaqi01 88960684aa
rm optimize_for in framework.proto (#21571)
5 years ago
Zeng Jinle 0f8888360e
Polish op registry codes (#21561)
5 years ago
Aurelius84 3d9dee575e
Set lod_level of Out in compile time of sequence_pool_op (#21604)
5 years ago
zhouwei25 346705967d monitoring changes of unittest, delete one unittest will need approve (#21377)
5 years ago
Zeng Jinle 97e76cb96d
refine dev_ctx.Wait() exception throw, test=develop (#21600)
5 years ago
Huihuang Zheng 1dcf6a7212
Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532)
5 years ago
hutuxian c5aec2fe68
Paddlebox Related to Framework (#21586)
5 years ago
liym27 9da7e6b4d4
add file check_op_desc.py and add interface to get default value. (#21530)
5 years ago
Jacek Czaja 8f5a93a07b - Fix to regression in performance of ResNet-50 training (#21588)
5 years ago
Jacek Czaja 9ce0e29dc3 [MKL-DNN] Batch norm mkl-dnn NHWC support (#21553)
5 years ago
Zeng Jinle 3a7caf481c
add grad maker assert, test=develop (#21564)
5 years ago
Huihuang Zheng b241c7329c
Refine a Warning Which Can Occur Not Only During Init (#21546)
5 years ago
Pei Yang 20d61414b4
fix glog warning, test=develop (#21573)
5 years ago
wangchaochaohu 932aca162d
Add Branch to avoid CPU profiler warning print (#21556)
5 years ago
Leo Chen cdd46d7e02
Split VarBase from Python Variable for Dygraph (#21359)
5 years ago
Youwei Song cdba41af4d dygraph Embedding layer use lookuptable v2 (#21209)
5 years ago
Pei Yang 122b37ce62
make config option DisableGlogInfo() able to mute all inference logs (#21318)
5 years ago
wangchaochaohu 4c9b3dafa7
fill_constant_batch_size_like OP precious problem fix (#21337)
5 years ago
Aurelius84 fa7cff1fee
Add CI for checking registered data_type of new Op (#21488)
5 years ago
Zhaolong Xing da7748c53d
add conv, depthwise_conv, pooling (#20966)
5 years ago
WangXi 768f9242e9 Fix dgc clip & rampup step, test=develop (#21491)
5 years ago
hong 0b75a0c10b add overrider for virtual function to avoid warning (#21503)
5 years ago
Aurelius84 54382ce497
Add get_all_kernels api of registered data_type in pybind.cc (#21499)
5 years ago
Zeng Jinle 3662fb71a7
remove eval() calls in Eigen, test=develop (#21498)
5 years ago
Jacek Czaja 18a5d30754 [MKL-DNN] Conv2d and Conv2d transpose MKL-DNN NHWC support (#21466)
5 years ago
GaoWei8 250a192181 Add ernie large c++ inference test (#21365)
5 years ago
zhongpu 6ebf0f47b8 support SelectedRows in dygraph, test=develop (#21078)
5 years ago
Tao Luo 70eb397677
remove unused snappy/snappystream depends in distributed codes (#21484)
5 years ago
lilong12 0bc8bdf724
set dim[0] to -1 if dim[0] < 0 during compiling for c_allgather op (#21402)
5 years ago
Zhaolong Xing c5f0293cf3
NV jetson(nano, tx2, xavier) inference compile support (#21393)
5 years ago
Zhaolong Xing b39c011637
specify the auto growth allocator for inference. (#21448)
5 years ago
tangwei12 0bddb951c2
fix async mode, test=develop (#21367)
5 years ago
Zeng Jinle 81ef8b7f8f
Fix CI DefaultGradOpMaker check (#21482)
5 years ago
Huihuang Zheng a71f53d7ac
Add warning message when initialize GLOG failed. (#21487)
5 years ago
Leo Chen b3090ad406
fix synchronization problem in softmax_with_cross_entropy_op, test=develop (#21480)
5 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
5 years ago
Lv Mengsi 37f3e56dea
Fix transpose conv (#21406)
5 years ago
hutuxian 7e68bc896b
refactor AUC OP and add its CUDA Kernel (#21336)
5 years ago
wawltor dbbe6e9cb6
fix the device supported of the op unique and unique_with_counts. (#21395)
5 years ago
wangchaochaohu d4776ec027
fix the correctness of memcpy profiling result test=develop (#21458)
5 years ago
wangguanzhong 379e3febf2
fix shape check in density_prior_box, test=develop (#21414)
5 years ago
Adam 76b55da15a Fix bug in UpdatePadding for int64_t type (#21465)
5 years ago
Pei Yang 7b28d938bf
show shape diff in wrong trt input shape errmsg, test=develop (#21451)
5 years ago
Jie Fang 5e813b53c5 nhwc optimization for batchnorm (#21090)
5 years ago
Leo Chen e0c9d856fb
add unused input vars check for OpWithKernel, test=develop (#21169)
5 years ago
Chen Weihang 664f958a02
Fix optimizer op infershape failed in dygraph multi-cards mode (#21374)
5 years ago
Huihuang Zheng 630be31952
Fix Cond Bug for Nested Control Flow (#21340)
5 years ago
Jacek Czaja cd43c4440e [MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375)
5 years ago
Leo Chen add62acfd1
remove kDepXOut for abs_grad op, test=develop (#21407)
5 years ago
Chen Weihang 407f883f5a
Add SelectedRows support for dygraph DebugString (#21415)
5 years ago
Adam 9107bf209f Add template version of UpdatePadding (#21426)
5 years ago
zhaoyuchen2018 b16274556a
Add dscending for argsort (#21400)
5 years ago
Zeng Jinle 6b09b73e17
add explicit conversion to NoNeedBufferVarsFunctor, test=develop (#21430)
5 years ago
hong ac8546701d
Add dygraph execution context (#20157)
5 years ago
hutuxian a6b089c614
add macro to ban windows (#21422)
5 years ago
Kaipeng Deng ebfb720a63
add Adam beta1/beta2 support Variable (#21234)
5 years ago
Zeng Jinle 09696d5df8
Use system allocator in OpTest (#21335)
5 years ago
tianshuo78520a d624b417d8 change make nproc on Cloud Integration (#21350)
5 years ago
wangchaochaohu 8293f21a52
Profile refine (#21258)
5 years ago
Kaipeng Deng 67c836fb5c
batch_norm momentum support variable (#21246)
5 years ago
lidanqing c0aa13672e Fp32 vs int8 qat C++ performance (#21244)
5 years ago
ShenLiang e2c6f434ec Add Lod information for gather_nd & scatter_nd (#21404)
5 years ago
Tao Luo c0656dcb1a
remove -Wno-error=sign-compare, make warning as error (#21358)
5 years ago
wangchaochaohu e0e205ea2d
fix the profiling bug test=develop (#21396)
5 years ago
Zeng Jinle b97fc16d21
fix lod_reset bug, test=develop (#21392)
5 years ago
Zeng Jinle 89966525f1
Polish reference count pass (#21324)
5 years ago
Zhaolong Xing d1a6e112e6 fix C++ multicard inference bug. (#20955)
5 years ago
hutuxian 47a82e38e3
Support data_norm gpu kernel (#21325)
5 years ago
Youwei Song d5ff79e55e Support numpy bridge (enabled by default in dygraph mode) (#20983)
5 years ago
GaoWei8 8493f20ebc Polish the codes of fc when needs padding (#21378)
5 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
5 years ago
Zeng Jinle b639a882c3 fix syn bn grad maker, test=develop, test=document_fix (#21317)
5 years ago
Youwei Song 4d0f5ab1a8 add axis check for concat op (#21288)
5 years ago
zhaoyuchen2018 afb134847d
Fix ernie python infer diff (#21311)
5 years ago
Lv Mengsi b6ce4f8b2f
Fix mistake of batch norm op (#21237)
5 years ago
lilong12 41d13209d7
add the framework support for distfc (#21197)
5 years ago
Zeng Jinle dbba9c7e4b
polish global_value_getter_setter, test=develop (#21332)
5 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
5 years ago
silingtong123 45c1e7bb7b add prediction demo and script on windows (#21248)
5 years ago
Jacek Czaja f4cf028a8c [MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207)
5 years ago
Michał Gallus ed9ceb9f98 Refactor MKL-DNN ElementwiseMul (#21061)
5 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
5 years ago
wangchaochaohu 6514f52e46
fix the fill_constant op precious problem (#21322)
5 years ago
zhaoyuchen2018 08c19c585d
Improve argsort performance. (#21267)
5 years ago
Thunderbrook 9a7832f8be
print table stat info for pslib (#21296)
5 years ago
WangXi 8ac7687e36 Fix dgc accuracy by mv regularization to local (#21278)
5 years ago
Zeng Jinle b9f8ae8494
Add global value getter setter (#21285)
5 years ago
Leo Zhao b19e1a1b56 use prefetch to load next mem into cache (#21206)
5 years ago
Dong Daxiang 691ced87c0
Refactor fetch handler (#21264)
5 years ago
gongweibao ed2a185248
optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597)
5 years ago
Yiqun Liu c918788ba9 Disable fusion_group pass for windows and mac. We will do some experiments on Linux first. (#21310)
5 years ago
Yihua Xu 69dd5152cf Fix the crash issue when scale or bias was null-pointer. (#21284)
5 years ago
Zhang Ting 698b8b73ad
optimize lod_reset op to avoid data transform
5 years ago
Liufang Sang f0b1518438 add dequantize_abs_max op and modify lookup_table op (#20899)
5 years ago
hutuxian a6ce2306f9 support cvm_op run in gpu (#21300)
5 years ago
Yihua Xu b085ecc258 Avoid the string as the key of map to improve the jit performance (#21292)
5 years ago
Chen Weihang 952508527a
Polish some PE code details (#21274)
5 years ago
Thunderbrook 0d17c1b816
solve pslib core in stop worker (#21263)
5 years ago
zhongpu c4ede95c74 open dygraph op test, test=develop (#19787)
5 years ago
danleifeng 6fc3e8ec84 edit elementwise_mul doublegrad inplace (#21245)
5 years ago
Thunderbrook 349e82d669
support general embedding params (#21217)
5 years ago
liu zhengxi 3cb6c0a059
Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240)
5 years ago
Yiqun Liu 6b1e1f0dda
Enable generating code for a given subgraph. (#21126)
5 years ago
zhaoyuchen2018 3ff5cc2d5e
Fix topk compile failed on windows (#21243)
5 years ago
Pei Yang 2e2f92a5b1
fix trt weight bug (#21231)
5 years ago
Zhang Ting 01a9646323
optimize assign op to avoid copy data from GPU to GPU (#21181)
5 years ago
zhouwei25 c0dcb090a3 Determine whether to copy and link inference lib by ON_INFER (#20931)
5 years ago
danleifeng 0e7baabe59 extend elementwise broadcast function (#20957)
5 years ago
Adam d623e863c9 Fix GELU grad error (#21204)
5 years ago
Zeng Jinle a152315be7
refine Tensor method, test=develop (#21031)
5 years ago
yaoxuefeng b5d8ba8394
fix data_norm op to avoid impractical normalization result test=develop (#21152)
5 years ago
Zeng Jinle 67e88424e5
Polish jit trace codes (#21218)
5 years ago
Zeng Jinle cdb3d27985
Fix warn of gcc8 (#21205)
5 years ago
liuwei1031 d8b6cf2bcd
fix sporadically hang issue on windows(#21201)
5 years ago
Zhang Ting 9cbe7bccba modified error message and API doc for channel_last supported Op (#21002)
5 years ago
Zhaolong Xing 65f7052554
TRT int8: refine trt int8 for dynamic range set (#21112)
5 years ago
guofei 56b5d14704 Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118)
5 years ago
WangXi 3c98ec90ce Fix INF bug of softmax_cross_entropy_op (#21165)
5 years ago
xujiaqi01 23876de55b
fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052)
5 years ago
Yihua Xu eec9c9cbe7 Fix jit tls issue (#21151)
5 years ago
GaoWei8 a9d4eed3a8 fix cmake fails on inference_download_and_uncompress (#21185)
5 years ago
xujiaqi01 9e045170c0
add copy table (#21086)
5 years ago
ruri aeb887911f
Refine edit distance cn (#21121)
5 years ago
Kaipeng Deng 98b59cb82c
fix elementwise_mod float point kernel. test=develop (#21183)
5 years ago
Zeng Jinle 5fdfbe3413
Add friendly dygraph trace API (#21091)
5 years ago
Chen Weihang 4bd9463630
fix detail error message error, test=develop (#21170)
5 years ago
whs cfdd1fc2cd
Fix warpctc in padding mode. (#21033)
5 years ago
Chen Weihang 8da0cd537a
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134)
5 years ago
zhaoyuchen2018 b93870e696
Improve topk performance. (#21087)
5 years ago
Adam d74ea0855f Add relative error measure when (value > 1) (#21144)
5 years ago
Chen Weihang b3a3e6f60c change cuda enforce & add example (#21142)
5 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
5 years ago
Chen Weihang 7e5f74b825
Add examples for error message writing specification - InvalidArgument (#21132)
5 years ago
Chen Weihang 27fa9c100b
add examples for resource exhausted error, test=develop (#21140)
5 years ago
zhaoyuchen2018 4a544762a2
Add Asypadding for conv fusion. (#21041)
5 years ago
WangXi de5d3ff688 Fix dgc buffer illegal & reuse velocity (#21012)
5 years ago
ceci3 f62a929151
fix instance norm (#21042)
5 years ago
Zeng Jinle d625aaf0c1
remove so many logs of parallel executor, test=develop (#21105)
5 years ago
lilong12 e249d9a3e2
fix the computation for dx (grad for x) for prelu operation. (#20949)
5 years ago
Chen Weihang edd6680a71
Further simplify the C++ error info stack (#21093)
5 years ago
Zhang Ting e0285eae64 add check for input channels and Attr(groups), test=develop (#21095)
5 years ago
Yiqun Liu 35f17ae28f
Add the check of lod_level between compile-time and runtime. (#20961)
5 years ago
Chen Weihang 826254f664
Add pre-condition check for fuse optimizer op pass (#21005)
5 years ago
Yiqun Liu 9091f8cdf9
Support generating code for grad_op (#21066)
5 years ago
Aurelius84 1cd6721873 Optimizer mmcpy if _rand_len=16 and remove data copy in GradKernel (#21099)
5 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
5 years ago
LielinJiang 06063b7001 add op locality_aware_nms, test=develop (#20976)
5 years ago
wangchaochaohu fc385777e4
fix the compile cost long time test=develop (#21064)
5 years ago
Chen Weihang 2f27b10331
Add dependency for error_codes.proto (#21084)
5 years ago
wangchaochaohu 149a1e3124
Expand refine (#21063)
5 years ago
Wojciech Uss af3ff422cc Fix dst memory allocation in elementwise_add (#21059)
5 years ago
liym27 26a6e27afe fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997)
5 years ago
GaoWei8 829bf871d7 Add ernie c++ inference test (#21015)
5 years ago
mapingshuo b592deec90 add dlpack to imdb demo, test=develop (#21069)
5 years ago
Chen Weihang 7ee25189c3
Enrich the type of error and declare the error type interfaces (#21024)
5 years ago
Adam 3fda695bb0 Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062)
5 years ago
Huihuang Zheng 1957192f05
Add select_input_op and select_output_op (#21016)
5 years ago
Liufang Sang e5e699ecc0 set lod level for compile time test=develop (#21022)
5 years ago
liym27 f0e95a6049 Polish error messages of pool_2d/3d and add Raises in English document. test=develop (#21017)
5 years ago
ddokupil c98712d56e add ending message in paddle_build.sh (#20334)
5 years ago
Zeng Jinle a710ccc0cb
refine error message of allocator again, test=develop (#21023)
5 years ago
tianshuo78520a d89ca2ffb5 split api_spec document (#20999)
5 years ago
zhaoyuchen2018 0059404e77
Fix ce ocr_recognition test fails (#20987)
5 years ago
Zeng Jinle f56967c483
refine error message of gpu allocator, test=develop (#21008)
5 years ago
Chengmo bc8e600ce5
Fix rpc not wait in GEO communicator (#20967)
5 years ago
Leo Chen 008ed65fd5 Add c++ global current tracer for dygraph (#20882)
5 years ago
Zeng Jinle 5aae595902
fix no_need_buffer_vars_dep, test=develop, test=document_fix (#21007)
5 years ago
xujiaqi01 1d1a07937a
simplify master+patch,remove ins when size != merge_size or has conflict slot (#20913)
5 years ago
Tao Luo 25ffa8445d
refine murmurhash3_x64_128 for bloom_filter (#20996)
5 years ago
Zeng Jinle 878a40f57d
Support NoNeedBufferVarsInference in dygraph backward (#20868)
5 years ago
wangchaochaohu bf379fef96
refine code for code reuse test=develop (#20988)
5 years ago
Yucheng 98f1cebd38 add sample code test under python3 and enabled multi-thread (#20950)
5 years ago
Zhang Ting de9bec607e lrn supports channel_last input, test=develop (#20954)
5 years ago
Liufang Sang 9b666cae67 fix diff in dequantize op between cpu and gpu test=develop (#20953)
5 years ago
zhongpu 065804d39e fix bug in grad_op compute for dygraph, test=develop (#20975)
5 years ago
Wilber c534149642
fix squared_mat_sub_fuse_pass when elementwise_op input is from persistable param test=develop (#20960)
5 years ago
Zhang Ting f4f85831d3 fix the bug of conv_transpose cudnn kernel, test=develop (#20958)
5 years ago
wangchaochaohu 7695b713e1
gpu info query refine test=develop (#20904)
5 years ago
Leo Chen 2c3c579b9b tensor.set() supports array list and remove unused code, test=develop (#20959)
5 years ago
WangXi eec4fa9099 And Enforce to fuse pass for DGC doesn't support fuse for now, test=develop (#20935)
5 years ago
Leo Chen 9974e40787 Update Tensor.set() to support float16 (#19964)
5 years ago
zhaoyuchen2018 7f3a445e9a
Fix gru as small frame_size has error. (#20922)
5 years ago
Zeng Jinle b0c0ffb9ae
refine pe when exception raises, test=develop (#20894)
5 years ago
123malin 20cdff0e02
Optimize decay (#20816)
5 years ago
Chengmo 16596f6498
Fix Paddle Cloud role maker (#20860)
5 years ago
liym27 59de8e1214 Compatible int32 and int64 for attr in concat/split/unsqueeze. test=develop (#20912)
5 years ago
Zhang Ting 8d1e9f0f7e maxout supports channel_last input (#20846)
5 years ago
Yihua Xu b6260f3866 Optimize the kernel implementation of layernorm with openmp (#20895)
5 years ago
hong 8c4573a3cb
GradMaker for dygraph (#19706)
5 years ago
Thunderbrook 59bcdc8a19
support dump param of model into afs (#20302)
5 years ago