Commit Graph

15991 Commits (76a581972881084ab369e8d850830f0f7d5f0906)

Author SHA1 Message Date
Zhang Ting 439d95e157 modified interpolate op to support tensor attribute, test=develop, test=document_preview (#19287)
5 years ago
Zhang Ting b38889413d add crop_tensor_op, test=develop, test=document_preview (#19314)
5 years ago
lidanqing 2c32c2d649 Refactor conv computeINT8 (#19574)
5 years ago
joanna.wozna.intel 3f1d0234ae Fix conv2d+dequantize squash for residual fusion (#19545)
5 years ago
Huihuang Zheng a35557d8f4
Fix deps of prune (#19876)
5 years ago
Adam c7e688921b Add template functions for Acquire primitive/primitive_desc (#19867)
5 years ago
flame fe18cfdb4f
hide with inference optim API (#17355)
5 years ago
Leo Chen 578a2f5da3 fix SplitLodTensor when batch_size = 0, test=develop (#19866)
5 years ago
Aurelius84 b125e327aa
Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606)
5 years ago
wopeizl a7c440d303
add precise roi pooling op test=develop (#18960)
5 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
5 years ago
Jie Fang d9db94d752 Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus. (#19714)
5 years ago
wangchaochaohu 47af618f70
Strided slice (#19642)
5 years ago
Zeng Jinle 13ca364ceb
remove some flags and add comments to some flags, test=develop (#19813)
5 years ago
123malin 1bc285a53a
add retry function to try to solve grpc error code 14 (#19661)
5 years ago
Zeng Jinle 5eb381a3e2
refine reallocate of workspace size, test=develop (#19843)
5 years ago
石晓伟 71b2ed61bc
support MLU nums, test=develop (#19372)
5 years ago
Zeng Jinle 3f87464e9c
refine executor_gc_helper codes, test=develop (#19814)
5 years ago
LielinJiang 6d72a86b14 fix_roi_transform_bug (#19785)
5 years ago
Zeng Jinle 3fd3b663a8
fix gc bug in controlflow ops, test=develop (#19827)
5 years ago
Leo Chen 982e61f5ff Update elementwise double grad to save gpu memory (#19509)
5 years ago
Zeng Jinle db26de8389
[Bug fix] Disable memory reuse on feeded variables (#19835)
5 years ago
Adam dfdd73cbc0 Add MKLDNNhandlerT templatized class (#19801)
5 years ago
Zeng Jinle cabb9501bd
fix leaky_relu op when alpha is zero, test=develop (#19833)
5 years ago
Pei Yang 9cbc1eff2d
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
5 years ago
chengjuntao 00efd1d8a9
add deformable conv v1 op and cpu version of deformable conv v2 (#18500)
5 years ago
Thunderbrook 40c66f8df9
rm return in vfork (#19734)
5 years ago
Zhaolong Xing 110be57c1b
fix memory optimization type (#19781)
5 years ago
liym27 677e714425 fix pow op, support tensor for agument factor. (#19313)
5 years ago
liym27 bd89a27308 add tensor support for argument shape in reshape op; (#19268)
5 years ago
liym27 88628016b2 add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208)
5 years ago
liym27 e9e3c08777 fix expand op: (#19302)
5 years ago
xujiaqi01 6bf298bf09
support preload thread, optimize hdfs log, fix master+patch bug (#19695)
5 years ago
Huihuang Zheng a0d80754c5
Add comments for CUDA Device Context Allocator related stuff (#19809)
5 years ago
Jiabin Yang cc311bdf95
Feature/add transform data dygraph (#19707)
5 years ago
lvmengsi b76343c3b7
cpu Conv double grad (#19672)
5 years ago
Zeng Jinle 754fd57ed7
disable memory optimization passes when FLAGS_use_ngraph=True, test=develop (#19778)
5 years ago
翟飞跃 93c85c930a Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770)
5 years ago
chengduo 8281497030
Fix warning info of build_strategy (#19805)
5 years ago
Zeng Jinle b34933d9ee
fix retry allocator bug, test=develop (#19794)
5 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
5 years ago
Zeng Jinle 32b1151f5e
reduce default value of cudnn workspace size, test=develop (#19780)
6 years ago
zhongpu 52673956de add kernel for squeeze_op, test=develop (#19656)
6 years ago
zhongpu 2a81c3679a add kernel for unstack_op, test=develop (#19538)
6 years ago
Chen Weihang 00d5375e0c
Add prune_backward function to cover complicated test_program.clone situation (#19772)
6 years ago
Kaipeng Deng 99c78b772a
fix softmax axis!=-1. test=develop (#19800)
6 years ago
tianshuo78520a 38f1c2fe28 change approve site (#19791)
6 years ago
Adam d4413a54bc Add common CreateKey for mkldnn handlers (#19767)
6 years ago
Yihua Xu 0d6ea52958 Fix the definition issue when used mkl_scsrmm and mkl_dcsrmm functions. (#19774)
6 years ago
chengduo 056fdedde3
Open fuse all reduce option (#19765)
6 years ago
Aurelius84 8c7e411908
Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716)
6 years ago
JesseyXujin e352467c1c
modify activation op API, delete use_cudnn args, test=develop, (#19758)
6 years ago
Jacek Czaja 9e4c958552 Refactoring activation mkldnn op (#19748)
6 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
6 years ago
Zeng Jinle 0daa5c9772
Make leaky relu inplacable (#19676)
6 years ago
Zeng Jinle 078a678219
refine math_op_patch, test=develop (#19727)
6 years ago
chengduo e506c99c20
Open fuse broadcast option (#18833)
6 years ago
Jacek Czaja 47f670d58c - Softmax mkl-dnn refactoring (#19615)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
Aurelius84 22301115d0
Remove constraint that last dimension is forced to be 1 in huber_loss op (#19562)
6 years ago
chengduo 5866a7a5fe
Enable fused_all_reduce_op_handle support GPU and CPU Gradients (#19418)
6 years ago
Youwei Song 3e5fb6361b fix api-doc error for dygraph and backward (#19721)
6 years ago
Tao Luo ec9bc1bd9f
paddle::framework::vectorize() templatization (#19730)
6 years ago
Zeng Jinle bb4f8dee83
add logs to left var memory size, test=develop (#19722)
6 years ago
Adam 428b2b9e17 MKLDNN handler cleanup (#19713)
6 years ago
XiaoguangHu 27235cf222
Add document annotations for FLAGS that need to be open to external developers test=develop (#19692)
6 years ago
Zeng Jinle 1c25c88aba
refine memory usage of some operators, test=develop (#19700)
6 years ago
wangguanzhong 25dcd74d34
merge empty lod tensor, test=develop (#19228)
6 years ago
yaoxuefeng c6756ed225 fix instag op (#19591)
6 years ago
gongweibao 6c2bc29cc0
Fix float16 optimizer. (#19682)
6 years ago
Zeng Jinle 713c05dd60
refine tensor.mutable_data, test=develop (#19680)
6 years ago
Chen Weihang c78a4781bf
Fix train error when test_program.clone is executed after optimizer.minimize (#19397)
6 years ago
zhongpu 5f627488db add kernel for unsqueeze_op and Add unsqueezed op test, test=develop (#19436)
6 years ago
Zeng Jinle a7691603a5
add gpu_allocator_try_time config, test=develop (#19675)
6 years ago
JesseyXujin 0b06db9413
delete transmission args in linear_chain_crf op (#19619)
6 years ago
Tao Luo f05d2c519d paddle::framework::vectorize() templatization [PART3] (#19643)
6 years ago
hutuxian 1ca6ea0318
fix cmakelist deps (#19668)
6 years ago
Tao Luo bcddbc78d4
remove -Wmaybe-uninitialized warning (#19653)
6 years ago
Zeng Jinle 2db40d9f60
reduce thread num of retry_allocator_test,test=develop (#19638)
6 years ago
wangchaochaohu 4440d7ced0
test=develop cuda realization of label smooth op (#19175)
6 years ago
chengduo 31c5a5ee26 Remove linear_chain_crf_op.cu (#19645)
6 years ago
123malin a25a716e87
Optimize fleet API: add input check for some interfaces (#18971)
6 years ago
wangchaochaohu ed8f44ea21
codegen for fused elementwise operation (#19520)
6 years ago
Chen Weihang 73daa3d6c0
Code Cleanup: delete three useless raw variables in Conv2D (#19644)
6 years ago
123malin 2f037c3189
fix the diff between async mode and async_half mode (#19535)
6 years ago
Jiabin Yang e9233d1c1e Refactor dygraph (#19107)
6 years ago
mapingshuo dca9b6c5b0 add feed_var_names to Prune interface (#19589)
6 years ago
tangwei12 f45cb1c2ca
fix bug of communicator flag, test=develop (#19635)
6 years ago
Yiqun Liu 42b5bec6f9
Integrate NVRTC to support compiling CUDA kernel at runtime (#19422)
6 years ago
Tao Luo 3ae939e48a
unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631)
6 years ago
Leo Chen af692c9140 update reduce_sum and reduce_mean to save memory, test=develop (#19608)
6 years ago
tensor-tang e3e98ed678
fix scope lock bug on infer (#19624)
6 years ago
Aurelius84 6364ebc4dd
Add distributions of Categorical and MultivariateNormal (#18263)
6 years ago
Zeng Jinle 710767d894
Enable inplace support for some ops (#19612)
6 years ago
FDInSky a18cf5e119 add a argument for softshrink python api (#19396)
6 years ago
Tao Luo d6c85c96dc
paddle::framework::vectorize() templatization (#19627)
6 years ago
danleifeng 8672e15363 elementwise broadcast function enhancement (#19536)
6 years ago
Chen Weihang 8cb54ede8c
Add user-friendly error message in optimizer ops to give a hint about the position sensitive problem of run(startup_program) (#19605)
6 years ago
zhongpu 118bb897cf add kernel for flatten_op, test=develop (#19472)
6 years ago
Tao Luo 0a46d34538
refine some PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19607)
6 years ago
baojun a3a4b6e570 Enable ngraph through build_strategy (#19266)
6 years ago
ShenLiang 2cd3fa3e9a add scatter_nd op and scatter_nd_add op (#19571)
6 years ago
wawltor 364c44422e
Add the support the int64 data type of `scatter_op` input Index(#18804) (#19508)
6 years ago
Adam 8d6d95cc2b paddle::framework::vectorize() templatization (#19611)
6 years ago
Tao Luo 75d1571995
refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603)
6 years ago
Yiqun Liu c5548178b0
A a pass to enable the use of cudnn (#19346)
6 years ago
zhongpu cc443675e9 modify paddle_build.sh for Paddle python3 runtime image generation, test=develop (#19218)
6 years ago
Adam e94b26daf5 using MKLDNNMemoryFormat = mkldnn::memory::format changes (#19568)
6 years ago
Zeng Jinle e045aadf9a
fix retry_allocator_test by removing glog envs, test=develop (#19596)
6 years ago
baojun f2ad30c4dd Some ngraph op and unittest fix (#19515)
6 years ago
Tao Luo 49523ea189
replace PADDLE_ASSERT with PADDLE_ASSERT_MSG (#19586)
6 years ago
gongweibao abaf87be2b
Change backward_guard to optimize_guard to maximize the allreduce overlap. (#19506)
6 years ago
Zeng Jinle 578cccd48c fix parallel compilation error of allocator (#19581)
6 years ago
Zeng Jinle f4562c3468
fix typo of allocator, test=develop (#19578)
6 years ago
xiaoting 7a86706309 modified multiclass_nms example (#19553)
6 years ago
gongweibao 57f0f0f2dc
Delete pserver complete file before executor running. (#19468)
6 years ago
JesseyXujin 4a7e6deb63
add padding in linear_chain_crf op (#19583)
6 years ago
Zeng Jinle 19474019c2
fix fast pe to run highest priority ops first, test=develop (#19575)
6 years ago
zhouwei25 84c728013c fix the compilation issue on windows caused by mkl_CSRMM (#19533)
6 years ago
mapingshuo f4ee60b7d0 Imdb train demo2 (#19572)
6 years ago
Zeng Jinle 0af8549750 fix seg fault of share lod, test=develop (#19573)
6 years ago
Jacek Czaja cef95ee30d [MKL-DNN] Refactoring Softmax (#19312)
6 years ago
Zeng Jinle 0a73f7202a
Add retry_allocator for gpu (#19409)
6 years ago
hutuxian c756b5d231
Paddlebox Framework (#18982)
6 years ago
Zeng Jinle 5dce1da680 remove reset recordio usage (#19519)
6 years ago
ShenLiang 85914f7a88 add gather_nd op and unit test (#19366)
6 years ago
Jacek Czaja ecd9f330c9 [MKL-DNN] Fix to face model on AVX512 platforms (#19282)
6 years ago
GaoWei8 e8405e5c61 Modify the dropout op to multi-thread (#19504)
6 years ago
Huihuang Zheng 2916caa2c4
Change ugly PADDLE_ENFORCE_EQ in recurrent_op.cc (#19470)
6 years ago
Liufang Sang 9dde564097 change var name padding_num to padding_value (#19498)
6 years ago
Aurelius84 5b5379b32a
Add sequence_topk_avg_pooling Op (#19442)
6 years ago
yaoxuefeng 10ca3f9609
add thread scope stat accurate metrics test=develop (#19480)
6 years ago
liuwei1031 d6cb1a4122
add dynamic C runtime support on windows, test=develop (#19502)
6 years ago
Bai Yifan 6d99842bb8
fix mean_iou api example, test=develop, test=document_preview (#19503)
6 years ago
Tao Luo 02270b3eb1
remove unused assert.h (#19529)
6 years ago
chengduo e340df013e
Support feed single persistable variable to PE (#19417)
6 years ago
Yiqun Liu fcec365d29
Add a pass to replace dropout_op with scale_op when is_test is true (#19297)
6 years ago
hong e169538886
fix kernel config bug in dygraph mode; test=develop (#19532)
6 years ago
Zeng Jinle c2c5b1b941
remove signal raise msg, test=develop (#19527)
6 years ago
lidanqing ba368bf696 clean up intel labeled TODOs (#19476)
6 years ago
Zeng Jinle 11f2f78458
fix sofmax seg fault in AVX, test=develop (#19487)
6 years ago
Thunderbrook 1fe468d319
support debug each output of each ins (#19004)
6 years ago
Zeng Jinle 5c8f210ce3
refine inplace inference registry, test=develop (#19032)
6 years ago
chengduo b6d1d8901f
Increase num_iteration_per_drop_scope (#19075)
6 years ago
Double_V 1d0f04315a fix row_conv_op to force it support lodtensor and tensor input simultaneously, test=develop (#19412)
6 years ago
Jiabin Yang 1ce0a09e60
fix con2d transpose bias by create and init it in build_once (#18968)
6 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training (#18863)
6 years ago
baojun 6421c61ae2 Update ngraph engine for multiple threading (#19155)
6 years ago
Zeng Jinle caf59d0f3f
Add signal message to stderr (#19421)
6 years ago
Yi Liu efb05ba258
supports multiple NCCL communicators preserved in NCCLCommContext (#19407)
6 years ago
Huihuang Zheng 56dd76538c
Delete useless ex-scope in recurrent op (#19426)
6 years ago
wopeizl b8aa37d529
save the callstack information to file when exception throws test=dev… (#19324)
6 years ago
Aurelius84 a9cd513680
improve sequence_conv api doc (#19316)
6 years ago
joanna.wozna.intel 2e3ec66be0 Add conv dequant squash for int8 (#18905)
6 years ago
vincentXiyu 482ce818bb Support Tensor input with padding for warpctc op (#19322)
6 years ago
Leo Chen 6fb310ae29 Fix bug of getting bool Flags from os.environ (#19349)
6 years ago
tianshuo78520a 8048992042 add cuda10 support in fast_install.sh and add dynamic get version for release (#19106)
6 years ago
liu zhengxi 32598ffd8f
Python infer api update and add unit test (#19353)
6 years ago
mapingshuo d5ac87ec22
Lookahead optimizer (#19386)
6 years ago
Huihuang Zheng 12d29f4d2a
Change TensorCopy in recurrent_op to ShareDataWith (#19319)
6 years ago
tangwei12 19dac67e9f
fix distribute transpiler GRPC error code 4, RPC Deadline (#18984)
6 years ago
Yibing Liu 5d1575cfe8
Fix arg do_model_average in param_attr (#19376)
6 years ago
Tao Luo c82280e445
remove unused conv_elementwise_add2_act_fuse.cc (#19344)
6 years ago
chengduo 4278518fb0
Update CompiledProgram (#18919)
6 years ago
lidanqing 9240e5325c add local user data conversion into full_pascalvoc_test_preprocess.py (#19283)
6 years ago
翟飞跃 2e3ee57954 Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153)
6 years ago
Leo Chen a9d5fc5142 Enhance OpTest to check the consistency of operators when using and not using inplace (#19101)
6 years ago
Aurelius84 0d29cf18f4
Supports diagonal initialization in uniform_random op (#19299)
6 years ago
Tao Luo e3c68bde78
stronger the error message of tensor's mutable_data (#19303)
6 years ago
tianshuo78520a 188a5caf2e Split and enhance assert_api_spec_approvals (#19292)
6 years ago
chengduo a8a9823dae
add memory profiler (#19320)
6 years ago
Adam 97d1db1874 Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237)
6 years ago
wangguanzhong 37428952c6
fix generate mask fpn, test=develop (#19301)
6 years ago
zhaoyuchen2018 5296294dae
Fix elementwise performance poor issue (#19278)
6 years ago
Tao Luo 6527a7df67
replace part of PADDLE_ASSERT to PADDLE_ENFORCE (#19285)
6 years ago
xiaoting 62facc7e47 fix yolo_box python example (#18925)
6 years ago
Yihua Xu b920395842 Use sparse matrix to implement fused emb_seq_pool operator (#19064)
6 years ago
wangchaochaohu 6e326ca2c6
optimize the realization of cuda dropout (#19136)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
qingqing01 5fc8de449a
Remove warning in batch_norm_op (#19260)
6 years ago
lvmengsi d08d5ab519
Fix the mistake of convolution (#19274)
6 years ago
Aurelius84 78a3d837f8
Add match_matrix_tensor op (#18525)
6 years ago
Zeng Jinle 5b6673c44d
merge develop to solve conflict, also fix API doc, test=develop (#18823)
6 years ago
liuwei1031 50582071dc
fix compilation issue in windows vs2017 (#19183)
6 years ago
zhang wenhui 539c870753
add fl_listen_and_serv &fl_transpiler,test=develop (#19091)
6 years ago
juncaipeng 5368b36512 remove the warning for reminding user to avoid using the OriginProgram method, test=develop (#19244)
6 years ago
silingtong123 af0fbd9012 change PADDLE_ENFORCE to PADDLE_ENFORCE_CUDA_SUCCESS (#19205)
6 years ago
Zeng Jinle 91a0911ca3
Make PADDLE_ENFORCE_EQ support types that cannot be converted to std::string (#19243)
6 years ago
chengduo 8a89ca94ce
Fix REGISTER_OP_WITHOUT_GRADIENT (#19251)
6 years ago
gongweibao fd4b15a2f6
Unset unittests http_proxy env to avoid timeout. (#19269)
6 years ago
silingtong123 a94a25867d imporve the doc of decorate_reader API (#19206)
6 years ago
zhongpu c27b081397 modify paddle/scripts/fast_install.sh about mac installation, test=develop (#19187)
6 years ago
Kaipeng Deng 2848cb791e
fix temporal_shift OP PADDLE_ENFORCE. test=develop (#19161)
6 years ago
Tao Luo 2f8c7e021f
remove unused inference_transpiler unit-tests (#19130)
6 years ago
Zeng Jinle 708bd9798d
move_flags_to_unified_files_for_management, test=develop (#19224)
6 years ago
Zeng Jinle 002f325dcd
add PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#19211)
6 years ago
lidanqing 07a4d8f8d6 Fix mAP problem in unit test of int8 object detection test (#18946)
6 years ago
Hao Wang d53fa53b65 CI - Improve example code check (#19170)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
Yibing Liu 50b1cab122
Add padding support for crf_decoding (#19057)
6 years ago
Aurelius84 45fb031f6b
remove is_test param of FC test=develop (#19209)
6 years ago
liym27 c8cdef37b2 change the default value of summarize from -1 to 20 in Print API to improve ease of use (#18738)
6 years ago
Yiqun Liu 77572b70cb
Enhance the error message when GrapOpMaker is null. (#19070)
6 years ago
lvmengsi c6f163cd7a
add description of sync_bn (#19056)
6 years ago
chengduo b5ba801ef0
Fix gather op bug (#19168)
6 years ago
Zeng Jinle 0f9b33954a
move python reader api to fluid.io module, test=develop (#19143)
6 years ago
Leo Chen 80eab822c1 Remove unused DefaultGradOpDescMaker in REGISTER_OPERATOR() (#19166)
6 years ago
chengduo c70a97f46e Use CUDAPinnedPlace in buffered_reader (#19112)
6 years ago
jiaqi b104ea0684
add get_last_save_xbox_base/get_last_save_xbox (#19122)
6 years ago
joanna.wozna.intel 492a00f53e Add conv reqantize squash (#18754)
6 years ago
Jiawei Wang 6ac32d0981 Instag Implemention (#18394)
6 years ago
zhongpu 2e76e75517 modify paddle/scripts/fast_install.sh about Mac installation to support paddle version check on MacOS (#19108)
6 years ago
Tao Luo 5f5648a8ff
Revert "Python inference API support numpy (#19009)" (#19160)
6 years ago
wawltor 0019eb376a
Fix the error of op `ones_like` document,change the output variable
6 years ago
huangjun12 20f18930ae Add hard swish op (new op) (#19001)
6 years ago
joanna.wozna.intel bce72c7fea Replace Relu with bounded Relu in MobileNetV2 quantization (#18988)
6 years ago
chengduo e044e84264
open fuse_all_optimizer_ops (#19087)
6 years ago
wangguanzhong 1fc242a7ed
refine infer shape in box decoder and assign op, test=develop (#19118)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
flame b7e1a1d7e7 Python inference API support numpy (#19009)
6 years ago
wopeizl 80b7ef6fc8
add tensorrt support for windows (#19084)
6 years ago
Kevin 744279fe68
Refine embedding Api doc (#18820)
6 years ago
Kevin 945f3cf631
fix code too big test=develop (#19111)
6 years ago
Tao Luo 4a959883e7
remove unused aws_benchmarking and go directory (#19103)
6 years ago
yaoxuefeng 9150cf50fc
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
6 years ago
hutuxian 5a80cc8431
Datafeed support reading to cuda place directly. (#19071)
6 years ago
Zeng Jinle 88f111f885
remove unused inplace act codes, test=develop (#19079)
6 years ago
tianshuo78520a cfa6305303 Add check PADDLE_ENFORCE approval (#19088)
6 years ago
ShenLiang 4397cb318e add eye op, kernel and unitest test=develop (#18980)
6 years ago
Kaipeng Deng f86fead693
Add trilinear_interp OP (#18711)
6 years ago
Zhang Ting c2063217e7 optimize error message for "embedding" and "cross_entropy" OP (#18765)
6 years ago
Tao Luo 741ce8bb1a
inference_shared_library support profile (#16275)
6 years ago
chengduo 17d62ab220
Enhance fuse optimization op pass (#19010)
6 years ago
chengduo 21440b4d69
Add call stack info during compile time (#19067)
6 years ago
jiaqi a99bc64c63
add fleet util, add some interface in hdfs util (#18752)
6 years ago
mapingshuo 4ad7c9d5a7 [WIP] Add Imdb train demo (#18895)
6 years ago
tianshuo78520a 0b1025769c Add op_use_default_grad_op_maker.spec approval (#19035)
6 years ago
wangguanzhong e50f527fee
update roi doc in roi_pool and roi_align (#19036)
6 years ago
jiaqi fc038da749
fix QueueDataset queue size (#19016)
6 years ago
Leo Chen 8f53735437 Fix memory overwriting of tensors returned by executor (#19030)
6 years ago
Kaipeng Deng 1f46253d4a
fix natural exp decay doc. test=develop (#19025)
6 years ago
tianshuo78520a be3f469ad1 CI Add Reviewer Rules for large PRs (modify 20+ files or add 1000+ lines) (#19033)
6 years ago
Yiqun Liu a445c33552
Add the check of lod in sequence_softmax kernel. (#18996)
6 years ago
Zeng Jinle 2175d19993
fix memory_reuse_pass memory_size calculation error, test=develop (#19020)
6 years ago
tianshuo78520a de975be1ec change op_use_default_grad_op_maker.spec approval member (#19029)
6 years ago
Kevin e681d65515 Add var_conv_2d op (#18518)
6 years ago
Chen Weihang 81fe02c3fe
Fix config description error in cuda_profiler function document (#18750)
6 years ago
SunGaofeng 4da1c4f15d
fix g_param shape mismatch in WeightNormParamAttr (#18940)
6 years ago
liuwei1031 a43a763b54
fix warpctc.dll not found issue (#18761)
6 years ago
chengduo 01c7daade7
Add checking for the fetch_list of Executor.run (#18957)
6 years ago