Commit Graph

5218 Commits (b82e6520e1339e4127a7ecf0b05ee598312f5b6d)

Author SHA1 Message Date
Yang Zhang ebff68fa74
Add float16 support to `sync_batch_norm_op` (#19681)
5 years ago
Aurelius84 039b9710d5
Remove constraint that last dimension is forced to be 1 by adding lookup_table_v2 (#19735)
5 years ago
whs bdb3e376d0
[PaddleSlim] Enhence compressor api in PaddleSlim (#19894)
5 years ago
xujiaqi01 cedc04775c
support change shuffle and train thread num (#19841)
5 years ago
Kaipeng Deng 14625ffe9e
add elementwise mod support float/double. test=develop (#19570)
5 years ago
Ghost Under Moon 790d5226b5 give warnings when save a model without any parameters (#19931)
5 years ago
Zeng Jinle f254b477d1
Add py_reader combination unittest (#19923)
5 years ago
Leo Chen 57606205f5 Make OpTest check grad inplace even if forward has no inplace (#19847)
5 years ago
juncaipeng b0ceed6fb4
add fake_quant_dequant_op for average pool2d, test=develop (#19880)
5 years ago
Zhang Ting cb8f3c03a7 resize Ops support data_layout:channel_last, test=develop, test=document_preview (#19914)
5 years ago
mapingshuo 9901f69677
Forward recompute3 (#19913)
5 years ago
chengduo d7251a8e1e
Delete local execution scopes (#19749)
5 years ago
Ghost Under Moon 4836ee68fb warning when user save a inference model which contains auc op test=develop (#19838)
5 years ago
wopeizl e606b1754e
optimize the error information when the input for while op has a wron… (#19872)
5 years ago
ruri d31c92a2cd
add mse_loss (#19759)
5 years ago
Tao Luo a4919d3688
move tree_conv to fluid.contrib.layers (#19918)
5 years ago
Zeng Jinle 0436efd6a3
Unify DataLoader APIs (#19305)
5 years ago
tangwei12 278dd00322
paddle cloud role maker fix (#19646)
5 years ago
lvmengsi 4155e62559
add instance norm (#19500)
6 years ago
Adam cb65439da8 Add support for other axes in MKLDNN softmax op (#19907)
6 years ago
Jiabin Yang 454254115e
Feature/auto prune in dygraph (#19757)
6 years ago
Aurelius84 418a0967f3
move match_matrix var_conv2d et.al api into fluid.contrib test=develop (#19859)
6 years ago
Zeng Jinle e237275017
add py_reader may be deprecated msg, test=develop (#19891)
6 years ago
Zeng Jinle cee0079a4d
fix readers bug, test=develop (#19868)
6 years ago
Aurelius84 fcf53e55ff
support 2-level lod of input in sequence_pool (#19839)
6 years ago
chengduo ae31faaa87
refine optimier function (#19886)
6 years ago
Zhang Ting 93364b45c1 group_norm support data_layout:NHWC, test=develop, test=document_preview (#19614)
6 years ago
Zhang Ting 439d95e157 modified interpolate op to support tensor attribute, test=develop, test=document_preview (#19287)
6 years ago
Zhang Ting b38889413d add crop_tensor_op, test=develop, test=document_preview (#19314)
6 years ago
chengduo 1f68674483
refine executor bug info (#19887)
6 years ago
flame fe18cfdb4f
hide with inference optim API (#17355)
6 years ago
Aurelius84 b125e327aa
Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606)
6 years ago
gongweibao e8d3745c0f
change _origin_program test=develop (#19863)
6 years ago
wopeizl a7c440d303
add precise roi pooling op test=develop (#18960)
6 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
6 years ago
WangXi 8c2c8dc626 distribute.launch use poll to query subprocess (#19853)
6 years ago
chengduo 8e9273272f Disable test_dygraph_mnist_fp16.py (#19844)
6 years ago
Jie Fang d9db94d752 Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus. (#19714)
6 years ago
wangchaochaohu 47af618f70
Strided slice (#19642)
6 years ago
Zeng Jinle 13ca364ceb
remove some flags and add comments to some flags, test=develop (#19813)
6 years ago
Huihuang Zheng 3e1e1fee26
Return correct currrent block of a var (#19850)
6 years ago
123malin 1bc285a53a
add retry function to try to solve grpc error code 14 (#19661)
6 years ago
Bai Yifan e2c6bada36 Support dispensable student_loss in PaddleSlim distillation (#19824)
6 years ago
LielinJiang 6d72a86b14 fix_roi_transform_bug (#19785)
6 years ago
Zeng Jinle db26de8389
[Bug fix] Disable memory reuse on feeded variables (#19835)
6 years ago
chengjuntao 00efd1d8a9
add deformable conv v1 op and cpu version of deformable conv v2 (#18500)
6 years ago
chengduo b99fc38cec
Add fp16 support for dygraph (#19828)
6 years ago
Leo Chen 5fbf03d6b4 Enhance OpTest to support double grad inplace check (#19826)
6 years ago
liym27 677e714425 fix pow op, support tensor for agument factor. (#19313)
6 years ago
liym27 bd89a27308 add tensor support for argument shape in reshape op; (#19268)
6 years ago
liym27 88628016b2 add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208)
6 years ago
liym27 e9e3c08777 fix expand op: (#19302)
6 years ago
xujiaqi01 6bf298bf09
support preload thread, optimize hdfs log, fix master+patch bug (#19695)
6 years ago
Jiabin Yang cc311bdf95
Feature/add transform data dygraph (#19707)
6 years ago
lvmengsi b76343c3b7
cpu Conv double grad (#19672)
6 years ago
翟飞跃 93c85c930a Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770)
6 years ago
chengduo 2729c17446
Fix example error of Variable and Operator (#19821)
6 years ago
ruri a0e9b7b9a2
add unittest for square error cost op (#19746)
6 years ago
zhongpu 52673956de add kernel for squeeze_op, test=develop (#19656)
6 years ago
Chen Weihang 00d5375e0c
Add prune_backward function to cover complicated test_program.clone situation (#19772)
6 years ago
tangwei12 6a1db2044c
fix sync_with_distributed_lookup_table, test=develop (#19737)
6 years ago
Aurelius84 8c7e411908
Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716)
6 years ago
JesseyXujin e352467c1c
modify activation op API, delete use_cudnn args, test=develop, (#19758)
6 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
6 years ago
chengduo c308c88d71
Fix test_parallel_executor_test_while_train (#19723)
6 years ago
Zeng Jinle 078a678219
refine math_op_patch, test=develop (#19727)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
Aurelius84 22301115d0
Remove constraint that last dimension is forced to be 1 in huber_loss op (#19562)
6 years ago
Youwei Song 3e5fb6361b fix api-doc error for dygraph and backward (#19721)
6 years ago
Thunderbrook bda92434db remove trainer desc test in windows temporarily (#19753)
6 years ago
Youwei Song 810a33c8bd fix dygraph partitial backward problem (#19625)
6 years ago
Tao Luo ec9bc1bd9f
paddle::framework::vectorize() templatization (#19730)
6 years ago
Bai Yifan aa63d5ac6d Make the distillation process not save teacher variables in PaddleSlim (#19633)
6 years ago
Zeng Jinle bb4f8dee83
add logs to left var memory size, test=develop (#19722)
6 years ago
chengduo 2c30e64b2f
increase timelimit test_pe_serexnext (#19702)
6 years ago
wangguanzhong 25dcd74d34
merge empty lod tensor, test=develop (#19228)
6 years ago
gongweibao 6c2bc29cc0
Fix float16 optimizer. (#19682)
6 years ago
Chen Weihang c78a4781bf
Fix train error when test_program.clone is executed after optimizer.minimize (#19397)
6 years ago
zhongpu 5f627488db add kernel for unsqueeze_op and Add unsqueezed op test, test=develop (#19436)
6 years ago
Zeng Jinle a7691603a5
add gpu_allocator_try_time config, test=develop (#19675)
6 years ago
JesseyXujin 0b06db9413
delete transmission args in linear_chain_crf op (#19619)
6 years ago
123malin a25a716e87
Optimize fleet API: add input check for some interfaces (#18971)
6 years ago
Aurelius84 25c0eb28cc
fix unittest random failed test=develop (#19654)
6 years ago
chengduo 5c4eb394e9 Make test_pe_seresnext serial (#19634)
6 years ago
chengduo ac34aac33f Disable the test_parallel_executor_test_while_train temporarily (#19641)
6 years ago
Chen Weihang 73daa3d6c0
Code Cleanup: delete three useless raw variables in Conv2D (#19644)
6 years ago
Jie Fang c6a598a276 init new amp, optimize inserting cast op for batchnorm (#18596)
6 years ago
123malin 2f037c3189
fix the diff between async mode and async_half mode (#19535)
6 years ago
Jiabin Yang e9233d1c1e Refactor dygraph (#19107)
6 years ago
mapingshuo dca9b6c5b0 add feed_var_names to Prune interface (#19589)
6 years ago
whs 3abe84500e
Fix data type of variable in edit distance evaluator (#19618)
6 years ago
Aurelius84 6364ebc4dd
Add distributions of Categorical and MultivariateNormal (#18263)
6 years ago
Zeng Jinle 710767d894
Enable inplace support for some ops (#19612)
6 years ago
FDInSky a18cf5e119 add a argument for softshrink python api (#19396)
6 years ago
danleifeng 8672e15363 elementwise broadcast function enhancement (#19536)
6 years ago
zhongpu 118bb897cf add kernel for flatten_op, test=develop (#19472)
6 years ago
baojun a3a4b6e570 Enable ngraph through build_strategy (#19266)
6 years ago
ShenLiang 2cd3fa3e9a add scatter_nd op and scatter_nd_add op (#19571)
6 years ago
wawltor 364c44422e
Add the support the int64 data type of `scatter_op` input Index(#18804) (#19508)
6 years ago
zhongpu 4d26274d25 add detach API for Variable in dygraph mode, test=develop (#19477)
6 years ago
whs 1c2aae567a
Skip start epoch and end epoch when dumping strategy in PaddleSlim (#19580)
6 years ago
hutuxian 66ad68ed7b
Update UT test_boxps (#19599)
6 years ago
baojun f2ad30c4dd Some ngraph op and unittest fix (#19515)
6 years ago
Tao Luo 49523ea189
replace PADDLE_ASSERT with PADDLE_ASSERT_MSG (#19586)
6 years ago
gongweibao abaf87be2b
Change backward_guard to optimize_guard to maximize the allreduce overlap. (#19506)
6 years ago
Zeng Jinle 635cd62d23
remove deprecated memory_optimize usages, test=develop (#19579)
6 years ago
Youwei Song 9a577f2e41 fix batchnorm api param: data_layout (#19524)
6 years ago
xiaoting 7a86706309 modified multiclass_nms example (#19553)
6 years ago
gongweibao 57f0f0f2dc
Delete pserver complete file before executor running. (#19468)
6 years ago
JesseyXujin 4a7e6deb63
add padding in linear_chain_crf op (#19583)
6 years ago
hutuxian c756b5d231
Paddlebox Framework (#18982)
6 years ago
Zeng Jinle 5dce1da680 remove reset recordio usage (#19519)
6 years ago
ShenLiang 85914f7a88 add gather_nd op and unit test (#19366)
6 years ago
Jacek Czaja ecd9f330c9 [MKL-DNN] Fix to face model on AVX512 platforms (#19282)
6 years ago
Liufang Sang 9dde564097 change var name padding_num to padding_value (#19498)
6 years ago
Aurelius84 5b5379b32a
Add sequence_topk_avg_pooling Op (#19442)
6 years ago
chengduo 1cdd3b6985
Disable GC in test_parallel_exe_seresnext (#19408)
6 years ago
yaoxuefeng 10ca3f9609
add thread scope stat accurate metrics test=develop (#19480)
6 years ago
Bai Yifan 6d99842bb8
fix mean_iou api example, test=develop, test=document_preview (#19503)
6 years ago
Bai Yifan 8394699dbb add stop_gradient in range_api, test=develop (#19484)
6 years ago
chengduo e340df013e
Support feed single persistable variable to PE (#19417)
6 years ago
lidanqing ba368bf696 clean up intel labeled TODOs (#19476)
6 years ago
Thunderbrook 1fe468d319
support debug each output of each ins (#19004)
6 years ago
zhang wenhui bd35a7f0a6
support fc sort by number, test=develop (#19466)
6 years ago
Double_V 1d0f04315a fix row_conv_op to force it support lodtensor and tensor input simultaneously, test=develop (#19412)
6 years ago
Jiabin Yang 1ce0a09e60
fix con2d transpose bias by create and init it in build_once (#18968)
6 years ago
Yi Liu 4ef6b8457a
adapte fleet api for localsgd and support nccl comm configuration in executor (#19443)
6 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training (#18863)
6 years ago
chengduo e26411cec2
Open test_parallel_dygraph_se_resnext (#19342)
6 years ago
Yi Liu efb05ba258
supports multiple NCCL communicators preserved in NCCLCommContext (#19407)
6 years ago
Aurelius84 a9cd513680
improve sequence_conv api doc (#19316)
6 years ago
zhang wenhui 0d7949831b
fix fleet_desc bug && support format for abacus hotstart (#19430)
6 years ago
vincentXiyu 482ce818bb Support Tensor input with padding for warpctc op (#19322)
6 years ago
chengduo bfb6ac816e
Fix optimizer bug (#19410)
6 years ago
Leo Chen 6fb310ae29 Fix bug of getting bool Flags from os.environ (#19349)
6 years ago
liu zhengxi 32598ffd8f
Python infer api update and add unit test (#19353)
6 years ago
chengduo 11070cbff9
enabel seresnext reduce test (#19341)
6 years ago
Ghost Under Moon 10643b4ea6 fix- raise io error when user load from non-existed dir test=develop (#19384)
6 years ago
mapingshuo c2e5eaa27d
delete recordio writer (#19406)
6 years ago
mapingshuo d5ac87ec22
Lookahead optimizer (#19386)
6 years ago
silingtong123 da127d1110 Optimized error reporting information (#19173)
6 years ago
Jiabin Yang 55931db449
fix problem that get_attr method can't using default mode when we call has_attr in dygraph (#19328)
6 years ago
tangwei12 19dac67e9f
fix distribute transpiler GRPC error code 4, RPC Deadline (#18984)
6 years ago
Yibing Liu 5d1575cfe8
Fix arg do_model_average in param_attr (#19376)
6 years ago
zhang wenhui 4a3c4b8fa4
add fleet_desc config feature & multi_sparse table, test=develop (#18827)
6 years ago
Jiancheng Li 1799c257ad Update Light-NAS to support latency-aware search (#19050)
6 years ago
Zhen Wang 0fe72469ea
Add the max-pool2d quantization support and the partial quantization support. (#19310)
6 years ago
Leo Chen d49c2bad71 update inplace grad test to new CompiledProgram API, test=develop (#19359)
6 years ago
Yibing Liu b2c4f76cf2
Fix sequence mask in dygraph (#19271)
6 years ago
chengduo 4278518fb0
Update CompiledProgram (#18919)
6 years ago
翟飞跃 2e3ee57954 Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153)
6 years ago
Leo Chen a9d5fc5142 Enhance OpTest to check the consistency of operators when using and not using inplace (#19101)
6 years ago
Aurelius84 0d29cf18f4
Supports diagonal initialization in uniform_random op (#19299)
6 years ago
chengduo 5a579df9ba
[Speedup] Make dygraph data parallel faster (#19280)
6 years ago
chengduo 6a1632318d
Split test_parallel_executor_seresnext to three unit test (#19239)
6 years ago
Zeng Jinle 561232c25a
remove is_mem_optimized in Program, test=develop (#19307)
6 years ago
lidanqing 3fdecc19b7 Add elementwise_mul_mkldnn UT with [conv + elt_mul + conv] (#19191)
6 years ago
xiaoting 62facc7e47 fix yolo_box python example (#18925)
6 years ago
danleifeng 0865b5a9a0 distribute launch : add use_paddlecloud argument (#19273)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
lvmengsi d08d5ab519
Fix the mistake of convolution (#19274)
6 years ago
Aurelius84 78a3d837f8
Add match_matrix_tensor op (#18525)
6 years ago
Zeng Jinle 5b6673c44d
merge develop to solve conflict, also fix API doc, test=develop (#18823)
6 years ago
zhang wenhui 539c870753
add fl_listen_and_serv &fl_transpiler,test=develop (#19091)
6 years ago
kh2se2013 27e85625b8 add python coverage launch when WITH_COVERAGE=ON (#19264)
6 years ago
chengduo 8a89ca94ce
Fix REGISTER_OP_WITHOUT_GRADIENT (#19251)
6 years ago
gongweibao fd4b15a2f6
Unset unittests http_proxy env to avoid timeout. (#19269)
6 years ago
silingtong123 a94a25867d imporve the doc of decorate_reader API (#19206)
6 years ago
gongweibao 86f0591175
Remove node_num function. (#19167)
6 years ago
Tao Luo 2f8c7e021f
remove unused inference_transpiler unit-tests (#19130)
6 years ago
zhaoyuchen2018 0c71c839ec
Fix recurrent op not update grade issue (#18581)
6 years ago
Hao Wang d53fa53b65 CI - Improve example code check (#19170)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
Yibing Liu 50b1cab122
Add padding support for crf_decoding (#19057)
6 years ago
Aurelius84 45fb031f6b
remove is_test param of FC test=develop (#19209)
6 years ago
wuzewu 6fc1defd77 Fix compatibility issue of fluid.io.save_vars on windows platform (#19181)
6 years ago
liym27 c8cdef37b2 change the default value of summarize from -1 to 20 in Print API to improve ease of use (#18738)
6 years ago
LielinJiang 1331c9e1f8 fix distributions unittest bug, test=develop (#19012)
6 years ago
lvmengsi c6f163cd7a
add description of sync_bn (#19056)
6 years ago
Zeng Jinle 0f9b33954a
move python reader api to fluid.io module, test=develop (#19143)
6 years ago
jiaqi b86be13c15
fix default value (#19193)
6 years ago
jiaqi b104ea0684
add get_last_save_xbox_base/get_last_save_xbox (#19122)
6 years ago
jiaqi bfd514c730
fix default value of fleet desc (#19176)
6 years ago
lidanqing c548e370f1 UT coverage for guassian_mkldnn_op and batch_norm_mkldnn_op (#19011)
6 years ago
Jiawei Wang 6ac32d0981 Instag Implemention (#18394)
6 years ago
wawltor 0019eb376a
Fix the error of op `ones_like` document,change the output variable
6 years ago
huangjun12 20f18930ae Add hard swish op (new op) (#19001)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
Kevin 744279fe68
Refine embedding Api doc (#18820)
6 years ago
yaoxuefeng 9150cf50fc
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
6 years ago
Zeng Jinle c51eb6bb14
remove book_memory_optimization directory, test=develop (#19117)
6 years ago
Zeng Jinle c194b0c835
Try to deprecate unstable python memory optimize (#18983)
6 years ago
hutuxian 5a80cc8431
Datafeed support reading to cuda place directly. (#19071)
6 years ago
chengduo 3f4c088ad8
prune the feed op in compiler (#18997)
6 years ago
chengduo d23603322e
Remove compile from PE (#19080)
6 years ago
ShenLiang 4397cb318e add eye op, kernel and unitest test=develop (#18980)
6 years ago
Kaipeng Deng f86fead693
Add trilinear_interp OP (#18711)
6 years ago
chengduo 17d62ab220
Enhance fuse optimization op pass (#19010)
6 years ago
chengduo 21440b4d69
Add call stack info during compile time (#19067)
6 years ago
jiaqi a99bc64c63
add fleet util, add some interface in hdfs util (#18752)
6 years ago