Commit Graph

2544 Commits (9901f69677e9b85f1a5b8f6ac97ea1f3e2887375)

Author SHA1 Message Date
zhang wenhui 4a3c4b8fa4
add fleet_desc config feature & multi_sparse table, test=develop (#18827)
6 years ago
Leo Chen d49c2bad71 update inplace grad test to new CompiledProgram API, test=develop (#19359)
6 years ago
chengduo 4278518fb0
Update CompiledProgram (#18919)
6 years ago
翟飞跃 2e3ee57954 Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153)
6 years ago
Leo Chen a9d5fc5142 Enhance OpTest to check the consistency of operators when using and not using inplace (#19101)
6 years ago
Aurelius84 0d29cf18f4
Supports diagonal initialization in uniform_random op (#19299)
6 years ago
chengduo 5a579df9ba
[Speedup] Make dygraph data parallel faster (#19280)
6 years ago
chengduo 6a1632318d
Split test_parallel_executor_seresnext to three unit test (#19239)
6 years ago
lidanqing 3fdecc19b7 Add elementwise_mul_mkldnn UT with [conv + elt_mul + conv] (#19191)
6 years ago
danleifeng 0865b5a9a0 distribute launch : add use_paddlecloud argument (#19273)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
Aurelius84 78a3d837f8
Add match_matrix_tensor op (#18525)
6 years ago
Zeng Jinle 5b6673c44d
merge develop to solve conflict, also fix API doc, test=develop (#18823)
6 years ago
zhang wenhui 539c870753
add fl_listen_and_serv &fl_transpiler,test=develop (#19091)
6 years ago
kh2se2013 27e85625b8 add python coverage launch when WITH_COVERAGE=ON (#19264)
6 years ago
chengduo 8a89ca94ce
Fix REGISTER_OP_WITHOUT_GRADIENT (#19251)
6 years ago
gongweibao fd4b15a2f6
Unset unittests http_proxy env to avoid timeout. (#19269)
6 years ago
zhaoyuchen2018 0c71c839ec
Fix recurrent op not update grade issue (#18581)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
Yibing Liu 50b1cab122
Add padding support for crf_decoding (#19057)
6 years ago
LielinJiang 1331c9e1f8 fix distributions unittest bug, test=develop (#19012)
6 years ago
lidanqing c548e370f1 UT coverage for guassian_mkldnn_op and batch_norm_mkldnn_op (#19011)
6 years ago
Jiawei Wang 6ac32d0981 Instag Implemention (#18394)
6 years ago
huangjun12 20f18930ae Add hard swish op (new op) (#19001)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
yaoxuefeng 9150cf50fc
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
6 years ago
Zeng Jinle c194b0c835
Try to deprecate unstable python memory optimize (#18983)
6 years ago
hutuxian 5a80cc8431
Datafeed support reading to cuda place directly. (#19071)
6 years ago
chengduo 3f4c088ad8
prune the feed op in compiler (#18997)
6 years ago
ShenLiang 4397cb318e add eye op, kernel and unitest test=develop (#18980)
6 years ago
Kaipeng Deng f86fead693
Add trilinear_interp OP (#18711)
6 years ago
chengduo 17d62ab220
Enhance fuse optimization op pass (#19010)
6 years ago
chengduo 21440b4d69
Add call stack info during compile time (#19067)
6 years ago
Leo Chen 8f53735437 Fix memory overwriting of tensors returned by executor (#19030)
6 years ago
Kevin e681d65515 Add var_conv_2d op (#18518)
6 years ago
Zeng Jinle 311f90f1eb
reduce_unittest_time,test=develop (#19005)
6 years ago
Liufang Sang faf6890b6c support tensor input for ctc align op (#18887)
6 years ago
Dong Daxiang c97ea53c3e make listen and server as exclusive run (#18990)
6 years ago
Zeng Jinle 7ac748adb4
Open gc by default (#18836)
6 years ago
石晓伟 ee2f296ef8
Fusion: seqpool_cvm_concat (#18471)
6 years ago
wawltor 3ab1866ca5
Add the op of unique_with_counts, expand count function of the op unique (#18720)
6 years ago
LielinJiang 22fa4c2d24 Fix depthwise conv gpu kernel bug (#18582)
6 years ago
HaoRen 24f8543106 Add center Loss Op Support (#18681)
6 years ago
Dong Daxiang 2bb296dfe9
make dist unit test exclusive run (#18865)
6 years ago
danleifeng e0a2d4dfec Add elementwise_pow_op backward implementation and the unit test codes of it. (#18848)
6 years ago
chengduo ecd2bdada6
add CPUInplaceTestWithFuseOptimizationOps (#18867)
6 years ago
Zeng Jinle 8008ab4e6b
Remove legacy C++ memory optimization codes (#18834)
6 years ago
Zeng Jinle 9a8a7a1ddc
fix affine_channel no_need buffer bug, test=develop (#18844)
6 years ago
chengduo 4140fe11a4
Open fuse optimization ops (#18741)
6 years ago
Adam ee02227949 Add LeakyReLU MKLDNN support (#18762)
6 years ago
Zeng Jinle a802da650b
Feature/mem opt pass refactor (#18735)
6 years ago
guru4elephant 2efb282c86
split test_dist_se_resnext.py into 4 testcases (#18743)
6 years ago
Bob Zhu 220eef602e Extend Matmul to support matrix multiplication with multiple heads (#18570)
6 years ago
whs 075e1cf78e
Add python API for appending LoD level (#18702)
6 years ago
chengduo 8259f1418f
Enhance backward process (#18700)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
Yi Liu 157211c4e1
supports distributed classification (#18690)
6 years ago
Huihuang Zheng a3028bb729
Fix random test_recurrent_op failure (#18718)
6 years ago
Tao Luo bd22453f20
Revert "Add LeakyRelu MKLDNN support (#18656)" (#18723)
6 years ago
guru4elephant ebf9797ec3
split different comm method for mnist distributed training (#18715)
6 years ago
Huihuang Zheng 89bc3fd841
Support memory eager deletion on recurrent OP (#17710)
6 years ago
Adam d6b6a337a9 Add LeakyRelu MKLDNN support (#18656)
6 years ago
tangwei12 0b9acb4912
add check of executor (#17986)
6 years ago
Zeng Jinle ae58afc546
Feature/auto_growth_allocator (#18561)
6 years ago
hutuxian bb2f5d24a2
hash_op support int64 hash_size (#18674)
6 years ago
guru4elephant ab57d3893e
make auc op compatible with 1 dim (#18551)
6 years ago
guru4elephant b71b454328 increase timeout again (#18628)
6 years ago
123malin b414645a65
fix #17430: int64类型的attr训练非预期 (#18264)
6 years ago
kh2se2013 9ad57f2dfd 1)change to parallel mode on python coverage run (#18594)
6 years ago
gongweibao c0a82748cf
Polish backwards optimizer dependency codes and use more default values. (#18255)
6 years ago
Zeng Jinle d3003a1620
Feature/buffer_shared_inplace (#17911)
6 years ago
lujun b6d5c74f69
update dygraph api doc for web (#18550)
6 years ago
Physher 0caa08ea40 Add mkldnn int8 mul-op kernel (#17834)
6 years ago
LielinJiang 24d1c44a0c Fix roi_perspective_transform_op bug (#18522)
6 years ago
zhaoyuchen2018 832d8191ff
Fix topk cannot handle 1D vector bug (#18466)
6 years ago
Jiabin Yang 7586cdd545
Hide no support (#18515)
6 years ago
LielinJiang 43e17c7951 Add distributions of normal and uniform (#18023)
6 years ago
qingqing01 602cb6a5b4
Enhance linear_lr_warmup (#18463)
6 years ago
chengduo 7453857324 Make fuse_all_reduce_op_pass support mix_precision (#17652)
6 years ago
zhoukunsheng 7c6f2350b9 support Tensor input for edit_distance op (#18162)
6 years ago
zhoukunsheng 26318544d2 support Tensor input for chunk_eval op (#18226)
6 years ago
zhoukunsheng 206c44e2a8 add unique kernel and op (#17557)
6 years ago
zhoukunsheng 71af72b1c2 upgrade hash op to support Tensor and LoDTensor input (#17998)
6 years ago
zhoukunsheng d3b3443d10 add ones_like op (#17388)
6 years ago
zhoukunsheng 67b48d7fe7 add size op (#17412)
6 years ago
hutuxian 6e0df3102e
Refactor for Pipeline Thread Check (#18459)
6 years ago
Yi Liu a873fa84ce
supports collective training with programs (#18392)
6 years ago
chengduo e0d8c6ac68
Add find_no_grad_vars in backward.py (#17942)
6 years ago
LielinJiang 449c7a9f98 Make roi_perspective_transform op return mask and transform matrix (#18371)
6 years ago
kh2se2013 27fb9cad65 add WITH_COVERAGE option, default OFF (#17872)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
hutuxian e42057cd1a
add ut for pipeline training (#18289)
6 years ago
Jiabin Yang bd61d89925
test=develop, recover ocr ut on dygraph (#18166)
6 years ago
Yibing Liu 23941e43ec
Update lamb optimizer (#18333)
6 years ago
Jiabin Yang 79bcdbbf1a test=develop, disable basic gru related ut (#18329)
6 years ago
Hongyu Liu df2eee71d8
Sequence mask support tensor (#18249)
6 years ago
chengduo d54e13bbec
add random seed for recurrent op test (#18274)
6 years ago
xiaoting b58bb80248 set src_idx > 0 for bilinear_interp_op (#18238)
6 years ago
guru4elephant 7d76e34ec2
add more print function for timeout issue, make timeout value larger (#18219)
6 years ago
Hongyu Liu cefd0fb598
Fix slice op shape=-1 bug (#18107)
6 years ago
Jiabin Yang b3cbc5be76
test=develop, fix test_imperative_transformer and ocr (#18127)
6 years ago
Qiao Longfei 778f6acf4d
disable test_async_ssa_graph_executor_mnist test=develop (#18165)
6 years ago
翟飞跃 802ea50956 fix spelling errors (#17941)
6 years ago
Jiabin Yang 991c94f135
test=develop, add add_multi_gpu_install_check (#18157)
6 years ago
chengduo 4978db2c10
Remove nccl dep when the number of GPU is 1 (#18158)
6 years ago
Zeng Jinle 6eec66a1b1
Fix py_reader iterable bug (#18108)
6 years ago
qingqing01 80d2e66f9e
Update backward appending stragety to support double backward and fix some bug. (#18104)
6 years ago
FlyingQianMM ff83655f7e
add detection output operator for supporting retinanet (#17896)
6 years ago
guru4elephant 0941e3e013
add class name and timeline for test_dist_base.py (#18122)
6 years ago
FlyingQianMM 0aee1f0074
add sigmoid focal loss operator for supporting retinanet (#17895)
6 years ago
FDInSky 9e4b9d9798 Update generate_proposal_labels_op to support CascadeRCNN. (#17200)
6 years ago
FlyingQianMM 9ed2f936f1
add target assign operator for supporting retinanet (#17893)
6 years ago
chengduo 24e988a471
Fix bug of scope_buffered_ssa_graph_executor (#18100)
6 years ago
wopeizl 26a7c1a396
add unit test to cover all parameters for print op test=develop (#18089)
6 years ago
guru4elephant b2cfdc3891
Refine unittest log (#18084)
6 years ago
gongweibao f5caf3443c
Fix reinitialized ncclid error! (#18025)
6 years ago
chengduo b5a1c1463d
Update CPU_NUM config (#18059)
6 years ago
tensor-tang 566bf2ec56
concat op support negative axis (#18045)
6 years ago
tangwei12 101f74cb19
fix save/load in fleet (#17675)
6 years ago
wawltor 8eb134c3c1
Fix scatter and gather op when has duplicate index (#17952)
6 years ago
Huihuang Zheng 0bf2535158
Cherry-pick: fix random CI failure. (#18011)
6 years ago
Kaipeng Deng 96ee528e3e
fix logging basicConfig cannot be setting after import paddle (#17786)
6 years ago
cjt222 871af28d6c
add deformable psroi pooling (#17827)
6 years ago
SunGaofeng 40885c225b
add unfold op (new op),test=develop (#17944)
6 years ago
Huihuang Zheng 9f519bafe7
Ignore a unit test which failed on cuda9/10 python3 ci task (#17950)
6 years ago
Yibing Liu 33d1e56506
Enable seq_pool op to accept len 0 input (#17284)
6 years ago
Hongyu Liu 2a9d74f67c
Add comment for dygraph api (#17869)
6 years ago
Hongyu Liu 8062bd510c
Reshape support tensor attribute (#17781)
6 years ago
gongweibao f3e5a5cf67
Unset https_proxy and http_proxy in our launch.py (#17915)
6 years ago
Jiabin Yang fba10b6bb5
test=develop, refine api (#17883)
6 years ago
gongweibao fbbdc9ccad
Add backward and optimizer operator dependency pass. (#17746)
6 years ago
Jiabin Yang 4cb7d32c9b
test=develop, add dygraph_not_support and refine ocr (#17868)
6 years ago
Huihuang Zheng 83e51ded21
SERIAL flaky imperative unit tests for CI cuda9 (#17892)
6 years ago
Jiabin Yang 3bfb92c32b
test=develop, hide build_once (#17871)
6 years ago
Jiabin Yang 022dfed4fc
Add optimizer save and load (#16986)
6 years ago
pawelpiotrowicz 39bc8a55a4 [NGraph] Enable ngraph layer_norm operator (#17599)
6 years ago
gongweibao 6a1df46991
Fine tuning launch.py (#17223)
6 years ago
wopeizl 841553e13f
use pyreader to read data in dygraph mode (#17314)
6 years ago
Jiabin Yang 3d3f5506d2
Feature/Fix recurrent usage of Varbase in Dygraph (#17838)
6 years ago
Jiabin Yang eaf049c4b8 test=develop, refine ocr attention model (#17763)
6 years ago
Hongyu Liu dfec676270
expand op supprt tensor attribute (#17773)
6 years ago
Jiabin Yang 3b70f870e2
Using Smart pointer to optimizer memory usage of dyGraph (#17768)
6 years ago
Hongyu Liu 82358bfdc1
ont hot support tensor depth (#16972)
6 years ago
mozga-intel 6a6bf597f7 [NGraph] Enable elementwise_div operator test=develop (#17515)
6 years ago
Zeng Jinle 3a6ead24ad
Add no_grad decorator to dygraph (#17790)
6 years ago
lilong12 bfcc97d924
Split the unittest test_dist_mmist into multiple unittests (test_dist_mnist, test_dist_mnist_nccl and test_dist_mnist_lars) to avoid timeout (#17707)
6 years ago
guru4elephant d52391094d
fix prepare context redundant code problem, optimize executor by cach… (#17743)
6 years ago
baojun 2c58f1a83c [NGraph] Added lookup table to ngraph engine test=develop (#17647)
6 years ago
pawelpiotrowicz bacc822492 [NGraph] Enable transpose ngraph operator (#17636)
6 years ago
baojun 90eae0b39a [NGraph] Addded slice op to ngraph test=develop (#17648)
6 years ago