Commit Graph

9211 Commits (73daa3d6c0001855aea95cdd552b6398de50c349)

Author SHA1 Message Date
lidanqing c548e370f1 UT coverage for guassian_mkldnn_op and batch_norm_mkldnn_op (#19011)
6 years ago
Jiawei Wang 6ac32d0981 Instag Implemention (#18394)
6 years ago
wawltor 0019eb376a
Fix the error of op `ones_like` document,change the output variable
6 years ago
huangjun12 20f18930ae Add hard swish op (new op) (#19001)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
wopeizl 80b7ef6fc8
add tensorrt support for windows (#19084)
6 years ago
Kevin 744279fe68
Refine embedding Api doc (#18820)
6 years ago
yaoxuefeng 9150cf50fc
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
6 years ago
Zeng Jinle c51eb6bb14
remove book_memory_optimization directory, test=develop (#19117)
6 years ago
Zeng Jinle c194b0c835
Try to deprecate unstable python memory optimize (#18983)
6 years ago
hutuxian 5a80cc8431
Datafeed support reading to cuda place directly. (#19071)
6 years ago
chengduo 3f4c088ad8
prune the feed op in compiler (#18997)
6 years ago
chengduo d23603322e
Remove compile from PE (#19080)
6 years ago
ShenLiang 4397cb318e add eye op, kernel and unitest test=develop (#18980)
6 years ago
Kaipeng Deng f86fead693
Add trilinear_interp OP (#18711)
6 years ago
chengduo 17d62ab220
Enhance fuse optimization op pass (#19010)
6 years ago
chengduo 21440b4d69
Add call stack info during compile time (#19067)
6 years ago
jiaqi a99bc64c63
add fleet util, add some interface in hdfs util (#18752)
6 years ago
mapingshuo 4ad7c9d5a7 [WIP] Add Imdb train demo (#18895)
6 years ago
wangguanzhong e50f527fee
update roi doc in roi_pool and roi_align (#19036)
6 years ago
Leo Chen 8f53735437 Fix memory overwriting of tensors returned by executor (#19030)
6 years ago
Youwei Song 95ff4fba61 specify the highest numpy version under python 2.x (#19018)
6 years ago
Kaipeng Deng 1f46253d4a
fix natural exp decay doc. test=develop (#19025)
6 years ago
LielinJiang e5b9753a18 Fix ExponentialMovingAverage api bug in python3, test=develop (#18775)
6 years ago
Kevin e681d65515 Add var_conv_2d op (#18518)
6 years ago
Chen Weihang 81fe02c3fe
Fix config description error in cuda_profiler function document (#18750)
6 years ago
Zeng Jinle 311f90f1eb
reduce_unittest_time,test=develop (#19005)
6 years ago
lvmengsi 5d9df8c8c7
fix dropout (#18965)
6 years ago
SunGaofeng 4da1c4f15d
fix g_param shape mismatch in WeightNormParamAttr (#18940)
6 years ago
Jiabin Yang af63b1184c test=develop, fix memory leak in dygraph (#18998)
6 years ago
liuwei1031 a43a763b54
fix warpctc.dll not found issue (#18761)
6 years ago
chengduo 01c7daade7
Add checking for the fetch_list of Executor.run (#18957)
6 years ago
Liufang Sang faf6890b6c support tensor input for ctc align op (#18887)
6 years ago
Dong Daxiang c97ea53c3e make listen and server as exclusive run (#18990)
6 years ago
xsrobin 8ce902541c
fix unalign of some examples (#18943)
6 years ago
Zeng Jinle 7ac748adb4
Open gc by default (#18836)
6 years ago
hong f745d6d9e4
fix expand op dtype build bugs; test=develop (#18932)
6 years ago
jiaqi 02c370c3dc
support filelist size < trainer num && fix pull dense (#18956)
6 years ago
石晓伟 ee2f296ef8
Fusion: seqpool_cvm_concat (#18471)
6 years ago
jiaqi 768059b3a0
adjust ins weight according to nid slot (#18784)
6 years ago
wawltor 3ab1866ca5
Add the op of unique_with_counts, expand count function of the op unique (#18720)
6 years ago
LielinJiang 22fa4c2d24 Fix depthwise conv gpu kernel bug (#18582)
6 years ago
whs c92b78b060
Fix unitest of light nas. (#18931)
6 years ago
jiaqi 233746d89d
set fleet_send_batch_num a default value according to trainer num
6 years ago
chengduo 20859c08e8
[DyGraph] Make multi-card program faster (#18892)
6 years ago
HaoRen 24f8543106 Add center Loss Op Support (#18681)
6 years ago
lvmengsi d21c391447
replace paper link (#18861)
6 years ago
Dong Daxiang 2bb296dfe9
make dist unit test exclusive run (#18865)
6 years ago
whs 6cccab9203
Make lod_append support variable lod. (#18908)
6 years ago
danleifeng e0a2d4dfec Add elementwise_pow_op backward implementation and the unit test codes of it. (#18848)
6 years ago
chengduo ecd2bdada6
add CPUInplaceTestWithFuseOptimizationOps (#18867)
6 years ago
Zeng Jinle 8008ab4e6b
Remove legacy C++ memory optimization codes (#18834)
6 years ago
Thunderbrook 52c1431eee
add clear_model interface in fleetwrapper (#18815)
6 years ago
Zeng Jinle 9a8a7a1ddc
fix affine_channel no_need buffer bug, test=develop (#18844)
6 years ago
lvmengsi 829ef26281
Fix drop deconv (#18813)
6 years ago
chengduo 4140fe11a4
Open fuse optimization ops (#18741)
6 years ago
chengduo 582cc29799
add warning info for CPU_NUM (#18840)
6 years ago
Adam ee02227949 Add LeakyReLU MKLDNN support (#18762)
6 years ago
Zeng Jinle a802da650b
Feature/mem opt pass refactor (#18735)
6 years ago
石晓伟 9dbb62eeb9
Fix examples of API (#18092)
6 years ago
guru4elephant 30562e371b
refine launch_ps and role_maker (#18795)
6 years ago
fuyinno4 c167a4b4dd
Fix shrink-dense and add scale-datanorm (#18746)
6 years ago
guru4elephant 2efb282c86
split test_dist_se_resnext.py into 4 testcases (#18743)
6 years ago
Bob Zhu 220eef602e Extend Matmul to support matrix multiplication with multiple heads (#18570)
6 years ago
whs 075e1cf78e
Add python API for appending LoD level (#18702)
6 years ago
chengduo 8259f1418f
Enhance backward process (#18700)
6 years ago
JesseyXujin 25c9b57bcd
Modify auc doc. Add output variable description, previously was the scalar type, now changed to the tuple type.test=develop (#18771)
6 years ago
Thunderbrook d8396281ef
add slot to sparse table (#18686)
6 years ago
jiaqi d18aabb472
support patch data, add load_one_table, fix bug (#18509)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
Yi Liu 157211c4e1
supports distributed classification (#18690)
6 years ago
guru4elephant 70b03760fd
add parameter server launch (#18687)
6 years ago
Zeng Jinle d07ad4c605
add more traceback to py_reader error msg, test=develop (#18722)
6 years ago
Huihuang Zheng a3028bb729
Fix random test_recurrent_op failure (#18718)
6 years ago
Tao Luo bd22453f20
Revert "Add LeakyRelu MKLDNN support (#18656)" (#18723)
6 years ago
tangwei12 d845848341
do some odd jobs (#18641)
6 years ago
guru4elephant ebf9797ec3
split different comm method for mnist distributed training (#18715)
6 years ago
Huihuang Zheng 89bc3fd841
Support memory eager deletion on recurrent OP (#17710)
6 years ago
Adam d6b6a337a9 Add LeakyRelu MKLDNN support (#18656)
6 years ago
tangwei12 0b9acb4912
add check of executor (#17986)
6 years ago
Zeng Jinle ae58afc546
Feature/auto_growth_allocator (#18561)
6 years ago
hutuxian bb2f5d24a2
hash_op support int64 hash_size (#18674)
6 years ago
guru4elephant 5ed713d519
remove ctr reader, all functions are satisfied in dataset (#18672)
6 years ago
guru4elephant ab57d3893e
make auc op compatible with 1 dim (#18551)
6 years ago
guru4elephant b71b454328 increase timeout again (#18628)
6 years ago
123malin b414645a65
fix #17430: int64类型的attr训练非预期 (#18264)
6 years ago
Kevin 995d7d8600 Modify embedding_op input dtype to int64 (#18598)
6 years ago
kh2se2013 9ad57f2dfd 1)change to parallel mode on python coverage run (#18594)
6 years ago
gongweibao c0a82748cf
Polish backwards optimizer dependency codes and use more default values. (#18255)
6 years ago
Zeng Jinle d3003a1620
Feature/buffer_shared_inplace (#17911)
6 years ago
tianshuo78520a 1c10dac4f2 Add code example in CI (#18228)
6 years ago
lujun b6d5c74f69
update dygraph api doc for web (#18550)
6 years ago
guru4elephant 9c17a899d7
upgrade collective fleet api (#18533)
6 years ago
bingyanghuang a25be53cb5 QAT int8 MKL-DNN transformation pass with MUL (#18322)
6 years ago
Physher 0caa08ea40 Add mkldnn int8 mul-op kernel (#17834)
6 years ago
LielinJiang 24d1c44a0c Fix roi_perspective_transform_op bug (#18522)
6 years ago
guru4elephant 1f1cc2221f
add random port (#18504)
6 years ago
zhaoyuchen2018 832d8191ff
Fix topk cannot handle 1D vector bug (#18466)
6 years ago
Jiabin Yang 7586cdd545
Hide no support (#18515)
6 years ago
LielinJiang 43e17c7951 Add distributions of normal and uniform (#18023)
6 years ago
tensor-tang 4828a5e008
core remove pycpuinfo (#18479)
6 years ago
qingqing01 602cb6a5b4
Enhance linear_lr_warmup (#18463)
6 years ago
chengduo 7453857324 Make fuse_all_reduce_op_pass support mix_precision (#17652)
6 years ago
chengduo 55baeceddb
Enhance execution error info (#18482)
6 years ago
pkpk e9c7e218f2
Nan debugger init (#18401)
6 years ago
zhoukunsheng 7c6f2350b9 support Tensor input for edit_distance op (#18162)
6 years ago
zhoukunsheng 26318544d2 support Tensor input for chunk_eval op (#18226)
6 years ago
zhoukunsheng 206c44e2a8 add unique kernel and op (#17557)
6 years ago
zhoukunsheng 71af72b1c2 upgrade hash op to support Tensor and LoDTensor input (#17998)
6 years ago
zhoukunsheng d3b3443d10 add ones_like op (#17388)
6 years ago
zhoukunsheng 67b48d7fe7 add size op (#17412)
6 years ago
hutuxian 6e0df3102e
Refactor for Pipeline Thread Check (#18459)
6 years ago
Zeng Jinle 41ab76e55b add friendly error msg to py_reader (#18316)
6 years ago
Kaipeng Deng 823ab5e887
fix load attr error. test=develop (#18447)
6 years ago
Yi Liu a873fa84ce
supports collective training with programs (#18392)
6 years ago
guru4elephant 357311fdb7
make fleet support mpi job submit directly (#18441)
6 years ago
chengduo e0d8c6ac68
Add find_no_grad_vars in backward.py (#17942)
6 years ago
LielinJiang 449c7a9f98 Make roi_perspective_transform op return mask and transform matrix (#18371)
6 years ago
tensor-tang a3bc804f5f
fix mac ci random fail (#18430)
6 years ago
xiaoting dd3f9d19cf replace mnist dataset url, test=develop (#18429)
6 years ago
xsrobin 47e2ef38e9
add "import paddle.fluid as fluid" to examples lack of it
6 years ago
hutuxian 8a39e5c110 update api format (#18413)
6 years ago
tensor-tang ce7a024c6d
fix py-cpuinfo mac random fail (#18383)
6 years ago
Jie Fang 2b4ef509ea init custom black white list (#18377)
6 years ago
guru4elephant e83f902b98
add MultiSlotStringDataGenerator for speedup of string based user inp… (#18390)
6 years ago
Jiabin Yang 43f64a177e
Fix/program doc (#17908)
6 years ago
chengduo 871cc15e6a
Add is_compiled_with_cuda (#18356)
6 years ago
Wojciech Uss 8ed819d8e4 Call the test_slim_int8_* tests through absolute path (#18386)
6 years ago
lujun fd6631ef2f
Fix dygraph show style (#18297)
6 years ago
翟飞跃 19da59ed3f Remove all the code, API and doc of MKL-DNN INT8v1 (#18347)
6 years ago
chengduo 8ed33bf91f
Fix Bug-prone code of PE (#18354)
6 years ago
tangwei12 999d9a59a5
fix communicator with pyreader (#18350)
6 years ago
kh2se2013 27fb9cad65 add WITH_COVERAGE option, default OFF (#17872)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
qingqing01 9047ac687e
Simplify multi_box_head API in detection.py and remove assign op. (#18310)
6 years ago
hutuxian e42057cd1a
add ut for pipeline training (#18289)
6 years ago
Jiabin Yang bd61d89925
test=develop, recover ocr ut on dygraph (#18166)
6 years ago
Yibing Liu 23941e43ec
Update lamb optimizer (#18333)
6 years ago
whs 1bdfd2eb85
Fix checkpoint of Light-NAS (#18330)
6 years ago
Jiabin Yang 79bcdbbf1a test=develop, disable basic gru related ut (#18329)
6 years ago
Jiabin Yang 831a3e62ca
Add install check for multigpu (#18323)
6 years ago
Zeng Jinle f88e07a0d3
fix lod_tensor.py grammar error, test=develop (#18308)
6 years ago
Hongyu Liu df2eee71d8
Sequence mask support tensor (#18249)
6 years ago
Jiabin Yang 9cb799be50
test=develop, Revert "Add multi gpu install check" (#18313)
6 years ago
Qiao Longfei 0e08e91c18
optimize communicator merge sparse gradient test=develop (#18159)
6 years ago
Jie Fang 172c2facef init black/white lists (#17847)
6 years ago
chengduo e06c69c788
Fix default value of fluid.memory_optimize (#18295)
6 years ago
Zhaolong Xing 6978b2e48e
fix split and sampled softmax (#18280)
6 years ago
hutuxian 6ed73830c2
add api desc for pipeline training (#18293)
6 years ago
liuwei1031 a736c03b10
improve doc of lstm, sequence_enumerate, softmax_with_cross_entropy, space_to_depth APIs (#18261)
6 years ago