Zhaolong Xing
76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. ( #19213 )
...
* fix mask rcnn bug:
1. affine channel fuse (diff)
2. condition block op (memory leak)
3. merge lod tensor op (diff)
4. memroy optim (diff)
test=develop
* fix ci aboud PADDLE_ENFOCE
fix merge lod infer op ut
test=develop
6 years ago
lvmengsi
d08d5ab519
Fix the mistake of convolution ( #19274 )
6 years ago
Aurelius84
78a3d837f8
Add match_matrix_tensor op ( #18525 )
...
* add matrch_matrix_tensor op test=develop
* fix ignore unittest if with_mkl=off test=develop
* clean code and rm is_test param test=develop
* modify API.spec test=develop
* rm useless code in search_compute.h test=develop
* modify api.spec test=develop
* modify default_grad.spec test=develop
* Add API test code test=develop
* clean code in search_computer.h
* modify PADDLE_ENFORCE and clean search_compute.h test=develop
* fix code style test=develop
6 years ago
Zeng Jinle
5b6673c44d
merge develop to solve conflict, also fix API doc, test=develop ( #18823 )
6 years ago
zhang wenhui
539c870753
add fl_listen_and_serv &fl_transpiler,test=develop ( #19091 )
...
add fl_listen_and_serv op for Federated_learning and fl_distribute_transpiler add this op to pserver program . This op just listen the endpoint and sum&scale.
6 years ago
kh2se2013
27e85625b8
add python coverage launch when WITH_COVERAGE=ON ( #19264 )
...
add python coverage launch when WITH_COVERAGE=ON
6 years ago
chengduo
8a89ca94ce
Fix REGISTER_OP_WITHOUT_GRADIENT ( #19251 )
...
* fix REGISTER_OP_WITHOUT_GRADIENT
test=develop
6 years ago
gongweibao
fd4b15a2f6
Unset unittests http_proxy env to avoid timeout. ( #19269 )
...
Unset unittests http_proxy env to avoid timeout.
6 years ago
silingtong123
a94a25867d
imporve the doc of decorate_reader API ( #19206 )
...
* imporve the doc of decorate_reader API, test=develop
* udpate API.spec, test=develop
6 years ago
gongweibao
86f0591175
Remove node_num function. ( #19167 )
...
node_num is not needed for users, so remove them and fix the bugs about it!
6 years ago
Tao Luo
2f8c7e021f
remove unused inference_transpiler unit-tests ( #19130 )
...
* remove unused inference_transpiler unit-tests
test=develop
* remove InferenceTranspiler usage in quantize_transpiler.py
test=develop
6 years ago
zhaoyuchen2018
0c71c839ec
Fix recurrent op not update grade issue ( #18581 )
...
* Fix recurrent op fails
For the variable used in outter block,
copy sub block's grad variable to outter block
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* Fix unicode error
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* Refine test code
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* Fix seq2seq case fails
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* remove unreasonable code.
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* Refine code according to comment
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
6 years ago
Hao Wang
d53fa53b65
CI - Improve example code check ( #19170 )
...
* add exception exit on error example codes test=develop
6 years ago
Adam
b837689e97
Add generalized Conv+Activation MKLDNN fuse pass creation ( #19072 )
...
test=develop
6 years ago
Yibing Liu
50b1cab122
Add padding support for crf_decoding ( #19057 )
...
* Add padding support for crf_decoding
* Fixes in comupte kernel
test=develop
* Update API Spec
test=develop
* Update API.spec
test=develop
* Avoid using paddle_enforce
test=develop
* Fix enforce
test=develop
6 years ago
Aurelius84
45fb031f6b
remove is_test param of FC test=develop ( #19209 )
...
Remove is_test parameter of FC op. The parameter is_test is not used anywhere.
6 years ago
wuzewu
6fc1defd77
Fix compatibility issue of fluid.io.save_vars on windows platform ( #19181 )
6 years ago
liym27
c8cdef37b2
change the default value of summarize from -1 to 20 in Print API to improve ease of use ( #18738 )
...
* change the default value of summarize from -1 to 20 in Print op to improve ease of use, test=develop
* change the doc of API Print to make the document easier to understand, test=develop
6 years ago
LielinJiang
1331c9e1f8
fix distributions unittest bug, test=develop ( #19012 )
6 years ago
lvmengsi
c6f163cd7a
add description of sync_bn ( #19056 )
6 years ago
Zeng Jinle
0f9b33954a
move python reader api to fluid.io module, test=develop ( #19143 )
6 years ago
jiaqi
b86be13c15
fix default value ( #19193 )
...
* fix default value in ps_pb2.py: delta_keep_days 30 -> 16
* test=develop
6 years ago
jiaqi
b104ea0684
add get_last_save_xbox_base/get_last_save_xbox ( #19122 )
...
* add get_last_save_xbox_base/get_last_save_xbox
* fix fleet_util bug of load paddle model
* add doc string in fleet api
6 years ago
jiaqi
bfd514c730
fix default value of fleet desc ( #19176 )
...
* fix default value of fleet desc, default values are same with jingpai
* print log when save model
6 years ago
lidanqing
c548e370f1
UT coverage for guassian_mkldnn_op and batch_norm_mkldnn_op ( #19011 )
...
* integrations problem
test=develop
* add batch_norm_mkldnn_op backward-reuse test and guassian seed=0 test
test=develop
6 years ago
Jiawei Wang
6ac32d0981
Instag Implemention ( #18394 )
...
* instag lod tensor impl
* First PR for instag
* First PR for instag
* Before adding Selection Rows.
* Change name from instag to filter_instag, add upgrade the impl of filter_instag
* Change name from instag to filter_instag, add upgrade the impl of filter_instag
* Fix yapf error in gradient_checker.py to pass Travis-CI
* Fix Filter Instag Grad test=develop
* Fix Filter Instag Grad test=develop
* 1) Fix API.spec, add filter_instag Op. 2) Add Vector Support for CUDA. test=develop
* Impl Loss_weight and empty output handler
* change Loss Weight datatype to Float32, and add Loss Weight as 2nd output
* 1) Support Tensor Input(without LOD) 2) Add Unit test
* Filter By Instag Final test=develop
* Update API.spec for filter_by_instag test=develop
* Update API.spec for filter_by_instag 2 test=develop
* Add Filter By Instag Coverage
* code format of test_layers.py
* code format test_layers.py test=develop
* Make API args more readable test=develop
* Make API args more readable and pass code format test=develop
* Filter By Instag Op, Rename Map to Index Map test=develop
* Filter By Instag Op, code format err in filter_by_instag_op.cc test=develop
* Filter by instag op: code format of cpp files test=develop
* Filter by instag Op: Api spec modification test=develop
* Filter by instag Op: Api spec doc id modification test=develop
* Filter by instag Op: Api spec and doc preview test=develop test=document_preview
* Filter By Instag Op, fix doc erro test=document_preview test=develop
* Filter By Instag Op, fix doc err and Api spec test=document_preview test=develop
* Filter By Instag Op, fix Api spec test=document_preview test=develop
* Filter By Instag Op, fix Paddle Encoforce deprecated warning test=document_preview test=develop
* Filter By Instag Op, fix Paddle Encoforce deprecated and code format warning test=document_preview test=develop
6 years ago
wawltor
0019eb376a
Fix the error of op `ones_like` document,change the output variable
...
test=document_preview
test=develop
Fix the error of op `ones_like` document, change the output variable from x to out.
6 years ago
huangjun12
20f18930ae
Add hard swish op (new op) ( #19001 )
...
* add hard_swish activation op (new op)
test=develop
* remove redundancy files
* modify document content of HardSwish OP
* add API test in test_layers.py
* add dynamic_graph for test_hard_swish
6 years ago
gongweibao
29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. ( #18966 )
...
Polish fleet API to support cuda collective mode and nccl2 mode
6 years ago
wopeizl
80b7ef6fc8
add tensorrt support for windows ( #19084 )
...
* add tensorrt support for windows
6 years ago
Kevin
744279fe68
Refine embedding Api doc ( #18820 )
...
* fix overflow by int32 mul test=develop
* fix reference nullptr
* fix codestyle test=develop
* modify to point in ContextProjectFunctor test=develop
* modify to point in ContextProjectFunctor test=develop
* modify . to -> test=develop
* refine embedding padding_idx doc test=develop
* fix math:padding_idx preview bug test=develop
* modify API.spec test=develop
* fix spell error test=develop
* refine dtype parm desc test=develop
6 years ago
yaoxuefeng
9150cf50fc
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics ( #18871 )
...
* add ctr related metric layer test=develop
* add save cache and slots shuffle test=develop
* add save cache and slots shuffle test=develop
* fix error
* fix error
* fix style for ci
* fix for comments
* change SlotsShuffle input to std::strinf for generality
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix stylr
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* change non-const reference to pointer
* fix style
* fix style
* fix style test=develop
* fix style test=develop
* add return ins num in ctr metric op
* change dtype to float in metric_op.py
* fix error test=develop
* fix style test=develop
* fix API spec
* fix API spec
* fix API spec test=develop
* add UT test=develop
6 years ago
Zeng Jinle
c51eb6bb14
remove book_memory_optimization directory, test=develop ( #19117 )
6 years ago
Zeng Jinle
c194b0c835
Try to deprecate unstable python memory optimize ( #18983 )
...
* deprecate python memory optimize, test=develop
* remove memory_optimize in unittests, test=develop
* add unittests to deprecated interfaces, test=develop
6 years ago
hutuxian
5a80cc8431
Datafeed support reading to cuda place directly. ( #19071 )
...
* add a place field in DataFeed to denote which place it will feed data to.
* abstract the copy process in CopyToFeedTensor function
* add UT for float32 type and for CUDAPlace
6 years ago
chengduo
3f4c088ad8
prune the feed op in compiler ( #18997 )
...
test=develop
6 years ago
chengduo
d23603322e
Remove compile from PE ( #19080 )
...
* remove compile from PE
test=develop
6 years ago
ShenLiang
4397cb318e
add eye op, kernel and unitest test=develop ( #18980 )
...
* add eye op,test=document_preview test=develop
* fix the API.spec, test=develop
* fix the document, test=document_preview test=develop
* add unitest for CI coverage, test=develop
6 years ago
Kaipeng Deng
f86fead693
Add trilinear_interp OP ( #18711 )
...
* add trilinear interp. test=develop
* fix unittest. test=develop
* add python api and test_layers. test=develop
* refine API.spec. test=develop
* fix format. test=develop
* add python API test. test=develop
* format code. test=develop
* refine code strcuture. test=develop
* fix format
* fix doc. test=develop
* fix converage. test=develop
* fix format. test=develop
6 years ago
chengduo
17d62ab220
Enhance fuse optimization op pass ( #19010 )
...
* Enhance fuse optimization op pass
test=develop
6 years ago
chengduo
21440b4d69
Add call stack info during compile time ( #19067 )
...
* Add call stack info during runtime and compile time
test=develop
* Rename operator_call_stack
test=develop
* Add unit test
test=develop
* follow comment
test=develop
6 years ago
jiaqi
a99bc64c63
add fleet util, add some interface in hdfs util ( #18752 )
...
* add fleet util (fleet/utils/fleet_util.py): functions for users' convenience
* add some interface in hdfs util : hdfs is_file、hdfs cat
6 years ago
mapingshuo
4ad7c9d5a7
[WIP] Add Imdb train demo ( #18895 )
...
* add train demo for imdb text classification task
* make inference library release data_feed dataset dataset_factory data_feed_factory
* add String Data Generator
* new feature of train demo: save model params
* New feature of train demo: set training config using gflags
* change code style for CI
* add readme and dataset for imdb demo trainer
6 years ago
wangguanzhong
e50f527fee
update roi doc in roi_pool and roi_align ( #19036 )
...
* update roi doc in roi_pool and roi_align, test=develop
6 years ago
Leo Chen
8f53735437
Fix memory overwriting of tensors returned by executor ( #19030 )
...
* fix memory overlapping of fetch var (return of executor.run), test=develop
* fix wrong usage of ParallelExecutor in op_test, test=develop
* remove useless parameter and simplify code
* avoid tensor destruct untimely, test=develop
* add testcase independent of OpTest, test=develop
6 years ago
Youwei Song
95ff4fba61
specify the highest numpy version under python 2.x ( #19018 )
...
As mentioned in this link, the last version of NumPy to support Python 2.7 is numpy 1.16.4.
6 years ago
Kaipeng Deng
1f46253d4a
fix natural exp decay doc. test=develop ( #19025 )
6 years ago
LielinJiang
e5b9753a18
Fix ExponentialMovingAverage api bug in python3, test=develop ( #18775 )
6 years ago
Kevin
e681d65515
Add var_conv_2d op ( #18518 )
...
* fix overflow by int32 mul test=develop
* fix reference nullptr
* fix codestyle test=develop
* modify to point in ContextProjectFunctor test=develop
* modify to point in ContextProjectFunctor test=develop
* modify . to -> test=develop
* add var_conv_2d op test=develop
* edit api.spec test=develop
* ignore unittest if with_mkl=off test=develop
* fix python3 division test=develop
* fix ignore unittest bug test=develop
* remove useless code test=develop
* modify api.spec test=develop
* modify default_grad.spec test=develop
6 years ago
Chen Weihang
81fe02c3fe
Fix config description error in cuda_profiler function document ( #18750 )
...
* fix profiler doc error, test=develop
* update API.spec, test=develop
6 years ago
Zeng Jinle
311f90f1eb
reduce_unittest_time,test=develop ( #19005 )
6 years ago
lvmengsi
5d9df8c8c7
fix dropout ( #18965 )
...
Fix dropout in nn.py
6 years ago
SunGaofeng
4da1c4f15d
fix g_param shape mismatch in WeightNormParamAttr ( #18940 )
...
* fix g_param shape mismatch in WeightNormParamAttr
* add comment to show why insert reshape in startup_program
test=develop
6 years ago
Jiabin Yang
af63b1184c
test=develop, fix memory leak in dygraph ( #18998 )
6 years ago
liuwei1031
a43a763b54
fix warpctc.dll not found issue ( #18761 )
...
* fix warpctc.dll not found issue, test=develop
* revert the linux platform change, test=develop
* delete warpctc_lib_path.h.in, test=develop
* add SetPySitePackagePath function
* fix warpctc.dylib not found issue on Mac, test=develop
* improve the paddle lib path setting logic, test=develop
* fix mac ci issue caused by test_warpctc_op unittest, test=develop
* tweak code, test=develop
6 years ago
chengduo
01c7daade7
Add checking for the fetch_list of Executor.run ( #18957 )
...
* update exe.run
6 years ago
Liufang Sang
faf6890b6c
support tensor input for ctc align op ( #18887 )
...
* test=develop support Tensor input for ctc_align_op
* test=develop add some comment
6 years ago
Dong Daxiang
c97ea53c3e
make listen and server as exclusive run ( #18990 )
...
make listen and server as exclusive run
6 years ago
xsrobin
8ce902541c
fix unalign of some examples ( #18943 )
...
* test=develop test=document_preview
* Update API.spec
6 years ago
Zeng Jinle
7ac748adb4
Open gc by default ( #18836 )
...
* open gc by default, test=develop
* fix test_train_recognize_digits and disable gc when ngraph is enabled, test=develop
* fix conditional_block op eager deletion bug, test=develop
* add some comments to reviewers, test=develop
6 years ago
hong
f745d6d9e4
fix expand op dtype build bugs; test=develop ( #18932 )
6 years ago
jiaqi
02c370c3dc
support filelist size < trainer num && fix pull dense ( #18956 )
...
* support filelist size < trainer num
* pull dense when stop, to make sure local dense params are same as pserver, so save paddle model will save dense model same as pserver
* enable QueueDataset train same filelist for serveral times
6 years ago
石晓伟
ee2f296ef8
Fusion: seqpool_cvm_concat ( #18471 )
...
* add fusion_seqpool_cvm_concat test=develop
* simplify pass, test=develop
* fix code style, test=develop
6 years ago
jiaqi
768059b3a0
adjust ins weight according to nid slot ( #18784 )
...
adjust ins weight according to nid slot , user can specify adjust_ins_weight in strategy
6 years ago
wawltor
3ab1866ca5
Add the op of unique_with_counts, expand count function of the op unique ( #18720 )
...
* test=develop
Add the op of unique_with_counts, the op is calc the unqiue input of data, and output the corresponding indices and count of data.
* test=develop
Check the input and dtype in the op of unique_with_counts
* test=develop
test=document_preview
update the API.spec for `unique_with_counts`, at the same time, optimize the python api in the op of `unique_with_count`
* test=develop
test=document_preview
Fix some python api problem in the op of `unique_with_counts`, and change the error messsage in this op.
* Fix some API problem in the op of `unique_with_counts`
test=develop
test=document_preview
* test=develop
test=document_preview
Fix the api sample of op `unique_with_counts`, and update api.spec
6 years ago
LielinJiang
22fa4c2d24
Fix depthwise conv gpu kernel bug ( #18582 )
...
* fix depthwise conv gpu kernel bug, test=develop
* add more depthwise conv test, test=develop
6 years ago
whs
c92b78b060
Fix unitest of light nas. ( #18931 )
...
test=develop
6 years ago
jiaqi
233746d89d
set fleet_send_batch_num a default value according to trainer num
...
(1) set fleet_send_batch_num a default value according to trainer num, the previous 80000 is fixed,if trainer num is much less or larger than 100,global shuffle may have timeout error.
(2) fix load one table bug, add barrier
6 years ago
chengduo
20859c08e8
[DyGraph] Make multi-card program faster ( #18892 )
...
* update parallel.py
test=develop
6 years ago
HaoRen
24f8543106
Add center Loss Op Support ( #18681 )
...
* support center loss
* change tensor copy api to high level api tensorcopy
* test=develop rewrite the center_loss cuda_kernel to make it faster
and add document of the center loss api,also update test function
* test=document_preview test=develop
update document of center loss
* test=document_preview test=develop
modify API.spec modify test code remove nouse const_cast
6 years ago
lvmengsi
d21c391447
replace paper link ( #18861 )
...
Update conv2d transpose link
6 years ago
Dong Daxiang
2bb296dfe9
make dist unit test exclusive run ( #18865 )
...
make dist unit test exclusive run
6 years ago
whs
6cccab9203
Make lod_append support variable lod. ( #18908 )
...
test=develop
6 years ago
danleifeng
e0a2d4dfec
Add elementwise_pow_op backward implementation and the unit test codes of it. ( #18848 )
6 years ago
chengduo
ecd2bdada6
add CPUInplaceTestWithFuseOptimizationOps ( #18867 )
...
test=develop
6 years ago
Zeng Jinle
8008ab4e6b
Remove legacy C++ memory optimization codes ( #18834 )
...
* remove legacy memory optimization codes, test=develop
* follow huihuang's comments,test=develop
* follow luotao's comments, test=develop
6 years ago
Thunderbrook
52c1431eee
add clear_model interface in fleetwrapper ( #18815 )
...
* dump slot
* test
* proto
* dump slot
* test
* proto
* code style
* code style
* code style
* style
* add delete after unseen days
* add unseen days
* code style
* conflict solve
test=develop
* add clear model
* code style
test=develop
* code style
test=develop
6 years ago
Zeng Jinle
9a8a7a1ddc
fix affine_channel no_need buffer bug, test=develop ( #18844 )
6 years ago
lvmengsi
829ef26281
Fix drop deconv ( #18813 )
...
* replace link
* update api.spec
* fix mistake
6 years ago
chengduo
4140fe11a4
Open fuse optimization ops ( #18741 )
...
* open fuse optimization ops
test=develop
6 years ago
chengduo
582cc29799
add warning info for CPU_NUM ( #18840 )
...
test=develop
6 years ago
Adam
ee02227949
Add LeakyReLU MKLDNN support ( #18762 )
6 years ago
Zeng Jinle
a802da650b
Feature/mem opt pass refactor ( #18735 )
...
* first version memory optimize pass, test=develop
* remove move_tensor_sharing_pass, test=develop
* refine code comments, add unittests, test=develop
* turn off memory_optimize by default, test=develop
* follow huihuang's comments, test=develop
* follow chengduoZH's comments, test=develop
* fix grammar error, add const qualifier, fix pass_test exception message, test=develop
* follow chengduoZH's comments 2nd, test=develop
6 years ago
石晓伟
9dbb62eeb9
Fix examples of API ( #18092 )
...
* fix logical APIs
test=develop
test=document_preview
* fix isfinite
* update matmul comments
* update API.spec
test=document_preview
test=develop
* update API.spec
test=document_preview
test=develop
* update API.spec
test=document_preview
test=develop
6 years ago
guru4elephant
30562e371b
refine launch_ps and role_maker ( #18795 )
...
refine launch_ps and role_maker
6 years ago
fuyinno4
c167a4b4dd
Fix shrink-dense and add scale-datanorm ( #18746 )
...
Fix FleetWrapper:
1. fix shrink dense: just scale show
2. add datanorm scale: divide datanorm's gradient by batch_size
6 years ago
guru4elephant
2efb282c86
split test_dist_se_resnext.py into 4 testcases ( #18743 )
...
* split test_dist_se_resnext.py into 4 testcases
6 years ago
Bob Zhu
220eef602e
Extend Matmul to support matrix multiplication with multiple heads ( #18570 )
...
* extend matmul op to support multiple head multiplication
With the support of multiple head, the multiplication of two big matrixes is
split into multiplication of several (head_number) small matrixes. e.g. if
Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number
as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of
[6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].
6 years ago
whs
075e1cf78e
Add python API for appending LoD level ( #18702 )
...
* Make lod reset op support for append lod level.
* Fix API.spec
test=develop
* Fix unitest.
test=develop
* Add python api for lod append.
test=develop
* Fix API.spec
test=develop
* Fix format of doc.
test=develop
* Fix unitest.
test=develop
* Fix doc.
test=develop
6 years ago
chengduo
8259f1418f
Enhance backward process ( #18700 )
...
* prun backward ops
test=develop
6 years ago
JesseyXujin
25c9b57bcd
Modify auc doc. Add output variable description, previously was the scalar type, now changed to the tuple type.test=develop ( #18771 )
6 years ago
Thunderbrook
d8396281ef
add slot to sparse table ( #18686 )
...
The change includes 2 things:
1. save delta model and shrink table are control by the same parameter before, now add delete_after_unseen_days to control shrink table.
2. value in sparse table has no slot before, now add slot in sparse table, and add DownpureCtrAccessor to support the new meta.
test=develop
6 years ago
jiaqi
d18aabb472
support patch data, add load_one_table, fix bug ( #18509 )
...
(1)support patch data (merge slots of instances of same line id, modify dense layer which
changes its size)
(2)add fleet load_one_table interface, support load from paddle model and load from pslib model
(3)fix push sparse bug which cause push sparse cost more time(about 10% in my testcase)
(4)when some slots are not in one of your network (join/update, etc.),data feed、collect label info、push/pull sparse will skip these slots, instead of throw error.
(5)add more debug info in TrainFilesWithProfiler
6 years ago
chengduo
fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. ( #18664 )
...
* support sparse gradients
test=develop
6 years ago
Yi Liu
157211c4e1
supports distributed classification ( #18690 )
...
* supports distributed classification training
* update API.spec
* fix evenly division in python3
* change "index_range" to "index_num" in shard_index operator
test=document_preview
test=develop
6 years ago
guru4elephant
70b03760fd
add parameter server launch ( #18687 )
...
add parameter server launch so that a user can easily launch parameter server
6 years ago
Zeng Jinle
d07ad4c605
add more traceback to py_reader error msg, test=develop ( #18722 )
6 years ago
Huihuang Zheng
a3028bb729
Fix random test_recurrent_op failure ( #18718 )
...
The change includes 3 things:
1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1.
2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values.
3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests.
test=develop
6 years ago
Tao Luo
bd22453f20
Revert "Add LeakyRelu MKLDNN support ( #18656 )" ( #18723 )
...
test=develop
6 years ago
tangwei12
d845848341
do some odd jobs ( #18641 )
...
do some odd jobs, test=develop
6 years ago
guru4elephant
ebf9797ec3
split different comm method for mnist distributed training ( #18715 )
...
* split different comm method for mnist distributed training
6 years ago
Huihuang Zheng
89bc3fd841
Support memory eager deletion on recurrent OP ( #17710 )
...
Test PaddingRNN on V100 GPU device.
Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU.
GPU memory (MiB): 6414 (this PR) vs 6837 (without this PR)
Speed (steps/s): 10.28 (this PR) vs 9.89 (without this PR)
6 years ago
Adam
d6b6a337a9
Add LeakyRelu MKLDNN support ( #18656 )
...
test=develop
6 years ago
tangwei12
0b9acb4912
add check of executor ( #17986 )
...
* add check of executor, test=develop
6 years ago
Zeng Jinle
ae58afc546
Feature/auto_growth_allocator ( #18561 )
...
* feature/auto_growth_allocator, test=develop
* add unittest of AlignedAllocator, test=develop
* try to turn on auto_growth to test on CI, test=develop
* fix segmentation fault in mixed_vector.h, test=develop
* add unittests, test=develop
6 years ago
hutuxian
bb2f5d24a2
hash_op support int64 hash_size ( #18674 )
...
* hash_op support int64 hash_size
* add corresponding UT
6 years ago
guru4elephant
5ed713d519
remove ctr reader, all functions are satisfied in dataset ( #18672 )
...
* remove ctr reader, all functions are satisfied in dataset
6 years ago
guru4elephant
ab57d3893e
make auc op compatible with 1 dim ( #18551 )
...
* make auc op compatible with 1 dim
6 years ago
guru4elephant
b71b454328
increase timeout again ( #18628 )
...
test=develop
6 years ago
123malin
b414645a65
fix #17430 : int64类型的attr训练非预期 ( #18264 )
...
* fix int64_t
* update fill constant op unittest
* add empty line
6 years ago
Kevin
995d7d8600
Modify embedding_op input dtype to int64 ( #18598 )
6 years ago
kh2se2013
9ad57f2dfd
1)change to parallel mode on python coverage run ( #18594 )
...
2)add pip install coverage in Dockerfile.tmp
test=develop
6 years ago
gongweibao
c0a82748cf
Polish backwards optimizer dependency codes and use more default values. ( #18255 )
6 years ago
Zeng Jinle
d3003a1620
Feature/buffer_shared_inplace ( #17911 )
...
* feature/buffer_shared_inplace, test=develop
* refine code, test=develop
* fix elementwise_add op cpu inplace and sum inplace bug, test=develop
* add unittest and debug log, test=develop
* fix parallel_executor scope bug, polish code, test=develop
* fix sum op, activation op, single_in_place_inference bug, test=develop
* remove kLocalExecScopeName, test=develop
* fix unittest,test=develop
* fix out_var first version bug, test=develop
* follow comments,test=develop
6 years ago
tianshuo78520a
1c10dac4f2
Add code example in CI ( #18228 )
...
* test api example
* update python
* add sampcd_processor.py
* add if 0
* sort
* test paddle
* test paddle
* test paddle
* add whitelist
* change sampcd_processor.py
* change sampcd_processor.py
* change sampcd_processor.py
* add exit
* test=develop
* test=develop
6 years ago
lujun
b6d5c74f69
update dygraph api doc for web ( #18550 )
...
remove dygraph.enable from __all__
hidden dygraph. profiler
add doc to dygraph. no_grad
6 years ago
guru4elephant
9c17a899d7
upgrade collective fleet api ( #18533 )
...
* upgrade collective fleet api
6 years ago
bingyanghuang
a25be53cb5
QAT int8 MKL-DNN transformation pass with MUL ( #18322 )
6 years ago
Physher
0caa08ea40
Add mkldnn int8 mul-op kernel ( #17834 )
6 years ago
LielinJiang
24d1c44a0c
Fix roi_perspective_transform_op bug ( #18522 )
...
* fix transform matrix bug, test=develop
* modify API.spec
6 years ago
guru4elephant
1f1cc2221f
add random port ( #18504 )
...
* add random port
6 years ago
zhaoyuchen2018
832d8191ff
Fix topk cannot handle 1D vector bug ( #18466 )
...
* Fix topk cannot handle 1D vector bug
Add path to handle 1D vector
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
* refine code
test=develop
Signed-off-by: zhaoyuchen <zhaoyuchen01@baidu.com>
6 years ago
Jiabin Yang
7586cdd545
Hide no support ( #18515 )
...
* test=develop, fix docker with paddle nccl problem
* test=develop, hide no_support api and add ut for it
6 years ago
LielinJiang
43e17c7951
Add distributions of normal and uniform ( #18023 )
...
* add_distributions_of_normal_and_uniform
* paddle/fluid/API.spec
* modify API.spec
* modified paddle/fluid/API.spec, test=develop
* modify paddle/fluid/API.spec, test=develop
* modify paddle/fluid/API.spec, test=develop
* fix some comment, test=develop
* modify API.spec, test=develop
* add comment for init function, modify hard code, test=develop
* modify API.spec, test=develop
* modify API.spec, test=develop
* make unit test function shorter, test=develop
* modify paddle/fluid/API.spec
6 years ago
tensor-tang
4828a5e008
core remove pycpuinfo ( #18479 )
...
remove pycpuinfo deps in core
6 years ago
qingqing01
602cb6a5b4
Enhance linear_lr_warmup ( #18463 )
...
* make it support float/int learning as input.
6 years ago
chengduo
7453857324
Make fuse_all_reduce_op_pass support mix_precision ( #17652 )
6 years ago
chengduo
55baeceddb
Enhance execution error info ( #18482 )
...
* enhance execution error info
test=develop
6 years ago
pkpk
e9c7e218f2
Nan debugger init ( #18401 )
...
test=develop
6 years ago
zhoukunsheng
7c6f2350b9
support Tensor input for edit_distance op ( #18162 )
6 years ago
zhoukunsheng
26318544d2
support Tensor input for chunk_eval op ( #18226 )
...
* test=develop
support Tensor input for chunk_eval op
* test=develop
fix testcase for chunk_eval op
* test=develop
fix typos in nn.py
6 years ago
zhoukunsheng
206c44e2a8
add unique kernel and op ( #17557 )
6 years ago
zhoukunsheng
71af72b1c2
upgrade hash op to support Tensor and LoDTensor input ( #17998 )
6 years ago
zhoukunsheng
d3b3443d10
add ones_like op ( #17388 )
6 years ago
zhoukunsheng
67b48d7fe7
add size op ( #17412 )
6 years ago
hutuxian
6e0df3102e
Refactor for Pipeline Thread Check ( #18459 )
...
move the thread-check code from train_from_dataset to a single function
add UT for the thread check function
6 years ago
Zeng Jinle
41ab76e55b
add friendly error msg to py_reader ( #18316 )
6 years ago
Kaipeng Deng
823ab5e887
fix load attr error. test=develop ( #18447 )
6 years ago
Yi Liu
a873fa84ce
supports collective training with programs ( #18392 )
...
1. Since allreduce op has 4 reduce types, We split these four reduce types into four ops
2. We also refined the collective op code, e.g. we separated the collective op kernel into CPUKernel and CUDAKernel, and remove the device specified DeviceContext parameter in template as we already knew the target DeviceContext
3. We remove the newly added Collective op role to reduce the complexity of program and graph analysis
6 years ago
guru4elephant
357311fdb7
make fleet support mpi job submit directly ( #18441 )
...
make fleet support mpi job submit directly.
6 years ago
chengduo
e0d8c6ac68
Add find_no_grad_vars in backward.py ( #17942 )
...
* add not_been_used_vars to no_grad_set
test=develop
6 years ago
LielinJiang
449c7a9f98
Make roi_perspective_transform op return mask and transform matrix ( #18371 )
...
* modify roi_perspective_transform_op to output mask and transform matrix
* modify comment
* modify comment
* modify API.spec
* update API.spec
* remove no use header, test=develop
* resolve conflict
6 years ago
tensor-tang
a3bc804f5f
fix mac ci random fail ( #18430 )
...
* fix mac ci random fail
* use platform instead
6 years ago
xiaoting
dd3f9d19cf
replace mnist dataset url, test=develop ( #18429 )
...
replace mnist dataset url
6 years ago
xsrobin
47e2ef38e9
add "import paddle.fluid as fluid" to examples lack of it
6 years ago
hutuxian
8a39e5c110
update api format ( #18413 )
...
* update api format
test=develop
* update API.spec
test=develop
6 years ago
tensor-tang
ce7a024c6d
fix py-cpuinfo mac random fail ( #18383 )
...
* fix py-cpuinfo mac random fail
* differentiate version on windows
6 years ago
Jie Fang
2b4ef509ea
init custom black white list ( #18377 )
...
test=develop
6 years ago
guru4elephant
e83f902b98
add MultiSlotStringDataGenerator for speedup of string based user inp… ( #18390 )
...
* add MultiSlotStringDataGenerator for speedup of string based user input data
6 years ago
Jiabin Yang
43f64a177e
Fix/program doc ( #17908 )
...
* test=develop, add some comments for Program.clone
* test=develop, add API.spec
* test=develop, refine comments
* refine Program doc and clone doc
* test=develop, refine doc
6 years ago