Shixiaowei02
7b9fc71076
update tensorrt subgraph_util test=develop
6 years ago
dengkaipeng
0f7411a1ae
round down for scale. test=develop
6 years ago
dongdaxiang
87027a2eef
fix API.spec problem and executor's docstring
...
test=develop
6 years ago
sneaxiy
8c869a865d
update develop ops
...
test=develop
6 years ago
sneaxiy
33473890f3
Merge develop
...
test=develop
6 years ago
dongdaxiang
ade9337486
fix API.spec
...
test=develop
6 years ago
liuwei1031
278debab71
fix comments of 16410, test=develop ( #16499 )
...
* fix comments of 16410, test=develop
* modify inplace_op_inference_test according to pass interface change, test=develop
6 years ago
Wojciech Uss
2498395132
remove profiling from int8 test
...
test=develop
6 years ago
Zhaolong Xing
3e6aa498d6
Merge pull request #16526 from NHZlX/refine_trt_anakin
...
refine subgraph trt and anakin
6 years ago
Tao Luo
8f7b5883b8
Merge pull request #16529 from lidanqing-intel/lidanqing/preprocess-data
...
preprocess with PIL the full val dataset and save binary
6 years ago
Tao Luo
5b24002389
Merge pull request #16399 from sfraczek/sfraczek/analyzer_int8_resnet50_test
...
create test for quantized resnet50
6 years ago
nhzlx
7cde2d9e84
fix trt engine test error.
...
test=develop
6 years ago
zhoukunsheng
3c4f5f0368
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into linspace
6 years ago
zhoukunsheng
ead3c0a8fc
update api.spec
6 years ago
dongdaxiang
720647e17f
rebase current develop and fix conflict
...
test=develop
6 years ago
zhoukunsheng
2336d5ca5d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rank
6 years ago
dongdaxiang
3a79be6eb3
refine API spec
...
test=develop
6 years ago
dongdaxiang
98dda08a85
fix pull sparse slow problem
...
test=develop
6 years ago
dongdaxiang
93c3c7f9b3
fix dataset testcase problem
...
test=develop
6 years ago
dongdaxiang
d739bab844
fix async_executor problem and remove some unnecessary testcase, fix trainer_desc import problem
...
test=develop
6 years ago
dongdaxiang
241d8808be
add timer to distributed executor
...
test=develop
6 years ago
dongdaxiang
3c73859eec
add trainer_desc.proto to distributed executor
...
test=develop
6 years ago
dongdaxiang
60b7bf6fa6
add infer_from_dataset for inference
6 years ago
xjqbest
030c7e7e9d
fix FillSparseValue error
...
test=develop
6 years ago
dongdaxiang
88880d9b69
fix import trainer_desc_pb2 error
...
test=develop
6 years ago
dongdaxiang
0030eb2a61
fix distributed building
...
test=develop
6 years ago
dongdaxiang
ed31874397
undefine rand_r()
...
test=develop
6 years ago
dongdaxiang
f7e4813804
add WIN32 for rand_r and usleep
...
test=develop
6 years ago
dongdaxiang
cedbc161da
add more _LINUX maroc on data_feed.cc for mac and window compile
...
test=develop
6 years ago
dongdaxiang
c5980c3566
add _LINUX macro
...
test=develop
6 years ago
dongdaxiang
433301fbc2
remove glog in shell.h
...
test=develop
6 years ago
dongdaxiang
9e51ad4a65
fix io and fs compile on mac
...
test=develop
6 years ago
dongdaxiang
6eca88ac76
fix io and fs compile on mac
...
test=develop
6 years ago
dongdaxiang
2708108a08
fix fleet_wrapper compile on windows
...
test=develop
6 years ago
dongdaxiang
4ce35815fb
fix windows GLOG problem
...
test=develop
6 years ago
dongdaxiang
e3107a6ae0
fix windows compile problem
...
test=develop
6 years ago
dongdaxiang
398004ece0
disable sys/wait.h to fix windows compile problem, include scope in lodtensor_printer
...
test=develop
6 years ago
dongdaxiang
d4514949bf
remove local random engine in fleet with rand_r()
...
test=develop
6 years ago
dongdaxiang
e82969eeb0
remove getdelim in windows
...
test=develop
6 years ago
dongdaxiang
45eb6f0765
run pre-commit check files and fix code style problem
...
test=develop
6 years ago
dongdaxiang
d87ba58c14
refine document of python API, make device_worker and trainer's API private
...
test=develop
6 years ago
dongdaxiang
5687f234bf
fix trainer_desc.proto error
6 years ago
dongdaxiang
b95b80bc76
add doc string for executor and update API.spec
...
test=develop
6 years ago
dongdaxiang
6be9f719e2
make string_helper dependency work
...
test=develop
6 years ago
xjqbest
e95cafd9a7
fix code style & add dataset testcase
...
test=develop
6 years ago
dongdaxiang
39362a8415
move root_scope->DropKids() into Finalize() so that we do not have to drop all the kids
...
test=develop
6 years ago
dongdaxiang
ba15d6b164
move root_scope->DropKids() into Finalize() so that we do not have to drop all the kids
...
test=develop
6 years ago
xjqbest
be74de2c61
fix code style & fix register bug & add release_memory
...
test=develop
6 years ago
dongdaxiang
a0b59773af
fix code style
6 years ago
dongdaxiang
f39b323ed7
remove trainer_library in CMakeLists
...
test=develop
6 years ago
dongdaxiang
365be5d559
support win32 flag in io.cc shell.cc, fix code style problem in fleet_wrapper, fix lodtensor_printer_test problem
...
test=develop
6 years ago
dongdaxiang
dc8cf36e4b
add more example on datagenerator
...
test=develop
6 years ago
dongdaxiang
6bf796df14
refine print fetch list
6 years ago
xjqbest
589467f24c
fix bug
6 years ago
xjqbest
b7940c2918
fix bug of gen_worker_desc and set_filelist, add some doc
6 years ago
dongdaxiang
68d7bf3de5
add fetch var function
...
test=develop
6 years ago
xjqbest
a34fe6248f
add some doc
6 years ago
xujiaqi01
f5c6a14b54
fix runtime error
6 years ago
xujiaqi01
a5b1a0e12b
support multi dataset && add init model && fix bug
6 years ago
dongdaxiang
3c65cc1bbd
add document for role_maker and fleet parameter, data_generator
6 years ago
dongdaxiang
f6c9232a3d
fix dataset float32 type problem
6 years ago
dongdaxiang
73b1f396d7
add data_generator into paddle.fluid.incubate.data_generator, add op run log in hogwild_device_worker and downpour_device_worker
...
test=develop
6 years ago
dongdaxiang
73544e8b8d
add training speed log
6 years ago
dongdaxiang
9419de521f
add IO percent for multi_trainer
6 years ago
dongdaxiang
6af697adb0
add trainfileswithprofiler for downpour worker
6 years ago
dongdaxiang
2644b88685
add comment for MPI Symetric role maker
...
test=develop
6 years ago
dongdaxiang
cf45c54340
add distributed optimizer factory
6 years ago
dongdaxiang
b7a202aa38
add distributed optimizer factory
6 years ago
xujiaqi01
70a5d4f797
fix error
6 years ago
xujiaqi01
d25389fefd
add some log && fix error
6 years ago
dongdaxiang
f612877797
add incubate for unified API
6 years ago
dongdaxiang
317eb0aad3
add incubate for unified API
6 years ago
xujiaqi01
39449ba0b9
fix bug && add DestroyReaders in trainer
6 years ago
dongdaxiang
e657c127a8
hide opt_info in distirbuted optimizer
6 years ago
xujiaqi01
ecfc7df913
add dataset factory && fix style
6 years ago
dongdaxiang
328f11b8b6
refactor downpour optimization
...
test=develop
6 years ago
xujiaqi01
3cea00bd52
store memory data in Dataset && fix bug
6 years ago
dongdaxiang
ff87698a44
refactor downpour optimization
6 years ago
dongdaxiang
b66f0074b6
fix data reading bugs in api, add VLOG(3) log for setup
6 years ago
dongdaxiang
b415ec27e8
make Dataset* as an argument
6 years ago
xjqbest
dd67ad08a2
modify c++ and python dataset related code & fix bug
6 years ago
dongdaxiang
cc4def6ba5
fix some conflict for compilation
6 years ago
heqiaozhi
9bca1926c1
refactor & fix bug
6 years ago
xjqbest
2e9a836c6f
add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
6 years ago
dongdaxiang
2486389793
add RunFromDataset in executor
6 years ago
dongdaxiang
e36bbcc871
fix some typo and CMakefile.txt
6 years ago
xjqbest
824b84d185
add DataSet and InMemoryDataFeed, support load data into memory and shuffle data
6 years ago
dongdaxiang
08c25995a2
add run from dataset in executor.
6 years ago
dongdaxiang
c28bbdf8ba
add dataset_generator.py
...
dataset_generator.py is a framework for generating data with python
the generated data with a fixed format will be feeded into c++ reader
test=develop
6 years ago
dongdaxiang
be757096da
add pybind for fleet
6 years ago
dongdaxiang
687cb79dbb
add pipe command io interface
6 years ago
dongdaxiang
1fe54416c9
move fs.cc and shell.cc into paddle/fluid/framework/io
...
test=develop
6 years ago
dongdaxiang
53fbab5d33
add fs_local_open example
6 years ago
dongdaxiang
afaf937010
add fs_local_open example
6 years ago
dongdaxiang
cf1360643f
add printer for fetch variable
6 years ago
dongdaxiang
d65cb13ad5
add pslib flag on fleet_wrapper CMakefile
6 years ago
dongdaxiang
6de9ebc65c
refine VLOG in fleet_wrapper.h
...
test=develop
6 years ago
dongdaxiang
97d5cd30f0
make pull dense worker work
6 years ago
dongdaxiang
39014b9f9f
fix class register problem
6 years ago
dongdaxiang
f0dd1201cc
fix destructor problem
...
test=develop
6 years ago
dongdaxiang
f2bde9c241
fix destructor problem
6 years ago
dongdaxiang
54f047a126
fix ngraph compile option
6 years ago
dongdaxiang
dd1dc9bcf0
add common.h.in back
6 years ago
dongdaxiang
378037c535
make s_instance_ private to ensure singleton
6 years ago
dongdaxiang
a446d26e8a
add todo for asynce executor
6 years ago
dongdaxiang
c165012031
refine device_worker and trainer code
...
test=develop
6 years ago
dongdaxiang
8a335b50be
add downpour device_worker pb configuration
6 years ago
dongdaxiang
24a8001142
make -DWITH_PSLIB=ON compilable
6 years ago
dongdaxiang
67b1d6d721
add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
6 years ago
dongdaxiang
855bf579d2
add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
6 years ago
lujun
d4f63d82ac
Merge pull request #16475 from junjun315/fix-doc-multiplex
...
refine multiplex-doc
6 years ago
Qiao Longfei
d8974e6da0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
...
test=develop
6 years ago
Shixiaowei02
bddb2cd315
resolve conflicts with the develop branch test=develop
6 years ago
lidanqing
0d656996bf
fix some bugs of unzip and reading val list
...
test=develop
6 years ago
chengduo
1096746cbf
Fuse Adam And SGD ops ( #15933 )
...
* fuse optimizer
6 years ago
Jacek Czaja
2632327429
[MKL-DNN] Tensor modifications revert ( #16462 )
...
* Revert "[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233 )"
This reverts commit 13816dd4ac
.
Apart from enabling transformer for MKL-DNN
* Revert "- MKL-DNN pooling updated to set_prim_desc"
This reverts commit c63f6b2039
.
Conflicts:
paddle/fluid/operators/mkldnn/concat_mkldnn_op.cc
* Revert "[MKL-DNN] MKL-DNN specific Tensor modification (#15429 )"
test=develop
This reverts commit dec9cf53c8
.
* - concat compilation fix
- lint
test=develop
- Lint fixes
test=develop
- Lint fixes
test=develop
- Fix Transpose MKLDNN op
test=develop
6 years ago
Zeng Jinle
4143a1c216
Merge pull request #16491 from sneaxiy/feature/advance_gc
...
Fix grad op makers
6 years ago
chengduo
2265d091e6
Fix threaded executor bug ( #16508 )
...
* fix threaded executor bug
test=develop
* change the order of class member
test=develop
* Fix Travis CI
test=develop
6 years ago
sneaxiy
2c836ff914
check default grad maker
...
test=develop
6 years ago
nhzlx
d065b5bf2b
Anakin ssd support
...
refine trt first run
add quant dequant fuse pass
omit simplify_anakin_priorbox_detection template
omit transpose_flatten_concat_fuse template
test=develop
6 years ago
zhoukunsheng
beb4a86d13
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rank
6 years ago
zhoukunsheng
b06e9b773d
test=develop
...
add rank op
6 years ago
Zeng Jinle
69cb9792ea
Merge pull request #16506 from sneaxiy/revert-16424-fix_allocator_bug
...
Revert "Fix allocator bug"
6 years ago
lidanqing
b46e467abc
add wget and unzip part and change data_dir
...
test=develop
6 years ago
zhoukunsheng
2f9e562100
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into linspace
6 years ago
dengkaipeng
2078f4207f
fix API.spec. test=develop
6 years ago
lidanqing
894aa9b235
change script file name and data_dir location
...
test=develop
6 years ago
lidanqing
57f51e5b08
preprocess with PIL the full val dataset and save binary
...
test=develop
6 years ago
dengkaipeng
8160a66193
fix doc priority. test=develop
6 years ago
chengduo
ed61d67c73
Fix the interface of Pass::Apply ( #16484 )
...
* modify the interface of Pass::Allay
test=develop
* Polish code
test=develop
* Fix Travis CI
test=develop
* fix Pass::Apply interface
test=develop
* Fix Travis CI
test=develop
6 years ago
dengkaipeng
193185b840
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shift
6 years ago
Zeng Jinle
5f1c92a81c
Merge pull request #16450 from zhhsplendid/del-redundant-op-var-reg
...
Add SpectralNormGradOpDescMaker
6 years ago
Zeng Jinle
2aa18e2bda
Merge pull request #16496 from sneaxiy/fix_gc_bug
...
Fix gc bug
6 years ago
Sylwester Fraczek
8ece7a9708
fixed url to dataset
...
test=develop
6 years ago
sneaxiy
5656fa9f7c
fix travis ci
...
test=develop
6 years ago
Zeng Jinle
174d0d0b90
Revert "Fix allocator bug"
...
add include headers to fix travis-ci
test=develop
6 years ago
gongweibao
eb83abeac3
Add DGC(Deep Gradient Compression) interface. ( #15841 )
6 years ago
Qiao Longfei
34890fd3b1
fix gpu build for lookup_table_op test=develop
6 years ago
Sylwester Fraczek
fe21578a44
create test for quantized resnet50
...
test=develop
6 years ago
Michał Gallus
2d8b7b3a76
Refine default MKL-DNN Pass order ( #16490 )
...
* Refine default MKL-DNN Pass order
test=develop
* Add comment to default MKL-DNN Pass list
test=develop
6 years ago
Wojciech Uss
09dfc7a2aa
C-API quantization core 2 ( #16396 )
...
* C-API quantization core
test=develop
Co-authored-by: Sylwester Fraczek <sylwester.fraczek@intel.com>
* Decouple Quantizer from AnalysisPredictor
test=develop
* fixes after review
test=develop
* renamed mkldnn quantize stuff
test=develop
* remove ifdef from header file
test=develop
6 years ago
Jiabin Yang
e41d581304
test=develop, fix space_to_depth_doc ( #16293 )
...
* test=develop, fix space_to_depth_doc
* test=develop, refine indent
* test=develop, refine code
* test=develop, add api spec
6 years ago
sneaxiy
4c8254e3bf
revert some loop op revision
...
test=develop
6 years ago
Zeng Jinle
644e8af4cf
Merge pull request #16424 from sneaxiy/fix_allocator_bug
...
Fix allocator bug
6 years ago
sneaxiy
c4c6205268
fix gc bug
...
test=develop
6 years ago
zhoukunsheng
874b5d8362
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into linspace
6 years ago
zhoukunsheng
83c7bca13f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
6 years ago
zhoukunsheng
a55111b869
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into zeros_like
6 years ago
zhoukunsheng
848ec97ab3
test=develop
...
add zeros_like op
6 years ago
sneaxiy
16f0994728
Merge develop
...
test=develop
6 years ago
sneaxiy
63651c1968
fix grad desc maker
...
test=develop
6 years ago
Yihua Xu
57dc3c1943
Disable compare for Issue#16316 ( #16466 )
...
* Disable compare for accuracy issue.
test=develop
* Add todo comments to show more information.
test=develop
6 years ago
Zeng Jinle
c7c6eeb44e
Merge pull request #16409 from sneaxiy/feature/advance_gc
...
Enhance gc to support deleting tensor buffer in advance
6 years ago
Qiao Longfei
33be014535
fix distribute compile problem test=develop
6 years ago
Jiabin Yang
54a73578a8
Feature/install check ( #16044 )
...
* test=develop, add install check
* test=develop, add install check scripts
* test=develop, refine language
* test=develop, add api spec
* test=develop, change cdn to bj to pass ci
6 years ago
Qiao Longfei
b542639dc0
code clean test=develop
6 years ago
wopeizl
c300b1ba69
Tensor index ( #16223 )
...
* extend the slice function for python
test=develop
6 years ago
Jiabin Yang
0d9d25d40f
Feature/refactor layers to Layers ( #16337 )
...
* test=develop, add some Layers and tests
* test=develop, add more layers
* test=develop, add more layers
* test=develop, add force cpu option
* Update test_layers.py
remove pdb
* test=develop, refine code
6 years ago
dengkaipeng
3e352388eb
fix format. test=develop
6 years ago
dengkaipeng
eb2123e12d
fix doc and jit. test=develop
6 years ago
liuwei1031
8d22bc17a4
Memory optimize ( #16410 )
...
* fix cdn issue, test=develop
* fix memory optimize bugs, test=develop
* fix memory optimize bugs, test=develop
* remove add/sub_2 op, test=develop
* disable memory_optimize by default, test=develop
* disable inplace activation in python, test=develop
* fix unittests, test=develop
* fix unittests, test=develop
* bug-fix, test=develop
6 years ago
Xin Pan
f8c279b11c
Merge pull request #16454 from panyx0718/imperative2
...
polish deepCF model to support real dataset
6 years ago
Zhaolong Xing
fa1796a30a
Merge pull request #16330 from NHZlX/merge_anakin_branch_to_dev
...
Cherry-pick from PaddlePaddle:feature/anakin-engine: Anakin subgraph support.
6 years ago
lujun
3f8b2f5ff5
fix multiplex doc, test=develop
6 years ago
sneaxiy
a0f4fefb60
delete source file no_need_buffer_vars_inference.cc
...
test=develop
6 years ago
Qiao Longfei
392e97aae5
fix cpplint test=develop
6 years ago
Qiao Longfei
37f6b9ab7a
fix build test=develop
6 years ago
tensor-tang
1eff834e97
update jitkernel doc ( #16327 )
...
* update jitkernel doc
test=develop
* follow comments
* follow comments
test=develop
6 years ago
Qiao Longfei
30618409db
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor-communicator
6 years ago
Yiqun Liu
98802e1f75
Optimize the implementation of while_op again, for cases when is_test is true. ( #16359 )
...
test=develop
6 years ago
lujun
c34b24ede7
Merge pull request #16425 from junjun315/checkpoint-hotfix
...
Checkpoint hotfix
6 years ago
Wu Yi
9ffd5eecef
test fix fetch bar place for ce ( #16406 )
...
* test fix fetch bar place for ce
* fix ps mode dist train in develop test=develop
* fix style check test=develop
* update test=develop
6 years ago
sneaxiy
318072c26b
add comments of allocator design
...
test=develop
6 years ago
chengduo
4f2278f032
Add doc for CPUPlace CUDAPlace CUDAPinPlace ( #16442 )
...
test=develop
6 years ago
dengkaipeng
1ef30c230d
fix API.spec. test=develop
6 years ago
nhzlx
953bdde058
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
...
test=develop
6 years ago
Tao Luo
e0a3a49096
Merge pull request #16438 from wojtuss/wojtuss/move-cpu-quantize-passes
...
Move cpu_quantize_* passes into mkldnn subfolder
6 years ago
gongweibao
ec6519e806
Fix allreducedep bug ( #16443 )
6 years ago
sneaxiy
78fb3a62e0
fix env variable settting bug
...
test=develop
6 years ago
Qiao Longfei
b65adf7f65
add communicator_send_wait_times
6 years ago
sneaxiy
2d92b6be98
merge develop
...
test=develop
6 years ago
Jiabin Yang
f735102eab
add layer norm to Layers, add transformer test in imperative mode ( #16092 )
...
* add layer norm to Layers, add transformer prepare encoding
* little change
* finish encoder part
* add decoder part
* finish model part
* add test case and part of data feed
* add transformer test
* add to_parameter, add remove in set_attr
* test=develop, fix pos encoding bug, create_parameter with stantard name
* test=develop, rm dropout test in imperative
* test=develop, fix cpu error
* test=develop, fix minize bug
* test=develop, fix one hot not stop gradient
* test=develop, fix one hot not stop gradient
* test=develop, refine parameter name
* test=develop, fix transformer test in imperative mode
* test=develop, fix transformer test in imperative mode
* test=develop, fix boost and mkl download error
* test=develop, fix boost and mkl download error
* test=develop, fix ci and refine code
* test=develop, fix ci and refine code
6 years ago
Xin Pan
fd24ab47ab
polish
...
test=develop
6 years ago
Xin Pan
1f89249a95
update DeepCF model
...
test=develop
6 years ago
sneaxiy
a7d0ac50b8
Merge develop
6 years ago
sneaxiy
7000ec85d9
fix some op grad maker
...
fix ctest eager deletion disable bug
test=develop
6 years ago
nhzlx
45b3766fdf
fix comments
...
test=develop
6 years ago
zhaoyuchen2018
cdb315e9d8
Merge branch 'develop' into docrefine
6 years ago
zhhsplendid
3909108cae
Add SpectralNormGradOpDescMaker
...
Use SpectralNormGradOpDescMaker instead of DefaultGradOpDescMaker
to avoid registering useless variables to improve GPU usage.
test=develop
6 years ago
dengkaipeng
ceb31d30f0
fix formax. test=develop
6 years ago
dengkaipeng
cfef382a85
fix format. test=develop
6 years ago
Zeng Jinle
4cc9809cae
Merge pull request #15799 from sneaxiy/feature/decoupled_reader
...
Try to decouple reader with program_desc
6 years ago
whs
e9bec9369b
[slim] Add quantization strategy and distillation strategy. ( #16408 )
...
* Add fsp operator.
1 Add unitest.
2. Add python API.
3. Add layer test.
* Add quantization strategy.
1. Add API.
2. Add unitest.
* Add distillatoin strategy.
* Add unitest config file for quantization
* Fix Copyright
test=develop
* Fix setup.py
* Fix document of layers.py.
test=develop
* Fix unitest in python3.
test=develop
* Fix documents.
test=develop
* 1. refine fsp op by batched gemm
2. remove unused import
test=develop
* Fix test_dist_se_resnext.
1. disable test distillation.
2. reset framework.py
test=develop
* Enable unitest of distillation after fixing Block._clone_variable
test=develop
* Fix cdn issue.
test=develop
6 years ago
dengkaipeng
d54005a7f4
fix unittest. test=develop
6 years ago
liuwei1031
de3b70a101
fix cdn issue, test=develop ( #16423 )
...
* fix cdn issue, test=develop
* fix cdn issue, test=develop
6 years ago
dengkaipeng
90bd038d35
fix format. test=develop
6 years ago
Qiao Longfei
63acbe7a65
fix bug
6 years ago
zhoukunsheng
d3d31a5894
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into all_any
6 years ago
zhoukunsheng
664c342ca0
test=develop
...
split reduce_all_any_op.h into two files
add unit test for reduce_all, reduce_any
6 years ago
lujun
bc4d1c7246
fix mix input type error, test=develop
6 years ago
Qiao Longfei
0ff1e64fab
fix a bug
6 years ago
zhoukunsheng
43060084a4
test=develop
...
add linspace, modify interface comments in tensor.py, merge with develop branch
6 years ago
Qiao Longfei
0997cf8f65
add more check
6 years ago
sneaxiy
f8ed2c229e
try to fix ci error
...
test=develop
6 years ago
zhoukunsheng
8e9ebebcef
test=develop
...
add linspace op
6 years ago
lujun
18aa59493e
fix mix input type error, test=develop
6 years ago
Wojciech Uss
46677fb080
Move cpu_quantize_* passes into mkldnn subfolder
...
test=develop
6 years ago
dengkaipeng
cfda1fdea7
add attr scale. test=develop
6 years ago
sneaxiy
c20db6357b
split PR
...
test=develop
6 years ago
Zeng Jinle
c64d959343
Merge pull request #16295 from zhhsplendid/zhenghuihuang-dev-2
...
Add support for init_memory and re-allocate_memory
6 years ago
lujun
1b6a2a09e8
fix mix input type error, test=develop
6 years ago
nhzlx
a1d11bb175
fix ci bug: cudnn handler in multi card
...
test=develop
6 years ago
Qiao Longfei
93464b25ac
update async_sparse_param_update_recorder
6 years ago
Qiao Longfei
542b52fac3
fix trainer_id
6 years ago
Qiao Longfei
be0c482304
update trainer_id
6 years ago
sneaxiy
2f54d9f995
Merge develop
...
test=develop
6 years ago
Qiao Longfei
c60f312d1b
add trick
6 years ago
Qiao Longfei
103c9bb376
update rpc_client
6 years ago
sneaxiy
c75a880386
fix windows bug
...
test=develop
6 years ago
sneaxiy
072d95d8f6
Merge develop
...
test=develop
6 years ago
sneaxiy
a93a9eef8f
add op registry type
...
refine gc code
test=develop
6 years ago
dengkaipeng
f45aced59b
add jit test. develop=test
6 years ago
Qiao Longfei
b7661d7e56
add some log
6 years ago
Qiao Longfei
e8fe5186a1
complete parameter_recv
6 years ago
Qiao Longfei
d5c7898201
complete pserver side update
6 years ago
Qiao Longfei
de65398cb8
update transpiler and listen and serv op
6 years ago
whs
2e5831f0dc
[slim] Refine framework of slim and add filter pruning strategy ( #16226 )
...
* First pr of paddle slim.
1. Add framework of paddle slim
2. Add filter pruning strategy
test=develop
* Rename unitest to tests.
test=develop
* Add prettytable into requirements.
test=develop
* Change in_nodes and out_nodes to odered dict.
test=develop
* Remove distillation.
test=develop
* Fix API.spec
test=develop
* Fix unitest.
test=develop
* Fix unitest in windows.
test=develop
* Fix unitest in windows.
test=develop
* Fix unitest.
test=develop
* Hide some functions.
test=develop
* Fix python import in python3.5
test=develop
* Fix compress pass.
test=develop
* Fix unitest of test_dist_ctr.
test=develop
* Enhence flops.
* use os.path.join
* Fix pickle for python3
Fix log and comments.
test=develop
* 1. Remove feed_reader in compress pass
2. Fix cache reader
3. Rename CompressPass to Compressor
4. Add comments for distiller optimizer
5. Remove unused pruner currently
6. Add some comments.
7. Change API.spec
test=develop
* Fix pruning in python3.
test=develop
* Fix unitest in python3.
test=develop
* Fix format in python3.
test=develop
6 years ago
whs
18779b5b8f
[Operator] Add range op. ( #15431 )
...
* Add range op.
test=develop
* Add more unitests.
test=develop
* Fix API.spec
test=develop
* Fix API.spec
test=develop
* Fix API.spec
test=develop
6 years ago
Qiao Longfei
25e2b41729
add AsyncSparseParamUpdateRecorder test
6 years ago
Qiao Longfei
c6e82785aa
init async_sparse_param_update_recorder
6 years ago
phlrain
7dc4a7f4f8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_var_name_in_opt_2
6 years ago
Zhen Wang
ec11135d54
Merge pull request #16341 from wzzju/add_channel_wise_in_quant_pass
...
Add channel wise in quant pass.
6 years ago
xiaolil1
e235882c18
Enable MKL-DNN INT8 Concat Kernel. ( #16156 )
...
* Enable INT8 Concat Kernel to improve the performance of MobileNet-SSD.
test=develop
* Optimize UT format.
test=develop
* Fix UT file address issue.
test=develop
* Refine the license year.
test=develop
* Optimize code for new API.
test=develop
* Restructure INT8 Concat kernel.
test=develop
6 years ago
Qiyang Min
171df5b56b
Merge pull request #16303 from junjun315/checkpoint
...
for Checkpoint save and load
6 years ago
Hongyu Liu
e3bca9011c
Merge pull request #16357 from phlrain/fix_concat_check
...
Fix concat check
6 years ago
Hongyu Liu
e5478ab5c8
Merge pull request #16346 from phlrain/add_floordiv_and_mod
...
add elementwise floordiv, mod
6 years ago
chengduo
a6a3b2fbbc
[Speed]Refine ParallelExecutor ( #16190 )
...
* refine parallelExecutor
test=develop
* Polish op_handle
test=develop
* Remove unnecessary op_handle
test=develop
* Fix Travis CI
test=develop
* Fix fetch bug
test=develop
* Remove WaitInputVarGenerated
* Fix OpHandleBase::Run
test=develop
* debug
test=develop
* use origin fetch_op_handle
test=develop
* Revert op_handle_base.cc
test=develop
* Polish code
test=develop
* Fix OpHandleBase::Run
test=develop
* code refine
* test CI and CE
test=develop
* fix OpHandle::Run
test=develop
* refine AllReduceOpHandle
test=develop
* Polish code
test=develop
6 years ago
nhzlx
3df7b98a0f
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into HEAD
6 years ago
nhzlx
f3a2e4b3d8
1. Add ANAKIN_ROOT compile option
...
2. refine trt code
test=develop
6 years ago
phlrain
77a08750e9
add var name in optimizer; test=develop
6 years ago
chengduo
33965527fd
Add unit test for fuse all reduce ( #16354 )
...
* refine fused_all_reduce_op
* add unit test in test_parallel_executor_seresnext
test=develop
6 years ago
Hongyu Liu
18a0f6d97a
Merge pull request #16351 from phlrain/fix_topk_shape_check
...
Fix topk shape check
6 years ago
Hongyu Liu
15444430b0
Merge pull request #16348 from phlrain/fix_squeeze_check
...
fix squeeze shape check
6 years ago
phlrain
5dc9b51994
fix time; test=develop
6 years ago
phlrain
686b8935fe
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_floordiv_and_mod
6 years ago
phlrain
18d107c27a
add floordiv and mod op; test=develop
6 years ago
phlrain
ff112813de
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_concat_check
6 years ago
phlrain
8274d9d733
fix concat shape check; test=develop
6 years ago
Hongyu Liu
0d779f15f6
Merge pull request #16261 from phlrain/fix_sequence_pad_2
...
Fix sequence pad 2
6 years ago
Hongyu Liu
8c81d9949e
Merge pull request #16347 from phlrain/fix_matmul_check
...
fix matmul shape check
6 years ago