Bai Yifan
e2c6bada36
Support dispensable student_loss in PaddleSlim distillation ( #19824 )
...
* support_dispensable_student_loss, test=develop
* add distillation test, test=develop
* fix distillation test non convergence problem, test=develop
* fix test_distillation fail problem, test=develop
6 years ago
Bai Yifan
aa63d5ac6d
Make the distillation process not save teacher variables in PaddleSlim ( #19633 )
...
* split teacher checkpoints with student checkpoints, test=develop
* add unittest for graph.merge(), test=develop
6 years ago
gongweibao
6c2bc29cc0
Fix float16 optimizer. ( #19682 )
...
Fix float16 optimizer
6 years ago
Chen Weihang
73daa3d6c0
Code Cleanup: delete three useless raw variables in Conv2D ( #19644 )
...
* delete useless raw variables in Conv2D, test=develop
* adjust the vars number in test_graph_wrapper to pass unittest, test=develop
6 years ago
Jie Fang
c6a598a276
init new amp, optimize inserting cast op for batchnorm ( #18596 )
...
init new amp, optimize inserting cast op for batchnorm
6 years ago
123malin
2f037c3189
fix the diff between async mode and async_half mode ( #19535 )
...
* test=develop, communicator merge add => merge average
6 years ago
whs
1c2aae567a
Skip start epoch and end epoch when dumping strategy in PaddleSlim ( #19580 )
...
test=develop
6 years ago
gongweibao
abaf87be2b
Change backward_guard to optimize_guard to maximize the allreduce overlap. ( #19506 )
...
Change backward_guard to optimize_guard to maximize the allreduce overlap
6 years ago
Zeng Jinle
5dce1da680
remove reset recordio usage ( #19519 )
6 years ago
lidanqing
ba368bf696
clean up intel labeled TODOs ( #19476 )
...
test=develop
6 years ago
Leo Chen
6fb310ae29
Fix bug of getting bool Flags from os.environ ( #19349 )
...
* fix bug of getting bool Flags from os.environ, test=develop
* add empty loss_name in CompiledProgram for inplace grad test, test=develop
6 years ago
liu zhengxi
32598ffd8f
Python infer api update and add unit test ( #19353 )
...
* python inference api supports numpy and add unit test, fix unit test fail in test_slim_int8_googlenet and test_slim_int8_mobilenet
6 years ago
Jiancheng Li
1799c257ad
Update Light-NAS to support latency-aware search ( #19050 )
...
* update light_nas_strategy: add latency constraint
test=develop
* update light_nas_strategy: update get_model_latency
test=develop
* update light_nas_strategy: add more check
test=develop
* update light_nas test
test=develop
* update light_nas test
test=develop
* minor update light_nas test
test=develop
* minor update light_nas test
test=develop
* update light_nas test
test=develop
* update _constrain_func of light_nas_strategy
test=develop
* update _constrain_func of light_nas_strategy
test=develop
* remove unused code
test=develop
6 years ago
Zhen Wang
0fe72469ea
Add the max-pool2d quantization support and the partial quantization support. ( #19310 )
...
* add pool2d quantization support, only for max-pooling.
* add the partial quantization support.
6 years ago
Tao Luo
2f8c7e021f
remove unused inference_transpiler unit-tests ( #19130 )
...
* remove unused inference_transpiler unit-tests
test=develop
* remove InferenceTranspiler usage in quantize_transpiler.py
test=develop
6 years ago
yaoxuefeng
9150cf50fc
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics ( #18871 )
...
* add ctr related metric layer test=develop
* add save cache and slots shuffle test=develop
* add save cache and slots shuffle test=develop
* fix error
* fix error
* fix style for ci
* fix for comments
* change SlotsShuffle input to std::strinf for generality
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix stylr
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* fix style
* change non-const reference to pointer
* fix style
* fix style
* fix style test=develop
* fix style test=develop
* add return ins num in ctr metric op
* change dtype to float in metric_op.py
* fix error test=develop
* fix style test=develop
* fix API spec
* fix API spec
* fix API spec test=develop
* add UT test=develop
6 years ago
chengduo
01c7daade7
Add checking for the fetch_list of Executor.run ( #18957 )
...
* update exe.run
6 years ago
whs
c92b78b060
Fix unitest of light nas. ( #18931 )
...
test=develop
6 years ago
tangwei12
d845848341
do some odd jobs ( #18641 )
...
do some odd jobs, test=develop
6 years ago
guru4elephant
5ed713d519
remove ctr reader, all functions are satisfied in dataset ( #18672 )
...
* remove ctr reader, all functions are satisfied in dataset
6 years ago
bingyanghuang
a25be53cb5
QAT int8 MKL-DNN transformation pass with MUL ( #18322 )
6 years ago
Jie Fang
2b4ef509ea
init custom black white list ( #18377 )
...
test=develop
6 years ago
Wojciech Uss
8ed819d8e4
Call the test_slim_int8_* tests through absolute path ( #18386 )
...
test=develop
6 years ago
翟飞跃
19da59ed3f
Remove all the code, API and doc of MKL-DNN INT8v1 ( #18347 )
6 years ago
whs
1bdfd2eb85
Fix checkpoint of Light-NAS ( #18330 )
...
Socket can't be pickled.
test=develop
6 years ago
Hongyu Liu
df2eee71d8
Sequence mask support tensor ( #18249 )
...
* sequnce mask support max length tensor input; test=develop
* add rnn_impl.py; test=develop
* add basic gru lstm unittest; test=develop
* fix api spec; test=develop
* fix sequence_mask op bug;
test=develop
test=document_preview
* change +-*x to elmentwise_op; test=develop
* add mkl flag; test=develop
* fix rnn impl bug; test=develop
* update api spec; test=develop
* fix doc bug; test=develop
* fix lstm bugs; test=develop
6 years ago
Jie Fang
172c2facef
init black/white lists ( #17847 )
...
test=develop
6 years ago
songhao
6b3d96254d
fix some bug when merge sparse embedding parameters, test=develop ( #18223 )
...
1. fix the bug that out_put_var in SaveSelectedRows would be empty string
2. use merge_sparse_lookup_table to replace sum op for load_persistables_for_inference
3. fix the bug in _clone_var_in_block_ when the var is SELECTED_ROWS.
6 years ago
Wojciech Uss
36b60e240a
Enable MKL-DNN for slim FP32 vs. INT8 tests ( #18214 )
...
* Enable MKL-DNN for slim FP32 vs. INT8 tests
test=develop
* added test for MobileNetV1 with MKL-DNN
test=develop
6 years ago
bingyanghuang
976cf46010
slim QAT MKL-DNN readme ( #18187 )
...
* QAT doc first version
* follow the comments, test=develop
* update the benchmark, test=develop
* Change to channel-wise, test=develop
6 years ago
chengduo
5588b923f3
Add multi process reader ( #18115 )
...
* add multi process reader
test=develop
6 years ago
翟飞跃
802ea50956
fix spelling errors ( #17941 )
...
* fix spelling errors; test=develop
* Update API.spec
update md5
* Update API.spec
* change the order of api;test=develop
6 years ago
翟飞跃
78441c5449
add mkldnn Int8v2 slim doc ( #17909 )
6 years ago
翟飞跃
c2fb9b906a
change dirname mobilenet to mobilenetv1 ( #18160 )
6 years ago
bingyanghuang
5cf7e7415d
update mkldnn int8v2 doc ( #18177 )
6 years ago
Wojciech Uss
ca5642c850
unify FP32 vs. INT8 comparison tests output ( #18111 )
...
test=develop
6 years ago
Sylwester Fraczek
accb132f0f
fix slim int8 mkldnn multithreading issue ( #18009 )
6 years ago
chengduo
b5a1c1463d
Update CPU_NUM config ( #18059 )
...
* update CPU_NUM config
test=develop
6 years ago
Wojciech Uss
78e932862c
Added unit test for QAT FP32 & INT8 comparison ( #17814 )
...
* added unit test for QAT FP32 & INT8 comparison
test=develop
* enabled other models and updated filenames
test=develop
* added accuracy check and multiple batch handling
test=develop
* removed quantization_mkldnn_pass.py
test=develop
* cleanup
test=develop
* updated model paths
test=develop
* renamed tests without MKL-DNN
test=develop
* fix reusing mkldnn pool2d primitive
test=develop
* add performance measuring
test=develop
* fix accuracy statistics
test=develop
* removed non-mkldnn tests
test=develop
* added conv2d_depthwise->conv2d mkldnn transformation
test=develop
* format update
test=develop
* fixed creating key for pool2d grad
test=develop
* added pass
* Fix the accuracy issue while using float precision to get the scale.
test=develop
* Fix the format issue when 'X' is not nchw.
test=develop
* removed output comparing and changed number of images
test=develop
* cmake and comment fix
test=develop
* updated acc threshold for QAT comparison tests
test=develop
* added OMP_NUM_THREADS setting
test=develop
* enable all QAT INT8 tests
test=develop
* restored upstream version of a file
test=develop
* modified directory names
test=develop
6 years ago
Huihuang Zheng
0bf2535158
Cherry-pick: fix random CI failure. ( #18011 )
...
* Cherry-pick fix random Python3 CI failure.
In some tests, SWEs used "print('xxx').format('xxx')". The syntax
is only supported in Python2, not python3. However, since those
lines are related to data download, if the CI machines already have
the data, it passes CI tests. That causes random failure.
* Cherry-pick: disable CUDNN case of test_warpctc_op
Also temporary disable a unit test. The test will be fixed under high priority.
6 years ago
Kaipeng Deng
96ee528e3e
fix logging basicConfig cannot be setting after import paddle ( #17786 )
...
* fix logging unable. test=develop
* unset sys.stdout for stream handler. test=develop
* fix newly add basicConfig. test=develop
* fix import error. test=develop
6 years ago
whs
82bfa0a5ba
Enhence auto pruning and fix save/load graph when using py_reader
...
1. Enhence auto pruning.
2. Fix save/load graph when using py_reader.
test=develop
6 years ago
bingyanghuang
90ebce9ead
QAT int8 MKL-DNN transformation pass ( #17819 )
6 years ago
翟飞跃
209a3f4e09
add log into slim mkldnn int8 test ( #17926 )
6 years ago
whs
5df65e506d
Add Ligth-NAS for PaddleSlim ( #17679 )
...
* Add auto pruning strategy.
1. Fix compressor.
2. Enhence graph executor.
3. Add SAController
4. Add auto pruning strategy.
5. Add unitest for auto pruning strategy.
test=develop
* Init light-nas
* Add light nas.
* Some fix.
test=develop
* Fix sa controller.
test=develop
* Fix unitest of light nas.
test=develop
* Fix setup.py.in and API.spec.
test=develop
* Fix unitest.
1. Fix unitest on windows.
2. Fix package importing in tests directory.
* 1. Remove unused comments.
2. Expose eval_epoch option.
3. Remove unused function in search_agent.
4. Expose max_client_num to yaml file.
5. Move flops constraint to on_epoch_begin function
test=develop
* Fix light nas strategy.
test=develop
* Make controller server stable.
test=develop
* 1. Add try exception to compressor.
2. Remove unitest of light-nas for windows.
test=develop
* Add comments
Enhence controller
test=develop
* Fix comments.
test=develop
6 years ago
翟飞跃
993c703bcc
INT8 MKL-DNN v2 integrate to slim ( #17634 )
...
* refactor PR 16865
* delete mergetool files
* test=develop
* test=develop
* test=develop
* test=develop
* create dir for int8 model before call SaveOptimModel
* test=develop
* mkldnn int8 only support linux; test=develop
* refine code; test=develop
* remove comment; test=develop
* refine code; test=develop
* fix bug; test=develop
* add exception for mkldnn_post_training_strategy
* reuse int8v2 CAPI dataset; test=develop
* fix accuracy check bug; test=develop
* remove tab
* convert files to unix format
* test=develop
* reduce CI time;test=develop
* reduce CI time and refine code;test=develop
* refine comment; test=develop
* add cmake FLAGS;test=develop
* remove predict_num;test=develop
6 years ago
Yiqun Liu
8fd39f3e99
Enhance fused_elementwise_activation op and add python api in contrib.layers ( #17236 )
...
* Enhance fused_elementwise_activation op.
test=develop
* Move the api fused_elementwise_activation to contrib.
test=develop
* Add including files.
test=develop
* Add the support of sigmoid in fused_elementwise_activetion op.
* Update API.spec.
test=develop
6 years ago
chengduo
9322216170
Add data distributed_sampler ( #17573 )
...
* add data parallel batch
6 years ago
Zhen Wang
3398f99608
Adding AddQuantDequantPass for TensorRT int8 ( #17529 )
...
* add quant_dequant_pass, test=develop
* Add quant_dequant before some ops, such as the elementwise_add op. This is required by TensorRT. test=develop
6 years ago
Yibing Liu
f9796b1249
Add LAMB Optimizer support ( #17489 )
...
* Add LAMB optimizer
* Expose LAMB Optimizer's APIs
test=develop, test=document_preview
* Cleanup code & doc
test=develop, test=document_preview
* Update lamb optimizer's formula
test=develop
6 years ago
Jie Fang
30e178fa2c
init auto loss scaling ( #17194 )
...
* init auto loss scaling
test=develop
* change API.spec
* change ifelse to switch and use reduce_sum to optimize checking isfinite
test=develop
* Remove redundant code
test=develop
6 years ago
guomingz
6d1d7c8acf
Fix the KL algorithm bug when calculated the size of tensor. ( #17198 )
...
* resolve #17184
Fix the KL algorithm bug when calculated the size of tensor.
* Add the missing test=develop.
test=develop
6 years ago
Zhen Wang
65541d83b0
add scale pass for calculating the output scales.test=develop ( #17259 )
6 years ago
Zhen Wang
a40121e4c8
fix the initialization process error. test=develop ( #17213 )
6 years ago
wopeizl
06325c4de1
disable the test_distillation_strategy temporarily ( #17227 )
...
test_distillation_strategy always failed on a machine with 4 gpus only, disable temporarily and need to figure out the root cause and add it back later
6 years ago
guomingz
e4a52e082f
resolve #17159 ( #17172 )
...
Update the folder name generation mechanism for saving the quantized model and weights.
The folder name would be unique by adding the timestamp postfix.
test=develop
6 years ago
Yibing Liu
beda78258f
Init mixed precision training interface ( #16856 )
...
* Init mixed precision training interface
* Add fp16 test script
test=develop
* All initializers support float16
test=develop
* Code cleanup & add more code annotations
test=develop
* Update API spec
test=develop
* Add usage example in doc
test=develop
6 years ago
Zhen Wang
b8c166f632
Merge pull request #17029 from wzzju/add_graph_checkpoint
...
add checkpoint functions for graph. test=develop
6 years ago
Zhen Wang
bf6b9d6d37
add checkpoint functions for graph. test=develop
6 years ago
Wang, Chuanqi
45ebafc33e
Split test_calibration test to two tests ( #16977 )
...
* Split test_calibration test to two tests
test=develop
* Modify CMakeLists file
test=develop
6 years ago
whs
486f7d8ed6
Restore quantization and distillation stategy before loading persistables. ( #16958 )
...
test=develop
6 years ago
Tao Luo
bc037c13c7
use multi-thread to speedup CI tests
...
test=develop
6 years ago
chuanqiw
1818d1d9f0
Reduce test_calibration test time by using large batchsize
...
test=develop
6 years ago
AIFollowers
fb1ae722d1
Fix model_stat importing in paddle/fluid/contrib
6 years ago
chengduo
bb80dae7d0
Add DecoupledWeightDecay ( #16427 )
...
* Add DecoupledWeightDecay
6 years ago
whs
73c4f2b7b6
Fix distillation for soft label. ( #16538 )
...
test=develop
6 years ago
Zhen Wang
4c1ec41de3
Merge pull request #16531 from wanghaoshuang/quan_ck
...
[slim] Fix checkpoint of quantization strategy.
6 years ago
AIFollowers
e18ab78f67
add model_stat.py ( #16512 )
...
* Add a tool to summary model's PARAMS, FLOPs in paddle/fluid/contrib.
6 years ago
wanghaoshuang
d41b623a72
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into quan_ck
...
test=develop
6 years ago
wanghaoshuang
6db7c2a500
Fix checkpoint of quantization.
6 years ago
Zhen Wang
f86429dbd9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_slim_quant_bugs
...
test=develop
6 years ago
whs
679a4c28fc
Fix lost of learning rate variable in distillatoin when using lr decay. ( #16471 )
...
test=develop
6 years ago
Zhen Wang
6b854f3e1f
fix the save_in_nodes bug.
6 years ago
Zhen Wang
183bacebe3
clean codes and fix some bugs. test=develop
6 years ago
Tao Luo
1b4e4e7ef7
Merge pull request #16453 from chuanqi129/calibration_readme_refine
...
Update INT8 calibration README
6 years ago
liuwei1031
8d22bc17a4
Memory optimize ( #16410 )
...
* fix cdn issue, test=develop
* fix memory optimize bugs, test=develop
* fix memory optimize bugs, test=develop
* remove add/sub_2 op, test=develop
* disable memory_optimize by default, test=develop
* disable inplace activation in python, test=develop
* fix unittests, test=develop
* fix unittests, test=develop
* bug-fix, test=develop
6 years ago
chuanqiw
c512516ff4
Update INT8 calibration README
...
test=develop
6 years ago
Zhen Wang
27d05203e7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_quan_hang
...
test=develop
6 years ago
Zhen Wang
1c11f817e9
Use the resolve hazard method.
6 years ago
Wang, Chuanqi
85e1cc1e02
Update Readme with new accuracy and performance data measured on 6271 ( #16437 )
...
* Update Readme with new accuracy and performance data measured on 6271
test=develop
* Modify tables and example command
test=develop
6 years ago
whs
e9bec9369b
[slim] Add quantization strategy and distillation strategy. ( #16408 )
...
* Add fsp operator.
1 Add unitest.
2. Add python API.
3. Add layer test.
* Add quantization strategy.
1. Add API.
2. Add unitest.
* Add distillatoin strategy.
* Add unitest config file for quantization
* Fix Copyright
test=develop
* Fix setup.py
* Fix document of layers.py.
test=develop
* Fix unitest in python3.
test=develop
* Fix documents.
test=develop
* 1. refine fsp op by batched gemm
2. remove unused import
test=develop
* Fix test_dist_se_resnext.
1. disable test distillation.
2. reset framework.py
test=develop
* Enable unitest of distillation after fixing Block._clone_variable
test=develop
* Fix cdn issue.
test=develop
6 years ago
Zhen Wang
2ccbfd5e10
Fix some bugs for quantization passes.
6 years ago
liuwei1031
de3b70a101
fix cdn issue, test=develop ( #16423 )
...
* fix cdn issue, test=develop
* fix cdn issue, test=develop
6 years ago
chuanqiw
431068c9ca
Enhance test calibration script on accuracy assert
...
test=develop
6 years ago
whs
2e5831f0dc
[slim] Refine framework of slim and add filter pruning strategy ( #16226 )
...
* First pr of paddle slim.
1. Add framework of paddle slim
2. Add filter pruning strategy
test=develop
* Rename unitest to tests.
test=develop
* Add prettytable into requirements.
test=develop
* Change in_nodes and out_nodes to odered dict.
test=develop
* Remove distillation.
test=develop
* Fix API.spec
test=develop
* Fix unitest.
test=develop
* Fix unitest in windows.
test=develop
* Fix unitest in windows.
test=develop
* Fix unitest.
test=develop
* Hide some functions.
test=develop
* Fix python import in python3.5
test=develop
* Fix compress pass.
test=develop
* Fix unitest of test_dist_ctr.
test=develop
* Enhence flops.
* use os.path.join
* Fix pickle for python3
Fix log and comments.
test=develop
* 1. Remove feed_reader in compress pass
2. Fix cache reader
3. Rename CompressPass to Compressor
4. Add comments for distiller optimizer
5. Remove unused pruner currently
6. Add some comments.
7. Change API.spec
test=develop
* Fix pruning in python3.
test=develop
* Fix unitest in python3.
test=develop
* Fix format in python3.
test=develop
6 years ago
Zhen Wang
ec11135d54
Merge pull request #16341 from wzzju/add_channel_wise_in_quant_pass
...
Add channel wise in quant pass.
6 years ago
Zhen Wang
8965819fbb
rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop
6 years ago
Zhen Wang
ec88b6cc5a
add channel wise quantization in ir pass.
6 years ago
tangwei12
8ea4218ce1
update load persistables for increment, test=develop ( #15576 )
...
* update load persistables for increment, test=develop
* update load persistables for increment, test=develop
* update API Spec, test=develop
* update API Spec, test=develop
* add doc, test=develop
* add doc, test=develop
* Update lookup_table_utils.py
* Update API.spec
* Update lookup_table_utils.py
test=develop
* Update API.spec
test=develop
* fix api spec
* Update lookup_table_utils.py
test=develop
6 years ago
achao2013
81b4fad8b9
add moving average absmax op and fix bug ( #15155 )
...
* Add moving average absmax op in quantilize-aware training.
6 years ago
Zhen Wang
41b8cf0bae
Merge pull request #16162 from wzzju/fix_nan_static_quant
...
Fix NaN bugs for static quantization strategy (mutil-cards train).
6 years ago
Zhen Wang
b8d1f5038e
Add the executor test for the graph clone API. test=develop
6 years ago
Zhen Wang
4ab0607634
fix NaN bugs.
6 years ago
Zhen Wang
ac6ef06ffa
Add the Clone method in Graph. test=develop
6 years ago
Zhen Wang
1b9c8d5f06
add clone function for IrGraph. test=develop
6 years ago
Zhen Wang
3cf0ee414d
update some details. test=develop
6 years ago
Zhen Wang
0cb50bb983
avoid ce fails on windows.
6 years ago
Zhen Wang
7c8f7df2fe
add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop
6 years ago
Zhen Wang
33f99d6197
add IrNode&IrVarNode&IrOpNode. test=develop
6 years ago
Zhang, Guoming
e5d3d7c63d
resolve #15724
...
1.Remove the code for setting mkldnn environment in the test_calibration.py;
2.Update the cmake file for MKLDNN environment enabling;
3.Update the INT8 inference doc.
test=develop
6 years ago
Zhen Wang
bc95a4ccfe
Merge branch 'develop' into quantization_inference_passes
6 years ago
sneaxiy
7e399b0628
rename
...
test=develop
6 years ago
sneaxiy
42f6d0f899
modify API.spec
...
test=develop
6 years ago
sneaxiy
f85245b409
test=develop
6 years ago
WangZhen
28dfad5e27
fix some bugs about python3. test=develop
6 years ago
WangZhen
2175292634
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantization_inference_passes
...
test=develop
6 years ago
WangZhen
a7efab7ec1
add comments for public API. test=develop
6 years ago
Haihao Shen
b5ebca47a3
Add INT8 calibration README ( #15548 )
...
* Add calibration README; test=develop
6 years ago
Haihao Shen
1b8047b712
Add INT8 calibration support in Paddle package ( #15569 )
...
* Add INT8 calibration support in Paddle package; test=develop
6 years ago
Haihao Shen
294d594450
Enable performance measurement in INT8 calibration unit test ( #15560 )
...
* Enable performance measurement in INT8 calibration unit test
6 years ago
WangZhen
0db41a9c44
add op_role attr when creating op node.
6 years ago
WangZhen
c67b29c178
fix some bugs of graph.to_program and get_pass.
6 years ago
Haihao Shen
f82515800c
Enable INT8 Calibration Unit Test for MobileNet-V1 ( #15539 )
...
* Enable mobilenet UT in separate test class; use download cache by paddle download utility and cache unzip; and fix typo; test=develop
* Extract cache_unzipping function for reuse; format code style; test=develop
* Simplify the test code by define a combined function for both downloading and unzipping; test=develop
6 years ago
WangZhen
c64f22048a
add convert_to_int8 pass and transform_for_mobile pass and their UTs.
6 years ago
WangZhen
c8095eeb82
add freeze pass, and UT is passed.
6 years ago
Haihao Shen
2a82c5651e
Refine INT8 calibration API; shorten the iteration number to reduce test time; test=develop
6 years ago
乔龙飞 Qiao Longfei
c58555067e
Merge pull request #14731 from jacquesqiao/optimize-cpp-reader
...
Optimize cpp reader
6 years ago
WangZhen
dde19a0ff8
add quantization freeze pass.
6 years ago
guomingz
dbdaf15ca0
[V1.3] Add the calibration tool code for int8 inference and focus test. ( #15062 )
...
* Add the calibration tool code for int8 inference and focus test.
* Fix the calibration tool per the review comments.
test=develop
* Update the calibrator doc and remove extra line.
* Fix the invalid is_negative_input attr set on Mobilenet.
* Add the comments and fix the format issue.
test=develop
* Update the CMakelist.txt for Calibration PR.Disable the Calibration UT if not enable MKLDNN.
test=develop
* Update the CMakeList.txt.
test=develop
* Disable the test_calibration case on WIN and MAC.
test=develop
* Add the missing brackets.
test=develop
* Remove the outdated map operator which not supported on Python3.
test=develop
* Fix the style issue.
test=develop
* 1.Update the CMakeList.txt to disable calibration tool ut when the WITH_MKL is not set;
2.Add the workaround to enable the FLAGS_use_mkldnn for PR_CI(PADDLE).
test=develop
* Fix the typo and format the License header.
test=develop
* 1.Add and Update TODOs per review comments.
2.Code clean.
test=develop
6 years ago
Qiao Longfei
119a3d4deb
update comment test=develop
6 years ago
WangZhen
3b668c1574
Update some comments in the quantization transform pass. test=develop
6 years ago
WangZhen
b913463e83
Update according to the reviewers' suggestion. test=develop
6 years ago
WangZhen
3ce6172052
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
6 years ago
WangZhen
59e5cc51d6
Add quantization transform pass and UT.
6 years ago
Qiao Longfei
cf0a057981
add document for ctr reader
...
test=develop
6 years ago
Qiao Longfei
45578c1b48
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
Dun
9f8f0fc2d3
Memory optimization of depthwise conv op and group norm op ( #15313 )
...
* mem opt
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* test=develop
* refine code test=develop
* refine code test=develop
* refine code test=develop
* refine code test=develop
* refine with cub test=develop
* fix mkldnn test && remove comments && test=develop
* polish code && test=develop
* add only_forward test && test=develop
6 years ago
WangZhen
e2ff300b02
add UT for quantization.
7 years ago
WangZhen
451896fce4
init quantization.
7 years ago
Qiao Longfei
edad60e612
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
7 years ago
whs
938705745e
Init paddle slim ( #14834 )
...
* Init slim.
* Remove distillation demo.
* Fix import errors.
test=develop
* Fix some issues.
test=develop
* Fix configs.
test=develop
* Modify API.spec.
test=develop
* Fix format.
test=develop
* Fix format.
test=develop
* Add some comments.
7 years ago
tangwei12
45dd3491db
lookup table utils fix ( #14730 )
...
* fix utils config, test=develop
* reweite tools in lookup table utils, test=develop
* merge develop, test=develop
* add copy in setup, test=develop
* update api spec, test=develop
* add doc, test=develop
* prettifying doc, test=develop
* Update API.spec
update api.spec, test=develop
* Update lookup_table_utils.py
test=develop
* Update lookup_table_utils.py
test=develop
* merge develop, test=develop
* merge develop, test=develop
* init fix, test=develop
* fix in downloads, test=develop
* fix in setup.in, test=develop
7 years ago
heqiaozhi
caa6b59677
add hdfs_utils & helper & node doc
7 years ago
heqiaozhi
7bd16e3afa
fix some bug & add log
7 years ago
heqiaozhi
5d3ecbfdf5
fix hdfs bug
7 years ago
heqiaozhi
009c7cf6cc
add finialize
7 years ago
heqiaozhi
57ac412b98
download data
7 years ago
Qiao Longfei
daba57f752
complete ctr_reader
7 years ago
Qiao Longfei
978fd6800c
update ctr_reader.py
7 years ago
Qiao Longfei
668ae9083e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-ctr-reader
7 years ago
tangwei12
3639d99f99
Fix save and load lookup table/optimizer vars ( #14301 )
...
* fix mkdir conflict
* fix load/save lookup tables
test=develop
* add lookup_table_utils
* fix load optimize vars on pserver
* delete lookup table utils
* fix save and load lookup tables
* fix load optimizer var
* fix load optimizer var, test=develop
* fix python 3 style, test=develop
* move lookup_table_utils to contrib utils
7 years ago
tangwei12
dd6fd4c747
Utils for download and upload files with HDFS ( #14473 )
...
* add hdfs utils
* add hdfs utils
* test=develop
* update hdfs utils and add demo
* fix multi_download return local files
* test=develop
* add sync multi upload, test=develop
7 years ago
wopeizl
d9a1f3e58e
Windows/online ( #14474 )
...
* add recordio support
* disable the openblas multi-thread on windows since no support
adjust the python script
* code style
* code style
test=develop
* add create_recordio_file_reader back
* fix code style
test=develop
* fix the gtest.cmake on windows
* fix cc_test on windows
* fix the win build
test=develop
* remove fused compile support on windows
test=develop
* add the jit support
test=develop
* add the jit support, test=develop
* add the jit support, test=develop
* add the jit back
fix compile error on windows
* rollback test=develop
* test case fix
* disable DSO by default on windows
* exclude warpctc_op on windows
* exclude the dynload_warpctc out on windows
test=develop
* fix the scripts error
test=develop
* disable avx on windows by default
test=develop
* re-organize the cmake file
* disable mkl on windows by default
* add warp_ctc back
* fix the dependency
* fix the dependency
* fix the build issue on windows
* remove unsupported flag on windows
* code style
* code style
test=develop
* fix issue
* add profiler, parallel_executor back
* clean up the pre-definitions on windows
* fix build issue
* test=develop
7 years ago
peizhilin
1f12ba6192
gpu support, fix build issue:
...
1. Non utf-8 characters within comments of OPs may lead to protobuf fail to parse_from_string
2. comment out some ops which not supported on windows
3. cuda libs may not be correctly linked to target on windows
7 years ago
Qiao Longfei
aff54ef735
add ctr data
7 years ago
Dang Qingqing
f63ab5619b
Fix conflict.
7 years ago
Dang Qingqing
8f5d918a19
Disable one test in test_quantize_transpiler.
7 years ago
qingqing01
748be49e77
Fix random fail in Python3 ( #13666 )
7 years ago
Dang Qingqing
209f799f91
test,
...
test=develop
7 years ago
qingqing01
479ad4bb92
Merge branch 'develop' into quantize_transpiler_update
7 years ago