Commit Graph

390 Commits (develop)

Author SHA1 Message Date
tianshuo78520a 433cef03e5
fix typo word (#22784)
6 years ago
ShenLiang 3132681e8a
add partial_sum op in contrib (#22292)
6 years ago
ShenLiang e136661304
add partial_concat op in contrib (#22528)
6 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
6 years ago
Wojciech Uss 405bb94bd1
Fix batch_num default value (#22682)
6 years ago
bingyanghuang 6abeb5c54d
fix ernie accuracy doc (#22671)
6 years ago
Wojciech Uss fce37bc593
Doc update with Ernie QAT INT8 benchmarking (#22519)
6 years ago
Wojciech Uss 4cddb43c5c
Add support for Ernie NLP model to the Slim QAT (#22506)
6 years ago
cc d143f70a09 Post_training_quantization support set quant 8/16 bits (#22492)
6 years ago
cc 197913ebe1
Add weight quantization in post_training_quanzitaion (#22445)
6 years ago
joanna.wozna.intel 3099d9d47c Restore requantize squash (#22399)
6 years ago
juncaipeng b339dff231
fix use cache file, test=develop (#22240)
6 years ago
juncaipeng 8f7372ca81
add mul and matmul quantization, test=develop (#22054)
6 years ago
Bai Yifan a46bb2e6ab
Fix slim log format, test=develop (#22281)
6 years ago
songyouwei 83037e55ca named_sublayers and named_parameters (#21868)
6 years ago
gongweibao 5e07db15e6
fix init scaling value test=develop (#22145)
6 years ago
Chengmo 418abc92f4
Update pyramid related OP (#21372)
6 years ago
zhoushiyu cee2ccb078
add shuffle batch op (#21674)
6 years ago
lidanqing 9dff56e8e2 change qat_performance with mobilenet, change batch_size of qat2_resnet50 (#21895)
6 years ago
juncaipeng 8b74fc4fa7
Fix post training quantization (#21745)
6 years ago
Youwei Song f6144d8463 remove build_once & name_scope (#21131)
6 years ago
juncaipeng 5c4106023c
disable qat int8 test for now, test=develop (#21696)
6 years ago
juncaipeng 52f38877e4 Fix ci bug for deleting data files when other test is running (#21661)
6 years ago
lidanqing fbf9eca0d3 QAT Int8 document (#21360)
6 years ago
juncaipeng 1f57ac1241
delete concat in AddQuantDequantPass, test=develop (#21454)
6 years ago
lidanqing c0aa13672e Fp32 vs int8 qat C++ performance (#21244)
7 years ago
Liufang Sang 1840c1652c add config file to avoid load checkpoint test=develop (#21373)
7 years ago
itminner 07e6a94268 paddleslim quantization skip pattern support list of string (#21141)
7 years ago
Zhen Wang be2e3e67d9
Fix some typos in AMP. (#21354)
7 years ago
juncaipeng 84865b806b add resnet50 test for post trainint quantization, test=develop (#21272)
7 years ago
Zeng Jinle b9f8ae8494
Add global value getter setter (#21285)
7 years ago
juncaipeng 29b63f0aa1
support set model_filename and params_filename in post_training_quantization, test=develop (#21213)
7 years ago
Liufang Sang c91cb6c550 fix load checkpoint error in test_reader (#20924)
7 years ago
juncaipeng 00b11a4a1e
Support more ops in post training quantization, test=develop (#21073)
7 years ago
joanna.wozna.intel 37e0e7a96b QAT int8 accuracy little improvement (#21074)
7 years ago
Bai Yifan bd8b0ebaba fix distiller typo, test=develop (#21070)
7 years ago
juncaipeng 2c07727fb0
delete test resnet50 in post train quantization to avoid timeout error, test=develop (#21081)
7 years ago
juncaipeng fa522dffa0
Fix bug in add_quant_dequant_pass, test=develop (#21018)
7 years ago
juncaipeng 175ba39c03
Add post_training_quantization (#20800)
7 years ago
gongweibao 3255fe69bb Add custom black variable name set in amp interface. (#20875)
7 years ago
Aurelius84 aacd16dbb4 add pyramid_hash_op (#20698)
7 years ago
Bai Yifan fd5321b3f3 modify slim print precision to round(,6), test=develop (#20833)
7 years ago
whs fa67e6e83e
Fix unitest of pruning in python3 env. (#20825)
7 years ago
bingyanghuang fd49ebcbd8 update int8 benchmark with 6271 data, test=develop test=document_fix (#20736)
7 years ago
wangguanzhong 9a3e22aad4
move nms2 to contrib, test=develop (#20709)
7 years ago
juncaipeng f201b465ec
Move pool2d to add_quant_dequant_pass, test=develop (#20586)
7 years ago
Bai Yifan ffec9195e4
soft_label_distiller fix, test=develop (#20645)
7 years ago
Liufang Sang bc4af386f4 [PaddleSlim] refine slim reader to support dataloader (#20604)
7 years ago
gongweibao 1d82025e89
Add interface so user can get scaled loss when they use customized loss. (#20571)
7 years ago
翟飞跃 36acfaeeda Add fused_embedding_seq layer into fluid.contrib (#19771)
7 years ago
Guo Sheng bd99df715a
Fix basic_gru and docs of gru_unit and dynamic_gru (#19393)
7 years ago
bingyanghuang a5f530a8f9 Update QAT MKL-DNN readme doc (#20029)
7 years ago
Liufang Sang 86c2c362ae fix fuse_reduce_op quantization bug (#20306)
7 years ago
Michał Gallus 540935a825 [Bug-fix][1.6] Improve QAT accuracy (#20174)
7 years ago
gongweibao 7b9e33972a
delete backward return list test=develop (#20294)
7 years ago
xiteng1988 3785f08f68 fix next_tokens of controller (#20060)
7 years ago
bingyanghuang 9de6772510 Follow comment of Merged QAT PR 18970 (#19979)
7 years ago
Bai Yifan 28a559de12
fix PyReader._iterable usage, test=develop (#20045)
7 years ago
whs 3ea2b661c0
Make PaddleSlim support PyReader (#19995)
7 years ago
Wojciech Uss 4286a6270d Add support for new QAT models (#18970)
7 years ago
jhjiangcs 766bd529d1 add optimizer:dpsgd,test=develop (#19915)
7 years ago
whs bdb3e376d0
[PaddleSlim] Enhence compressor api in PaddleSlim (#19894)
7 years ago
juncaipeng b0ceed6fb4
add fake_quant_dequant_op for average pool2d, test=develop (#19880)
7 years ago
Tao Luo a4919d3688
move tree_conv to fluid.contrib.layers (#19918)
7 years ago
Aurelius84 418a0967f3
move match_matrix var_conv2d et.al api into fluid.contrib test=develop (#19859)
7 years ago
Jie Fang d9db94d752 Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus. (#19714)
7 years ago
Bai Yifan e2c6bada36 Support dispensable student_loss in PaddleSlim distillation (#19824)
7 years ago
Bai Yifan aa63d5ac6d Make the distillation process not save teacher variables in PaddleSlim (#19633)
7 years ago
gongweibao 6c2bc29cc0
Fix float16 optimizer. (#19682)
7 years ago
Chen Weihang 73daa3d6c0
Code Cleanup: delete three useless raw variables in Conv2D (#19644)
7 years ago
Jie Fang c6a598a276 init new amp, optimize inserting cast op for batchnorm (#18596)
7 years ago
123malin 2f037c3189
fix the diff between async mode and async_half mode (#19535)
7 years ago
whs 1c2aae567a
Skip start epoch and end epoch when dumping strategy in PaddleSlim (#19580)
7 years ago
gongweibao abaf87be2b
Change backward_guard to optimize_guard to maximize the allreduce overlap. (#19506)
7 years ago
Zeng Jinle 5dce1da680 remove reset recordio usage (#19519)
7 years ago
lidanqing ba368bf696 clean up intel labeled TODOs (#19476)
7 years ago
Leo Chen 6fb310ae29 Fix bug of getting bool Flags from os.environ (#19349)
7 years ago
liu zhengxi 32598ffd8f
Python infer api update and add unit test (#19353)
7 years ago
Jiancheng Li 1799c257ad Update Light-NAS to support latency-aware search (#19050)
7 years ago
Zhen Wang 0fe72469ea
Add the max-pool2d quantization support and the partial quantization support. (#19310)
7 years ago
Tao Luo 2f8c7e021f
remove unused inference_transpiler unit-tests (#19130)
7 years ago
yaoxuefeng 9150cf50fc
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
7 years ago
chengduo 01c7daade7
Add checking for the fetch_list of Executor.run (#18957)
7 years ago
whs c92b78b060
Fix unitest of light nas. (#18931)
7 years ago
tangwei12 d845848341
do some odd jobs (#18641)
7 years ago
guru4elephant 5ed713d519
remove ctr reader, all functions are satisfied in dataset (#18672)
7 years ago
bingyanghuang a25be53cb5 QAT int8 MKL-DNN transformation pass with MUL (#18322)
7 years ago
Jie Fang 2b4ef509ea init custom black white list (#18377)
7 years ago
Wojciech Uss 8ed819d8e4 Call the test_slim_int8_* tests through absolute path (#18386)
7 years ago
翟飞跃 19da59ed3f Remove all the code, API and doc of MKL-DNN INT8v1 (#18347)
7 years ago
whs 1bdfd2eb85
Fix checkpoint of Light-NAS (#18330)
7 years ago
Hongyu Liu df2eee71d8
Sequence mask support tensor (#18249)
7 years ago
Jie Fang 172c2facef init black/white lists (#17847)
7 years ago
songhao 6b3d96254d fix some bug when merge sparse embedding parameters, test=develop (#18223)
7 years ago
Wojciech Uss 36b60e240a Enable MKL-DNN for slim FP32 vs. INT8 tests (#18214)
7 years ago
bingyanghuang 976cf46010 slim QAT MKL-DNN readme (#18187)
7 years ago
chengduo 5588b923f3
Add multi process reader (#18115)
7 years ago
翟飞跃 802ea50956 fix spelling errors (#17941)
7 years ago
翟飞跃 78441c5449 add mkldnn Int8v2 slim doc (#17909)
7 years ago
翟飞跃 c2fb9b906a change dirname mobilenet to mobilenetv1 (#18160)
7 years ago
bingyanghuang 5cf7e7415d update mkldnn int8v2 doc (#18177)
7 years ago
Wojciech Uss ca5642c850 unify FP32 vs. INT8 comparison tests output (#18111)
7 years ago
Sylwester Fraczek accb132f0f fix slim int8 mkldnn multithreading issue (#18009)
7 years ago
chengduo b5a1c1463d
Update CPU_NUM config (#18059)
7 years ago
Wojciech Uss 78e932862c Added unit test for QAT FP32 & INT8 comparison (#17814)
7 years ago
Huihuang Zheng 0bf2535158
Cherry-pick: fix random CI failure. (#18011)
7 years ago
Kaipeng Deng 96ee528e3e
fix logging basicConfig cannot be setting after import paddle (#17786)
7 years ago
whs 82bfa0a5ba
Enhence auto pruning and fix save/load graph when using py_reader
7 years ago
bingyanghuang 90ebce9ead QAT int8 MKL-DNN transformation pass (#17819)
7 years ago
翟飞跃 209a3f4e09 add log into slim mkldnn int8 test (#17926)
7 years ago
whs 5df65e506d
Add Ligth-NAS for PaddleSlim (#17679)
7 years ago
翟飞跃 993c703bcc INT8 MKL-DNN v2 integrate to slim (#17634)
7 years ago
Yiqun Liu 8fd39f3e99
Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236)
7 years ago
chengduo 9322216170
Add data distributed_sampler (#17573)
7 years ago
Zhen Wang 3398f99608
Adding AddQuantDequantPass for TensorRT int8 (#17529)
7 years ago
Yibing Liu f9796b1249
Add LAMB Optimizer support (#17489)
7 years ago
Jie Fang 30e178fa2c init auto loss scaling (#17194)
7 years ago
guomingz 6d1d7c8acf Fix the KL algorithm bug when calculated the size of tensor. (#17198)
7 years ago
Zhen Wang 65541d83b0
add scale pass for calculating the output scales.test=develop (#17259)
7 years ago
Zhen Wang a40121e4c8
fix the initialization process error. test=develop (#17213)
7 years ago
wopeizl 06325c4de1
disable the test_distillation_strategy temporarily (#17227)
7 years ago
guomingz e4a52e082f resolve #17159 (#17172)
7 years ago
Yibing Liu beda78258f
Init mixed precision training interface (#16856)
7 years ago
Zhen Wang b8c166f632
Merge pull request #17029 from wzzju/add_graph_checkpoint
7 years ago
Zhen Wang bf6b9d6d37 add checkpoint functions for graph. test=develop
7 years ago
Wang, Chuanqi 45ebafc33e Split test_calibration test to two tests (#16977)
7 years ago
whs 486f7d8ed6
Restore quantization and distillation stategy before loading persistables. (#16958)
7 years ago
Tao Luo bc037c13c7 use multi-thread to speedup CI tests
7 years ago
chuanqiw 1818d1d9f0 Reduce test_calibration test time by using large batchsize
7 years ago
AIFollowers fb1ae722d1 Fix model_stat importing in paddle/fluid/contrib
7 years ago
chengduo bb80dae7d0
Add DecoupledWeightDecay (#16427)
7 years ago
whs 73c4f2b7b6
Fix distillation for soft label. (#16538)
7 years ago
Zhen Wang 4c1ec41de3
Merge pull request #16531 from wanghaoshuang/quan_ck
7 years ago
AIFollowers e18ab78f67 add model_stat.py (#16512)
7 years ago
wanghaoshuang d41b623a72 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into quan_ck
7 years ago
wanghaoshuang 6db7c2a500 Fix checkpoint of quantization.
7 years ago
Zhen Wang f86429dbd9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_slim_quant_bugs
7 years ago
whs 679a4c28fc
Fix lost of learning rate variable in distillatoin when using lr decay. (#16471)
7 years ago
Zhen Wang 6b854f3e1f fix the save_in_nodes bug.
7 years ago
Zhen Wang 183bacebe3 clean codes and fix some bugs. test=develop
7 years ago
Tao Luo 1b4e4e7ef7
Merge pull request #16453 from chuanqi129/calibration_readme_refine
7 years ago
liuwei1031 8d22bc17a4
Memory optimize (#16410)
7 years ago
chuanqiw c512516ff4 Update INT8 calibration README
7 years ago
Zhen Wang 27d05203e7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_quan_hang
7 years ago
Zhen Wang 1c11f817e9 Use the resolve hazard method.
7 years ago
Wang, Chuanqi 85e1cc1e02 Update Readme with new accuracy and performance data measured on 6271 (#16437)
7 years ago
whs e9bec9369b
[slim] Add quantization strategy and distillation strategy. (#16408)
7 years ago
Zhen Wang 2ccbfd5e10 Fix some bugs for quantization passes.
7 years ago
liuwei1031 de3b70a101
fix cdn issue, test=develop (#16423)
7 years ago
chuanqiw 431068c9ca Enhance test calibration script on accuracy assert
7 years ago
whs 2e5831f0dc
[slim] Refine framework of slim and add filter pruning strategy (#16226)
7 years ago
Zhen Wang ec11135d54
Merge pull request #16341 from wzzju/add_channel_wise_in_quant_pass
7 years ago
Zhen Wang 8965819fbb rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop
7 years ago
Zhen Wang ec88b6cc5a add channel wise quantization in ir pass.
7 years ago
tangwei12 8ea4218ce1
update load persistables for increment, test=develop (#15576)
7 years ago
achao2013 81b4fad8b9 add moving average absmax op and fix bug (#15155)
7 years ago
Zhen Wang 41b8cf0bae
Merge pull request #16162 from wzzju/fix_nan_static_quant
7 years ago
Zhen Wang b8d1f5038e Add the executor test for the graph clone API. test=develop
7 years ago
Zhen Wang 4ab0607634 fix NaN bugs.
7 years ago
Zhen Wang ac6ef06ffa Add the Clone method in Graph. test=develop
7 years ago
Zhen Wang 1b9c8d5f06 add clone function for IrGraph. test=develop
7 years ago
Zhen Wang 3cf0ee414d update some details. test=develop
7 years ago
Zhen Wang 0cb50bb983 avoid ce fails on windows.
7 years ago
Zhen Wang 7c8f7df2fe add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop
7 years ago
Zhen Wang 33f99d6197 add IrNode&IrVarNode&IrOpNode. test=develop
7 years ago
Zhang, Guoming e5d3d7c63d resolve #15724
7 years ago
Zhen Wang bc95a4ccfe
Merge branch 'develop' into quantization_inference_passes
7 years ago
sneaxiy 7e399b0628 rename
7 years ago
sneaxiy 42f6d0f899 modify API.spec
7 years ago
sneaxiy f85245b409 test=develop
7 years ago
WangZhen 28dfad5e27 fix some bugs about python3. test=develop
7 years ago
WangZhen 2175292634 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantization_inference_passes
7 years ago
WangZhen a7efab7ec1 add comments for public API. test=develop
7 years ago
Haihao Shen b5ebca47a3 Add INT8 calibration README (#15548)
7 years ago
Haihao Shen 1b8047b712 Add INT8 calibration support in Paddle package (#15569)
7 years ago
Haihao Shen 294d594450 Enable performance measurement in INT8 calibration unit test (#15560)
7 years ago
WangZhen 0db41a9c44 add op_role attr when creating op node.
7 years ago
WangZhen c67b29c178 fix some bugs of graph.to_program and get_pass.
7 years ago
Haihao Shen f82515800c Enable INT8 Calibration Unit Test for MobileNet-V1 (#15539)
7 years ago
WangZhen c64f22048a add convert_to_int8 pass and transform_for_mobile pass and their UTs.
7 years ago
WangZhen c8095eeb82 add freeze pass, and UT is passed.
7 years ago
Haihao Shen 2a82c5651e Refine INT8 calibration API; shorten the iteration number to reduce test time; test=develop
7 years ago
乔龙飞 Qiao Longfei c58555067e
Merge pull request #14731 from jacquesqiao/optimize-cpp-reader
7 years ago
WangZhen dde19a0ff8 add quantization freeze pass.
7 years ago
guomingz dbdaf15ca0 [V1.3] Add the calibration tool code for int8 inference and focus test. (#15062)
7 years ago
Qiao Longfei 119a3d4deb update comment test=develop
7 years ago
WangZhen 3b668c1574 Update some comments in the quantization transform pass. test=develop
7 years ago
WangZhen b913463e83 Update according to the reviewers' suggestion. test=develop
7 years ago
WangZhen 3ce6172052 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
7 years ago
WangZhen 59e5cc51d6 Add quantization transform pass and UT.
7 years ago
Qiao Longfei cf0a057981 add document for ctr reader
7 years ago
Qiao Longfei 45578c1b48 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
7 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
7 years ago
WangZhen e2ff300b02 add UT for quantization.
7 years ago
WangZhen 451896fce4 init quantization.
7 years ago
Qiao Longfei edad60e612 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
7 years ago
whs 938705745e
Init paddle slim (#14834)
7 years ago
tangwei12 45dd3491db
lookup table utils fix (#14730)
7 years ago
heqiaozhi caa6b59677 add hdfs_utils & helper & node doc
7 years ago
heqiaozhi 7bd16e3afa fix some bug & add log
7 years ago