Commit Graph

274 Commits (abb49df26fe32e2cd73d1c709480f58412c810ae)

Author SHA1 Message Date
Bai Yifan e2c6bada36 Support dispensable student_loss in PaddleSlim distillation (#19824)
6 years ago
Bai Yifan aa63d5ac6d Make the distillation process not save teacher variables in PaddleSlim (#19633)
6 years ago
gongweibao 6c2bc29cc0
Fix float16 optimizer. (#19682)
6 years ago
Chen Weihang 73daa3d6c0
Code Cleanup: delete three useless raw variables in Conv2D (#19644)
6 years ago
Jie Fang c6a598a276 init new amp, optimize inserting cast op for batchnorm (#18596)
6 years ago
123malin 2f037c3189
fix the diff between async mode and async_half mode (#19535)
6 years ago
whs 1c2aae567a
Skip start epoch and end epoch when dumping strategy in PaddleSlim (#19580)
6 years ago
gongweibao abaf87be2b
Change backward_guard to optimize_guard to maximize the allreduce overlap. (#19506)
6 years ago
Zeng Jinle 5dce1da680 remove reset recordio usage (#19519)
6 years ago
lidanqing ba368bf696 clean up intel labeled TODOs (#19476)
6 years ago
Leo Chen 6fb310ae29 Fix bug of getting bool Flags from os.environ (#19349)
6 years ago
liu zhengxi 32598ffd8f
Python infer api update and add unit test (#19353)
6 years ago
Jiancheng Li 1799c257ad Update Light-NAS to support latency-aware search (#19050)
6 years ago
Zhen Wang 0fe72469ea
Add the max-pool2d quantization support and the partial quantization support. (#19310)
6 years ago
Tao Luo 2f8c7e021f
remove unused inference_transpiler unit-tests (#19130)
6 years ago
yaoxuefeng 9150cf50fc
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
6 years ago
chengduo 01c7daade7
Add checking for the fetch_list of Executor.run (#18957)
6 years ago
whs c92b78b060
Fix unitest of light nas. (#18931)
6 years ago
tangwei12 d845848341
do some odd jobs (#18641)
6 years ago
guru4elephant 5ed713d519
remove ctr reader, all functions are satisfied in dataset (#18672)
6 years ago
bingyanghuang a25be53cb5 QAT int8 MKL-DNN transformation pass with MUL (#18322)
6 years ago
Jie Fang 2b4ef509ea init custom black white list (#18377)
6 years ago
Wojciech Uss 8ed819d8e4 Call the test_slim_int8_* tests through absolute path (#18386)
6 years ago
翟飞跃 19da59ed3f Remove all the code, API and doc of MKL-DNN INT8v1 (#18347)
6 years ago
whs 1bdfd2eb85
Fix checkpoint of Light-NAS (#18330)
6 years ago
Hongyu Liu df2eee71d8
Sequence mask support tensor (#18249)
6 years ago
Jie Fang 172c2facef init black/white lists (#17847)
6 years ago
songhao 6b3d96254d fix some bug when merge sparse embedding parameters, test=develop (#18223)
6 years ago
Wojciech Uss 36b60e240a Enable MKL-DNN for slim FP32 vs. INT8 tests (#18214)
6 years ago
bingyanghuang 976cf46010 slim QAT MKL-DNN readme (#18187)
6 years ago
chengduo 5588b923f3
Add multi process reader (#18115)
6 years ago
翟飞跃 802ea50956 fix spelling errors (#17941)
6 years ago
翟飞跃 78441c5449 add mkldnn Int8v2 slim doc (#17909)
6 years ago
翟飞跃 c2fb9b906a change dirname mobilenet to mobilenetv1 (#18160)
6 years ago
bingyanghuang 5cf7e7415d update mkldnn int8v2 doc (#18177)
6 years ago
Wojciech Uss ca5642c850 unify FP32 vs. INT8 comparison tests output (#18111)
6 years ago
Sylwester Fraczek accb132f0f fix slim int8 mkldnn multithreading issue (#18009)
6 years ago
chengduo b5a1c1463d
Update CPU_NUM config (#18059)
6 years ago
Wojciech Uss 78e932862c Added unit test for QAT FP32 & INT8 comparison (#17814)
6 years ago
Huihuang Zheng 0bf2535158
Cherry-pick: fix random CI failure. (#18011)
6 years ago
Kaipeng Deng 96ee528e3e
fix logging basicConfig cannot be setting after import paddle (#17786)
6 years ago
whs 82bfa0a5ba
Enhence auto pruning and fix save/load graph when using py_reader
6 years ago
bingyanghuang 90ebce9ead QAT int8 MKL-DNN transformation pass (#17819)
6 years ago
翟飞跃 209a3f4e09 add log into slim mkldnn int8 test (#17926)
6 years ago
whs 5df65e506d
Add Ligth-NAS for PaddleSlim (#17679)
6 years ago
翟飞跃 993c703bcc INT8 MKL-DNN v2 integrate to slim (#17634)
6 years ago
Yiqun Liu 8fd39f3e99
Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236)
6 years ago
chengduo 9322216170
Add data distributed_sampler (#17573)
6 years ago
Zhen Wang 3398f99608
Adding AddQuantDequantPass for TensorRT int8 (#17529)
6 years ago
Yibing Liu f9796b1249
Add LAMB Optimizer support (#17489)
6 years ago
Jie Fang 30e178fa2c init auto loss scaling (#17194)
6 years ago
guomingz 6d1d7c8acf Fix the KL algorithm bug when calculated the size of tensor. (#17198)
6 years ago
Zhen Wang 65541d83b0
add scale pass for calculating the output scales.test=develop (#17259)
6 years ago
Zhen Wang a40121e4c8
fix the initialization process error. test=develop (#17213)
6 years ago
wopeizl 06325c4de1
disable the test_distillation_strategy temporarily (#17227)
6 years ago
guomingz e4a52e082f resolve #17159 (#17172)
6 years ago
Yibing Liu beda78258f
Init mixed precision training interface (#16856)
6 years ago
Zhen Wang b8c166f632
Merge pull request #17029 from wzzju/add_graph_checkpoint
6 years ago
Zhen Wang bf6b9d6d37 add checkpoint functions for graph. test=develop
6 years ago
Wang, Chuanqi 45ebafc33e Split test_calibration test to two tests (#16977)
6 years ago
whs 486f7d8ed6
Restore quantization and distillation stategy before loading persistables. (#16958)
6 years ago
Tao Luo bc037c13c7 use multi-thread to speedup CI tests
6 years ago
chuanqiw 1818d1d9f0 Reduce test_calibration test time by using large batchsize
6 years ago
AIFollowers fb1ae722d1 Fix model_stat importing in paddle/fluid/contrib
6 years ago
chengduo bb80dae7d0
Add DecoupledWeightDecay (#16427)
6 years ago
whs 73c4f2b7b6
Fix distillation for soft label. (#16538)
6 years ago
Zhen Wang 4c1ec41de3
Merge pull request #16531 from wanghaoshuang/quan_ck
6 years ago
AIFollowers e18ab78f67 add model_stat.py (#16512)
6 years ago
wanghaoshuang d41b623a72 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into quan_ck
6 years ago
wanghaoshuang 6db7c2a500 Fix checkpoint of quantization.
6 years ago
Zhen Wang f86429dbd9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_slim_quant_bugs
6 years ago
whs 679a4c28fc
Fix lost of learning rate variable in distillatoin when using lr decay. (#16471)
6 years ago
Zhen Wang 6b854f3e1f fix the save_in_nodes bug.
6 years ago
Zhen Wang 183bacebe3 clean codes and fix some bugs. test=develop
6 years ago
Tao Luo 1b4e4e7ef7
Merge pull request #16453 from chuanqi129/calibration_readme_refine
6 years ago
liuwei1031 8d22bc17a4
Memory optimize (#16410)
6 years ago
chuanqiw c512516ff4 Update INT8 calibration README
6 years ago
Zhen Wang 27d05203e7 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_quan_hang
6 years ago
Zhen Wang 1c11f817e9 Use the resolve hazard method.
6 years ago
Wang, Chuanqi 85e1cc1e02 Update Readme with new accuracy and performance data measured on 6271 (#16437)
6 years ago
whs e9bec9369b
[slim] Add quantization strategy and distillation strategy. (#16408)
6 years ago
Zhen Wang 2ccbfd5e10 Fix some bugs for quantization passes.
6 years ago
liuwei1031 de3b70a101
fix cdn issue, test=develop (#16423)
6 years ago
chuanqiw 431068c9ca Enhance test calibration script on accuracy assert
6 years ago
whs 2e5831f0dc
[slim] Refine framework of slim and add filter pruning strategy (#16226)
6 years ago
Zhen Wang ec11135d54
Merge pull request #16341 from wzzju/add_channel_wise_in_quant_pass
6 years ago
Zhen Wang 8965819fbb rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop
6 years ago
Zhen Wang ec88b6cc5a add channel wise quantization in ir pass.
6 years ago
tangwei12 8ea4218ce1
update load persistables for increment, test=develop (#15576)
6 years ago
achao2013 81b4fad8b9 add moving average absmax op and fix bug (#15155)
6 years ago
Zhen Wang 41b8cf0bae
Merge pull request #16162 from wzzju/fix_nan_static_quant
6 years ago
Zhen Wang b8d1f5038e Add the executor test for the graph clone API. test=develop
6 years ago
Zhen Wang 4ab0607634 fix NaN bugs.
6 years ago
Zhen Wang ac6ef06ffa Add the Clone method in Graph. test=develop
6 years ago
Zhen Wang 1b9c8d5f06 add clone function for IrGraph. test=develop
6 years ago
Zhen Wang 3cf0ee414d update some details. test=develop
6 years ago
Zhen Wang 0cb50bb983 avoid ce fails on windows.
6 years ago
Zhen Wang 7c8f7df2fe add some op_des funs to IrOpNode and add some var_des funs to IrVarNode. test=develop
6 years ago
Zhen Wang 33f99d6197 add IrNode&IrVarNode&IrOpNode. test=develop
6 years ago
Zhang, Guoming e5d3d7c63d resolve #15724
6 years ago
Zhen Wang bc95a4ccfe
Merge branch 'develop' into quantization_inference_passes
6 years ago
sneaxiy 7e399b0628 rename
6 years ago
sneaxiy 42f6d0f899 modify API.spec
6 years ago
sneaxiy f85245b409 test=develop
6 years ago
WangZhen 28dfad5e27 fix some bugs about python3. test=develop
6 years ago
WangZhen 2175292634 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into quantization_inference_passes
6 years ago
WangZhen a7efab7ec1 add comments for public API. test=develop
6 years ago
Haihao Shen b5ebca47a3 Add INT8 calibration README (#15548)
6 years ago
Haihao Shen 1b8047b712 Add INT8 calibration support in Paddle package (#15569)
6 years ago
Haihao Shen 294d594450 Enable performance measurement in INT8 calibration unit test (#15560)
6 years ago
WangZhen 0db41a9c44 add op_role attr when creating op node.
6 years ago
WangZhen c67b29c178 fix some bugs of graph.to_program and get_pass.
6 years ago
Haihao Shen f82515800c Enable INT8 Calibration Unit Test for MobileNet-V1 (#15539)
6 years ago
WangZhen c64f22048a add convert_to_int8 pass and transform_for_mobile pass and their UTs.
6 years ago
WangZhen c8095eeb82 add freeze pass, and UT is passed.
6 years ago
Haihao Shen 2a82c5651e Refine INT8 calibration API; shorten the iteration number to reduce test time; test=develop
6 years ago
乔龙飞 Qiao Longfei c58555067e
Merge pull request #14731 from jacquesqiao/optimize-cpp-reader
6 years ago
WangZhen dde19a0ff8 add quantization freeze pass.
6 years ago
guomingz dbdaf15ca0 [V1.3] Add the calibration tool code for int8 inference and focus test. (#15062)
6 years ago
Qiao Longfei 119a3d4deb update comment test=develop
6 years ago
WangZhen 3b668c1574 Update some comments in the quantization transform pass. test=develop
6 years ago
WangZhen b913463e83 Update according to the reviewers' suggestion. test=develop
6 years ago
WangZhen 3ce6172052 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into graph_quantization
6 years ago
WangZhen 59e5cc51d6 Add quantization transform pass and UT.
6 years ago
Qiao Longfei cf0a057981 add document for ctr reader
6 years ago
Qiao Longfei 45578c1b48 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
6 years ago
WangZhen e2ff300b02 add UT for quantization.
7 years ago
WangZhen 451896fce4 init quantization.
7 years ago
Qiao Longfei edad60e612 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
7 years ago
whs 938705745e
Init paddle slim (#14834)
7 years ago
tangwei12 45dd3491db
lookup table utils fix (#14730)
7 years ago
heqiaozhi caa6b59677 add hdfs_utils & helper & node doc
7 years ago
heqiaozhi 7bd16e3afa fix some bug & add log
7 years ago
heqiaozhi 5d3ecbfdf5 fix hdfs bug
7 years ago
heqiaozhi 009c7cf6cc add finialize
7 years ago
heqiaozhi 57ac412b98 download data
7 years ago
Qiao Longfei daba57f752 complete ctr_reader
7 years ago
Qiao Longfei 978fd6800c update ctr_reader.py
7 years ago
Qiao Longfei 668ae9083e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-ctr-reader
7 years ago
tangwei12 3639d99f99
Fix save and load lookup table/optimizer vars (#14301)
7 years ago
tangwei12 dd6fd4c747
Utils for download and upload files with HDFS (#14473)
7 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
7 years ago
peizhilin 1f12ba6192 gpu support, fix build issue:
7 years ago
Qiao Longfei aff54ef735 add ctr data
7 years ago
Dang Qingqing f63ab5619b Fix conflict.
7 years ago
Dang Qingqing 8f5d918a19 Disable one test in test_quantize_transpiler.
7 years ago
qingqing01 748be49e77
Fix random fail in Python3 (#13666)
7 years ago
Dang Qingqing 209f799f91 test,
7 years ago
qingqing01 479ad4bb92
Merge branch 'develop' into quantize_transpiler_update
7 years ago