作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2026, Vol. 52 ›› Issue (6): 68-79. doi: 10.19678/j.issn.1000-3428.0253191

• 计算智能与模式识别 • 上一篇    下一篇

基于顺序分级与数据偏置修正优化的二阶段超分辨率量化方案

郝亮1,2, 苏博何俊1, 王京华1, 徐勇1,3,*()   

  1. 1. 哈尔滨工业大学(深圳)信息学部, 广东 深圳 518055
    2. 河钢数字技术股份有限公司, 河北 石家庄 050035
    3. 深圳市视觉目标检测与判识重点实验室, 广东 深圳 518055
  • 收稿日期:2025-10-23 修回日期:2026-02-10 出版日期:2026-06-15 发布日期:2026-03-27
  • 通讯作者: 徐勇
  • 作者简介:

    郝亮, 男, 正高级工程师、博士研究生, 主研方向为工业机器视觉

    苏博何俊, 硕士研究生

    王京华, 副教授

    徐勇(通信作者), 教授

  • 基金资助:
    河钢集团重点科技项目(HG2025129); 深圳市技术攻关项目(JSGG20220831104402004)

A Two-Phase Super-Resolution Quantization Scheme Optimized by Sequential Grading and Data Bias Correction

HAO Liang1,2, SU Bohejun1, WANG Jinghua1, XU Yong1,3,*()   

  1. 1. Division of Information Science, Harbin Institute of Technology, Shenzhen, Shenzhen 518055, Guangdong, China
    2. HBIS Digital Technology Co., Ltd., Shijiazhuang 050035, Hebei, China
    3. Shenzhen Key Laboratory of Visual Object Detection and Recognition, Shenzhen 518055, Guangdong, China
  • Received:2025-10-23 Revised:2026-02-10 Online:2026-06-15 Published:2026-03-27
  • Contact: XU Yong

摘要:

模型量化技术通过将高精度浮点数映射到低比特离散空间, 能有效降低模型存储与计算开销。如何合理化地考虑参数分布的特点以构建更好的映射方案是模型量化研究的核心。现有后训练量化(PTQ)方案多默认非激活层数据呈对称钟型分布, 却忽略了激活层与输入引入的微小偏置可能导致的分布不对称性, 进而使量化映射偏向一侧, 引入显著近似损失。本文针对图像超分辨率(SR)量化方案进行研究, 对性能获得公认的二阶段PTQ方案进行改进。将预寻找量化上下界过程中的基于最值的均等分配改进为基于排序的非均匀分配, 并在伪量化过程中引入偏置量, 将数据的一部分及均值进行自适应偏移, 缓解数据偏置引发的估计损失。改进后的方案在效果上全面超越原方案, 并同样具有高压缩比以及加速比: 相比原SwinIR-light模型参数量减少60%以上, 超分辨率过程加速3倍以上。

关键词: 模型量化, 图像超分辨率, 数据偏移, 后训练量化, 排序, 偏置

Abstract:

Model quantization technology effectively reduces model storage and computational overhead by mapping high-precision floating-point data to low-bit discrete spaces. A core focus of model quantization research is how to rationally account for the characteristics of parameter distributions to construct superior mapping schemes. Existing Post-Training Quantization (PTQ) schemes nearly universally assume that the data distribution of non-activation layers follows a symmetric bell-shaped curve, but overlook the fact that small biases introduced by the model's activation layers and inputs induce distributional asymmetry. Consequently, the resulting quantization mapping is skewed to one side due to this subtle asymmetry, leading to significant approximation loss. This paper investigates quantization schemes for image Super-Resolution (SR) and proposes improvements to the widely recognized two-stage PTQ scheme. First, the max-min-based equal partitioning employed in the pre-search for quantization bounds is modified to a sorting-based non-uniform partitioning approach. Second, a bias term is introduced during the pseudo-quantization process, where a portion of the data and its mean are adaptively shifted to mitigate estimation loss caused by data bias. The improved scheme outperforms the original counterpart across all performance metrics while maintaining comparable high compression ratio and acceleration ratio—compared to the original SwinIR-light model, it reduces the parameter count by more than 60% and accelerates the SR process by more than 3 times.

Key words: model quantization, image Super-Resolution (SR), data bias, Post-Training Quantization (PTQ), sorting, bias