Neural networks : the official journal of the International Neural Network Society

Bitscaling: Streamlining neural network compression via predictive multi-scale growth of mixed-precision networks.

Yuehao Li, Haifang Jian, Hongchang Wang, Linghe Zhang, Yuhao Liu, Wu Liu

Published: 202510.1016/j.neunet.2025.108327

Abstract

Jointly optimizing model scale and quantization for neural networks achieves significantly higher compression ratios than either technique alone. However, the resulting combinatorial search space renders most existing methods impractical in real-worl…

Preview only. Read the full abstract at the source

View at DOI