Neural networks : the official journal of the International Neural Network Society
Bitscaling: Streamlining neural network compression via predictive multi-scale growth of mixed-precision networks.
Yuehao Li, Haifang Jian, Hongchang Wang, Linghe Zhang, Yuhao Liu, Wu Liu
Published: 202510.1016/j.neunet.2025.108327
Abstract
Jointly optimizing model scale and quantization for neural networks achieves significantly higher compression ratios than either technique alone. However, the resulting combinatorial search space renders most existing methods impractical in real-worl…
Preview only. Read the full abstract at the source