Neural networks : the official journal of the International Neural Network Society
Enhancing end-to-end speech translation via multi-stage knowledge distillation.
Yue Zhou, Yuxuan Yuan, Yanyan Feng, Xiaodong Shi
Published: 202510.1016/j.neunet.2025.108444
Abstract
Knowledge distillation (KD) using machine translation (MT) teacher models has demonstrated effectiveness in enhancing end-to-end speech-to-text translation (ST). However, existing KD methods for ST primarily rely on the teacher's output distributions…
Preview only. Read the full abstract at the source