Neural networks : the official journal of the International Neural Network Society

Enhancing end-to-end speech translation via multi-stage knowledge distillation.

Yue Zhou, Yuxuan Yuan, Yanyan Feng, Xiaodong Shi

Published: 202510.1016/j.neunet.2025.108444

Abstract

Knowledge distillation (KD) using machine translation (MT) teacher models has demonstrated effectiveness in enhancing end-to-end speech-to-text translation (ST). However, existing KD methods for ST primarily rely on the teacher's output distributions…

Preview only. Read the full abstract at the source

View at DOI