IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Multi-Stage Knowledge Integration of Vision-Language Models for Continual Learning.
Hongsheng Zhang, Zhong Ji, Jingren Liu, Yanwei Pang, Jungong Han
Published: 202610.1109/TIP.2026.3652014
Abstract
Vision Language Models (VLMs), pre-trained on large-scale image-text datasets, enable zero-shot predictions for unseen data but may underperform on specific unseen tasks. Continual learning (CL) can help VLMs effectively adapt to new data distributio…
Preview only. Read the full abstract at the source