IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

Multi-Stage Knowledge Integration of Vision-Language Models for Continual Learning.

Hongsheng Zhang, Zhong Ji, Jingren Liu, Yanwei Pang, Jungong Han

Published: 202610.1109/TIP.2026.3652014

Abstract

Vision Language Models (VLMs), pre-trained on large-scale image-text datasets, enable zero-shot predictions for unseen data but may underperform on specific unseen tasks. Continual learning (CL) can help VLMs effectively adapt to new data distributio…

Preview only. Read the full abstract at the source

View at DOI