Caption Assisted Multimodal Large Language Model for Video Moment Retrieval. — SciRadar