Memory-based quadratic interpolation optimization with reinforcement learning for robust PV parameter estimation. — SciRadar