Recommendation of deep reinforcement learning based on value function considering error reduction. — SciRadar