Model-free optical processors using in situ reinforcement learning with proximal policy optimization. — SciRadar