High-dimensional continuous action space control via trust region optimized deep reinforcement learning. — SciRadar