Small Batch Deep Reinforcement Learning