Increasing Policy Network Size Does Not Guarantee Better Performance in Deep Reinforcement Learning