High-density Visual Crowd Counting With Perspective Understanding in Deep Neural Networks