Machine Learning Algorithm and System Co-Design for Hardware Efficiency