Directly Optimizing Evaluation Metrics To Improve Text To Motion