StaTeS-SQL: Soft Q Learning with State-Dependent Temperature Scheduling