Statistical language modelling for large vocabulary speech recognition