Special thanks to the NLP community for the recognition. This was my earliest work in "large" language model at the time , quoted from the paper "Our stacking LSTM models have 4 layers, each with 1000 cells, and 1000-dimensional embeddings" .
Early LSTM Language Model Architecture Recognition from NLP Community
By
–