This is kind of weird, if you think about it. The unknown words together will be more frequent than the 9998th most frequent term. So the fidelity of representation isn't being distributed well. How can we give the unknowns more vectors?
Improving Vector Representation for Unknown Words in NLP
By
–
Leave a Reply