Resizing the embedding table for fine-tuning is pretty awkward. We'd rather have some way of making sure that novel words can get a unique representation, even from a fixed-size embedding table. The hashing trick achieves this.
Hashing Trick for Fixed-Size Embedding Tables in Fine-Tuning
By
–
Leave a Reply