Oh yeah, Bag-of-Words models are more favorite simple baseline for text classification. In my IMDB experiments, BoW ranked ~10% above the Gzip method (but 6% below LLM embeddings)
By
–
Oh yeah, Bag-of-Words models are more favorite simple baseline for text classification. In my IMDB experiments, BoW ranked ~10% above the Gzip method (but 6% below LLM embeddings)