LFM2-ColBERT-350M is also very fast! Its inference speed is on par with GTE-ModernColBERT-v1 (only 150M parameters) for query and document encoding across various batch sizes.
LFM2-ColBERT-350M Achieves Competitive Inference Speed
By
–

By
–

LFM2-ColBERT-350M is also very fast! Its inference speed is on par with GTE-ModernColBERT-v1 (only 150M parameters) for query and document encoding across various batch sizes.