F-VLM, a simple and scalable open-vocabulary detection method that is built upon frozen vision and language models, reduces the training complexity for open-vocabulary detectors and expands detection to novel objects. Learn more and check out the code → https://
goo.gle/3O6Ih9Y
F-VLM: Open-Vocabulary Detection with Frozen Vision Language Models
By
–
Leave a Reply