SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics SmolVLA is a compact, open-source VLA model built for low-cost training and real-world deployment on consumer hardware, enabling efficient language-driven robot control without sacrificing performance.
SmolVLA: Efficient Vision-Language-Action Model for Robotics
By
–
