Yes, that's one of my points.
Text is insufficient.
We need sensory inputs to learn how the world works.
We can estimate the total amount of visual data seen by a 2 year-old: 2 years = 2x365x12x3600 or roughly 32 million seconds.
We have 2 million optical nerve fibers, carrying
Sensory Data and Multimodal Learning in AI Development
By
–
Leave a Reply