LOLNerf: Learn from One Lookβ, we explore the ability to learn a high-quality representation from just a single 2-D image. It can understand common 3D objects, like cars and human faces, from just one image – a game changer for object recognition.
MULTIMODAL AI
-
LOLNeRF Revolutionizes Computer Vision With Multi-Angle Capture
By
–
Say goodbye to the hassle of capturing multiple angles and hello to the future of computer vision.
— Shubham Saboo (@Saboo_Shubham_) 27 janvier 2023
LOLNeRF is here to revolutionize the way we see the world.
(A thread) ππ§΅ pic.twitter.com/nHRDNyMuvNSay goodbye to the hassle of capturing multiple angles and hello to the future of computer vision. LOLNeRF is here to revolutionize the way we see the world. (A thread)
-
Generative AI Impact: Images vs Text Which Will Dominate
By
–
Which type of Generative AI do you think will have a bigger commercial impact: Generating images (e.g., diffusion algorithms, stable diffusion) or text (e.g., LLMs, ChatGPT)?
-
Generate Images Using Diffusion Models and Stable Diffusion
By
–
You can generate any image you can imagine. Just say the words. Learn about how to use Diffusion based models for Image Generation. https://
learnopencv.com/image-generati
on-using-diffusion-models/
β¦ #stablediffusion #imagegeneration #diffusionmodels #computervision #ai #deeplearning #machinelearning -
Eager to Explore and Experiment with DeepFloydIF
By
–
i want to play with @DeepFloydIF so badly, let us at it!!!
-
Data2vec 2.0: 16x Faster Self-Supervised Learning Across Modalities
By
–
Data2vec 2.0 can train self-supervised speech, vision & text models up to 16x faster than the most popular existing algorithm for images β achieving the same accuracy. Read more & get the open source code https://
bit.ly/3H9mPf7 -
Groq and Maxeler Tech Demos: AI, HPC, Speech and Music
By
–
Check out demos on our YouTube #SC22 playlist to see @GroqInc and @MaxelerTech capabilities in speech transcriptions, #AI music generation, computational fluid dynamics, converged #HPC, and more! https://
groq.link/sc22demos -
Diffusion Models: The New AI Image Generation Buzz
By
–
πBLOG-TASTIC TUESDAYSπ
— Satya Mallick (@LearnOpenCV) 24 janvier 2023
Diffusion models are the new buzz in the world of AI-based image generation. Learn how they work and the different types of diffusion models for image generation in our latest article.
β‘οΈ https://t.co/wcPfq7tgwQ #stablediffusion #computervision #ai pic.twitter.com/pnRpAQHHm6BLOG-TASTIC TUESDAYS
Diffusion models are the new buzz in the world of AI-based image generation. Learn how they work and the different types of diffusion models for image generation in our latest article. https://
learnopencv.com/image-generati
on-using-diffusion-models/
β¦ #stablediffusion #computervision #ai -
OpenAI Prepares Text-to-Video Model Launch
By
–
After ChatGPT (text-to-text) and DALL.E (text-to-image), get ready for the new text-to-video model by OpenAI It is a legitimate research project at OpenAI and could launch much sooner than anticipated!!
-
Google Research Responds to AI Criticism with 2022 Achievements Overview
By
–
Google PR counterreaction begins, from Jeff Dean, reminding everyone of the extensive contributions from Google Research across LLMs, ViTs, Multimodal, and Generative text/audio/video: https://
ai.googleblog.com/2023/01/google
-research-2022-beyond-language.html?m=1
β¦