AI Dynamics

Global AI News Aggregator

MULTIMODAL AI

Comparative Visualization for AI Image Generation Solutions

By

@saboo_shubham_

–

03 January 2023 11h06

Will be good to prepare a comparative visualization for all the AI image generation solutions out there! Will start working on it soon!!

→ View original post on X — @saboo_shubham_,

3 January 2023
Limitations of bulk conversion services for short-form video content

By

AI Dynamics

–

02 January 2023 3h33

these services might be suitable for bulk conversion tasks (e.g. an audiobook or other long-form stuff) but for smaller tasks like videos (ads, voiceover, etc) they are still (even the best ones) lacking in their tone, cadence, and emote

→ View original post on X — @alexalbert__,

2 January 2023
2023 Tech Wishes: AI, Learning, Video, Open Models, Robotics

By

@reza_zadeh

–

31 December 2022 23h18

In 2023, may your: – Generative AI produce beautiful images & text
– Active Learning framework ask the right questions – generations be as beautiful as image
– Model be OSS without censors
– Robotics simulation be faithful to real world Happy New Year everyone!

→ View original post on X — @reza_zadeh,

31 December 2022
Advances in Surgical AI: Skill Assessment and Patient Outcome Prediction

By

AI Dynamics

–

31 December 2022 22h56

We made strides in surgical #AI which involves assessing the skill of surgeons, predicting patient outcomes, and discovering novel surgeon biomarkers based on multi-modal data and deep learning algorithms. @AjhungMD gives an excellent overview here

→ View original post on X — @animaanandkumar,

31 December 2022
Robust Vision Transformer Architecture Wins Semantic Segmentation Challenge

By

AI Dynamics

–

31 December 2022 22h51

We also developed robust vision transformer architecture, fully attention networks (FAN), with channel-based attention for robustness. We won the Semantic Segmentation Tracking of Robust Vision Challenge at ECCV. https://
arxiv.org/abs/2210.12852

→ View original post on X — @animaanandkumar,

31 December 2022
Deep Dive into Stable Diffusion Technology

By

AI Dynamics

–

25 December 2022 17h16

Diving deep into stable diffusion

→ View original post on X — @akshay_pachaar,

25 December 2022
SD 2.1 Model Served by HuggingFace

By

AI Dynamics

–

25 December 2022 0h29

It’s the same SD 2.1 model that HuggingFace serves

→ View original post on X — @hardmaru,

25 December 2022
Reasoning in Visual Perception: Distinguishing Squares from Circles

By

AI Dynamics

–

24 December 2022 11h12

The former sounds perhaps stranger, so here's an example. Let's say you have to tell whether a given image contains a square or a circle — a canonical perception problem. Sounds easy enough if you have a well-trained visual system, right? How would reasoning come into play?

→ View original post on X — @fchollet,

24 December 2022
Largest Text-Molecule Model Enables ChatGPT-like Molecule Retrieval and Editing

By

AI Dynamics

–

22 December 2022 18h51

We build the largest Text-molecule model that does not rely only on aligned training pairs. Now you can retrieve and edit molecules based on text prompts. This will pave the way for #ChatGPT for #molecules https://
chao1224.github.io/MoleculeSTM @nvidia @Mila_Quebec @Caltech

→ View original post on X — @animaanandkumar,

22 December 2022
ImageNetX: Identifying Vision System Failures at Scale

By

AI Dynamics

–

21 December 2022 18h14

Even today’s best #deeplearning vision systems can fail when pose/lighting/background vary. Our work on ImageNetX is one of the first large scale efforts to pinpoint mistake types of in AI computer vision systems. Explore the dataset

→ View original post on X — @aiatmeta,

21 December 2022