@reach_vb - AI Dynamics - Page 53 of 98

Meta dropped a first-of-its-kind behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks 🔥

Available on Hugging Face 🤗

pic.twitter.com/POA5piqEnC
— Vaibhav (VB) Srivastav (@reach_vb) 14 décembre 2024

Meta dropped a first-of-its-kind behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks Available on Hugging Face

→ View original post on X — @reach_vb

15 December 2024

MoEs for Production Usage in AI Systems

By

@reach_vb

–

13 December 2024 14h18

Yeah! But that’s okay no? I see MoEs more for actual production usage.

→ View original post on X — @reach_vb

13 December 2024

DeepSeek-VL2: Enhanced Multimodal Model with Dynamic Tiling

By

@reach_vb

–

13 December 2024 13h48

Improvements from v1: > 2x high-quality training data vs DeepSeek-VL1
> Dynamic image tiling for flexible resolutions + efficient DeepSeek-MoE for LM
> 3-stage training + new multi-modal parallel strategies for efficiency

→ View original post on X — @reach_vb

13 December 2024

DeepSeek-VL2 Achieves SoTA Vision Performance with Fewer Parameters

By

@reach_vb

–

13 December 2024 13h41

The whale strikes again! DeepSeekVL 2 > DeepSeek-VL2-Tiny, DeepSeek-VL2-Small, and DeepSeek-VL2, with 1.0B, 2.8B, and 4.5B activated parameters
> SoTA perf with similar or fewer activated parameters compared to Qwen 2 VL
> Excels at visual question answering, optical

→ View original post on X — @reach_vb

13 December 2024