Skip to content

AI Dynamics

Global AI News Aggregator

Rechercher

@_akhaliq

Reddit thread about not expecting too much from AI

By

@_akhaliq

–

10 June 2023 23h44

Reddit thread: https://
reddit.com/r/ProgrammerHu
mor/comments/145z7g6/do_not_pretend_too_much_from_ai/
…

→ View original post on X — @_akhaliq

10 June 2023
Trending AI News Stories and Papers

By

@_akhaliq

–

09 June 2023 23h38

Trending AI news stories + papers https://
open.substack.com/pub/akhaliq/p/
trending-ai-news-stories-papers-486
…

→ View original post on X — @_akhaliq

9 June 2023
Nvidia releases ATT3D for text-to-3D object synthesis

By

@_akhaliq

–

09 June 2023 19h02

Nvidia just released ATT3D: Amortized Text-To-3D Object Synthesis

project page: https://t.co/3yAK3Hh4II

Text-to-3D modeling has seen exciting progress by combining generative text-to-image models with image-to-3D methods like Neural Radiance Fields. DreamFusion recently… pic.twitter.com/tOvP31EFS0
— AK (@_akhaliq) 9 juin 2023

Nvidia just released ATT3D: Amortized Text-To-3D Object Synthesis project page: https://
research.nvidia.com/labs/toronto-a
i/ATT3D/
… Text-to-3D modeling has seen exciting progress by combining generative text-to-image models with image-to-3D methods like Neural Radiance Fields. DreamFusion recently

→ View original post on X — @_akhaliq

9 June 2023
Meta Releases MusicGen: Controllable Music Generation Model

By

@_akhaliq

–

09 June 2023 16h25

Meta just released MusicGen, a simple and controllable model for music generation

MusicGen is a single stage auto-regressive Transformer model trained over a 32kHz EnCodec tokenizer with 4 codebooks sampled at 50 Hz. Unlike existing methods like MusicLM, MusicGen doesn't not… pic.twitter.com/kFCOrAmLSh
— AK (@_akhaliq) 9 juin 2023

Meta just released MusicGen, a simple and controllable model for music generation MusicGen is a single stage auto-regressive Transformer model trained over a 32kHz EnCodec tokenizer with 4 codebooks sampled at 50 Hz. Unlike existing methods like MusicLM, MusicGen doesn't not

→ View original post on X — @_akhaliq

9 June 2023
Background Prompting for Improved Object Depth Estimation

By

@_akhaliq

–

09 June 2023 8h18

Background Prompting for Improved Object Depth paper page: https://
huggingface.co/papers/2306.05
428
… Estimating the depth of objects from a single image is a valuable task for many vision, robotics, and graphics applications. However, current methods often fail to produce accurate depth for

→ View original post on X — @_akhaliq

9 June 2023
Test-Time Optimization for Dense Motion Tracking in Videos

By

@_akhaliq

–

09 June 2023 8h13

Tracking Everything Everywhere All at Once

paper page: https://t.co/UwLxRYPGvb

present a new test-time optimization method for estimating dense and long-range motion from a video sequence. Prior optical flow or particle video tracking algorithms typically operate within limited… pic.twitter.com/3ryHUA4c9n
— AK (@_akhaliq) 9 juin 2023

Tracking Everything Everywhere All at Once paper page: https://
huggingface.co/papers/2306.05
422
… present a new test-time optimization method for estimating dense and long-range motion from a video sequence. Prior optical flow or particle video tracking algorithms typically operate within limited

→ View original post on X — @_akhaliq

9 June 2023
Scaling Spherical CNNs: Spectral Domain Convolutions

By

@_akhaliq

–

09 June 2023 8h08

Scaling Spherical CNNs paper page: https://
huggingface.co/papers/2306.05
420
… Spherical CNNs generalize CNNs to functions on the sphere, by using spherical convolutions as the main linear operation. The most accurate and efficient way to compute spherical convolutions is in the spectral domain

→ View original post on X — @_akhaliq

9 June 2023
R-MAE: Regions Meet Masked Autoencoders for Vision Tasks

By

@_akhaliq

–

09 June 2023 8h05

R-MAE: Regions Meet Masked Autoencoders paper page: https://
huggingface.co/papers/2306.05
411
… Vision-specific concepts such as "region" have played a key role in extending general machine learning frameworks to tasks like object detection. Given the success of region-based detectors for

→ View original post on X — @_akhaliq

9 June 2023
LU-NeRF: Scene and Pose Estimation Using Local Unposed NeRFs

By

@_akhaliq

–

09 June 2023 8h02

LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs paper page: https://
huggingface.co/papers/2306.05
410
… A critical obstacle preventing NeRF models from being deployed broadly in the wild is their reliance on accurate camera poses. Consequently, there is growing interest in

→ View original post on X — @_akhaliq

9 June 2023
Grounded Text-to-Image Synthesis with Attention Refocusing

By

@_akhaliq

–

09 June 2023 6h08

Grounded Text-to-Image Synthesis with Attention Refocusing

paper page: https://t.co/3DfgBmfB2I

Driven by scalable diffusion models trained on large-scale paired text-image datasets, text-to-image synthesis methods have shown compelling results. However, these models still fail… pic.twitter.com/nQRFzGoSLj
— AK (@_akhaliq) 9 juin 2023

Grounded Text-to-Image Synthesis with Attention Refocusing paper page: https://
huggingface.co/papers/2306.05
427
… Driven by scalable diffusion models trained on large-scale paired text-image datasets, text-to-image synthesis methods have shown compelling results. However, these models still fail

→ View original post on X — @_akhaliq

9 June 2023

←Previous Page

1 … 123 124 125 126 127 … 138

INNOVATION GENERATIVE AI RESEARCH LLMS TOOLS MACHINE LEARNING CODE MARKET TRENDS BUSINESS TECHNOLOGY BIG TECH ETHICS ENTERPRISE AI SOFTWARE AGENTS APPS COMPUTING AUTOMATION DATA POLICY OPEN SOURCE CULTURE MULTIMODAL AI REGULATION CREATIVE AI PROMPT ENGINEERING ECONOMY SOCIETY INVESTMENT EDUCATION SAFETY AI HARDWARE AGI HARDWARE JOBS STARTUPS INDUSTRY ROBOTICS WORKFORCE SECURITY CYBERSECURITY HEALTHCARE AI SYSTEMS SUSTAINABILITY WEB3 DECENTRALIZED AI

AI Dynamics

Global AI News Aggregator

About
Archives

Rechercher