Trending AI news stories + papers https://
open.substack.com/pub/akhaliq/p/
trending-ai-news-stories-papers-74e
…
@_akhaliq
-
Trending AI News Stories and Papers
By
–
-
Tab-CoT: Zero-shot Tabular Chain of Thought Framework
By
–
Tab-CoT: Zero-shot Tabular Chain of Thought propose a new Chain-of-Thought framework Tab-CoT, which use a tabular format to conduct complex reasoning process in a highly structured manner. Despite its simplicity, we show that our approach is capable of performing reasoning
-
Photoshop AI Generative Fill Used for Its Intended Purpose
By
–
Photoshop AI Generative Fill was used for its intended purpose
-
Photoshop Generative Fill Beta Expands Midjourney Photos
By
–
Photoshop Generative Fill Beta used to expand Midjourney photos pic.twitter.com/kTuC8x4Fvj
— AK (@_akhaliq) 31 mai 2023Photoshop Generative Fill Beta used to expand Midjourney photos
-
Concept Decomposition for Visual Exploration Using Vision-Language Models
By
–
Concept Decomposition for Visual Exploration and Inspiration
— AK (@_akhaliq) 31 mai 2023
propose a method to decompose a visual concept, represented as a set of images, into different visual aspects encoded in a hierarchical tree structure. We utilize large vision-language models and their rich latent… pic.twitter.com/J5OduSX7CGConcept Decomposition for Visual Exploration and Inspiration propose a method to decompose a visual concept, represented as a set of images, into different visual aspects encoded in a hierarchical tree structure. We utilize large vision-language models and their rich latent
-
VisorGPT: Learning Visual Prior via Generative Pre-Training
By
–
VisorGPT: Learning Visual Prior via Generative Pre-Training
— AK (@_akhaliq) 31 mai 2023
propose to learn Visual prior via Generative Pre-Training, dubbed VisorGPT. By discretizing visual locations of objects, e.g., bounding boxes, human pose, and instance masks, into sequences, our~can model visual prior… pic.twitter.com/xj84MvpE14VisorGPT: Learning Visual Prior via Generative Pre-Training propose to learn Visual prior via Generative Pre-Training, dubbed VisorGPT. By discretizing visual locations of objects, e.g., bounding boxes, human pose, and instance masks, into sequences, our~can model visual prior
-
LibriTTS-R: Restored Multi-Speaker Text-to-Speech Dataset
By
–
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus paper introduces a new speech dataset called “LibriTTS-R'' designed for text-to-speech (TTS) use. It is derived by applying speech restoration to the LibriTTS corpus, which consists of 585 hours of speech data at 24 kHz
-
AlteredAvatar: Fast Style Adaptation for Dynamic 3D Avatars
By
–
AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation
— AK (@_akhaliq) 31 mai 2023
presents a method that can quickly adapt dynamic 3D avatars to arbitrary text descriptions of novel styles. Among existing approaches for avatar stylization, direct optimization methods can produce excellent… pic.twitter.com/k9uhlZhWz0AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation presents a method that can quickly adapt dynamic 3D avatars to arbitrary text descriptions of novel styles. Among existing approaches for avatar stylization, direct optimization methods can produce excellent
-
Geometric Algebra Transformers: A New Architecture for Geometric Data
By
–
Geometric Algebra Transformers introduce the Geometric Algebra Transformer (GATr), a general-purpose architecture for geometric data. GATr represents inputs, outputs, and hidden states in the projective geometric algebra, which offers an efficient 16-dimensional vector space
-
Grammar Prompting for Domain-Specific Language Generation with LLMs
By
–
Grammar Prompting for Domain-Specific Language Generation with Large Language Models Large language models (LLMs) can learn to perform a wide range of natural language tasks from just a handful of in-context examples. However, for generating strings from highly structured