reddit thread: https://
reddit.com/r/ChatGPT/comm
ents/142bzk3/selflearning_of_the_robot_in_1_hour/
…
@_akhaliq
-
Robot Self-Learning in One Hour Reddit Discussion
By
–
-
Neuralangelo: High-Fidelity Neural Surface Reconstruction Method
By
–
Neuralangelo: High-Fidelity Neural Surface Reconstruction
— AK (@_akhaliq) 6 juin 2023
paper page: https://t.co/omcwjlyeha
present Neuralangelo, which combines the representation power of multi-resolution 3D hash grids with neural surface rendering. Two key ingredients enable our approach: (1) numerical… pic.twitter.com/n3beH5Oa4ENeuralangelo: High-Fidelity Neural Surface Reconstruction paper page: https://
huggingface.co/papers/2306.03
092
… present Neuralangelo, which combines the representation power of multi-resolution 3D hash grids with neural surface rendering. Two key ingredients enable our approach: (1) numerical -

GPT Models Meet Robotic Applications: Co-Speech Gesturing Chat System
By
–
GPT Models Meet Robotic Applications: Co-Speech Gesturing Chat System paper page: https://
huggingface.co/papers/2306.01
741
… introduces a chatting robot system that utilizes recent advancements in large-scale language models (LLMs) such as GPT-3 and ChatGPT. The system is integrated with a -

Binary and Ternary Neural Networks for Efficient Natural Language Generation
By
–
Binary and Ternary Natural Language Generation paper page: https://
huggingface.co/papers/2306.01
841
… Ternary and binary neural networks enable multiplication-free computation and promise multiple orders of magnitude efficiency gains over full-precision networks if implemented on specialized -
HeadSculpt: Text-Guided 3D Head Avatar Generation Method
By
–
HeadSculpt: Crafting 3D Head Avatars with Text
— AK (@_akhaliq) 6 juin 2023
paper page: https://t.co/DeebCTOHxp
Recently, text-guided 3D generative methods have made remarkable advancements in producing high-quality textures and geometry, capitalizing on the proliferation of large vision-language and image… pic.twitter.com/JOGeVMvgbIHeadSculpt: Crafting 3D Head Avatars with Text paper page: https://
huggingface.co/papers/2306.03
038
… Recently, text-guided 3D generative methods have made remarkable advancements in producing high-quality textures and geometry, capitalizing on the proliferation of large vision-language and image -

Diffusion Models Effectiveness for Optical Flow and Depth Estimation
By
–
The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation paper page: https://
huggingface.co/papers/2306.01
923
… Denoising diffusion probabilistic models have transformed image generation with their impressive fidelity and diversity. We show that they also -

PolyVoice: Language Models for Speech-to-Speech Translation
By
–
PolyVoice: Language Models for Speech to Speech Translation paper page: https://
arxiv.org/abs/2306.02982 propose PolyVoice, a language model-based framework for speech-to-speech translation (S2ST) system. Our framework consists of two language models: a translation language model and a -

Polyglot-Ko: Open-Source Large-Scale Korean Language Models
By
–
A Technical Report for Polyglot-Ko: Open-Source Large-Scale Korean Language Models paper page: https://
huggingface.co/papers/2306.02
254
… Polyglot is a pioneering project aimed at enhancing the non-English language performance of multilingual language models. Despite the availability of various -

SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model
By
–
SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model paper page: https://
huggingface.co/papers/2306.02
245
… With the development of large language models, many remarkable linguistic systems like ChatGPT have thrived and achieved astonishing success on many tasks, showing the incredible -

VisualGPTScore: Vision-Language Models with Multimodal Generative Pre-Training
By
–
VisualGPTScore: Visio-Linguistic Reasoning with Multimodal Generative Pre-Training Scores paper page: https://
huggingface.co/papers/2306.01
879
… Vision-language models (VLMs) discriminatively pre-trained with contrastive image-text matching losses such as P(match|text, image) have been criticized