@reach_vb

–

25 November 2024 22h29

Check out the model weights and inference code base here:

OuteTTS v0.2: Compact Multilingual TTS with Voice Cloning

By

–

25 November 2024 22h28

Smol TTS keeps getting better! Introducing OuteTTS v0.2 – 500M parameters, multilingual with voice cloning! 🔥

> Multilingual – English, Chinese, Korean & Japanese
> Cross platform inference w/ llama.cpp
> Zero-shot voice cloning
> Trained on 5 Billion audio tokens
> Qwen 2.5… pic.twitter.com/N4c4Ukfrhz
— Vaibhav (VB) Srivastav (@reach_vb) 25 novembre 2024

Smol TTS keeps getting better! Introducing OuteTTS v0.2 – 500M parameters, multilingual with voice cloning! > Multilingual – English, Chinese, Korean & Japanese
> Cross platform inference w/ llama.cpp
> Zero-shot voice cloning
> Trained on 5 Billion audio tokens
> Qwen 2.5

Qwen 2 VL + FLUX: Advanced Open Source Image Generation

By

–

25 November 2024 16h24

Qwen 2 VL + FLUX > supports variation, img2img, inpainting, and controlnet-guided generation
> depth estimation and line detection for precise structural guidance
> aspect ratios up to 1536×1024 MIT licensed!

Hugging Face SmolLM: Lightweight Language Model Released

By

–

25 November 2024 14h03

What are ya waiting for? https://
github.com/huggingface/sm
ollm
…

SmolLM: Open Source Language Model Training and Deployment

By

–

25 November 2024 13h59

SmolLM – run, pre-train, fine-tune, evaluate SoTA fully open source LM Run with Transformers, MLX, Transformers.js, MLC Web-LLM, Ollama, Candle and more! Apache 2.0 licensed codebase – go explore now!

Optimizing llama.cpp quantization performance gains

By

–

24 November 2024 23h00

good reminder: I need to check my llama.cpp quants I suspect I’m leaving perf on the table.

24 November 2024

Llava o1: Open-Source Vision Language Model with CoT

By

–

24 November 2024 21h02

Llava o1: https://
huggingface.co/Xkev/Llama-3.2
V-11B-cot
…

24 November 2024

Major Open Source LLM Releases: Pixtral, Tülu Compete With Claude

By

–

24 November 2024 20h50

Massive week for Open AI/ ML: @MistralAI Pixtral & Instruct Large – ~123B, 128K context, multilingual, json + function calling & open weights @allen_ai Tülu 70B & 8B – competive with claude 3.5 haiku, beats all major open models like llama 3.1 70B, qwen 2.5 and nemotron Llava

24 November 2024

Open Weights and Science Drive AI Use Cases Forward

By