Spaces of the week! – we've got Text to Image, Text to Video, Multilingual LLMs and much more!
@reach_vb
-
Open Source AI Project Code Available for Experimentation
By
–
But, it’s open – with all the code open source, you can fiddle with it, try different prompts, things etc The final output is not at that level, but, it only gets better from here,
-
F5-TTS Audio Quality Optimization with Reference Speakers
By
–
Nice! Lmk how it goes, I’ll be back in office tomorrow will take a deeper look. Btw from my experience w/ F5-TTS – the generation quality depends quite a bit on the reference audio – might be worth checking with different speaker prompts.
-
Audio Refinement Step for AI Generation Quality Improvement
By
–
Another cool to experiment would be to add an Audio refiner as an optional Step 5 – with the sole goal to make the generation sound as good as possible. Resemble Enhance would fit in well there:
-
F5-TTS and E2 TTS: Exploring the Best Text-to-Speech Solutions
By
–
Heya! I think F5-TTS/ E2 TTS is the best atm. It’d be cool to experiment with it.
-
Qwen 2.5 Stack Upgrade Benefits Performance
By
–
Ah yeah! I bet you can get massive benefits just by upgrading the stack to Qwen 2.5
-
Llama PDF Pre-Processing Logic Guide Released
By
–
You can find it here: https://
github.com/meta-llama/lla
ma-recipes/blob/main/recipes/quickstart/NotebookLlama/Step-1PDF-Pre-Processing-Logic.ipynb
… -
Open Source AI Stack: Swappable Domain-Specific Models
By
–
Beauty of open source: You can replace any part of this stack drop-in with domain specific/ fine-tuned/ better models too.
— Vaibhav (VB) Srivastav (@reach_vb) 27 octobre 2024
Open intelligence. https://t.co/7b8e5bWxAZ pic.twitter.com/kqbvMyGqEjBeauty of open source: You can replace any part of this stack drop-in with domain specific/ fine-tuned/ better models too. Open intelligence.
-
Meta Releases Llama Recipes GitHub Repository for Quick Start
By
–
Check it out here: https://
github.com/meta-llama/lla
ma-recipes/tree/main/recipes/quickstart/NotebookLlama
… -
Meta Releases NotebookLlama: Open Recipe Using Llama Models
By
–
Wow! Meta dropped an open NotebookLM recipe: NotebookLlama 🔥
— Vaibhav (VB) Srivastav (@reach_vb) 27 octobre 2024
It uses L3.2 1B/ 3B for pre-processing the PDF, L3.1 70B for Transcript creation, L3.1 8B for re-writes and Parler TTS for Text to Speech ⚡
Step 1: Pre-process PDF: Use Llama-3.2-1B-Instruct to pre-process the PDF… pic.twitter.com/L7hb5GsMtlWow! Meta dropped an open NotebookLM recipe: NotebookLlama It uses L3.2 1B/ 3B for pre-processing the PDF, L3.1 70B for Transcript creation, L3.1 8B for re-writes and Parler TTS for Text to Speech Step 1: Pre-process PDF: Use Llama-3.2-1B-Instruct to pre-process the PDF