AI Dynamics

Global AI News Aggregator

@reach_vb

FastSpeech2 Vocoder Inference Debugging: Tensor Dimension Error

By

@reach_vb

–

30 June 2023 17h04

Running inference on an assortment of FastSpeech2 variants with a new vocoder.. after 3 hours of sifting through the stack trace, realised that I forgot to `unsqueeze` my input vector! *cries in torch*

→ View original post on X — @reach_vb,

30 June 2023
Coqui AI Open Source Text-to-Speech Solution Gains Recognition

By

@reach_vb

–

29 June 2023 11h04

Brilliant! Open source Text-to-Speech ftw! Kudos @coqui_ai

→ View original post on X — @reach_vb,

29 June 2023
Sharing AI Models on Hub Platform

By

@reach_vb

–

29 June 2023 0h51

Fantastic!! Would be cool to get the models on the hub too! Happy to help with it, if you want!

→ View original post on X — @reach_vb,

29 June 2023
Improving Whisper Transcription Quality with Contrastive Search Strategy

By

@reach_vb

–

27 June 2023 9h29

In my experience those results were quite suboptimal and didn’t quite result in usable transcriptions. With better decoding strategy those issues can be alleviated a bit. So the hack is more so of using contrastive search with Whisper to enable such use cases. Will run more

→ View original post on X — @reach_vb,

27 June 2023
Fine-Tuning Whisper Model for Improved Performance

By

@reach_vb

–

27 June 2023 9h26

Definitely yes! You can fine-tune Whisper to boost the model performance: https://
huggingface.co/blog/fine-tune
-whisper
…

→ View original post on X — @reach_vb,

27 June 2023
Contrastive Search Outperforms Greedy Search in Text Generation

By

@reach_vb

–

27 June 2023 9h25

Interesting! Can you try running the next cell with contrastive search? In my experience greedy search doesn’t perform as well! Would be curious to see the results 🙂

→ View original post on X — @reach_vb,

27 June 2023
AI Transcription Accuracy and Generation Strategy Optimization

By

@reach_vb

–

26 June 2023 23h45

Yess! Quite possible – although the accuracy of the transcriptions still needs to be checked. I’ve found it to loose context if not used with the right generation strategy!

→ View original post on X — @reach_vb,

26 June 2023
Contrastive Search Benchmarks Coming Soon for Developers

By

@reach_vb

–

26 June 2023 20h47

Definitely! More benchmarks on contrastive search coming soon too Thanks for empowering millions of developers

→ View original post on X — @reach_vb,

26 June 2023
Contrastive Search vs Greedy Search for LLM Generation

By

@reach_vb

–

26 June 2023 18h51

Yeh! Would deffo recommend contrastive search. It works quite well (however, over generates sometime ) Greedy search simply results in lost context.

→ View original post on X — @reach_vb,

26 June 2023
Fine-tuned Models Performance on Multilingual Translation Tasks

By

@reach_vb

–

26 June 2023 18h46

Agreed! From my experience, it works well on languages that were abundant in the train set and also had lang -> en translate pairs. I’ll run some experiments to see how well do fine-tuned models perform. Quite lovely to see ya already using this

→ View original post on X — @reach_vb,

26 June 2023