In my experience those results were quite suboptimal and didn’t quite result in usable transcriptions. With better decoding strategy those issues can be alleviated a bit. So the hack is more so of using contrastive search with Whisper to enable such use cases. Will run more
Improving Whisper Transcription Quality with Contrastive Search Strategy
By
–
Leave a Reply