EzAudio Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer discuss: https://
huggingface.co/papers/2409.10
819
… demo: https://
huggingface.co/spaces/OpenSou
nd/EzAudio
… Latent diffusion models have shown promising results in text-to-audio (T2A) generation tasks, yet previous models have encountered
EzAudio: Efficient Diffusion Transformer for Text-to-Audio Generation
By
–
