"Intermediate representation" refers to a representation of the music generated from the text prompt. The generator model first turns the text prompt into this intermediate representation, which captures elements of the music such as: -Genre
-Tempo
-Instruments
-Mood
-Era
Intermediate representation in music generation from text prompts
By
–