Structured analysis and generation in music, audio, and beyond