Sorry to interrupt your conversation with Steve. I already see some misunderstanding popping up in your original question which is worrying to me: I want something that is purely interactive -- no timeline necessary.
Timeline is essential in Captivate, and a lot of users do not understand its importance. Interactivity in Captivate is linked with that timeline, because you have the opportunity to pause the timeline using interactive objects or slide events. Based on my long experiences as trainer, consultant for Captivate (will not talk about the thousands of answers I give on this forum), recently I wrote an article about the three stumbling blocks for newbies. You will see that Timeline is by far the first of them.
Challenges for Starters - Captivate blog
I have quite a lot of examples on my blog, interactive tutorials, although very small ones. How to use audio is another subject that you'll find in several posts (I am a professional musician as well). Here is an old one, illustrating what normally is meant by 'interactive tutorial', because there is a lot of misunderstanding about that word; you will see why the timeline is so important
Playtime with Audio and Widgets - Captivate blog
If you ask me whether Captivate is ideal for music tutorials, I would hesitate however, because it doesn't have a way to capture what the user is playing or singing. Do not forget that the hardware needed for correct interpretation of sound is a factor that is underestimated, and that having good quality audio in courses means that you often end up with large files. Not everyone can hear the difference between a WAV and a MP3 file as I do, and the way compression has to happen will define the quality.
Please do read some of my blog posts, and if you have more questions, fire away.