Skip to main content
Inspiring
June 1, 2016
Answered

Timing Text to Speech to Video?

  • June 1, 2016
  • 2 replies
  • 636 views

I am writing a tutorial for coworkers about embedding text to speech within Adobe Captivate 8 and the cpvc file. Ideally, I would be supplying my own vocal instruction with a mic as I am working on the video capture. However, since we are not doing it this way I am going to import the external text to speech file into the cpvc timeline. 

I would like to ask the pros what I am about to write makes sense.

I am going to suggest that before the captivate user makes the video, they should write out the copy of the tutorial. Once the finalized copy is complete they should measure the amount of time the copy takes to complete before doing each section of the video. This way they can match the video split with the audio file imported into that split. In doing so, they won't make the video split stretch out too long to match the audio file or leave silence on the video split. Is this something you would do?

Thanks for your help.

This topic has been closed for replies.
Correct answer Lilybiri

For a CPVC file I will indeed start writing out the script. When recording the video, I will 'speak' the script in a normal rhythm to be able to execute the required actions as will be convenient for the final audio clip to be inserted later on. You can also, if wanted, capture the video in different steps, maybe because later on you'll have to insert static objects like still images for titles. With that approach, less editing will have to be done when you insert the final audio (clips). I never generate them by TTS but record them with Audition and a good microphon.

2 replies

Erik Lord
Inspiring
June 1, 2016

Agreed. For such a use case, I have a script and practice it while moving the mouse around, clicking things, etc.

I often put cues in the script I'm reading too, to indicate what action I should take when reading - mostly for longer sentences where I don't want no movement on the screen.

This is usually enough to be close enough...then I can adjust the screen elements as needed.

(this is one reason I prefer Captivate for NON full-capture video, much easier to adjust elements. If doing 'CPVC' video, I'd default to Camtasia in most situations, just due to experience with it...and timings there need to be much tighter)

Lilybiri
Legend
June 1, 2016

Erik, the OP was talking about a cpvc-project, hence my answer.

Lilybiri
LilybiriCorrect answer
Legend
June 1, 2016

For a CPVC file I will indeed start writing out the script. When recording the video, I will 'speak' the script in a normal rhythm to be able to execute the required actions as will be convenient for the final audio clip to be inserted later on. You can also, if wanted, capture the video in different steps, maybe because later on you'll have to insert static objects like still images for titles. With that approach, less editing will have to be done when you insert the final audio (clips). I never generate them by TTS but record them with Audition and a good microphon.