Skip to main content
Participant
July 16, 2022
Answered

How to align multiple audio clips with same number of segments

  • July 16, 2022
  • 1 reply
  • 423 views

I have 3 audio clips with exact same speech generated by azure text to speech service. As 3 audio clips are using different tone, there are differences in the time finishing same sentences and words. Even I adjust the azure parameter and produce an almost exact total length, each individual sentence still have slightly different speed and finish time.  As the clip becomes longer, the shift difference accumulates and becomes obvious at the later part.

The below audio clips are edited by detecting and cutting out the silent part, each audio clip has exact number of segment.

What I want to do is align each segment at the beginning, even each sentence has slightly difference speed and ending time, the difference is acceptable to me since the shift problem is not accumulating anymore. so like the below image, I can align manually, but there are hundreds of clips. is there a method to this automatically?

the azure service doesn't have any parameter to make this result, so the remaining choice would be adjusting it in adobe audition.

This topic has been closed for replies.
Correct answer SteveG_AudioMasters_

I'm afraid that there isn't any method of auto-aligning clips, no. But even for a few hundred clips, it wouldn't take long - you have the orthogonal line-up snap indication there (the one you can see when you drag the clip along the timeline) which makes it pretty easy to do.

 

One of the problems of automating this would be the issue of deciding exactly what you were aligning with what, and how far it would be acceptable for a clip to be 'pulled'. Since Audition has no idea of what the content of any of the clips is, I could see plenty of places above where errors could slip in...

1 reply

SteveG_AudioMasters_
Community Expert
SteveG_AudioMasters_Community ExpertCorrect answer
Community Expert
July 16, 2022

I'm afraid that there isn't any method of auto-aligning clips, no. But even for a few hundred clips, it wouldn't take long - you have the orthogonal line-up snap indication there (the one you can see when you drag the clip along the timeline) which makes it pretty easy to do.

 

One of the problems of automating this would be the issue of deciding exactly what you were aligning with what, and how far it would be acceptable for a clip to be 'pulled'. Since Audition has no idea of what the content of any of the clips is, I could see plenty of places above where errors could slip in...