Transcription combines speakers

Question

I've been using the transcription feature to create the captions for my YouTube channel, and it's largely a lot of help. But the speaker detection tends to merge speakers together, especially female speakers. In my footage, there are 3 female speakers: 2 with Australian accents, and 1 with an American accent. But Premiere Pro doesn't distinguish between the three of them, even though one has a completely different accent!

Repro steps:

1. Record a video with multiple speakers, both male and female.

2. Use Premiere Pro's audio transcription feature to generate captions.

3. Notice that similar sounding speakers get treated as the same speaker. The effect is more pronounced for female voices than male voices. (Perhaps it is pitch-related?)

Rach McIntire · Answer

HI @Tim362650694hsr ,

Welcome to the Premiere Pro forums! We are glad to see you here. That definitely does sound frustrating. Can you provide a sample clip for the team to reproduce the issue? Feel free to send me a DM.

We need a few more details to help with this issue. Please see: How do I write a bug report?

I hope we can help you soon. Sorry for the frustration!

Sign up

To post, reply, or follow discussions, please sign in with your Adobe ID.

Sign in to Adobe Community

To post, reply, or follow discussions, please sign in with your Adobe ID.