Skip to main content
Participant
June 3, 2024
Open for Voting

Transcriptions are bad

  • June 3, 2024
  • 3 replies
  • 440 views

Text based Transcriptions don't let you choose speakers but static ones do?

 

When transcribed they have different words and even then the transcriptions are so innaccurate on both ends, What gives?

The first line should be:

"Oh, what? I'm banning? What the hell?"

 

Second:

 

"It's your time to shine"

 

At a quick glance you can see that so much of this is broken english and trust me the audio is not that bad when it comes to comprhension, especially the first speakers lines (blue)

 

Btw, can't color code speakers? Static transriptions doesn't like curse words??

Why isn't there an option to put in unique words like brand names etc.???

 

Adobe loves Ai so much now they can't even implement it to get the audio transcribed correctly????

 

Tiktok transcribes 1000% times better than this I'm so confused what is happening here? Wheres my money going?????

 

Theres so many discrepencies and oddities to this archaic system how long is this going to last??????

 

Bonus issue:

Whats up the unituitave way that masks are transformed? Have to highlight two points holding shift just to move one side of a rectangle? Why can't I just drag the side outward like any other adobe app?

 

This mediocrity is dissapointing and annoying. I'm not even trying to be rude I'm genuinely so confused with these baffling systems.

 

If these get managed to be fixed I'd be impressed even though all of this should be a given years ago.

3 replies

Participant
June 4, 2024

Oh I made a lengthy reply in response to Matt's reply but it kinda disappeared and I was too lazy to retype it all but basically I figured out my main issue was the unintuiative aspect of clicking source clips and trying to retransrcibe it through there.

 

Not to mention that the source clip button being bottom left is jarring aswell.

 

I concluded my response with saying something along the lines were, "I guess my transcription was better after doing multiple speakers", but I've been spending at least 2+ hours today already retranscribing every line by hand and I'm only a little half way done on my 30 minute clip.

 

I also don't like having to double click the text to edit it.

I don't like how it continues to move/play if you try to do it when playback is going and the line keeps moving mid typing in an edit.

I don't like how you can't add transcription "paragraphs" since sometimes it just gets the speaker completely wrong.

I don't like how you can't switch speakers on the fly.

 

Why should I have to go to source clip anyway? Just let us transcribe each individual track or something?

 

The color codes is just to censor the names essentially but I was just saying if there were color codes this would be easier by a lot. Especially when turned into captions. You could see which speaker it is on the fly.

 

Anyway. Transcriptions still suck (no offense) they literally are just wrong on almost every line. When someone says "Ow" it always puts "Oh". And thats just one small example.

 

Feel free to hire me.

 

Stan Jones
Community Expert
Community Expert
June 4, 2024

@HUEY280489983phw,

 

What version of PR are you using?

 

The differences in the transcription are interesting. @Kerstin Ebert @Alexander_DVA Can you comment?

 

In addition to @mattchristensen's point, if you use the autotranscribe source setting, and you do not set the preference to identify speakers, it won't.

 

How are you getting the speaker color codes?

 

Stan

 

mattchristensen
Legend
June 3, 2024

Text-Based Editing transcriptions can identify speakers and let you change the speaker. If you are seeing "Unknown" as the speaker it means you transcribed with the option to detect speakers turned off, so Unknown is placed in the speaker name. Re-transcribe those clips and make sure to turn on Speaker detection and you will then see Speaker 1, Speaker 2, etc listed. When you are looking at the source transcript you can change the name of the speaker.