Skip to main content
Participating Frequently
September 14, 2023

Transcription and Captioning Improvements

  • September 14, 2023
  • 1 reply
  • 557 views

I would like to request various improvements to the automatic transcription for source files.

 

1. Improved accuracy in general. We are seeing a lot of incorrect words and a huge amount of incorrect punctuation in the transcripts. I understand that punctuation is not always obvious in dialogue, but the amount of mistakes has been frustrating. The automatic transcription needs to do better with introductory clauses/phrases, in particular. It also needs to handle people with accents better.

 

2. A way to input commonly-used acronyms and phrases so that the AI can learn them and favor them for your transcripts. Our industry uses a very large number of acronyms and unique jargon. The automatic transcription does horribly with acronyms. If I could add these to a dictionary or list for the automatic transcription to reference and favor when it thinks these words are said, it would be helpful. We use search and replace right now, but the system very often uses many different alternative words/phrases inconsistently instead of the correct one or just one consistent alternative, which makes it difficult to find all incorrect instances.

 

3. The ability to export transcripts without any timestamps. We do not need the timestamps for our transcripts. It is a pain to manually remove them all.

 

4. The ability to import untimed, plain-text transcripts and have Premiere automatically determine the timings and generate captions. Right now, we can only generate captions using the automatic transcription. If we receive a transcript that was already made by another department, however, we cannot just import it and have Premiere automatically add the timings. We are often given transcripts in plain text without the timings. We are not given SRT files. I know that YouTube is able to add timings to plain text, so this feature should, theoretically, be possible.

 

5. More accurate caption length and line numbers. Even though we adjust the settings for maximum number of characters, time, and number of lines, these settings are not applied accurately. For example, we frequently will end up with captions that include just one word and last one second even though we had a large duration and character limit. This seems to be some sort of bug in the way the settings are applied.

 

6. The ability to export the captions as a VTT file. We do not use SRT in our industry. We use VTT. I do not want to have to use a third-party tool to convert an SRT to a VTT.

 

We are really enjoying the text-based editing features. Improving the automatic transcription and captioning would make this process easier and even more useful for us. Thank you!

1 reply

Stan Jones
Community Expert
Community Expert
September 15, 2023

Olivia,

 

Do you see other feature requests or bug reports on #1 and 2? I think they are there, but do not find them in my notes.

 

3 - This was a deliberate change because there were so many users wanting the export with timecodes. I don't know if there is still a feature request to add back an option for text only. My workaround is to export the csv, then use just the text column. There are some formatting issues, and the best option appears to be to copy the text column and paste into Word. If you paste directly into a text editor, double lines get quotes around them.

 

4 - The relatively new "import corrected transcript" moves this closer. I cannot find current feature requests. All my notes are back to the UserVoice system.

 

I have experimented quite a bit, but I don't think I have posted a method yet. But try this workaround:

Create a source media or sequence transcription of your material. Export as .txt. Create a .txt file with just the text of your external transcript (one big paragraph, one space between each section of the transcript). I use a single line for the time code with the first time and last time in the PR transcript, and a single entry for speaker. But at least one user indicates success with just plain text.

 

Import as corrected. PR applies the timecodes.

 

5 - I do believe there are bugs here. Upvote this bug report and see my comments there:

https://community.adobe.com/t5/premiere-pro-bugs/create-captions-minimum-duration-in-seconds-being-completely-ignored/idi-p/13985748

 

6 - You have commented on this feature request:

https://community.adobe.com/t5/premiere-pro-ideas/export-captions-as-vtt-webvtt-files/idi-p/13515926

It was carried over from the UserVoice system (thus all the 1/24/23 dates), and does have some comments since. But not nearly enough for such an important option.

 

Stan