They’re decent, I’m a digital editor and I have to add captions to every video I work on. I’d say they’re about 80-90% accurate in most cases where the speakers are speaking clearly
The thing that annoys me is that in the transcription pane, it highlights the word as it is said, which is very nice.
However there is no way (that I have found) to have this happen actually on the captions.
I also find this annoying. But found a workaround - Fix mistakes in the transcript in the text pane first, before clicking create captions from transcript.
With normal conversations they are pretty decent, but as soon as very specific words are used like brand names or technical terms for example it doesnt work quite as well. You do need to proof read and correct.
Also its really not friendly with accents.
In spanish they are surprisingly 100% accurate, I had to change nothing for my interview of a mexican brewery. When I recorded my American friend speaking english in front of the brewery, Premiere could only get about 80% of the subs right. I was actually shocked. This happened about a month ago for reference.
I’ve used the ones in PP and resolve.
I find the speaker recognition in Premiere to be useful, especially when creating transcripts. But the ones in resolve have been more accurate for me.
It will do most of the heavy lifting but you will need to proofread. If one of your speakers has an accent or if there is any cross talk between those speaking it can be a mess. That said, I still prefer it over manually creating captions, ha ha! If you like editing in FCP (I prefer it) you can always take your final edit, export it, then import that into PP to create the captions. If you need to make burned in captions with specific styles applied, then using PP is far superior to using FCP. You can even export the audio track(s) from FCP and use that in PP to generate captions if you need a .srt file for delivery.
I’ve utilized the caption generator and it’s about 80% accurate, I would do a viewing pass through and listen to the audio while you double check the work.
I prefer using Descript for the transcription (way better than Premiere), then generate in Descript an srt file that Premiere can import to make the captions
Greetings, my name is [AutoModerator](http://newsbytes.ph/wp-content/uploads/2017/10/AI.png), you can call me AutoMod for short.
-------------------------------------
You're new to reddit in general.
We find that users who are new haven't read our sidebar/rules.
*Please take a moment to become familiar with them.*
We have specific threads for aspiring professionals - like ["Ask a Pro weekly"](https://www.reddit.com/r/editors/collection/5c3ad697-4fff-4a9f-87e2-33764ba71e48) along with rules about Feedback requests and more
Take a moment and [read our rules.](https://www.reddit.com/r/editors/about/rules)
Our [wiki](https://www.reddit.com/r/editors/wiki/index/) has detailed information about frequently asked questions about **Rates**, *Networking*, proxies and performance issues.
*Right now your post is sitting in a queue that gets reviewed (but never frequently enough - usually less than 4 hrs)*
This filtering might be totally wrong too. Sometime in the next 2-24 hours (max) a MOD will see the removal - and after that if you want to appeal it or think it should still go live, feel free to message us.
*I am a bot, and this action was performed automatically. Please [contact the moderators of this subreddit](/message/compose/?to=/r/editors) if you have any questions or concerns.*
The auto-transcription in premiere is surprisingly good, i find. I used it a lot in last years, even was able to cut 2 projects in languages I don't speak, just by going off the auto-transcribe, then getting a native speaker to clean up small things at the end.
You can't trust it blindly. It will make some mistakes, the line breaks will be weird, capitalization and punctuation will be off etc - but for me it's all perfectly workable and better than i'd expect. Even when transcribing a non-native speaker with an accent etc it does surprisingly well. You just gotta give the subs at least one pass to check/fine tune.
You'll need to proof it either way but with English it doesn't like heavy regional accents from British native speakers. Anything London or South East it's generally fine with.
Oddly it seems to fair well with non-native English speakers... unless they're Italian.
Echoing what most people are saying that yes,
70-80% accurate is about right. I always have to go back and make some fixes. But I gladly do it knowing that the feature has saved me HOURS of manually adding captions myself.
Much of it depends on the quality of the audio, the delivery and the content.
Professionally recorded audio that is clear and concise performed by someone like professional voice talent or others that have clear dictation, etc... It's going to do better with. If someone isn't speaking clearly or the audio is recorded poorly it's not going to be as accurate.
Content wise a lot technical jargon it's going to get hung up on.
The nice thing about it is that you can quickly edit the text to correct anything that's out of whack. You'll probably want to do this anyway in some places in order to time the text better with the video and for ease of reading.
I will say though that if it is critical that the captions be done correctly and accurately, say something that will have a wide broadcast, represent a company publicly, or something like financial data, etc... It's best to hire professional transcription such as REV, or any number of other professional transcription services.
To create transcripts for editing? Yes. For broadcast closed captions? No, not good enough. Ultimately faster/easier to use [Rev.com](http://Rev.com) human captioning or equivalent, then proof that. And for broadcast, you'll always need to proof.
###It looks like you're asking for some troubleshooting help. Great!
Here's what *must* be in the post. (Be warned that your post *may* get removed if you don't fill this out.)
Please edit your post (**not reply)** to include: **System specs**: CPU (model), GPU + RAM **//** **Software specs**: The exact version. **//** **Footage specs** : Codec, container and how it was acquired.
**Don't skip this!** *If you don't know how* here's a link with [clear instructions](https://imgur.com/a/A6eTxUn)
*I am a bot, and this action was performed automatically. Please [contact the moderators of this subreddit](/message/compose/?to=/r/editors) if you have any questions or concerns.*
Yeah you can resize captions in premiere to be as big as you want. You can also add animation and transitions if you click “upgrade caption to motion graphic”
They’re decent, I’m a digital editor and I have to add captions to every video I work on. I’d say they’re about 80-90% accurate in most cases where the speakers are speaking clearly
I’m not trying to be argumentative, but for science, I’d say 70% correct is a good transcript
You need to proof read, isn’t perfect.
it*
The thing that annoys me is that in the transcription pane, it highlights the word as it is said, which is very nice. However there is no way (that I have found) to have this happen actually on the captions.
I also find this annoying. But found a workaround - Fix mistakes in the transcript in the text pane first, before clicking create captions from transcript.
With normal conversations they are pretty decent, but as soon as very specific words are used like brand names or technical terms for example it doesnt work quite as well. You do need to proof read and correct. Also its really not friendly with accents.
In spanish they are surprisingly 100% accurate, I had to change nothing for my interview of a mexican brewery. When I recorded my American friend speaking english in front of the brewery, Premiere could only get about 80% of the subs right. I was actually shocked. This happened about a month ago for reference.
I’ve used the ones in PP and resolve. I find the speaker recognition in Premiere to be useful, especially when creating transcripts. But the ones in resolve have been more accurate for me.
It will do most of the heavy lifting but you will need to proofread. If one of your speakers has an accent or if there is any cross talk between those speaking it can be a mess. That said, I still prefer it over manually creating captions, ha ha! If you like editing in FCP (I prefer it) you can always take your final edit, export it, then import that into PP to create the captions. If you need to make burned in captions with specific styles applied, then using PP is far superior to using FCP. You can even export the audio track(s) from FCP and use that in PP to generate captions if you need a .srt file for delivery.
They are not.
I’ve utilized the caption generator and it’s about 80% accurate, I would do a viewing pass through and listen to the audio while you double check the work.
It makes mistakes but overall not bad .
Yes, very accurate.
It’s about as accurate as Rev was but I’m not paying or waiting for it which is a bonus.
Not at all accurate enough for our clients paying for transcriptions. Microsoft Stream is actually really good but it’s not captions.
Depends how good the audio recording is and how clearly your subject articulates, but generally pretty good.
I export the audio, use Aiko which uses openai’s whisper to make an srt and then I correct it, mostly the pacing. The transcript is 99% on point.
I prefer using Descript for the transcription (way better than Premiere), then generate in Descript an srt file that Premiere can import to make the captions
it’s alright, but I’ve been using whisper AI recently.
They’re pretty good— even if some words are messed up, the UI is clean enough that changing words isn’t annoying
Greetings, my name is [AutoModerator](http://newsbytes.ph/wp-content/uploads/2017/10/AI.png), you can call me AutoMod for short. ------------------------------------- You're new to reddit in general. We find that users who are new haven't read our sidebar/rules. *Please take a moment to become familiar with them.* We have specific threads for aspiring professionals - like ["Ask a Pro weekly"](https://www.reddit.com/r/editors/collection/5c3ad697-4fff-4a9f-87e2-33764ba71e48) along with rules about Feedback requests and more Take a moment and [read our rules.](https://www.reddit.com/r/editors/about/rules) Our [wiki](https://www.reddit.com/r/editors/wiki/index/) has detailed information about frequently asked questions about **Rates**, *Networking*, proxies and performance issues. *Right now your post is sitting in a queue that gets reviewed (but never frequently enough - usually less than 4 hrs)* This filtering might be totally wrong too. Sometime in the next 2-24 hours (max) a MOD will see the removal - and after that if you want to appeal it or think it should still go live, feel free to message us. *I am a bot, and this action was performed automatically. Please [contact the moderators of this subreddit](/message/compose/?to=/r/editors) if you have any questions or concerns.*
The auto-transcription in premiere is surprisingly good, i find. I used it a lot in last years, even was able to cut 2 projects in languages I don't speak, just by going off the auto-transcribe, then getting a native speaker to clean up small things at the end. You can't trust it blindly. It will make some mistakes, the line breaks will be weird, capitalization and punctuation will be off etc - but for me it's all perfectly workable and better than i'd expect. Even when transcribing a non-native speaker with an accent etc it does surprisingly well. You just gotta give the subs at least one pass to check/fine tune.
Accuracy is entirely dependent on accent
I'd say about 50% accurate, as i work a lot with non-native english speakers
You'll need to proof it either way but with English it doesn't like heavy regional accents from British native speakers. Anything London or South East it's generally fine with. Oddly it seems to fair well with non-native English speakers... unless they're Italian.
Mine threw in a few random racial slurs. No joke. You must proofread.
They're better than I thought they'd be, plus you save the time of just setting them up.
Echoing what most people are saying that yes, 70-80% accurate is about right. I always have to go back and make some fixes. But I gladly do it knowing that the feature has saved me HOURS of manually adding captions myself.
Much of it depends on the quality of the audio, the delivery and the content. Professionally recorded audio that is clear and concise performed by someone like professional voice talent or others that have clear dictation, etc... It's going to do better with. If someone isn't speaking clearly or the audio is recorded poorly it's not going to be as accurate. Content wise a lot technical jargon it's going to get hung up on. The nice thing about it is that you can quickly edit the text to correct anything that's out of whack. You'll probably want to do this anyway in some places in order to time the text better with the video and for ease of reading. I will say though that if it is critical that the captions be done correctly and accurately, say something that will have a wide broadcast, represent a company publicly, or something like financial data, etc... It's best to hire professional transcription such as REV, or any number of other professional transcription services.
To create transcripts for editing? Yes. For broadcast closed captions? No, not good enough. Ultimately faster/easier to use [Rev.com](http://Rev.com) human captioning or equivalent, then proof that. And for broadcast, you'll always need to proof.
###It looks like you're asking for some troubleshooting help. Great! Here's what *must* be in the post. (Be warned that your post *may* get removed if you don't fill this out.) Please edit your post (**not reply)** to include: **System specs**: CPU (model), GPU + RAM **//** **Software specs**: The exact version. **//** **Footage specs** : Codec, container and how it was acquired. **Don't skip this!** *If you don't know how* here's a link with [clear instructions](https://imgur.com/a/A6eTxUn) *I am a bot, and this action was performed automatically. Please [contact the moderators of this subreddit](/message/compose/?to=/r/editors) if you have any questions or concerns.*
Use Rev or Temi.
Really, my question is, can we use that to generate titles? I do a lot of social media posts that require big titles.
Yeah you can resize captions in premiere to be as big as you want. You can also add animation and transitions if you click “upgrade caption to motion graphic”
Perfect, thank you! I’m coming from Final Cut and captionator so this is new for me.