Copy link to clipboard
Copied
Sorry, this is a bit of a rant... but hopefully, I am expressing what others are thinking.
Remember when Adobe demonstrated an advanced Audition version with the ability to edit a recording by directly manipulating the text, both deleting text and adding new text synthesized in the same voice? It was really cool, and Adobe said they were going to release the product soon. What was that? Ten years ago?
Now that I am very well versed in Generative AI tech and work in this field, I understand that what Adobe demonstrated was entirely scripted and fake because the technology to do exactly what they demonstrated simply did not exist.
But I am off-topic. It's now 2024, and Audition (even the Beta version) still doesn't support speech-to-text, and even the text-to-speech generator is 1990s SAPI technology! No one uses SAPI text-to-speech any longer!
Some might say Premiere offers speech-to-text. It sure does, and it is ridiculously bad! Have you compared it to even the oldest iteration of 2023's Whisper AI? You will never use Premiere again!
Every day, I find new reasons to dump this costly Adobe platform. I want Adobe to be much better because I have invested many years in becoming an "expert" in the products. Unfortunately, Adobe continues to disappoint.
Copy link to clipboard
Copied
Remember when Adobe demonstrated an advanced Audition version with the ability to edit a recording by directly manipulating the text, both deleting text and adding new text synthesized in the same voice? It was really cool, and Adobe said they were going to release the product soon. What was that? Ten years ago?
Adobe didn't ever release anything like text to speech because their legal team - quite correctly - forbade them from doing so. The idea of putting words that they didn't speak into somebody's mouth looked, and still does, like a legal minefield.
And let's face it, they clearly don't like speech to text very much either. That said, I've never seen any system that I'd actually call good....
Copy link to clipboard
Copied
Hi SteveG... I mean no disrespect but it is almost as if you are still living in 2019. 🙂
I whole lot has changed from that time... to start, like I said, the technology to do what they demonstrated in 2016 *absolutely did not exist* at that time. It's 2024 and anyone working in the field of LLMs and machine learning (such as myself) now understand exactly what is required in order to clone voices with that kind of accuracy they demonstrated. Adobe lied! Whatever! It's no big deal! Many companies do the same! But they did lie! Also, the legal issues were their second lie to cover up the fact that they had no product to sell.
Consider this, if Adobe couldn't release a voice editing product, why are there over 25 Generative AI companies selling voice cloning products now? Even the Apple iPhone can clone anyone's voice after some training. Others will close with only 5 minutes of good prerecorded speech. Again, the legal argument was nonsense.
"I've never seen any system that I'd actually call good...."
With a response like that, I don't think this is a topic that interests you. Otherwise, you'd be well aware of dozens of products that can now synthesize voice that is indistinguishable from human voice, including mimicking realistic cadence and emotions. I used to spend $$$ on voice actors on already limited budget.
Anyhow, regardless of it all, there is still no excuse that Adobe still gives us 2009 tech SAPI voices for TTS, considering the money this company takes in through subscriptions.
Take care.
Copy link to clipboard
Copied
Well, you're entitled to your view, however morally short-sighted it is.
Copy link to clipboard
Copied