Skip to main content
Harmony J
Community Manager
Community Manager
November 20, 2024
Question

Enhance Speech v2 is here!

  • November 20, 2024
  • 26 replies
  • 13866 views

We’re thrilled to introduce Adobe Podcast Enhance Speech v2 to our community. Our team has been hard at work to bring you a tool that transforms your audio and video recordings into studio-quality sound with just a few clicks.

 

Here are the v2 highlights:

 

Natural clarity: Clear, natural dialogue without robotic tones—no matter the language or recording conditions. It handles everything from noisy crowds to poor acoustics and delivers studio-quality audio.

 

Strength slider: Powered by our Sound Lift AI model, the updated slider allows you to balance speech clarity with ambient noise. Want a bit of café ambiance in your podcast? Adjust the slider to find the perfect mix.

 

Noise removal: Background noise, reverb, and music—v2 removes it all, keeping your message sharp and clear.

 

Clarity boost: Struggling with quiet or distant audio? V2 improves clarity in soft recordings without amplifying unwanted noise.

 

Try out Enhance Speech v2 here:

podcast.adobe.com/enhance

 

We’d love to hear your feedback and suggestions. Your thoughts are super valuable as we continue to improve and innovate.

 

We’re excited to see how you use v2 to elevate your audio storytelling!

26 replies

Participating Frequently
November 25, 2024

This version is terrible! It filters out things it shouldn't, like laughter, and the percentage slider is totally useless now. 

CGBESSELLIEU
Inspiring
November 29, 2024

Unfortunately, can confirm it does filter out laughter.
And with the broken slider can't test different percentages on that.

I recognize its likely difficult to design these models to differentiate between noise and laughter.

Henrik Heigl
Community Expert
Community Expert
November 23, 2024

Hi,

 

I tested it with the following Test Scenario: I assume I am at a concert as a young journalist and want to interview someone. In the background the Band on the stage plays and its loud, much noise from the crowd, other voices in the background, etc. I use a smartphone as the recording device (in my case an iphone). To simulate that scenario I searched on YouTube a Live concert, play it over some Bluetooth speaker or something similar and use the smartphone as a microphone to record that staged/fictive interview with myself 😉

After that recording i simply upload that file into the enhance speech.

 

 

 

Results:

The Tools makes a pretty good job! V2 is definitely WAY better, no more "robot voice", etc. if it comes to making the voice better, but its not yet there. Especially if the background noise changes within the recording also the Voice-color changes and thats a bit odd. if the background is much louder then the own voice or if there is recognisable second voice that is talking in the background and you make a pause then also the background voice is recognized as the own voice.

Conclusion:

I would wish for some more "control knobs" e.g. enhance speech, background overlay intensity, voice color (more/less radio voice), more/less compression, more/less AI analysis, etc.

Also, separation of voices (Voice1, Voice2, background music, etc.) so that those can be edited later in the Adobe Podcast Studio separately. A Button "edit with Audition" and "edit with Studio (beta) is also needed.
Hope that helps. Also it maybe possible not only to download as wave, but also as mp3 or other formats?
With those additions I could imagine also some kind of "Enhance Speech VST" as Plugin for Adobe Audition or other DAW's (just an idea).

regards,Henrik
CGBESSELLIEU
Inspiring
November 29, 2024

Agreed, control knobs for various key parameters would make the tool more robust!

Participating Frequently
November 22, 2024

The sound in v2 does not allow me to do my job, it just kills my YouTube channel, I am depressed and standing still because I am afraid that people subscribed to me and watched the video only because of the voice, which sounds extremely terrible without processing, please i want v1 back... 

Participant
December 1, 2024

absolutely

Participating Frequently
November 22, 2024

PLEASE LET ME USE V1 BACK

Participating Frequently
November 22, 2024

Adobe People... Enhance Speech v1 saved my butt on an expensive work project. For that, I will forever be grateful... I was hired to shoot a guided tour in a live compounding pharmacy, and the background industrial noise (that I couldn't control) was so bad that I honestly thought my career was over. Enhance Speech v1 cleaned the audio subtly to the point that it was usable. Nobody even knew what I'd done. Wonderful tech. What I'm hearing in v2 is so alien, obvious, and artificial that I could never utilize it. Yes, it's clean, but It no longer sounds like the person who was speaking. This is analogous to an AI photography sharpening tool that doesn't actually sharpen a person's portrait; it replaces their face with a totally new face and, therefore, becomes unusable. Please make v1 available again for those who want it. Thank you. 

Participating Frequently
November 22, 2024

Wait... I was freaking out, but now I see that v1 is still accessible. Please keep it that way. THANKS!

Participating Frequently
November 22, 2024

How did you switch

Participant
November 20, 2024

Even so, I think it would be better if there was an option to choose between V1 or V2 because not everyone necessarily likes the audio produced by V2. For example, in my case, I personally prefer V1 and feel more comfortable with the sound it produces. So why isn’t there an option to choose between V1 or V2? Why are we forced to use V2?

Participating Frequently
November 22, 2024

++++