Community Manager

Question

Enhance Speech v2 is here!

Forum|Forum|1 year ago
November 20, 2024
26 replies
14568 views

We’re thrilled to introduce Adobe Podcast Enhance Speech v2 to our community. Our team has been hard at work to bring you a tool that transforms your audio and video recordings into studio-quality sound with just a few clicks.

Here are the v2 highlights:

Natural clarity: Clear, natural dialogue without robotic tones—no matter the language or recording conditions. It handles everything from noisy crowds to poor acoustics and delivers studio-quality audio.

Strength slider: Powered by our Sound Lift AI model, the updated slider allows you to balance speech clarity with ambient noise. Want a bit of café ambiance in your podcast? Adjust the slider to find the perfect mix.

Noise removal: Background noise, reverb, and music—v2 removes it all, keeping your message sharp and clear.

Clarity boost: Struggling with quiet or distant audio? V2 improves clarity in soft recordings without amplifying unwanted noise.

Try out Enhance Speech v2 here:

podcast.adobe.com/enhance

We’d love to hear your feedback and suggestions. Your thoughts are super valuable as we continue to improve and innovate.

We’re excited to see how you use v2 to elevate your audio storytelling!

Enhance Speech

Show previous replies

N

NAF22

Participating Frequently

This version is terrible! It filters out things it shouldn't, like laughter, and the percentage slider is totally useless now.

CGBESSELLIEU

Inspiring

Unfortunately, can confirm it does filter out laughter.
And with the broken slider can't test different percentages on that.

I recognize its likely difficult to design these models to differentiate between noise and laughter.

Henrik Heigl

Community Expert

Hi,

I tested it with the following Test Scenario: I assume I am at a concert as a young journalist and want to interview someone. In the background the Band on the stage plays and its loud, much noise from the crowd, other voices in the background, etc. I use a smartphone as the recording device (in my case an iphone). To simulate that scenario I searched on YouTube a Live concert, play it over some Bluetooth speaker or something similar and use the smartphone as a microphone to record that staged/fictive interview with myself 😉

After that recording i simply upload that file into the enhance speech.

Results:

The Tools makes a pretty good job! V2 is definitely WAY better, no more "robot voice", etc. if it comes to making the voice better, but its not yet there. Especially if the background noise changes within the recording also the Voice-color changes and thats a bit odd. if the background is much louder then the own voice or if there is recognisable second voice that is talking in the background and you make a pause then also the background voice is recognized as the own voice.

Conclusion:

I would wish for some more "control knobs" e.g. enhance speech, background overlay intensity, voice color (more/less radio voice), more/less compression, more/less AI analysis, etc.

Also, separation of voices (Voice1, Voice2, background music, etc.) so that those can be edited later in the Adobe Podcast Studio separately. A Button "edit with Audition" and "edit with Studio (beta) is also needed.
Hope that helps. Also it maybe possible not only to download as wave, but also as mp3 or other formats?
With those additions I could imagine also some kind of "Enhance Speech VST" as Plugin for Adobe Audition or other DAW's (just an idea).

regards,Henrik

CGBESSELLIEU

Inspiring

Agreed, control knobs for various key parameters would make the tool more robust!

A

Arsen_Yasu7211

Participating Frequently

The sound in v2 does not allow me to do my job, it just kills my YouTube channel, I am depressed and standing still because I am afraid that people subscribed to me and watched the video only because of the voice, which sounds extremely terrible without processing, please i want v1 back...

J

jay_scott2337

Participant

absolutely

A

Arsen_Yasu7211

Participating Frequently

PLEASE LET ME USE V1 BACK

E

erics.simmons.401

Participating Frequently

Adobe People... Enhance Speech v1 saved my butt on an expensive work project. For that, I will forever be grateful... I was hired to shoot a guided tour in a live compounding pharmacy, and the background industrial noise (that I couldn't control) was so bad that I honestly thought my career was over. Enhance Speech v1 cleaned the audio subtly to the point that it was usable. Nobody even knew what I'd done. Wonderful tech. What I'm hearing in v2 is so alien, obvious, and artificial that I could never utilize it. Yes, it's clean, but It no longer sounds like the person who was speaking. This is analogous to an AI photography sharpening tool that doesn't actually sharpen a person's portrait; it replaces their face with a totally new face and, therefore, becomes unusable. Please make v1 available again for those who want it. Thank you.

E

erics.simmons.401

Participating Frequently

Wait... I was freaking out, but now I see that v1 is still accessible. Please keep it that way. THANKS!

A

Arsen_Yasu7211

Participating Frequently

How did you switch

L

Lpi_Ral8804

Participant

Even so, I think it would be better if there was an option to choose between V1 or V2 because not everyone necessarily likes the audio produced by V2. For example, in my case, I personally prefer V1 and feel more comfortable with the sound it produces. So why isn’t there an option to choose between V1 or V2? Why are we forced to use V2?

A

Arsen_Yasu7211

Participating Frequently

++++

Sign up

To post, reply, or follow discussions, please sign in with your Adobe ID.

Sign in to Adobe Community

To post, reply, or follow discussions, please sign in with your Adobe ID.

Scanning file for viruses.

This file cannot be downloaded