BabbleLabs Cloud API Speech Enhancement Technology
Clear Cloud enables automatic, high quality speech enhancement and best-in-class general noise reduction for use in audio/video production, speech-driven projects, and voice-sensitive applications. Clear Cloud enhances speech and reduces background noises, such as traffic, crowds and audio artifacts.
BabbleLabs cofounder Raul is on the street in Uruguay! Listen to see what Clear Cloud does with all that Montevideo traffic noise. For more examples, check out Gabby's Lab to see how users around the world are conquering unwanted noise.
Use the Clear Cloud API today and start using automated, single line speech enhancement to take your production and development projects to new levels of clarity, accuracy, and quality. Focus your audience on your content instead of garbled speech and distracting sounds.
Download the Clear Cloud App for Android or iOS today and start exploring how easy it is to capture your on-the-street audio and video and immediately enhance it to hear the Clear Cloud difference. Focus your audience on your content instead of garbled speech and distracting sounds.
Use the Clear Cloud Web interface today and start using automated, single line speech enhancement to take your production and development projects to new levels of clarity, accuracy, and quality.
What happens to the audio/video file I send you?
We apply a pre-processing algorithm to regularize the received signal. Next it gets passed through a neural network (NN) subsystem to isolate and reconstruct the speech signal. Finally, we take the NN output to derive a statistical model of the speech that we use to suppress any noise in the received audio stream.
This yields natural speech with minimal reverberation, mixed with a controlled level of the background noise to preserve the naturalness of the audio file. Also, we preserve the speech signal at the same level it was in the received file, to give you full control over any level adjustment that you might want to apply. The average volume of the speech signal is scaled to 0.9x the original value to avoid saturation. Since noise is removed from the stream, the perceived volume can be meaningfully lower for signals with a lot of noise, and you might wish to normalize the volume to compensate.
Do you support stereo?
Yes! Our algorithms work on each channel individually. You send us the number of output channels for stereo inputs, and we will either process each channel individually or convert the data to mono.
Processing each channel will provide a richer experience, but you will be charged separately for each channel. Channels converted to mono will be charged the rate for a single channel. Streams with more than two channels will always be converted to at most two channels.