FAQs

LLet’s start with the questions that seem to be on everyone’s mind. If you don’t see the answers you seek, please fill out the “Submit a help request” form below. For non-technical questions, please Contact Us.

Search FAQs

Technical

Where can I learn more about the API?

Right here! Please let us know if you still have questions after reading the API datasheet. You can submit your question by filling out the support form below.

How many calls does the API require?

One call is required to get an authentication token, which is valid for a time interval. After obtaining the authentication token, only one call is required to enhance audio.

Do you have a specification sheet?

Yes! Please contact us.

What happens to the audio/video file I send you?

We apply a pre-processing algorithm to regularize the received signal. Next it gets passed through a neural network (NN) subsystem to isolate and reconstruct the speech signal. Finally, we take the NN output to derive a statistical model of the speech that we use to suppress any noise in the received audio stream. This yields natural speech with minimal reverberation, mixed with a controlled level of the background noise to preserve the naturalness of the audio file. Also, we preserve the speech signal at the same level it was in the received file, to give you full control over any level adjustment that you might want to apply. The average volume of the speech signal is scaled to 0.9x the original value to avoid saturation. Since noise is removed from the stream, the perceived volume can be meaningfully lower for signals with a lot of noise, and you might wish to normalize the volume to compensate.

What are the advantages of using deep neural networks for noise reduction?

BabbleLabs applies deep neural networks — sophisticated mathematical models, trained to perform a complex task — to mimic the capabilities of specific human cognition skills. Distinguishing speech from background sounds is remarkably difficult, because those interfering sounds often are occurring at the same time, in the same frequency ranges, and are often fluctuating as rapidly as the speech.

All traditional methods try to identify statistical characteristics of speech vs. noise, and use these statistics to suppress the noise while preserving the speech. This approach can work for stationary (non-varying) noise, but it cannot handle transient noise well. Just as humans learn to extract the thread of speech from background sounds, including interfering speech, neural networks are trained on extensive sequences of real human speech and real noise. These algorithms learn to isolate and regenerate the human speech without the noise by using a longer context of past speech.

BabbleLabs has gathered a unique database of noisy speech and applies hundreds of thousands of hours of natural speech in training its production networks, enabling accurate separation of speech from noise — across different languages, speaking styles, vocabulary, noise types and noise intensities.

Do you support stereo?

Yes! Our algorithms work on each channel individually. You send us the number of output channels for stereo inputs, and we will either process each channel individually or convert the data to mono. Processing each channel will provide a richer experience, but you will be charged separately for each channel. Channels converted to mono will be charged the rate for a single channel. Streams with more than two channels will always be converted to at most two channels.

Which sample rates do you support?

Our algorithms work natively on 16,000 Hz. Other sample rates will be down- or up- sampled to 16,000 Hz.

Which formats do you support?
VideoAudio
.mp4
.mov
.wav
.mp3
.m4a
.aac
.ogg
.aiff
Do you support streaming?

Yes. You can either submit:

  1. A multi-part form encoded request suitable for posting an audio file, or
  2. Any supported audio stream format may be fed directly to our streaming endpoint.
What is the typical length of audio or video file that I can process?

We should be able to handle any size file, but we are expecting users to upload movie-length or less (i.e., under 2 hours). Typical uses include 30-60 minute interviews or presentations, short documentaries, and clips designed for social media sharing or YouTube.

What’s the best way to compare my original audio/video material (what I submitted) against the enhanced file that Clear Cloud returns?

Listen to it! Then, send us your feedback, we would love to hear what you have to say. Soon, BabbleLabs will also be sharing with you metrics and comparative analysis of the input stream and the output stream, to give you a better idea of how we did, objectively and subjectively.

Where can I find real-world samples of audio/video enhanced by BabbleLabs Clear Cloud?

Our users have been busy conquering unwanted noise. Check out Gabby’s Lab to see what Clear Cloud can do for you in real-world environments and use cases.

App

Do you support Android or iOS?

Both! The Apple version is supported on iOS 11 and higher, and is a universal app. The Android app is supported on OS 5 and higher. We welcome your feedback; please contact us.

Do I need to register to use the app?

You don’t have to register to get started — you can use up to $5 worth of processing time (250 audio, 125 video minutes equivalent) before we will require you to register.

Can I use the app with existing video?

You can shoot audio/video and enhance it immediately through the app. You can also enhance existing files stored on your phone. On an iPhone, you can enhance any video stored in your Photos app. On Android, you can enhance video files stored in various places on your phone, including Camera, Google Gallery, and Instagram; the enhanced files will be stored in the app and album you uploaded from. For both iOS and Android, you will find enhanced files in the Clear Cloud folder. On iPhones, this folder lives in the Photos app. On Android phones, it lives in the Gallery app.

Can I pay for processing time via the ClearCloud App?

For the iPhone app, once you have used up the free processing minutes provided (250 minutes of audio or 125 minutes of video), you can add $4.99 to your app account to continue enhancing your audio and video files. You can also register on our site to be directly billed by BabbleLabs for processing time. For Android users, once you add your billing information on our web site, there is no need to do anything else. After you have signed in to our site, you can find the billing page in the menu under your user name.

How do I upload audio-only files from my phone?

For iPhone users, you can export audio from other apps into Clear Cloud. Enhanced audio files are stored in an audio album managed by the Clear Cloud App.

For Android users, you can only create and enhance audio-only files within the Clear Cloud App. The enhanced audio files are accessible in the Audio portion of the Clear Cloud App, where they can be easily located, named and managed.

Why can’t I hear any sound with my video in the Clear Cloud App for iPhone?

On your iPhone, make sure that the Ring/Silent switch is in the “on” position and the volume is turned up. Check to see if anything is covering the microphone/speaker area on the bottom of the iPhone.

What if I have privacy concerns about allowing the app to access my video and audio files?

Both versions of the app (iOS and Android) keep your audio and video files completely private. Even though the app itself gets access to your existing audio and video clips, it never stores anything in the cloud or shares anything with BabbleLabs.

How do I get help with the Clear Cloud App?

In the app, tap the “hamburger menu” and select Support. Enter your question. We’ll respond to you via the email address you provide. If you prefer, you can submit a help request using the form below.

Pricing

Can I limit my monthly use?

Yes. Once you have signed in on our website, go to the user name drop down menu, choose "Billing" and set your monthly billing limit. (BabbleLabs has set a default limit set of $2000 per month.)

Can I get charged more than my monthly limit?

Yes. We will charge for each audio or video stream after the entire stream has been enhanced. This means that if you are near your monthly limit and upload a stream that exceeds the time remaining, you will be charged accordingly for the overage (standard per minute pricing applies). For example, if you have 5 minutes remaining until you reach your limit, then you upload a 25 minute video file, you will be charged 80 cents for the 20 minutes above your limit (20 minutes x 4 cents/per minute for video processing).

How do I know how much I have used against my monthly limit?

Once you are registered and signed in to the BabbleLabs web site, you can click “Usage Statistics” in the upper right corner of the home page to see what portion of your monthly limit has been used.

Can I pay with Bitcoin, Ethereum or other crypto currencies?

If you can figure out a way to pay that results in US dollars, sure!

Privacy and Security

How can I be sure that you are protecting the confidentiality of the audio/video material I submit?

All data is transmitted to and from our servers encrypted using https. We do not store your audio/video material.

Will BabbleLabs listen to my audio or video for the purposes of advertising to me?

No! BabbleLabs wants to enhance, improve and personalize your audio and video streams, not use your streaming data to market to you.

How do I remove / delete my account?

Send contact us with your request.

Partners and OEMs

Who do I contact for embedded applications?

We know a lot about embedding Clear Cloud to give your device a differentiated video or audio result! We’d love to hear about your project — please contact us.

Are you looking for partners?

Yes! We’re looking for partners in speech processing and deep learning (cloud or embedded) as well as experts in speech metrics and researchers from university programs in related fields.

We are working toward establishing an active customer community. Likewise, we want to build and foster a community of enthusiastic experts, developers, end-users, and innovators. Interested? Send us with your contact info and the area/manner in which you would like to partner.

Are you looking for investors?

We announced a successful round of funding in January 2018. This Series Seed investment of $4 million led by Cognite Ventures is being used for initial development and productization. We are always interested in hearing from our colleagues; we know the technology space we’re in is exceptionally active, with a growing focus on voice interfaces and deep learning.

Careers

Are you looking for interns?

Yes, we are looking for interns! We don’t yet have a formal internship program. Take a look at our Careers information. If you believe you have directly relevant skills — and you have a passion for speech enhancement, speech-centric technology, and deep learning — contact us.

Media and Public Relations

I want to write an article/blog about BabbleLabs. Who should I contact?

We welcome your help in spreading the word about the exciting developments here at BabbleLabs. Please send an inquiry with a brief abstract to us from our contact page.

Submit a help request