Audio Transcription API

Transcribe audio clips, videos, and more via API.

Use Cases

Our combination of humans and machine learning is built to solve a wide range of use cases. Our goal is to power innovation, wherever that may be.

Accurate, Incredible Transcription

Powered by a combination of trained humans and machine learning, we provide 99% accuracy through our dead-simple API.

Humans and Machines

We use a combination of humans, machine learning, and audio signal processing to deliver higher quality than any solution by itself. Humans are even smarter when aided by machines.

99% Accuracy

With our combination of skilled humans, machine learning, and sophisticated reviewing technology, we offer best-in-class accuracy. We pride ourselves in our ability to deliver quality.

Elegant Automation

Unlike solutions that require you to upload audio through a website, we provide automation through an easy-to-use API. It's never been simpler to build human-in-the-loop apps.

We Are Scalable

We are built to handle millions of requests per month. Scale is powered by our distributed team of hard-working Scalers augmented with machine learning technology.

Our effective screening, training, and routing software help ensure that we deliver only the highest quality at scale.


Developer First

We relentlessly focus on providing the best experience possible for developers.

We do all the heavy lifting in the background so you can get started with just a few lines of code.

Explore our docs
  attachment_type: 'audio',
  attachment: '',
  verbatim: false,
  callback_url: "",
  phrases: ['hungry', 'foolish']
}, (err, task) => {
  // do something with task

Scale is trusted by world-class companies.

The world's best and most innovative companies use Scale to power large-scale human operations. We consistently impress with our incredible quality and cost effectiveness.


You only pay for what you use. There are no additional costs or termination fees.

Pay as you go!

Use Case1 Day Response1 Week Response
Audio Transcription$0.500 / minute$0.400 / minute
Video Transcription$0.500 / minute$0.400 / minute
Video Captioning$0.500 / minute$0.400 / minute

Completion times are best-effort for non-enterprise users. If you have more than 1000 requests per month or require a task completion time SLA, chat with us about custom pricing!

Interested in this task type? Contact Sales to learn more about its availability


Custom annual plans for high volume.

  • Upfront and Volume Discounts
  • Enterprise-grade SLAs
  • 24/7 Development Support
  • Dedicated Account Managers