Asynchronous Speech-To-Text API Documentation

OpenAPI apis-guru text

Rev.ai provides quality speech-text recognition via a RESTful API. All public methods and objects are documented here for developer reference. For a real-time speech to text solution, use Rev.ai's .

Base Endpoint

The base url for this version of the API is

https://api.rev.ai/speechtotext/v1

All endpoints described in this documentation are relative to this base url.

Quick Start

Follow the

Get your Access Token

You can generate your on the of your account. This access token only needs to be generated once and never expires. You can re-generate your token, however this will invalidate the previous token.

Submit a File

To submit an audio file for transcription to Rev.ai:

curl -X POST "https://api.rev.ai/speechtotext/v1/jobs" -H "Authorization: Bearer $REV_ACCESS_TOKEN" -H "Content-Type: application/json" -d "{\"media_url\":\"https://www.rev.ai/FTC_Sample_1.mp3\",\"metadata\":\"This is a sample submit jobs option\"}"

You’ll receive a response like this:

The id (in this case Umx5c6F7pH7r) will allow you to retrieve your transcript.

Get Your Transcript

Once a transcription job's status becomes transcribed, you can retrieve the transcript in JSON format by running:

Alternatively you can get the plain text version by running:

You can poll for the status of your job by querying for the job periodically:

Note: Polling is NOT recommended in a production server. Rather, use to asynchronously receive notifications once the transcription job completes

If you have any further questions, contact us at

Submitting Files

Two POST request formats can be used to submit a file: application/json or multipart/form-data.

JSON

This is the preferred method of file submission. Uses the media_url property to provide a direct download URL to the Rev.ai server. This method supports the use of pre-signed URLs. Links to videos hosted on platforms like Youtube are not valid because they are not direct download links.

Important note on presigned URLs: Signed URLs usually have an expiration time which is configurable. To ensure the Rev.ai server can access the link, make sure the expiration time is set to 2 hours or more. In the event you plan on resending the same file, make sure to generate a new presigned URL.

FormData

Used to send a local file to the Rev.ai server. This allows the customer to send the file directly from the host machine. Certain limits apply to this format, see the for more detals.

Turnaround Time and Chunking

Often, especially for shorter files, your transcript will be ready in 5 minutes or less. It generally takes no longer than 15 minutes to return longer audios. If you require faster turn around time please contact

Chunking is the act of breaking audio files into smaller segments. Rev.ai uses this method to decrease turnaround time of audios greater than 3 minutes in length.

Webhooks

If the optional callback_url is provided, the API will make an HTTP POST request to the callback_url with the following request body when the job either completes successfully or fails.

Sample Webhook

On Successful Transcription Job

{  "job": {    "id": "Umx5c6F7pH7r",    "status": "transcribed",    "created_on": "2018-05-05T23:23:22.29Z",    "callback_url": "https://www.example.com/callback",    "duration_seconds": 356.24,    "media_url": "https://www.rev.ai/FTC_Sample_1.mp3"  }}

On Failed Transcription Job

{  "job": {    "id": "Umx5c6F7pH7r",    "status": "failed",    "created_on": "2018-05-05T23:23:22.29Z",    "callback_url": "https://www.example.com/callback",    "failure": "download_failure",    "failure_detail": "Failed to download media file. Please check your url and file type"  }}

Important notes for using webhooks: The API will make a POST request, not a GET request, to the callback_url. The request body is the job details. You can unsubscribe from a webhook by responding to the webhook request with a 200 response. If a webhook invocation does not receive a 200 Rev.ai will retry the callback_url every 30 minutes until either 24 hours have passed or we receive a 200 response.

For initial webhook testing, you can try using a third party webhook testing tool such as .

Async API Limits

The following default limits apply per user, per endpoint and are configurable by Rev.ai support. If you have any further questions, contact us at

10,000 transcription requests submitted every 10 minutes
500 transcriptions processed every 10 minutes
Multi-part/form-data requests to the /jobs endpoint have a concurrency limit of 10 and a file size limit of 2GB
POST requests to the /jobs endpoint that use the media_url property do not have a concurrency limit or file restriction. They are only limited by the first two bullet points

Error Handling

The API indicates failure with 4xx and 5xx HTTP status codes. 4xx status codes indicate an error due to the request provided (e.g., a required parameter was omitted). 5xx error indicate an error with Rev.ai's servers.

When an 4xx error occurs during invocation of a request, the API responds with a as HTTP response payload.

The problem details information is represented as a JSON object with the following optional properties:

Property	Description
type	a URI representing the type for the error
title	a short human readable description of type
details	additional details of the error
status	HTTP status code of the error

In addition to the properties listed above, the problem details object may list additional properties that help to troubleshoot the problem.

Example Errors

// Bad Submit Job Request{  "parameter": {    "media_url": [      "The media_url field is required"    ]  },  "type": "https://www.rev.ai/api/v1/errors/invalid-parameters",  "title": "Your request parameters didn't validate",  "status": 400}
// Invalid Transcript State{  "allowed_values": [    "transcribed"  ],  "current_value": "in_progress",  "type": "https://rev.ai/api/v1/errors/invalid-job-state",  "title": "Job is in invalid state",  "detail": "Job is in invalid state to obtain the transcript",  "status": 409}

Retrying Failed Requests

Some errors can be resolved simply by retrying the request. The following error codes are likely to be resolved with successive retries.

Status Code	Error
429	Too Many Requests
502	Bad Gateway
503	Service Unavailable
504	Gateway Timeout

Note: With the exception of the 429 status code, it is recommended that the maximum number of retries be limited to 5 attempts per request. The number of retries can be higher for 429 errors but if you notice consistent throttling please contact us at .

curl -X POST "https://api.rev.ai/speechtotext/v1/jobs" -H "Authorization: Bearer $REV_ACCESS_TOKEN" -H "Content-Type: application/json" -d "{\"media_url\":\"https://www.rev.ai/FTC_Sample_1.mp3\",\"metadata\":\"This is a sample submit jobs option\"}"

{  "job": {    "id": "Umx5c6F7pH7r",    "status": "transcribed",    "created_on": "2018-05-05T23:23:22.29Z",    "callback_url": "https://www.example.com/callback",    "duration_seconds": 356.24,    "media_url": "https://www.rev.ai/FTC_Sample_1.mp3"  }}

{  "job": {    "id": "Umx5c6F7pH7r",    "status": "failed",    "created_on": "2018-05-05T23:23:22.29Z",    "callback_url": "https://www.example.com/callback",    "failure": "download_failure",    "failure_detail": "Failed to download media file. Please check your url and file type"  }}

// Bad Submit Job Request{  "parameter": {    "media_url": [      "The media_url field is required"    ]  },  "type": "https://www.rev.ai/api/v1/errors/invalid-parameters",  "title": "Your request parameters didn't validate",  "status": 400}
// Invalid Transcript State{  "allowed_values": [    "transcribed"  ],  "current_value": "in_progress",  "type": "https://rev.ai/api/v1/errors/invalid-job-state",  "title": "Job is in invalid state",  "detail": "Job is in invalid state to obtain the transcript",  "status": 409}

Homepage: https://api.apis.guru/v2/specs/rev.ai/v1.json
Provider: rev.ai
OpenAPI version: 3.0.1
Spec (JSON): https://api.apis.guru/v2/specs/rev.ai/v1/openapi.json
Spec (YAML): https://api.apis.guru/v2/specs/rev.ai/v1/openapi.yaml

Tools (9)

Extracted live via the executor SDK.

account.getAccount

Get the developer's account information
captions.getCaptions

Returns the caption output for a transcription job. We currently support SubRip (SRT) and Web Video Text Tracks (VTT) output. Caption output format can be specified in the Accept header. Returns SRT by default.
Note: For streaming jobs, transient failure of our storage during a live session may prevent the final hypothesis elements from saving properly, resulting in an incomplete caption file. This is rare, but not impossible.
jobs.deleteJobById

Deletes a transcription job. All data related to the job, such as input media and transcript, will be permanently deleted. A job can only be deleted once it's completed (either with success or failure).
jobs.getJobById

Returns information about a transcription job
jobs.getListOfJobs

Gets a list of transcription jobs submitted within the last 30 days in reverse chronological order up to the provided limit number of jobs per call. Note: Jobs older than 30 days will not be listed. Pagination is supported via passing the last job id from a previous call into starting_after.
jobs.submitTranscriptionJob

Starts an asynchronous job to transcribe speech-to-text for a media file. Media files can be specified in two ways, either by including a public url to the media in the transcription job options or by uploading a local file as part of a multipart/form request.
transcript.getTranscriptById

Returns the transcript for a completed transcription job. Transcript can be returned as either JSON or plaintext format. Transcript output format can be specified in the Accept header. Returns JSON by default.
Note: For streaming jobs, transient failure of our storage during a live session may prevent the final hypothesis elements from saving properly, resulting in an incomplete transcript. This is rare, but not impossible. To guarantee 100% completeness, we recommend capturing all final hypothesis when you receive them on the client.
openapi.previewSpec

Preview an OpenAPI document before adding it as a source
openapi.addSource

Add an OpenAPI source and register its operations as tools