Transcription API

Transcription API

Transcribes audio into the input language.

Official Guide
We provide this cost friendly API for high throughput scenarios, there's no rate limit 🚀! For more detailed API spec, see

Available models:

  1. whisper-1

The price of Transcription call is 20% of the price of OpenAI official website. Details: LLM API | PPU Quota | Endpoint Usage


Transcribes audio into the input language.


Authorizationstring✔️Bearer YOUR-GOAPI-KEY
filefile✔️The audio file object (not file name) to transcribe, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.
languagestringThe language of the input audio. Supplying the input language in ISO-639-1 format will improve accuracy and latency.
promptstringAn optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.
response_formatstringThe format of the transcript output, in one of these options: json, text, srt, verbose_json, or vtt.

Response Codes:

200: OK
Successful Response
400: Bad Request
The request format does not meet the requirements.
401: Unauthorized
The API key is incorrect
500: Internal Server Error
Service is experiencing an error

Request Example

curl \
  -H "Authorization: Bearer $YOUR_GOAPI_KEY" \
  -H "Content-Type: multipart/form-data" \
  -F file="@/path/to/file/audio.mp3" \
  -F model="whisper-1"