AI-voice detection

curl --request POST \
  --url https://api.aurigin.ai/v1/predict \
  --header 'Content-Type: multipart/form-data' \
  --header 'x-api-key: <api-key>' \
  --form file='@example-file' \
  --form device=api \
  --form model=apollo-4-2025-10-20

{
  "prediction_id": "pred_9b6ff057a7f7",
  "global": {
    "confidence": 0.95,
    "result": "spoofed",
    "reason": null
  },
  "segments": [
    {
      "index": 0,
      "start": 0,
      "end": 5,
      "confidence": 0.96,
      "result": "spoofed"
    },
    {
      "index": 1,
      "start": 5,
      "end": 10,
      "confidence": 0.94,
      "result": "spoofed"
    }
  ],
  "model": "apollo-4-2025-10-20",
  "processing_time": 1.23,
  "audio_duration": 10,
  "warnings": []
}

Deepfake detection

AI-voice detection

Detect whether short audio clips are human (bonafide) or AI-generated (spoofed) and get segment-level confidence scores.

Designed for quick checks

Best for files under 5 MB and longer than 3 seconds
Supported formats: WAV, MP3, AAC, FLAC, OGG, M4A, MP4, MOV, AVI, MKV

Two ways to send audio

multipart/form-data upload with a file
application/json payload with a presigned_url

The API automatically analyzes the audio in ~5-second segments and returns both a global verdict and detailed per-segment results.

POST

predict

AI-voice detection

curl --request POST \
  --url https://api.aurigin.ai/v1/predict \
  --header 'Content-Type: multipart/form-data' \
  --header 'x-api-key: <api-key>' \
  --form file='@example-file' \
  --form device=api \
  --form model=apollo-4-2025-10-20

{
  "prediction_id": "pred_9b6ff057a7f7",
  "global": {
    "confidence": 0.95,
    "result": "spoofed",
    "reason": null
  },
  "segments": [
    {
      "index": 0,
      "start": 0,
      "end": 5,
      "confidence": 0.96,
      "result": "spoofed"
    },
    {
      "index": 1,
      "start": 5,
      "end": 10,
      "confidence": 0.94,
      "result": "spoofed"
    }
  ],
  "model": "apollo-4-2025-10-20",
  "processing_time": 1.23,
  "audio_duration": 10,
  "warnings": []
}

Authorizations

x-api-key

string

header

required

Body

Provide either a direct file upload or a presigned URL. Include device to tag the caller and optionally choose a model.

file

required

device

enum<string>

Optional device type making the request

Available options:

macos,

windows,

web_app,

api

model

enum<string> | null

Optional model version to run. Currently only apollo-4-2025-10-20 is available for this endpoint.

Available options:

apollo-4-2025-10-20

prediction_id

string

Optional custom prediction identifier

Response

Global verdict plus per-segment predictions and confidence scores.

prediction_id

string

Unique identifier for this prediction

global

object

Show child attributes

segments

object[]

Array of segment-level predictions. Each segment represents a 5-second chunk of the audio.

Show child attributes

model

string

Model version used for prediction

processing_time

number<float>

Time taken to process the audio in seconds

audio_duration

number<float>

Duration of the audio file in seconds

warnings

string[]

Array of warning messages, if any

Introduction Large file AI-voice detection

Deepfake detection

Voice ID

Authorizations

Body

Response