AI-voice detection
Detect whether audio clips are human (bonafide) or AI-generated (spoofed) and get segment-level confidence scores.
Designed for quick checks
- Supported formats: WAV, MP3, AAC, FLAC, OGG, M4A, MP4, MOV, AVI, MKV
Two ways to send audio
multipart/form-dataupload with afileapplication/jsonpayload with apresigned_url
The API automatically analyzes the audio in ~5-second segments and returns both a global verdict and detailed per-segment results.
Documentation Index
Fetch the complete documentation index at: https://docs.aurigin.ai/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
Body
Provide either a direct file upload or a presigned URL. Include device to tag the caller and optionally choose a model.
Optional device type making the request
macos, windows, web_app, api Optional model version to run. Defaults to the latest available model (apollo-4-5-2026-03-26). apollo-4-2026-01-16 remains available for backwards compatibility.
apollo-4-5-2026-03-26, apollo-4-2026-01-16 Optional custom prediction identifier
Maximum silence percentage allowed per segment before it is skipped (0.0-100.0). Segments where silence exceeds this threshold are excluded from model inference. Lower values are stricter (skip more segments); higher values are more permissive. Set to 100 to analyze virtually all segments regardless of silence.
0 <= x <= 100Decision threshold for classifying audio as bonafide or spoofed (0.0-1.0). Lower values bias toward higher spoof detection; higher values reduce false positives. Confidence scores are derived from how far the raw score is from this threshold.
0 <= x <= 1Response
Global verdict plus per-segment predictions and confidence scores.
Unique identifier for this prediction
Array of segment-level predictions. Each segment represents a 5-second window of the audio.
Model version used for prediction
Time taken to process the audio in seconds
Duration of the audio file in seconds
Array of warning messages, if any