Endpoint
Headers
Request Body
You can send audio data in two ways:Option 1: Multipart form data (direct file upload)
file
(required): Audio file (WAV, MP3, M4A, etc.)user_id
(optional): User identifier for tracking
Option 2: JSON with presigned URL
presigned_url
(required): URL to the audio fileuser_id
(optional): User identifier for tracking
Examples
- Direct file upload
- Presigned URL
Response
Response Fields
predictions
(array): AI detection results for each 5-second chunk of the audio"fake"
: Audio segment is likely AI-generated"real"
: Audio segment is likely human-generated- Array length = number of 5-second chunks in the audio file
global_probability
(array): Confidence scores (0.0-1.0) for each prediction- Higher values indicate higher confidence in the prediction
- Corresponds 1:1 with the
predictions
array
error
(array): Error messages for each chunk (null if successful)- Corresponds 1:1 with the
predictions
array - Contains error details if processing failed for a specific chunk
- Corresponds 1:1 with the
Audio Processing Details
The API processes your audio file in 5-second chunks:- Minimum duration: 3 seconds (1 chunk)
- Maximum duration: 50MB file size
- Chunk size: 5 seconds each
- Output: One prediction per 5-second chunk
- A 12-second audio file → 3 chunks → 3 predictions
- A 30-second audio file → 6 chunks → 6 predictions
Supported Formats
- WAV, MP3, M4A, FLAC, OGG
- Mono or stereo
- Various bitrates and sample rates
Error Codes
- 400: Invalid input or file too large (50MB max)
- 403: Authentication failed (check
x-api-key
) - 500: Internal error or upstream unavailability
Authorizations
Response
200 - application/json
OK
Error messages for each 5-second chunk (null if successful)
Confidence scores (0.0-1.0) for each prediction, one per 5-second chunk
AI detection results for each 5-second chunk of the audio. Array length equals the number of 5-second chunks in the audio file.