AI-voice detection
Analyze Audio
Detect AI-generated audio with real-time deepfake analysis
POST
AI-voice detection
Documentation Index
Fetch the complete documentation index at: https://docs.aurigin.ai/llms.txt
Use this file to discover all available pages before exploring further.
Overview
The/predict endpoint analyzes audio files for AI-generated content (deepfakes, voice cloning, text-to-speech). Returns results in real-time for small files.
File Size Limits: For very large files (100MB+), use pre-signed S3 URLs for asynchronous processing.
Authentication
Your API key for authentication
Request Parameters
Audio or video file to analyzeSupported formats: WAV, MP3, AAC, FLAC, OGG, M4A, MP4, MOV, AVI, MKVMax size: 5 MB (recommended for fastest processing)
Device type making the requestOptions:
macos, windows, web_app, apiCustom prediction ID for tracking (auto-generated if not provided)Format:
pred_ followed by 12 hex charactersExample: pred_9b6ff057a7f7Model version to useOptions:
stable- Production model (recommended)stable-latest- Latest stable releasedev-v4- Development model (testing only)
Response
Unique identifier for this prediction
Overall prediction for the entire audio file
Per-segment analysis (one per
chunk_duration)Model version used for prediction
Time taken to process in seconds
Total audio file duration in seconds
List of warnings (e.g., partial chunk failures, quality issues)
Example Request
Example Response
200 - Spoofed Audio Detected
200 - Authentic Audio
400 - Validation Error
401 - Unauthorized
413 - File Too Large
Error Codes
| Code | Description | Solution |
|---|---|---|
| 400 | Validation error | Check file format, duration, parameters |
| 401 | Unauthorized | Verify API key is valid and active |
| 413 | File too large | Reduce file size or use pre-signed S3 URLs |
| 422 | Unsupported format | Convert to supported audio format |
| 500 | Processing failed | Retry or contact support if persists |
| 503 | Service unavailable | Retry with exponential backoff |
Confidence Score Interpretation
Theconfidence score indicates how certain the model is about its prediction:
| Confidence | Interpretation | Action |
|---|---|---|
| 0.9 - 1.0 | Very confident | Trust the result |
| 0.7 - 0.9 | Confident | Generally reliable |
| 0.4 - 0.7 | Moderate | Review segments individually |
| 0.0 - 0.4 | Low confidence | Manual review recommended |
Best Practices
Optimize Performance
Optimize Performance
- Use WAV or FLAC for best accuracy
- Keep files under 5MB for fastest processing with this endpoint
Handle Partial Results
Handle Partial Results
Check the If warnings exist, review individual segments for confidence.
warnings array for partial failures:Retry Logic
Retry Logic
Implement exponential backoff for transient errors:
Monitor Performance
Monitor Performance
Track
processing_time to identify slow requests:- < 2s: Excellent
- 2-5s: Good
- 5-10s: Acceptable for large files
- > 10s: Consider async processing
Rate Limits
| Plan | Requests/Minute | Concurrent |
|---|---|---|
| Free | 10 | 2 |
| Starter | 60 | 5 |
| Professional | 300 | 20 |
| Enterprise | Custom | Custom |
Upgrade your plan at app.aurigin.ai/billing for higher limits.
Related Endpoints
Pre-Signed URLs
Asynchronous processing for large files (100MB+)
Authorizations
Response
OK
Error messages for each 5-second chunk (null if successful). Aligns 1:1 with the predictions array.
Confidence scores (0.0-1.0) for each prediction, one per 5-second chunk. Aligns 1:1 with the predictions array.
AI detection results for each 5-second chunk of the audio. Array length equals the number of 5-second chunks in the audio file.
Available options:
fake, real