How It Works
The API processes your audio file in 5-second chunks:- Minimum duration: 3 seconds (1 chunk)
- Maximum duration: 4MB file size
- Chunk size: 5 seconds each
- Output: One prediction per 5-second chunk
- A 12-second audio file → 3 chunks → 3 predictions
- A 30-second audio file → 6 chunks → 6 predictions
Make a Request
- JSON with presigned URL
- Multipart form upload
Authenticate
x-api-key header with every request.Choose upload method
presigned_url when the file already lives in storage.Inspect chunked results
Response Breakdown
Supported Formats
- WAV, MP3, M4A, FLAC, OGG
- Mono or stereo
- Various bitrates and sample rates
Error codes
Error codes
| Code | Description |
|---|---|
400 | Invalid input or file too large (4MB max) |
403 | Authentication failed (check x-api-key) |
500 | Internal error or upstream unavailability |
Tips for production use
Tips for production use
- Cache presigned URLs for their validity window to minimize round trips.
- Retry with exponential backoff when you receive
500errors. - Monitor confidence scores to spot borderline results and trigger manual review.
Authorizations
Body
Provide either a direct file upload or a presigned URL. Include device to tag the caller and optionally choose a model.
Optional device type making the request
macos, windows, web_app, api Optional model version to run. Currently only apollo-4-2025-10-20 is available for this endpoint.
apollo-4-2025-10-20 Optional custom prediction identifier
Response
Global verdict plus per-segment predictions and confidence scores.
Unique identifier for this prediction
Array of segment-level predictions. Each segment represents a 5-second chunk of the audio.
Model version used for prediction
Time taken to process the audio in seconds
Duration of the audio file in seconds
Array of warning messages, if any