Skip to main content

Why VoxaTrace?

The Gap in Voice AI

Voice AI has made incredible progress understanding what you say. Speech-to-text is solved. But none of it understands how you sound.

Pitch. Timbre. Melody. Rhythm. Emotion. Vocal quality. These acoustic qualities are invisible to speech AI — but they're everything for singing apps, vocal training, music education, and any application where the sound of a voice matters.

VoxaTrace fills that gap.

What Makes VoxaTrace Different

Acoustic Intelligence, Not Speech Recognition

FeatureSpeech AIVoxaTrace
OutputText transcriptPitch (Hz), confidence, vocal activity
FocusWords and languageHow the voice sounds
Use caseDictation, commandsSinging, training, games
Latency100ms–1s (cloud)~50ms (on-device)

On-Device, Real-Time

No server calls. No network latency. No data leaving the device.

  • ~50ms latency — fast enough for real-time feedback
  • Works offline — no internet required
  • Privacy by design — audio never leaves the device

Battle-Tested at Scale

  • 8 years of R&D — not a weekend project
  • 5 million users — proven in production
  • Same API on Android and iOS — write once, ship everywhere

Three-Tier API Design

Use as much complexity as you need:

// Tier 1: Just works (80% of users)
val detector = CalibraPitch.createDetector()

// Tier 2: Discoverable options (15% of users)
val detector = CalibraPitch.createDetector(
PitchDetectorConfig.Builder()
.algorithm(PitchAlgorithm.SWIFT_F0)
.voiceType(VoiceType.SOPRANO)
.build()
)

// Tier 3: Full control (5% of users)
val config = PitchDetectorConfig.PRECISE.copy(
confidenceThreshold = 0.6f,
amplitudeGateDb = -50f
)

Algorithm Performance

Two pitch detection algorithms for different needs:

MetricYINSwiftF0
Latency~50ms~50ms
Accuracy (clean audio)GoodExcellent
Accuracy (noisy audio)FairGood
Memory overheadMinimal~10MB (model)
Battery impactLowLow-Medium
DependenciesNoneONNX Runtime

When to Use Each

Choose YIN when:

  • Building a simple tuner
  • Targeting low-end devices
  • Audio is clean and controlled
  • You want minimal dependencies

Choose SwiftF0 when:

  • Building singing or vocal training apps
  • Users are in noisy environments
  • Accuracy matters more than app size
  • You need robust vibrato detection

What You Can Build

VoxaTrace enables any application where the acoustic qualities of voice matter:

ApplicationWhat VoxaTrace Provides
Singing appsPitch detection, scoring, performance feedback
Vocal trainingReal-time intonation analysis, progress tracking
Music educationEar training, sight-singing, pitch matching
Voice gamesPitch as game input (sing to jump, hum to control)
AccessibilityVoice-based input beyond speech recognition
WellnessVocal health monitoring, breathing exercises

Ready to Start?