VoxaTrace/com.musicmuni.voxatrace.calibra/CalibraLiveEval

CalibraLiveEval

Real-time singing evaluation session with segment support.

What is Live Evaluation?

Live evaluation scores a singer's performance in real-time by comparing their pitch to a reference melody. Use it for:

Karaoke apps: Score singing as users perform
Music education: Provide instant feedback on pitch accuracy
Practice apps: Track improvement across attempts

The session breaks a song into segments (phrases or sections), evaluates each one, and tracks the user's progress.

When to Use

Scenario	Use This?	Why
Score singing against reference	Yes	Core use case
Just detect pitch (no scoring)	No	Use `PitchDetection.createDetector()`
Analyze recorded audio (not live)	No	Use `CalibraMelodyEval`
Voice activity detection only	No	Use `CalibraVAD`

Quick Start

Kotlin

// 1. Create detector and session
val detector = PitchDetection.createDetector()
val session = CalibraLiveEval.create(lessonMaterial, detector = detector)

// 2. Prepare (loads reference, extracts features)
session.prepareSession()

// 3. Start segment and feed audio
session.startPracticingSegment(0)
recorder.audioBuffers.collect { buffer ->
    session.feedAudioSamples(buffer.toFloatArray(), sampleRate = 48000)
}

// 4. Get result
val result = session.finishPracticingSegment()
println("Score: ${result?.score}")

// 5. Cleanup
session.closeSession()

Swift

// 1. Create detector and session
let detector = PitchDetection.createDetector()
let session = CalibraLiveEval.create(
    reference: lessonMaterial,
    detector: detector
)

// 2. Prepare (loads reference, extracts features)
try await session.prepareSession()

// 3. Start segment and feed audio
session.startPracticingSegment(index: 0)
for await buffer in recorder.audioBuffers {
    session.feedAudioSamples(buffer.samples, sampleRate: 48000)
}

// 4. Get result
if let result = session.finishPracticingSegment() {
    print("Score: \(result.score)")
}

// 5. Cleanup
session.closeSession()

Usage Tiers

Tier 1: Convenience API (80% of users)

Pass player and recorder handles; the session coordinates everything.

val session = CalibraLiveEval.create(
    reference = lessonMaterial,
    detector = PitchDetection.createDetector(),
    player = player,
    recorder = recorder
)
session.prepareSession()
session.onSegmentComplete { result -> showScore(result) }
session.startPracticingSegment(0)  // Seeks, plays, records, scores automatically

Tier 2: Low-Level API (15% of users)

Manually manage audio; full control over timing.

val session = CalibraLiveEval.create(reference, detector = detector)
session.prepareSession()
session.startPracticingSegment(0)
recorder.audioBuffers.collect { buffer ->
    session.feedAudioSamples(buffer.toFloatArray(), buffer.sampleRate)
}
val result = session.finishPracticingSegment()

Phase Progressions

Singalong: IDLE → SINGING → EVALUATED
Singafter: IDLE → LISTENING → SINGING → EVALUATED

Observe phase StateFlow for UI updates.

State Machine

IDLE ──prepareSession()──► READY
READY ──startPracticingSegment()──► PRACTICING
PRACTICING ──finishPracticingSegment()──► BETWEEN_SEGMENTS (or COMPLETED if last)
PRACTICING ──discardCurrentSegment()──► BETWEEN_SEGMENTS
PRACTICING ──seekToSegment()──► PRACTICING (new segment)
BETWEEN_SEGMENTS ──startPracticingSegment()──► PRACTICING
BETWEEN_SEGMENTS ──advanceToNextSegment()──► PRACTICING (or COMPLETED if last)
BETWEEN_SEGMENTS ──finishSession()──► COMPLETED
* ──closeSession()──► (released)

Ownership Model

Dependency	Ownership	Rationale
`detector`	Owned - session closes it	Created specifically for this session
`player`	Borrowed - caller manages	Shared resource, UI may need direct access
`recorder`	Borrowed - caller manages	Shared resource, may be reused

Types

Companion

object Companion

Properties

activeSegment

val activeSegment: StateFlow<ActiveSegmentState?>

Current active segment state, or null if not practicing.

completedSegments

val completedSegments: StateFlow<Map<Int, List<SegmentResult>>>

Map of segment index to list of attempts (from SegmentResultStore).

currentTime

val currentTime: StateFlow<Float>

Current playback position in seconds (from player or clock in manual mode).

isAutoLoopEnabled

val isAutoLoopEnabled: Boolean

True if auto-loop (configured policy) is currently active.

isPlaying

val isPlaying: StateFlow<Boolean>

Whether the player is currently playing.

isRecording

val isRecording: StateFlow<Boolean>

Whether recording is active.

livePitch

val livePitch: SharedFlow<PitchPoint>

Per-emission pitch stream — same source, rate (100 Hz), and timestamps as pitchContour. Different surface shape: pitchContour is the accumulated trail (read whole or windowed), livePitch is the per-event firehose (best for live tuner displays, telemetry, anything event-driven).

phase

val phase: StateFlow<PracticePhase>

Current practice phase. Observe this for unified singalong/singafter UI.

pitchContour

val pitchContour: PitchContourRecorder

The session's pitch contour. 100 Hz (detector's native rate), player-time-stamped, session-scoped with segment-aware wiping on retry / seek.

pitchProcessingEnabled

val pitchProcessingEnabled: Boolean

Whether pitch processing is currently enabled

referenceKeyHz

val referenceKeyHz: Float

Reference key in Hz from LessonMaterial.

segments

val segments: List<Segment>

All segments from the reference.

state

val state: StateFlow<SessionState>

Current session state. Observe this in your UI.

studentKeyHz

val studentKeyHz: Float

Current student key in Hz. 0 = same as reference.

Functions

advanceToNextSegment

fun advanceToNextSegment(): Boolean

Advance to the next segment.

beginSingingPhase

fun beginSingingPhase()

Manually trigger LISTENING → SINGING transition.

open override fun close()

Release all resources.

closeSession

fun closeSession()

Close the session and release all resources.

discardCurrentSegment

fun discardCurrentSegment()

Discard the current segment without scoring.

feedAudioSamples

@ShouldRefineInSwift

fun feedAudioSamples(samples: FloatArray, sampleRate: Int = 16000, captureTimestampNanos: Long = 0)

Feed audio samples to the session.

finishPracticingSegment

fun finishPracticingSegment(): SegmentResult?

Finish the current segment and get its result.

finishSession

fun finishSession(): SingingResult

Finish the session and get aggregated results.

getResultsForSegment

fun getResultsForSegment(index: Int): List<SegmentResult>?

Get all results for a specific segment.

hasCompletedSegment

fun hasCompletedSegment(index: Int): Boolean

Check if a segment has been completed at least once.

onPhaseChanged

@ShouldRefineInSwift

fun onPhaseChanged(callback: (PracticePhase) -> Unit)

onReferenceEnd

@ShouldRefineInSwift

fun onReferenceEnd(callback: (Segment) -> Unit)

Register callback for reference end (singafter mode). Called when reference audio finishes playing, before student starts singing.

onSegmentComplete

@ShouldRefineInSwift

fun onSegmentComplete(callback: (SegmentResult) -> Unit)

onSessionComplete

@ShouldRefineInSwift

fun onSessionComplete(callback: (SingingResult) -> Unit)

pausePlayback

fun pausePlayback()

Pause playback and recording.

prepareSession

suspend fun prepareSession()

Prepare the session for practice.

restartSession

fun restartSession(fromSegment: Int = 0)

Restart the session from a clean state.

resumePlayback

fun resumePlayback()

Resume from paused state.

retryCurrentSegment

fun retryCurrentSegment()

Retry the current segment.

seekToSegment

fun seekToSegment(index: Int)

Seek to a specific segment (discards current attempt if practicing).

seekToTime

fun seekToTime(seconds: Float)

Seek to a specific time position.

setAutoLoopEnabled

fun setAutoLoopEnabled(enabled: Boolean)

Toggle auto-loop behavior at runtime.

setMaxAttempts

fun setMaxAttempts(maxAttempts: Int)

Update the per-segment retry cap (the configured policy's maxAttempts) at runtime. Takes effect on the next segment completion; 0 = unlimited. Held on the configured policy, so it persists across setAutoLoopEnabled toggles.

setPitchProcessingEnabled

fun setPitchProcessingEnabled(enabled: Boolean)

Enable or disable pitch processing (smoothing + octave correction) at runtime.

setStudentKeyHz

fun setStudentKeyHz(keyHz: Float)

Set student key for transposition.

startPracticingSegment

fun startPracticingSegment(index: Int)

Start practicing a specific segment.