GENERAL backend - no model required
Create Singing Realtime VAD provider (uses SwiftF0 pitch model).
Create Speech VAD provider.