Add a pre-extracted pitch point. Skips internal pitch detection.
Timestamp in seconds
Pitch in Hz (-1 for unvoiced)
Detection confidence (0.0-1.0)