Utterance Processing
It helps to understand how Utterly Voice
processes each utterance.
The following describes each step:
-
When the microphone is on,
Utterly Voice is monitoring the microphone signal levels.
When the signal goes above the configured microphone threshold
(see Setup),
it starts streaming microphone data to the recognizer.
Once the signal goes below the threshold for a duration of
the configured utterance gap milliseconds value
(see Setup),
it stops streaming microphone data and requests a transcript
for the completed utterance.
-
Depending on the recognizer,
it may have returned a list of possible transcripts
that match the utterance audio.
Based on your settings,
the most likely match is selected.
-
If you have defined swaps,
the transcript may be altered based on the definitions.
-
The transcript is sent to the interpreter.
The interpreter uses your settings
to determine if the utterance contains any commands
or whether text should be typed as-is.
-
Commands within the utterance are executed
and text is typed as dictated by the utterance.
The utterance is added to the user interface history area,
including results from the recognizer, swaps, and the interpreter.