Utterance Processing

It helps to understand how Utterly Voice processes each utterance. The following describes each step:

When the microphone is on, Utterly Voice is monitoring the microphone signal levels. When the signal goes above the configured microphone threshold (see Setup), it starts streaming microphone data to the recognizer. Once the signal goes below the threshold for a duration of the configured utterance gap milliseconds value (see Setup), it stops streaming microphone data and requests a transcript for the completed utterance.
Depending on the recognizer, it may have returned a list of possible transcripts that match the utterance audio. Based on your settings, the most likely match is selected.
If you have defined swaps, the transcript may be altered based on the definitions.
The transcript is sent to the interpreter. The interpreter uses your settings to determine if the utterance contains any commands or whether text should be typed as-is.
Commands within the utterance are executed and text is typed as dictated by the utterance. The utterance is added to the user interface history area, including results from the recognizer, swaps, and the interpreter.