03-05-2012, 11:07 AM
SPEECH RECOGNITION REPORT FONIX PRODUCT EVALUATION
Report.doc (Size: 151.5 KB / Downloads: 140)
ABSTRACT
This document defines a set of evaluation criteria and test methods for speech recognition systems used in vehicles. FONIX product was found the easiest and quickest system to develop speech recognition software, available on the market today. The evaluation of this product in vehicle, noisy environments, and accuracy and suitability of control under various conditions are also included in this document for the testing purposes. The effects of engine noise, interference by turbulent air outside the car, interference by the sounds from the car’s radio/entertainment, and interference by the sounds of the car’s windshield wipers are all considered separately. Recognition accuracy was compared using a variety of different road routines, noisy environments and languages. Testing in “ideal” non-noisy environment of a quiet room has been also performed for comparison.
INTRODUCTION
In this report we concentrate on the speech recognition programs that are human-computer interactive. When software evaluators observe humans testing such software programs, they gain valuable insights into technological problems and barriers that they may never witness otherwise. Testing speech recognition products for universal usability is an important step before considering the product to be a viable solution for its customers later. This document concerns Speech Recognition accuracy in the automobile, which is a critical factor in the development of hands-free human-machine interactive devices. There are two separate issues that we want to test: word recognition accuracy and software friendliness. Major factors that impede recognition accuracy in the automobile include noise sources such as tires and wind noise while the vehicle is in motion, engine noise, noises produced by the car radio/entertainment systems, fans, windshield wipers, horn, turn signals, heater, A/C, temperature sets, cruise control speed setting, headlight, emergency flashers, and others listed below.
But, what is speech recognition?
Speech recognition works like this. You speak into a microphone and the computer transforms the sound of your words into text to be used by your word processor or other applications available on your computer. The computer may repeat what you just said or it may give you a prompt for what you are expected to say next. This is the central promise of interactive speech recognition. Early speech recognition programs made you speak in staccato fashion, insisting that you leave a gap between every two words. You also had to correct any errors virtually as soon as they happened, which means that you had to concentrate so hard on the software that you often forgot what you were trying to say.