Voice based web access is a rapidly developing technology. PhoNET is a solution for these and many other problems faced by the netizens. The basic idea is that using an ordinary phone to browse the web and the primary motivations are: to provide a widely available means for creating new interactive voice applications; addressing needs for mobility; and addressing issues inaccessibility. Basis of the idea are the age old IVR systems used to serve information for the dialers through a pre programmed process. Phonet is a very long journey from the IVRs; it involves the most complex technologies of the century Like Speech Recognition (SR), Text to speech (TTS) conversion and artificial intelligence (AI). This enables a user to be connected to internet as long as he has access to a phone. PhoNET uses the traditional HTML content so the web site need not be rewritten or redesigned. We present a detailed analysis in the most possible simplest way of how the technologies like SR, TTS and AI are integrated to develop a intelligent Platform (PhoNET) to achieve voice based web access which involves Document processing and Document Rendering. In Document Processing we describe two approaches, telephone browsing and transcoding, focusing mostly on the former since that work is more mature. In Document Rendering we present the major problem i.e., the relevance of cognitive thought to text rendering along with its most suitable solution. In the end we examine the challenges and further developments involved in practical application of the proposed technology-The PhoNET
Introduction
Today’s telecom business has seen recent growth, especially in bandwidth infrastructure for long distance (LD) and data. The industry is currently experiencing strong growth in the wireless segment as mobile devices prove to be very popular with both consumers and business. An evolving market segment is “Internet anywhere,” and many companies are trying approaches to present viable products for this market. One approach is Internet access over wireless devices such as cell phones with a screen. However, this method has inherent limitations such as small screen size, lack of a keyboard, the need for a special device (web-enabled phone), the need to rewrite and maintain a special website, and severe bandwidth constraints using wireless data transfer protocols. Another approach that is becoming popular is voice-based limited Internet access, which overcomes all of the limitations of the wireless data devices but one; they still limit access to the few sites that are re-engineered for voice. They typically deliver content such as news, weather, horoscopes, and stock quotes, etc. over the phone. These companies are called “Voice Portals.” Voice portals were the first web applications that tried to integrate websites with voice which gave birth to the enterprise based PBX systems. Our solution, which presents a third option, gives users all of the benefits of the voice portals, yet has complete access to the entire Internet without limitation. With our Voice Internet technology PhoNET, anyone can surf, search, send and receive email, and conduct e-commerce transactions, etc. using their voice from anywhere using any phone, with the more freedom of movement than a standard Internet browser which requires a PC and an Internet connection. PhoNET technology is faster and cheaper than existing alternatives. Today, only the largest of companies are making their Web sites telephone-accessible because existing technology requires a manual, costly and time-consuming re-write of each page. With the voice internet technology-PhoNET, existing Web pages are used, allowing users to leverage their Web investment. The software dynamically converts existing pages into audio format, significantly lowering the up-front investment a business must make to allow users to hear and interact with their Web site by phone.
Motivation
The primary method of access today continues to be the computer, which has certain advantages as well as some limitations. Computers offer a visual Internet experience that is usually rich in content. Some basic computer skills and knowledge are needed to access the Internet. But, computer-based access is proving insufficient for the professional on the move. When in the car or away from the office or computer, accessing the Web is difficult, if not impossible. And, an increasing number of people prefer an interface that allows them to hear and speak rather than see and click or type. Some existing Internet users have also identified problems with the visual Internet experience. Pages are increasingly full of graphics, advertisement banners, etc., which move, flash, and blink as they vie for attention. Some find this “information overload” annoying, and lament the delays it creates by severely taxing the available bandwidth.