Automatic speaker recognition

seminar class · 25-02-2011, 12:06 PM

Automatic+Speaker+Recognition+System.doc (Size: 92 KB / Downloads: 87)
ABSTRACT
Speaker recognition is the process of automatically recognizing who is speaking on the basis of individual information included in speech waves. This technique makes it possible to user's voice to verify their identity and control access to services such as voice dialing, banking by telephone, telephone shopping, database access services, information services, voice mail, security control for confidential information areas, and remote access to computers.
The goal of this project is to build a simple, yet complete and representative automatic speaker recognition system. Due to the limited space, we will only test our system on a very small (but already non-trivial) speech database. There were 8 female speakers, labeled from S1 to S8. All speakers uttered the same single digit "zero" once in a training session and once in a testing session later on. Those sessions are at least 6 months apart to simulate the voice variation over the time. The vocabulary of digit is used very often in testing speaker recognition because of its applicability to many security applications. For example, users have to speak a PIN (Personal Identification Number) in order to gain access to the laboratory door, or users have to speak their credit card number over the telephone line. By checking the voice characteristics of the input utterance, using an automatic speaker recognition system similar to the one that we will develop, the system is able to add an extra level of security.
1. Principles of Speaker Recognition
Speaker recognition can be classified into identification and verification. Speaker identification is the process of determining which registered speaker provides a given utterance. Speaker verification, on the other hand, is the process of accepting or rejecting the identity claim of a speaker. Figure shows the basic structures of speaker identification and verification systems.
Speaker recognition methods can also be divided into text-independent and text-dependent methods. In a text-independent system, speaker models capture characteristics of somebody’s speech which show up irrespective of what one is saying. In a text-dependent system, on the other hand, the recognition of the speaker’s identity is based on his or her speaking one or more specific phrases, like passwords, card numbers, PIN codes, etc.
All technologies of speaker recognition, identification and verification, text-independent and text-dependent, each has its own advantages and disadvantages and may requires different treatments and techniques. The choice of which technology to use is application-specific. The system that we will develop is classified as text-independent speaker identification system since its task is to identify the person who speaks regardless of what is saying.
At the highest level, all speaker recognition systems contain two main modules (refer to Figure ): feature extraction and feature matching. Feature extraction is the process that extracts a small amount of data from the voice signal that can later be used to represent each speaker. Feature matching involves the actual procedure to identify the unknown speaker by comparing extracted features from his/her voice input with the ones from a set of known speakers. We will discuss each module in detail in later sections.
All speaker recognition systems have to serve two distinguish phases. The first one is referred n sessions or testing phase. In the training phase, each registered speaker has to provide samples of their speech so that the system can build or train a reference model for that speaker. In case of speaker verification systems, in addition, a speaker-specific threshold is also computed from the training samples. During the testing (operational) phase (see Figure), the input speech is matched with stored reference model(s) and recognition decision is made.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Rich Internet Application for Weekly Automatic College Timetable Generation	presentation Abstract	1	793	09-09-2017, 02:36 PM Last Post: jaseela123
	Human Recognition Using Multiple Fingerprints	seminar flower	1	1,456	09-09-2017, 11:03 AM Last Post: jaseela123
	voice recognition excel operation	project maker	1	510	30-08-2017, 01:30 PM Last Post: jaseela123
	Speech Recognition	Computer Science Clay	0	10,522,971	25-08-2017, 09:32 PM Last Post: Computer Science Clay
	Automatic Webpage Update Informer	nit_cal	0	11,050,746	25-08-2017, 09:32 PM Last Post: nit_cal
	Face recognition using Laplacianfaces	mechanical engineering crazy	0	16,063,264	25-08-2017, 09:32 PM Last Post: mechanical engineering crazy
	Discriminative Learing and Recognition of Image set classes Using Canonical Correlati	mechanical engineering crazy	0	6,480,258	25-08-2017, 09:32 PM Last Post: mechanical engineering crazy
	AUTOMATIC GENERATION OF C COMPILER BACKENDS FROM MACHINE DESCRIPTIONS	nit_cal	0	6,987,759	25-08-2017, 09:32 PM Last Post: nit_cal
	An Automatic Face Recognition System for Frontal Face Images using Local Binary Patte	mkaasees	0	245	05-11-2016, 11:28 AM Last Post: mkaasees
	Raspberry Pi Face Recognition Treasure Box	mkaasees	0	262	04-11-2016, 03:46 PM Last Post: mkaasees

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.