Design of a Speaker Recognition Code using MATLAB

**seminar ideas** · 20-07-2012, 02:31 PM

Design of a Speaker Recognition Code using MATLAB

.pdf

Design of a Speaker Recognition.PDF (Size: 236.54 KB / Downloads: 121)

INTRODUCTION

Development of speaker identification systems began as early as the 1960s with
exploration into voiceprint analysis, where characteristics of an individual’s voice were
thought to be able to characterize the uniqueness of an individual much like a fingerprint.
The early systems had many flaws and research ensued to derive a more reliable method
of predicting the correlation between two sets of speech utterances. Speaker
identification research continues today under the realm of the field of digital signal
processing where many advances have taken place in recent years.

APPROACH

This multi faceted design project can be categorized into different sections:
speech editing, speech degradation, speech enhancement, pitch analysis, formant analysis
and waveform comparison. The resulting discussion will be segmented based on these
delineations.

SPEECH EDITING

The file recorded with my slower speech (a17.wav) was found from the ordered
list of speakers. A plot of this file is shown in Figure (1). It was determined that the
length of the vector representing this speech file had a magnitude of 30,000. Thus the
vector was partitioned into two separate vectors of equal length and the vectors were
written to a file in opposite order. The file was then read and played back. The code for
this process can be found in Appendix A.

SPEECH DEGRADATION

The file recorded with my faster speech (a18.wav) was found from the ordered list
of speakers. Speech degradation was performed by adding Gaussian noise generated by
the MATLAB function randn() to this file. A comparison was then made between the
clean file and the signal with the addition of Gaussian noise. The code for this process
can be found in Appendix B.

PITCH ANALYSIS

The file recorded with my slower speech (a17.wav) was found from the ordered
list of speakers. Pitch analysis was conducted and relevant parameters were extracted.
The average pitch of the entire wav file was computed and found to have a value of
154.8595 Hz. The graph of pitch contour versus time frame was also created to see how
the pitch varies over the wav file, Figure (3). The results of pitch analysis can be used in
speaker recognition, where the differences in average pitch can be used to characterize a
speech file. The code for this process can be found in Appendix D.

FORMANT ANALYSIS

Formant analysis was performed on my slow speech file (a17.wav). The first five
peaks in the power spectral density were returned and the first three can be seen in Figure
(4). Also, the vector position of the peaks in the power spectral density were calculated
and can be used to characterize a particular voice file. This technique is used in the
waveform comparison section. The code for this process can be found in Appendix E.

WAVEFORM COMPARISON

Using the results and information learned from pitch and formant analysis, a
waveform comparison code was written. Speech waveform files can be characterized
based on various criteria.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	JPEG Decoder Design Report	seminar flower	1	3,349	25-03-2018, 06:49 PM Last Post: magnumZ
	A Novel Data Embedding Method Using Adaptive Pixel Pair Matching Report	project girl	3	4,489	15-01-2018, 01:56 PM Last Post: dhanabhagya
	Design and Implementation of High-Performance FPGA Signal Processing Datapaths	seminar class	1	300,513	20-09-2017, 01:31 PM Last Post: jaseela123
	Detecting False Data in Wireless Sensor Network using Efficient Becan Scheme	seminar tips	1	3,235	20-09-2017, 01:03 PM Last Post: jaseela123
	A Design & Implementation of Collision Avoidance System (CAS) for Automobiles	seminar flower	1	1,142	19-09-2017, 04:25 PM Last Post: jaseela123
	Color Image Indexing Using BTC	seminar tips	1	1,436	19-09-2017, 02:52 PM Last Post: jaseela123
	Mobile Messenger Using Ad-hoc Networks	seminar code	1	682	19-09-2017, 02:50 PM Last Post: jaseela123
	System Analysis (Modeling of the Existing and Proposed System using OOD)	seminar flower	1	2,459	15-09-2017, 03:39 PM Last Post: jaseela123
	DESIGN AND PERFORMANCE ANALYSIS OF OPTICAL CDMA SYSTEM USING NEWLY DESIGNED MULTIWAVE	project girl	1	1,270	15-09-2017, 01:34 PM Last Post: jaseela123
	Secure Online Examination Management using XML full report	seminar class	1	294,875	14-09-2017, 12:51 PM Last Post: jaseela123

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.