data mining full report

**seminar flower** · 22-08-2012, 03:28 PM

Report Project Datamining

.pdf

datamining_2.pdf (Size: 1.16 MB / Downloads: 58)

Introduction

Given a set of images humans generally have no problems making sense of the contents,
recognizing the objects displayed and using associations to classify them in categories.
Automation of all but the simplest of these tasks is daunting but very much needed when looking
for specific types of images in large databases (e.g. the Internet).
We have built a system that is capable of retrieving images while taking into account the
feedback of the user requesting the images. In the following chapters the workings of the system
and the process to this implementation will be explained.

Problem description

The system retains a collection of images each of which is a member of a class (e.g. birds,
buildings, flowers).
The user selects an image from the collection and inputs it to the system. The system then returns
those images which it thinks are most similar to the given image (and its associated class).
After this first step the user gives feedback to the system marking those images which are from
the correct class. The system uses this feedback to improve the results in the next iteration.
These iterations continue until an acceptable number of correct images has been returned or a
specified maximum number of iterations have taken place

Graphical User Interface

The Graphical User Interface (GUI) is an important part of our program, because in this
assignment, the user has to select the images that are relevant for his or her query. We chose to
use Java for the GUI, because the Matlab GUI development tool (GUIDE) is not suitable for
serious GUI building.

Relevance Feedback

The first method we tried was based on Rocchio's formula. It starts out by representing all
documents as points in an n-dimensional space, the image-space D. The query-image, is
represented as a vector, Q. After the initial run D is split into a set of relevant images and
irrelevant images, Dr and Dirr respectively. It then updates the query-vector Q using the following
formula:
Q* = Q + α Σ Dr – β Σ Dirr
After some experimentation this method was rejected. It didn't produce the results we'd hoped
for. This algorithm uses a purely nearest neighbor approach where the query point gets bumped
through the image-space at each iteration. This does not work if the classes are not well separated
in the feature space.
The next method had an individual weight for each feature individually instead of for each image
as a whole. Based on the relevant and the irrelevant images a feature-weight vector was updated
to reflect the relevance or irrelevance of certain features for the current requested image.
The idea was to have the algorithm focus on those features which were important to the
classification problem at hand and ignore features that were not important. This approach was
promising but did not provide the desired results and left us stranded.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	JPEG Decoder Design Report	seminar flower	1	3,349	25-03-2018, 06:49 PM Last Post: magnumZ
	PIPELINED 2D DCT FOR JPEG IMAGE COMPRESSION REPORT	project girl	1	2,867	25-03-2018, 06:37 PM Last Post: magnumZ
	A Novel Data Embedding Method Using Adaptive Pixel Pair Matching Report	project girl	3	4,489	15-01-2018, 01:56 PM Last Post: dhanabhagya
	SURA Project Monitors Report	seminar projects maker	1	1,177	21-09-2017, 01:30 PM Last Post: jaseela123
	RIA based E- Shopping Portal for Electronic Gadgets Report	study tips	1	1,588	21-09-2017, 01:25 PM Last Post: jaseela123
	THREE DIMENSIONAL PASSWORD FOR MORE SECURE AUTHENTICATION A MAIN PROJECT REPORT	study tips	1	1,300	21-09-2017, 12:56 PM Last Post: jaseela123
	CONSUMER PERCEPTION TOWARDS ONLINE GROCERY STORES” PROJECT REPORT	project maker	1	1,966	21-09-2017, 12:41 PM Last Post: jaseela123
	Training and Placement Cell Management Report	study tips	1	1,838	20-09-2017, 04:05 PM Last Post: jaseela123
	Factory Time Attendance Management System Report	study tips	1	2,638	20-09-2017, 01:19 PM Last Post: jaseela123
	Detecting False Data in Wireless Sensor Network using Efficient Becan Scheme	seminar tips	1	3,235	20-09-2017, 01:03 PM Last Post: jaseela123

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.