Analysis of semi supervised learning methods towards multi label text classification

**seminar tips** · 19-11-2012, 05:53 PM

Analysis of semi supervised learning methods towards multi label text classification

.pdf

Analysis of semi supervised learning methods.pdf (Size: 271.99 KB / Downloads: 36)

ABSTRACT

The area of multi label text classification is getting more attention of researchers because of its role in the field of information retrieval , text mining , web mining etc. Supervised methods from machine learning are mainly used for its realization. But as it needs labeled data for classification all the time , semi supervised methods are now a day getting popular in the MLTC domain. The goal of Semi supervised learning is to reduce the classification errors using readily available unlabeled data in conjunction with available labeled data. This paper mainly provides survey and analysis of various semi supervised methods used in multi label text classification task ; This overview concludes that consideration of semantic aspects of input document datasets , their representation in conjunction with smoothness and manifold assumptions in semi supervised learning may give more relevant classification results.

INTRODUCTION
Recently the area of Multi label text classification has attracted significant attention from lot of researchers, as playing a crucial role in many applications such as web page classification , classification of news articles, information retrieval etc[6].Generally Supervised methods are used in working principle of multi label classification. But in real practice availability of labeled data is rare and that of unlabeled data is plenty [9]. Major limitation of existing supervised algorithms for multi label text classifiers is that they need labeled training data to learn accurately[9][10]. But acquisition of labeled training data is not as easy as that of getting unlabeled data. We need human intervention to label the given text document which is not only time consuming but error prone also [14]. This demands other sources of information that can reduce the need for labeled data. So now a day many researchers are looking towards semi supervised learning as promising solution to the give problem.

OVERVIEW OF MULTI LABEL TEXT CLASSIFICATION

The goal of text classification system is to determine the correct class of a new text document based on some training examples. Thus consideration of semi supervised machine learning method for building text classifier is an interesting area for research. Some of the research in the area of text classification focuses on some specific properties of text data. One such a property is its multi-labelity [3]. Multi-label text classification system is one key domain in this research area. Multi-label classification studies the problem in which a data instance can have multiple labels [4]. Semi supervised methods for text classification is also present in the literature. But very few techniques are available for solving multi-label text classification problem.

Expectation Maximization (EM) based text classification.

Nigam and Mccallum [9] developed this algorithm in 1999. It was very popular attempt to introduce semi supervised learning for text document classification. In this technique the authors have proposed updation in the basic EM technique by considering unlabeled data as incomplete data as it is coming without labels. EM is a class of iterative algorithms for max. Likelihood or max. a posteriori estimation in problems with incomplete data.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	DNSSEC (A Protocol towards securing the Internet Infrastructure)	Computer Science Clay	3	64,035,515	25-05-2018, 04:02 PM Last Post: pttytopa8058
	Design and Analysis Of Algorithms : Seminar Report and PPT	seminar projects maker	1	1,315	21-09-2017, 12:04 PM Last Post: jaseela123
	Software Test Factory (A proposal of a process model to create a Test Factory) Semi	seminar code	1	680	15-09-2017, 01:25 PM Last Post: jaseela123
	MAC Protocol for Reliable Multicast over Multi-Hop Wireless Ad Hoc Networks pdf	study tips	1	1,029	15-09-2017, 12:39 PM Last Post: jaseela123
	A Decision Support System to improve e-Learning Environments pdf	project girl	1	1,265	09-09-2017, 09:33 AM Last Post: jaseela123
	E-learning With Virtual Classroom	seminar paper	1	4,026	06-09-2017, 03:18 PM Last Post: jaseela123
	COMPUTER FORENSIC ANALYSIS PPT	seminar projects maker	1	815	31-08-2017, 04:53 PM Last Post: jaseela123
	OBJECT ORIENTED ANALYSIS AND DESIGN TWO MARK AND SIXTEEN MARK Q and A	seminar ideas	1	1,982	29-08-2017, 11:23 AM Last Post: jaseela123
	Dragging the world towards wireless galaxy Report	study tips	0	1,210	25-08-2017, 09:32 PM Last Post: study tips
	A Multi-Dimensional Approach to Internet Security	Electrical Fan	0	9,063,061	25-08-2017, 09:32 PM Last Post: Electrical Fan

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.