Latent Dirichlet Allocation

**study tips** · 08-06-2013, 01:02 PM

Latent Dirichlet Allocation

ABSTRACT

We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of discrete data such as text corpora. LDA is a three-level hierarchical Bayesian model, in which each item of a collection is modeled as a finite mixture over an underlying set of topics. Each topic is, in turn, modeled as an infinite mixture over an underlying set of topic probabilities. In the context of text modeling, the topic probabilities provide an explicit representation of a document. We present efficient approximate inference techniques based on variational methods and an EM algorithm for empirical Bayes parameter estimation. We report results in document modeling, text classification, and collaborative filtering, comparing to a mixture of unigrams model and the probabilistic LSI model. We consider the problem of modeling text corpora and other collections of discrete data. The goal is to find short descriptions of the members of a collection that enable efficient processing of large collections while preserving the essential statistical relationships that are useful for basic tasks such as classification, novelty detection, summarization, and similarity and relevance judgments LDA is based on a simple exchangeability assumption for the words and topics in a document; it is therefore realized by a straightforward application of de Finetti’s representation theorem. We can view LDA as a dimensionality reduction technique, in the spirit of LSI, but with proper underlying generative probabilistic semantics that make sense for the type of data that it models. Exact inference is intractable for LDA, but any of a large suite of approximate inference algorithms can be used for inference and parameter estimation within the LDA framework.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Dynamic Memory Allocation	presentation Abstract	0	311	26-05-2015, 03:06 PM Last Post: presentation Abstract
	Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment	seminar code	0	443	28-08-2014, 10:42 AM Last Post: seminar code
	Dynamic Memory Allocation : Seminar Report and PPT	seminar projects maker	0	285	09-06-2014, 04:23 PM Last Post: seminar projects maker
	Joint Routing and Spectrum Allocation for Multi-Hop Cognitive Radio Networks	seminar post	0	297	07-05-2014, 11:04 AM Last Post: seminar post
	Memory Allocation	seminar post	0	343	14-04-2014, 04:56 PM Last Post: seminar post
	Report on Exploiting Dynamic Resource Allocation for Efficient Parallel Data	study tips	0	894	21-08-2013, 04:36 PM Last Post: study tips
	Multi-Path Routing and Rate Allocation for Multi-Source Video On-Demand Streaming pdf	study tips	0	466	30-07-2013, 02:41 PM Last Post: study tips
	Distributed Bees Algorithm for Task Allocation in Swarm of Robots pdf	study tips	0	539	10-06-2013, 03:44 PM Last Post: study tips
	An SMDP-Based Service Model for Inter domain Resource allocation in mobile Cloud ppt	study tips	0	902	06-05-2013, 04:23 PM Last Post: study tips
	INTERDOMAIN RESOURCE ALLOCATION BASED ON SMDP MODEL REPORT	study tips	0	816	01-03-2013, 11:23 AM Last Post: study tips

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.