Learning Bounds for Domain Adaptation

**study tips** · 08-06-2013, 01:01 PM

Learning Bounds for Domain Adaptation

ABSTRACT

Empirical risk minimization offers well-known learning guarantees when training and test data come from the same domain. In the real world, though, we often wish to adapt a classifier from a source domain with a large amount of training data to different target domain with very little training data. In this work we give uniform convergence bounds for algorithms that minimize a convex combination of source and target empirical risk. The bounds explicitly model the inherent trade-off between training on a large but inaccurate source data set and a small but accurate target training set. Our theory also gives results when we have multiple source domains, each of which may have a different number of instances, and we exhibit cases in which minimizing a non-uniform combination of source risks can achieve much lower target error than standard empirical risk minimization Domain adaptation addresses a common situation that arises when applying machine learning to diverse data. We have ample data drawn from a source domain to train a model, but little or no training data from the target domain where we wish to use the model. Domain adaptation questions arise in nearly every application of machine learning. In face recognition systems, training images are obtained under one set of lighting or occlusion conditions while the recognizer will be used under different conditions. In speech recognition, acoustic models trained by one speaker need to be used by another. In natural language processing, part-of-speech taggers, parsers, and document classifiers are trained on carefully annotated training sets, but applied to texts from different genres or styles. We investigate the task of domain adaptation when we have a large amount of training data from a source domain but wish to apply a model in a target domain with a much smaller amount of training data. Our main result is a uniform convergence learning bound for algorithms which minimize convex combinations of source and target empirical risk. Our bound reflects the trade-off between the size of the source data and the accuracy of the target data, and we give a simple approximation to it that is computable from finite labeled and unlabeled samples. This approximation makes correct predictions about model test error for a sentiment classification task. Our theory also extends in a straightforward manner to a multi-source setting, which we believe helps to explain the success of recent empirical work in domain adaptation

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	A Decision Support System to improve e-Learning Environments pdf	project girl	1	1,265	09-09-2017, 09:33 AM Last Post: jaseela123
	E-learning With Virtual Classroom	seminar paper	1	4,026	06-09-2017, 03:18 PM Last Post: jaseela123
	Adaptation Performance of Covert Channels in Dynamic Source Routing	seminar class	1	247,460	31-08-2017, 01:36 PM Last Post: jaseela123
	compositional adaptation based on optimizing the global distance function and its app	electronics seminars	0	9,468,489	25-08-2017, 09:32 PM Last Post: electronics seminars
	Multiple Domain Orientation	computer science crazy	0	13,260,773	25-08-2017, 09:32 PM Last Post: computer science crazy
	SEMINAR REPORT on Learning and Semantic Web	Computer Science Clay	0	17,445,757	25-08-2017, 09:32 PM Last Post: Computer Science Clay
	Intelligent Speed Adaptation	presentation Abstract	0	362	10-06-2015, 03:26 PM Last Post: presentation Abstract
	Compositional Adaptation	presentation Abstract	0	219	22-05-2015, 02:37 PM Last Post: presentation Abstract
	Intelligent Speed Adaptation	presentation Abstract	0	241	21-05-2015, 04:04 PM Last Post: presentation Abstract
	Focused Web Crawling for E-Learning Content	presentation Abstract	0	225	20-05-2015, 03:11 PM Last Post: presentation Abstract

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.