online intrusion alert aggregation with generative data stream modeling pdf

14-03-2012, 09:37 PM

what is necessity of this project? why the project has been taken up? has this become successful?

seminar paper · 15-03-2012, 10:57 AM

to get information about the topic "online intrusion alert aggregation with generative data stream modeling" full report ppt and related topic refer the link bellow

https://seminarproject.net/Thread-online...m-modeling

17-04-2012, 08:49 PM

i need documentation and coding for online intrusion alert aggregation with generative data stream modeling

17-04-2012, 09:04 PM

i need a documentation and coding for ONLINE INTRUSION ALERT AGGREGATION WITH GENERATIVE DATA STREAM MODELING

**seminar ideas** · 18-04-2012, 10:09 AM

to get information about the topic "online intrusion alert aggregation with generative data stream modeling " full report refer the link bellow

https://seminarproject.net/Thread-online...m-modeling

18-04-2012, 03:11 PM

why we use DARPA dataset in Online Intrusion Alert Aggregation with Generative Data Stream Modeling

lakshmichaitanya · 18-04-2012, 03:20 PM

what is meant by DARPA dataset in Online Intrusion Alert Aggregation with Generative Data Stream Modeling

**seminar flower** · 05-09-2012, 10:04 AM

Online Intrusion Alert Aggregation with Generative Data Stream Modeling

.pdf

Online Intrusion Alert Aggregation.pdf (Size: 2.58 MB / Downloads: 56)

Abstract

Alert aggregation is an important subtask of intrusion detection. The goal is to identify and to cluster different
alerts—produced by low-level intrusion detection systems, firewalls, etc.—belonging to a specific attack instance which has been
initiated by an attacker at a certain point in time. Thus, meta-alerts can be generated for the clusters that contain all the relevant
information whereas the amount of data (i.e., alerts) can be reduced substantially. Meta-alerts may then be the basis for reporting to
security experts or for communication within a distributed intrusion detection system. We propose a novel technique for online alert
aggregation which is based on a dynamic, probabilistic model of the current attack situation. Basically, it can be regarded as a data
stream version of a maximum likelihood approach for the estimation of the model parameters.

INTRODUCTION

INTRUSION detection systems (IDS) are besides other
protective measures such as virtual private networks,
authentication mechanisms, or encryption techniques very
important to guarantee information security. They help to
defend against the various threats to which networks and
hosts are exposed to by detecting the actions of attackers or
attack tools in a network or host-based manner with misuse
or anomaly detection techniques [1].
At present, most IDS are quite reliable in detecting
suspicious actions by evaluating TCP/IP connections or log
files, for instance. Once an IDS finds a suspicious action, it
immediately creates an alert which contains information
about the source, target, and estimated type of the attack
(e.g., SQL injection, buffer overflow, or denial of service). As
the intrusive actions caused by a single attack instance—
which is the occurrence of an attack of a particular type that
has been launched by a specific attacker at a certain point in
time—are often spread over many network connections or
log file entries, a single attack instance often results in
hundreds or even thousands of alerts. IDS usually focus on
detecting attack types, but not on distinguishing between
different attack instances.

RELATED WORK

Most existing IDS are optimized to detect attacks with high
accuracy. However, they still have various disadvantages
that have been outlined in a number of publications and a lot
of work has been done to analyze IDS in order to direct
future research (cf. [5], for instance). Besides others, one
drawback is the large amount of alerts produced. Recent
research focuses on the correlation of alerts from (possibly
multiple) IDS. If not stated otherwise, all approaches
outlined in the following present either online algorithms
or—as we see it—can easily be extended to an online version.
Probably, the most comprehensive approach to alert
correlation is introduced in [6]. One step in the presented
correlation approach is attack thread reconstruction, which can
be seen as a kind of attack instance recognition. No
clustering algorithm is used, but a strict sorting of alerts
within a temporal window of fixed length according to the
source, destination, and attack classification (attack type). In
[7], a similar approach is used to eliminate duplicates, i.e.,
alerts that share the same quadruple of source and
destination address as well as source and destination port.
In addition, alerts are aggregated (online) into predefined
clusters (so-called situations) in order to provide a more
condensed view of the current attack situation. The
definition of such situations is also used in [8] to cluster
alerts. In [9], alert clustering is used to group alerts that
belong to the same attack occurrence. Even though called
clustering, there is no clustering algorithm in a classic sense

ANOVEL ONLINE ALERT AGGREGATION
TECHNIQUE

In this section, we describe our new alert aggregation
approach which is—at each point in time—based on a
probabilistic model of the current situation. To outline the
preconditions and objectives of alert aggregation, we start
with a short sketch of our intrusion framework. Then, we
briefly describe the generation of alerts and the alert format.
We continue with a new clustering algorithm for offline
alert aggregation which is basically a parameter estimation
technique for the probabilistic model. After that, we extend
this offline method to an algorithm for data stream clustering
which can be applied to online alert aggregation. Finally, we
make some remarks on the generation of meta-alerts.

Collaborating Intrusion Detection Agents

In our work, we focus on a system of structurally very
similar so-called intrusion detection (ID) agents. Through
self-organized collaboration, these ID agents form a
distributed intrusion detection system (DIDS).
Fig. 1 outlines the layered architecture of an ID agent:
The sensor layer provides the interface to the network and
the host on which the agent resides. Sensors acquire raw
data from both the network and the host, filter incoming
data, and extract interesting and potentially valuable (e.g.,
statistical) information which is needed to construct an
appropriate event. At the detection layer, different detectors,
e.g., classifiers trained with machine learning techniques
such as support vector machines (SVM) or conventional
rule-based systems such as Snort [24], assess these events
and search for known attack signatures (misuse detection)
and suspicious behavior (anomaly detection).

Alert Generation and Format

In this section, we make some comments on the information
contained in alerts, the objects that must be aggregated, and
on their format. As the concrete content and format depend
on a specific task and on certain realizations of the sensors
and detectors, some more details will be given in Section 4
together with the experimental conditions.
At the sensor layer, sensors determine the values of
attributes that are used as input for the detectors as well as
for the alert clustering module. Attributes in an event that
are independent of a particular attack instance can be used
for classification at the detection layer. Attributes that are
(or might be) dependent on the attack instance can be used in
an alert aggregation process to distinguish different attack
instances. A perfect partition into dependent and independent
attributes, however, cannot be made. Some are clearly
dependent (such as the source IP address which can
identify the attacker), some are clearly independent such
as the destination port which usually is 80 in case of webbased
attacks), and lots are both (such as the destination
port which can be a hint to the attacker’s actual target
service as well as an attack tool specifically designed to
target a particular service only). In addition to the attributes
produced by the sensors, alert aggregation is based on
additional attributes generated by the detectors. Examples
are the estimated type of the attack instance that led to the
generation of the alert (e.g., SQL injection, buffer overflow,
or denial of service), and the degree of uncertainty
associated with that estimate.

Conclusion

The experiments demonstrated the broad applicability of
the proposed online alert aggregation approach. We
analyzed three different data sets and showed that
machine-learning-based detectors, conventional signaturebased
detectors, and even firewalls can be used as alert
generators. In all cases, the amount of data could be
reduced substantially. Although there are situations as
described in Section 3.3—especially clusters that are
wrongly split—the instance detection rate is very high

**study tips** · 22-02-2013, 04:17 PM

Online Intrusion Alert Aggregation with Generative Data Stream Modeling

.pdf

Online Intrusion.pdf (Size: 2.58 MB / Downloads: 32)

Abstract

Alert aggregation is an important subtask of intrusion detection. The goal is to identify and to cluster different
alerts—produced by low-level intrusion detection systems, firewalls, etc.—belonging to a specific attack instance which has been
initiated by an attacker at a certain point in time. Thus, meta-alerts can be generated for the clusters that contain all the relevant
information whereas the amount of data (i.e., alerts) can be reduced substantially. Meta-alerts may then be the basis for reporting to
security experts or for communication within a distributed intrusion detection system. We propose a novel technique for online alert
aggregation which is based on a dynamic, probabilistic model of the current attack situation. Basically, it can be regarded as a data
stream version of a maximum likelihood approach for the estimation of the model parameters. With three benchmark data sets, we
demonstrate that it is possible to achieve reduction rates of up to 99.96 percent while the number of missing meta-alerts is extremely
low. In addition, meta-alerts are generated with a delay of typically only a few seconds after observing the first alert belonging to a new
attack instance.

INTRODUCTION

INTRUSION detection systems (IDS) are besides other
protective measures such as virtual private networks,
authentication mechanisms, or encryption techniques very
important to guarantee information security. They help to
defend against the various threats to which networks and
hosts are exposed to by detecting the actions of attackers or
attack tools in a network or host-based manner with misuse
or anomaly detection techniques [1].
At present, most IDS are quite reliable in detecting
suspicious actions by evaluating TCP/IP connections or log
files, for instance. Once an IDS finds a suspicious action, it
immediately creates an alert which contains information
about the source, target, and estimated type of the attack
(e.g., SQL injection, buffer overflow, or denial of service). As
the intrusive actions caused by a single attack instance—
which is the occurrence of an attack of a particular type that
has been launched by a specific attacker at a certain point in
time—are often spread over many network connections or
log file entries, a single attack instance often results in
hundreds or even thousands of alerts. IDS usually focus on
detecting attack types, but not on distinguishing between
different attack instances.

RELATED WORK

Most existing IDS are optimized to detect attacks with high
accuracy. However, they still have various disadvantages
that have been outlined in a number of publications and a lot
of work has been done to analyze IDS in order to direct
future research (cf. [5], for instance). Besides others, one
drawback is the large amount of alerts produced. Recent
research focuses on the correlation of alerts from (possibly
multiple) IDS. If not stated otherwise, all approaches
outlined in the following present either online algorithms
or—as we see it—can easily be extended to an online version.
Probably, the most comprehensive approach to alert
correlation is introduced in [6]. One step in the presented
correlation approach is attack thread reconstruction, which can
be seen as a kind of attack instance recognition. No
clustering algorithm is used, but a strict sorting of alerts
within a temporal window of fixed length according to the
source, destination, and attack classification (attack type). In
[7], a similar approach is used to eliminate duplicates, i.e.,
alerts that share the same quadruple of source and
destination address as well as source and destination port.
In addition, alerts are aggregated (online) into predefined
clusters (so-called situations) in order to provide a more
condensed view of the current attack situation. The
definition of such situations is also used in [8] to cluster
alerts. In [9], alert clustering is used to group alerts that
belong to the same attack occurrence. Even though called
clustering, there is no clustering algorithm in a classic sense.
The alerts from one (or possibly several) IDS are stored in a
relational database and a similarity relation—which is based
on expert rules—is used to group similar alerts together.
Two alerts are defined to be similar, for instance, if both
occur within a fixed time window and their source and
target match exactly. As already mentioned, these approaches
are likely to fail under real-life conditions with
imperfect classifiers (i.e., low-level IDS) with false alerts or
wrongly adjusted time windows.

Alert Generation and Format

In this section, we make some comments on the information
contained in alerts, the objects that must be aggregated, and
on their format. As the concrete content and format depend
on a specific task and on certain realizations of the sensors
and detectors, some more details will be given in Section 4
together with the experimental conditions.
At the sensor layer, sensors determine the values of
attributes that are used as input for the detectors as well as
for the alert clustering module. Attributes in an event that
are independent of a particular attack instance can be used
for classification at the detection layer. Attributes that are
(or might be) dependent on the attack instance can be used in
an alert aggregation process to distinguish different attack
instances. A perfect partition into dependent and independent
attributes, however, cannot be made. Some are clearly
dependent (such as the source IP address which can
identify the attacker), some are clearly independent such
as the destination port which usually is 80 in case of webbased
attacks), and lots are both (such as the destination
port which can be a hint to the attacker’s actual target
service as well as an attack tool specifically designed to
target a particular service only). In addition to the attributes
produced by the sensors, alert aggregation is based on
additional attributes generated by the detectors. Examples
are the estimated type of the attack instance that led to the
generation of the alert (e.g., SQL injection, buffer overflow,
or denial of service), and the degree of uncertainty
associated with that estimate.

Conclusion

The experiments demonstrated the broad applicability of
the proposed online alert aggregation approach. We
analyzed three different data sets and showed that
machine-learning-based detectors, conventional signaturebased
detectors, and even firewalls can be used as alert
generators. In all cases, the amount of data could be
reduced substantially. Although there are situations as
described in Section 3.3—especially clusters that are
wrongly split—the instance detection rate is very high

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	abstract for online recharge	Guest	1	2,165	14-09-2017, 03:24 PM Last Post: jaseela123
	domotics pdf	Guest	1	3,433	14-09-2017, 12:41 PM Last Post: jaseela123
	ieee pdf blu ray disc	Guest	4	4,761	12-09-2017, 07:44 AM Last Post: suryaelangovan
	applet programming in java pdf	Guest	1	1,994	09-09-2017, 10:48 AM Last Post: jaseela123
	ieee pdf nanoscale materials devices for future communication networks	Guest	1	2,402	28-08-2017, 04:39 PM Last Post: jaseela123
	speed control of dc motor using dac project pdf	santanukandar	1	4,396	26-08-2017, 04:46 PM Last Post: jaseela123
	smart remote data collection project	Guest	1	2,271	26-08-2017, 02:40 PM Last Post: jaseela123
	microcontroller based moving message lcd display pdf	Guest	1	1,674	26-08-2017, 01:41 PM Last Post: jaseela123
	analysis of recent mergers of reliance and bp pdf	Guest	1	1,106	26-08-2017, 01:13 PM Last Post: jaseela123
	tension control brakes seminar report pdf	umeshck	1	3,534	26-08-2017, 12:57 PM Last Post: jaseela123

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.