COMPUTER NETWORKS RESEARCH LAB

Department of Electrical and Computer Engineering, McGill University

NAVIGATION

Home
People
Photos

RESEARCH

Projects
Publications

LINKS

AAPN
MITACS
McGill TSP
McGill ECE

LOCAL ACCESS

Local Info

Project Description Return to Network Monitoring Projects

Detailed Description of the Algorithm

Possible Applications:
The algorithm is intended to operate on traffic statistics collected (in distributed manner) at core nodes of large backbone networks, such as the 11 core routers of the Abilene network.

At each timestep t, the algorithm requires as inputs a traffic measurement vector x_t and an associated scalar y_t. An example choice for x_t is the flow vector (the number of packets or bytes in each source-destination flow) and for y_t the total number of packets in the network, as recorded during the measurement interval corresponding to timestep t.

Kernel Recursive Least Squares

The recursive least squares algorithm is a popular method of obtaining linear predictors of a data sequence. It is suitable for online learning scenarios as it observes input samples sequentially and has modest storage and computational requirements. It does not need to store historical data and its computational cost per timestep is independent of time.

Kernel machines use a kernel mapping function to produce non-linear and non-parametric learning algorithms [4]. The idea behind kernel machines is that a suitable kernel function, when applied to a pair of input data vectors, may be interpreted as an inner product in a high dimensional Hilbert space known as the feature space [3]. This allows inner products in the feature space (inner products of the feature vectors) to be computed without explicit knowledge of the feature vectors themselves, by simply evaluating the kernel function:

where x_i , x_j denote input vectors and j represents the mapping onto the feature space.

Popular kernel functions include the Gaussian kernel with variance s²:

and the polynomial kernel of degree d:

A special case of the polynomial kernel is the linear kernel:

Note that the linear kernel is simply the inner product (dot product) of its arguments.

The Kernel Recursive Least Squares (KRLS) algorithm combines the principles of recursive least squares and kernel machines, providing an efficient and non-parametric approach for performing online data mining and anomaly diagnosis.

The Key Idea:

The objective of the Kernel-based Online Anomaly Detection (KOAD) algorithm is to build a dictionary of input vectors, such that the mapping of the input vectors onto the feature space forms an approximately linearly independent basis in the feature space [3].

Fig.1: Simplified depiction of space spanned by 2 sample dictionary elements, D{1} and D{2}. d is a distance metric, and n₁, n₂ are thresholds where n₁ < n₂. When d > n₂, we infer that the current observation is too far away from the space of normality as defined by the dictionary, and instantly declare a red alarm; when n₁ < d < n₂, we raise an orange alarm, keep track of the usefulness of the arriving x_t for a short interim period, and then take a firm decision on it.

Outline of KRLS Anomaly Detection Algorithm:

The algorithm proceeds by evaluating the degree of linearly dependence of the feature vector at each timestep t, j(x(t)), on the current dictionary. The algorithm evaluates d_t, the error in projecting the current feature vector onto the dictionary, according to:

where x₁...x_m represent members of the dictionary at timestep t. The value of d_t is then compared with the thresholds n₁ and n₂ to initiate the immediate relevant response, as shown in Fig.2. A complete description of the algorithm is provided in the associated technical report [6].

Fig.2: Overview of KRLS online anomaly detection algorithm. Detailed description is provided in [6].