Intrusion Detection Using Neural Networks and Support Vector Machine

INTRUSION DETECTION USING NEURAL NETWORKS AND SUPPORT VECTOR MACHINE

Srinivas Mukkamala, Guadalupe Janoski, Andrew SungDept. of CS in New Mexico Institute of Mining and Technology

IEEE WCCI IJCNN 2002World Congress on Computational IntelligenceInternational Joint Conference on Neural Networks

Outline

Approaches to intrusion detection using neural networks and support vector machines

DARPA dataset Neural Networks Support Vector Machines Experiments Conclusion and Comments

Approaches

Key ideas are to discover useful patterns or features that describe

user behavior on a system And use the set of relevant features to build

classifiers that can recognize anomalies and known intrusions

Neural networks and support vector machines are trained with normal user activity and attack patterns Significant deviations from normal behavior are

flagged as attacks

DARPA Data for Intrusion Detection

DARPA (Defense Advanced Research Projects Agency) An agency of US Department of Defense responsible for

the development of new technology for use by the military

Benchmark from a KDD (Knowledge Discovery and Data Mining) competition designed by DARPA

Attacks fall into four main categories DOS: denial of service R2L: unauthorized access from a remote machine U2R: unauthorized access to local super user (root)

privileges Probing: surveillance and other probing

Features

http://kdd.ics.uci.edu/databases/kddcup99/task.html

Signals

Signal

Signals

Signal

Neuron 神經

Dendrite 樹突

Axon 軸突

Soma 中心

Gather signals

Output signal

Combine signals & decide to trigger

Neural Networks

OUTPUT

平面的線 : w1X1 + w2X2 – θ = 0

WEIGHT

ACTIVATION

Divide and Conquer

Data N1 N2

A +1 +1 +1 -3

B +1 -1 -1 -1

C -1 -1 -3 +1

D -1 +1 -1 -1N3

A +1 -1 +1

B -1 -1 -1

C -1 +1 +1

D -1 -1 -1

Layer 1 Layer 2 Layer 3 Layer 4

w01(1)

w11(1)

w21(1)

Layer 1

general

wij(l)

Layer l

xi(l-1)

Hyperbolicfunction

tanh(S) = eS – e-S

eS + e-S

tanh(S)

)1()()(

lj xwS

)tanh( )()( lj

Decide Architecture

Determine Weight Automatically

Feed Forward Neural Network (FFNN)

ΣInputOutpu

g(x) 由w所組成的 classifier

Training Data: Nnnn yx 1)},{(

Error Function:

nnn yxg

2))((1

How to minimize E(w) ? Stochastic Gradient Descent (SGD)

w is random small value at the beginningfor T iterations

wnew wold – η ．▽ w(En)learning rate

……

Layer 1Layer 2 Layer L-1Layer L… …

wij(l)

Layer l

xi(l-1)

forwardfor l = 1, 2, …, L compute Sj

(l) and xj(l)

)1()()(

lj xwS

)tanh( )()( lj

2)( )(1

2)1()(1

))(tanh(

2)(1 )](tanh1[))(tanh(2

LL SyS

)1( Lix

)1()(1)(

)1()()(

li Sw ))(tanh1( )1(2)()()1(

Back Propagation Algorithm

General

backwardfor l = L, L-1, …, 1 compute δi

… …

wij(l)

Layer l

xi(l-1)

Feed Forward NNet

)1()()(

lj xwS

Consists of layers 1, 2, …, L

wij(l) connect neuron i in layer (l-

1) to neuron j in layer lCumulated signal

Activated output

)( )()( lj

often tanh

Minimize E(w) and determine the weights automatically

SGD (Stochastic Gradient Descent)

)1()()(

E Forward: compute Sj(l) and xj

Backward: compute δi(l)

w is random small value at the beginningfor T iterations wnew wold – η ．▽ w(En)

Stop when desired error rate was met

Support Vector Machine

A supervised learning method Is known as the maximum margin

classifier Find the max-margin separating

hyperplane

SVM – hard margin13

2∥w∥

<w, x> - θ = 0

<w, x> - θ = -1

<w, x> - θ = +1

∥w∥w, θyn(<w, xn> - θ) ≧1

argmin

2w, θyn(<w, xn> - θ) ≧1

1<w, w>

Quadratic programming14

argmin

1Σ Σ aijvivj + Σ bivi2 i j

Σ rkivi ≧ qki

vV* quadprog(A, b, R, q)

argmin

2w, θyn(<w, xn> - θ) ≧1

1<w, w>

Let V = [ θ, w1, w2, …, wD ]

Σ wd2

(-yn) θ + Σ yn (xn)d wd ≧ 1d=1

Adapt the problem for quadratic programming

Find A, b, R, q and put into the quad. solver

Adaptation15

V = [ θ, w1, w2, …, wD ]

v0, v1, v2, .…, vD

Σ wd2

(-yn) θ + Σ yn (xn)d wd ≧ 1d=1

argmin

Σ rkivi ≧ qki

a00 = 0a0j = 0ai0 = 0

i ≠ 0, j ≠ 0aij = 1 (i = j)

0 (i ≠ j)

b0 = 0

i ≠ 0bi = 0

qn = 1

rn0 = -yn

rnd = yn (xn)d

(1+D)*(1+D)

(1+D)*1

(2N)*(1+D)

(2N)*1

SVM – soft margin

Allow possible training errors

Tradeoff c Large c : thinner hyperplane, care about

error Small c : thicker hyperplane, not care about

argmin

2w, θyn(<w, xn> - θ) ≧1 – ξn

1<w, w> + c Σξnn

ξn ≧ 0

errorstradeoff

Adaptation17

argmin

Σ rkivi ≧ qki

V = [ θ, w1, w2, …, wD, ξ1, ξ2, …, ξN ]

(1+D+N)*(1+D+N)

(2N)*(1+D+N)

(1+D+N)*1

(2N)*1

Primal form and Dual form

Primal form

Dual form

argmin

2w, θyn(<w, xn> - θ) ≧1 – ξn

1<w, w> + c Σξnn

ξn ≧ 0

argmin

0 ≦αn≦C

1ΣΣ αnynαmym<xn, xm> - Σ αnn m

Σ ynαn = 0

Variables: 1+D+N

Constraints: 2N

Variables: N

Constraints: 2N+1

Dual form SVM

Find optimal α* Use α* solve w* and θ

αn=0 correct or on 0<αn<C on αn=C wrong or on

free SV

Support Vector

Nonlinear SVM

Nonlinear mapping X Φ(X) {(x)1, (x)2} R2 {1, (x)1, (x)2, (x)1

2, (x)22,

(x)1(x)2} R6

Need kernel trick

argmin

0 ≦αn≦C

1ΣΣ αnynαmym<Φ(xn), Φ(xm)> - Σ αnn m

Σ ynαn = 0

(1+ <xn, xm>)2

Experiments

Using automated parsers to process the raw TCP/IP dump data into machine-readable form

7312 training data (different types of attacks and normal data) has 41 features

6980 testing data evaluate the classifier

Pre-processing Training Testing

Support Vector Machines

Neural Networks

Details RBF kernelC = 1000

204 support vectors (29 free)

3-layer 41-40-40-1 FFNNetsScaled conjugate gradient

descentDesired error rate = 0.001

Accuracy

99.5% 99.25%

Time spent

17.77 sec 18 min

Conclusion and Comments

Speed SVMs is significant shorter

Avoid the ”curse of dimensionality” by max-margin

Accuracy Both have high accuracy

SVMs can only make binary classification IDS requires multiple-class identification

How to determine the features？

Intrusion Detection Using Neural Networks and Support Vector Machine

Documents

A Novel Approach for Intrusion Detection System Using ... · Support Vector Machines (SVMs) and Multivariate Adaptive Regression Splices (MARS ... A. Intrusion Detection Framework

Neural Vector Spaces for Unsupervised Information Retrieval

INTRUSION DETECTION USING NEURAL NETWORKS AND SUPPORT VECTOR MACHINE Srinivas Mukkamala, Guadalupe Janoski, Andrew Sung Dept. of CS in New Mexico Institute

Artiﬁcial Neural Networks vs. Support Vector Machines … · Artiﬁcial Neural Networks vs. Support Vector Machines for ... impossible without making use of computer systems.

FORTIFICATION OF HYBRID INTRUSION DETECTION SYSTEM USING VARIANTS OF NEURAL NETWORKS AND SUPPORT VECTOR MACHINES

MULTI VECTOR PORTABLE INTRUSION DETECTION SYSTEM

Support Vector Neural Training

2806 Neural Computation Support Vector Machines Lecture 6 2005 Ari Visa

Attacks and Intrusion Detection in Cloud Computing Using ... · Attacks and Intrusion Detection in Cloud Computing Using Neural Networks and Particle Swarm Optimization Algorithms

Intrusion Detection with Neural Networks · 2014. 4. 15. · The system presented in this paper, NNID (Neural Network Intrusion Detector), is based on these three ideas. NNID is a

Network Intrusion Detection System Using Neural … Host-based Intrusion Detection System (HIDS) and Network-based Intrusion Detection System. ... the attack types and their temporal

Neural Networks: Support Vector machines

using an evolutionary neural network for web intrusion detection

Support Vector Machines - MIT OpenCourseWare · • Artificial neural networks: 0.826 • Support vector machines (polynomial kernel): 0.738 to 0.813 • Support vector machines (Gaussian

Neural Network Based Vector Hysteresis Model and …maxwell.sze.hu/docs/a1.pdf · Neural Network Based Vector Hysteresis Model and the Nondestructive Testing Method by ... Budapest

Intrusion Detection System Based on Carpenter/Grossberg Artificial Neural … · · 2017-01-10Carpenter/Grossberg Artificial Neural Network ... Introduction 2 1.2. Research Motivation

Implementing Artificial Neural Networks and Support Vector

Intrusion Detection using Honeypot and Support Vector ...ethesis.nitrkl.ac.in/7997/1/672.pdf · Intrusion Detection using Honeypot and Support Vector ... using Honeypot and Support

Predicting Adversarial Cyber Intrusion Stages Using ...zoran/papers/nima2018.pdf · Predicting Adversarial Cyber Intrusion Stages Using Autoregressive Neural Networks Aunshul Rege

Neural-Network Vector Controller for Permanent-Magnet …repository.essex.ac.uk/26205/1/Neural-Network Vector... · 2019. 12. 11. · Neural networks (NNs) have been applied in PMSM