Lecture 10: SVM and MIRA

Machine Learning for Language Technology Lecture 10: SVM and MIRA

Marina San5ni Department of Linguis5cs and Philology Uppsala University, Uppsala, Sweden

Autumn 2014

Acknowledgement: Thanks to Prof. Joakim Nivre for course design and materials

Margin

Maximizing Margin (i)

Maximizing Margin (ii)

Maximizing Margin (iii)

Max Margin = Min Norm

Maximizing the margin

Linear Classifiers: Repe55on & Extension 7

•  The no5on of margin: a way of predic5ng what it will be a good separa5on on the test set.

•  Intui5vely, if we make the margin between opposite groups as wide as possible, our chances to guess correct in the test set should increase.

•  the generaliza5on error on unseen test data is propor5onal to the inverse of the margin: the larger the margin, the smaller the generaliza5on error

Support Vector Machines (SVM) (i)

Support Vector Machines (SVM) (ii)

Margin Infused Relaxed Algorithm (MIRA)

Perceptron vs. SVMs/MIRA

Linear Classifiers: Repe55on & Extension 12

Perceptron SVMs/MIRA If the training set is separable by some margin, the Perceptron will find a weight vector that separates the data, but it will not necessarily pick up the vector that maximizes the margin. If we are lucky, it will be a vector with the largest margin, but there will be no guarantee.

SVMs/MIRA want a weight vector that maximizes the margin to 1. Here the margin is normalized to 1. So we put a constraint on the weight vector saying that the weight should be such that when you computes the norm we should get 1. We keep the margin fixed and minimize the norm. That is, we want the smallest weight vector that gives us margin 1.

We do not minimize the norm, we minimize the norm squared divided by 2 to make the math easier (trust the people who suggested this J )

Summary

The end

Lecture 10: SVM and MIRA

Education

עבוד אותות במערכת החושים סמסטר א' תש"ע mira/Senses2009 Lecture 13 mira/Senses2009

Stützvektormethode (SVM)

Lecture12 - SVM

Sakusaku svm

עבוד אותות במערכת החושים סמסטר א' תש"ע eng.tau.ac.il/~mira/Senses2009 Lecture 13

Credit Risk - Lecture 2defaultrisk.free.fr/pdf/Lecture2.pdfDefault and Ratings Logistic regression Tree-based algorithms SVM Credit Risk Lecture 2 { Statistical tools for scoring and

SVM TR53 SVM TR107 - Star Foils Tecniche/SVM TR...SVM TR53 THERMAL TRANSFER OVERPRINTER Esempi di stampa SVM TR107 I marcatori multipista trasversali soddisfano le esigenze di stampare

ECE595 / STAT598: Machine Learning I Lecture 19 Support ... · Outline Support Vector Machine Lecture 19 SVM 1: The Concept of Max-Margin Lecture 20 SVM 2: Dual SVM Lecture 21 SVM

32hl67u Svm

SVM classifier

Lecture 2: The SVM classifier

AI Study · 2004. 11. 16. · SVM -O 155- 10 ETRIOII'/ 4.1 SVM 2007119} non-eye non-eye —L 719-1 SVM 1 Ù)A5L, Radial Basis Function(RBF) SVM non-eye SVM* 17 non-eye SVM 71 gl EL

Lecture 2: linear SVM in the Dual

The SVM classifier zisserman lecture note.pdf

SVM campaign

Principles of Programming Languages Lecture 1 Slides by Daniel Deutch, based on lecture notes by Prof. Mira Balaban

Lecture 3: SVM dual, kernels and multiple classes

New · 2004. 11. 16. · SVM -O 155- 10 ETRIOII'/ 4.1 SVM 2007119} non-eye non-eye —L 719-1 SVM 1 Ù)A5L, Radial Basis Function(RBF) SVM non-eye SVM* 17 non-eye SVM 71 gl EL 711

Machine Learning Queens College Lecture 13: SVM Again

ECE595 / STAT598: Machine Learning I Lecture 20 Support ...Outline Support Vector Machine Lecture 19 SVM 1: The Concept of Max-Margin Lecture 20 SVM 2: Dual SVM Lecture 21 SVM 3: Kernel