Persian Part Of Speech Tagging

Mostafa Keikha

Database Research Group (DBRG)

ECE Department, University of Tehran

Decision Trees

Decision Tree (DT): Tree where the root and each internal node is

labeled with a question. The arcs represent each possible answer to the

associated question. Each leaf node represents a prediction of a

solution to the problem. Popular technique for classification; Leaf

node indicates class to which the corresponding tuple belongs.

Decision Tree Example

Decision Trees

A Decision Tree Model is a computational model consisting of three parts: Algorithm to create the tree Algorithm that applies the tree to data

Creation of the tree is the most difficult part. Processing is basically a search similar to that in

a binary search tree (although DT may not be binary).

Decision Tree Algorithm

Using DT in POS Tagging

Compute Ambiguity classes Each term may have

different tags Ambiguity class for each

term: set of all possible tags

compute # of occurrence for each tag in each ambiguity class

Ambiguity Class

# of occurrence

a b c d10 20 25 40

b c d 40 39 50

b d 60 55

Create Decision Tree on Ambiguity classes

In each level delete tag with minimum occurrence

a b c d10 20 25 40

b c d40 39 50

b d60 55

Advantage Easy to understand Easy to implement

Disadvantage Context independent

Known Tokens Results

Run PercentTokensCorrectAccuracy

197.9739392336376492.34%

298.0635563032896592.50%

397.9639752836778992.51%

497.9241056138157892.94%

597.9740307937230592.36%

Average97.976392144.2362880.292.474%

POS tagging using HMMs

Let W be a sequence of words W = w1 , w2 , … , wn

Let T be the corresponding tag sequence T = t1 , t2 , … , tn

Task : Find T which maximizes P ( T | W )

T’ = argmaxT P ( T | W )

By Bayes Rule,

P ( T | W ) = P ( W | T ) * P ( T ) / P ( W )

T’ = argmaxT P ( W | T ) * P ( T )

Transition Probability,

P ( T ) = P ( t1 ) * P ( t2 | t1 ) * P ( t3 | t1 t2 ) …… * P ( tn | t1 … tn-1 )

Applying Tri-gram approximation,

P ( T ) = P ( t1 ) * P ( t2 | t1 ) * P ( t3 | t1 t2 ) …… * P ( tn | tn-2 tn-1 )

Introducing a dummy tag, $, to represent the beginning of a sentence,

P ( T ) = P ( t1 | $ ) * P ( t2 | $ t1 ) * P ( t3 | t1 t2 ) …… * P ( tn | tn-2 tn-1 )

Smoothing Transition Probabilities

Sparse data problem

Linear interpolation method

P'(ti | ti - 2 , ti - 1) = λ1 P( ti ) + λ2 P(ti | ti - 1 ) + λ3 P(ti | ti - 2 , ti - 1)

such that the s sum to 1

Calculation of λs

Emission Probability,

P(W | T ) ≈ P(w1 | t1) * P(w2 | t2) * . . . * P(wn | tn)

Context Dependency

To make more dependent on the context the emission probability is calculated as:

P(W | T ) ≈ P(w1 | $ t1) * P(w2 | t1 t2) ...* P(wn | tn-1 tn)

Smoothing technique is applied

P' (wi | ti-1 ti) = θ1 P(wi | ti) + θ2 P(wi | ti-1 ti) Sum of all θs is equal to 1

θs are different for different words.

Lexicon generation probability

P(N V ART N | files like a flower) = 4.37*10-6

Known Tokens Results

198.0739429038221196.94%

298.1634591334591397.18%

398.0439784934389496.96%

498.0241097039848796.96%

598.0740346039147597.03%

Average98.072390496.437239697.01%

Unknown Tokens Results

11.937760582975.12%

21.846689535780.09%

31.967956615377.34%

41.988283643577.69%

51.937945624678.62%

Average1.9287726.6600477.77%

Overall Results

Run TokensCorrectAccuracy

140205038804096.52%

236265835127096.86%

340580539189096.57%

441925340492296.58%

541140539772196.67%

Average400234.2386768.696.64%

Persian Part Of Speech Tagging

Documents

Introduction to Syntax, with Part-of-Speech Tagging

Part-of-Speech Tagging for Twitter:

Part-Of-Speech Tagging using Neural Networks

Part-of-Speech Tagging Updated 22/12/2005. Part-of-Speech Tagging Tagging is the task of labeling (or tagging) each word in a sentence with its appropriate

CS4705 Part of Speech tagging

Part-of-Speech Tagging and Partial Parsing

Lecture 09: Part-of-Speech Tagging

ANLP Lecture 7 Part-of-speech tagging

WORD CLASSES AND PART-OF-SPEECH TAGGING

Introduction to parts of speech tagging

TP2663 Pemprosesan Bahasa Tabii - ftsm.ukm.my 05 Part of Speech.pdf1 TP2663 Pemprosesan Bahasa Tabii Part of Speech Tagging Part of Speech tagging Part of speech tagging Parts of speech

Part of Speech Tagging - The University of Edinburgh · HMM Part-of-Speech Tagging Part of Speech Tagging Informatics 2A: Lecture 15 Mirella Lapata School of Informatics University

CS 4705 Part of Speech Tagging

Part of Speech Tagging (Chapter 5)

031808 obama speech (persian)

Part of Speech Tagging - BGUelhadad/nlp13/prob/postagging.pdfPart-of-Speech Tagging Part-of-SpeechTagging I Givenawordsequencew 1 w m,determinethecorresponding part-of-speech(tag)sequencet

Part-of-speech tagging (3) - The University of EdinburghSteve Renals s.renals@ed.ac.uk Part-of-speech tagging (3) Outline Recall: HMM PoS tagging Viterbi decoding Trigram PoS tagging

Part-of-speech tagging (1) - School of Informatics · Outline Parts of Speech PoS Tagging in NLTK Evaluating taggers Summary Part-of-speech tagging (1) Steve Renals s.renals@ed.ac.uk

HMM Part-Of-Speech Tagging - Natural Language Processing

Part of Speech Tagging