26
Computational Computational Linguistics Linguistics Lecture 1: Introduction Lecture 1: Introduction What is Linguistics? Prescriptive Grammar: Descriptive Grammar: Some Linguistic Methods Grammatical Theory Linguistics Beyond Grammar linguistics What is computational linguistics? Dr. Saeed Dr. Saeed Rahati Rahati

Lec 1

  • Upload
    quchani

  • View
    822

  • Download
    3

Embed Size (px)

DESCRIPTION

computational linguistics Ferdowsi university

Citation preview

Page 1: Lec 1

Computational LinguisticsComputational LinguisticsComputational LinguisticsComputational Linguistics

Lecture 1: IntroductionLecture 1: IntroductionLecture 1: IntroductionLecture 1: Introduction

What is Linguistics?Prescriptive Grammar:Descriptive Grammar:Some Linguistic MethodsGrammatical TheoryLinguistics Beyond Grammarlinguistics What is computational linguistics?Dr. Saeed Dr. Saeed

RahatiRahatiDr. Saeed Dr. Saeed RahatiRahati

Page 2: Lec 1

What is Linguistics?What is Linguistics?

Linguistics is the study of human language, broadly construed.

Linguistics is a scientific discipline with established theories, analytic methods, and real-world applications.

Linguists often study individual languages, but...

When linguists study individual languages, they have larger issues in mind.

Page 3: Lec 1

3

Linguistics is descriptive, not prescriptive

… What is Linguistics? … What is Linguistics?

Page 4: Lec 1

4

Rules against certain usages.Few if any rules for what is

allowed.Condemns forms generally in

use.Explicitly normative enterprise.

Prescriptive Grammar:Prescriptive Grammar:

Page 5: Lec 1

Rules characterizing what people do say.

Tries to do so in a way that reflects internalized generalizations that people have made.

Linguists are fundamentally concerned with linguistic knowledge.

Descriptive Grammar:Descriptive Grammar:

Page 6: Lec 1

6

• parkway vs. driveway• maternity dress vs. paternity

suit• bathing trunks (pl) vs. bikini

(sing)• you are vs. *you is• Aren’t I clever? vs. *I aren’t

clever.

Anyway, language isn’t logical:Anyway, language isn’t logical:

Page 7: Lec 1

7

◮ Fieldwork◮ Formal analysis of patterns in data sets◮ Psycholinguistic experiments◮ Computational modeling◮ Corpus analysis

Some Linguistic MethodsSome Linguistic Methods

Page 8: Lec 1

8

◮ Phonetics: The study of speech sounds◮ Phonology: The study of sound systems◮ Morphology: The study of word structure◮ Syntax: The study of sentence structure◮ Semantics: The study of linguistic meaning◮ Pragmatics: The study of language use

Grammatical TheoryGrammatical Theory

Page 9: Lec 1

9

Phonetics: The Study of Speech SoundsPhonetics: The Study of Speech Sounds

Page 10: Lec 1

10

Phonology: The Study of Sound SystemsPhonology: The Study of Sound Systems

Page 11: Lec 1

11

◮ missile: ‘ICBM’◮ anti-tank-missile: ‘missile targetting tanks’◮ anti-aircraft-missile: ‘missile targetting aircraft’◮ anti-missile-missile: ‘missile targetting ICBMs’

Morphology: The Study Of Word StructureMorphology: The Study Of Word Structure

Page 12: Lec 1

12

◮ Rule: Anti-X-missile is a missile targetting Xs.◮ What kind of missile targets anti-missile-missiles?◮ anti-anti-missile-missile-missile◮ anti-anti-anti-missile-missile-missile-missile:‘missile targetting anti-anti-missile-missile-missiles’

Morphological RulesMorphological Rules

Page 13: Lec 1

13

◮ I saw the woman with the telescope.I [saw [the woman] [with the telescope]].I [saw [[the woman] [with the telescope]]].

◮ Put the block in the box on the table in the bedroom.◮ Put the block in the box on the table in the bedroom near the kitchen.

Syntax: The Study of Sentence StructureSyntax: The Study of Sentence Structure

Page 14: Lec 1

14

◮ Structural Ambiguity produces semantic ambiguity.◮ Both in morphology and syntax.◮ Lexical Ambiguity: We screened the candidates.◮ Both Together: I saw her duck.

Semantics: The Study of Linguistic MeaningSemantics: The Study of Linguistic Meaning

Page 15: Lec 1

15

Q: Is Palin a Republican?A: Is the Pope Catholic?◮ Why don’t you move up to the City?◮ Why should I stand here and listen to this?◮ Do you think I’m saying this just to hear the sound of my own voice?

Pragmatics: The Study of Language UsePragmatics: The Study of Language Use

Page 16: Lec 1

16

◮ Historical Linguistics: How languages change over time.◮ Sociolinguistics: How languages vary socially. How language is used as a social resource.◮ Psycholinguistics: What goes on in people’s heads as they use language.◮ Language Acquisition: How people learn language. (first language acquisition; second language acquisition)◮ Computational Linguistics: Making computers process (generate/‘understand’/translate...) human languages.

Linguistics Beyond GrammarLinguistics Beyond Grammar

Page 17: Lec 1

17

more

rig

oro

us

less

rigoro

us

more

fl

akey

literarycriticism

physicsbiology

chemistrypsychology

computational linguistics

neuropsychologylinguistics?

Computational LinguisticsComputational Linguistics

Page 18: Lec 1

18

What defines the rigor of a field? Whether results are reproducible Whether theories are

testable/falsifiable Whether there are a common set of

methods for similar problems Whether approaches to problems

can yield interesting new questions/answers

Page 19: Lec 1

Computational Linguistics

19Linguistics

Page 20: Lec 1

Computational Linguistics

20

more

rig

oro

us

less

rigoro

us

literarycriticism

engineering sociologylinguistics

Linguistics

Page 21: Lec 1

more

rig

oro

us le

ss

rigoro

us

oth

er a

reas o

f so

ciolin

gu

istics(e

.g. D

eb

ora

h

Tan

nen

)“th

eore

tical”

ling

uistics

(e.g

. m

inim

alist

synta

x)

som

e a

reas o

f so

ciolin

gu

istics(e

.g. B

ill Lab

ov)

psy

cholin

gu

istics

exp

erim

en

tal p

hon

etics

histo

rical lin

gu

istics

“theore

tical”

ling

uistics

(e.g

. lexica

l-fun

ction

al

gra

mm

ar)

The true situation with linguistics

Page 22: Lec 1

22

What is computational linguistics? Text normalization/segmentation Morphological analysis Automatic word pronunciation prediction Transliteration Word-class prediction: e.g. part of speech tagging Parsing Semantic role labeling Machine translation Dialog systems Topic detection Summarization Text retrieval Bioinformatics Language modeling for automatic speech recognition Computer-aided language learning (CALL)

Page 23: Lec 1

23Computational linguistics

Often thought of as natural language engineering

But there is also a serious scientific component to it.

Page 24: Lec 1

24

Goals of Computational Linguistics/ Natural Language Processing To get computers to deal with

language the way humans do: They should be able to understand

language and respond appropriately in language

They should be able to learn human language the way children do

They should be able to perform linguistic tasks that skilled humans can do, such as translation

Yeah, right

Page 25: Lec 1

25Some interesting themes…

Finite-state methods: Many application areas Raises many interesting questions about how

“regular” language is Grammar induction:

Linguists have done a poor job at their stated goal of explaining how humans learn grammar

Computational models of language change: Historical evidence for language change is only

partial. There are many changes in language for which we have no direct evidence.

Page 26: Lec 1

26Why CL may seem ad hoc

Wide variety of areas (as in linguistics)

If it’s natural language engineering the goal is often just to build something that works

Techniques tend to change in somewhat faddish ways… For example: machine learning

approaches fall in and out of favor