25
Corpus Linguistics Małgorzata Warzecha

Corpus Linguistics Presentation

Embed Size (px)

DESCRIPTION

A power point presentation on Corpus Linguistics. It contains some history, current trends and example of software used for creating corpora. Duration: roughly 40 minutes. Created for Methodology course at Jagiellonian University, English studies. Author: Małgorzata Warzecha

Citation preview

Corpus Linguistics

Corpus LinguisticsMagorzata Warzecha

What is a corpus?

What is a corpus?

Corpus:From the Latin for body (plural corpora), a corpus is a body of language representative of a particular variety of language or genre which is collected and stored in electronic form for analysis using concordance software.

CASS: Briengs, Tony McEnryTypes of corpora:1. Specialised corpusTypes of corpora:1. Specialised corpus2. General corpusTypes of corpora:1. Specialised corpus2. General corpus3. Multilingual corporaTypes of corpora:1. Specialised corpus2. General corpus3. Multilingual corpora4. Parallel corpusTypes of corpora:1. Specialised corpus2. General corpus3. Multilingual corpora4. Parallel corpus5. Learner corpus

Types of corpora:1. Specialised corpus2. General corpus3. Multilingual corpora4. Parallel corpus5. Learner corpus6. Historical or Diachronic corpusTypes of corpora:1. Specialised corpus2. General corpus3. Multilingual corpora4. Parallel corpus5. Learner corpus6. Historical or Diachronic corpus7. Monitor corpusWhat is corpus linguistics? A theory of language or a methodology of language?What is corpus linguistics?NOT a theory of language! but:

a collection of methods for studying language

Corpus linguistics is perhaps best described in simple terms as the study of language based on examples of real life language use.

History of corpus linguisticsEarly Corpus Linguistics

Criticism

Modern Approach to Corpus Linguistics1. Early Corpus LinguisticsHarris (1993: 27) summarises the approach well: 'The approach began ... with a large collection of recorded utterances from some language, a corpus. The corpus was subjected to a clear, stepwise, bottom-up strategy of analysis.'

1. Early Corpus LinguisticsExamples of early corpus research:Language acquisition: diary studies period 1876-1926; Preyer (1889), Stern (1924)Spelling conventions: Kding (1897), 11 million German wordsLanguage pedagogyComparative Linguistics: Eaton (1940) Syntax, Semantics: Fries (1952)