14
Corpus linguistics in lexicography Group members: Sarah Khairuddin (0713976) Eslam Abdurabuh (0614532) Nurul Diana Md. Rabi (0634264) Noraini Mohd Noor (0728928)

lexicography

  • Upload
    ayfa

  • View
    2.695

  • Download
    3

Embed Size (px)

Citation preview

Page 1: lexicography

Corpus linguistics in lexicographyGroup members: Sarah Khairuddin (0713976)Eslam Abdurabuh (0614532) Nurul Diana Md. Rabi (0634264) Noraini Mohd Noor (0728928)

Page 2: lexicography

Definition

Lexicography is a scholarly discipline that involves compiling, writing, or editing dictionaries.

It is divided into two related areas:1. Practical Lexicography.2. Theoretical Lexicography.

Page 3: lexicography

Scope The basic concern of lexicography is 'word' which is studied in

different branches of linguistics, phonetics, grammar, stylistics etc.

Lexicography focuses on the design, compilation, use and evaluation of general dictionaries, i.e. dictionaries that provide a description of the language in general use.

Thus,

1. Practical Lexicography focuses on writing, or editing dictionaries.

Profiling the intended users, Defining words. Choosing the appropriate structures for presenting the data in the

dictionary. Selecting words and affixes for systematization as entries. Selecting collocations, phrases and examples. Choosing lemma forms for each word or part of word to be

lemmatized. Organizing definitions. Specifying pronunciations of words.

Page 4: lexicography

2. Theoretical Lexicography: is the analysis or description of the vocabulary of a particular language, and the meaning that links certain words to others in a dictionary.

Related aspects: Dictionary criticism. Dictionary history. Dictionary typology. Dictionary structure. Dictionary use. Dictionary IT.

Page 5: lexicography

Corpus used in Lexicography

Written Part:

Extracts from regional and national newspapers

Specialist periodicals and journals for all ages and interests

Academic books and popular fiction, Published and unpublished letters Memoranda, School and university essays, Among many other kinds of text.

Page 6: lexicography

Spoken Part:

Orthographic transcriptions of unscripted informal conversations (recorded by volunteers selected from different age, region and social classes in a demographically balanced way)

Spoken language collected in different contexts, ranging from formal business or government meetings to radio shows and phone-ins.

Page 7: lexicography

Examples of the Corpus

Collins Cobuild. British National Corpus (BNC). Longman Corpus Network. American National Corpus.

Page 8: lexicography

Relevance or application of lexicography to language learning/language research

Giving definitions to avoid ambiguity As a main source for record keeping

in preserving the collection of words Served as a guideline on how words

are changing New words are been introduced and

old words die out Give status labels for example slang,

jargon, taboo, etc

Page 9: lexicography

CONTRIBUTIONS OF LEXICOGRAPHY AND CORPUS LINGUISTICS TO A THEORY LANGUAGE(2000)•Author : Patrick Hans •Objective of the study : -To see the relevance of transforming generative linguistic theory to lexicography.-To see the relevance of using a device machine( corpus ) that can generate all and only the grammatical utterances in grammar.•Findings and synopsis : By studying the corpus evidence for a natural-kind term spider, we can develop a sort of collectivecognitive profile of the word and its meaning: the corpus prompts us into considering what mightbe said.

Page 10: lexicography

Corpus-based cognitive profile of the noun spider: Many thousands of species of spiders are known. Spiders are carnivores. Some species of spiders hunt prey. Some spiders bite. Some species of spiders are poisonous. Many species of spiders spin webs, with threads of extremely strong

silk. Spiders lurk in the centre of their webs. Spiders control what is going on in their webs. Spiders have eight legs. Their legs are thin, hairy, and long in proportion to body size. Spiders have eight eyes. Spiders spend a lot of time being motionless. Spiders’ movement is sudden. Spiders crawl. Spiders scuttle. Spiders are swift and agile. Spiders can run up walls. Many people have a dread of spiders. People are much concerned with trying to get spiders out of the bath.

Page 11: lexicography

LEXICOGRAPHY AND CORPUS LINGUISTICS (1992)

Author : Fred Karlson Objective of the study :

-To introduce the first English Corpus projects and its use.

To clarify What can corpus linguistics, in the broad sense just defined, contribute to lexicography, on top of what COBUILD and other completed projects have already demonstrated by way of using raw concordances derived from large text corpora?

Synopsis : features of new English corpus projects. The systematic design and collection of the corpora,

their large size, the idea of making them generally available to the research community, careful evaluation of the problems of representativeness and sampling, and, especially in the case of the Brown Corpus, full-scale computerization.

Page 12: lexicography

Methodology: collecting texts and using them for linguistic

description, developing linguistically suitable computational

tools for annotation and processing of large text corpora. (Statistical corpus processing, Grammatical annotation of large corpora).

Findings: It makes possible the generation of frequency

lists for lemmas. It makes possible comparisons concerning the

lexical composition of text types on the lemma level

raise the level of abstraction somewhat and help the lexicographer in structuring the corpus data

Collocation phenomena and syntactic frames are much easier to spot

Page 13: lexicography

Dictionary Production Software TLex Suite 2010 TLex Dictionary Compilation Software tlTerm Professional Termbase

Software  tlCorpus Concordance Software  tlReaderFeatures:TLex contains many specialized features that

allow you to dramatically reduce dictionary production time and increase the quality and consistency of your dictionaries (from single-user projects to large teams).

Page 14: lexicography

These include an integrated Corpus Query System, real-time preview, full customizability, advanced styles system,

"smart cross-references" with tracking and auto-updating, automated lemma reversal, automated numbering and sorting,  multi-user support for managing teams, and much more.

TLex can be used for all languages, for all kinds of dictionaries.