Upload
ayfa
View
2.695
Download
3
Embed Size (px)
Citation preview
Corpus linguistics in lexicographyGroup members: Sarah Khairuddin (0713976)Eslam Abdurabuh (0614532) Nurul Diana Md. Rabi (0634264) Noraini Mohd Noor (0728928)
Definition
Lexicography is a scholarly discipline that involves compiling, writing, or editing dictionaries.
It is divided into two related areas:1. Practical Lexicography.2. Theoretical Lexicography.
Scope The basic concern of lexicography is 'word' which is studied in
different branches of linguistics, phonetics, grammar, stylistics etc.
Lexicography focuses on the design, compilation, use and evaluation of general dictionaries, i.e. dictionaries that provide a description of the language in general use.
Thus,
1. Practical Lexicography focuses on writing, or editing dictionaries.
Profiling the intended users, Defining words. Choosing the appropriate structures for presenting the data in the
dictionary. Selecting words and affixes for systematization as entries. Selecting collocations, phrases and examples. Choosing lemma forms for each word or part of word to be
lemmatized. Organizing definitions. Specifying pronunciations of words.
2. Theoretical Lexicography: is the analysis or description of the vocabulary of a particular language, and the meaning that links certain words to others in a dictionary.
Related aspects: Dictionary criticism. Dictionary history. Dictionary typology. Dictionary structure. Dictionary use. Dictionary IT.
Corpus used in Lexicography
Written Part:
Extracts from regional and national newspapers
Specialist periodicals and journals for all ages and interests
Academic books and popular fiction, Published and unpublished letters Memoranda, School and university essays, Among many other kinds of text.
Spoken Part:
Orthographic transcriptions of unscripted informal conversations (recorded by volunteers selected from different age, region and social classes in a demographically balanced way)
Spoken language collected in different contexts, ranging from formal business or government meetings to radio shows and phone-ins.
Examples of the Corpus
Collins Cobuild. British National Corpus (BNC). Longman Corpus Network. American National Corpus.
Relevance or application of lexicography to language learning/language research
Giving definitions to avoid ambiguity As a main source for record keeping
in preserving the collection of words Served as a guideline on how words
are changing New words are been introduced and
old words die out Give status labels for example slang,
jargon, taboo, etc
CONTRIBUTIONS OF LEXICOGRAPHY AND CORPUS LINGUISTICS TO A THEORY LANGUAGE(2000)•Author : Patrick Hans •Objective of the study : -To see the relevance of transforming generative linguistic theory to lexicography.-To see the relevance of using a device machine( corpus ) that can generate all and only the grammatical utterances in grammar.•Findings and synopsis : By studying the corpus evidence for a natural-kind term spider, we can develop a sort of collectivecognitive profile of the word and its meaning: the corpus prompts us into considering what mightbe said.
Corpus-based cognitive profile of the noun spider: Many thousands of species of spiders are known. Spiders are carnivores. Some species of spiders hunt prey. Some spiders bite. Some species of spiders are poisonous. Many species of spiders spin webs, with threads of extremely strong
silk. Spiders lurk in the centre of their webs. Spiders control what is going on in their webs. Spiders have eight legs. Their legs are thin, hairy, and long in proportion to body size. Spiders have eight eyes. Spiders spend a lot of time being motionless. Spiders’ movement is sudden. Spiders crawl. Spiders scuttle. Spiders are swift and agile. Spiders can run up walls. Many people have a dread of spiders. People are much concerned with trying to get spiders out of the bath.
LEXICOGRAPHY AND CORPUS LINGUISTICS (1992)
Author : Fred Karlson Objective of the study :
-To introduce the first English Corpus projects and its use.
To clarify What can corpus linguistics, in the broad sense just defined, contribute to lexicography, on top of what COBUILD and other completed projects have already demonstrated by way of using raw concordances derived from large text corpora?
Synopsis : features of new English corpus projects. The systematic design and collection of the corpora,
their large size, the idea of making them generally available to the research community, careful evaluation of the problems of representativeness and sampling, and, especially in the case of the Brown Corpus, full-scale computerization.
Methodology: collecting texts and using them for linguistic
description, developing linguistically suitable computational
tools for annotation and processing of large text corpora. (Statistical corpus processing, Grammatical annotation of large corpora).
Findings: It makes possible the generation of frequency
lists for lemmas. It makes possible comparisons concerning the
lexical composition of text types on the lemma level
raise the level of abstraction somewhat and help the lexicographer in structuring the corpus data
Collocation phenomena and syntactic frames are much easier to spot
Dictionary Production Software TLex Suite 2010 TLex Dictionary Compilation Software tlTerm Professional Termbase
Software tlCorpus Concordance Software tlReaderFeatures:TLex contains many specialized features that
allow you to dramatically reduce dictionary production time and increase the quality and consistency of your dictionaries (from single-user projects to large teams).
These include an integrated Corpus Query System, real-time preview, full customizability, advanced styles system,
"smart cross-references" with tracking and auto-updating, automated lemma reversal, automated numbering and sorting, multi-user support for managing teams, and much more.
TLex can be used for all languages, for all kinds of dictionaries.