Upload
umcs-pl
View
1
Download
0
Embed Size (px)
Citation preview
Please cite as: Krajka, J. (2006). Web-based pronunciation teaching. The Teacher 6(40) 2006,
7-14.
WEB-BASED PRONUNCIATION TEACHING
Dr Jarosław Krajka
Uniwersytet Marii Curie-Skłodowskiej, Lublin
Szkoła Wyższa Psychologii Społecznej, Warszawa
Introduction
The development of pronunciation skills is a prerequisite for appropriate oral
production and interaction, with the comprehensibility of the learner’s speech being of utmost
importance for effective communication. As Godwin-Jones (2000) puts it, thanks to the
replacement of analog audio by digital audio for classroom pronunciation practice, random
access, variable playback speeds, and incorporation into interactive language learning
applications are all enabled.
The focus of the present paper will be to explore the possibilities opened by Web-
based instruction as far as pronunciation work is concerned. The use of Internet sites, online
audio materials, Web Text-to-Speech synthesis services (TTS) and software, ready-made or
teacher-prepared audio quizzes, may all contribute to the development of pronunciation
knowledge and skills. Here, we will adopt a bottom-up approach, starting with the articulation
of individual sounds (segments), through practice leading to language production in discourse,
communicative exercises or dialogues. This assumption has been justified by most usual
teaching procedure in mainstream language education, though one might reflect on the
opposite, top-down view, as well.
Phases of pronunciation teaching in the Web-enhanced classroom
1. Discrimination
Exposing students to the sounds of the target language, in isolation and in context, in
various positions of the word, is essential in developing discrimination skills. The new sounds
in the target language, as well as sounds that are substantially different in quality and/or
quantity, need to be exposed to students in the exercises that aim at improving noticing of the
new sounds, growing in complexity from sounds in isolation to more complicated words and
phrases.
The use of online audio resources to help in preparing materials for the discrimination
of sounds is not to be underestimated. Free availability of a variety of audio files, both
downloadable as .mp3 or .wav (podcasts) and streaming (playing “live” from the websites),
create favourable conditions for exposing students to the target language speech in its natural
form. However, before exposing students to the authentic flow of connected speech, usually at
fast tempo and often accompanied by background noises (e.g., in the news bulletins), it might
be useful to collect audio resources where particular sounds might be prominently featured.
The creation of a sort of “sound library”, with a great amount of recordings of words with the
key sound in various positions (front, mid and back), all in convenient digital format, should
be the first step in preparing discrimination activities for students.
In the case of materials available online, they need to be either downloaded (right-
clicking on an item and selecting “Save Target As”) or recorded with audio recorders/editors
such as Audacity (http://audacity.sourceforge.net). A ready-made collection of words
especially selected and recorded for pronunciation practice, most convenient for such a
teacher-made sound library, might be Sobkowiak’s Fonetyka angielska w pigułce
(http://elex.amu.edu.pl/ifa/pigulka/start.htm). Apart from being an extremely comprehensive
source of theoretical knowledge on how to articulate the sounds of the English language, the
site features downloadable recordings of exemplary words, all in .wav format. Due to that, the
teacher can either guide all the students to the site and have them click on the links to relevant
words to play them online, or can right-click the hyperlinked words and choose “Save Target
As” to retrieve the recordings of individual words for later offline use.
A similar operation can be performed with online dictionaries featuring audio
recordings of words, such as Merriam-Webster (http://www.m-w.com) or collections of
recordings of individual words (Words aloud, http://kweto.com/etjaichi/nicolascueto/). As the
pronunciation of specific words in online dictionaries is usually in the streaming format
(playing “live” from the website), one needs to record it with an audio recorder such as
Audacity (see Figure 1 below for the demonstration). Thanks to the use of Audacity, the
teacher can record the word multiple times, with gaps in between for students to repeat it, as
can be demonstrated by the soundwave form in the Audacity window below.
Figure 1. A pronunciation recording of an online dictionary entry with Audacity.
Alternatively, teachers might make their own recordings of selected words using
Audacity, so that the audio files are made using the teacher’s specific accent that students
might be most familiar with. Teacher-made recordings might be most useful for creating
minimal pairs, especially the longer strings in which words are repeated a number of times in
order to train the students’ ears.
Finally, some of the Text-to-Speech (TTS) synthesis tools listed below can be used to
automatically generate individual words or phrases with particular sounds (see Godwin-Jones,
2000, for a discussion of the technical aspects involved in speech synthesis technologies on
the Web). It needs to be kept in mind, however, that TTS readers, though quick and
convenient to use, will produce synthesized speech that may not fully portray the target
language, and it might not be fully beneficial to follow only these for sound discrimination:
• Nuance RealSpeak (http://www.nuance.com/realspeak/demo/);
• Acapela Group (http://www.acapela-group.com/index.asp, http://demo.acapela-
group.com/);
• AT&T Labs Research (http://public.research.att.com/~ttsweb/tts/, with one demo at
http://public.research.att.com/~ttsweb/tts/demo.php and the other at
http://elvis.naturalvoices.com/demos/);
• VLC’s Text-to-Speech Test Page
(http://www.edict.com.hk/texttospeech/TTStest.htm);
• Bell Labs Text-to-Speech (http://www.bell-labs.com/project/tts/voices-java.html);
• TextVoice Generator (http://guoshesen.51.net/download.html) - a fully functional TTS
reader generating sound files with the text/words input.
The selection of words for pronunciation work, both as far as discrimination and
practice is concerned, is the issue that teachers should approach with necessary caution.
Ideally, the words that students work with should be suited to their language level, both the
ones that they have already been exposed to receptively and the ones that are a part of the
productive lexicon, with no room for random selection. Similarly, the selection of words and
phrases needs to focus on the sounds in question in various positions of the words, both
monosyllabic and multisyllabic. To this end, it is advisable to use various rhyming services
(Rhymezone, http://www.rhymezone.com or Rhymer, http://www.rhymer.com, to quote just
two), which will provide the teacher with a number of words that rhyme with the words input.
In this way, while introducing a word with a given sound in front, mid and back positions, the
teacher will be provided with a large number of rhyming words, with a various number of
syllables, which they can choose from for pronunciation practice. It is also interesting to note
that Rhymezone (http://www.rhymezone.com) allows searching for homophones and similar
sounding words, which are additional opportunities for collecting materials for pronunciation
work, while Rhymer (http://www.rhymer.com) gives the chance to find various types of
rhymes (end, beginning, first/last syllable or double rhymes).
Thus, some of the above methods can serve to facilitate the compilation or creation of
audio materials for some of the techniques for improving discrimination as listed by Ur
(1996), Komorowska (2002) and Harmer (2002):
• minimal pairs: Ss distinguish the same/different words;
• give the number: with sounds from a minimal pair coded with numbers, Ss give the
number;
• repeated word: with a minimal pair with three words, where one sound is repeated to
make it more difficult, Ss distinguish sounds;
• counting repetitions: with some of the words from a minimal pair repeated, Ss are to
count repetitions;
• odd word out: Ss find the word with a different sound;
• label the word: Ss match the sounds with the words;
• matching the heard with the seen;
• matching the picture with the word heard;
• playing noughts and crosses (with words or sounds).
A variety of online audio resources as listed below can be used for collecting audio
materials especially to teach connected speech and suprasegmental features such as intonation
and sentence stress. It might be especially useful to collect radio broadcasts from different
English-speaking countries, as in this way students would be exposed to different varieties of
the language, which is crucial in the light of the New Matura requirements:1
• Australian Broadcasting Corporation (http://www.abc.net.au/);
• BBC (http://www.bbc.co.uk);
• BBC World Service (http://www.bbc.co.uk/worldservice/);
• Canadian Broadcasting Corporation (http://www.cbc.ca/);
• National Public Radio (http://www.npr.org/);
• Shoutcast (http://www.shoutcast.com);
• Voice of America (http://www.voanews.com/english/portal.cfm);
• World Radio Network (http://www.wrn.org/audio.html).
The issue of exposing students to a variety of accents and pronunciation patterns, not
only following the geographical divisions but also taking into account such criteria as the
social class, the profession or the age, can be accomplished with some of the audio/video
archives available on the Internet. It is especially BBC’s Video Nation
(http://www.bbc.co.uk/videonation/) that might be a perfect example here, as the archive
enables materials search by geographical location, in this way accessing samples of specific
accents with a wealth of additional information about speakers. Other sources of varied audio
materials, containing samples of different accents though perhaps less flexible as far as
seeking specific speakers, can be found in the following audio/video directories:
1 According to the information provided by the Regional Examination Board (Czarnecka-Cicha, 2003, personal communication), the New Matura listening comprehension tasks are to be distinguished by standard British English pronunciation on the basic level (poziom podstawowy) and other geographical varieties of the language on the advanced level (poziom rozszerzony).
• Recordings of English sounds: New Zealand Pronunciation
(http://www.ualberta.ca/~johnnewm/NZEnglish/sounds.html), Evaluating English
Accents WorldWide
(http://www.otago.ac.nz/anthropology/Linguistic/Sounds/Sounds.html), The Speech
Accent Archive (http://accent.gmu.edu/browse.php).
• Audio and video resource centres: Archive.org (http://www.archive.org),
Creativecommons.org (http://www.creativecommons.org);
• Podcast directories: Podomatic (http://www.podomatic.com), Jellycast
(http://www.jellycast.com), Podcasts for educators, schools and colleges
(http://recap.ltd.uk/podcasting/), Odeo.com (http://odeo.com/listen), Archive.org
(http://www.archive.org/);
• Historical speeches and recordings: American Rhetoric
(http://www.americanrhetoric.com/), Library of Congress American Memory Project
(http://memory.loc.gov/ammem/browse/ListSome.php?format=Sound+Recording),
The History Channel
(http://www.historychannel.com/broadband/searchbrowse/index.jsp?searchtype=searc
h&searchterm=*&mediatype=Audio), Vincent Voice Library
(http://www.lib.msu.edu/vincent/);
• The military: Pentagon Channel (http://www.pentagonchannel.mil), DefenseLink
(http://www.defenselink.mil/transcripts/);
• Television: ITN (http://www.itn.co.uk), ITV (http://www.itv.com), CentralTV
(http://www.centraltv.co.uk), Channel 4 (http://www.channel4.co.uk).
2. Presentation
The teacher’s initial responsibility is to have as good a pronunciation as possible,
being an adequate model to be followed. For that, knowing the sounds, producing them in
daily speech, having the theoretical knowledge of how to produce them and consciously
applying this knowledge in one’s production, even to the point of overpronuncing, are
necessary for proper teaching. Expanding significantly on this knowledge, with the help of
such online resources as Fonetyka angielska w pigułce,
http://elex.amu.edu.pl/ifa/pigulka/start.htm, Sharon Widmayer's Sounds of English,
http://www.soundsofenglish.org/pronunciation/index.htm or John Maidment's Tip of the Day,
http://www.phon.ucl.ac.uk/home/johnm/eptotd/tiphome.htm, should lead the teachers to
greater awareness of articulation of sounds.
Verbal presentation of new sounds, even enhanced by the use of articulation
descriptions and audio recordings of specific sounds in isolation, needs to be supplemented
with the skilful use of properly selected visuals. It is not only the question of knowing the
place and manner of articulation, but, ideally, seeing these in practice, for better visualization
of the abstract concepts. To this end, using visual aids such as vowel charts (see
http://www.arts.gla.ac.uk/IPA/ipachart.html) or mouth diagrams
(http://www.stuff.co.uk/media/calcul.html,
http://www.soundsofenglish.org/pronunciation/sounds.htm) is to increase the impact of the
presentation. An extraordinary tool for presenting pronunciation of the sounds of English in a
visual way is Phonetics: The Sounds of American English
(http://www.uiowa.edu/~acadtech/phonetics/english/frameset.html), which uses JavaScript
animation to expose the articulation of the sounds in a mouth diagram, with video alongside
recordings of sample words.
Figure 2. Visual presentation of articulation at Phonetics: The Sounds of American English
(http://www.uiowa.edu/~acadtech/phonetics/english/frameset.html).
Similar opportunities for effective presentation of the sounds of English are opened by
Tutorpal IPA Java-script animated slideshows (http://www.tutorpal.com/slideshow/). Each
sound is provided with the animated articulation sequence, both in the lip view and the mouth
diagram, in the context of selected words. There is also the breath indicator and vocal cords
movement (for voiced/voiceless) information. As each sound can be pronounced by the male
and the female voice, one can get a greater range of articulation models. Extremely interesting
options are muting the voice, so that one can only see the articulation animated and pronounce
the sound themselves, changing the speed of articulation on the screen, and varying the words
to be pronounced.
Associating sounds with the graphic representation is obviously the key element of
pronunciation teaching, however, one must also make every effort possible to develop the
associations between the object designated and its phonetic representation. In this respect, it is
Ship or Sheep: minimal pair ESL pronunciation practice
(http://www.fonetiks.org/shiporsheep/) which comes highly useful, as the minimal pairs are
all illustrated with pictures of objects, and moving the mouse over the words has them
instantly pronounced.
Visual TTS, or synchronization of a facial image (a talking head) with synthesized
speech, provides realistic mouth movements matching the speech sounds to increase the
intelligibility of the speech, not only to hear the differences in e.g. English vowels, but also
enhance understanding by seeing the sounds being pronounced (Sobkowiak, 2003). CSLU
Speech Toolkit (http://www.cslu.ogi.edu/toolkit/), is a combination of speaker- and
vocabulary-independent recognition technology, Text-to-Speech synthesis and facial
animation software within an authoring environment for building spoken language systems.
The toolkit contains the feature of facial animation, with an animated 3D talking head called
Baldi, producing sounds automatically synchronizing natural or synthetic speech with realistic
lip, tongue, mouth and facial movements. Especially by making the talking head transparent,
the teacher can display the organs of speech from different perspectives to fully grasp the
production of sounds. Depending on the technological considerations, the teacher might either
install the Speech Toolkit on individual computers, use the teacher computer with the LCD
projector to give a whole-class presentation, or make screenshots of the sounds and put them
up on the class website/class Moodle or embed in Microsoft Word documents.
Figure 3. Baldi made transparent (after Sobkowiak, 2003).
The indispensable element of the presentation of new sounds is contrasting them with
the first language sounds. For that aim, a useful technique can be exposing students to
“foreign-sounding” speech, produced by people speaking with strong interference of the first
language on their L2. While searching for such speakers using some of the online audio
directories provided above may be successful, a much simpler (though inauthentic and
limited) way of generating such foreign-sounding speech might be to use free online TTS
demos, selecting a different speaker than the language input. In this way, the synthesized
speaker voice is used to pronounce foreign vowels, consonants and clusters, which, in many
cases, can provide interesting material for further pronunciation study.
English speaker-Polish text TTS Polish speaker-English text TTS
Nuance RealSpeak
(http://www.nuance.com/realspeak/demo/)
Nuance RealSpeak
(http://www.nuance.com/realspeak/demo/)
Acapela Group (http://demo.acapela-group.com/) Acapela Group (http://demo.acapela-
group.com/)
AT&T Labs Research
(http://public.research.att.com/~ttsweb/tts/demo.php)
AT&T Labs Research
(http://elvis.naturalvoices.com/demos/)
VLC’s Text-to-Speech Test Page
(http://www.edict.com.hk/texttospeech/TTStest.htm)
Bell Labs Text-to-Speech (http://www.bell-
labs.com/project/tts/voices-java.html)
Figure 4. Free online Text-to-Speech synthesis tools.
The presentation phase of pronunciation work needs to introduce students to the
phonetic script, in the receptive mode at least, so that they develop not only the grapho-
phonemic associations, but also learn to decode the phonetic alphabet symbols to denote
particular sounds of English. To this end, it might be necessary for the teacher to expose
students to the phonetic script in online dictionaries in the decoding process. However, it
needs to be pointed out that due to coding problems many of the online reference tools fail to
properly reproduce the phonetic alphabet, and lack of this knowledge can lead to
misunderstandings and misconceptions about the phonetic representation of words. Thus, it is
advisable for teachers to incorporate online dictionary consultation activities, where
specifically such tools as Cambridge Dictionaries Online (http://dictionary.cambridge.org)
will be used due to their perfect rendering of the phonetic script, while the use of other
dictionaries will be discouraged for the same reason.
3. Practice
The practice phase of pronunciation instruction takes learners from discrimination and
presentation to the production of new sounds. At the beginning, this should rely on imitation
in various modes of work: chorally, in smaller groups, finally individually to be corrected by
the teacher. The imitation of teacher speech or recorded models of sounds, words, sentences,
should be collected in digital form as explained above, so that students can work on repeating
and imitating individual sounds and words also on their own out of class, in extensive
pronunciation practice. For that purpose, it might be ideal to create pronunciation practice
webpages on the class website or the class Moodle, where for particular sounds students
would be provided with articulation explanations (or links to their visualizations at
http://www.uiowa.edu/~acadtech/phonetics/english/frameset.html or
http://www.tutorpal.com/slideshow/) and recordings of examples (teacher-made or linked to
relevant sections of Fonetyka angielska w pigułce,
http://elex.amu.edu.pl/ifa/pigulka/start.htm).
Providing that copyright restrictions are adhered to, the teachers might reflect on
manipulating recordings of sounds, words or sentences as for changing pitch, tempo, adding
silence, fading in or out, in order to suit the level of materials to the needs of the class. Such
operations are fairly easy to execute with most contemporary audio editors, and especially
Audacity (http://audacity.sourceforge.net) recommended here for ease of use and widespread
availability.
An alternative mode of work might be providing students with tasks in which they are
given the phonetic transcription of words to pronounce and confront with the online
dictionary recording. To this end, the teacher-made pronunciation worksheets should link to
relevant word entries with pronunciation transcripts at Cambridge Dictionaries Online,
http://dictionary.cambridge.org, while Merriam-Webster, http://www.m-w.com, for
recordings.
As the next step in the development of learners’ pronunciation skills, the teacher might
implement in-class learner speech recording with Audacity, for learners to listen to online
clips (e.g., pronunciation recordings of key words at Merriam-Webster dictionary), record
their own pronunciation, then play both and compare. This particular procedure might be
facilitated to the greatest extent if the teacher instructs students on how to mix tracks in
Audacity, so that individual learners could mix the original recording (with gaps) with their
own version of the pronunciation. In this way, learners might be encouraged to develop their
“inner criteria for correctness” (see Caleb Gattegno’s The Silent Way, Gattegno, 1972).
Dictations might be another interesting possibility to develop learners’ pronunciation
skills, and specifically the sound-spelling correspondence. An example of the procedure is
provided at Graded Dictations (http://www.fonetiks.org/dictations/), where English passages
are played four times: first, the whole passage is read at normal speed to listen for gist;
second, each phrase is read slowly twice, with punctuation, as students write; then the whole
passage is read again for work to be checked; and finally, the written text is shown for
students to compare and count their mistakes. While the use of ready-made dictations as the
ones provided at http://www.fonetiks.org/dictations/ or http://www.d-
oliver.net/pronunciation/dictations/menu.htm will surely be of significant help to many
teachers, it might be worth reflecting on preparing such materials on one’s own, following the
procedure below:
• Prepare a text in a word-processor;
• Record it with Audacity or use TextVoice Generator to create a synthesized
version;
• Save the sound file with a distinct name;
• Open HotPotatoes JCloze exercise module (http://hotpot.uvic.ca) and paste the
text;
• Gap the text for the words that are to be focused on in the dictation;
• Make a link to the sound file;
• Save the exercise and export it as HTML for viewing.
In this case, it needs to be made sure that both the sound file and the exercise file are in the
same folder, either of the class website or the Moodle. The use of an interactive authoring tool
such as HotPotatoes makes it possible for the dictation activity to be done in the self-study
mode, thus extending the impact of pronunciation instruction beyond the confines of the
classroom.
Some other ready-made pronunciation activities can be found at the sites given below.
While teachers might direct students to these in order to develop their knowledge and skills, it
must be remembered that the contents of the tasks do not always have to be suitable for
particular learners, especially as for the selection of words. Thus, teachers are advised to
reflect on creating similar activities using the toolkit containing an authoring tool
(HotPotatoes, http://hotpot.uvic.ca), an audio recorder (Audacity,
http://audacity.sourceforge.net), a TTS reader (TextVoice Generator,
http://guoshesen.51.net/download.html):
• Practice of individual sounds or combinations of them: Steve Chadwick's Diphthong
Calculator (http://www.stuff.co.uk/media/calcul.html), John Maidment's The Vowel
Machine (http://www.btinternet.com/~eptotd/vm/soundmachines.htm), Vowels and
Diphthongs of American and British English
(http://faculty.washington.edu/dillon/PhonResources/newstart.html);
• Practice of word-sound-transcription correspondence: John Maidment’s Transcriber
(http://www.btinternet.com/~eptotd/vm/transcriber/transcri.htm);
• Minimal pair work: Jim Duber's Minimal Pairs (http://www-
writing.berkeley.edu/chorus/call/cuttingedge.html), Minimal Pairs for English RP
(http://pages.britishlibrary.net/marlodge/wordlist/), Adam Rado's English Learning
Fun Center (http://www.elfs.com and
http://www.elfs.com/SAY%20WHAT/Say%20Whatx.html), American English
Pronunciation Practice (http://www.manythings.org/pp/), ESL Gold’s Sounds and
Distinctions (http://www.eslgold.net/pronunciation/sounds_distinctions.html);
• Interactive listen-and-type exercises: Takako's Great Adventures
(http://www.faceweb.okanagan.bc.ca/tga/), Sounds of English
(http://www.soundsofenglish.org/activities/ex1.htm);
• Homophones/homographs/similar sounding words: John Higgins’ Homepage
(http://pages.britishlibrary.net/marlodge/), English Pronunciation
(http://international.ouc.bc.ca/pronunciation/), Dennis Online Learning Page
(http://www.d-oliver.net/pronunciation/dictations/menu.htm), Fonetiks.org
(http://fonetiks.org/difficult.html);
• Pronunciation practice of most common words: ESLgame.com Quizzes and Tests
(http://www.eslgame.com/eslgame/study/words/words.asp).
4. Production
The final phase of pronunciation work, production, can be distinguished from practice
by its much greater focus on meaning and content, being less characterized by discrete
emphasis on particular sounds. Producing sounds in various types of communicative activities
that will naturally involve the use of certain words and sounds should ensure smooth
integration of more explicit knowledge about the articulation of sounds into the learner’s
language inventory. Once a lot of input for reception has been given to students, mainly
focused on isolating and discriminating sounds, more favourable conditions for the use of
particular sounds in meaningful production and interaction will be created.
The activities for this phase might involve the use of pictures, such as associating
pictures with sounds, talking about pictures using words with a given sound, question-answer
about the things in the pictures using a given sound. Such collections of pictures can be found
in Microsoft Office ClipArt resources, as well as through online graphics search engines, both
copyrighted (http://www.google.com/images; http://www.altavista.com/image;
http://www.ditto.com/; http://www.picsearch.com/; http://multimedia.lycos.com/;
http://www.webplaces.com/search/; http://www.search-engine-index.co.uk/Images_Search/)
and free-domain ones (http://www.zettweb.com/grab-a-picture/;
http://www.deleage.com/en/5000_photos.htm; http://www.pdimages.com/;
http://pics.tech4learning.com/; http://www.peacegallery.wyenet.co.uk/; http://gimp-
savvy.com/PHOTO-ARCHIVE/).
Various teacher-made materials that aim at enriching the classroom with words
containing particular sounds might involve tongue-twisters (Adam Rado's Mouth Manglers,
http://www.elfs.com/MMz.html, Pizzaz! Tongue Twisters,
http://darkwing.uoregon.edu/~leslieob/twisters.html, 1st International Collection of Tongue
Twisters, http://www.uebersetzung.at/twister/en.htm, The Tongue Twister Database,
http://www.geocities.com/Athens/8136/tonguetwisters.html, Thinks.com Tongue Twisters,
http://thinks.com/words/tonguetwisters.htm), limericks, teacher- or student-made poems,
rhymes and chants (facilitated by such rhyming sites as Rhymezone,
http://www.rhymezone.com or Rhymer, http://www.rhymer.com). Such materials can be
geared specifically at certain sounds (as exemplified at
http://international.ouc.bc.ca/pronunciation/twisters02.htm), yet at the same time can be
skillfully interwoven thematically, structurally and lexically with the curriculum.
The integration of Text-to-Speech synthesis tools into the learner’s toolkit in the
receptive phase can enable pronunciation practice while reading online materials of various
kinds. Such an example of receptive work enhanced with pronunciation practice is displayed
in song lyrics hyperlinked to sound files (http://fonetiks.org/mom/), where “mousing over”
pop lyrics will have them performed. Ideally, however, it should be up to the learner to decide
which contents will be read and pronounced alongside reading. To this end, it might be
interesting to enrich the learning environment by equipping the Internet lab browsers with
VLC Text-to-speech Web Narrator (http://www.edict.com.hk/texttospeech/ttsBrowser.htm).
This particular program has the functionality of a Web browser but is enhanced with the
following TTS options: synthesizing the highlighted piece of text, reading the whole website
text aloud and displaying it in a "speech balloon", looking up any word on a webpage directly
in the VLC Net Dictionary together with its TTS sound file to play its pronunciation, finally
recording one's own pronunciation of a sound, word or sentence and playing it back to
compare with the original. This enhanced Web browser may become a highly useful tool for
learners using the Web and needing to get instant help of dictionary lookup and pronunciation
practice.
Figure 5. VLC Text-to-Speech Web Narrator
(http://www.edict.com.hk/texttospeech/ttsBrowser.htm).
Creating and performing dialogues which by their topic will involve the production of
specified sounds, intonation patterns or word/sentence stress can be triggered by listening and
imitating models provided in the receptive phase, based on some of the sources for online
audio as enumerated above. In the beginning stages of learning, students might use TTS to
fully generate dialogs, which can be later imitated. Such a service, provided by Text-to-
Speech Dialog Wizard and Text-to-Speech Interactive Dialog Wizard
(http://www.edict.com.hk/texttospeech/ttswizard.htm and
http://www.edict.com.hk/texttospeech/ttswizard2.htm respectively), will enable creating a
conversation dialog with up to 10 conversational moves with different voices, and the HTML
file generated can be saved and used on one’s PC or website. The resulting dialogue is sent to
the speech engine, utterances are recorded as separate .wav files, and the whole is returned as
an HTML page with all the audio files embedded.
The role of the development of pronunciation in authentic language interaction is not
to be underestimated, as the teacher needs to observe and diagnose problems which might
occur in meaning-focused exercises. The reflection on the quality of speech produced by
students, especially during the online exchanges using various conferencing tools is of prime
importance for the effective consolidation of theoretical knowledge with communicative
skills. Thus, it is advisable for teachers to use audio recorders such as Audacity to record
computer conferences for later analysis and remedial teaching.
Pronunciation-oriented online teaching
Taking into account a much bigger role of pronunciation than grammatical correctness
in communication, both direct and indirect instruction can be implemented in the Web-based
environment, working along the directions outlined above. When dealing with the role of
online pronunciation training in Web-based instruction, the following types of paradigms can
be followed:
• integrated: when pronunciation is blended with grammar and lexis;
• remedial: when a given phonetic issue is revised;
• practice: when solely pronunciation is practised (Kelly, 2000: 14);
• whole lesson: especially useful when teaching connected speech and intonation
patterns;
• discrete slots: when pronunciation is practised in limited amounts throughout the
lesson;
• integrated phases;
• opportunistic teaching: teacher's remarks about a problematic issue made when a
need arises (Harmer, 2002).
Thus, depending on the specific needs of the class and the curricular requirements, the teacher
might plan and execute Web-based pronunciation activities in one of the modes above, either
as a follow-up activity of other skills (especially in the post-listening and reading phases) or
as preparation for speaking activities.
The application of TTS technology as outlined above can be seen in the integrative
lesson outline as described by Shesen (2002). In the task focusing on reading and listening
comprehension the teacher first plays the text without students looking at it, to understand and
answer general comprehension questions. When playing the text once more, the teacher
manipulates with the reading speed, to make sure that students get the detailed information
needed, until they are able to follow every single word. In the next phase the teacher shows
the contents (with the "Hide/Show" button) and students analyze the text with new or key
words highlighted and explained. Students practise the pronunciation of these by listening to a
relevant sentence, repeating one's own pronunciation and comparing with a model sentence.
Finally, the teacher makes students read the whole text, increasing their reading speed until
they achieve fluency.
Conclusion
Computer-Assisted Pronunciation Teaching (CAPT) is a huge area of research and
practice, with a great number of extremely sophisticated commercial products and more and
more advanced solutions in the sphere of speech recognition and synthesis. However, the
discussion in the present work focused on the ways of enhancing pronunciation teaching in
the Web-based environment, both in-class and self-study, with the tools and techniques that
could be readily implemented by every single language teacher with no additional expenses or
extra technical expertise. It is to be hoped that soon teachers will become confident in the use
of simple TTS and audio editing tools such as the ones outlined above, and the
implementation of more sophisticated programs, with Visual TTS and talking heads, will help
raise the quality of EFL pronunciation teaching.
References
Gattegno, C. (1972) Teaching Foreign Languages in Schools: The Silent Way. New York: Educational
Solutions.
Godwin-Jones, B. (2000). Speech technologies for language learning. Language Learning & Technology, 3 (2),
January 2000, 6-9, http://llt.msu.edu/vol3num2/emerging/index.html.
Harmer, J. (2002). The Practice of English Language Teaching. Harlow: Pearson Education.
Kelly, R. (2000). How to Teach Pronunciation. Harlow: Pearson Education.
Komorowska, H. (2002). Metodyka nauczania języków obcych. Warszawa: Fraszka Edukacyjna.
Shesen, G. (2002). Simulated English native speakers. Teaching English with Technology, 2 (1), January 2002,
http://www.iatefl.org.pl/call/j_tech7.htm.
Sobkowiak, W. (2003). TTS in EFL CALL – Some pedagogical considerations. Teaching English with
Technology, 3 (4), October 2003, http://www.iatefl.org.pl/call/j_article15.htm#sob.
Ur, P. (1996) A Course in Language Teaching. Cambridge: Cambridge University Press.