Upload
tranthien
View
214
Download
1
Embed Size (px)
Citation preview
A HXGH PERFORMANCE SOFT KEYBOARD FOR MOBILE SYSTEMS
A Thesis
Presented to
The Facuity of Graduate Studies
of
The University of Guelph
BY
S H A W XIA ZHANG
In partial fulfillment of requirement
for the degree of
Master of Science
February, 1998
O Shawn Xia Shang, 1998
National Libtary Bibliothèque nationale du Canada
Acquisitions and Acquisitions et Bibliographie Services seMces bibliographiques 395 Weiiington Street 395. nre Wellingtori OttawaON K I A M Ot&wa ON K1A ON4 canada Canada
The author has granted a non- L'auteur a accorde une licence non exclusive licence aIlowing the exclusive permettant à la National Library of Canada to Bibliothèque nationale du Canada de reproduce, loan, distribute or sell reproduire, prêter, distribuer ou copies of this thesis in microform, vendre des copies de cette thèse sous paper or electronic formats. la fome de microfichelnlm. de
reproduction sur papier ou sur format électronique.
The author retains ownership of the L'auteur conserve la proprié,te du copyright in this thesis. Neither the droit d'auteur qui protège cette thése. thesis nor substantial extracts fiom it Ni la thèse ni des extraits substantiels may be printed or otheMrise de celle-ci ne doivent être imprimés reproduced without the author's ou autrement reproduits sans son permission. autorisation.
ABSTRACT
A HIGH PERFORMANCE SOFT KEXBOARD FOR MOBILE SYSTEMS
S h a w X. Zhang
University of Guelph, 1998
Supervisor:
Professor 1. S. MacKenzie
The major issues in text entry on mobile cornputer systems are presented. Two formai
usability experiments were conducted to explore particular problems in the design and
usability of high performance soft keyboards.
An experiment with 12 participants compared text entry rates on large and small sofi
keyboards with QWERTY layouts. Participants were also given a standard touch typing
test. The resuits showed that touch typing skills on physical keyboards do not necessarily
transfer to stylus tapping on a soft keyboard. Participants touch typing speeds ranged
fiom 19 to 74 wpm, whereas their stylus tapping speeds ranged from 15.4 to 25.0 wprn
on a large sol3 keyboard and from 13.1 to 26.0 wprn on a small soft keyboard. The
correlations were low: .416 for the large soft keyboard and .523 for the smdl soft
keyboard.
We also designed a completely new high performance sofi keyboard, based on a
linguistic model, Fitts' law, and other rnodels. The new design supports a predicted
maximum text entry speed of 58.2 wpm. This is about 35% faster than with a QWERTY
layout (at a maximum of about 43.2 wprn).
A second longitudinai text-entry experiment with five participants was conducted to
compare our hi& performance soft keyboard with a QWERW sofi keyboard. The
results showed that, with practice, users can perfom better with the new design. The
crossover point occurred at the 1 0 ~ session of the 20-session experiment. This
corresponds to about 3 hours of practice. The error rates were significantly Iower with
the high performance layout than with the QWERTY layout, and this was consistent
throughout the experiment. Over the coune of the experiment, mean entry speeds
improved, finishing at 44.3 wpm for the high performance layout design, and 40.3 wpm
for the QWERTY Iayout. A regression model based on the power law of leamhg
predicts that at session 50, the mean entry rates would reach 60.7 and 44.8 wpm, for the
high performance and QWERTY Iayouts, respectively. These are very close to the rates
predicted in our model.
To Hua, Lili and Our parents.
Acknowledgement
1 wish to thank my thesis advisor Dr. Scott MacKenzie for his knowledgeable and skillfùl
guidance and supervision during my study. It was his knowledge, enthusiasm and
achievement in human computer interaction that made me interested in the area of HCI.
Working with him was also a great pleasure.
1 would also like to thank Dr. Fangju Wang, for serving in my supervisory comrnittee.
Many thanks to Dr. Tom Wilson who chaired my thesis examination comrnittee.
Thanks are given to the members of the Input Research Group - Guelph, Input Research
Group - Toronto, and al1 who have contributed in this research, in big way or small way.
Thanks are aiso given to my fellow graduate student and fnend William Soukoreff, for
sharing his knowledge and helping me in many ways in my study.
My greatest thanks go to my wife Hua Shen for her love and unmatched support, patience
and encouragement. My daughter Lili brightened my days with her love and cheers, and
sometimes a littie rnischief. Thanks to my parents for dways being there with me.
Table of Content
Acknowledgement ............................................................................................................. i . . Table of Content ................................................................................................................. ii
List of Figures .................................................................................................................... iv ....................................................................................................... Chapter 1 . Introduction 1
. . 1 1. Mobile Computing ..................................... ... 1 7 1.2. Text Entry with MobiIe Systems ........................................................................... -
1.2.1. How Pen-Based Computers Work .................................................................... 4 . . . . 1.2.2. Handwntmg Recognition .................................................................................. 5 1.2.3. Physical Keypads and Sofi Keyboards ............................................................. 5
1.2.3.1. Physical and Safi Keypads ...................................................................... 6 1.2.3.2. Soft Keyboards ......................................................................................... 9
.............................................................................................. 1.3. Focus of this Work 11 ............................................................................................ Chapter 2 . Literature Review 12
2.1. Text Entry Using Recognition Software ............................................................. 12 2.1.1. Character-Based Recognition ......................................................................... 13 2.1.2. Word-Based Recognition ............. .... ........................................................... 16
2.2. Text Entq Using Sofl Keyboards ..................................................................... 17 CC 97 . 2.3. The Elusive Crossover Pont ............................................................................ 23
............................................................................ 2.4. ModeIs ,., ............................ 24 2.4.1. The Linguistic Mode1 and Space Key Extension ............................................ 24 2.4.2. Fitts' Law ......................................................................................................... 26
................................................................................ 3.4.3. Visual Scan Time Mode1 28 ................................................................................. 2.4.4. Saccadic Eye Movement 29
2.4.5. Soukoreff and MacKenzie's Optimal Angle Mode1 ........................................ 30 ............................................................................................ 2.4.6. Key Repeat Time 32
Chapter 3 . The Design of Soft Keyboards ...................................................................... 34 3.1. Issues Related to Sofi Keyboard Design ............................................................. 34
3.1.1. Visual Scan: Fixed vs . Random .................................................................... 34 3.1.2. Ski11 Transfer from Previous Experience ....................................................... 35 3.1.3. Size: Large vs . Small ................................................................................... 36
............................................................................................... 3.1.4. Leaming Mode1 36 3.1 .5 . Possibilities for Alternatives ........................................................................... 37
....................................................................................... 3.2. Keyboard Optimization 38 3.2.1. Models ............................................................................................................. 38 3.2.2. Extra Design Concems ................................................................................... 39 3.2.3. Current Designs and Speed Predictions .......................................................... 40
3.3. The Optimal Soft Keyboard - OPTI ................................................................... 41 ............................................................................................. . 3.3.1 Design Rationale 41
3.3.2. The OPTI Design .......................................................................................... 43 Chapter 4 . Experiment 1 ................................................................................................... 44
4.1. Method ................................................................................................................. 44 . . 4.1.1. Participants ...................................................................................................... 44
4.1.2. Apparatus ........................................................................................................ 44 ............................................................................................................. 4.1.3. Design 48
4.1 .4 . Data Collection ............................................................................................... 49 4.1.5. Procedure ..................................................................................................... 50
4.2. Results and Discussion ........................................................................................ 52 4.2.1. Visual Search Time on a Soft Keyboard ......................................................... 52 4.2.2. Ski11 Transfer fiom Touch Typing to Stylus Tapping .................................... 53 4.2 .3 . Text Entry Speed ...................... ... . ..,.,. ...................................................... 54 4.2.4. Error Rates ............................... ... .... ..................................................... 57
...................................................................................................... 4.3. The Next Step 58 Chapter 5 . Experirnent 2 .............................. ..... .............................................................. 59
5.1. Method ................................................................................................................. 59 . . 5.1.1. Partmpants ..................................................................................................... 59 5.1.2. Apparatus ........................................................................................................ 59
............................................................................................................. 5.1.3. Design 61 .............................................................................................. . 5.1 .4 Data Collection 62
...................................................... . .......................... 5.1 .5 Procedure .... . ...... 63 5.2. Resuits and Discussion ........................ .. .......................................................... 65
5.2.1. Text Entry Speed - The Leaming Curve ........................................................ 65 .................................................................................................... 5.2.2. Error Rates 68
.................................................................................. 5.2.2.1 . Cumulative Errors 68 .......................................................................................... 5.2.2.2. Chunk Errors 70 .......................................................................................... 5.2.3. Use of Space Keys 72
...................................................................................................... Chapter 6 . Conclusion 76 Bibliography .................................................................................................................... 79
...................................................................................................................... Appendices -87 A . Single Letter Frequencies of Comrnon English .................................................. 87
.................................................. B . Digraphs Involving Ten Most Frequent Letters 88 ............................................................................... . C . General Description of V 1 0 90
D . Listing of phrases used in expenment one .......................................................... 93 E . Sample text used in the typing test in the experiments ....................................... 94 F . Listing of phrases used in experiment two .......................................................... 95
................................................................................ G . General description of V2.0 96 ........................................ H . Calculating the theoretical MT of a randorn keyboard 98
List of Figures
Figure 1 3Com's PamZPiZot (a) and Microsoft's PaZmPC @) ...................................................... 2 Figure 2 Nokia's 9000 Series Communicutor ............................................................................... 2 Figure 3 A hypothetical keypad (a) .............................................................................................. 7 Figure 4 A hypothetical keypad (b) .............................................................................................. 8 Figure 5 Single-stroke irnplementation of the Roman alphabet (a) Unisfrokes (b)
Graffiti ..................................................................................................................... -14 Figure 6 A Q WERTY layout .................................................................................................... 1 8 Figure 7 An ABC layout ............................................................................................................ - 1 8 Figure 8 JustType keyboard ........................................................................................................ 19
...................................................................... Figure 9 A Dvorak layout showing alphabet only 19
...................................................................... Figure 10 A Fitaly layout showing alphabet only 19 ........................................................ Figure 1 1 A telephone keypad with letters Q, Z and Space 20
Figure 12 The elusive crossover point ........................................................................................ 23 Figure 1 3 The reciprocal tapping task on two targets with width ( W) and
............................................................. amplitude (A) for establishing the Fitts' law 27 Figure 14 Performing a Fitts' law tapping task ........................................................................... 27
........................................................................ Figure 15 optimal angle model: expert behavior 31 ................................................................. Figure 16 An approximation of optimal angle mode1 1 . . .............................................................................................. Figure 1 7 Distnbution of digraphs 33
............................................... Figure 18 The OPTI Keyboard for text entry on mobile systems 43 ...................... .................................................................... Figure 19 Control panel of V 1 . 0 ... 46
......................................................................... Figure 20 A session with the large fixed Iayout 47 ............................................................... Figure 21 A session with the large randomized layout 47
......................................................................... Figure 22 A session with the small fixed layout 48 .............................................................. Figure 23 A session with the srna11 randomized layout 48
Figure 24 Text entry speed ........................................................................................................ -55 ................................................................................................. Figure 25 Control panel of V2.0 60
............................................................. Figure 26 A session with OPTI layout in Expenment 2 64 .................. .............................. Figure 27 A session with QWERTY layout in Experiment 2 .. 65
Figure 28 Performance over sessions . (a) Entry speed (b) Learning curves and th .................................................................................... extrapolations to 50 session 66
......................................................................... Figure 29 Mean cumulative error over sessions 69 Figure 30 Mean chunk error over sessions ................................................................................. 71 Figure 3 1 Use o f optimal spaces with O P n over sessions ..................................................... ....73
...................................................... Figure 32 Theoretical and actual optimal use of space keys 74 Figure 33 A five by six version of the optimal keyboard ........................................................... 78
............. .............................. Figure 34 A Keyboard without fixed letter-key assignments .......... 98
Chapter 1. Introduction
Since the inception of pen-based computing in the early 1990s technology in this area has
advanced steadily. As more technologies are available and more developers are involved in
research and development, both hardware and software have continually progressed towards
products with better performance and higher user acceptance.
1.1. Mobile Computing
Pen-based computing systems inciude personai digital assistants (PDAs), personal information
managers (PIMs), slates, and other pen-based and handheld products.
More important, however, is the shifi in attention from the limited idea of '"pen-based
computing" to the more general notion of "mobiie computing". The latter encompasses a vast
temtory, including not only PDAs, PIMs, slates, and other traditional pen-based products, but
also cellular phones, pagers, remote controls for consumer products. or future "convergent"
products for interfacing to computing, cable, telephone, consumer electronics, and intemet
systems.
Compared to systems just a few years ago, today's mobile systems have more computing
power and functionality. The devices are both lighter and more rugged, and they are
ergonornicaily designed. Above d l , hand writing recognition has improved tremendously in
recent years.
The development of pen-based systems is as diversified as desktop systems. Different
processors and operating systems are found in different products. Examples of products are
3Com's Palm Pilot (Figure 1 a), Sharp's SE-500, HP's ZOOM, Psion's Series 3c, MicrosofYs
PalmPC (Figure 1 b) and Nokia's 9000 Communicator (Figure 2). Examples of operating
systems are Palm OS, Magic Cap and Microsoft Windows CE.
(a) (b) Figure 1 3Comfs PamZPilot (a) and Microsoft's PalmPC (b)
Figure 2 Nokia's 9000 Series Communicator
1.2. Text Entry with Mobile Systems
The rapidly expanding market for mobile cornputing systems has elevaied to the fore a dificuit
challenge: the need to manually enter alphanumeric information.
For mobile systems, the full-sized QWERTY keyboard has either disappeared, or has shnink to
such a size that touch typing with ten fingers becomes impossible. In many situations with
mobile computers, users need to hold the computer with one hand or both hands. Here, the
main input channel is reduced fiom ten fingers to as few as one figer. The term "one fmger"
is used loosely here because the "finger" can sometirnes be replaced by a stylus or a pen. (It
can even be extended to eye-tracking input where the only input channel is the focal point).
Without a Ml-size physicai keyboard, these systems resort to altemate techniques for text
entry, such as handwriting recognition, tapping on a soft keyboard, or a miniature physical
keyboard that can no longer facilitate touch-typing. Interacting with the system using Pen-
based technology brought forth both a new technique and new challenges.
Indeed, one of the attractions of pen interfaces is the prospect of using familiar pen and paper
skills for interacting with computers (Frankish et al. 1995). Besides functioning as a mouse,
Pen input has two additional advantages: a) manipulation is directly under users' focus. and b)
the keyboard and mouse are converged into one input device. This is translated into ease of
annotating and drawing; and the possibility of writing text information into the computer.
Even by today's standards, pen input is still simpler to implement and is more effective than
other input means such as voice recognition or eye tracking, which are not of prirnary interest
in this research.
1.2.1. How Pen-Based Computers Work
Pen-enabled displays in most of today's mobile computea are formed with two layers: a
display layer that displays the information, and a digitizing layer which either overlays or
underlays the display layer. The digitizing layer detects the pen's action over it: position,
movement, pressure or other signals sent by the pen or sensed by the layer, such as angle of the
pen or the click of the side burton on the barre1 of the pen.
When two layers are perfectly aligned on top of each other, user perceives one display that
"reacts" to the pen input. The computer, however, has several jobs to complete behind the
scene.
It has to detect any action by the pen, process the signals and display results back to the user
with minimum Iag. It is up to the software or hardware to display any interactions between the
pen and the computer. In other words, "writing" on a computer's display does not necessarily
leave an "ink trail" on the screen: a difference fiom writing with a normal pen and paper. In
translating pen input into text information, a field called handwriting recognition emerged.
Pen input fares very well for simple painting, selecting and annotating. When it cornes to
entering texhial information into the computer with a Pen, two techniques are commonly used:
handwriting recognition or so fi keyboards. These are described in the following sections.
1.2.2. Handwriting Recognition
One clarification is due; namely, that handwriting recognition is not a synonyrn for pen-based
cornputing or mobile cornputing. Handw-riting recognition is ody a subset of these. However,
the poor performance of handwriting recognition in the early 1990s greatly hindered the
acceptance of pen-based systems.
With early systems, daims of unconstrained, cursive hand-written input did not meet the
expectations of a demanding user community. Today, handwriting recognition products are
much improved (Blickenstorfer, 1996) and the support for languages other than English is
emerging. The pen-based systems, particularly PDAs and PIMs, are much better received
today than in the early 1990s.
Handwriting recognition can be implemented at either the sohvare or firmware level.
Recognition "engines" come in many varieties. Recognition performance can be enhanced by
exploiting context, dictionaries, constraining the symbol set @y using "modes". for example),
custornizing for specific users or training. Detailed discussion of different schemes in
handw-riting recognition is found in Chapter 2.
1.2.3. Physical Keypads and Soft Keyboards
Besides using handwriting recognition, text information can be input into mobile systems
through a soft keyboard or a physical keyboard. Mobile systems that support handwriting
recognition generally include a soft keyboard as well.
1.23.1. Pfiysical and Soft Keypads
A keypad is usually a small set of keys, either physicdly built in or irnplemented as software.
A keypad can be used to enter text. There is no clear line between a keypad and a keyboard.
Generally the number of keys found on a keypad is less than on a keyboard. Multiple letters
and sometimes, the rnix of letters and nurnbers are assigned to a key on a keypad. One
question with a keypad therefore, is how to distinguish what is intended when a key is hit.
Several schemes can be used:
By entering extra information to explicitly indicate the desired letter. For example, when
each key is assigned with three letters, users can hit a key once to select the first letter.
twice for the second, and so forth. But the interval between the multiple hits for letter
selection m u t be clearly distinguishabie fkom the interval between different letters. This
makes it dificult for users with different motor skills to use the device. It also leaves less
room for errors. For example, to enter a word ' T A Y on a keypad as shown in Figure 3,
the following steps must be taken.
1. Quickly tap the key labeled "ABC" three times to select letter "C",
2. Brief pause until "C" is registered,
3. Tap once on the key labeled "ABC" to select letter "A",
4. Brief pause until "A" is registered,
5. Quickly tap on the key labeled "STU" two times to select latter "T",
6. Brief pause until "T" is registered.
Figure 3 A hypothetical keypad (a)
If the keys also have numbers assigned to them, like a telephone keypad (Figure 4), then users
can tap a key once to select the set of letters and then tap a numeric keys to indicate the
position of the desired letter in the set. The following steps are needed to enter "CAT':
1. tap on key " 1 " to select "ABC",
2. tap on key "3" to indicate Ietter C's position,
3. tap on key "1" to select "ABC",
4. tap on key "1" to indicate letter A's position,
5. tap on key "7" to select "STU",
6. tap on key "2" to indicate letter T's position.
Figure 4 A hypothetical keypad (b)
This scheme requires that users switch back and forth between identiQing a letter on a key and
identiQing the position of the letter by identifying a numenc key.
The advantage of this method is that the timing is not cntical and every letter is composed of
two taps. The user can relax and take time to enter the text without worrying about the
tirneout.
Both methods above are explicit. Substantial learning and numerous steps are required for
even a smaii text-entry task.
Another method uses disambiguation to ease the cognitive load fiom users. Users only
have to tap on keys that contain the desired letters. An algorithm in the background catches
the sequence, finds the most likely word for the sequence, and then proposes the word(s) to
the user for approval. This will rely heavily on the disambiguation algorithm for accuracy
and for reducing user fnistration. For example, to enter the word "CAT" on a hypothetical
keyboard (Figure 4), a user only has to tap "1 ", " 1 ", and "7". Disarnbiguation choices
include '%AT7, "BAT' and "ACT' and possibly others. Extra step(s) will be required to
select the word from the given list. If the htended word is not on the list of choices given7
the scheme has to allow for explicit construction of words such as "ABU" (the little
monkey in Disney's Aladdin). This method must be learned at an extra cost of time and
effort.
Using a keypad to enter the alphabet can reduce the physical space required (either a physical
keypad or screen space for a software keypad). The greatest problem is usability.
The reverse of the above, however, is much easier. Using a well-known name to code a
sequence of digits has been widely used. An example is a phone number 1-800-UOGUELPH
(Don't call, this number does not exist as this thesis is written).
1.2.3.2. Soft Keyboards
A soft keyboard is a keyboard image displayed on the system's screen. It is tapped on with a
Pen. Entering text with a soft keyboard is similar to selecting a button in a dialog box. No
speciai ski11 is required except the simple point-and-click mouse action. In the case of a pen, it
is a "point-and-tap" action.
There are several advantages in using soft keyboards:
1) There is no ambiguity as long as each key is assigned only one letter per mode. Different
letten c m be assigned to the same key (as with physicai keyboards) and can be accessed via a
mode shift. Becaw keys can be assigned different captions through software, clutîered
prktings sometimes found on physical keyboards can be avoided.
2) They do not have to be visible al1 the tirne, as opposed to a physical keyboard. Compared
with physically attached keyboards, soft keyboards do not add extra weight to the device.
They can be activated when needed and hidden when finished.
3) They are easier to implement and consume fewer system resources compared with
handwriting recognition.
4) They do not Wear and tear and are easily upgradable.
However, there are several disadvantages of the sofi keyboards:
1) They take screen real estate when activated. An economicai problem here is that. since the
soft keyboard will likely be used with one frnger or pen, we only need to access one letter at a
time. But a soft keyboard has to display at least a set of keys at the sarne time and, therefore, at
any time, most of these keys are not used.
2) They require a minimum key size to be operable. If the keys are too small. not only will
they be hard to recognize, but they will also be difficult to tap. Performance might suffer with
a smaller keyboard. Other factors may also affect the usability of a small soft keyboard, such
as screen resolution, ambient lighting, user's vision and user's hand-eye coordination.
10
3) They generally lack feedback, especially tactile feedback. Most pen-based display panels
are pressure sensitive. Ody a small pressure is required to activate a key. Most of the soft
keyboards rely on visud feedback - a visual "button-down" effect or reverse video - when a
key is tapped. As well, usen c m not sense the edge of a key with the fmger or pen. Finally.
eyes-free operation with a soft keyboard is not possible.
However, the advantages of soft keyboards hold the potential of supporting the easiest and
fastest way of entering text uifonnation into a mobile system, despite the drawbacks.
The most cornmoniy adopted layout for soft keyboard is QWERTY. The QWERTY layout
was designed to slow down typing speeds to reduce mechanical jam on typewriters (Potosnak,
1988). There is no obvious reason why it is the layout of choice for soft keyboards. We will
present a full discussion on this in the following chapten.
1.3. Focus of this Work
This research is focused on the issues in entering textural information into a cornputer system
using soft keyboards. These issues include the following:
Theoretical and empirical models related to soft keyboard design and usability;
Prediction of text entry rates with soft keyboards;
Exploring the problem of optimizing a keyboard layout for maximum entry rates;
Testing the usability of the optimal soft keyboard with text entry tasks.
Chapter 2. Literature Review
The notion of "text" here refers to either or both alphabetic and nurneric information. The
alphabetic information is language specific and, hereby, only refers to the English ianguage.
However, most of the models and rules will apply to other languages as well.
Advances in technology enables us today to have a choice of using either keyboards or
handwriting recognition to interact with cornputers. We also have a better understanding of
when and where handwriting recognition will be more appropriate than a keyboard.
2.1. Text Entry Using Recognition Software
To implement handwriting recognition, a system must fint be capable of accurately capturing
the x-y coordinate data of pen-tip movement. This development in the late 1950's precipitated
considerable activity in on-line handwriting recognition. This intense activity lasted through
the 1 9601s, ebbed in the 197û's, and was renewed in the 1980's (Tappert et al., 1990).
Generally, handwriting recognition cm be categorized as on-line and off-line recognition. The
distinction lies on two factors: when to capture the data and when to recognize the data.
On-line recognition usually captures data irnmediately and recognizes it as soon as the data are
captured. Off-line recognition captures the data immediately and fmishes with recognition
days, months, or even years later. It can also be that the data are captured at a later time.
(Optical character recognition or OCR, is a typical off-line recognition process).
12
An ideal on-line handwriting recognizer should match user input in both speed and accuracy.
It should not introduce any noticeable lag and should always work at user's pace. At the same
thne it should deliver satisfactory recognition accuracy. However, the ideal may never arrive:
one can always scribble something worse than a physician's prescription.
There are several ways to categorize handwriting recognition technology: some recognition
engines are character-based while others are word-based. Most begin recognition
instantaneously but some use deferred recognition. Some rely on printed characters whereas
others attempt to recognize cursive writing (Blickenstorfer, 1997). Many recognizers make no
effort to force users to provide manageable input but others have invented a set of symbols that
users m u t use to enter text (MacKenzie and Zhang, 1997).
2.1.1. Character-Based Recognition
Character-based recognition makes no effort to recognize words. It relies on the user to input
high quality writing. It also relies on constraining the possible characters in a small set. Mode
shifts are inevitable. For example, if user scnbbles a letter "C", it is dificult for a character-
based recognizer to know if it is in upper case or lower case without first being provided with
an additional constraint.
Another difficulty in implementing recognition algorithms for handwriting or hand printing is
known as the "segmentation problem". This occurs because some letters, or symbols, are
composed of multiple strokes. This c m also be thought of as the problem of "when-to-
recognize".
Several solutions are used for the "'when-to-recognize" problem. These include using a delay
tirneout, limiting the input in boxes or a comb-shaped entry line, or inventing a new stroke
alphabet in which each letter is created with a single stroke.
A single stroke, in this sense, is a continuous gesture of any shape created in one action. The
stroke begins when the pen touches the writing surface of the system and ends when the pen is
raised. This greatiy simplifies recognition because the segmentation problem is avoided.
Examples of single-stroke alphabets are Unistrokes (Goldberg and Richardson, 1993) invented
at Xerox PARC and Grami (EHickenstorfer, 1995) by Palm Cornputhg (Figure 5). Graflti
has been built in to several PDA's including 3Com PulmPilot and HP OmniGo 100. Another
commercial product Gesture Mosaic by Mosaic Input Technologies clairns to be "Virtually
100% accurate". "Each character has a unique single stroke for fast input" and can achieve text
entry speed of "up to or greater than 40 wpm" (Gesture Mosaic advertisement). No formal
usability studies have been reported on Unistrokes and Gesture Mosaic.
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Figure 5 Single-stroke implementation of the Roman alphabeti
(a) Unisrrokes (b) GraDti
' The black dot indicates the starting position for each stroke.
14
The major concern with handwriting recognition is recognition accuracy. Evaluation of
character-based recognizers is relatively easy. Each letter is either correctly recognized or not.
Evaluation of word-based recognizers can be more difficult depending on how accuracy is
evaluated.
A usability study of Grami showed that after only 5 minutes of learning (alphabet only) an
accuracy of 97% could be achieved. Resuits also showed that participants in the study
demonstrated complete ski11 retention on Graflti following a one-week lapse without further
practice (MacKenzie and Zhang, 1997).
Handwriting recognition usability is of concem not only for alphabetic entry, but also for
numenc entry. MacKenzie et al. (1994) tested four different methods for numeric entry.
Although the data were nurneric, entry speeds were calculated using the commonly accepted
five-character-per-word formula for text entry. Tapping on a soft keypad was the fastest (30
wpm) and most accurate (1.2% error rate), compared to hand printing (using a Microsoft's
handwriting recognition software included in the Microsoft Windows for Pen Computing 1.1)
which scored second on both speed (18.5 wpm) and accuracy (10.4% error rate). The other
two stroking methods, moving Pie Menu and Pie Pad received the lowest performance scores.
In a survey of 18 recognizers, eight developers quoted accuracy of 98-100% without
qualification. The others qualified claims with statements such as "writer dependent", "50% to
100% depending on application", "up to 100% based on training effects", or "100% if
characters written as prescribed" (Blickenstorfer, 1996). One reason it is difficult to quantifi
accuracy is that the hurnan element must be considered. As the survey noted, "our approach
15
was simply to ask what each developer claims the accuracy of their recognizer to be. In the
absence of a standard benchmark for recognition accuracy, this and our subjective expenence
with the products is d l we have to go on" (Blickenstorfer, 1996).
2.1.2. Word-Based Recognition
A word-based recognizer on the other hand, will fmt try to recognize each single character in a
word, make a match in its intemal library and come up with the closest suggestion for user
approval. This is in some sense similar to disambiguation as discussed earlier with keypad text
entry (section 1.2.3.1).
Handwriting recognition has advanced recently and today's handwriting recognition software is
much more satisfactory than products fiom the early 1990s. The 1st word on handwriting
recognition is that for a handwriting recognizer to achieve 100% accuracy is only illusionary
because even human beings are not able to recognize every handwritten text without any doubt
(Neisser and Weene, 1960). Hurnans are able to recognize about 96% of hand-printed single
characters, i.e., without contextual information (Frankish et al., 1995). There will aiways be an
obligation for the writer to write clearly. Generally recognition rates of 97% or higher are
acceptable by most w r s (LaLomia, 1994). Furthemore, there is always a tradeoff on speed
and accuracy with on-line recognizers.
2.2. Text Entry Using Soft Keyboards
In the world of desktop computing, the idea of using a non-QWERTY layout is a tired notion
that is of minor interest today. Even though altemate key layouts, such as Dvorak (Potosnak
1988), or chord keyboards (Noyes, 1983; Gopher et al., 1985; Gopher and Raij, 1988), exist
and some can support higher entry rates, substantial practice is required to become proficient in
their use. This, combined with the large installed base for QWERTY keyboards, has ensured
the continued role of QWERTY as the keyboard layout for desktop computing.
For soft keyboards, however, the arguments for using a QWERTY layout are diminished
because touch-typing ski11 will not necessarily transfer to "touch tapping". The motor ski11 of
two-handed eyes-fiee touch-typing is very different from the simple act of one-handed eyes-on
tapping with a stylus on a sofi keyboard. That is, if we compared two touch typists - one 25
wpm and one 75 wpm - in their ability to touch tap, it is not certain that a 75-wpm typist
would be façter, or substantially fàster, than would a 25-wpm typist.
Although it is not necessary to keep the antiquated, inefficient QWERTY layout, most people
still prefer a QWERTY layout for a soft keyboard. One reason is the familiarity with the
QWERTY layout on desktop keyboards. Another reason is that most designers saive to
duplicate a physical keyboard as closely as possible instead of optimizing a layout for the
screen (Blickenstorfer, 1997). Yet another reason is that most of the layout designs other than
QWERTY are either still at a research Ievel, or have not been tested for their usability and user
acceptance. Besides the keypads that have been introduced earlier, several commercial,
hypothetical and research-level layout designs will be discussed below.
17
MacKenzie et al. (1994) tested two soft keyboard layouts, QWERTY and ABC (a keyboard
with two rows of letters in alphabetical order and hence named), and found that Q WERTY was
faster (23 wpm) than ABC (1 3 wpm) but has a higher error rate (1.1 %) compared with that of
the ABC layout (0.6%).
Figure 6 A QWERTY layout
Figure 7 An ABC layout
Other research by MacKenzie, Soukoreff, and Zhang (1996) extended to six diflerent layouts.
In addition to the QWERTY and ABC layout, four other layouts were also tested. These
included a keypad style layout with disambiguation narned JzistType (Figure 8), a DVORAK
layout (Figure 9), a commercial design cailed FlTALY (Figure 10) and a telephone keypad
layout (Figure 11). On a telephone keypad, Ietters "Q", "Z", and Space are assigned to keys
"7", "9" and Y", respectively. This arrangement allowed w r s to enter the full English
alphabet. Disarnbiguation is also assurned for the telephone keypad layout.
Figure 8 Just Type ke yboard
Figure 9 A Dvorak layout showing alphabet oniy
Figure 10 A FitaZy layout showing alphabet oniy
Figure 1 1 A telephone keypad with letters Q, Z and Space
Again, QWERTY was found to be the fastest (21 wpm), followed by ABC (10.7 wpm). The
experiment, however, tested only the users' initial exposure to the layouts, except for
QWERTY. This is a major defkiency, because the experiment failed to show the potential of
the different layouts given sufficient practice.
Besides a fixed layout, researchers have also explored other possibilities. One of these was a
soft keyboard introduced as IBM/BellSouth's Simon - a combination of PDA and ce11 phone.
It was small and only displayed five or six lei-ters. If you picked one, the letters then changed
to the ones most likely to be used next in the English language (Blickenstorfer, 1997).
Although displaying a small set seemed to be a good idea in saving screen space, the usabiiity
of such keyboards must be tested. Factors such as visual search will add to the difficulty of
use.
Using a mode switch c m also reduce the size of the key set. This idea was borrowed from
physical keyboards where a Shift key switches between different assignments on the sarne
physical key. Mode switching c m be extended to different activities. This can also create
problems in usability. For example, with disambiguation-enabled keyboards or keypads, a user
must occasionally switch between tapping/entering text and selecting a word fiom a list of
choices. If the word is not in the list, the user hm to switch modes, such as "customize user
dic tionary ", "custornize List order", etc.
Another mode switch example is seen in some existing soi? keyboard designs. When a user
taps on the shiA key to switch between upper case and lower case, the key captions of the
entire keyboard are switched to reflect the new selection. The user is forced to switch between
searching among lower case and searching on a keyboard with al1 upper case captions. This
not only adds to difficulty in use but also to complexity in the software and lags due to
repainting the screen.
This is not a problem with physical keyboards for two reasons: (a) letters are al1 printed on the
key caps physically, and (b) most of the operations are eyes-fiee, especially with touch-typists.
Modes are necessary for reducing the size of a set. They add cognitive load upon users,
however. Unless necessary, over-use and over-expression of modes should be avoided.
It is obvious that any new technologies, even a new keyboard layout, must be leamed before a
user can be proficient. Two questions thus have &sen:
1. 1s it better than the existing technology? In other words, is it worth learning?
2. How difficult is it to learn, or how long will it take for an average user to become
proficient?
For soft keyboard designs, Soukoreff and MacKenzie (1995) presented a theoretical model for
the upper and lower bounds of typing speed using sofi keyboards. This was an attempt to
ariswer the fïrst question above. Using the model, MacKenzie el al. (1 997) predicted the lower
bounds and upper bounds of six different keyboard layouts using different text entry schemes.
The lower bound, or the enhy-level speed of a keyboard, represents the immediate usability of
a design. It predicts the performance of a novice user. The upper bound, on the other hand,
predicts the best performance an average user can possibly achieve. The two questions above.
therefore, can be re-addressed as how to establish the lower and upper bounds of a new
technology.
The theoretical and empirical models for calculating the predictions of lower and upper bond
speed of a sofi keyboard are discussed in detail in section 2.4.
While both the lower and upper bounds can be predicted, the lower bound is of less interest.
The upper bound represents the performance potential of a technology. The achievement of
that will be of primary interest in this thesis.
Whether a theoretical upper bound prediction is achievable and how long it will take before
users c m achieve this performance are two separate issues. The former will be determined by
the accuracy and sufficiency of models and the latter must be deterrnined empirically. In other
words, a specific learning mode1 based on experimentation must be established.
23. The Elusive "Crossover" Point 4
Ernpirical evaluation is the fmal "road test" of a new technology. In general when users are
involved, the new technology rnight start with lower performance than that of the current
technology. But because of the potentiai of new technology, the performance rnay catch up
and evennially exceed the performance of existing technology with prolonged use or training.
This is known as a crossover point of the two performance curves dong a time axis (Figure
12).
Crossover
b Time
Figure 12 The elusive crossover point
This crossover point rnay not be achievable in al1 cases. If, for example, it is not known
whether the new technology will eventually perform better than current technology, the "New"
curve might very well stay below the "Current" curve. A usability study might be terminated
before the two c w e s meet. In many cases, the problem is not knowing "when" the two curves
23
will cross at the planning stage of a usability study. This problem can be partially solved by
running a srna11 "pilot" study before the full user test to estirnate the position of the crossover
point.
McQueen et al. (1995) conducted a usability study on numeric entry using two entry methods
with a stylus. The crossover point of total entry time occurred at session 7 (20 minutes each)
of a total of 20 sessions.
Although with a prematurely terminated study, the crossover point can be reached by
mathematically extmpolating user performance data, the case will not be as convincing.
2.4. Models
As stated in Focus of this Work (section 1 . 3 , our goal was to design an optimal soft keyboard
that supports higher text entry speed than the 'status quo' Q WERTY layout. The models
descnbed below are al1 related to sofi keyboard design. and most of them will be used directly
in designing and predicting the upper text entry speed limit of an optimal keyboard.
2.4.1. The Linguistic Mode1 and Space Key Extension
In a paper by Maymer and Tresselt (1965) 20,000 words were sampled with a total of 87.201
letters. The work addressed many details on gathering a representative sample. Each letter
was examined for its frequency not only as a whole, but also for its different positions within
words.
Maymer and Tresselt's corpus provides a set of representative frequencies of alphabet usage in
"common English". These letter frequencies will be used in our prediction model.
Maymer and Tresselt's frequencies for letten in comrnon English, however, did not include
spaces. This was due to the linguist's focus on language. For text entry on cornputers,
however. a blank space is a character which must be inserted within text just as any other
character is inserted (Dix eî al., 1993). From the usen' perspective, however, this difference is
not visible. But for designing a soft keyboard, a space is the 27" character of the alphabet.
To modiQ the original character fiequency table, Soukoreff and MacKenzie (1995) added a
space for each word. The total nurnber of characters became 107,201. Dividing the total
number of characters by the total number of words in the sample, it results in about five letters
(including spaces) per word for the sample. This is usually considered the word size for
common English.
Another important piece of information in Maymer and Tresselt's work is the digraph
frequencies. A digraph is a pair of letters in a given order. "ON" is considered different fiom
"NO", for example. A digraph fiequency is the frequency of the digraph in the sample.
Again with the help of the letter positions in words provided by the original work. Soukoreff
and MacKenzie (1995) created a 27 x 27 digraph fiequency table.
2.4.2. Fitts' Law
Fitts (1954) described a mode1 for predicting the movement thne for rapid target selection
tasks. With a simple setup as illustmted in Figure 13 and Figure 14, Fitts found that the
movement time (MT, in seconds) is a fûnction of target width (ÇY) and amplitude (A). The
original Fitts formula is
MacKenzie (1989) found that the original formula bj
better fit of Fitts' experimental data:
LW = a + b x l o g , -+1 -(W 1
annon and Wea~ ler ( 1945) provided a
The logarithmic term of the formula is called the index of task difficulty ID, and carries the
unit "bits". Shannon's formula therefore, always gives a positive value (2 O) of ID with any
given A and W. The term 6 carries the units "seconds per bit". The reciprocal of b carries the
uni& "bits per second" or just "bps". This terrn is "throughput", as specified in ISO 9241 -92.
[SO 9241-9 is in cornmittee cirafi at the time of this writing.
26
Figure 13 The reciprocal tapping task on two targets with width (ÇY) and amplitude (A)
for establishing the Fitts' Iaw
*-.*/ Figure 14 Performing a Fitts' law tapping task
The study by MacKenzie et al. (1 99 1 ) measured the throughput value for pointing tasks using a
stylus as a cornputer's pointing device. The value was found to be 4.9 bits per second. The
intercept value (a) in the formula varies with experimental data, but is close to zero.
Theoretically it should be exactly zero, meaning that a task with zero index of difficulty would
take no time to complete.
Work by Epps (1986), Kerr and Langolf (1977), MacKenzie et al. (1991) and othen has
demonstrated that Fitts' law is applicable to the movement of a stylus as an input device. By
extension, Fitts' iaw is applicable when stylus movements are between 'keys' on a QWERTY
keyboard simulated on a pen-based computer's display (Soukoreff and MacKenzie, 1995).
MacKenziels formula will be used throughout this thesis work for calculating the movement
tirne of this type of tasks:
2.4.3. Visual Scan Time Mode1
Hick and Hyrnan's visual scan time prediction mode1 (Hick, 1952 and Hyman, 1953) has been
widely accepted to calculate the time (also called Reaction Time, RT, in seconds) required for
finding an item among N items:
RT = 0.2 x log, N (4)
where N is the total number of items to search fiom and the dope 0.2 has a unit of seconds/bit.
For example, by substituting N with 26 we get the visual scan time for searching a letter among
26 alphabets:
This model, however, is sensitive to the dope of the equation. In Welford's extensive review
of choice reaction time studies (Welford, 1968: 60- 104), slopes Vary fiom about 1 60 mshit to
about 320 rnslbit. The RT calculated with Welford's slopes therefore, range fiom 761 ms to
1.52 seconds. respectively.
The choice reaction time model describes the user behavior on an unfamiliar set of objects,
such as a new keyboard layout. With an expert user, total familiarity is assumed. Therefore,
for experts the dominating time required is to move fiom key to key without any searching.
Hick-Hyman's model, therefore, will o d y be used to predict the novice behavior, or. in other
words, the lower bound speed of a keyboard.
However. it should be noted that this does not mean the "slowest entry speed" as opposed to
the upper bound. which means the "fastest speed". The lower bound should be interpreted as
the entry-level performance of a typical novice user.
2.4.4. Saccadic Eye Movement
The nature of hurnan eye movement known as saccadic movement also plays a role. Simply
speaking, when a person reads or looks at a picture or even stares at a spot on the wall, his or
her eyes make multiple, rapid, jumpy movements constantly. These movements are called
saccades. A penon with normal vision usually makes about two saccades per second. Most of
these eye movements range in magnitude from four minutes of arc to 15 degrees (Bahill and
Stark, 1979). If the area is too small or the items are too close together, it will be difficult to
keep the eyes still (try holding a book at an arm length and count the number of lines of the
ISBN barcode). To scan a larger area, head movement will be involved and it will take longer
time and more muscles will be involved to cornplete the coverage.
2.4.5. Soukoreff and MacKenziels Optimal Angle Mode1
Soukoreff and MacKenzie ( 1995) described a model to predict the theoretical upper speed lirnit
by stylus tapping on keyboards. This model States that if there is a key that has width or height
longer than one unit then an expert user would approach that key at the angle that equals the
outgoing angle toward the next letter. This is depicted in Figure 15. The original mode1 was
used for a QWERTY keyboard where one space key is laid out at the bottorn of the keyboard
and is stretched across the entire length. But this mode1 c m be applied to any keyboard
designs where such situations &se.
Figure 15 optimal angle model: expert behavior
An approximation of the optimal angle mode1 (as explained in Figure 16) is to simpli@ the hit
points on the long key(s) only at some discrete points. For example, to hit a square key, this
mode1 would assume that expert user would only hit the middle point. For a 2-unit long key
the expert user would hit either the point 0.5 unit from lefi edge or the point 0.5 unit fiom the
right edge.
Key 3
Figure 16 An approximation of optimal angle model
This mode1 will greatly simpliQ the calculation while having minimal impact on calculation
accuracy. A detailed discussion is found in the next chapter where we present the design of the
optimal soft keyboard.
2.4.6. Key Repeat Time
Common English also has cases where double letters such as "tt" or "Ii" are used.
Theoreticaily, entering double letters requïres no laterai pen movement. Therefore this key
repeat time cannot be modeled by Fitts' law. For an expert user, there will be no search time
required either. With stylus tapping on a soft keyboard, only the key repeat time will account
for the text entry time. A simple empincal test (Soukoreff and MacKenzie, 1995) showed that
when tapping with a stylus on a sofi keyboard displayed on a Wacom tablet, the key repeat
time was 0.153 seconds. In the test users were asked to tap on a Wacom tablet for one minute
and the repeat tirne was averaged. In the real world, double letters only require two taps. The
speed would degrade appreciably toward the end of one minute of tapping. The repeat time
wouid be best represented by the first few seconds of tapping. Another brute-force test was
conducted with 5 subjects to confirm the previous empincal result. In the second test. the
subjects tapped on a soft keyboard 25 times before the average was taken. This short sequence
only took 3 to 4 seconds. Ten such sequences were collected for each subject. The result was
slightiy shorter key repeat time: about 0.127 seconds. However. the importance of this should
not be exaggerated as double tapping only concributes a small portion to text entry time (about
2% of total digraphs found in cornmon English. Figure 17).
Ch-Ch Ch4 pace S pace-C h Duble
Digraphs
Figure 17 Distribution of digraphs
We have described al1 the models required for designing and predicting the upper bound entry
speed for a soft keyboard. The next chapter will describe the design of an optimal soft
keyboard for pen-based text entry.
Chapter 3. The Design of Soft Keyboards
In this chapter we will address design issues for an optimal sofi keyboard. We begin with a
discussion of issues pertinent to novices and then discuss issues important for the design of an
optimal soft keyboard - one that can support the highest possible text entry rates for experts.
3.1. Issues Related to Soft Keyboard Design
3.1.1. Visual Scan: Fixed vs. Random
Visual scan time is a dorninating factor for novice performance. On physicai devices such as a
keyboard or a design on paper, a problem in testing visual scan time is how to maintain the
'hovice status" of the user. Since humans learn fiorn experience and observation, novice status
diminishes quickly once a user starts to use a keyboard. Each time the user searches for a key,
adjacent keys are observed and information about these keys is stored in memory (key captions
and positions). As use progresses, the user shifts fiom being a novice to being an intermediate
user and, finally, toward being an expert.
To validate Hick-Hyman's mode1 and rneasure visual scan time on a sofl keyboard, finding a
way to maintain the "novice status" becomes interesting.
To capture the true novice expenence, the user m u t remain at the novice level while
performance data are collected. The unfamiiiarity of key assignments can be achieved through
software by randomly reassigning letters to keys after each tap.
3.1.2. Ski11 Transfer from Previous Experience
One expectation is that people who can touch-type on a QWERTY keyboard might benefit
from the familiar layout even when the keyboard size is reduced to allow only one-finger input.
With touch typing, people do not think of the location of the keys. Instead, they translate text
directly to their fmgea using motor memory. Experimental psychologists cal1 this a "motor
program" (Heuer et al.. 1985). This is also known as "eyes fiee" operation on a keyboard.
Will a touch typist actualiy benefit fiom the familiar QWERTY layout in a one-finger tapping
task? This is a question that we sought to answer.
Being farniliar with a keyboard layout fiom previous experience may have effect on initiai
performance. Even partial familiarity can benefit the initial performance. For example. when
tapping on a QWERTY keyboard, knowing generaily which half of the keyboard the letter 'T"
is in can lead to a quick reduction of the search scope - from the full keyboard to half the
keyboard. This translates into shorter search time (refer to section 2.4.3).
When searching dominates the performance, a user is still in a novice or a near novice state.
As use progresses, visual search will become less necessary. Other barriers will become more
and more prominent in affecting the performance. Worst of al1 is the time it takes to move
from one key to the next. Whether this is just speculation or reality remains to be answered.
35
3.13, Sue: Large vs. Smaii
Mobile systems today are genuig smaller and smaller, for example, the 3Com P a l d i l o i is
only 8 x 12 x 2.5 cm. The liquid crystai display area is only 5.5 x 5.5 cm. Touch screen
interfaces are even found on wristwatches such as Casio's VDB-MI, where the bottom 113 of
the screen displays context sensitive menu buttons that a user can operate with a finger.
Implernenting a sofi keyboard on such a small display area requires careful design, not to
mention the physicai dificulty when operating such a keyboard with one's finger. An
interesting question arises: Does keyboard size (within lirnits) affect user performance?
Fitts' law States that as long as the ratio of M W remains constant, the movement tirne will be
the sarne (refer to section 2.4.2). This oniy holds within lirnits, however. In practice, other
factors can affect rnovement time. Different sizes of keyboards may require different muscle
groups that in turn have different throughputs (Balakrishnan and MacKenzie, 1997). Saccadic
eye movement (see section 2.4.4) may also play a role in working with different sizes of
keyboards.
3.1.4. Learning Mode1
"Practice makes perfect".
Leaming effects have been well studied over the years. The learning curve which gives entry
time as a function of amount of practice can be approximated as follows (Card et al., 1978, De
Jong, 1957):
where T! is time required to complete a certain task at the first time; TV is the predicted time for
the same task on the triai; N is the nurnber of triais; and a is an empincally detennined
constant.
3.1.5. Possibilities for Alternatives
To summarize, we have the following motivations to designing a soft keyboard with supenor
performance to Q WERTY:
a) QWERTY is not optimized for either touch typing or pen-based text input. If we can
establish that ski11 transfer fiom touch typing to one-fingered tapping is not significant, then we
can project that keeping the Q WERTY layout is not necessary.
b) For experts, the most important factor that dominates the speed of pen-based input with a
soft keyboard is the cave1 t h e arnong keys. If we can reduce the pen's travel distance fiom
key to key, we can increase text entry speed.
c) Soft keyboard optimization is language specific. We have learned a great deal about the
single letter and digraph fiequencies of cornmon English.
d) The theoretical and empincal models that have been discussed are suficient for keyboard
design. Although there might be other models related to keyboard performance, they are
considered trivial (e.g. the effect of order of letter arrangement on user cognitive load, see
more discussions in Section 3.3.1).
With the above tools in hand, we are in a position to design a soft keyboard that will perform
better than Q WERTY.
Once again it should be noted that a new keyboard design might have a lower performance
initiaily. If our theoreticai models are correct we should expect that performance will exceed
that of QWERTY, given sufficient practice.
We should also note that no matter how confident and optimistic we are about a design,
empiricai testing with users is essential. A learning model is the final tool for our design.
3.2. Keyboard Optimization
3.2.1. Modeis
Here we bnefly list the rnodels used to design our keyboard; that is, to calculate the theoretical
upper text entry speed and to empirically test the design. Details on these models can be
found in section 2.4.
Linguistic model: single letter and digraph frequencies, including the space character, for
cornrnon English. The five-characterdword definition denved from this work will be used
in the calculation.
Fitts' law: for predicting the movement time fiom key to key. The Shannon I MacKenzie
formula will be used (Equation 3 on p. 28). The throughput value for a stylus is 4.9
bits/sec. For long keys, the width is set to the "smaller of' the two dimensions (MacKenzie
& Buxton, l992), which equals the height (refer to Figure 16 on p. 3 1 ).
Optimal angle model, especially the approximation of the Optimal angle model.
Key repeat the. We will use the value obtained by Soukoreff and MacKenzie (1995) of
153 ms per tap.
Learning model: to predict and extrapolate behavior based on an empirical test.
3.2.2. Extra Design Concerns
This section presents extra design concerns. These concems are specific to the design of sofi
keyboards:
The keyboard should have no dead space. Dead space is the space between keys where no
action is designated. Dead space is not necessary and will only increase the key-to-key
movement time.
The entire shape of the keyboard should be rectangular to fil1 the shape of a typicai window
where the soft keyboard is displayed.
There is no constra.int on the shape of keys. For the sake of simplicity, we will make al1 the
keys rectangular. There is no limit on the size of keys either. A keyboard can have keys of
any size as long as the entire keyboard meets the two constraints above.
3.23. Current Designs and Speed Predictions
In short, the calculation of the upper bound entry rate is to sum al1 the digraph distances
weighted by the probability of each digraph. This will be the mean distance (norrnalized to
have the same units as the key height) a pen needs to travel for entering comrnon English.
By substituting A with the mean distance and W with the height of key (square key has both
width and height of one unit) in MacKenzie's formula (equation 3) the mean movement time is
calculated. The rnean movement time (secondcharacter) is then converted into the entry speed
(words/minute) using equation 7.
I X
60 (shin) Speed (wpm) = -
MT (s) 5 (chadword)
When the design contains keys with length longer than one unit, the calculation of movement
tirne is dealt separately. The approximation of the optimal angle model is used to calculate the
shortest path of each digraph depending on the characters that follow the long key.
Note that the approximation only calculates near-but-less-than optimal performance.
Therefore, the prediction slightly 'under estimates' the actual upper bound speed.
The prediction for double letters uses the key repeat time model (Section 2.4.6). Although key
repeat time is applied to al1 the double letter digraphs, not al1 appear in cornrnon English.
Those that are not in common English will have zero digraph probability and will not
contribute to the movement time calculation of the keyboard.
Using the models just descnbed, the upper limit of entry speed of a QWERTY sofi keyboard
(Figure 6, p. 18) was calculated to be 43.2 wpm (with a slightly sub-optimal approximation for
char-space-char movement). Table 1 lists the upper bound text entry speed predictions of
several commercial and experimental soft keyboards, including the designs that will be
discussed in the following sections.
Table 1 Upper Bound Entry Speed Predictions of Several Keyboards
Key board Speed (Vm) Description
Q WERTY 43 -2
Dvorak 38.7
Fitaiy 55.9
JustType 44.2
Physical keyboard layout
Physical keyboard layout
Soft keyboard. No disambiguation
Sofi keyboard. With Disarnbiguation
Phone pad 22.6 Using 2-key scheme, see sec. 1.2.3.1
OPTI 58.2 Soft keyboard. Designed at HCI Lab
5 x 6 OPTI 59.4 Soft keyboard. Designed at HCI Lab
Note: The 5 x 6 OPTI is presented in Addendum in Chapter 6.
3.3. The Optimal Soft Keyboard - OPTI
3.3.1. Design Rationale
Several steps are involved in designing a soft keyboard.
When examining the single letter fiequencies (Appendix A), the first five letters - space, E, T,
A and H - make up 50% of the cumulative frequencies. The next 11 letîers rnake up another
40% and the last 1 1 account for only 10Y0 of the total. The design started by considering only
the more fiequent letters.
The single-letter fiequencies simply indicate that some letters are more fiequently used than
others. They reveal nothing about relationships between letters. The second step was to
consider digraphs. First, we picked the top ten lettee (fiom E to L) fkom the single-letter
fiequency lin. For each letter we listed the top 10 digraphs starting and ending with the letter.
The result is listed in Appendix B. For example, the digraph " H E accounts for 3.155
occurrences whereas "EH" did not make the list. The former done makes it desirable to put
"H" and " E as close together as possible on a keyboard.
It makes sense to have the letter sequence "HE" to appear in the design radier than "EH. The
ordering may play some role with users' cognitive behavior: A sequence of "THE" appearing
on a keyboard would feel more naturai than a "HTE" or "EHT". It's speculated that even with
expert users, naturaily ordered sequences make it easier and more cornfortable to use.
However, this rationale did not have a strong influence on the design. We expected that with
expert users, familiarity dimuiishes the importance of ordering.
Another rationale was to keep the lettee of similar shapes separate from each other to avoid
confusion during a quick scan. Examples include letiers "C" and "G". "Q" and "O", " M and
"W', and "F" and "P". Again, this is not rigorously accounted for as it is felt to have only a
minor effect on an expert's behavior.
3.3.2. The OPTI Design
Assembling the keyboard takes some imagination. Whereas testing the design can be done
mathematically. The final design of a new keyboard is shown in Figure 1 8.
Caiculation of the upper bound tapping speed is done with a specially designed spreadsheet
that ailows quick visual inspection and modification of the design.
Figure 18 The OPTI Keyboard for text entry on mobile systems
The upper limit of entry speed of the OPTI soft keyboard was calculated to be 58.2 wpm (with
a similar sub-optimal approximation). This is about 35% higher than the predicted speed of
43.2 wpm given earlier for the Q WERTY layout (Section 3.2.3).
Chapter 4. Experiment 1
Experiment 1 was designed to gain a better understanding of following issues in sofi keyboard
design:
1. Will Hick-Hyrnan's search Ume mode1 apply to novice behavior with a soft keyboard?
2. To what extent does the ski11 of touch typing transfer to stylus tapping?
3. Will the size of the keyboard affect the usability of a sofi keyboard?
4.1. Method
4.1.1. Participants
Twelve students and staff fiom the University of Guelph were paid to participate in this
experiment. Six were male and six were female, al1 were nght handed. Some were students in
computer science while others were in other life science disciplines. Ali had expenence using
cornputers on a daily basis. They were recruited fiom those who participated in our previous
experiments or fiom those who responded to our posting to the departmental graduate students
and staff ernail lists. No screening was done on the participants.
4.1.2. Apparatus
The experiment software was developed with Borland C++ 4.0 and ObjectWindows Library
(OWL 2.0). The host system was a Packard Bell 486DX-50 PC m i n g Microsofi Windows
44
for Pen ComputingTM 1.1. A Wacom PL-IOOV tablet was atîached to the system. The Wacom
tablet is both an LCD display and a digitizer. It has a 12-inch B & W backlit LCD display
panel with 640 by 480 pixel resolution. Using the combination of the tablet and host computer
enabled the experiment to run without system lag and allowed user entry to also appear on a
regula. VGA monitor. A special stylus used on the tablet functions as a pointing device.
Instead of pressing and releasing a mouse button to 'click'. users tap-and-lift the stylus on the
tablet surface to complete the same action. The stylus and tablet use the standard WindowsM
mouse driver.
The software developed for the experiment 1 was simply code-named V1 .O. It displays two
different sized keyboards for the text entry task. Each size also has two different layouts. One
is the standard QWERTY layout, showing the alphabet only. Another layout generates
randomized key assignments each t h e a key is tapped. The keys on the large keyboard are 28
x 28 pixels each, or 10 x 10 mm measured on the screen. The keys on the srnail keyboard are
18 x 18 pixels each, or 6.4 x 6.4 mm on screen. The size and layout can be selected as
experiment conditions from the control panel of the software (Figure 19). Further description
of the expenment software are found in Appendix C.
I
l nput device - Soft keyboard
(f Stglus I tablet Large (28x28 pixel]
r Mouse r S maIl (1 8x1 8 pixel]
r Other P Key radomization
@ Practice r Run Experiment
/ Number of Sentences n
. - - -
Figure 19 Control panel of V 1 -0
The task implemented in software was to memorize a short phrase of text and enter it with each
of the soft keyboards (Figure 20, Figure 21, Figure 22 and Figure 23)). This approach
simulates a text creation task in that the user knows exactly what to enter. This is in contrast to
text copy tasks which require the user's focus of attention to continually switch between the
source text and the keyboard. Figure 20 shows a session with the large fixed layout. A sample
phrase is displayed on the first line and user input is on the second line, shown as unfinished.
Figure 21 shows a session with the randomized keyboard layout. Note that keys are
randomized when a key is tapped. The space key is not randomized. Figure 22 and Figure 23
show two sessions with the small keyboard.
The scale of keyboards in the figures is 1 : 1.
THE CAPITAL OF OUR NATION THE CAPITAL OF OUA
Figure 20 A session with the large fmed layout
NO EXCHANGE WITHOUT A BILL
NO EXCHANGE WA
Figure 2 1 A session with the large randomized layout
DASHIHG THROUGH THE SHOW DASHING THROUGH A
Figure 22 A session with the small fixed layout
INFORMATION SUPER HIGHWAY I N F O m T I O N SUPEA
Figure 23 A session with the srnail randomized layout
4.1.3. Design
The experiment was a 2 x 2 within-subjects factorial design. The factors and levels of the
experiment were as follows:
Size (small, large)
Layout { fixed, random)
There were four different combinations of layout and size. Each participant finished four
blocks, one for each condition. Each block contained 10 phrases (Appendix D). The order of
the conditions was balanced to reduce interactions between factors.
4.1.4. Data Collection
For each key a user tapped on the keyboard, the following information was collected in the
data files.
Given character;
Given character's position (ID on the keyboard);
User entered character;
User entered character's position;
Time elapsed between characters (in milliseconds);
Error. (1 if the user character was different From the given character; O if the user
character was correct.)
The first Ietter in each phrase was excluded from the timing measurement and from the
calculation of the correlation of the Ietter fkequencies in the test phrases with common English.
This is because it is irrelevant to take the entry tirne after the phrase is displayed until the
participant taps the first letter.
Ten short sample phrases were used in the experiment. For each block, ten phrases were
presented to the participants in randorn-without-replacement order. The sarnple texts set is
listed in Appendix D.
The correlation coefficient for the sample text phrases against common English calculated at
letter fiequency level, as just described, was .95 162. The implication of this high correlation is
that the sample text is highly representative of common Engiish.
4.1.5. Procedure
The task was to copy the sample phrases using each of the soft keyboards.
The Wacom tablet was positioned on a separate desktop away from the experimenter's host
machine. The ceiling lights were turned off to remove the glare on the LCD panel and a table
lamp was used to provide ambient lighting for the lab.
Prior to the fornial test, al1 participants were given a typing test. The program Typing Tutor N
on an Apple Macinfosh cornputer was used to collect the data. The text used for the typing test
was modified fiom a longer story to fit on one screen and to show alphabetic characters with
minimum punctuation (See Appendix E).
After the typing test, participants were given written instructions explaining the task and
device. They were specifically asked to focus on both speed and accuracy. They were also
asked to ignore rnistakes and carry on with the rest of the phrase when a mistake was made.
The purpose of the experiment was not explained.
They were then given the tablet and the stylus. The tablet was tilted off the desk to provide a
good viewing angle (about 15"). It was also adjusted to have appropnate contrast and
bnghtness.
Prior to completing the session, participants were given a practice session with one phrase for
each condition.
In the formal test session, one phrase was displayed at a time. With the fixed keyboard layout
the participants only needed to copy the text. The visual search time was presumed minimal
since the layout was fixed as per a QWERTY keyboard. With the randomized layout.
however, they had to visually search for the letter on the keyboard afier each letter was
entered. When an error was made, a "click" was played through speakers.
The typing test was given again at the end of the last session.
4.2. Results and Discussion
43.1. Visual Search Time on a Soft Keyboard
The movement time Ton a random keyboard has two components
where MTradom is the average movement time on a randomized keyboard and RT is the visual
search tirne.
Ideally if we know MTrDndO, then what remains is RT.
In fact, we observed that the movement between keys was confounded and movernent time
could not be modeled by Fitts' law. This was a disappointing result; however, it does provide
additional insight into novice behavior on sofi keyboards. During a visual search when using
the randomized keyboard layout, we observed participants to consistently lifi the pen after each
tap. They held it in mid-air so that the hand was not in the way, then they reached back to the
tablet when the next letter was found. Some participants simply swung the pen over the
keyboard to follow the focal movement of eyes and tapped down as soon as the next letter was
spotted. In either case, this is not a simple target selection. Therefore the totai movement time
has three components:
Where MTcom+ is the movement tirne for the confounded pen movement a described above.
As long as we do not have a mode1 for MT,o~pod we cannot predict the RT. However,
MTrdo,,, can be caiculated mathematically (Appendix H).
4.2.2. S M Transfer from Touch Typing to Stylus Tapping
The typing speeds of our participants ranged fkom 19 to 74 wpm. n i e tapping speed ranged
from 15.4 to 25.1 wpm for the large keyboard and from 13.1 to 26.0 wpm for the small
keyboard, respectively. The relationship between the typing speed and the tapping speed for
the four keyboards is shown in Table 2.
There was little correlation between touch typing speed and stylus tapping speed on the soft
keyboard - r = -416 for the large keyboard and r = 323 for the small keyboard, respectively.
The simple interpretation is that a fast typist does not necessady tap faster on a sofi keyboard
than does a slow typist. Typing ski11 was not significantly transferred to tapping on the soft
keyboard. Although farniliarity does help, these two tasks are fundarnentally different. both in
mental mode1 and in the motor skills involved. This is support for our belief that more
effective soft keyboard design is warranted.
Table 2 Typing and tapping speeds of the twelve participants
Participant T ~ P P ~ ( w m )
Fixed L Fixed S Rand L Rand S
Average entry speed 21.17 19.97 5.33 5 -52
Standard Deviation 14.77 2.89 3.79 0.93 0.92
Correlation Coeficient (r) O. 116 O. 523 O. 073 0.081
Note: Correlation Coefficients are with typing speed (2nd col.)
4.2.3. Text Entry Speed
The text entry speed in this experirnent refiects only about the first 15 to 20 minutes o f use.
Our focus in this experiment was not on prolonged user leaming and proficiency.
The andysis of variance of text entry speed (Table 3) showed that There was a significant main
effect of keyboard layout (F i ,~1 = 314.53, p < -0001). There was no effect for keyboard size
(FI,, 1 = 2.626, p > .05). There was a significant interaction eflect between keyboard layout and
size (FiVI 1 = 7.18, p < -05).
54
Table 3 ANOVA: Entry Speed for Experiment 1
Source df SM MS F P
Participant 11 150.700 13.700
Layout 1 2752.15 2752.15 3 14.53 .O00 1
Layout * Participant 11 96.249 8.750
S ize 1 3.131 3.13 1 2.626 -1334
Size * Participant I I 13.1 18 1.193
Layout * Size 1 5.727 5.727 7.182 .O2 14
Layout * Size * Participant 11 8.772 .797
The large fixed keyboard was slightly faster (21.17 wpm) than the small fixed keyboard (1 9.97
wpm). When the keyboard was randomized, however, the smaller keyboard showed a faster
entry speed (5 .j2 wpm) than the larger one (5.34 wpm) (Figure 24).
Large Small
QWERTY Randorn
Digraphs
Figure 24 Text entry speed
The shorter search time for the srnall keyboard, as reflected by the difference in visual field
width, might have played a role. The large keyboard is about 50% wider and taller. Recall
&om section 2.4.4, that the maximum saccade is about 15 degrees. The distance fiom
participants' eyes to the tablet was about 30 cm; and the width of the small keyboard was
measured at 6.3 cm. This results in a field width of about 12 degrees. Using the same
assumption with the width of the large keyboard rneasured at 9.5 cm, the field width was about
17 degrees. This means on the average, more saccadic eye movement might have involved in
searchuig with large keyboard.
Although we did not enforce a precise head position, this calculation suggests that more
saccadic movement might be necessary for large keyboards. In other words, participants could
not get the full vision of the entire keyboard without having to move their eyes more ofien on
the large keyboards. This extra eye movement may be one of the factors that increased the
average search time. In the worst case, participants may have to move their heads to complete
the search.
With the large keyboard, the users' hands tended to block a relatively large area of the
keyboard. Moving their hands away to regain the full view took longer as well. Al1 these
factors would contribute to the slower performance with large, randomized keyboard.
Participants were not instnicted to anchor any body part. They tended to anchor wrists on the
rim of the tablet when using fixed keyboards. With the random keyboard. they would anchor
their elbows on the desk.
4.2.4. Error Rates
The anaiysis of variance (Table 4) revealed that there was a significant difference in entry error
rates between the two keyboard layouts (F1.11 = 17.87, p < .005). The fixed layout had an error
rate of 2.96% and the random layout had an error rate of only 0.73%.
Table 4 ANOVA: Error Rates for Experiment 1
Source df SM MS F P
Participant 11 39.219 3.565
Layout 1 59.185 59.185 17.870 .O0 14
Layout * Participant 1 1 36.432 3.312
Size 1 6.149 6.149 3.415 .O9 16
Size * Participant 11 19.805 1.800
Layout * Size 1 6.149 6.1 49 5.527 .O384
Layout * Size * Participant I I 12.238 1.1 13
The difference in error rate for the random layout we attribute to participants being more
cautious about making erron with the random layout than with the fixed layout. The
confidence in being familiar with the fixed QWERTY layout caused participants to be more
relaxed on errors.
4.3. The Next Step
From this experiment we have gained an understanding of the fundamental issues in using a
soft keyboard. We now move on to the n e a experiment to Mly test the usability of our
optimal keyboard for pen-based text input.
Chapter 5. Experiment 2
The second experiment was a longitudinal design to explore learning effects with the O P n
keyboard. We also sought to qualitatively assess its usability and acceptance, and this requires
an evaluation wîth users. The effects of the factors described in the design of the OPTI can
o d y be established through an acnial usability test.
5.1. Method
5.1.1. Participants
Five University computer science students participated in the experiment. Four were male and
one was female. Al1 were right handed and used desktop cornputers on a regular basis. None
had regular expenence on using a pen-based computer. They were recruited from a pool of
subjects who participated in other non-related experiments. Since we are testing a keyboard
specifically designed for English, we picked oniy those whose first language was English. Al1
were well informed about the time cornmitment required for the experiment.
5.1.2. Apparatus
The expenment software was developed with Borland C++ 4.0 and ObjectWindows Library
(OWL 2.0). The host system was a Packard Bell 486DX-50 PC running Microsoft Windows
for Pen Computing 1.1. A Wacom PL-I OU V tablet was attached to the system. These were the
same as in Experirnent 1.
The software developed for the experiment hvo was code-named V2.0. V2.0 displays two
different keyboard layouts for a text enûy task. One is the standard QWERTY Iayout showing
.the alphabet only. The other is the "OPTI", as s h o w earlier in Figure 18. The control panel is
shown in Figure 26. Further descriptions of the experiment software are found in Appendix G.
The experiment was conducted in the HCI Lab at the University of Guelph's Computing and
Information Science Department. To minimize interference fiom any other source the lab was
completely booked for the experiment. The entire experiment took about four weeks including
Saturdays. A speciai web site was created for information updates and scheduling.
-Run Mode F Practiçe r Run Expehent
Input device
6 S tylui / tablet
r Mouse
Keyboard Layout - @ QWERTY
r OPTl
r Correlation 1 Block Size: 0 50
Figure 25 Control panel of V2.0
5.13. Design
The experirnent had 2 x 20 within-subjects factoriai design. The two factors were:
Keyboard layout (QWERTY. OPTI)
Session (20 sessions).
Each session was about 45 minutes long and was divided into two 20-22 minute periods. One
of the two layout conditions was assigned in each half-session period in aitemating order from
session to session. The order of the conditions was balanced between participants to reduce
interactions.
Each haif-session contained severai blocks of triais. The number of blocks for each half-
session penod was controlled such that as many blocks as possible were collected within the
allotted time. Therefore. in the early sessions, fewer blocks (5 to 6) were administered than in
later sessions (9 to 1 1). A 5 minute recess was allowed between the two half-sessions.
Each block contained 10 phrases. These 10 phrases were randomly selected from a source file
of 70 phrases. Phrases were not repeated within blocks but repeats were allowed from block to
block.
The sarnple phrase set was tested for its correlation against the common English. The result
was r = .9845 for single-letter correlation and r = .9418 for digraph correlation. The complete
list of the phrase set is shown in Appendix F.
Each participant was scheduled to cornpiete 20 sessions for the experirnent. Sessions were
scheduled Mondays through Saturdays. They were separated by at least two houn and no
more than two days. This was to sirnulate "regdar use" of the system while trying to avoid
fatigue and to fit participants' reguiar daily schedules.
This is a longitudinal experiment aiming at practicing participants toward expert performance
on the OPTi design. Data collection was designed so that as many details as possible about
user input were collected.
5.1.4. Data Collection
For each key a user tapped on the keyboard, the following information was collected in the
data files.
Given character;
Given character's position (ID on the keyboard);
User entered character;
User entered character's position. The four space keys were identified separately;
Time elapsed between characters (in milliseconds);
Error (1 if the user character was different from the given character; O if user character
was correct).
5.1.5. Procedure
Each participant was given a typing test prior to the fmt session. A program Typing Tutor N
on an Apple Macintosh computer ninning System 7 was used. The text used for typing test was
the same as for Experiment 1 (Appendix E).
Each participant was then given written instructions explaining the task and the goal of the
experiment. They were asked specificaily to aim for both entry speed and accuracy. The
instructions also stated that if they made more than IO% errors with a sentence (about 3
mistakes) they should slow down on the next sentence to increase accuracy.
As designed, the length of each half-session period was controlled with a tirner. Once started
the software was self-administered. The entire session was monitored on a separate CRT
connected to the system.
When participants had to skip a session and the next scheduled session was more than two
days, then the data fiom the upcoming session was not used. Instead, the participant was asked
to use that session as a 'warm-up' session and a new make up session was scheduled.
They were then given the tablet and the stylus. The tablet was tilted off the desk to provide a
good viewing angle (about 25'). It was also adjusted to have appropriate contrast and
brightness. The overhead lights were tumed off to reduce the glare on the tablet's display
panel. The height of the desk was 26 inches, a standard height for typing. The desktop can be
raised by about 2 inches to allow for different body sizes of participants.
The participants were asked to copy each short phrase by tapping on the sofi keyboard. A sofi
audio feedback "tick" was played to the users through speakers. When an error was made, a
louder "click", which was rnuch more prominent, was played. The participants were asked to
ignore errors and carry on with the next correct Ietter pointed at by the cursor (Figure 26
and Figure 27).
A typing test was given again when participants ffished the 20" session.
A plot chart was set up during the experiment to keep the participants motivated. Performance
expectations were not explained, however. Instead, they were constantly reminded to do their
best on both layouts.
1 NON PROFIT ORGAHI ZATION
NON PROFIT ORGANIA
Figure 26 A session with OPTI layout in Experiment 2
VIDE0 C-RA OITH ZOOM ïENS
VIDE0 CAMERA BITH A
Figure 27 A session with QWERTY layout in Experiment 2
5.2. Results and Discussion
5.2.1. Text Entry Speed - The Learning Curve
The most interesting and exciting result of this expenment was the learning curves (Figure 28).
The analysis of variance of text entry speed (Table 5) showed that there was no main effect for
keyboard (Fi,., = 0.60, p > .OS). There was a significant effect of session (F19.76 = 89.2. p <
-0001) and a significant keyboard by session interaction (F19,76 = 34.3, p < ,0001).
OPTI had a poor performance at the beginning, averaging 17 wpm, while QWERTY averaged
about 28 wpm. Through the extended use of OPTI, OPTI keyboard eventuaily out-performed
QWERTY significantly (Figure 28 a). The crossover occurred at the tenth session. This is just
under four hours of practice. As the experiment progressed OPTI exhibited better and better
65
performance in entry speed while QWERTY tended to level off. The t ea entry speed for OPTI
reached nearly 45 wpm by the 2oLh session and the performance of QWERTY reached about 40
wpm-
OPTl Q W E R N
- OPTI ------ QWERTY
Session
Figure 28 Performance over sessions. (a) Entry speed (b) Learning curves and
extrapolations to 50' session
Table 5 ANOVA: Entry Speed for Experiment 2
Source df SS MS F P
Participant 4 2465 -967 6 16.492
Key board 1 44.180 44.180 -600 -48 18
Keyboard * participant 4 294.476 73.619
Session 19 6204.993 326.579 89.205 .O00 1
Session * participant 76 278.235 3.66 1
Keyboard * session 19 935.522 49.23 8 34.3 15 .O00 1
Keyboard * session * part. 76 109.050 1.435
When we plotted two leaming cuves, the squared correlation coefficients were r' = 0.997 for
OPTI and r2 = 0.980 for QWERTY (see Section 3.1.4 for more about learning model). This
impiies that the fitted learning models provide a very good prediction of user behaviour. In
both cases over 98% of the variance is accounted for in the modek.
Considering that this longitudinal study lasted only 20 sessions. Most of the participants had
not become "experts" on the OPTI layout by merely ten hours of use. So, we mathematicaily
extended the learning curves for another thirty sessions to project their performance with an
even longer practice. The extrapolation of QWERTY performance stopped at 44.8 wprn and
OPTI would reach to 60.7 wpm. These two values are close matches with our theoretical
upper bound predictions: 43.2 wprn for Q WERTY and 58.4 wprn for OPTI (Figure 28 b).
5.2.2. Error Rates
An error was recorded when the user-entered character was different from the given character.
It couid occur under different circurnstances, and the technique of analyzing these errors can
affect the way we explain user behavior.
Figure 29 and Figure 30 show the error rates of the two keyboards over sessions. Two
different types of errors are discussed below.
5.2.2.1. Cumulative Errors
The cumulative errors were collected each time an entry error occurred. This is the "normal"
to measure errors. It provides a generic description of how accurate a device is in perfonning
given task. It does not attend to how an error occurred. The cumulative error rates ranged
fkom 2.07% for OPTI and 3.21% for QWERTY on the first session to 4.18% for OPTi and
4.84% for QwERTY on the 2 0 ~ session.
An analysis of variance (Table 6) revealed that there was a significant difference in cumulative
mors between the two keyboard designs (Fin = 12.294, p < .OS). QWERTY had consistently
higher error rates throughout the expenment. There was also a significant increase in
cumulative erron over sessions (F19,76 = 4.419, p <.OOl). This may have occurred because
entry speed increased over sessions, thus participants' input tended to continue into the reaction
time phase of error. It was noted that participants wanted to challenge themselves on entry
speed and tended to be more relaxed on errors. They were instmcted to focus on both speed
and accuracy (section 5.1.5) and were reminded of this when errors got too high. However, it
seems that cumulative error may not actuaily descnbe participants' behavior during the
experiment.
+ Opti r,. Qwerty
1.0 .
0.0 - -- - - - - - - - - - - - - . -
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Sessions
Figure 29 Mean cumulative error over sessions
Table 6 ANOVA: Cumulative Erroe for Experirnent 2
Source df SS MS F P
Part. 4 122.723 30.68 1
Ke yboard 1 39.761 39.76 1 1 2.294 ,0248
Keyboard * participant 4 12.937 3 -234
Session 19 1 15.520 6.080 4.3 19 .O00 1
Session * participant 76 104.574 1.376
Keyboard * session 19 10.148 .534 1.582 -0830
Keyboard * session * partic. 76 25.657 -338
5.2.2.2. Chunk Errors
The experiment was designed to control the error by forcing the user to always keep
synchronization with the given text. As the entry speed increased, another type of error may
have occurred more ofien. This has been called a chunk error (Matias, MacKenzie and
Buxton, 1996). This occurs, for example, when a user is unsynchronized with the given text as
follows :
Al1 the entries starting fiom the second space d e r "THE" (as s h o w underlined) would be
counted as errors. Thus eight (8) errors will be recorded. To correct for this, the user must
ignore "A" and enter "S". as indicated by the cursor +'", to regain synchronization.
In the above exarnple, it would be unfair to ascribe eight errors to the user, simply because the
words were shifted by one place. To compensate for this. we count a series of more than one
(inclusive) continuous error as only one error. We cal1 this a chunk error.
0.0 ---- - - - - - . - - - - - - - - - . - . - - - .
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Sessions
Figure 30 Mean chu& error over sessions
Since a chunk error is a subset of a cumulative error, the chunk error rate must be lower than
the cumulative error rate. An anaiysis of variance (Table 7) showed that chunk error rates
increased significantly over sessions on both keyboards (Fi.4 = 16.695, p < .05). Chunk error
rates were d s o significantly lower for OPTI than for QWERTY = 4.809, p < -0001).
Matias et al. (1996) found that chun. errors did not increase as much as the cumulative errors
when the total entry speed increased. This suggested that the size of the chunks got longer and
longer since it was more and more diEcult for user to react to errors when typing speed
increased. This was not the case in this study, and we attribute this to the different motor skiils
required. Matias' experiment tested touch typing with five fingers. The interaction between
fingers can cause a " d m roll" effect, which makes it difficult to stop when an error occurs. In
our experiment, however, such interaction did not exist because there was only one channel of
input at a time. Recall that a "chunk" has length of one for one-or-more concurrent error
characters. The consistent increase in chunk error rates with cumulative error rates suggests
that chunks did not get longer as entry speed increased.
Table 7 ANOVA: Chunk Errors for Experiment 2
Source df SS MS F P -- -
Participant 4 73 -280 18.320
Keyboard. 1 3 7.424 3 7.424 16.695 .O 150
Keyboard * participant 4 8.967 2.242
Session 19 64.986 3 -420 4.809 .O00 1
Session * participant 76 54.055 -71 1
Keyboard * session 19 4.260 -224 1.398 A538
Keyboard * session * partic. 76 12.189 .160
5.2-3. Use of Space Keys
nie OPTI iayout had four space keys (Figure 18, p. 43). For any character-space-character
sequence at Least one space key would create the shortest path, which is therefore referred to as
the optimal space key. The experiment was designed so that the use of different space keys
was transparent to users, but the data file would distinguish them as different keys.
In the experiment? participants were instructed that they could use any of the space keys as they
entering text. As the sessions progress, one would expect three major scenarios: (a) users
would attempt to use the optimal space key; (b) users would always use the closest space key,
following the first character of the character-space-character sequence or (c) users would tend
to use a "favorite" space key.
A favorite space is a personal choice. It does not necessarily have to be optimal. It rnight be
the one that stays visible more often than others, for example. We did not dwell on how users
chose or used space keys because they were only a small part of the keyboard. Figure 31
shows that the use of optimal space keys increased fiom 38% to 47%. Considering that for
each space entered there was one optimal space key4 and three non-optimal space keys, we c m
see that the optimal space did get more use than non-optimal space keys.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Session
Figure 3 1 Use of optimal spaces with OPTI over sessions
Having four space keys is convenient. But forcing users to always use the optimal space key
requires extra judgement on the fly and this is not likely to occur.
'' When the character-space-charmer pattern is symrnetrical there wouid be two or more space keys that create the same shortest path. In an extreme case with E-Space-E, ail the four space keys are equally optimal.
The four space keys were not created equal. Figure 32 shows the theoretical optimal percentage
of the four space keys and the actual use of these four keys. Note that the position of space
keys is Space 1 (top left), Space 2 (top right), Space 3 (bottom lefi), and Space 4 (bottom right)
(See Figure 18 on p. 43). Space 1 holds the highest share of being optimal but it was the least
fiequently used, causing a failback in overail performance.
O 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Session
Figure 32 Theoretical and actual optimal use of space keys
Recall that the ratio of the movement tirne of character-only digraphs to digraphs involving
spaces is 62:38. This means that 62% of the time, the pen travels fiom character to character
when entering common English. Table 8 lists the character and space time distributions of
cornrnon English across several keyboard designs. The mean MT gives the overall upper
bound speed limit. The M T C h - C ~ and iMTchhSpch are the percentage t h e required for movement
74
involving characters only and for movement involving both space and characten. From the
table we can see that in al1 cases space keys are doing very well in reducing the total MT: 38%
of the entry task takes only about 10% of the time, whereas over 60% of the entry task would
take 90% of the time for digraphs involving only characters. In other words, the designs are
biased in favour of the space key(s).
An ided distribution of the time would be one that matches the digraph ratio (62:38) as closely
as possible. Here we see a great potential of designing an even faster keyboard if MTCh-Chr and
not A47',ch-qKh, can be reduced.
Table 8 Movement Time Distributions
Common English Y
Digraphs Digraph QWERTY OPTI 5 x 6 OPTI
Fitaly Distribution
Mean MT (s) 0.278 0.206 0.202 0.21 1
Char - char 60% 89.6% 90.0% 89.0% 9 1 .O%
Double letters 3%
Char - sp & sp - char 38% 10.4% 10.0% 1 1 .O% 9.0%
Note: the 5 x 6 OPTI design is presented in the addendurn on p. 77.
Although the design was not handedness specific, later analysis showed that space 1 is most
optimal, followed by space 2 (refer to Figure 18 on p. 43 for key assignments and Figure 32 for
the distribution). This might inform the future design of soft keyboard as well.
Chapter 6. Conclusion
In this work, we have explored rnany issues in text entry using sofi keyboards and a stylus.
These include:
The effect of keyboard size;
The visual search time for novice users;
The theoreticai expert behavior in using a soft keyboard;
The relationship between touch typing ski11 and use of a soft keyboard;
Major issues in developing an optimal keyboard for stylus tapping.
A longitudinal study of user performance with soi? keyboards.
Predictions for the theoreticai upper limit for severd keyboard layouts;
The design of a new optimal sofi keyboard for pen-based text entry with a predicted upper
speed limit of 58.4 wprn.
We have the following major findings:
8 There was no significant effects for keyboard size on performing text entry within the size
range we tested;
The difference between our finding and Hick-Hyman's weil-studied formula on visual
search time RT can be attributed to a user behavior not modeled, namely, hand traveling
patterns during visual search.
76
There was a Iow correlation between touch typing speed and tapping speed. The low
correlation can be attributed to the different motor act, and different mental models in
transferrhg text entry tasks into key selection.
The empiricai results strongly supported our keyboard design and prediction. Usen were
able to perform better with the OPTI keyboard design over the traditionai QWERTY layout
after 4 hours of practice. A mode1 extrapolation suggested that the theoretical upper bound
b i t codd be reached if about 50 sessions (1 7 hours) of practice were received.
Data have shown that the OP- design is not only faster but also significantly more
accurate than a OWERTY sofi kevboard.
Addendum:
After we have concluded our experimentation on soft keyboard, we continued to explore other
design possibilities. One design that yielded even faster predictions than OPTI was a 5 x 6
design. It has a slightly better the-proportion between kfT'Ch-Ch and MTCh-spch. The theoretical
upper speed limit is 59.4 wpm (Figure 33). We also believe that an even faster keyboard is
possible.
Figure 33 A five by six version of the optimal keyboard
Since we now believe that spaces should be given less credit in text entry with soft keyboards,
perhaps a user custornizable "favorite space" would be a better choice. A favorite space key
could be one that is not in the way of alphabetic entry most of the time. Hitting a space could
be like casually visiting a neutral zone with minimum attention required. An example is to
have the entire rest of the window, or even the entire screen, as a space key.
A favorite space key could also be the one that is always the closest and easiest to hit. This
would always require the shortest movement time. An example would be a large space in the
center of the keyboard.
Bibliography
1. Bahill. (1979). The trajectones of saccadic eye movements. Scientifc Arnerican. 40, 108-
118.
2. Balaicrishan, R., & M a c K e ~ e , 1. S. (1997). Performance differences in the fingers, wrist,
and foreami in cornputer input control. Proceedings of the CHI '97 Conference on Human
Factors in Computing Systems, 3 03-3 10. New York: ACM.
3. Blickenstorfer, C. H. (1995, March). Graffiti: Wow!!!! Pen Computing Magarine, pp. 30-
31.
4. Blickenstorfer, C. H. (1996, August). Handwriting recognition is alive and well. Pen
Computing Magazine, pp. 3 0-33.
5. Blickenstorfer, C. H. (1997, April). A new look at handwriting recognition. Pen
Computing, pp. 76-8 2 .
6. Brown, C. M. L. (1988). Cornparison of typing and handwriting in "two-finger typists".
Proceedings of the Human Factors Society 32nd Annual Meeting, 3 8 1 -3 8 5. Santa Monica,
CA: Human Factors and Ergonornics Society.
7. Card, S. K., English, W. K., & Burr, B. J. (1978). Evaiuation of mouse, rate-controlled
isometnc joystick, step keys, and text keys for text selection on a CRT. Ergonornics, 21(8),
601-613.
8. Card, S. K., Moran, T. P.. & Newell, A. (Eds.). (1983). m e psychology of human-cornputer
interaction: Lawrence Erlbaum Associates,
9. C hmg, L., & MacKenzie. 1. S. ( 1 994). A comparison of two handwriting recognizers for
pen-based computes Proceedings of the CASCON 94,364-371.
10. De Jong, J. R. (1957). The effects of increasing ski11 on cycle tirne and its consequences for
time standards. Ergonornics, 1, 5 1-60.
11. Dix, A., Finlay, J., Abowd, G., & Beale, R. (Eds.). (1993). Human-cornputer interaction:
Prentice Hall.
12. Epps, B. W. (1986). Comparison of six cursor control devices based on Fitts' law models.
Proceedings of the 30th Annual Meeting of the Human Factors Societv, 327-33 1. Human
Factors Society.
13. Fitts, P. M. (1 954). The information capacity of the human motor system in controlling the
amplihide of movement. Journal of Erperimentd Psychology, 4 7(6), 3 8 1 -39 1.
14. Fitts, P. M., & Peterson, J. R. (1964). Information capacity of discrete motor responses.
Journal of Experirnental Psychology, 6 7(2), 1 O3 - 1 1 2.
15. Frankish, C., Hull, R., & Morgan, P. (1 995). Recognition accuracy and user acceptance of
pen interfaces. Proceedings of the Proceedings of the CHIt95 Conference on Human
Factors in Computing Systems, 503-5 10. New York: ACM.
16. Goldberg, D., & Goodisman, A. (1991). Stylus user interfaces for manipulating text,
Proceedings of the ACM SIGGRAPH Symposium on User Interface Sofmare and
TechnoZogy (pp. 12% 13 5).
17. Goldberg, D., & Richardson, C. (1993). Touch-typing with a stylus. Proceedings of the
M E R C H I '93,8047. ACM.
18. Gopher, D., Hilsemath, H., & Raij, D. (1 985). Steps in the development of a new data entry
device based upon two hand chord keyboard Proceedings of the Human Factors Sociew
29th Annual Meeting, 132-1 36. Human Factors Society.
19. Gopher, D., & Raij, D. (1988). Typing with a two-handed chord keyboard: will the qwerty
become obsolete? lEEE Transaction on System, Man and Cybernetics. 18,60 1-609.
20. Graham, E. D. (1 996). Virtual pointing on a cornputer display: Non-linear control-display
mappings. Proceedings of the Graphies Interface '96, 39-46. Canadian Human-Cornputer
Communications Society.
2 1. Greenstein, J. S., & Arnaut, L. Y. (1 987). Human factors aspects of manual cornputer input
Devices. In G. Salvendy (Ed.), Handbook of Human Factors (pp. 1450-1489). New York:
John Wiley & Sons.
8 1
22. Heuer, H., Kleinbeck, U., & Schmidt, K. H. K. H. Schmidt (Eds.). ( L 985). Motor behavior:
Programrning, coniroi. and acquisition. New York: S pringer-Verlag.
23. Hick W. E. ( 1 952). On the rate of gain of information. Quarterly Journal of Erperimenîal
Psychoiogy, J(11-26).
24. Hyman, R. (1953). Stimulus information as a determinant of reaction tirne. Journal of
Ejcperimental PsychoZo~, 45, 1 88- 1 96.
25. Jacob, R. J. K. (1994). Eye tracking in advanced interface design. In W. Barfield & T. A. F.
III (Eds. ), Virtual Environments and Advanced Interface Design . Oxford: Oxford.
26. Kerr, B. A., & Langolf, G. D. (1977). Speed of aiming movements. Q m e r l y Journal of
Experimental Psychology, 29,475-48 1 .
27. LaLomia, M. 1. (1 992). User acceptance of computer applications with speech, handwriting
and keyboard input devices, Proceedings of ACM CH1'92 Conference on Human Factors in
Computing Systems -- Posters and Short Talks (pp. 5 8).
38. LaLornia, M. J. (1994). User acceptance of handwritten recognition accuracy. Proceedings
of the Cornpanion Proceedings of the CHI'9-f Conference on Human Factors in Computing
Systems, 107- 108. New York: ACM.
29. MacKenzie, 1. S. (1989). A Note on the information-theoretic basis for Fitts' Iaw. Journal
of Motor Behavior, 21(3), 323-330.
30. MacKenzie, 1. S.. Seiien, A., & Buxton, W. (1991). A Cornparison of input devices in
elemental pointhg and dragging tasks. Proceedings of the CH191, 16 1 - 166. ACM Press.
3 1. MacKenzie, 1. S. (1 992). Movement tirne perdiction in human-cornputer interfaces.
Proceedings of the Graphics Inferface '92, 1 40- 1 50. Canadian Information Processing
Society.
32. MacKenzie, 1. S., & Buxton, W. (1 992). Extending Fitts' Iaw to two-dimensional tasks.
Proceedings of the CHI '92,2 19-225. Addison Wesley.
33. MacKenzie, 1. S., & Ware, C. (1993). Lag as a determinant of human performance in
interactive systems. Proceedings of the M E R C H I '93,4881193.
34. MacKenzie, 1. S., Nonnecke, B., Riddersma, S., McQueen, C., & Meltz, M. (1994).
Alphanumeric entry on pen-based cornputers. International Journal of Human-Cornputer
Sfudies, J 1(5), 775-792.
35. MacKenzie, 1. S., Nomecke, R. B., McQueen, J. C., Riddersma, S., & Meltz, M. (1994). A
cornparison of three rnethods of character entry on pen-based computen. Proceedings of
the Human Factors and Ergonornics Socieîy 38th Annual Meeting, 330-334.
36. M a c K e ~ e , 1. S., Nonnecke, R. B., McQueen, J. C., Riddersma, S., & Meltz, M. (1994). A
cornparison of four methods of numeric entry on pen-based cornputers. Proceedings of the
Graphics Interface '95475-82. Canadian Hurnan-Cornputer Communications Society.
37. MacKenzie, 1. S., & Zhang, S. X. (1 997). The immediate usability of Graffiti Proceedings
of the Grophics Interface '97, 129-1 37. Canadian Information Processing Society.
38. MacKenzie, 1. S., Soukoreff, R. W., & Zhang, S. (1997). Text enhy using soi3 keyboards.
Behavior and Information Technology, (subrnitted for publication).
39. Matias, E., MacKenzie, 1. S., & Buxton, W. (1993). Half-QWERTY: A one-handed
key board facilitating ski11 transfer fiom Q WERTY, Proceedings of A CM INTERCHI'93
Conference on Human Factors in Computing Systems @p. 88-94).
40. Matias, E., MacKenzie, 1. S., & Buxton, W. (1996). One-handed touch typing on a
Q WERTY keyboard. Human-Compter Interaction, 1 1, 1-27.
41. Maymer, M. S., & Tresselt, M. E. (1965). Tables of single-letter and digram frequency
counts for various word-length and letter-position combinations. Phychonomic Mnograph
Supplements, 1 (S), 1 3-32.
42. McQueen, C., MacKenzie, 1. S., Nonnecke, B., Riddersma, S., & Meltz, M. (1994). A
cornparison of four methods of numeric entry on pen-based computers. Proceedings of the
Proceedings of Graphics Interface W. 75-82. Toronto: Canadian Information Processing
Society.
43. McQueen, J. C., MacKe~xie, 1. S., & Zhang, S. X. (1 995). An extended study of numeric
entry on pen-based computers. Proceedings of the Graphics Interface '95, 215-222.
Canadian Information Processing Society.
84
44. Miller, G. A. (1953). What is information measurement? American Psychologist. 8, 3-1 1 .
45. Neisser, U., & Weene, P. (1960). A note on human recognition of hand-printed characters.
Information and Control, 3, 1 9 1 - 1 96.
46. Newell, A., Br Card S. (1985). The prospects for phychological science in hurnan-
computer interaction. Homan Cornputer Interaction, 1,209-242.
47. Noyes, J. (1983). Chord keyboard. Applied Ergonornics, I . I ( l ) , 55-59.
48. Noyes, J. (1983). The QWERTY keyboard: A review. International Journal of Man-
Machine Studies. 18(3), 265-28 1.
49. Pausch, R., Crea, T., & Conway, M. (1992). A literature survey for virtual environments:
military flight simulator visual systems and simdator sickness. Presence, 1(3), 344-363.
50. Potosnak, K. M. (1988). Keys and keyboards. In M. Helander (Ed.), Handbook of Human-
Cornputer Interaction (pp. 475-494). New York, NY: North-HoIland.
5 1. Raij, D., Gopher, D., & Kimchi, R. (1987). Perceptual clnd motor determinants of efficient
data entry, Proceedings of the Human Factors Society 3 1st Annual Meeting (pp. 820-824).
52. Shannon, C. E., & Weaver, W. (Eds.). (1949). The mathematical theory of communication.
Urbana, IL: University of Illinois Press.
53. Soukoreff, R. W., & MacKenzie, 1. S. (1995). Theoreticai upper and lower bounds on
typing speed using a stylus and a soft keyboard. Behmiour and Informution Technology.
I4(6), 370-379.
54. Soukoreff, R. W., M a c K e ~ e , 1. S., & Zhang, S. (1997). Text entry using soft keyboards.
Behavior and information Technology, (submitted for publication).
55. Strong, E. P. ( 1 956). A cornparif ive experimenf in sirnplz~ed keyboard reîraining and
standard keyboard supplementary training . Washington, DC: General Services
Administration.
56. Tappert, C. C., Suen, C. Y., & Wakahara, T. (1990). The state of the art in on-line
h a n d k ting recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence.
12(8), 787-808.
57. Welford, A. T. ( 1 968). Fundarnentals of skill, Movernenf: Relationships berneen speed and
accuracy @p. 145-1 60). London: Methuen & Co Ltd.
Appendices
A. Single Letter Frequencies of Common ~n~l ish' .
5 From Maymer and Tresselt, 1965.
Single Letter count Space 1 19998
Cumulative fieq. % 1 8.66
B. Digraphs Involving Ten Most Frequent ~et ters~ .
The order of ten ietters are: E, T, A, H, O, S, N, R, 1, and L
From Soukoreff and MacKenzie, 1995. 7 DC = Digraph count. A "-" represents "Space". For example, HE occwed 3 155 tirnes. E-Space occurred 4904 tirne, etc. A blank indicates that there is no such digraph in the sample.
B. (continued)
C. General Description of V1.O
The experiment software generates two different sizes of soft keyboards each with two layouts:
fixed QWERTY or randornized. When key randomization is enabled, the key assignment is
completely randornized each time a key is tapped. Since the space key has a different shape
than the others, it is not reassigned with randomization.
The key dimensions of the large keyboard is 10 x 10 mm measured on the Wacom PL-IOOV
12" digitizing tablet screen. The small keys were 6.4 x 6.4 mm. The large keys have the same
height, in pixels, as buttons found in a typical MS Windows application's dialog box. The
small keyboard has nearly the same size as the on-screen keyboard found on a 3Comts
PaIrnPiIot.
V1 .O also allows for different input devices. When a stylus is selected the prograrn hides the
mouse cursor. Other device selections will enable the mouse cursor. It also has different
modes for practicing or for ninning experiment. In the latter case, al1 data are saved to files.
The original and participant-entered text are displayed in a 16 point bold uppercase mono-
spaced Courier font to help keep participants synchronized during text entry.
Once the first character is entered, the program collects a senes of data values:
The time elapsed since the last character was entered,
The given character at the cursor position (a caret """),
90
The position of the given character on the keyboard (The positions are numbered fiom
O for the top-left key to 25 for the bottom right key of the third row. Space is numbered
26.1,
The character entered by the participant,
The position of the participant-entered character and
Emr statu, (O = correct character, 1 = incorrect character).
An audible "tick" signal indicates an error.
The program automaticaily stops at the end of the phrase and displays a summary of text entry
speed, average elapsed tirne between characters, and error rate;
Utilities 1
Help - i
I
Options I
Data file namingl
Device 1 Sample set size 1
1 Load sample phrase set
I 1
Correlation test
Practice 1 Experirnent
I I I
I File information
I Subject ID
Session ID
Block ID
Setup keyboard
D. Listing of phrases used in experiment one
ALL GOOD BOYS DESERVE FUDGE
THE BACK YARD OF OUR HOUSE
MARY HAD A LITTLE LAMB
PROTECT OUR ENVIRONMENT
VIDE0 CAMERA WITH ZOOM LENS
LIFE IS BUT A DREAM
THE QUICK BROWN FOX JUMPED
BLACK AND WHITE IMAGE
CLOSED FOR THE SEASON
MOM MADE HER A TURTLENECK
E. Sample text used in the typïng test in the experiments
A queen asked her advisor, the grand vizier, why she never seemed able to fmd mental
contentment. The vizier indicated that her mind is constantly occupied with exquisite
possessions. When one's mind is absorbed with thoughts of jewels, palaces, and gold, it cannot
fuid equilibrium. The queen pondered the vizier's words, and realized that he had spoken the
tmth.
Listing of phrases used in experiment two
MOM MADE HER A TURTLENECK GOLDILOCKS AND THE THREE BEARS WE DID SOME GROCERY SHOPPING THE ASSIGNMENT IS DUE TODAY WHAT YOU SEE IS WHAT YOU GET FOR YOUR INFORMATION ONLY A QUARTER OF A CENTURY THE STORE WILL CLOSE AT TEN HEAD SHOULDERS KNEES AND TOES VANILLA FLAVORED ICECREAM FREQUENTLY ASKED QUESTIONS ROUND ROBIN SCHEDULMG INFORMATION SUPER HIGHWAY MY FAVORITE WEB BROWSER THE LASER PRINTER IS JAMMED ALL GOOD BOYS DESERVE FUDGE TKE SECOND LARGEST COUNTRY CALL FOR MORE DETAILS SHE ARRIVED N S T iN TIME HAVE A GOOD WEEKEND VIDE0 CAMERA WITH ZOOM LENS MONKEY SEE MONKEY DO THAT IS VERY UNFORTUNATE THE BACK YARD OF OUR HOUSE THIS IS A VERY GOOD IDEA THE MARCH READiNG WEEK OUR FAX NUMBER HAS C W G E D THANK YOU FOR THE HELP NO EXCHANGE WITHOUT A BILL THE EARLY BIRD GETS THE WORM BUCKLE UP FOR SAFETY THIS IS TOO MUCH TO HANDLE PROTECT OUR ENVIRONMENT THE GROWMG WORLD POPULATION
35. THE LIBRARY IS CLOSED TODAY
36. MARY HAD A LITTLE LAMB 37. TEACHING SUPPORT SERVICES 38. WE WILL GO SWIMMING TODAY 39. WE ACCEPT PERSONAL CHEQUES 40. NON PROFIT ORGANIZATION 4 1. USER FRIENDLY NTERFACE 42. EAT HEALTHY FOOD EVERY DAY 43. GET SOME HANDS ON EXPERIENCE 44. THIS WATCH IS TOO EXPENSIVE 45. UNITED STATES POSTAL SERVICE 46. COMMUNICATE THROUGH EMAIL 47. THE CAPITAL OF OUR NATION 48. TRAVEL AT THE SPEED OF LIGHT 49. 1 DO NOT FULLY AGREE WITH YOU 50. GAS BILLS ARE SENT OUT MONTHLY 5 1. EARTH QUAKES ARE PREDICTABLE 52. LIFE IS BUT A DREAM 53. TAKE IT TO THE RECYCLING DEPOT 54. SEND THIS BY REGISTERED MAIL 55. FALL IS MY FAVORITE SEASON 56. A FOX IS A VERY SMART ANIMAL 57. THE KIDS ARE VERY EXCITED 58. THIS PARKING LOT IS FULL 59. MY BIKE HAS A FLAT TIRE 60. DO NOT WALK TOO QUICKLY 6 1. DUCKS GO QUACK WHEN HUNGRY 62. IT HAS A LIMITED WARRANTY 63. THE FOUR SEASONS OF THE YEAR 64. THE SUN RISES IN THE EAST 65. IT WAS VERY W D Y TODAY 66. DO NOT WORRY ABOUT THIS 67. DASHING THROUGH THE SNOW 68. WANT TO JOiN US FOR LUNCH 69. STAY AWAY FROM STRANGERS 70. ACCOMPANIED BY AN ADULT
G. General description of V2.O
V2.0 is designed for ninning Iongitudinaily with a variable number of blocks per session. It is
designed to be self-contained. In each session, the expenmenter oniy needs to enter user id,
session id and the size of the block and select mode. The rest of the process is then under
participant's control without intervention by the experimenter.
It generates two different sessions with either OPTI or QWERTY layout. The data collected in
files are similar to that of V1 .O. It was tested extensively prior to the experiment.
I Start program menu I
Control panel - I
1 1
Mode
Load sample phrase set fiom file
Data file naming / I Layout
1
Automatic reset mouse cursor
4 Correlation coefficient
Help
1 Set experiment conditions I I 4 I Subject ID
Starting block ID
v v I Setup keyboard
4
Program structure of V2.0.
Session ID
& Calculating the theoretical MT of a random keyboard
MTmdo, is the mean movement t h e on a keyboard with key distributed in a fixed physicai
position without fixed letter-key assignments (Figure 34).
Figure 34 A Keyboard without fixed Ietter-key assignments
When we type on this keyboard, every tirne we hit a key, the next letter can be in any position
on the keyboard. Therefore the mean movernent time on this keyboard can be calculated
mathernaticall y without information about any speci fic language.
If the space is the only known key we can mode1 the movement time of such keyboard as
foilows.
Number the character keys from 1 to 26. The mean distance MDch=h br character to character
is:
And the mean distance fiom character to space (nurnbered as key 27) to character M&-yr,,:
Where DI- , is the distance between key 1 and key 1 (here is zero), Dl-r is the distance between
key 1 and key 2 and is the distance between key 1 and key 27, and so forth.
Therefore, the total movement time can be modeled by Fitts' law given that the mean distance
MD = MD,-J,~~ + MDch;.pi:
APPLIED - IMAGE. lnc - - 1653 East Main Street - -* - Rochester. NY 14609 USA -- -- - - Phone: 71 W82-0300 -- -- - - Fax: 716/28&5989
O 1993. Appiied Imfqe. Inc.. All M t s Reserved