27
KTH-TMH Stockholm [email protected] Looking for feedback expressions in speech corpora

KTH-TMH Stockholm [email protected]

  • Upload
    lysa

  • View
    35

  • Download
    0

Embed Size (px)

DESCRIPTION

Loredana Cerrato. Looking for feedback expressions in speech corpora. KTH-TMH Stockholm [email protected]. General idea. - PowerPoint PPT Presentation

Citation preview

Page 1: KTH-TMH Stockholm loredana@speech.kth.se

KTH-TMH [email protected]

 

Looking for feedback expressions

in speech corpora

Page 2: KTH-TMH Stockholm loredana@speech.kth.se

General idea

Study of the distribution, typology, use and function of verbal and non-verbal feedback expressions in human communication and in human machine communication.

Why?

Try to model human behaviour and try to reproduce it in speech technology application

Page 3: KTH-TMH Stockholm loredana@speech.kth.se

Definition of feedback in human communcation

Allwood JAllwood J. et al. et al. 1992, On the semantics and pragmatics of linguistic feedback. 1992, On the semantics and pragmatics of linguistic feedback

“Control strategies of the channel of communication and of the informative flux, used by participant to a conversation to support the interaction”.

”Feedback mechanism is one of the most important cohesion devices in human conversation”

”Feedback consists of a set of verbal and bodily means used, in a minimal intrusive way, both to elicit and give information about the basic communicative functions: contact, perception, comprehension, attitudinal reactions”

Page 4: KTH-TMH Stockholm loredana@speech.kth.se

General ways of expressing verbal feedback

• short verbal expressions: yes, no, ok together with some prosodical and phonological phenomena (like varying pitch contour and vocalic lengthening)

• short non lexical items: ah, ah-ah, mm, mh-mh

• repetition: either the last word uttered by the interlocutor or by repeating the core words of the last sentence

• reformulation: of the meaning of the received message.

Page 5: KTH-TMH Stockholm loredana@speech.kth.se

General ways of expressing non-verbal feedback

• Facial expressions: smile, eyebrow rising, gaze.

• Head movements: nods, shakes, tilts ..

• Gestures: hand, shoulder movements • Postures: trunk position.

-Combination of verbal and non verbal expressions

Page 6: KTH-TMH Stockholm loredana@speech.kth.se

Looking for verbal feedback expressions in a corpus of Italian spoken dialogues (human-human

communication)Material: • 12 spontaneous-elicited dialogues of ca. 12

minutes each

• AVIP database: Archive of Spoken Italian VarietiesNapoli, Bari, Pisa

(Collected in 1998-1999, it containes 16 dialogues transcribed at different levels: ortographic, phonetic, phonemic, prosodic and DA)

• Map task technique

Cerrato L. 1999

Page 7: KTH-TMH Stockholm loredana@speech.kth.se

Map task technique

Go straight till the lake, then at the park turn left.

Yes…park? What park?

I don´t have any parks

GIVER FOLLOWEROnly audio recordings

Task-oriented conversation

Giver: gives instruction

Follower: receives the instruction to accomplish the task

the auditive channel of communication is maximisedno visual channel, therefore feedback expressions are produced in a great number.

Page 8: KTH-TMH Stockholm loredana@speech.kth.se

Analysis

1) Identification and labeling of feedback2) Distributional analysis 3) Acoustic phonetic analysis:

– Duration and Pitch contour analysis With the aim of detecting acoustic cues related to the function

carried out by feedback expressions:• show attention/continuation of contact• aknowledge reception• agreement, acceptance• refusal• require clarifications• require further explanation• require further indications • show reception and acquisition of knowledge

Page 9: KTH-TMH Stockholm loredana@speech.kth.se

Distribution of feedback expressions

• 1/3 of the turns consist of feedback expressions If a dialogue counts c.a 260 turns,

90 are feedback expressions

• Feedback is expressed in a non-intrusive way

• Distinction between giving and requesting feedback

• Regional variants

A contradiction: if feedback is expressed in a ”non-intrusive way” can it be counted as a turn of its own?

Page 10: KTH-TMH Stockholm loredana@speech.kth.se

Characterisazion of feedback expressions and their functions

Type of expression

Minimal responses mmh

ah

si

no

Function

•show attention,continuation of contact

•acknowledge reception

•agreement, acceptance

•refusal

F0 contour

F0 flat or slightly falling

Extended responses•Repetitions

•Reformulations

•Questions

•require clarifications

•require further explanation

•require further indications

•show reception and acquisition of knowledge

Rising F0

Page 11: KTH-TMH Stockholm loredana@speech.kth.se

Intentional F0 variation can modify the meaning of the feedback

expression

• “Ah”: acquisition of knowledge

– Flat F0: the acquired knowledge satisfies a previous need of the listener

– Rising-falling F0: the information received doesn’t satisfy a need, but

disappoints the listener

Page 12: KTH-TMH Stockholm loredana@speech.kth.se

Looking for verbal feedback expressions in a corpus of Swedish spoken dialogues (human-human

communication)Material: 4 real dialogues between

1 travel agent and 4 customers recorded in a travel agency in Gothenburg

Göteborg Spoken Language Corpus.

Cerrato, L. 2002a

Page 13: KTH-TMH Stockholm loredana@speech.kth.se

Analysis

1) Identification and labeling of feedback2) Distributional analysis 3) Analysis of its function:

– show continuation of contact and perception– acknowledge comprehension– express a point of view (attitudinal reactions)

4) Acoustic phonetic analysis:– Duration and Pitch contour analysis

Page 14: KTH-TMH Stockholm loredana@speech.kth.se

Results: Customers-agent distribution for each dialogue

• The number of feedback expressions of the customers reaches more than 50% in all the dialogues.

0%

10%

20%

30%

40%

50%

60%

70%

80%

90%

100%

dial 1 dial 2 dial 3 dial 4

C feedback

A feedback

Page 15: KTH-TMH Stockholm loredana@speech.kth.se

Feedback functions

0%

10%

20%

30%

40%

50%

60%

70%

dial 1 dial 2 dial 3 dial 4

continuation

aknow ledgement

point of view

Customers

0%

10%

20%

30%

40%

50%

60%

dial 1 dial 2 dial 3 dial 4

continuation

acknowledgement

point of view

Agent

Continuation of contact

Acknowledgement of information

Expression of a point of view

Page 16: KTH-TMH Stockholm loredana@speech.kth.se

Results of the acoustic analysis

 short duration and flat pitch contour – active listening attitude

– not intend to take the turn;

  longer duration and rising pitch contour – active listening attitude intention to take the turn

0 550 ms

0 290 ms

Page 17: KTH-TMH Stockholm loredana@speech.kth.se

Conclusions on feedback

• It is possible to categorize verbal feedback expressions according to their typology and to the meaning they convey.

• Intentional variation of F0 seems to play an important role for assigning a particular meaning to the expressions used.

• Question and hypothesis:

Is it possible to categorize visual feedback in the same way?

Page 18: KTH-TMH Stockholm loredana@speech.kth.se

The study of ”visual feedback”in collaboration with Jens Allwod & c.

• What is the relation between co-occurring visual and verbal feedback expressions?

– Do visual expressions emphasise the meaning of the verbal expression?

– Or do they convey different shades of meaning?

• When visual feedback occur independently of verbal feedback expressions what is its function and meaning?

Need for a multimodal database

Page 19: KTH-TMH Stockholm loredana@speech.kth.se

Feedback expressions are produced along different modalities • Verbal: words and sentences

• Prosodic: speech rhythm, pauses, intensity, stress, intonation

• Gestural: hand and arm movements

• Facial: head and eye movement, gaze, smile and other facial expressions

• Bodily: posture, trunk and leg movements

Page 20: KTH-TMH Stockholm loredana@speech.kth.se

Feedback coding in a ”multimodal corpus”

• Tool: Multitool (Leif Grönqvist, Magnus Gunnarsson)

• Corpus: GSLC- Travel agency activity• Annotation: Multitier annotation of feedback

expressionsText: transcription (MSO) Function: reports the function of the feedback

expression and the function of the utterance that precede the feedback expression

Gesture: reports the “non-verbal feedback expressions”, analysed in terms of movements.

Gaze: [should] report the direction of the speakers’ gaze

Page 21: KTH-TMH Stockholm loredana@speech.kth.se

Feedback in human machine interactions

System’s feedback• Process feedback

– ”I am active” ”I am retrieving information”

• Information feedback– ”I have understood”: explicit,

implicit feedback that indicates the reception and comprehension of the message

Explicit feedback:User: I want to go to LondonSystem: did you say you wanted to go to London?Implicit feedback: User: I want to go to LondonSystem: At what time do you wish to arrive in London?

Users’ feedback

Explicit, Implicit

To show:

Perception,

Understanding

Acknowledgement

Acceptation,

Refusal

Ecc.

Page 22: KTH-TMH Stockholm loredana@speech.kth.se

Analysis of users’ feedback in human-machine interactions

AUGUST• 18% of users (of 1206)

gave feedback at least once• 6% of utterances (of 6876)

contained feedback expressions

• In 89% of these cases, feedback appeared in a turn of its own.

ADAPT94% of users (of 33) gave feedback at least once18% of utterances (of 1845) contained feedback expressionsOnly in 6% of these cases, feedback appeared in a turn of its own (it occurred mostly at the beginning of longer sentences)

Results adapted from Bell L. & Gustafson J. 2000;

Page 23: KTH-TMH Stockholm loredana@speech.kth.se

August

•August was an experimental spoken dialogue system able to give information about tourist sights and restaurants in Stockholm, about KTH and the life and works of August Strindberg.•The August database counts more than 10.000 utterances produced by 2685 users, all visitors at the Stockholm Cultural Centre, where the dialogue system was displayed as part of the Cultural Capital of Europe '98 program.

Page 24: KTH-TMH Stockholm loredana@speech.kth.se

AdaptAdapt is an experimental dialogue system in the domain of Stockholm’ s real-estate.

The Adapt database was collected in 1999 by means of the Wizard of Oz technique.

It includes a total of 50 dialogues produced by 33 users.

Page 25: KTH-TMH Stockholm loredana@speech.kth.se

Comparison between feedback strategies in Human-human communication and in

human-machine communication

• Material: 4 dialogues from GCSL and 4 dialogues from ADAPT

• Method:Distributional, functional and acoustic analysis

• Results:1. Humans produce feedback also when interacting with

machines, but in different ways. 2. The majority of feedback expressions in GSLC travel agency

dialogues consist of short monosyllabic words produced often on their own in a non-intrusive way.

3. In the interaction with ADAPT this kind of short feedback were never produced, they rather appeared at the beginning of longer sentences.

Cerrato L. 2002b

Page 26: KTH-TMH Stockholm loredana@speech.kth.se

Measuring head movements related to feedback

in collaboration with Linköping university Quantitative measurement of the entity of head

movement related to feedback expressions with the Qualisys MacReflex motion tracking system.

A pilot study: • recording of 4 elicited dialogues between humans recorded in

controlled situation,• transcription and coding of the dialogues• identification of “typical visual” feedback• quantification of the movements• implementation of the results in talking heads to make them look

more natural.

Page 27: KTH-TMH Stockholm loredana@speech.kth.se

Main References

• Allwod J ., 1999, The Swedish Spoken Language Corpus at Göteborg University. In Proceedings of Fonetik 99, Gothenburg Papers in Theoretical Linguistics 81.

• Allwod J., Nivre J., Ahlsén E. , 1992, On the Semantics and Pragmatics of Linguistic Feedback in Journal of Semantics 64.

• Allwood J., Grönqvist L., Ahlsén E., Gunnarsson M. ,2002 Annotations and Tools for an Activity Based Spoken Language Corpus", Proc. of 2nd SIGdial Workshop on Discourse and Dialogue, Aalborg, Denmark

• Bell L & Gustafson .,2000 Positive and negative user feedback in a spoken dialogue corpus, In Proc of ICSLP 2000, Beijing, China.

• Cerrato L, 1999 Il feedback verbale in dialoghi elicitati con la tecnica del map task. In Atti del GFS 10 Napoli, Italia, 25-35

• Cerrato L. 2002a, Some charactersitcs of feedback expressions in Swedish, in TMH.OPSR Vol.43 Fonetik 2002 p. 101-104

• Cerrato L. 2002b, A comparison between feedback strategies in Human-Human and Human-machine communciation in proceedings of ICSLP-2002, Denver, Colorado USA, (in press)