52
Turn Taking Ravi M H

TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

  • Upload
    dominh

  • View
    221

  • Download
    3

Embed Size (px)

Citation preview

Page 1: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Turn  Taking

Ravi  M  H

Page 2: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

SIGNALS  AND  RULES

Communication  mechanism  for  managing  taking  of  turns  in  face-­‐to-­‐face  interactionSignals,  cues  and  rules  are  usedThree  basic  signals

Turn-­‐yielding  signalsAttempt-­‐suppressing  signals  by  the  speakerBack  channel  signals  by  the  auditor.

Interactional  rulesProhibition  of  interruptionRequirement  for  properly  timed  signalsAcknowledge  understandingConfirm  continued  attention  of  the  listener

Page 3: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Sources  of  data

InterviewsStrong  intrinsic  motivationAvoids  more  artificial  topic  for  discussion

VideotapingSingle  camera  placementRecording  less  subtle  expressions,  smiles  and  grimacesHigh  quality  monophonic  audio  track  was  obtained.

Transcripts  (more  in  next  slide)Requirements-­‐ max  behavioral  breadth  and  continuityDepends  on  view  of  wealth  of  communication  engaged

Page 4: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Transcripts  -­‐Phonemes

Segmental  phonemes    -­‐ pronouncing  syllables  within  framework  of  English  sound  systemSupra  segmental  phonemes    -­‐ intonation,  stress,  pitch  and  juncture

Para  languageVocal  behaviors  not  part  of  sound  system  of  language.Intensity,  pitch  height,  extent.

Body  motionHead  gestures  and  movements,            Shoulder  movementsFacial  expressions Hand  gestures  and  movementsFoot  movements   Leg  movementsPostures  and  posture  shiftsUse  of  Artifacts,  pipe,  Kleenex,  papers  and  clipboard.

Page 5: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Turn  taking  mechanism

Rules  delimit  appropriate  responses  by  participant

Rules  and  signals  establish  empirical  expectations  w.r.t turn  taking  activities.

Speaker   claims  speaking  turn  any  momentAuditor   does  not  claim  speaking  turn.

Page 6: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Simultaneous  turns   simultaneous  talking,  and  no  contention

Turn  taking  Mechanism  is  not  designed  to  resolve  simultaneous  turns

Turn  yieldingRules

Turn-­‐yielding  signal  is  often  met  with  turn  taking  signalBut  if  not,  then  results  in  contention,  or  simultaneous  talking.

Page 7: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

SignalTurn  yielding  signal,  6  discrete  behavioral  cues

Intonation  :  any  pitch  level  terminal  junction  combination.Paralanguage  :  Drawl  :  drawl  on  final  syllable.Body  motionSocio  centric  sequences,-­‐ appearance  of  one  of  several  

Paralanguage  :  pitch/loudness    Syntax:  completion  of  grammatical  clause  

Page 8: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Attempt-­‐Suppressing  signalRules

Speaker  maintains  turn  for  himSpeakers  hands  being  engaged  in  gesticulationSelf  and  object  adaptorsDropping  of  the  gesticulating  hand

Page 9: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Back-­‐Channel  communicationRules

-­‐Head  nods  from  auditor.Signals  

Has  large  and  complex  set  of  signals.Sentence  completions.-­‐ auditors  complete  a  sentence  speaker  beginsBrief  requests  for  clarificationRestatement  in  few  words  of  an  preceding  thought  by  a  speaker

Page 10: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Coordinating  through  gaze

Formalizing  common  pattern  in  a  computational  model  of  gaze-­‐coordinated  turn  takingDialogue  actsLinguistic  actions   realization  of  acts  throuobservable  communicative  behviours

Page 11: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Two  sequences  useful  for  turn-­‐takingMutual-­‐break   after  finishing  conversation  with  utterance.

Used  when  conversation  proceeds  smoothlyMutual-­‐hold   recipient  begins  speaking  without  immediately  looking  away.

Gaze  actions  signal  their  intention  to  give  or  keep  the  turnGaze  is  one  of  many  indicators  of  turn-­‐taking  behavior

Page 12: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Utterance  segmentation  and  turn-­‐taking

Utterance  is  when  user  speaks  and  ends  when  user  ceases  to  speak.Detection  is  on  triggers  on  silence/non-­‐speech.Nailon -­‐ online  real-­‐time  prosodic  analysis  toolFinding  the  end  of  utterance  is  important  in  turn-­‐taking  mechanism.

Page 13: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Application  of  utterance  detection

Spoken  language  understandingTopic  detectionInformation  retrievalInteraction  controlTurn-­‐takingBack-­‐channelingSystem  barge-­‐in

Page 14: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Utterance  unitsHumans  use  prosodic  boundaries  to  delimit  speech  during  speaking  and  listeningEnd-­‐of-­‐utterance(EOD)  detectors  rely  on  silence  thresholdProblem   during  spontaneous  speech   silent  pauses  also  within  segments,  called  utterance  units.

Humans  can  distinguish,  but  how  would  the  machine  ???

Page 15: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Implementation  was  done  by  Jens  Edlund,  Heldner and  Gustafson.Segmenting  speech  into  pause  bounded  utterance  units

Augmented  end-­‐of-­‐utterance  detection  along  with  boundary  tones.mid  level  boundary  tones    used  to  single  out  internal  pauses  from  final  ones.Voice  Activity  Detection  (VAD)  used  to  discriminate  speech  from  non-­‐speech.

Page 16: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

LimitationsAcoustic  analysis  was  not  tuned  for  children.Children  speech  is  different  than  grown-­‐ups.

Future  scopeAdd  more  features,  and  test  on  greater  variety  of  dataMerge  information  by  prosodic  analysis  with  other  semantic  interpretations,  dialogue  context,  etc.

Page 17: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

3.Conversation  acts  in  task  oriented  system

Grounding      -­‐ establish  a  mutual  understanding  or  common  ground  of  conversation  contentEx  :  TRAINS  Project,  having  manager  (M)  talking  to  system(S)  to  accomplish  a  task  in  that  domain.The  domain  knowledge  is  a  must  in  task  oriented  acts.Pervasiveness  of  acknowledge  and  agreement  signals  striking  features  of  task-­‐oriented  dialogues.

Page 18: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Assumptions  and  tradeoffs

Each  Utterance  encodes  a  single  speech  actSpeech  acts  are  single  agent  plans.    Listener  is  passively  presentUtterances  are  heard  and  understood  correctlyTradeoffs  :  utterances  can  be  misunderstood

» Visual  cues,  head  nodding,  and  continued  eye  contact  are  not  taking  into  account,  which  is  general  in  human  nature.

» Presence  of  multiple  agents  -­‐ as  this  cant  be  stand  alone  system  and  has  to  be  part  of  a  framework.

Page 19: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Conversation  acts

4  levels  of  actions  for  maintaining  the  coherence  and  content  of  conversation.1.  Core  speech  acts   Discourse  Unit  acts

Inform  Request  Promise

2.  Argumentation  ActsElaborate  SummarizeClarify  Q&AConvince  Find-­‐Plan

Page 20: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Argumentation  acts

Page 21: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

3.  Grounding  ActsInitiateContinueAcknowledgeRepairReqRepairReqAckCancel

Page 22: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

4.  Turn-­‐taking  ActsKeep-­‐turnRelease-­‐turn  (assign-­‐turn)Take   turn

Certain  sound  patterns,  such  as  "uhh",  seem  to  carry  no  semantic  content  beyond  keeping  the  turn.Pauses  are  opporutnities for  anyone  to  take  the  turn.

uhh-­‐

Page 23: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

How  to  recognize  my  conversation  acts??  

Monitoring  speech  input  for  certain  set  of  featuresCan  be  overridden  by  evidence  from  other  featuresCan  be  done  through  context saved  through  states  accumulated  in  tracking  a  discourse  unit  through  a  sequence.

Page 24: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Conversations  are  primarily  concerned  to  solve  a  problemGather  InformationSocial  goals.

Recognizing  Turn-­‐taking  acts.Dependent  on  social  settingDetermined  using  the  same  channel  as  that  of  system  that  is  regulating.

Page 25: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Local  initiative   other  party  is  under  obligation  to  speak  or  respond  to

QuestionsRequests

If  no  obligations,  then  higher  level  goal  and  expectative  will  derive  the  local  control

Page 26: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Finite  state  turn-­‐taking  model

Model  to  control  turn  taking  behavior  of  conversation  agents

Uses  cost  matrixDecision  theoretic  principles

Knowing  what  to  say  is  equally  important  as  to  knowing  when to  sayMost  of  models  caters  to

hand-­‐coded  expert  knowledgeAnd  not  data-­‐driven  optimization

Page 27: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Builds  upon  previous  work  on  finite-­‐state  model  of  conversational  floorSimple  and  generalGrounded  in  decision  theoryLends  well  to  data-­‐driven  optimization

Page 28: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Jaffe  and  Feldstein  studiedMean  duration  of  pausesSwitching  pausesSimultaneous  speechVocalization  in  recorded  dyadic  conversations.Proposed  first  order  Markov  models  to  capture  alternation  of  speech  and  silence  in  dialog.

Page 29: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

6  finite  state  model

Page 30: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Turn  transitions  with  gap

Turn  transitions  with  overlap

Page 31: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Failed  interruptions

Time  outs

Page 32: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Cost  of  Turn-­‐Taking  Actions

CS  is  the  cost  of  interrupting  a  system  prompt  before  its  end  when  the  user  is  not  claiming  the  floor  (false  interruption)CO(t  )  is  the  cost  of  remaining  in  an  overlap  that  is  already  t  ms  longCU  is  the  cost  of  grabbing  the  floor  when  the  user  is  holding  it  (cut-­‐in)CG(  t)  is  the  cost  of  remaining  in  a  gap  that  is  already  t  ms  long

Page 33: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Decision  Theoretic  Action  Selection

the  optimal  decision  at  any  point  in  time  is  the  one  that  yields  the  lowest  expected  cost,  here  the  expected  cost  of  action  A  is:

observable  features  of  the  world,  and  C(A,  S)  is  the  cost  of  action  A  in  state  S

Page 34: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Finite  state  Turn-­‐Taking  Machine  relies  on  three  core  elements

A  non-­‐deterministic  finite-­‐state  machineA  cost  matrix  that  models  the  impact  of  different  system  actionsA  decision-­‐theoretic  action  selection  mechanism

Page 35: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Turn-­‐taking  framework  for  spoken  dialogue  system  which  bid  for  turn.

Observation  in  other  modelsPosition  for  Possible  turn  release  is  predictableEmphasis  on  release  detection  for  transitions.Use  pause  threshold,  and  predict  release-­‐turn  using  lexical  cues.Other  uses  Reinforcement  learning  to  tailor  onset  of  system  utterances.

But,  Question  is  who  will  resolve  turn-­‐conflicts  among  these  ????????  None

Page 36: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Bidding  approach,  for  turn-­‐conflict  resolution,  is  negotiative  framework   turn-­‐bidding  model

Useful,  when  amount  of  time  to  speak  was  reduced  ,  conflicts  grew.This  markedly  increases  value  of  utterance  and  urgency  to  speak.Utterances  overlap  due  to  competing  interest  of  conversant  in  taking  turn

Common  point-­‐ people  do  not  wait  for  their  turn-­‐release  cues.

Common  among  women  :POpponents  are  suppressed  due  to  insufficient  conversational  

be  more  important  than  opponent  taking  turn

Page 37: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Other  approaches

1.  Keep  or  release  turn  ApproachWorks  well  when  they  have  smooth  transition  of  release-­‐turnUsing  non-­‐syntactic  cues  such  as  gestures,  falling  pitch,  changing  gaze  directionUsually  some  system  uses  pause-­‐threshold  in  some  models.Some  system  attempt  to  estimate  probability  of  TRP  occurrance,  on  syntactic  and  prosodic  cues.Goes  awry  when  listener  attempts  to  take  tht run  without  turn-­‐yielding  cues.

Page 38: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

2.  Turn  ResolutionDetect  turn-­‐conflict.

Discrepancy  of  mutual  belief  between  conversantsEasy  when  previous  turn-­‐holder  immediately  releases  the  turn.But  cant  predict  when  both  compete  for  turn

Turn-­‐Conflict  resolutionIncreasing  strength  of  cues,relatively  increasing  the  volumeGives  more  turn-­‐taking  cues  than  turn-­‐releasing  cues

Page 39: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Turn-­‐Bidding  model

3.  Psycholinguistic  FrameworkModelled on

Individuals  interest  in  having  turnImportance  driven  approachTo  remedy  some  understanding  discrepancy.Pauses   first  one  to  speak  at  the  onset  time  of  utterance.

Page 40: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Computational  frameworkImplementing  the  psycholinguistic  concept  aboveBids  of  various  levels  are  used

Shorter,  short,  mid,  long,  longer(strongest  to  weakest)If  the  bid  is  tied,  winner  is  decided  randomly

Reinforcement  learning  is  usedDevelop  dialogue  system   to  assign  importance  

than  on  some  possibly  arbitrary  designer  decision.

Page 41: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Turn  bidding  approachIs  efficientIs  crucial  for  conversational  successCouple  high-­‐valued  utterances  with  short  onsets,  uses  Reinforcement  learning  to  global  dialogue  importance.Can  model  agents  with  more  intricate  patterns  of  interactionsEntire  cost  of  dialogue  is  optimized,  not  just  factors  for  turn  transition.Improvements  can  be  made  in  multi-­‐agent  environ

Page 42: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Turn-­‐taking  failures

Usually  system  have  fixed,  turn-­‐taking  strategy  Depends  on  length  of  pause  after  utterance.Others  have,  turn  based  on  content  and  prosody  of  user  utterance.

speak  immediately?

Page 43: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Usually  humans  stop  utterances  in  certain  situation   this  helps.Errors  in  speech  interval  detectionMisrecognition  of  users  intention  to  release  a  turn.Overlap  user  and  system  utterancesDiscontinuations   hard  to  detect,  as  it  has  only  word  fragments,  and  not  grammatical.

Page 44: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Remedial  measures

Discontinuation  detectionWord  error  rates  are  very  highOut  of  grammar  utterances.Prosody  recognition

Overlapping  information  used  for  predicting  speech  recognition  performance  degradation

Page 45: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Turns  in  Multiparty  conversationChallenges

To  reason  about  source  and  target  of  utterancesState  and  dynamics  of  the  floorParticipant  may  speak  to  another  participant,  or  system,  or  contribute  in  discussion,  and  wait  for  others  to  pitch  in.System  cannot  take  floor-­‐ though  it  can  predict  end-­‐of-­‐turnInferences  to  be  made  about  

multiparty  conversationReasoning  under  uncertainty  about  possible  outcomesTradeoff  between  acting  and  waiting  for  additional  infoTake  into  account  its  own  delay  in  perception  and  rendering  pipelines  of  system.

Page 46: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Any  remedies

Decision  theoretic  approachHighlight  opportunities  and  directions  Moving  from  heuristics  to  principles  decision  policiesInferences  about  conversation  dynamics  and  system  processing  delays.Explore  behaviors  of  expected-­‐utility  policies  and  compare  with  heuristic  procedures.

Page 47: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Various  two-­‐parties  approach

Turn  constructional  units  in  basic  model,  separated  by  transition  relevance  places.Gaze,  gesture  and  non-­‐verbal  communication  channels  for  turn  talking  regulationUsually  have  dyadic  settings,  usually  with  2  pplMachine  learning  employed,  plus  prosodic,  syntactic  and  semantic  features  to  predict  turns.Bidding  approach  (as  seen  earlier)Finite  state  machine  and  decision  theoretic  approach  for  grabbing  floor  (as  seen  earlier)

Page 48: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Reference  model   Turn-­‐taking  modelComponents

Conversational  floor-­‐ participant  ratified  to  speak  owns  it.4  Floor  management  actions

Hold  action   performed  by  one  who  currently  has  floor.Release   releasing  the  floor  to  someone  elseTake  Null  -­‐ participants  indifference,  for  no  claim

Sensing  component-­‐ estimate  the  current  speakerDecision  component   select  floor  managemnt actions  Dialog  management  layer   generates  systems  semantic  contributionsTurn-­‐taking  behavioral  control  component   renders  actions  into  synchronized  gaze,  gesture  and  speech.

Page 49: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Heuristic  approach  (on  turn  taking  model)Based  on  heuristics  for  sensing,  decicion making  and  behavioral  control.Current  speaker  identified  via  handcrafted  models,  using  sound  source  localization  information  from  microphone  array  and  from  visual  scene  analysis.Additional  rules   non  understanding,  and  utterances.Also  has  non-­‐verbal  gestures,  

avatars  face  turning  towards  other  participantLifting  eyebrows

Drew  incorrect  inferences   floor  releasedLed  to  turn  taking  problems,  as  floor  transition  battles  and  turn-­‐initial  overlaps

Page 50: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Decision  -­‐Theoretic    approach

Making  turn-­‐taking  decisionsContinuously  deliberate  about  key  uncertainties  and  resolve  tradeoffs  bw waiting  and  taking  floorTo  reduce  floor  battles  and  minimize  gaps  in  convRestrict  subspace  of  turn-­‐taking  policies

System  takes  floor  only  if  no  one  else  speaksAlways  takes  floor  once  a  silence  longer  than  a  specified  duration  is  observed.

Stochastic  input  processing  delay(ID)    also  calculated  by  system

Page 51: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Decision  -­‐

Takes  into  accountSystem  processing  delaysCost  functionProbabilistic  model  for  tracking  uncertainities

Potential  outcomesFloorTransitionToSystemFloorTransitionToOtherFloorTransitionBattle

Cost  ModellingTwo  case  :  one  where  floor  was  released  to  ssytem

Uses  a  learnt  model  for  better  floor  release  actions  [See  note]

Page 52: TurnTaking - University of Southern Californiaprojects.ict.usc.edu/nld/cs599s13/LectureNotes/cs599s13dialogue3... · Spoken%languageunderstanding Topic%detection Informationretrieval

Thank  you!!!!