32
1 Introduction to Introduction to Dialogue Systems Dialogue Systems Personal Assistants are Personal Assistants are becoming a reality becoming a reality Dr Natalia Konstantinova University of Wolverhampton 11 April 2014

Dialogue systems and personal assistants

Embed Size (px)

DESCRIPTION

This presentation covers dialogue systems: their definition, basic structure (covering all modules: natural language understanding, dialogue manager, natural language generation), evaluation and the way they can be used. We also provide details about future directions and discusses current personal assistants: SIRI, S-Voice, Cortana, Maluuba etc.

Citation preview

Page 1: Dialogue systems and personal assistants

1

Introduction to Dialogue Systems Introduction to Dialogue Systems

Personal Assistants are becoming a realityPersonal Assistants are becoming a reality

Dr Natalia KonstantinovaUniversity of Wolverhampton

11 April 2014

Page 2: Dialogue systems and personal assistants

2

OutlineOutline

• What is a dialogue system?• System structure and classification;• Evaluation;• Examples of existing systems;• Future directions;• IQA;

Page 3: Dialogue systems and personal assistants

3

DefinitionDefinition

• Artificial intelligence – idea to teach machines to think and act as humans.

• NLP – give machines the ability to read, understand and use natural language.

• Dialogue systems – part of artificial intelligence challenge.

Page 4: Dialogue systems and personal assistants

4

Optimistic viewOptimistic view

• Hollywood and Artificial Intelligence (robots that can think and act like humans)

• Video

Page 5: Dialogue systems and personal assistants

5

Realistic viewRealistic view

• Talk to Alan (or to some other HAL personalities)

• Chat with ALICE • Three bots talking

Page 6: Dialogue systems and personal assistants

6

What is a dialogue system?What is a dialogue system?

Ideas?• • • •

Page 7: Dialogue systems and personal assistants

7

DefinitionDefinition

• Editors of the Journal of Dialogue Systems :“A dialogue system is a computational device or

agent that • (a) engages in interaction with other human

and/or computer participant(s); • (b) uses human language in some form such as

speech, text, or sign; • and (c) typically engages in such interaction

across multiple turns or sentences.”

Page 8: Dialogue systems and personal assistants

8

Other termsOther terms

• Conversational agents (Jurafsky and H.Martin, 2006), (Lester, Branting, and Mott, 2004)

• “Chatterbot” or “chatbot”, first coined by Mauldin (1994):• simple dialogue systems, primarily based on

simple analysis of keywords in the input and usage of different templates

Page 9: Dialogue systems and personal assistants

9

Where are they used?Where are they used?

Usually embedded in such applications as:• customer service,• help desks,• website navigation,• guided selling,• technical support(Lester, Branting, and Mott, 2004)

Page 10: Dialogue systems and personal assistants

10

““Body” for a dialogue systemBody” for a dialogue system

• Embodied conversational agents (Cassell et al., 2000):• has a “body”, where both verbal and

nonverbal devices advance and regulate the dialogue between the user and the computer.

• Financial advisers, sales agents at online shops

Page 11: Dialogue systems and personal assistants

11

Embodied conversational agentsEmbodied conversational agents

Page 12: Dialogue systems and personal assistants

12

System structureSystem structure

• They generally consist of 5 main components (Jurafsky and H.Martin, 2006):1. speech recognition;2. natural language understanding (NLU);3. dialogue management;4. natural language generation (NLG);5. speech synthesis.

Page 13: Dialogue systems and personal assistants

13

System structureSystem structure

• Some modules are optional:• e.g. speech recognition and speech synthesis

• Dialogue systems involving speech are more complicated:• need to deal with errors in speech recognition

• Speech recognition can be dialogue-state dependant

Page 14: Dialogue systems and personal assistants

14

NLUNLU

• Aim of NLU module:• produce a semantic representation

appropriate for a dialogue task.

Page 15: Dialogue systems and personal assistants

15

Dialogue managerDialogue manager

• One of the most important parts of DS(Dale, Moisi, and Somers, 2000):

• interpret the speech acts;• carry out problem-solving actions;• formulate response;• in general maintain the system's idea of the

state of the discourse (e.g. dialogue move tree)

Page 16: Dialogue systems and personal assistants

16

Dialogue managerDialogue manager

• Interlink of NLU and NLG

• Responsible for the content generation • (taking decisions about what to say and how)

Page 17: Dialogue systems and personal assistants

18

NLGNLG

• Chooses syntactic structures and words to express the intended meaning, which was formulated by a dialogue manager.

• How?:• Templates to generate “prompts” (generated

outputs)• Advanced natural language generators

Page 18: Dialogue systems and personal assistants

19

Speech synthesisSpeech synthesis

• Is optional• Uses output on NLG module to generate

natural speech

Page 19: Dialogue systems and personal assistants

20

System classificationSystem classification

• Jurafsky and H.Martin (2006):4 main types of dialogue management (DM)

architectures:1. finite-state DM;2. frame/form based DM;3. information-state DM;4. plan-based DM.

Page 20: Dialogue systems and personal assistants

21

Finite-state DMFinite-state DM• A set of states• System totally controls the conversation

with the user

Page 21: Dialogue systems and personal assistants

22

Frame/form based DMFrame/form based DM

• Simple and the most widely used• Asks questions to fill in the slots in the

frame• Perform a database query

• E.g. booking a holiday

Page 22: Dialogue systems and personal assistants

23

Information-state DMInformation-state DM

• More complicated• Incorporates several ways to achieve a result.• Components:

• the information state (the “discourse context” or “mental model”);

• dialogue act interpreter (or “interpretation engine”);• dialogue act generator (or “generation engine”);• set of update rules (to update information state);• control structure to select needed update rule.

Page 23: Dialogue systems and personal assistants

24

Plan-based DMPlan-based DM

• The most sophisticated one• It interprets conversation as creation of a

plan and then interprets a plan “in reverse”• Is often referred as BDI (belief, desire andintentions) model.

Page 24: Dialogue systems and personal assistants

25

Other classificationsOther classifications

• system-initiative (or single initiative systems) mixed initiative systems

• spoken dialogue systems text dialogue systems

• multi-modal dialogue systems unimodal dialogue systems

• domain restricted dialogue systems Open domain dialogue systems

Page 25: Dialogue systems and personal assistants

Example of architectureExample of architecture

26

Page 26: Dialogue systems and personal assistants

27

EvaluationEvaluation

• How to make an objective evaluation?• Task-based evaluation (Dale, Moisi, and

Somers, 2000):• task completion success;• efficiency cost;• quality costs.

Page 27: Dialogue systems and personal assistants

28

EvaluationEvaluation

• Asking people to complete a question list and rank the quality of the system giving grades:• E.g. evaluate naturalness

• Maybe not very objective

Page 28: Dialogue systems and personal assistants

29

DifficultiesDifficulties

• Necessity to collect training corpus:• Wizard-of-Oz experiments • Prompting experiments

• Error handling

Page 29: Dialogue systems and personal assistants

30

ChatbotsChatbots• http://www.chatbots.org/

Page 30: Dialogue systems and personal assistants

Competitors of SIRICompetitors of SIRI

• Cortana by Microsoft;• Voice Mate by LG;• S-Voice by Samsung;• Google Now;• E.g. Android versions: Maluuba; Robin; Iris;

Vlingo; Skyvi; • More similar apps;

31

Page 31: Dialogue systems and personal assistants

Further directionsFurther directions

• Currently DM in all commercial systems is rule- based;

• What can be used? • Reinforcement learning (hierarchical RL);• Online learning;• Dialogue manager based on partially observable

Markov decision process (POMDP);• Quality-adaptive DM;

32

Page 32: Dialogue systems and personal assistants

33

ReferencesReferences• Cassell, Justine, Joe Sullivan, Scott Prevost, and Elizabeth F. Churchill, editors. 2000. Embodied

Conversational Agents. Cambridge, MA: MIT Press. • Dale, Robert, Hermann Moisi, and Harold Somers, editors. 2000. Handbook of Natural Language

Processing. Marcel Dekker, Inc. • Jurafsky, Daniel and James H.Martin. 2006. Speech and language processing an introduction to

natural language processing, computational linguistics, and speech recognition. Prentice-Hall, Inc. • Lester, James, Karl Branting, and Bradford Mott. 2004. Conversational agents. In Munindar P

Singh, editor, The Practical Handbook of Internet Computing. Chapman & Hall. • Mauldin, Michael L. 1994. Chatterbots, Tinymuds, and the Turing test: Entering the Loebner prize

competition. In Proceedings of the Eleventh National Conference on Artificial Intelligence. AAAI Press.

• Mitkov, Ruslan, editor. 2003. Handbook of Computational linguistics. Oxford University Press, USA.

• Sacks, H., E. A. Schegloff, and G. Jefferson. 1974. A simplest systematics for the organization of turn-taking for conversation. Language, 50(4):696-735.

• Varges, S., F. Weng, and H. Pon-Barry. 2007. Interactive question answering and constraint relaxation in spoken dialogue systems. Natural Language Engineering, 15(1):9-30.

• Webb, Nick and Bonnie Webber. 2009. Special issue on interactive question answering:

Introduction. Natural Language Engineering, 15(1):1-8, January.