32
œæƒføß '~ f October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

基于深度学习的对话系统

冯岩松

北京大学

October 19, 2016

Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

Page 2: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

对话系统

任务

使用自然语言进行多轮次的交互

Feng et al. (PKU) Question Answering October 19, 2016 2 / 14

Page 3: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

对话系统

任务

使用自然语言进行多轮次的交互

输输输入入入:::自然语言句子

资资资源源源:::领域知识、背景知识等显式或隐式资源

输输输出出出:::符合当前上下文语境或满足任务需求的自然语言句子

Feng et al. (PKU) Question Answering October 19, 2016 2 / 14

Page 4: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

对话系统

任务

使用自然语言进行多轮次的交互

输输输入入入:::自然语言句子资资资源源源:::领域知识、背景知识等显式或隐式资源输输输出出出:::符合当前上下文语境或满足任务需求的自然语言句子

[Wen, 2016]

Feng et al. (PKU) Question Answering October 19, 2016 2 / 14

Page 5: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

任务框架

语言理解

负责将输入句子解析为形式化表示

对话管理

对话状态跟踪(Dialogue StateTracker,或 BeliefTracking)对话内容选择(ResponseSelection)

语言生成

输出满足要求的自然语言句子

外部资源

领域知识、世界知识

Feng et al. (PKU) Question Answering October 19, 2016 3 / 14

Page 6: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

任务框架

语言理解

负责将输入句子解析为形式化表示

对话管理

对话状态跟踪(Dialogue StateTracker,或 BeliefTracking)对话内容选择(ResponseSelection)

语言生成

输出满足要求的自然语言句子

外部资源

领域知识、世界知识

Feng et al. (PKU) Question Answering October 19, 2016 3 / 14

Page 7: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

任务框架

语言理解

负责将输入句子解析为形式化表示

对话管理

对话状态跟踪(Dialogue StateTracker,或 BeliefTracking)对话内容选择(ResponseSelection)

语言生成

输出满足要求的自然语言句子

外部资源

领域知识、世界知识

Feng et al. (PKU) Question Answering October 19, 2016 3 / 14

Page 8: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

任务框架

语言理解

负责将输入句子解析为形式化表示

对话管理

对话状态跟踪(Dialogue StateTracker,或 BeliefTracking)对话内容选择(ResponseSelection)

语言生成

输出满足要求的自然语言句子

外部资源

领域知识、世界知识

Feng et al. (PKU) Question Answering October 19, 2016 3 / 14

Page 9: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

类型

任任任务务务导导导向向向 : 完成某项任务特定商业任务:订旅馆、订机票、售后服务……评价方式:任务的完成程度

开开开放放放域域域:没有特别限定某个任务

聊天机器人,小冰,小X……评价方式:多为主观评价,以及对话长度、是否重复等

传统对话系统

大量手工设计的特征、规则或模板

对上下文语境、用户的建模能力有限

训练数据有限,领域相关性强,迁移到其他领域的代价很高

几乎无法提供聊天功能

Feng et al. (PKU) Question Answering October 19, 2016 4 / 14

Page 10: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

类型

任任任务务务导导导向向向 : 完成某项任务特定商业任务:订旅馆、订机票、售后服务……评价方式:任务的完成程度

开开开放放放域域域:没有特别限定某个任务

聊天机器人,小冰,小X……评价方式:多为主观评价,以及对话长度、是否重复等

综综综合合合类类类

个人助理类,siri,Cortana,度秘……评价方式:任务完成情况,主观感受

传统对话系统

大量手工设计的特征、规则或模板

对上下文语境、用户的建模能力有限

训练数据有限,领域相关性强,迁移到其他领域的代价很高

几乎无法提供聊天功能

Feng et al. (PKU) Question Answering October 19, 2016 4 / 14

Page 11: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

当神经网络来了……

语言理解

对话管理

对话状态跟踪对话内容选择

语言生成

外部资源

领域知识、世界知识

[Wen, 2016]

Feng et al. (PKU) Question Answering October 19, 2016 5 / 14

Page 12: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

当神经网络来了……

语言理解 =⇒神经网络模型对话管理 =⇒神经网络模型

对话状态跟踪对话内容选择

语言生成 =⇒神经网络模型外部资源 =⇒神经网络模型

领域知识、世界知识

[Wen, 2016]

Feng et al. (PKU) Question Answering October 19, 2016 5 / 14

Page 13: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

方法:Encoding-Decoding

[Shang et al., 2015]

Feng et al. (PKU) Question Answering October 19, 2016 6 / 14

Page 14: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

方法:Encoding-Decoding

[Shang et al., 2015]

Feng et al. (PKU) Question Answering October 19, 2016 6 / 14

Page 15: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

方法:Seq2Seq

[Vinyals and Le, 2015]Feng et al. (PKU) Question Answering October 19, 2016 7 / 14

Page 16: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

感觉聊天还不错!

语言理解

RNN捕捉输入序列对话管理

由隐层来负责

语言生成

RNN负责输出序列外部资源

拼个向量?

然而,这是对话场景!可能会有多个轮次的

Feng et al. (PKU) Question Answering October 19, 2016 8 / 14

Page 17: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

感觉聊天还不错!

语言理解

RNN捕捉输入序列对话管理

由隐层来负责

语言生成

RNN负责输出序列外部资源

拼个向量?

然而,这是对话场景!可能会有多个轮次的

[Li et al., 2016]

Feng et al. (PKU) Question Answering October 19, 2016 8 / 14

Page 18: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

感觉聊天还不错!

语言理解

RNN捕捉输入序列对话管理

由隐层来负责

语言生成

RNN负责输出序列外部资源

拼个向量?

然而,这是对话场景!可能会有多个轮次的

[Li et al., 2016]

Feng et al. (PKU) Question Answering October 19, 2016 8 / 14

Page 19: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

感觉聊天还不错!

语言理解

RNN捕捉输入序列对话管理

由隐层来负责

语言生成

RNN负责输出序列外部资源

拼个向量?

然而,这是对话场景!可能会有多个轮次的

[Li et al., 2016]

Feng et al. (PKU) Question Answering October 19, 2016 8 / 14

Page 20: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

感觉聊天还不错!

语言理解

RNN捕捉输入序列对话管理

由隐层来负责

语言生成

RNN负责输出序列外部资源

拼个向量?

然而,这是对话场景!可能会有多个轮次的

[Li et al., 2016]

Feng et al. (PKU) Question Answering October 19, 2016 8 / 14

Page 21: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

感觉聊天还不错!

语言理解

RNN捕捉输入序列对话管理

由隐层来负责

语言生成

RNN负责输出序列外部资源

拼个向量?

然而,这是对话场景!可能会有多个轮次的

这个机器人好神奇……

回复质量好像不够稳定

用户建模缺失

能完成订票任务吗?这End2End会不会把我的票订错?

Feng et al. (PKU) Question Answering October 19, 2016 8 / 14

Page 22: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

挑战

1 如何对当前的场景 (Context)建模

已发生的对话是否有任务场景

2 如何对用户建模

人格一致:性别、年龄、背景……

3 如何有好的对话回合

有信息量、不重复、回合多

4 如何把任务目标融入进来

发挥神经网络特长解决个别模块,如语言生成

Feng et al. (PKU) Question Answering October 19, 2016 9 / 14

Page 23: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

加入用户建模

还是Seq2Seq,但强调人物建模

[Li

et al., 2016]

Feng et al. (PKU) Question Answering October 19, 2016 10 / 14

Page 24: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

加入用户建模

还是Seq2Seq,但强调人物建模

每人一个模型……

Feng et al. (PKU) Question Answering October 19, 2016 10 / 14

Page 25: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

任务完成类

多数系统不会完全依赖 Seq2Seq这种端对端系统模式单独模块应用神经网络模型,如语言生成、决策判断等

来个端对端任务类系统?

[Williams and Zweig, 2016]

Feng et al. (PKU) Question Answering October 19, 2016 11 / 14

Page 26: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

任务完成类

多数系统不会完全依赖 Seq2Seq这种端对端系统模式单独模块应用神经网络模型,如语言生成、决策判断等

来个端对端任务类系统?你可以的!

[Wen et al., 2016]

Feng et al. (PKU) Question Answering October 19, 2016 11 / 14

Page 27: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

数据集

聊天、微博、论坛、电影……

[Serban et al., 2015]Feng et al. (PKU) Question Answering October 19, 2016 12 / 14

Page 28: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

数据集

聊天、微博、论坛、电影……

[Serban et al., 2015]

Feng et al. (PKU) Question Answering October 19, 2016 12 / 14

Page 29: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

数据集

聊天、微博、论坛、电影……

[Serban et al., 2015]Feng et al. (PKU) Question Answering October 19, 2016 12 / 14

Page 30: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

数据集

任务类:旅游、参观、家居……

[Serban et al., 2015]

Feng et al. (PKU) Question Answering October 19, 2016 12 / 14

Page 31: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

参考文献-4

Oriol Vinyals, Quoc V. Le, A Neural Conversational Model, In ICML DeepLearning Workshop 2015

Lifeng Shang, Zhengdong Lu, Hang Li, Neural Responding Machine forShort-Text Conversation, ACL 2015.

Hierarchical Neural Network Generative Models for Movie Dialogues”,Iulian Vlad Serban, Alessandro Sordoni, Yoshua Bengio, Aaron C.Courville, Joelle Pineau, In AAAI 2015.

A Diversity-Promoting Objective Function for Neural ConversationModels”, Jiwei Li, Michel Galley, Chris Brockew, Jianfeng Gao, BillDolan, NAACL 2016.

A Persona-Based Neural Conversation Model”, Jiwei Li, Michel Galley,Chris Brockew, Jianfeng Gao, Bill Dolan, In ACL 2016

Jason D. Williams and Geoffrey Zweig, End-to-end LSTM-based dialogcontrol optimized with supervised and reinforcement learning, In arXiv2016

Tsung-Hsien Wen, Deep Learning for NLP, Tutorial, INLG 2016

Feng et al. (PKU) Question Answering October 19, 2016 13 / 14

Page 32: October 19, 2016 · 2016-10-19 · October 19, 2016 Feng et al. (PKU) Question Answering October 19, 2016 1 / 14

参考文献-4

Zhao Yan, Nan Duan, Junwei Bao, Peng Chen, Ming Zhou, Zhoujun Li,Jianshe Zhou, DocChat: An Information Retrieval Approach for ChatbotEngines Using Unstructured Documents, In ACL 2016

Iulian V. Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courvilleand Joelle Pineau, Building End-To-End Dialogue Systems UsingGenerative Hierarchical Neural Network Models, In AAAI 2016

Tsung-Hsien Wen, David Vandyke, Nikola Mrksic, Milica Gasic, LinaRojas-Barahona, Pei-Hao Su, Stefan Ultes, and Steve Young,“ANetwork-based End-to-End Trainable Task-Oriented Dialogue System”,In arXiv 2016.

Tsung- Hsien Wen, Milica Gasic, Nikola Mrksic, Lina Rojas-Barahona,Pei-Hao Su, Stefan Ultes, David Vandyke, and Steve Young, ConditionalGeneration and Snapshot Learning in Neural Dialogue Systems, InEMNLP 2016.

Iulian Vlad Serban, Ryan Lowe, Laurent Charlin, and Joelle Pineau, ASurvey of Available Corpora For Building Data-Driven DialogueSystems, In arXiv 2016

Feng et al. (PKU) Question Answering October 19, 2016 14 / 14