Upload
amice-mcbride
View
215
Download
0
Tags:
Embed Size (px)
Citation preview
AQUAINT Program: Overview
Dr. John Prange, Info-X R&D Thrust DirectorDr. Lynn Franklin, Dep Info-X R&D Thrust Director
[email protected]; [email protected] (Prange) / 443-479-6604 (Franklin)
301-688-7092 (ARDA Office)http://www.ic-arda.org
October 2004
2
Where is the Taj Mahal?
Let’s Start with a Simple, Factual, Question ---
Question ????
???
How Do We Find Information Today?
3
Traditional Information Retrieval (IR) Approach
Question ?
System SpecificQuery
e.g. Boolean Key WordEquation
DataSource
e.g Large Text
Archive
Traditional Information Retrieval
Ranked List of Hopefully “Relevant”
Documents. . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . .. . . . . . . . . .. . . . . . . . . .. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
4
Where is the Taj Mahal?
Or Is It ???
Use Your Favorite Search Engine
It Depends !!!
Answer: Agra, India
8
Single, Factoid Question ?
Ranked List of Hopefully “Relevant”
Documents. . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . .. . . . . . . . . .. . . . . . . . . .. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
System SpecificQuery; often Tailored
to Question TypeTraditional Information Retrieval
SingleData
Source
Move Closerto the Questione.g. QuestionClassification
QA
Next Generation Approaches:Question Answering (QA) Systems
“Answer”
Move Closerto the Answere.g. Passage
Retrieval
ShallowAnalysis
9
“Ask Jeeves” Approach
•Start with Your Question
• Identify Key Words & Classifies the Type of Question
• Respond with rephrased “Questions” for which “Ask Jeeves” knows the Answer
• Provide Additional Web Sites as a fall back position (a la --- a more traditional web search engine)
10
Direct KnowledgeEntry by Domain
Experts
ParallelDevelopment by
Distributed Teams
Rapid KnowledgeFormation
Comprehensive(Million-Axiom)
Knowledge BasesGene rate plau sib le
crisis scenarios
Uncover co nnectedactivities, thre ats
Reason a bout no velcrisis situations
Mon ito r and in terpre tmassive da ta steams
Gene rate po ssiblecour se s of actions
Perfor m vulnerab ilityana lyses
Reason a bout no velbatt le fie ld sit uations
Mon ito r and in terpre tchang in g battlefield
event s
Answe r cause & effe ctque st io ns about events
Answe r question s aboutfor ce capabilities
Retrieve f acts relevant toa crisis
CrisisUnderstanding
Answe r question s aboutter rain
Commander’sAssociate
10 K
100 K
1,000 K
Need to create newknowledge at a rate of 400
axioms per hour
(With HPKB technology, a 5-personteam can create knowledge at a
rate o f 40 axioms per hour)
Biological Weapons (BW)Knowledge
• Basic knowledge of space, time,causality, general physics
• Biology, & biologica l threats• BW R&D, produce, weaponize• Geo-po litical behavior & terrorism
Required
6 Months 12 Months
10 K
100 K
1,000 K
HPKB
Development Time
UpperOntology
Mid-LevelTheories
Domain-S pecificTheories
Rapid Knowledge Formation (RKF)
Structured Knowledge-Base Approach
Deepest QA but Limited to Given Subject Domain
•Create comprehensive Knowledge Base(s) or other Structured Data Base(s)
• At the 10K Axiom Level -- Capable of Answering factual questions within domain
• At the 100K Axiom Level -- Answer cause & effect/capability Questions
• At the 1000K Axiom Level -- Answer Novel Questions; ID alternatives
11
Overarching Context / Operational Requirement
Who is thisadvisor?
What do weknow about
him/her?
What are his/her views?
What influence does he/she have on FM?
And still more questions ???
In a foreign news broadcast a team of analysts observe a previously unknown individual conferring with the Foreign Minister. They suspect
that he/she is really a new senior advisor.
Does this signal that other
policy changes are coming?
Information Analysts
Advanced Question Answering
12
Overarching Context /Operational Requirement
AdvancedQA
Extract & AnalyzeResults
DeeperAutomated
Understanding
Answers
Interpret Results& Formulate the Answers
Provide Answers in a Form
Analysts Want
Ranked Lists of
“Relevant” Data Objects
System SpecificQueries; Fully Tailoredto Series of Questions
ExtendTraditional Information Retrieval
MultipleHeterogeneous
DataSources
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
Multi-Media Multi-Media Structured Structured
Other Other
Text Text Voice Voice
InterpretingComplex
QA Scenario within a
Larger ContextFactoid
Questions?
WhyQuestions
?
InterpretiveQuestions?
Judgement Questions?
OtherQuestions?
Information Analysts
Predictive Questions
?
Advanced Question Answering
13
1616QA Workshop - ACL 2001QA Workshop - ACL 2001
System SpecificQuery
e.g. Boolean Key WordEquation
System SpecificQuery
e.g. Boolean Key WordEquation
Ranked List of Hopeful ly “Relevant”
Documents
Ranked List of Hopeful ly “Relevant”
Documents
Traditional Information Retrieval
Traditional Information Retrieval
DataSource
e.g Large Text
Archive
DataSource
e.g Large Text
Archive
Traditional InformationRetrieval (IR) Approach
Question ?Question ?
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
2121QA Workshop - ACL 2001QA Workshop - ACL 2001
System SpecificQuery; often Ta ilored
to Question Type
System SpecificQuery; often Ta ilored
to Question Type
Ranked List of Hopeful ly “Relevant”
Documents
Ranked List of Hopeful ly “Relevant”
Documents. . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . .. . . . . . . . . .. . . . . . . . . .. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
Traditional Information Retrieval
Traditional Information Retrieval
SingleData
Source
SingleData
Source
Single, FactoidQuestion ?
Single, FactoidQuestion ?Move Closer
to the Questione.g. QuestionClassi fication
Move Closerto the Questione.g. QuestionClassi fication
Q&AQ&A
Next Generation Approaches:Question & Answering (Q&A) Systems
“Answer” “Answer”
Move Closerto the Answere.g. P assage
Retrieval
Move Closerto the Answere.g. P assage
Retrieval
ShallowAnalysisShallowAnalysis
Commercial World & Current R&D EffortsAre Addressing the Next GenerationBut Only Selected Content Understanding Barriers Are Being Aggressively Attacked
26QA Workshop - ACL 2001
Overarching Context /Operational Requirement
AdvancedQA
Extract & AnalyzeResults
DeeperAutomated
Understanding
Answers
Interpret Results& Formulate the Answers
Provide Answers in a Form
Analysts Want
Ranked Lists of
“Re levant” Data Objects
System SpecificQueries; Fully Tailoredto S eries of Questions
ExtendTraditionalInformationRetrieval
MultipleHeterogeneous
DataSources
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
. . . . . . . . . .
. . . . . . . . . .. . . . . . . . . .
Mult i-Media Mult i-Media St ructured St ructured
Other Other
Text Text Voice Voice
Inte rpretingComplex
QA Scenario within a
Larger Context FactoidQuest ions?
WhyQuest ions?
InterpretiveQuest ions?
JudgementQuest ions?
OtherQuest ions?
InformationAnalysts
PredictiveQuest ions?
Advanced Question Answering
Multiple KeyBarriers toContentUnderstandingWill Be AggressivelyAttacked
Advanced Question Answering Is Skipping Ahead Two Generations
14
AQUAINTAdvanced QUestion & Answering for INTelligence
• What it is and What it is not . . .
– Question & Answering Aimed at the “Information Professional” --- Not just the Casual User
– Rich, Contextually-based Question Scenarios --- Full Range of Questions --- Not just Isolated, Factoid Questions
– Places much higher premium on knowledge and reasoning across very broad domains
– Open Domain, Multiple Media, Multiple Languages, Multiple Genre, Structured and Unstructured Data --- Not just a Focused Data Environment
15
Increasing Complexity Levels of Questions & Answers
Level 1”Simple
Factual QA’s"
Level 2"Template &
Multi-valued QA’s”
Level 3“Cross Media &
Cross Document QA’s"
Level 4”Context-Based
QA Scenarios”
FULL COMPLEXITY OF QUESTIONS & ANSWERS RANGES:
FROM: TO:
Questions: Simple Facts Questions: Complex; Uses Judgement TermsKnowledge of User Context Needed;Broad Scope
Answers: Simple Answers found in Answers: Search Multiple Sources (in multipleSingle Document Media/languages); Fusion of
information; Resolution of conflictingdata; Multiple Alternatives; AddingInterpretation; Drawing Conclusions
Current Near Term Mid Term Long Term
Advanced QA:Ramping up to the Full Complexity of Questions & Answers
16
Future
Fully Intersected;Automatically
Generated;Variable Structure/Format;
Full Context Responses
Full Context-Based
QuestionScenario
Level III
Full Context-Based
QuestionScenario
Fully Intersected;Automatically
Generated;Variable Structure/Format;
Full Context Responses
Level II
Variable NarrativeSummary;
Multi-Media Presentations;
Simple InterpretedResults
Cross MediaCross Document
Simple Judgement
Level I
Fixed Templatesor
Tabular Lists
Mulit-ValuedFactual QuestionsQuestions
Answers
Today
50/250 BytePassage from
Single TextDocument
SingleFactualIsolated
Questions
Data Chasm
Missing Data
MANY Heterogeneous Data Sources;
All Types, Sizes, Locations
IncreasingVolumes
(Petabyte & up)
Synthesis Across“Documents”/Media
ContradictoryDataReliability
of Data & SourceMultiple
Perspectives
Advanced QA:Attacking the Data Chasm
17
Structured / Semi-Structured
KB’s DB’s“Tagged Data”(e.g. Web Data)
Unstructured
HumanLanguage
Media
Sensor
Economic
Geospatial
VisualData
Video Still Images
Text
Documents
Speech
Multi-Media
Technical /Abstract
Other
Newswire / News Broadcast
Technical
Formal / Informal Communication
Other
Language
English
ForeignLanguage 1
ForeignLanguage 2
ForeignLanguage N
Genre
Advanced QA:Complex QA Across Data Types
18
Advanced QA:Much Deeper Understanding of Human Language is Required
• Some times SMALL differences can produce significantly different results/interpretations:
– Stop Words
• “Books {by; for; about} kids”
– Attachments
• “The man saw the woman in the park with the telescope.”
– Co-reference
• “John {persuaded; promised} Bill to go. He just left.”
• “Mary took the pill from the bottle. She swallowed it.”
• Other times BIG differences can produce the same/similar results:
– “Name the films in which Jude Law starred.”
– “Jude Law played a leading role in which movies?”
– “In what Hollywood productions did Jude Law receive top billing?”
19
Advanced QA:Is Time Our Achilles Heel?
• Real Difficulties Exist in:
– Extracting, correctly interpreting time references & then creating manageable timelines
– Estimating & updating changing reliability of information over time
– Processing information in time sequence e.g. Tracking the details of an evolving event over time -- A whole different set of problems
• And of course:
– We can’t forget all of the issues related to the timeliness of the system’s response to our question(s) -- we’ll need at least “near real time responses”
March April May June July August
20
• Different sources do not report simultaneously on an event.
• Data from different sources may be near real-time or take years to arrive.
• The hypothesis of today may be thrown out by new data arriving next week.
Event
H Hour
Event Planning Aftermath of the Event
H-n H-nH+1H-1
Collectors 1,2,3 observe event
Collector 1 reports
Collector 2 reports Collector
3 reports
Collector 4 observe event-related info &
reports
Collector 5 picks up historical
event planning material in a
raid
Collector 1 observes
event planning &
reports
• Analysis is dependent on a time continuum where data on a future event is found in the historical patterns established in event planning stages. As incoming data is evaluated against historical data, outcomes may change.
Advanced QA:The Challenge of Time in Analysis
21
Advanced QA:The Need for Ever Increasing Knowledge -- Of All Types
** Knowledge Requirement would be better represented with a whole “quiver of arrows” of different sizes, lengths and types
DIMENSIONS OF THE QUESTIONPART OF THE QA PROBLEM
DIMENSIONS OF THE ANSWERPART OF THE QA PROBLEM
Context
Judgement
Scope
Fusion
Interpretation
MultipleSources
QA R&D Program
QA R&D Program
Advanced AdvancedSimpleFactual
Question
SimpleAnswer,SingleSource
Increasing
Knowledge Requirements **
IncreasingKnowledgeRequirements **
22
• A Different Paradigm may be useful when handling QA Scenarios:
• Current Analytic Paradigm:
– Sequentially “Filter Down” to the
final result
Processing & Analysis
Data
Results
– Works when QA’s are independent, isolated activities
– Cast a “wider net” while searching
for “golden nuggets” (Answers)
AnswersSpace of Data Objects and Sources
How Wide to Cast the “Net”?
What Info to Retain? In what form?
For how long?
– Automatically Extract, Represent,
and Preserve “closely related”
background information within
context of the QA Scenario
Background
Discarded
Advanced QA:The Need for a Different Paradigm
23
Overarching Context / Operational Requirement
Who is thisadvisor?
What do weknow about
him/her?
What are his/her views?
What influence does he/she have on FM?
And still more questions ???
In a foreign news broadcast a team of analysts observe a previously unknown individual conferring with the Foreign Minister. They suspect
that he/she is really a new senior advisor.
Does this signal that other
policy changes are coming?
Information Analysts
Advanced QA:Need for Improved Reasoning & Learning
FOCUS
24
Advanced Reasoning:• Use Multi-level Plans• Create and evaluate chains of reasoning• Reason across hetero- geneous data sources• Infer answers from data extracted from multiple sources when the answer is not explicitly stated • Utilize Link Analysis & Evidence Discovery• Plus other strategies
New SeniorAdvisor
Associates Associates Follow-upLeads
Follow-upLeads
“Bio”………..….……..…….………..….……..…….………..….……..…….…………...
“Views: Past & Present” .….… ….…...……. ….…...……. ….…...……. ….…...……. ….…..
Summarized Results
Collected Views
TV & RadioBroadcasts,Newspapers
& OtherArchives
Raw “Bio”Information
Education
Past Positions
Family
Travels
Other Activities
Summarized Results
Cross Fertilization
Advanced Learning:• Automatically learn new or modify existing reasoning strategies
Advanced QA:Need for Improved Reasoning & Learning
25
Interested External
Stakeholders
ARDA’s Info-X Program Partners
Active IC /Government
Partners
RecentAdditions
• NGIC• DHS
26
QUESTION????
Clarification
Other Analysts
Question & RequirementContext; Analyst Background
Knowledge
Multimedia Examples
Natural Statement ofQuestion;
Use of
QueryAssessment,
Advisor,Collaboration
Question Under- standing andInterpretation
Knowledge Bases;Technical Databases
AQUAINT:R&D Focused on Three Functional Components
Question & Answer Context
•Relevant information extracted and combined where possible;•Accumulation of Knowledge across “Documents”•Cross “Document” Summaries created;•Language/Media Independent Concept Representation•Inconsistencies noted;•Proposed Conclusions and Inferences Generated
Determinethe
Answer
Relevant “Documents”
MultipleRanked
Lists
Single, Merged
Ranked List ofRelevant “Documents”
Queries
Relevant“Knowledge”
KBQueries
Multiple Sources;Multiple Media;Multi-Lingual;Multiple Agencies
MultipleSource
SpecificQueries
Translate Queriesinto Source Specific Retrieval Languages
Partially Annotated & Structured Data
Automatic Metadata Creation
SupplementalUse
Supple- mentalUse
Query Refinement based on Analyst
Feedback
Iterative Refinementof Results based
on Analyst Feedback
AnalystFeed-back
FINAL ANSWER
Results of Analysis• Formulate Answer for Analyst in form they want
• Multimedia Navigation Tools for Analyst Review
AnswerFormulation
ProposedAnswer
AnswerContext
Operational Requirement / Cognitive Environment
27
Cross Cutting/Enabling Technologies Research Issues
QUESTION????
FINAL ANSWER
AnswerFormulation
Question Under-
standing and Inter-pretation
InformationRetrievalProcess
Analysis &SynthesisProcess
Determinethe Answer
AQUAINTPhase I
Solicitation
Annotated and ‘Ground Truthed’ Data
Component Level / End-to-End Testing & Evaluation
Component Integration and System Architecture Issues
SeparateCoordinated
Activities
AQUAINT:Separate, Coordinated Activities
28
AQUAINT Program Contractors
CarnegieMellonUniv. Univ. of
Albany
Univ. ofMassachusetts
BBN (2)
IBM
Columbia Univ.
Rutgers Univ.
Princeton Univ.
Univ. of Texas-Dallas
Language Computer Corp. (2)
CycorpSAIC
Univ. of SouthernCalifornia
/ Info ScienceInstitute
SRI
Stanford Univ.
Univ. of California-Berkeley
Univ. of Colorado-Boulder
OriginalHNC Software New Mexico
State University (2)
Univ. of Maryland –Baltimore County (UMBC)
CoGen Tex
Language Computer Corp.
Univ. of SouthernCalifornia
/ Info ScienceInstitute
CarnegieMellon
Univ. (2)
+ New
33
AQUAINT Program Phase 2 Contractors
Prime Contractors (18)
ColumbiaUniv.
Palo AltoResearch
Center
Arizona State Univ.
IBM T. J.Watson Center
Univ. of Illinois-Urbana-Champaign
Language ComputerCorporation (2)
Univ. of TexasAt Dallas
Cycorp
CarnegieMellon
Univ. (2)
Univ. ofPittsburgh
Sub Contractors (16)
Rutgers Univ.
USC
USC / ISI (2)
TexasTech
MonmouthUniv.
UC-Berkeley(ICSI)
StanfordUniv. (2)
Univ. of Colorado
Univ. ofPennsylvania
Lehman College
Univ. of Utah
Cornell Univ.
GeorgetownUniv.
MITRE
BBN
BrandeisUniv.
Princeton Univ.
Univ. ofAlbany
SPAWAR
MIT
34
AQUAINT Phase 2 Projects (Spring 04 – Spring 06)
Total End-to-End Systems (10) (Systems 1-5)
Organization Title Investigator Topical Focus
Data Dimension ARDA Agent
University of Texas at Dallas // University of California-
Berkeley (International Computer Science Institute)
// Stanford University
AQUINAS: Answering QUestions using INference and Advanced Semantics
Sanda Harabagiu // Srini Narayanan // Chris
Manning
End-to-End
System
Focused Data Strategy
CIA
Cycorp GINKO, an End-to-end Intelligence QA System
Based on Contextualized Knowledge Dossiers
David Schneider &
Michael Whitbrock
End-to-End
System
Diverse Data Strategy
CIA
Language Computer Corporation
Advanced Techniques for Multimodal Question
Answering
Dan Moldovan End-to-End
System
Diverse Data Strategy
CIA
Language Computer Corporation
HIREQA-ML: High Precision Interactive Question
Answering Using Multiple Languages
Sanda Harabagiu End-to-End
System
Diverse Data Strategy
CIA
Massachusetts Institute of Technology
A Tripartite Question Answering Architecture for
Integrating Diverse Knowledge Resources
Boris Katz End-to-End
System
Diverse Data Strategy
CIA
35
Total End-to-End Systems (10) (Systems 6-10)
Organization Title Investigator Topical Focus
Data Dimension ARDA Agent
Columbia University // University of Colorado //
Stanford Univ. // Univ. of Texas-
Dallas
Fusing Rich Information Extracted from Multiple Media and Languages to
Generate Contextualized Complex Answers
Vasileios Hatzivassiloglou & Kathleen McKeown // Dan Jurafsky & James Martin & Wayne Ward
End-to-End
System
Diverse Data Strategy
DIA
University at Albany,(SUNY) /
Rutgers University / Lehman College
(CUNY)
HITIQA-2: the Intelligence Analyst’s Assistant in High Quality Interactive
Question Answering
Tomek Strzalkowski // Paul Kantor // Boris
Yamrom
End-to-End
System
Diverse Data Strategy
DIA
BBN Technologies Breaking the Cross-lingual Barrier to Question Answering
Ralph Weischedel End-to-End
System
Diverse Data Strategy
NSA
IBM T. J. Watson Research Center
A Question-Answering (QA)-Based Information Gathering Environment
David Ferrucci & John Prager
End-to-End
System
Diverse Data Strategy
NSA
Carnegie Mellon University
JAVELIN II: Scenarios and Variable-Precision Reasoning for Advanced
Question Answering from Multilingual, Distributed Sources
Eric Nyberg & Teruko Mitamura
End-to-End
System
Diverse Data Strategy
NSA
AQUAINT Phase 2 Projects (Spring 04 – Spring 06)
36
Organization Title Investigator Topical Focus Data Dimension ARDA Agent
Arizona State University // Texas Tech University //
Monmouth University
Answering complex questions and performing deep reasoning in advanced question answering
systems
Chitta Baral / Richard Scherl // Michael Gelfond
Component Elements
Diverse Data Strategy
CIA
Carnegie Mellon University // University of
Southern California
Informedia Contexture: Analyzing and Synthesizing Video and Verbal
Context for Intelligence Analysis Dialogues
Howard D. Wactlar // Ram Nevatia
Component Elements
Diverse Data Strategy
NSA
Emphasis on One or more Advanced QA System Components (2)
AQUAINT Phase 2 Projects (Spring 04 – Spring 06)
37
Organization Title Investigator Topical Focus Data Dimension ARDA Agent
Palo Alto Research Center, Inc.
Two-way Bridge between Language and Logic
Daniel G. Bobrow & Ron Kaplan
Cross Cutting / Enabling
Technologies
Focused Data Strategy
CIA
Princeton University // University of Southern California (Information
Sciences Institute)
WordNet for Question Answering
Christiane Fellbaum / George Miller // Jerry
Hobbs
Cross Cutting / Enabling
Technologies
Focus Data Strategy
NSA
Brandeis University // Georgetown University // University of Southern California (Information
Sciences Institute)
Temporal Awareness Algorithms for Natural
Language Texts
James Pustejovsky // Inderjeet Mani // Jerry
Hobbs
Cross Cutting / Enabling
Technologies
Focus Data Strategy
NSA
University of Pittsburgh // Cornell University //
University of Utah
Opinions in Question Answering
Janyce Wiebe // Claire Cardie // Ellen Riloff
Cross Cutting / Enabling
Technologies
Diverse Data Strategy
DIA
University of Illinois at Urbana-Champaign //
University of Pennsylvania
Kindle: Knowledge and Inference via Description
Logics for Natural Language
Dan Roth // Martha Palmer
Cross Cutting / Enabling
Technologies
Diverse Data Strategy
NSA
SPAWAR // MITRE Gazatteer Exploitation for Question Answering
Beth Sundheim // Scott Mardis
Cross Cutting / Enabling
Technologies
Focused Data Strategy
ARDA
Focused Effort -- Cross Cutting / Enabling Technologies (6)
AQUAINT Phase 2 Projects (Spring 04 – Spring 06)
38
HIGHLIGHTS• Dramatic progress on linguistic approach that converts question and
relevant passages into logical forms and then arrives at answer through a powerful combination of an extended “WordNet” and a logic prover
AQUAINTAdvanced QUestion & Answering for INTelligence
39
HIGHLIGHTS• Dramatic progress on linguistic approach that converts question and
relevant passages into logical forms and then arrives at answer through a powerful combination of an extended “WordNet” and a logic prover
• Made significant strides in extending QA from isolated, factoid questions to far more complex “Who is / What is” questions that require combining information from multiple, potentially duplicative or contradictory document sources
AQUAINTAdvanced QUestion & Answering for INTelligence
40
More Complex Question Types
• Definitions– What is Tikrit?
• Biographies – Who is Mahmoud Abbas?
• Events– What happened in Baghdad on Thanksgiving?
• Different Perspectives / Opinions– What people think of Mahmoud Abbas’ resignation?
• Lists– What names of chewing gums are found in the AQUAINT corpus?
• Relationships– The analyst is interested in the line of succession of the Saudi
government, and the relationship between the individuals in their royal family. King Fahd is the current ruler, but is in poor health. Who is next in line, and what is his relationship to King Fahd? Who, if anyone, has been designated as second in line?
41
Example Definition *
What is Tikrit?
Tikrit is a power center for Sunni Arab tribes that may hold out for as long as possible out of fear of losing power to the nation’s Shiite majority (12). Baghdad may be the capital of Iraq, but Tikrit is Saddam country (15). Other experts caution that the years of preferential treatment towards the residents of Tikrit may cause them to stand by Saddam Hussein to the end (4). …
* Reference: Columbia Univ. / Univ. of Colorado AQUAINT Briefing
42
HIGHLIGHTS• Dramatic progress on linguistic approach that converts question and
relevant passages into logical forms and then arrives at answer through a powerful combination of an extended “WordNet” and a logic prover
• Made significant strides in extending QA from isolated, factoid questions to far more complex “Who is / What is” questions that require combining information from multiple, potentially duplicative or contradictory document sources
• Progress made on developing multi-engine QA system that combines linguistic, statistical & KB approaches
AQUAINTAdvanced QUestion & Answering for INTelligence
43
Available Answering Agents
• Predictive Annotation Agent– General-purpose agent, used in almost all cases.
• Statistical Query Agent– Also general-purpose. Courtesy Roukos/Ittycheriah
• Description Agent– Generic descriptions (appositions, parentheticals etc.)
• Structured Knowledge Agent– Answers from WordNet/KSP/Cyc
• Pattern-Based Agent– Looks for specific syntactic patterns based on semantic form
• Dossier Agent– Calls PIQUANT recursively with multiple factoid questions
• Profile Agent– Currently standalone – used for Relationship Pilot
* Reference: IBM AQUAINT Briefing
44
PIQUANT Architecture *
KSP-BasedAnswering Agent
KSP-BasedAnswering Agent
Predictive Annot.Answering AgentPredictive Annot.Answering Agent
Answering Agents
StatisticalAnswering Agent
StatisticalAnswering Agent
Definitional QAnswering Agent
Definitional QAnswering Agent
Question
AnswerResolution
Answer
QGoals
Answers
QFrameAnswerClassification
QuestionAnalysis
Knowledge Source Portal
Semantic Search
WordNet
Cyc
KeywordSearch
Pattern-BasedAnswering Agent
Pattern-BasedAnswering Agent
QPlanGenerator
QPlanExecutor
AQUAINT
TREC
EB
Web
CNS
* Reference: IBM AQUAINT Briefing
45* Reference: IBM AQUAINT Briefing
Multiple QA Agents Approach *What is the largest city in England?
• Text Match– Find text that says “London is the largest city in England” (or
paraphrase). Confidence is confidence of NL parser * confidence of source.
• “Superlative” Search– Find a table of English cities and their populations, and sort.– Find a list of the 10 largest cities in the world, and see which are in
England. • Uses logic: if L > all objects in set R then L > all objects in set E R.
– Find the population of as many individual English cities as possible, and choose the largest.
• Heuristics– London is the capital of England. (Not guaranteed to imply it is the
largest city, but this is very frequently the case.)
• Complex Inference – E.g. “Birmingham is England’s second-largest city”; “Paris is larger
than Birmingham”; “London is larger than Paris”; “London is in England”.
46
HIGHLIGHTS• Dramatic progress on linguistic approach that converts question and
relevant passages into logical forms and then arrives at answer through a powerful combination of an extended “WordNet” and a logic prover
• Made significant strides in extending QA from isolated, factoid questions to far more complex “Who is / What is” questions that require combining information from multiple, potentially duplicative or contradictory document sources
• Progress made on developing multi-engine QA system that combines linguistic, statistical & KB approaches
• Executed Pilot Evaluations for multiple complex QA Types; Developed Metrics for evaluating QA Systems at the Scenario Task Level; Full Evaluation of all End-to-End QA Systems late in Phase 2
AQUAINTAdvanced QUestion & Answering for INTelligence
47
June Sunrise over Kirkwall Bay in the Orkney Islands of Scotland
Your QuestionsYour Questions& Comments& Comments
48
Contact Information
Dr. John Prange, Info-X R&D Thrust Program DirectorDr. Lynn Franklin, Info-X R&D Thrust Program Dep Dir
• Web Pages: http://www.ic-arda.org (Internet)
• Phones: 443-479-8006 (Prange) 443-479-6604
(Franklin) 301-688-7092 (ARDA Office)
800-276-3747 (ARDA Office)
• FAX: 301-688-7401 (ARDA Office)
• E-Mail: [email protected] (Internet E-Mail)
[email protected] (Internet E-Mail) [email protected](Internet E-Mail)
• Location: Room 12A69 NBP #1Suite 6644
9800 Savage RoadFort Meade, MD 20755-6644