System Architecture
(4) User Study• Experts
• Novice Users• Much less training needed: Minutes vs. Days• Unanimously found the system useful
and easy to use
“[Compared to what he has used before] This is 100 times better.” --- A subject
(2) UI Components
(3) VAQL AQL
Yunyao Li*, Elmer Kim**, Marc A. Touchette, Ramiya Venkatachalam, Hao Wang *IBM Research – Almaden, ** Treasure Data, Inc., IBM Silicon Valley Lab
VINERy: A Visual IDE for Information Extraction
(1)VAQL (Visual Annotation Query Language)
• Extract Constructs • Atomic• Pre-built • Dictionary• Regular Expression• Literal • Proximity
• Composite• Sequence Pattern • Union
Information extraction is a critical building block for a wide range of emerging applications.
To satisfy the increasing text analytics demands of real-world applications, it is crucial to lower the barrier to entry and empower novices to develop high quality IE extractors.
VINERy is SystemT’s latest effort towards this goal.
Automatically generate performant and readable AQL programs for execution and for further development in AQL, if needed.
Available as part of IBM BigInsights since 4.0
Watch video demo
• Refinement Constructs• Projection• Expression• Consolidation• Filter
Document ViewerCanvasProject Pane
Extractor Catalog Result GridProperty Pane
How does VINERy compare to other existing ways of performing IE tasks?
much worse much better
Learning Curve
Time Required
Ease of Use
Effort Required
1 2 3 4 5
Try it out Learn more