14
Data Clone Detection and Visualization in Spreadsheets icse 13 Felienne Hermans, Ben Sedee, Martin Pinzger and Arie van Deursen Delft University of Technology

Data Clone Detection and Visualization in Spreadsheets icse 13

  • Upload
    argyle

  • View
    28

  • Download
    0

Embed Size (px)

DESCRIPTION

Data Clone Detection and Visualization in Spreadsheets icse 13. Felienne Hermans , Ben Sedee , Martin Pinzger and Arie van Deursen Delft University of Technology. BACKGROUND. Spreadsheets are widely used Copy-paste actions are widely used - PowerPoint PPT Presentation

Citation preview

Page 1: Data Clone Detection and Visualization in Spreadsheets icse  13

Data Clone Detection and Visualization in

Spreadsheetsicse 13

Felienne Hermans, Ben Sedee, Martin Pinzger and Arie van DeursenDelft University of Technology

Page 2: Data Clone Detection and Visualization in Spreadsheets icse  13

BACKGROUND

• Spreadsheets are widely used

• Copy-paste actions are widely used

• If formulas’s values are copied as plain text in a different location, data can be easily out of sync.

Page 3: Data Clone Detection and Visualization in Spreadsheets icse  13

GOAL

• Data clone detection

• Data clone visualization

Page 4: Data Clone Detection and Visualization in Spreadsheets icse  13

DATA CLONE DETECTION

• Algorithm– Cell classification– Lookup creation– Pruning– Cluster finding– Cluster matching

Page 5: Data Clone Detection and Visualization in Spreadsheets icse  13

CLONE VISUALIZATION

• Dataflow diagrams

• Pop-ups

Page 6: Data Clone Detection and Visualization in Spreadsheets icse  13

EVALUATION

Page 7: Data Clone Detection and Visualization in Spreadsheets icse  13

Comparative Causality: Explaining the Differences

Between Executionsicse 13

William N. Sumner Xiangyu ZhangPurdue University

Page 8: Data Clone Detection and Visualization in Spreadsheets icse  13

BACKGROUND

• A fine-grained causal inference technique.

• Causal State Minimization in Delta Debugging

• CSM has its limitations.

Page 9: Data Clone Detection and Visualization in Spreadsheets icse  13

LIMITATIONS of CSM

• 1. Confounding caused by Partial State Replacement

Page 10: Data Clone Detection and Visualization in Spreadsheets icse  13

LIMITATIONS of CSM

• 2. Execution Omission

• 3. Efficiency

Page 11: Data Clone Detection and Visualization in Spreadsheets icse  13

SOLUTION

• Confounding & Efficiency– They build a new model without confounding– The model is to simplify the original code and

reexecute with this new code

Page 12: Data Clone Detection and Visualization in Spreadsheets icse  13
Page 13: Data Clone Detection and Visualization in Spreadsheets icse  13

SOLUTION

• Execution Omission– Do state replacement both in the correct

execution and in the buggy execution.

Page 14: Data Clone Detection and Visualization in Spreadsheets icse  13

EVALUATION