i-nth logo

Authors

Marc Fisher, Gregg Rothermel, Tyler Creelan, & Margaret Burnett

Abstract

Spreadsheets are widely used but often contain faults. Thus, in prior work we presented a dataflow testing methodology for use with spreadsheets, which studies have shown can be used cost-effectively by end-user programmers.

To date, however, the methodology has been investigated across a limited set of spreadsheet language features. Commercial spreadsheet environments are multiparadigm languages, utilizing features not accommodated by our prior approaches. In addition, most spreadsheets contain large numbers of replicated formulas that severely limit the efficiency of dataflow testing approaches.

We show how to handle these two issues with a new dataflow adequacy criterion and automated detection of areas of replicated formulas, and report results of a controlled experiment investigating the feasibility of our approach.

Sample

Representations of a formula
Representations of a formula

This figure shows three equivalent ways of representing the same formula:

  • Spreadsheet formula.
  • Imperative code.
  • Abstract syntax tree.

Publication

2006, IEEE International Symposium on Software Reliability Engineering, November, pages 13-22

Full article

Scaling a dataflow testing methodology to the multiparadigm world of commercial spreadsheets