i-nth logo

Authors

Xiaoyan Wang, Quan Yu, & Guowei Yang

Abstract

Spreadsheets are widely used in data recording and data analysis in various fields. Since most spreadsheet users are non-programmers, they are prone to various errors in the process of designing and using the spreadsheet, which may bring huge economic losses to the organization.

Therefore, a lot of related work has emerged to support the users in detecting and fixing errors in spreadsheets. In these studies, most of them are based on the assumption that the original data is error-free, because formulas in the spreadsheet are considered more error-prone than input cells.

However, the assumption is not reasonable in the field of frequently using spreadsheet templates, because they have been professionally validated during usage. In such cases, root errors often arise from wrong input values rather than cells with formulas.

Therefore, this paper proposes an error repair method for input cells in templated spreadsheet based on symbol execution and constraint satisfaction solving.

  • First, find the output cells in the spreadsheet according to dependencies.
  • Second, for suspicious output cells, the input cells that determine their results are obtained via symbolic execution.
  • Finally, the repair candidates of the input cells are obtained by constraint satisfaction solving with domain knowledge.

Sample

Overview of our approach
Overview of our approach

Our approach uses symbolic execution techniques to find input cells associated with wrong output cells.

With collected data relations, we model the repair candidate solving problem as a constraint satisfaction problem (CSP).

Key steps in the process are:

  • Extracting formulas.
  • Finding faulty output cells.
  • Symbolic execution.
  • Constraint solving.

Publication

2018, 25th Asia-Pacific Software Engineering Conference (APSEC), December, pages 705-706

Full article

Automated repair of data faults in templated spreadsheets