Comprehensive path-sensitive data-flow analysis

Thakur, Aditya V. and Govindarajan, R.
Sixth International Symposium on Code Generation and Optimization (CGO), 2008

Data-flow analysis is an integral part of any aggressive optimizing compiler. We propose a framework for improving the precision of data-flow analysis in the presence of complex control-flow. We initially perform data-flow analysis to determine those control-flow merges which cause the loss in data-flow analysis precision. The control-flow graph of the program is then restructured such that performing data-flow analysis on the resulting restructured graph gives more precise results. The proposed framework is both simple, involving the familiar notion of product automata, and also general, since it is applicable to any forward data-flow analysis. Apart from proving that our restructuring process is correct, we also show that restructuring is effective in that it necessarily leads to more optimization opportunities. Furthermore, the framework handles the trade-off between the increase in data-flow precision and the code size increase inherent in the restructuring. We show that determining an optimal restructuring is NP-hard, and propose and evaluate a greedy strategy. The framework has been implemented in the Scale research compiler, and instantiated for the specific problem of Constant Propagation. On the SPECINT 2000 benchmark suite we observe an average speedup of 4% in the running times over Wegman-Zadeck conditional constant propagation algorithm and 2% over a purely path profile guided approach.

PDF     ACM©    

@inproceedings{thakur_govindarajan_CGO08,
  author = {Thakur, Aditya V. and Govindarajan, R.},
  title = {Comprehensive path-sensitive data-flow analysis},
  booktitle = {Sixth International Symposium on Code Generation and
                    Optimization {(CGO)}},
  year = {2008},
  pages = {55--63},
  doi = {10.1145/1356058.1356066},
  publisher = {ACM}
}