The Impact of Entropy and Solution Density on Selected SAT Heuristics
Abstract
:1. Introduction
- Suggesting the concept of entropy as a new proxy to the freedom of variables,
- Showing evidence that for satisfiable formulas there are better predictors of the effectiveness of various SAT heuristics than the sat/unsat dichotomy suggested by Oh, and in particular that entropy predicts hardness consistently across all those heuristics (albeit not in all cases with strong statistical significance), and
- Setting the foundations for future research into approximating entropy fast, which may eventually indeed lead to constructing faster portfolios based on entropy-based hardness prediction.
2. Entropy
3. A Preliminary: Standardized Linear Regression
- Standardization of the data: given data points , their standardization is defined for by
- Bootstrapping: Bootstrapping, parameterized by a value k, is a well-known technique for improving the precision of various statistics, such as the confidence interval. Technically, bootstrap is applied as follows: Given the original n samples, uniformly sample it n times with replacement (i.e., without taking the sampled points out, which implies that the same point can be selected more than once); repeat this process k times. Hence we now have data points. For our experiments we took , which is a rather standard value when using this technique. In each of the experiments that will be reported later on , hence we have a total of data points for each experiment.
- Two regression tests: The entropy and density data consists of pairs of the form , and , respectively, where is the index of the heuristic (e.g., in Section 4.3 we will compare the effectiveness of two restart strategies, so the indices 1 and 2 refer to those strategies). Hence the corresponding data is four series of points , and , where . To compare the predictive power of entropy, density and Oh’s criterion of SAT/UNSAT, we performed two statistical tests (recall that the data is standardized, and hence comparable):
- –
- The test: A linear regression test over the series , and the series .
- –
- The test: A linear regression test over the series and , and similarly for density (i.e., four tests all together). We then checked the significance of for each of these 4 tests. In addition, we checked the hypothesis for each of the measures. The result of this last test appears in the Appendix A.
Intuitively, the two models tell us slightly different things: the first tells us whether the gap between the two heuristics is correlated with the measure, and the second tells us whether there is a significant difference in the value of (the slope of the linear model) between the two heuristics. As we will see in the results, the p-value obtained by these models can be very different. - Plots: The plots are based on the original (non-standardized) data. To reduce the clutter (from 5000 points), we rounded all values to 2 decimal points and then aggregated them. Aggregation means that points (i.e., n points with an equal x value) are replaced with a single point ( ). However the trend-lines in the various plots are depicted according to the original data, before rounding and aggregation. The statistical significance of these trend-lines appears in the Appendix A.
4. Empirical Findings
4.1. The Benchmark Set
4.2. Entropy and Density Predict Hardness
4.3. A Refinement of Oh’s Results
5. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
Appendix A. Regression-Tests Results
Exp. | Bench. | Measure | Confidence | p-Val | Confidence | p-Val | Confidence | p-Val |
---|---|---|---|---|---|---|---|---|
Interval () | interval () | Interval () | ||||||
1 | B-Rand | Entropy | (−17.207, 16.04) | 0.48 | (−17.878, 16.155) | 0.05 | (−15.836, −2.328) | 0 |
B-Mod | Entropy | (−806.83, 675.0) | 0.46 | (−779.3, 699.96) | 0.44 | (−387.3, 164.9) | 0.003 | |
B-Rand | Density | (−21.39, 108.65) | 0.09 | (−19.30, 111.97) | 0.39 | (−25.01, −5.53) | 0 | |
B-Mod | Density | (−6829, 2647) | 0.2 | (−6663, 2883) | 0.16 | (−519.4, 758.4) | 0.003 | |
2 | B-Rand | Entropy | (−24.16, 2.49) | 0.04 | (−24.59, 1.71) | 0.39 | (0.640, 11.138) | 0.01 |
B-Mod | Entropy | (−651.6, 550.48) | 0.44 | (−622.03, 545.26) | 0.49 | (−190.64, 251.56) | 0.33 | |
B-Rand | Density | (−86.37, 17.50) | 0.09 | (−85.24, 16.11) | 0.47 | (−0.723, 14.450) | 0.01 | |
B-Mod | Density | (−2472, 5240) | 0.23 | (−2578, 5455) | 0.13 | (−717.8, 371.4) | 0.33 | |
3 | B-Rand | Entropy | (−53.50, 22.17) | 0.22 | (−52.71, 23.33) | 0.001 | (−40.55, −10.27) | 0 |
B-Mod | Entropy | (−938.9, 570.7) | 0.33 | (−921.4, 611.0) | 0.48 | (−4.2, 553.3) | 2 × 10 | |
B-Rand | Density | (−101.57, 183.8) | 0.30 | (−105.89, 183.08) | 0.05 | (−55.67, −13.35) | 0 | |
B-Mod | Density | (−9939, −937) | 0.007 | (−9991, −832) | 0.16 | (308.4, 1543.6) | 2 × 10 | |
4 | B-Rand | Entropy | (−96.55, −68.38) | 0 | (−97.27, −69.05) | 0.125 | (42.11, 53.69) | 0 |
B-Mod | Entropy | (−2042, −559) | 0.0004 | (−2091, −529) | 0.38 | (1127, 1717) | 0 | |
B-Rand | Density | (−340.3, −224.1) | 0 | (−343.9, −226.5) | 0.47 | (50.44, 67.76) | 0 | |
B-Mod | Density | (−21,540, −11,007) | 0.003 | (−21,474, −10,950) | 0.46 | (2357, 3788) | 0 |
References
- Oh, C. Between SAT and UNSAT: The Fundamental Difference in CDCL SAT. In Theory and Applications of Satisfiability Testing (SAT 2015); Heule, M., Weaver, S., Eds.; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2015; Volume 9340, pp. 307–323. [Google Scholar]
- Williams, R.; Gomes, C.P.; Selman, B. Backdoors to Typical Case Complexity. In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, Acapulco, Mexico, 9–15 August 2003; Gottlob, G., Walsh, T., Eds.; Morgan Kaufmann: Burlington, MA, USA, 2003; pp. 1173–1178. [Google Scholar]
- Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
- Sang, T.; Bacchus, F.; Beame, P.; Kautz, H.A.; Pitassi, T. Combining Component Caching and Clause Learning for Effective Model Counting. In Proceedings of the Seventh International Conference on Theory and Applications of Satisfiability Testing (SAT 2004), Vancouver, BC, Canada, 10–13 May 2004; Springer: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
- Thurley, M. sharpSAT—Counting Models with Advanced Component Caching and Implicit BCP. In Proceedings of the 9th International Conference on Theory and Applications of Satisfiability Testing—SAT 2006, Seattle, WA, USA, 12–15 August 2006; Biere, A., Gomes, C.P., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2006; Volume 4121, pp. 424–429. [Google Scholar]
- Heradio, R.; Fernández-Amorós, D.; Perez-Morago, H.; Adán, A. Speeding up Derivative Configuration from Product Platforms. Entropy 2014, 16, 3329–3356. [Google Scholar] [CrossRef] [Green Version]
- Perez-Morago, H.; Heradio, R.; Fernández-Amorós, D.; Bean, R.; Cerrada, C. Efficient Identification of Core and Dead Features in Variability Models. IEEE Access 2015, 3, 2333–2340. [Google Scholar] [CrossRef] [Green Version]
- Heradio, R.; Perez-Morago, H.; Fernández-Amorós, D.; Bean, R.; Cabrerizo, F.J.; Cerrada, C.; Herrera-Viedma, E. Binary Decision Diagram Algorithms to Perform Hard Analysis Operations on Variability Models. In New Trends in Software Methodologies, Tools and Techniques, Proceedings of the Fifteenth SoMeT_16, Larnaca, Cyprus, 12–14 September 2016; Fujita, H., Papadopoulos, G.A., Eds.; Frontiers in Artificial Intelligence and Applications; IOS Press: Amsterdam, The Netherlands, 2016; Volume 286, pp. 139–154. [Google Scholar]
- Giráldez-Cru, J.; Levy, J. A Modularity-Based Random SAT Instances Generator. In Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, Buenos Aires, Argentina, 25–31 July 2015; Yang, Q., Wooldridge, M., Eds.; AAAI Press: Palo Alto, CA, USA, 2015; pp. 1952–1958. [Google Scholar]
- Newman, M.E.J.; Girvan, M. Finding and evaluating community structure in networks. Phys. Rev. E 2004, 69, 026113. [Google Scholar] [CrossRef] [PubMed]
- Ansótegui, C.; Giráldez-Cru, J.; Levy, J. The community structure of SAT formulas. In Proceedings of the Theory and Applications of Satisfiability Testing—SAT 2012, Trento, Italy, 17–20 June 2012; Springer: Berlin/Heidelberg, Germany, 2012; pp. 410–423. [Google Scholar]
- Newsham, Z.; Ganesh, V.; Fischmeister, S.; Audemard, G.; Simon, L. Impact of Community Structure on SAT Solver Performance. In Theory and Applications of Satisfiability Testing—SAT 2014, Proceedings of the 17th International Conference, Held as Part of the Vienna Summer of Logic, VSL 2014, Vienna, Austria, 14–17 July 2014; Sinz, C., Egly, U., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2014; Volume 8561, pp. 252–268. [Google Scholar]
- Heule, M.; Dufour, M.; van Zwieten, J.; van Maaren, H. March_eq: Implementing Additional Reasoning into an Efficient Look-Ahead SAT Solver. In Theory and Applications of Satisfiability Testing, Proceedings of the 7th International Conference, SAT 2004, Vancouver, BC, Canada, 10–13 May 2004; Hoos, H.H., Mitchell, D.G., Eds.; Revised Selected Papers; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2004; Volume 3542, pp. 345–359. [Google Scholar]
- Audemard, G.; Simon, L. Predicting Learnt Clauses Quality in Modern SAT Solvers. In Proceedings of the 21st International Joint Conference on Artificial Intelligence, Pasadena, CA, USA, 11–17 July 2009; pp. 399–404. [Google Scholar]
- Full Experimental Results. Available online: http://ie.technion.ac.il/~ofers/entropy/supp.tar.gz (accessed on 15 September 2018).
- Luby, M.; Sinclair, A.; Zuckerman, D. Optimal Speedup of Las Vegas Algorithms. Inf. Process. Lett. 1993, 47, 173–180. [Google Scholar] [CrossRef]
Solver | p-Value | |||
---|---|---|---|---|
MiniSat-HACK-999ED | (−84.29, −72.58) | (−84.93, −73.56) | (5.37, 16.96) | 0.716 |
MiniSat-HACK-999ED | (−86.31, −75.36) | (−82.97, −72.64) | (−7.51, 1.44) | 0.200 |
(modified to luby) | ||||
MiniSat-HACK-999ED | (−72.84, −63.61) | (−72.31, −62.91) | (−4.80, 3.57) | 0.738 |
(modified for 2 phases) | ||||
SWDiA5BY | (−91.61, −79.17) | (−90.97, −78.77) | (−5.95, 4.92) | 0.84 |
COMiniSatPS | (−74.68, −64.58) | (−75.41, −65.43) | (−3.79, 5.37) | 0.76 |
lingeling-ayv | (−76.19, −66.61) | (−71.70, −61.76) | (−8.99, −0.35) | 0.029 |
Glucose | (−91.24, −79.34) | (−90.56, −78.88) | (−6.00, 4.85) | 0.845 |
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cohen, D.; Strichman, O. The Impact of Entropy and Solution Density on Selected SAT Heuristics. Entropy 2018, 20, 713. https://doi.org/10.3390/e20090713
Cohen D, Strichman O. The Impact of Entropy and Solution Density on Selected SAT Heuristics. Entropy. 2018; 20(9):713. https://doi.org/10.3390/e20090713
Chicago/Turabian StyleCohen, Dor, and Ofer Strichman. 2018. "The Impact of Entropy and Solution Density on Selected SAT Heuristics" Entropy 20, no. 9: 713. https://doi.org/10.3390/e20090713
APA StyleCohen, D., & Strichman, O. (2018). The Impact of Entropy and Solution Density on Selected SAT Heuristics. Entropy, 20(9), 713. https://doi.org/10.3390/e20090713