A Dataset and Experimental Evaluation of a Parallel Conflict Detection Solution for Model-Based Diagnosis
Abstract
1. Introduction
- A collection of constraint satisfaction problem (CSP) instances of varying size and complexity;
- The corresponding conflict sets computed in parallel and sequential modes;
- Execution traces, performance metrics, and parameter configurations for all runs.
2. Methods
2.1. QuickXPlain: Sequential Baseline
2.2. Parallel QuickXPlain: Speculative Parallelization
2.3. Implementation and Runtime Setup
2.4. Benchmarking Protocol and Dataset Generation
- Computed conflict set(s) in plain text.
- Execution time, number of recursive calls, and thread count.
- Hash validation to verify correctness and identity of outputs.
3. Data Files
3.1. File Structure and Contents
3.2. Dataset Generation and Metadata
3.3. Reproducibility Support
- Reproducible CSP instances in CNF format (folder cnf/);
- Result logs from benchmark runs (folder results/);
- Documentation in the README explaining parameter usage and available scripts.
4. Technical Validation
4.1. Correctness and Minimality
4.2. Speculative Parallelism Strategy
4.3. Performance Benchmark
4.4. Reproducibility
5. Usage Notes
5.1. Target Use Cases
- Benchmarking and Performance Evaluation: Researchers can employ the dataset to test the scalability of new conflict detection algorithms under standardized conditions. The execution logs and ground truth conflicts enable fair comparisons.
- Educational Integration: Instructors can incorporate the repository in AI, logic programming, or software engineering curricula. The visual models and step-by-step code enhance conceptual understanding of constraint reasoning and diagnosis.
- Algorithmic Development: Developers can extend or replace the conflict checking modules with their own solvers or optimizations while using the same CSP inputs and validation protocols. Moreover, recent work has emphasized integrating conflict detection tools into SAT- and SMT-based pipelines for richer constraint representation and interoperability [23].
- Speculative Execution Studies: The codebase includes tuning options for speculative depth, number of threads, and workload distribution, which support further studies in parallel and speculative algorithms.
- Toolchain Integration: Practitioners working with model-based diagnostic tools, product configurators, or rule engines can adapt the conflict detection backend to plug into their existing systems.
5.2. Illustrative Workflow
5.3. Summary of Potential Applications
5.4. Recommendations
6. Conclusions
Limitations and Future Work
- Extending the dataset with real-world industrial models and larger configurations.
- Refining speculative task scheduling and prioritization heuristics.
- Enabling compatibility with declarative formats (e.g., XCSP3) for broader applicability.
- Investigating hybrid parallelism strategies across distributed computing environments.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Reiter, R. A Theory of Diagnosis from First Principles. Artif. Intell. 1987, 32, 57–95. [Google Scholar] [CrossRef]
- de Kleer, J.; Williams, B.C. Diagnosing multiple faults. Artif. Intell. 1987, 32, 97–130. [Google Scholar] [CrossRef]
- Junker, U. QUICKXPLAIN: Preferred explanations and relaxations for over-constrained problems. In Proceedings of the 19th National Conference on Artificial Intelligence (AAAI-04), San Jose, CA, USA, 25–29 July 2024; AAAI Press: Menlo Park, CA, USA, 2004; pp. 167–172. [Google Scholar]
- Borrego-Díaz, J.; Galán Páez, J. Knowledge representation for explainable artificial intelligence. Complex Intell. Syst. 2022, 8, 1579–1601. [Google Scholar] [CrossRef]
- Felfernig, A.; Reiterer, S.; Reinfrank, F.C.; Ninaus, G.; Jeran, M. Conflict Detection and Diagnosis in Configuration. In Knowledge-Based Configuration: From Research to Business Cases, 1st ed.; Elsevier B.V.: Amsterdam, Netherlands, 2014; pp. 73–87. ISBN 978-0-12-415817-7. [Google Scholar]
- Felfernig, A.; Friedrich, G.; Jannach, D.; Zanker, M. Constraint-Based Recommender Systems. In Recommender Systems Handbook; Springer: Berlin/Heidelberg, Germany, 2011; pp. 187–215. [Google Scholar] [CrossRef]
- Baumeister, J.; Seitz, F. A Survey of Tools for Knowledge Base Debugging. KI—KüNstliche Intell. 2013, 27, 93–99. [Google Scholar]
- Dev Gupta, S.; Genç, B.; O’Sullivan, B. Explanation in Constraint Satisfaction: A Survey. In Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI 2021), Montreal, QC, Canada, 19–26 August 2021; International Joint Conferences on Artificial Intelligence Organization: San Diego, CA, USA, 2021; pp. 4400–4407. [Google Scholar] [CrossRef]
- Rodler, P. A formal proof and simple explanation of the QuickXPlain algorithm. Artif. Intell. Rev. 2022, 55, 6185–6206. [Google Scholar] [CrossRef] [PubMed]
- Vidal, C.; Felfernig, A.; Galindo, J.; Atas, M.; Benavides, D. Explanations for over-constrained problems using QuickXPlain with speculative executions. J. Intell. Inf. Syst. 2021, 57, 491–508. [Google Scholar] [CrossRef]
- Console, L.; Friedrich, G.; Dupré, D.T. Model-based Diagnosis Meets Error Diagnosis in Logic Programs. In Proceedings of the Automated and Algorithmic Debugging—AADEBUG 1993, Linköping, Sweden, 3–5 May 1993; Fritzson, P.A., Ed.; Lecture Notes in Computer Science. Springer: Berlin/Heidelberg, Germany, 1993; Volume 749, pp. 127–146. [Google Scholar] [CrossRef]
- Torlak, E.; Chang, F.S.H.; Jackson, D. Finding Minimal Unsatisfiable Cores of Declarative Specifications. In Proceedings of the FM 2008: Formal Methods, Turku, Finland, 26–30 May 2008; Cuellar, J., Maibaum, T., Sere, K., Eds.; Lecture Notes in Computer Science. Springer: Berlin/Heidelberg, Germany, 2008; Volume 5014, pp. 326–341. [Google Scholar] [CrossRef]
- Wieringa, S. Understanding, Improving and Parallelizing MUS Finding Using Model Rotation. In Proceedings of the Principles and Practice of Constraint Programming—CP 2012, Québec City, QC, Canada, 8–12 October 2012; Milano, M., Ed.; Lecture Notes in Computer Science. Springer: Berlin/Heidelberg, Germany, 2012; Volume 7514, pp. 672–687. [Google Scholar] [CrossRef]
- Ahn, J.W.; O’Boyle, M.F. Speculative parallelism for irregular programs using optimistic thread scheduling. IEEE Trans. Parallel Distrib. Syst. 2020, 31, 2893–2907. [Google Scholar]
- Silva, C.V.; Felfernig, A.; Galindo, J.; Atas, M.; Benavides, D. A Parallelized Variant of Junker’s QuickXPlain Algorithm. In Proceedings of the Foundations of Intelligent Systems. 25th International Symposium, ISMIS 2020, Graz, Austria, 23–25 September 2020; Lecture Notes in Computer Science. Springer: Berlin/Heidelberg, Germany, 2020; Volume 12117, pp. 457–468. [Google Scholar] [CrossRef]
- Gent, I.P.; Walsh, T. CSPLib: A benchmark library for constraints. Constraints 2001, 5, 293–316. [Google Scholar] [CrossRef]
- Liffiton, M.H.; Grimes, D.; Sarrafzadeh, A. Benchmarking and fairness in constraint satisfaction solving. Constraints 2022, 27, 195–219. [Google Scholar]
- Pineau, J.; Vincent-Lamarre, P.; Sinha, K.; Larivière, V.; Beygelzimer, A.; d’Alché Buc, F.; Fox, E.; Larochelle, H. Improving reproducibility in machine learning research (a report from the NeurIPS 2019 reproducibility program). J. Mach. Learn. Res. 2021, 22, 1–20. [Google Scholar] [CrossRef]
- Vidal-Silva, C.; Duarte, V.; Cardenas-Cobo, J.; Serrano-Malebran, J.; Veas, I.; Rubio-León, J. Reviewing Automated Analysis of Feature Model Solutions for the Product Configuration. Appl. Sci. 2023, 13, 174. [Google Scholar] [CrossRef]
- Benavides, D.; Segura, S.; Trinidad, P.; Cortés, A.R. FAMA: Tooling a Framework for the Automated Analysis of Feature Models. In Proceedings of the First International Workshop on Variability Modelling of Software-Intensive Systems (VaMoS 2007), Limerick, Ireland, 16–18 January 2007; Pohl, K., Heymans, P., Kang, K.C., Metzger, A., Eds.; Lero Technical Report. Volume 2007-01, pp. 129–134. [Google Scholar]
- Benavides, D.; Felfernig, A.; Galindo, J.A.; Reinfrank, F. Automated Analysis in Feature Modelling and Product Configuration. In Proceedings of the Safe and Secure Software Reuse—ICSR 2013, Pisa, Italy, 18–20 June 2013; Favaro, J., Morisio, M., Eds.; Lecture Notes in Computer Science. Springer: Berlin/Heidelberg, Germany, 2013; Volume 7925, pp. 160–175. [Google Scholar] [CrossRef]
- Stodden, V. Enhancing reproducibility in computational methods. Nat. Comput. Sci. 2022, 2, 76–78. [Google Scholar] [CrossRef] [PubMed]
- Barbosa, H.; Barrett, C.; Brain, M.; Kremer, G.; Lachnitt, H.; Mann, M.; Mohamed, A.; Mohamed, M.; Niemetz, A.; Nötzli, A.; et al. cvc5: A Versatile and Industrial-Strength SMT Solver. In Proceedings of the Tools and Algorithms for the Construction and Analysis of Systems—TACAS 2022, Munich, Germany, 2–7 April 2022; Fisman, D., Rosu, G., Eds.; Lecture Notes in Computer Science. Springer: Cham, Switzerland, 2022; Volume 13243, pp. 415–442. [Google Scholar] [CrossRef]
- Penco, R.; Pintar, D.; Vranić, M.; Šoštarić, M. Large Language Model-Driven Framework for Automated Constraint Model Generation in Configuration Problems. Appl. Sci. 2025, 15, 6518. [Google Scholar] [CrossRef]
- Usman, M.; Wang, W.; Vasic, M.; Wang, K.; Vikalo, H.; Khurshid, S. A Study of the Learnability of Relational Properties: Model Counting Meets Machine Learning (MCML). In Proceedings of the 41st ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2020), London, UK, 15–20 June 2020; Association for Computing Machinery: New York, NY, USA, 2020; pp. 1098–1111. [Google Scholar] [CrossRef]
- Xu, Y.; Liu, Z.; Tang, J. Hybrid Symbolic-Neural Reasoning for Large-Scale Constraint Solving. J. Artif. Intell. Res. 2023, 76, 255–282. [Google Scholar]
Instance | QX (1T) | PQX (2T) | PQX (4T) | Speedup (4T) | Memory (MB) | Efficiency (%) |
---|---|---|---|---|---|---|
csp_20 | 210 ms | 142 ms | 108 ms | 1.94× | 54.3 | 48.5 |
csp_40 | 765 ms | 410 ms | 278 ms | 2.75× | 67.1 | 68.8 |
csp_80 | 2435 ms | 1302 ms | 712 ms | 3.42× | 82.6 | 85.5 |
File/Folder | Description | Format |
---|---|---|
instances/ | Collection of example CSP problem instances. Each file defines variables, domains, and constraints in a structured dictionary format. Used as input for the QuickXPlain runs. | .json, .py |
conflicts/ | Output directory containing computed minimal conflict sets. Each file includes the list of constraints involved in an unsatisfiable subset. | .txt |
timing_results/ | Automatically generated folder with CSV logs recording execution times, number of recursive calls, and thread usage. Useful for performance evaluation and reproducibility. | .csv |
quickxplain.py | Main Python script implementing both the standard and parallel variants of QuickXPlain. Includes command-line options for thread control. | .py |
run_all.py | Helper script that executes multiple instances in batch mode and aggregates performance metrics. | .py |
readme.md | Documentation describing installation, usage, command-line arguments, expected input/output, and example runs. | Markdown |
config/ | Optional directory for storing external configuration files or solver settings (not mandatory for basic runs). | .yaml (future use) |
Validation Criteria | Sequential QX | Parallel QX (PQX) | Match Rate |
---|---|---|---|
Correct conflict sets | 500/500 | 500/500 | 100% |
Minimal conflict sets | 500/500 | 500/500 | 100% |
Structural consistency | ✓ | ✓ | 100% |
Instance | QX Mean (ms) | QX Std Dev | PQX Mean (ms) | PQX Std Dev | p-Value |
---|---|---|---|---|---|
csp_20 | 210 | 4.2 | 108 | 3.8 | <0.01 |
csp_40 | 765 | 11.5 | 278 | 10.2 | <0.01 |
csp_80 | 2435 | 38.7 | 712 | 26.3 | <0.01 |
Scenario | Description | Dataset Components |
---|---|---|
Benchmarking | Compare runtime, depth, and parallel speedup of conflict algorithms | timings.csv, conflicts/ |
Educational Use | Teach CSPs, recursion, and model-based diagnosis with visual support | instances/, Figures, README.md |
Algorithm Testing | Replace QX core logic with a custom solver or method | quickxplain.py, run_all.py |
Reproducibility Studies | Run identical experiments across systems or environments | config.yaml, docker/ |
Speculative Research | Analyze effectiveness of speculative parallel branches | Execution logs, Figure 3 and Figure 4 |
Contribution | Description |
---|---|
Open benchmark dataset | Includes synthetic and real CSPs derived from feature models and configuration problems |
Parallel conflict detection tool | Speculative, multithreaded variant of QuickXPlain with correctness validation |
Reproducibility framework | Public codebase with configuration scripts, test suite, and containerized environment |
Empirical validation | Runtime speedup up to 3.4× on large CSPs, confirmed by benchmark results |
Educational utility | Structured code and visual figures suitable for teaching and algorithmic prototyping |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cabezas-Quinto, J.J.; Vidal-Silva, C.; Serrano-Malebrán, J.; Márquez, N. A Dataset and Experimental Evaluation of a Parallel Conflict Detection Solution for Model-Based Diagnosis. Data 2025, 10, 139. https://doi.org/10.3390/data10090139
Cabezas-Quinto JJ, Vidal-Silva C, Serrano-Malebrán J, Márquez N. A Dataset and Experimental Evaluation of a Parallel Conflict Detection Solution for Model-Based Diagnosis. Data. 2025; 10(9):139. https://doi.org/10.3390/data10090139
Chicago/Turabian StyleCabezas-Quinto, Jessica Janina, Cristian Vidal-Silva, Jorge Serrano-Malebrán, and Nicolás Márquez. 2025. "A Dataset and Experimental Evaluation of a Parallel Conflict Detection Solution for Model-Based Diagnosis" Data 10, no. 9: 139. https://doi.org/10.3390/data10090139
APA StyleCabezas-Quinto, J. J., Vidal-Silva, C., Serrano-Malebrán, J., & Márquez, N. (2025). A Dataset and Experimental Evaluation of a Parallel Conflict Detection Solution for Model-Based Diagnosis. Data, 10(9), 139. https://doi.org/10.3390/data10090139