SAT feature analysis for machine learning classification tasks
dc.contributor.author | Dalla, Marco | en |
dc.contributor.author | Provan-Bessell, Benjamin | en |
dc.contributor.author | Visentin, Andrea | en |
dc.contributor.author | O'Sullivan, Barry | en |
dc.contributor.funder | Science Foundation Ireland | en |
dc.contributor.funder | European Regional Development Fund | en |
dc.date.accessioned | 2023-11-03T14:12:23Z | |
dc.date.available | 2023-11-03T14:12:23Z | |
dc.date.issued | 2023-07-02 | en |
dc.description.abstract | The extraction of meaningful features from CNF instances is crucial to applying machine learning to SAT solving, enabling algorithm selection and configuration for solver portfolios and satisfiability classification. While many approaches have been proposed for feature extraction, their relevance to these tasks is unclear. Their applicability and comparison of the information extracted and the computational effort needed are complicated by the lack of working or updated implementations, negatively affecting reproducibility. In this paper, we analyse the performance of five sets of features presented in the literature on SAT/UNSAT and problem category classification over a dataset of 3000 instances across ten problem classes distributed equally between SAT and UNSAT. To increase reproducibility and encourage research in this area, we released a Python library containing an updated and clear implementation of structural, graph-based, statistical and probing features presented in the literature for SAT CNF instances; and we define a clear pipeline to compare feature sets in a given learning task robustly. We analysed which of the computed features are relevant for the specific task and the tradeoff they provide between accuracy and computational effort. The results of the analysis provide insights into which features mostly affect an instance's satisfiability and which can be used to identify the problem's type. These insights can be used to develop more effective solver portfolios and satisfiability classification algorithms. | en |
dc.description.status | Peer reviewed | en |
dc.description.version | Accepted Version | en |
dc.format.mimetype | application/pdf | en |
dc.identifier.citation | Dalla, M., Provan-Bessell, B., Visentin, A. and O’Sullivan, B. (2023) ‘SAT feature analysis for machine learning classification tasks’, Sixteenth International Symposium on Combinatorial Search, Proceedings of the International Symposium on Combinatorial Search, 16(1), pp. 138–142, https://doi.org/10.1609/socs.v16i1.27292 | en |
dc.identifier.doi | https://doi.org/10.1609/socs.v16i1.27292 | en |
dc.identifier.endpage | 142 | en |
dc.identifier.isbn | 978-1-57735-882-4 | en |
dc.identifier.issn | 2832-9163 | en |
dc.identifier.issn | 2832-9171 | en |
dc.identifier.startpage | 138 | en |
dc.identifier.uri | https://hdl.handle.net/10468/15190 | |
dc.language.iso | en | en |
dc.publisher | The AAAI Press | en |
dc.relation.ispartof | Proceedings of the International Symposium on Combinatorial Search | en |
dc.relation.ispartof | Sixteenth International Symposium on Combinatorial Search | en |
dc.relation.project | info:eu-repo/grantAgreement/SFI/SFI Research Centres Programme::Phase 1/16/RC/3918/IE/Confirm Centre for Smart Manufacturing/ | en |
dc.relation.project | info:eu-repo/grantAgreement/SFI/SFI Research Centres/12/RC/2289/IE/INSIGHT - Irelands Big Data and Analytics Research Centre/ | en |
dc.relation.project | info:eu-repo/grantAgreement/SFI/SFI Research Centres Programme::Phase 2/12/RC/2289-P2s/IE/INSIGHT Phase 2/ | en |
dc.relation.project | info:eu-repo/grantAgreement/SFI/SFI Centres for Research Training Programme::Data and ICT Skills for the Future/18/CRT/6223/IE/SFI Centre for Research Training in Artificial Intelligence/ | en |
dc.rights | © 2023, Association for the Advancement of Artificial Intelligence (www.aaai.org) | en |
dc.subject | Machine and deep learning in search | en |
dc.subject | Search in Boolean satisfiability | en |
dc.subject | Portfolios of search algorithms | en |
dc.subject | Time | en |
dc.subject | Memory | en |
dc.subject | Solution quality trade-offs | en |
dc.subject | Combinatorial Search | en |
dc.subject | SAT CNF instances | en |
dc.subject | Artificial intelligence | en |
dc.subject | SAT/UNSAT | en |
dc.subject | SATfeatPy | en |
dc.subject | Problem category classification | en |
dc.subject | Machine learning | en |
dc.subject | AI | en |
dc.title | SAT feature analysis for machine learning classification tasks | en |
dc.type | Conference item | en |
oaire.citation.issue | 1 | en |
oaire.citation.volume | 16 | en |