SAT feature analysis for machine learning classification tasks

Dalla, Marco; Provan-Bessell, Benjamin; Visentin, Andrea; O'Sullivan, Barry

SAT feature analysis for machine learning classification tasks

dc.contributor.author	Dalla, Marco	en
dc.contributor.author	Provan-Bessell, Benjamin	en
dc.contributor.author	Visentin, Andrea	en
dc.contributor.author	O'Sullivan, Barry	en
dc.contributor.funder	Science Foundation Ireland	en
dc.contributor.funder	European Regional Development Fund	en
dc.date.accessioned	2023-11-03T14:12:23Z
dc.date.available	2023-11-03T14:12:23Z
dc.date.issued	2023-07-02	en
dc.description.abstract	The extraction of meaningful features from CNF instances is crucial to applying machine learning to SAT solving, enabling algorithm selection and configuration for solver portfolios and satisfiability classification. While many approaches have been proposed for feature extraction, their relevance to these tasks is unclear. Their applicability and comparison of the information extracted and the computational effort needed are complicated by the lack of working or updated implementations, negatively affecting reproducibility. In this paper, we analyse the performance of five sets of features presented in the literature on SAT/UNSAT and problem category classification over a dataset of 3000 instances across ten problem classes distributed equally between SAT and UNSAT. To increase reproducibility and encourage research in this area, we released a Python library containing an updated and clear implementation of structural, graph-based, statistical and probing features presented in the literature for SAT CNF instances; and we define a clear pipeline to compare feature sets in a given learning task robustly. We analysed which of the computed features are relevant for the specific task and the tradeoff they provide between accuracy and computational effort. The results of the analysis provide insights into which features mostly affect an instance's satisfiability and which can be used to identify the problem's type. These insights can be used to develop more effective solver portfolios and satisfiability classification algorithms.	en
dc.description.status	Peer reviewed	en
dc.description.version	Accepted Version	en
dc.format.mimetype	application/pdf	en
dc.identifier.citation	Dalla, M., Provan-Bessell, B., Visentin, A. and O’Sullivan, B. (2023) ‘SAT feature analysis for machine learning classification tasks’, Sixteenth International Symposium on Combinatorial Search, Proceedings of the International Symposium on Combinatorial Search, 16(1), pp. 138–142, https://doi.org/10.1609/socs.v16i1.27292	en
dc.identifier.doi	https://doi.org/10.1609/socs.v16i1.27292	en
dc.identifier.endpage	142	en
dc.identifier.isbn	978-1-57735-882-4	en
dc.identifier.issn	2832-9163	en
dc.identifier.issn	2832-9171	en
dc.identifier.startpage	138	en
dc.identifier.uri	https://hdl.handle.net/10468/15190
dc.language.iso	en	en
dc.publisher	The AAAI Press	en
dc.relation.ispartof	Proceedings of the International Symposium on Combinatorial Search	en
dc.relation.ispartof	Sixteenth International Symposium on Combinatorial Search	en
dc.relation.project	info:eu-repo/grantAgreement/SFI/SFI Research Centres Programme::Phase 1/16/RC/3918/IE/Confirm Centre for Smart Manufacturing/	en
dc.relation.project	info:eu-repo/grantAgreement/SFI/SFI Research Centres/12/RC/2289/IE/INSIGHT - Irelands Big Data and Analytics Research Centre/	en
dc.relation.project	info:eu-repo/grantAgreement/SFI/SFI Research Centres Programme::Phase 2/12/RC/2289-P2s/IE/INSIGHT Phase 2/	en
dc.relation.project	info:eu-repo/grantAgreement/SFI/SFI Centres for Research Training Programme::Data and ICT Skills for the Future/18/CRT/6223/IE/SFI Centre for Research Training in Artificial Intelligence/	en
dc.rights	© 2023, Association for the Advancement of Artificial Intelligence (www.aaai.org)	en
dc.subject	Machine and deep learning in search	en
dc.subject	Search in Boolean satisfiability	en
dc.subject	Portfolios of search algorithms	en
dc.subject	Time	en
dc.subject	Memory	en
dc.subject	Solution quality trade-offs	en
dc.subject	Combinatorial Search	en
dc.subject	SAT CNF instances	en
dc.subject	Artificial intelligence	en
dc.subject	SAT/UNSAT	en
dc.subject	SATfeatPy	en
dc.subject	Problem category classification	en
dc.subject	Machine learning	en
dc.subject	AI	en
dc.title	SAT feature analysis for machine learning classification tasks	en
dc.type	Conference item	en
oaire.citation.issue	1	en
oaire.citation.volume	16	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: SOCS2023_SATFeatPy_AV.pdf
Size:: 453.6 KB
Format:: Adobe Portable Document Format
Description:: Accepted version

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Computer Science - Conference Items
Insight Centre for Data Analytics - Conference Items