UNLPSat TextGraphs-16 Natural Language Premise Selection task: Unsupervised Natural Language Premise Selection in mathematical text using sentence-MPNet

dc.contributor.authorTrust, Paul
dc.contributor.authorKadusabe, Provia
dc.contributor.authorYounis, Haseeb
dc.contributor.authorMinghim, Rosane
dc.contributor.authorMilios, Evangelos
dc.contributor.authorZahran, Ahmed
dc.contributor.funderScience Foundation Irelanden
dc.date.accessioned2022-11-08T15:25:27Z
dc.date.available2022-11-08T15:25:27Z
dc.date.issued2022-10-16
dc.date.updated2022-10-20T08:41:19Z
dc.description.abstractThis paper describes our system for the submission to the TextGraphs 2022 shared task at COLING 2022: Natural Language Premise Selection (NLPS) from mathematical texts. The task of NLPS is about selecting mathematical statements called premises in a knowledge base written in natural language and mathematical formulae that are most likely to be used to prove a particular mathematical proof. We formulated this task as an unsupervised semantic similarity task by first obtaining contextualized embeddings of both the premises and mathematical proofs using sentence transformers. We then obtained the cosine similarity between the embeddings of premises and proofs and then selected premises with the highest cosine scores as the most probable. Our system improves over the baseline system that uses bag of words models based on term frequency inverse document frequency in terms of mean average precision (MAP) by about 23.5% (0.1516 versus 0.1228).en
dc.description.statusNot peer revieweden
dc.description.versionPublished Versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationTrust, P., Kadusabe, P., Younis, H., Minghim, R., Milios, E. and Zahran, A. (2022) 'UNLPSat TextGraphs-16 Natural Language Premise Selection task: Unsupervised Natural Language Premise Selection in mathematical text using sentence-MPNet', TextGraphs-16: Graph-based Methods for Natural Language Processing, Gyeongju, Republic of Korea, 16 October, pp. 119-123. Available at: https://aclanthology.org/2022.textgraphs-1.13 (Accessed: 8 November 2022)en
dc.identifier.endpage123en
dc.identifier.startpage119en
dc.identifier.urihttps://hdl.handle.net/10468/13837
dc.language.isoenen
dc.publisherAssociation for Computational Linguisticsen
dc.relation.urihttps://aclanthology.org/2022.textgraphs-1.13
dc.rights© 2022, the Authors. This paper is distributed under the terms of the Creative Commons Attribution Licence 4.0.en
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en
dc.subjectNatural Language Premise Selectionen
dc.titleUNLPSat TextGraphs-16 Natural Language Premise Selection task: Unsupervised Natural Language Premise Selection in mathematical text using sentence-MPNeten
dc.typeConference itemen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2022.textgraphs-1.13.pdf
Size:
205.13 KB
Format:
Adobe Portable Document Format
Description:
Published Version
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.71 KB
Format:
Item-specific license agreed upon to submission
Description: