UCCNLP@SMM4H’22:Label distribution aware long-tailed learning with post-hoc posterior calibration applied to text classification

dc.contributor.authorTrust, Paul
dc.contributor.authorKadusabe, Provia
dc.contributor.authorZahran, Ahmed
dc.contributor.authorMinghim, Rosane
dc.contributor.authorOmala, Kizito
dc.contributor.funderScience Foundation Irelanden
dc.date.accessioned2022-11-09T09:37:06Z
dc.date.available2022-11-09T09:37:06Z
dc.date.issued2022-10
dc.date.updated2022-10-20T09:15:42Z
dc.description.abstractThe paper describes our submissions for the Social Media Mining for Health (SMM4H) workshop 2022 shared tasks. We participated in 2 tasks: (1) classification of adverse drug events (ADE) mentions in english tweets (Task-1a) and (2) classification of self-reported intimate partner violence (IPV) on twitter (Task 7). We proposed an approach that uses RoBERTa (A Robustly Optimized BERT Pretraining Approach) fine-tuned with a label distribution-aware margin loss function and post-hoc posterior calibration for robust inference against class imbalance. We achieved a 4% and 1 % increase in performance on IPV and ADE respectively when compared with the traditional fine-tuning strategy with unweighted cross-entropy loss.en
dc.description.statusNot peer revieweden
dc.description.versionPublished Versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationTrust, P., Kadusabe, P., Zahran, A., Minghim, R. and Omala, K. (2022) 'UCCNLP@SMM4H’22:Label distribution aware long-tailed learning with post-hoc posterior calibration applied to text classification', Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, Workshop & Shared Task, Gyeongju, Republic of Korea, 12-17 October, pp. 90-94. Available at: https://aclanthology.org/2022.smm4h-1.26.pdf (Accessed: 9 November 2022)en
dc.identifier.endpage94en
dc.identifier.startpage90en
dc.identifier.urihttps://hdl.handle.net/10468/13838
dc.language.isoenen
dc.publisherAssociation for Computational Linguisticsen
dc.relation.urihttps://aclanthology.org/2022.smm4h-1.26
dc.rights© 2022, the Authors. This paper is distributed under the terms of the Creative Commons Attribution Licence 4.0.en
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en
dc.subjectSocial Media Mining for Healthen
dc.titleUCCNLP@SMM4H’22:Label distribution aware long-tailed learning with post-hoc posterior calibration applied to text classificationen
dc.typeConference itemen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2022.smm4h-1.26.pdf
Size:
230.18 KB
Format:
Adobe Portable Document Format
Description:
Published Version
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.71 KB
Format:
Item-specific license agreed upon to submission
Description: