Bayes at FigLang 2022 Euphemism detection shared task: Cost-sensitive Bayesian fine-tuning and Venn-Abers predictors for robust training under class skewed distributions

dc.contributor.authorTrust, Paul
dc.contributor.authorProvia, Kadusabe
dc.contributor.authorOmala, Kizito
dc.contributor.funderScience Foundation Irelanden
dc.date.accessioned2023-02-22T12:07:16Z
dc.date.available2023-02-22T12:07:16Z
dc.date.issued2022-12
dc.description.abstractTransformers have achieved a state of the art performance across most natural language processing tasks. However the performance of these models degrade when being trained on skewed class distributions (class imbalance) because training tends to be biased towards head classes with most of the data points . Classical methods that have been proposed to handle this problem (re-sampling and re-weighting) often suffer from unstable performance, poor applicability and poor calibration. In this paper, we propose to use Bayesian methods and Venn-Abers predictors for well calibrated and robust training against class imbalance. Our proposed approach improves f1-score of the baseline RoBERTa (A Robustly Optimized Bidirectional Embedding from Transformers Pretraining Approach) model by about 6 points (79.0% against 72.6%) when training with class imbalanced data.en
dc.description.statusPeer revieweden
dc.description.versionPublished Versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationTrust, P., Provia, K. and Omala, K. (2022) 'Bayes at FigLang 2022 Euphemism detection shared task: Cost-sensitive Bayesian fine-tuning and Venn-Abers predictors for robust training under class skewed distributions', Proceedings of the 3rd Workshop on Figurative Language Processing (FLP), pp. 94-99. Available at: https://aclanthology.org/2022.flp-1.13/ (Accessed: 22 February 2023)en
dc.identifier.endpage99en
dc.identifier.startpage94en
dc.identifier.urihttps://hdl.handle.net/10468/14236
dc.language.isoenen
dc.publisherAssociation for Computational Linguisticsen
dc.relation.urihttps://aclanthology.org/2022.flp-1.13/
dc.rights© 2022, Association for Computational Linguistics.en
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en
dc.subjectNatural language processingen
dc.subjectTransformersen
dc.subjectBayesian methodsen
dc.subjectVenn-Abers predictorsen
dc.titleBayes at FigLang 2022 Euphemism detection shared task: Cost-sensitive Bayesian fine-tuning and Venn-Abers predictors for robust training under class skewed distributionsen
dc.typeConference itemen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2022.flp-1.13.pdf
Size:
178.17 KB
Format:
Adobe Portable Document Format
Description:
Published Version
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.71 KB
Format:
Item-specific license agreed upon to submission
Description: