Bayes at FigLang 2022 Euphemism detection shared task: Cost-sensitive Bayesian fine-tuning and Venn-Abers predictors for robust training under class skewed distributions

Trust, Paul; Provia, Kadusabe; Omala, Kizito

Bayes at FigLang 2022 Euphemism detection shared task: Cost-sensitive Bayesian fine-tuning and Venn-Abers predictors for robust training under class skewed distributions

dc.contributor.author	Trust, Paul
dc.contributor.author	Provia, Kadusabe
dc.contributor.author	Omala, Kizito
dc.contributor.funder	Science Foundation Ireland	en
dc.date.accessioned	2023-02-22T12:07:16Z
dc.date.available	2023-02-22T12:07:16Z
dc.date.issued	2022-12
dc.description.abstract	Transformers have achieved a state of the art performance across most natural language processing tasks. However the performance of these models degrade when being trained on skewed class distributions (class imbalance) because training tends to be biased towards head classes with most of the data points . Classical methods that have been proposed to handle this problem (re-sampling and re-weighting) often suffer from unstable performance, poor applicability and poor calibration. In this paper, we propose to use Bayesian methods and Venn-Abers predictors for well calibrated and robust training against class imbalance. Our proposed approach improves f1-score of the baseline RoBERTa (A Robustly Optimized Bidirectional Embedding from Transformers Pretraining Approach) model by about 6 points (79.0% against 72.6%) when training with class imbalanced data.	en
dc.description.status	Peer reviewed	en
dc.description.version	Published Version	en
dc.format.mimetype	application/pdf	en
dc.identifier.citation	Trust, P., Provia, K. and Omala, K. (2022) 'Bayes at FigLang 2022 Euphemism detection shared task: Cost-sensitive Bayesian fine-tuning and Venn-Abers predictors for robust training under class skewed distributions', Proceedings of the 3rd Workshop on Figurative Language Processing (FLP), pp. 94-99. Available at: https://aclanthology.org/2022.flp-1.13/ (Accessed: 22 February 2023)	en
dc.identifier.endpage	99	en
dc.identifier.startpage	94	en
dc.identifier.uri	https://hdl.handle.net/10468/14236
dc.language.iso	en	en
dc.publisher	Association for Computational Linguistics	en
dc.relation.uri	https://aclanthology.org/2022.flp-1.13/
dc.rights	© 2022, Association for Computational Linguistics.	en
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en
dc.subject	Natural language processing	en
dc.subject	Transformers	en
dc.subject	Bayesian methods	en
dc.subject	Venn-Abers predictors	en
dc.title	Bayes at FigLang 2022 Euphemism detection shared task: Cost-sensitive Bayesian fine-tuning and Venn-Abers predictors for robust training under class skewed distributions	en
dc.type	Conference item	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 2022.flp-1.13.pdf
Size:: 178.17 KB
Format:: Adobe Portable Document Format
Description:: Published Version

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Computer Science - Conference Items